-
Notifications
You must be signed in to change notification settings - Fork 62
Description
I started writing a document explaining how people can deploy their models, and soon I was overwhelmed by the number of options out there.
One could simply implement a REST endpoint like what is done in the https://github.com/huggingface/api-inference-community repo, but that has issues with requiring feature names for instance, and that it uses JSON to transfer data which is not the fastest when it comes to numerical data.
Another option would be a simple gRPC which would use protobuf, but that requires defining the structure of the data. If the developer is okay with that, they could follow a bunch of guidelines out there on the internet, but it's not general and it'd be different for each use case, since it involves some manual work to define the data and the model.
There are also tons of companies which have their own solutions to deploy sklearn based models and a bunch of them are open sourced. Should we have a list of them?
Right now a user/developer who's new to deploying models, would be quite overwhelmed by the ecosystem if they were to find a solution, and I think a contribution of this library can be to ease that load, but I'm not sure how.
WDYT @skops-dev/maintainers
Also cc'ing some folks who might have better ideas: @ogrisel @thomasjpfan @mrocklin @jnothman @betatim