-
Notifications
You must be signed in to change notification settings - Fork 835
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Adding separate pod/deployment option to documentation #4092
Comments
Thank you very much for adding this feedback on the documentation - we would be quite keen to add a note to highlight this, there are some points that are a bit hidden so may be worth rethinking where this may fit best - would you be interseted to add this as a documentation contribution? Recently @edshee restructured the docs so he could best placed to point where in the docs it could be most appropriate if you would be interested to contribute |
your welcome! Sure, if you could let me know which directory it should be added I'll be glad to contribute to the documentation. |
You could add to doc/source/graph |
@cliveseldon Thank you, I'll add it this week. |
Describing the SeldonIO#4092
* Create graph-modes.md Describing the #4092 * link added and typo fixed
Hi Seldon team,
I followed the https://docs.seldon.io/projects/seldon-core/en/latest/examples/transformers-v2-protocol.html and https://docs.seldon.io/projects/seldon-core/en/latest/examples/graph-metadata.html to make a simple inference graph and I first ended up with the following scheme:
which was pretty similar to most of the examples in the documentation, however, for my usecase I needed to have three different pods for each nodes instead of having them in the same pod and I struggled to find a solution to this, I finally ended up with the following which I think is doing what I need:
I couldn't find a solid explanation of the differences between the two cases in the documentation and the availability of the second method (separate pods) and I accidentally find my answer when I was checking how to replicate each node in the https://docs.seldon.io/projects/seldon-core/en/latest/graph/scaling.html Therefore I think it would be nice to add some explanation somewhere in the document about the distinction and availability of both options and maybe the pros and cons (e.g. I think the per node scaling is only available in the separated pod options).
Thank you!
The text was updated successfully, but these errors were encountered: