Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
v2 control plane grpc + various fixes for scalability (#229)
* add transport settings for v2client * add error codes * fix httpmock tests * add model to cache if failed unload * add error handling for unload model * tidy up logic * add test * lint * limit http / grpc connections. * increase http connection timeout * Tidy up tests * lint * fix test * Update server snapshot creation in scheduler * read events in go routines * add v2 control plane protos * initial v2 grpc impl * add counter for load / unload models * add test * compute counts without models * add triton control plance to v2 dataplane for now * protos * compute memory including loading models * fix scheduler test * fix test * add a note * create a copy * cap * add a mutex around state map management * do not access map directly in memory store * return err * remove sleep * refactor and fix test * add v2 grpc test * add more test cases * add aggregate infer metrics and test * fix test and lint Co-authored-by: Clive Cox <[email protected]>
- Loading branch information