-
Notifications
You must be signed in to change notification settings - Fork 20
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Implement fallbacking for lang model pool #43
Comments
Merged
roma-glushko
added a commit
that referenced
this issue
Jan 7, 2024
roma-glushko
added a commit
that referenced
this issue
Jan 7, 2024
roma-glushko
added a commit
that referenced
this issue
Jan 7, 2024
roma-glushko
added a commit
that referenced
this issue
Jan 7, 2024
roma-glushko
added a commit
that referenced
this issue
Jan 7, 2024
roma-glushko
added a commit
that referenced
this issue
Jan 8, 2024
roma-glushko
added a commit
that referenced
this issue
Jan 8, 2024
roma-glushko
added a commit
that referenced
this issue
Jan 8, 2024
roma-glushko
added a commit
that referenced
this issue
Jan 8, 2024
roma-glushko
added a commit
that referenced
this issue
Jan 8, 2024
roma-glushko
added a commit
that referenced
this issue
Jan 8, 2024
roma-glushko
added a commit
that referenced
this issue
Jan 8, 2024
roma-glushko
added a commit
that referenced
this issue
Jan 8, 2024
roma-glushko
added a commit
that referenced
this issue
Jan 8, 2024
roma-glushko
added a commit
that referenced
this issue
Jan 8, 2024
roma-glushko
added a commit
that referenced
this issue
Jan 8, 2024
roma-glushko
added a commit
that referenced
this issue
Jan 9, 2024
roma-glushko
added a commit
that referenced
this issue
Jan 9, 2024
roma-glushko
added a commit
that referenced
this issue
Jan 9, 2024
roma-glushko
added a commit
that referenced
this issue
Jan 9, 2024
roma-glushko
added a commit
that referenced
this issue
Jan 11, 2024
roma-glushko
added a commit
that referenced
this issue
Jan 11, 2024
- Implemented lang router handling flow - Implemented the priority routing strategy - Implemented exponential retry when no model is healthy - Implemented model health (based on the token bucket) and basic rate limit tracker - Implemented a custom error budget field to accept budgets in form of human-friendly string e.g. "10/m". - Moved build/factory methods on the config level
Merged
roma-glushko
added a commit
that referenced
this issue
Jan 21, 2024
- ✨ [providers] Support for OpenAI Chat API #3 (@mkrueger12 ) - ✨ [API] #54 Unified Chat API (@mkrueger12 ) - ✨ [providers] Support for Cohere Chat API #5 (@mkrueger12 ) - ✨ [providers] Support for Azure OpenAI Chat API #4 (@mkrueger12 ) - ✨ [providers] Support for OctoML Chat API #58 (@mkrueger12 ) - ✨ [routing] The Routing Mechanism, Adaptive Health Tracking, and Fallbacks #42 #43 #51 (@roma-glushko) - ✨ [routing] Support for round robin routing strategy #44 (@roma-glushko) - ✨ [routing] Support for the least latency routing strategy #46 (@roma-glushko) - ✨ [routing] Support for weighted round robin routing strategy #45 (@roma-glushko) - ✨ [providers] Support for Anthropic Chat API #60 (@mkrueger12 ) - ✨ [docs] OpenAPI specifications #22 (@roma-glushko ) - 🔧 [chores] Inited the project #6 (@roma-glushko) - 🔊 [telemetry] Inited logging #14 (@roma-glushko) - 🔧 [chores] Inited Glide's CLI #12 (@roma-glushko) - 👷 [chores] Setup CI workflows #8 (@roma-glushko) - ⚙️ [config] Inited configs #11 (@roma-glushko) - 🔧 [chores] Automatic coverage reports #39 (@roma-glushko) - 👷 [build] Setup release workflows #9 (@roma-glushko)
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Implement fallbacking on provider failures to another healthy provider.
Create a GEP to explain how will that work (GEP0005).
The text was updated successfully, but these errors were encountered: