-
Notifications
You must be signed in to change notification settings - Fork 29
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
chore(sdk, api): fix types, enhance chat, audio and completion tests #1038
chore(sdk, api): fix types, enhance chat, audio and completion tests #1038
Conversation
✅ Deploy Preview for leapfrogai-docs canceled.
|
There seems to be an issue with our GPU runner configurations that is blocking the vLLM E2E tests from running: https://github.com/defenseunicorns/leapfrogai/actions/runs/10894219888/job/30230952202 The upstream UDS Common action is unable to install UDS CLI into the bin directory. It seems like it needs |
…nt-e2e-testing-for-vllm
70f58ad
to
a1a03c1
Compare
cac0fdb
to
3387974
Compare
Blocked until #1085 is merged into main. |
Description
Creates and enables vLLM E2E tests that touch each SDK gRPC endpoint at least once with a "happy path" test. Standardizes the new
Completions
test from vLLM's E2E tests to the LLaMA-CPP-Python E2E tests. Fixes theFinishReason
enum
typing andobject
field issues acrossCompletions
andChatCompletions
. AddsAudio
unit tests and refactors Repeater to enable running those tests.BREAKING CHANGES
FinishReason
to be an enum in both Completions and ChatCompletions protobufsEnum
class to define and transform the stub responsesobject
andcreated
field forCompletions
type as defined in the OpenAI API specificationtext_completion
forobject
ChatCompletions
type'screated
forCompletions
created
CHANGES
FinishReason
being the wrong typeRelated Issue
Fixes #1037
Relates to #854
Checklist before merging