chore(sdk, api): fix types, enhance chat, audio and completion tests #1038

justinthelaw · 2024-09-17T01:07:06Z

Description

Creates and enables vLLM E2E tests that touch each SDK gRPC endpoint at least once with a "happy path" test. Standardizes the new Completions test from vLLM's E2E tests to the LLaMA-CPP-Python E2E tests. Fixes the FinishReason enum typing and object field issues across Completions and ChatCompletions. Adds Audio unit tests and refactors Repeater to enable running those tests.

BREAKING CHANGES

fixes FinishReason to be an enum in both Completions and ChatCompletions protobufs
- modifies API gRPC handler, typing utils, and helper utils to use a Enum class to define and transform the stub responses
fixes object and created field for Completions type as defined in the OpenAI API specification
- uses Literal text_completion for object
- uses ChatCompletions type's created for Completions created

CHANGES

adds and enables simple ("happy path") E2E vLLM testing for local environments only (GPU runner is broken, see commit history)
adds Completions E2E test to llama-cpp-python and vLLM to catch more potential issues, like FinishReason being the wrong type
adds Audio and Completions unit testing
condenses tests into a single test file with an ENV parser (default, warning and helper text included) for model name
adds more comprehensive Make target for test and typing artifact clean-up
improves testing documentation and Makefile targets

Related Issue

Fixes #1037

Relates to #854

Checklist before merging

Tests, documentation, ADR added or updated as needed
Followed the Contributor Guide Steps

netlify · 2024-09-17T01:07:22Z

✅ Deploy Preview for leapfrogai-docs canceled.

Name	Link
🔨 Latest commit	`366cc79`
🔍 Latest deploy log	https://app.netlify.com/sites/leapfrogai-docs/deploys/66edbef41176bd00083aad64

justinthelaw · 2024-09-17T01:08:55Z

There seems to be an issue with our GPU runner configurations that is blocking the vLLM E2E tests from running: https://github.com/defenseunicorns/leapfrogai/actions/runs/10894219888/job/30230952202

The upstream UDS Common action is unable to install UDS CLI into the bin directory. It seems like it needs sudo or higher permissions, even though that is not usually required in our CPU (large or regular) runners. As a side note, the vLLM E2E tests run locally.

…nt-e2e-testing-for-vllm

justinthelaw · 2024-09-20T13:49:42Z

Blocked until #1085 is merged into main.

docs/DEVELOPMENT.md

src/leapfrogai_api/typedef/completion/completion_types.py

src/leapfrogai_sdk/chat/chat_pb2_grpc.py

fix FinishReason, add vLLM E2E

79272d1

justinthelaw added tech-debt Not a feature, but still necessary blocked 🛑 Something needs to happen before this issues is worked labels Sep 17, 2024

justinthelaw added this to the Current - RAG UX Enhancements | Model Directory | API Odds and Ends milestone Sep 17, 2024

justinthelaw self-assigned this Sep 17, 2024

justinthelaw requested a review from a team as a code owner September 17, 2024 01:07

justinthelaw linked an issue Sep 17, 2024 that may be closed by this pull request

chore(vllm): implement e2e testing for vllm #1037

Closed

justinthelaw added 4 commits September 16, 2024 21:37

llama completion test, add CompleteStreamChoice

927ad25

condense e2e to 1 file, add max_new_tokens

e9e434f

formatting fix

d8c6767

max_tokens for OpenAI client

29a9785

justinthelaw mentioned this pull request Sep 17, 2024

feat(vllm)!: upgrade vllm backend and refactor deployment #854

Merged

justinthelaw and others added 10 commits September 16, 2024 22:43

fix singular model_name arg

a166c93

isolate model_name to single test

1c63741

fix e2e-llama-cpp-python.yaml

2e82a9f

Update e2e-vllm.yaml

807128e

model_name fixture

e48331f

Merge remote-tracking branch 'origin/main' into 1037-testvllm-impleme…

e88b29f

…nt-e2e-testing-for-vllm

workaround GPU runner issue

8552ce0

workaround GPU runner issue, pt.2

af4e4ca

workaround GPU runner issue, pt.3

5b1532a

workaround GPU runner issue, pt.4

a8551e5

justinthelaw marked this pull request as draft September 17, 2024 18:00

justinthelaw and others added 5 commits September 17, 2024 14:01

temp turn on e2e vllm, add nvidia-smi

5f1b3c1

add nvidia setp

1e7e98c

fix cluster cmd, play with prompt

c46731a

k3d permissions

161fb3a

Update e2e-vllm.yaml

84a0388

add FinishReason enum back in

c90d820

justinthelaw changed the title ~~fix(sdk, test): fix finish reason, chat and completion e2e tests~~ fix(sdk, test): fix finish reason, chat, audsio and completion tests Sep 19, 2024

justinthelaw changed the title ~~fix(sdk, test): fix finish reason, chat, audsio and completion tests~~ fix(sdk, test): fix finish reason, chat, audio and completion tests Sep 19, 2024

justinthelaw changed the title ~~fix(sdk, test): fix finish reason, chat, audio and completion tests~~ test(sdk, api): fix finish reason, chat, audio and completion tests Sep 19, 2024

justinthelaw marked this pull request as draft September 19, 2024 15:03

passing unit tests

a1a03c1

justinthelaw force-pushed the 1037-testvllm-implement-e2e-testing-for-vllm branch from 70f58ad to a1a03c1 Compare September 19, 2024 20:07

Merge branch 'main' into 1037-testvllm-implement-e2e-testing-for-vllm

3387974

justinthelaw marked this pull request as ready for review September 19, 2024 22:11

justinthelaw removed the api label Sep 20, 2024

justinthelaw changed the title ~~test(sdk, api): fix finish reason, chat, audio and completion tests~~ chore(sdk, api): fix finish reason, chat, audio and completion tests Sep 20, 2024

justinthelaw force-pushed the 1037-testvllm-implement-e2e-testing-for-vllm branch from cac0fdb to 3387974 Compare September 20, 2024 13:47

justinthelaw marked this pull request as draft September 20, 2024 13:49

Merge branch 'main' into 1037-testvllm-implement-e2e-testing-for-vllm

6df5ebb

justinthelaw marked this pull request as ready for review September 20, 2024 16:22

gphorvath reviewed Sep 20, 2024

View reviewed changes

docs/DEVELOPMENT.md Outdated Show resolved Hide resolved

justinthelaw commented Sep 20, 2024

View reviewed changes

src/leapfrogai_api/typedef/completion/completion_types.py Show resolved Hide resolved

justinthelaw commented Sep 20, 2024

View reviewed changes

src/leapfrogai_sdk/chat/chat_pb2_grpc.py Outdated Show resolved Hide resolved

justinthelaw and others added 2 commits September 20, 2024 14:13

PR review fixes

da1399b

Merge branch 'main' into 1037-testvllm-implement-e2e-testing-for-vllm

e10ce50

CollectiveUnicorn approved these changes Sep 20, 2024

View reviewed changes

Merge branch 'main' into 1037-testvllm-implement-e2e-testing-for-vllm

366cc79

justinthelaw changed the title ~~chore(sdk, api): fix finish reason, chat, audio and completion tests~~ chore(sdk, api): fix types, enhance chat, audio and completion tests Sep 20, 2024

justinthelaw requested review from gphorvath and a team September 20, 2024 19:27

YrrepNoj approved these changes Sep 20, 2024

View reviewed changes

justinthelaw merged commit 014329c into main Sep 20, 2024
32 checks passed

justinthelaw deleted the 1037-testvllm-implement-e2e-testing-for-vllm branch September 20, 2024 19:52

github-actions bot mentioned this pull request Sep 20, 2024

chore(main): release 0.13.0 #1014

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chore(sdk, api): fix types, enhance chat, audio and completion tests #1038

chore(sdk, api): fix types, enhance chat, audio and completion tests #1038

justinthelaw commented Sep 17, 2024 •

edited

Loading

netlify bot commented Sep 17, 2024 •

edited

Loading

justinthelaw commented Sep 17, 2024

justinthelaw commented Sep 20, 2024

chore(sdk, api): fix types, enhance chat, audio and completion tests #1038

chore(sdk, api): fix types, enhance chat, audio and completion tests #1038

Conversation

justinthelaw commented Sep 17, 2024 • edited Loading

Description

BREAKING CHANGES

CHANGES

Related Issue

Checklist before merging

netlify bot commented Sep 17, 2024 • edited Loading

✅ Deploy Preview for leapfrogai-docs canceled.

justinthelaw commented Sep 17, 2024

justinthelaw commented Sep 20, 2024

justinthelaw commented Sep 17, 2024 •

edited

Loading

netlify bot commented Sep 17, 2024 •

edited

Loading