Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: model preheat #1

Closed
wants to merge 830 commits into from
Closed

Conversation

frostming
Copy link
Owner

Signed-off-by: Frost Ming [email protected]

What does this PR address?

Fixes #(issue)

Before submitting:

frostming and others added 30 commits March 5, 2024 11:37
* feat: export component context at top namespace

Signed-off-by: Frost Ming <[email protected]>
* feat: mount asgi app and fastapi support

Signed-off-by: Frost Ming <[email protected]>

* add test cases

Signed-off-by: Frost Ming <[email protected]>

* fix: update dependencies

Signed-off-by: Frost Ming <[email protected]>

* fix: only collect metrics when enabled

Signed-off-by: Frost Ming <[email protected]>

* fix types

Signed-off-by: Frost Ming <[email protected]>

---------

Signed-off-by: Frost Ming <[email protected]>
* fix: add envs to image

* ci: auto fixes from pre-commit.ci

For more information, see https://pre-commit.ci

---------

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
* fix: crash when pulling bento from organization with proxy mode

Signed-off-by: Frost Ming <[email protected]>
* fix: add str in enum for deployment schema

Signed-off-by: FogDong <[email protected]>

* ci: auto fixes from pre-commit.ci

For more information, see https://pre-commit.ci

* fix test

Signed-off-by: FogDong <[email protected]>

---------

Signed-off-by: FogDong <[email protected]>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Add envs field in bentofile

Signed-off-by: Sherlock113 <[email protected]>
Add GitHub repo links for use case docs

Signed-off-by: Sherlock113 <[email protected]>
Update the vllm use case doc

Signed-off-by: Sherlock113 <[email protected]>
* Add how to get deployment info doc

Signed-off-by: Sherlock113 <[email protected]>

* Add other methods and attributes

Signed-off-by: Sherlock113 <[email protected]>

* Update the docs

Signed-off-by: Sherlock113 <[email protected]>

---------

Signed-off-by: Sherlock113 <[email protected]>
* fix: wait until server ready when initializing client

Signed-off-by: Frost Ming <[email protected]>

* fix format

Signed-off-by: Frost Ming <[email protected]>

* fix: read token from env

* fix: don't wait for proxy client

Signed-off-by: Frost Ming <[email protected]>

* fix: fetch schema url

Signed-off-by: Frost Ming <[email protected]>

* fix tests

Signed-off-by: Frost Ming <[email protected]>

* try fixing

Signed-off-by: Frost Ming <[email protected]>

* why readyz isn't working

Signed-off-by: Frost Ming <[email protected]>

* revert change

Signed-off-by: Frost Ming <[email protected]>
Add doc for server_ready_timeout

Signed-off-by: Sherlock113 <[email protected]>
* fix: depracate all enum in schema

Signed-off-by: FogDong <[email protected]>

* ci: auto fixes from pre-commit.ci

For more information, see https://pre-commit.ci

* fix: fix test

Signed-off-by: FogDong <[email protected]>

* ci: auto fixes from pre-commit.ci

For more information, see https://pre-commit.ci

---------

Signed-off-by: FogDong <[email protected]>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
* feat: threaded serving

Signed-off-by: Frost Ming <[email protected]>

* fix: add context manager

Signed-off-by: Frost Ming <[email protected]>

* remove test code from quickstart

Signed-off-by: Frost Ming <[email protected]>

* fix: change the server url

Signed-off-by: Frost Ming <[email protected]>

* fix: add server on deployment

Signed-off-by: Frost Ming <[email protected]>
Adjust existing Bento deployment wording

Signed-off-by: Sherlock113 <[email protected]>
* Add doc for using pydantic model at the top level

Signed-off-by: Sherlock113 <[email protected]>

* Import bentoml package

Signed-off-by: Sherlock113 <[email protected]>

* Add explanations for accessing parameters

Signed-off-by: Sherlock113 <[email protected]>

---------

Signed-off-by: Sherlock113 <[email protected]>
* set upload timeouts to 10 hours

* ci: auto fixes from pre-commit.ci

For more information, see https://pre-commit.ci

---------

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
* fix: respect the content type config for file output

Signed-off-by: Frost Ming <[email protected]>

* fix issubclass check

Signed-off-by: Frost Ming <[email protected]>

---------

Signed-off-by: Frost Ming <[email protected]>
* Update the adaptive batching doc

Signed-off-by: Sherlock113 <[email protected]>

* Add @bentoml.api decorator

Signed-off-by: Sherlock113 <[email protected]>

* Add a section to explain how to handle multiple parameters

Signed-off-by: Sherlock113 <[email protected]>

---------

Signed-off-by: Sherlock113 <[email protected]>
* feat: Service.to_asgi() method

Signed-off-by: Frost Ming <[email protected]>
* Add asgi doc

Signed-off-by: Sherlock113 <[email protected]>

* ci: auto fixes from pre-commit.ci

For more information, see https://pre-commit.ci

---------

Signed-off-by: Sherlock113 <[email protected]>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
* fix: allow legacy runnable methods when calling API

Signed-off-by: Frost Ming <[email protected]>

* fix: output content type

Signed-off-by: Frost Ming <[email protected]>

* fix compatibility with python 3.9

Signed-off-by: Frost Ming <[email protected]>
fix(stub): update env params construction from __init__

Signed-off-by: Aaron <[email protected]>
* Add testing doc

Signed-off-by: Sherlock113 <[email protected]>

* ci: auto fixes from pre-commit.ci

For more information, see https://pre-commit.ci

---------

Signed-off-by: Sherlock113 <[email protected]>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
chore(stub): use TypedDict

Signed-off-by: Aaron <[email protected]>
frostming and others added 29 commits July 24, 2024 20:42
…nexpected keyword argument 'task'" (#4880)

Fixes #4879

Signed-off-by: Frost Ming <[email protected]>
* fix: silence git output

Signed-off-by: Frost Ming <[email protected]>
* fix: surface config error on bento build

Signed-off-by: Frost Ming <[email protected]>
* Refactor tasks doc

Signed-off-by: Sherlock113 <[email protected]>

* ci: auto fixes from pre-commit.ci

For more information, see https://pre-commit.ci

---------

Signed-off-by: Sherlock113 <[email protected]>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
…page (#4891)

* Sync the BYOC doc with blog

Signed-off-by: Sherlock113 <[email protected]>

* ci: auto fixes from pre-commit.ci

For more information, see https://pre-commit.ci

---------

Signed-off-by: Sherlock113 <[email protected]>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
…4893)

* fix: move user setup to base template to make custom base image work

Signed-off-by: Frost Ming <[email protected]>
* ci: pre-commit autoupdate [skip ci]

updates:
- [github.com/astral-sh/ruff-pre-commit: v0.5.0 → v0.5.6](astral-sh/ruff-pre-commit@v0.5.0...v0.5.6)
- [github.com/pdm-project/pdm: 2.16.1 → 2.17.3](pdm-project/pdm@2.16.1...2.17.3)
- [github.com/bufbuild/buf: v1.34.0 → v1.35.1](bufbuild/buf@v1.34.0...v1.35.1)

* chore: update check to use isinstance

Signed-off-by: Aaron Pham <[email protected]>

---------

Signed-off-by: Aaron Pham <[email protected]>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Aaron Pham <[email protected]>
Update doc images

Signed-off-by: Sherlock113 <[email protected]>
* fix: fix resource type

Signed-off-by: FogDong <[email protected]>

* ci: auto fixes from pre-commit.ci

For more information, see https://pre-commit.ci

---------

Signed-off-by: FogDong <[email protected]>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
* fix: direct return response if tracer id is null

Signed-off-by: FogDong <[email protected]>

* ci: auto fixes from pre-commit.ci

For more information, see https://pre-commit.ci

---------

Signed-off-by: FogDong <[email protected]>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: Frost Ming <[email protected]>
Signed-off-by: Frost Ming <[email protected]>
Signed-off-by: Frost Ming <[email protected]>
Signed-off-by: Frost Ming <[email protected]>
Signed-off-by: Frost Ming <[email protected]>
Signed-off-by: Frost Ming <[email protected]>
@frostming frostming closed this Aug 20, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.