forked from bentoml/BentoML
-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat: model preheat #1
Closed
Closed
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
* feat: export component context at top namespace Signed-off-by: Frost Ming <[email protected]>
Signed-off-by: Frost Ming <[email protected]>
* feat: mount asgi app and fastapi support Signed-off-by: Frost Ming <[email protected]> * add test cases Signed-off-by: Frost Ming <[email protected]> * fix: update dependencies Signed-off-by: Frost Ming <[email protected]> * fix: only collect metrics when enabled Signed-off-by: Frost Ming <[email protected]> * fix types Signed-off-by: Frost Ming <[email protected]> --------- Signed-off-by: Frost Ming <[email protected]>
* fix: add envs to image * ci: auto fixes from pre-commit.ci For more information, see https://pre-commit.ci --------- Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
* fix: crash when pulling bento from organization with proxy mode Signed-off-by: Frost Ming <[email protected]>
* fix: add str in enum for deployment schema Signed-off-by: FogDong <[email protected]> * ci: auto fixes from pre-commit.ci For more information, see https://pre-commit.ci * fix test Signed-off-by: FogDong <[email protected]> --------- Signed-off-by: FogDong <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Add envs field in bentofile Signed-off-by: Sherlock113 <[email protected]>
Add GitHub repo links for use case docs Signed-off-by: Sherlock113 <[email protected]>
Update the vllm use case doc Signed-off-by: Sherlock113 <[email protected]>
* Add how to get deployment info doc Signed-off-by: Sherlock113 <[email protected]> * Add other methods and attributes Signed-off-by: Sherlock113 <[email protected]> * Update the docs Signed-off-by: Sherlock113 <[email protected]> --------- Signed-off-by: Sherlock113 <[email protected]>
Signed-off-by: Frost Ming <[email protected]>
* fix: wait until server ready when initializing client Signed-off-by: Frost Ming <[email protected]> * fix format Signed-off-by: Frost Ming <[email protected]> * fix: read token from env * fix: don't wait for proxy client Signed-off-by: Frost Ming <[email protected]> * fix: fetch schema url Signed-off-by: Frost Ming <[email protected]> * fix tests Signed-off-by: Frost Ming <[email protected]> * try fixing Signed-off-by: Frost Ming <[email protected]> * why readyz isn't working Signed-off-by: Frost Ming <[email protected]> * revert change Signed-off-by: Frost Ming <[email protected]>
…gic (#4561) Signed-off-by: Frost Ming <[email protected]>
Add doc for server_ready_timeout Signed-off-by: Sherlock113 <[email protected]>
* fix: depracate all enum in schema Signed-off-by: FogDong <[email protected]> * ci: auto fixes from pre-commit.ci For more information, see https://pre-commit.ci * fix: fix test Signed-off-by: FogDong <[email protected]> * ci: auto fixes from pre-commit.ci For more information, see https://pre-commit.ci --------- Signed-off-by: FogDong <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
* feat: threaded serving Signed-off-by: Frost Ming <[email protected]> * fix: add context manager Signed-off-by: Frost Ming <[email protected]> * remove test code from quickstart Signed-off-by: Frost Ming <[email protected]> * fix: change the server url Signed-off-by: Frost Ming <[email protected]> * fix: add server on deployment Signed-off-by: Frost Ming <[email protected]>
Adjust existing Bento deployment wording Signed-off-by: Sherlock113 <[email protected]>
Signed-off-by: FogDong <[email protected]>
* Add doc for using pydantic model at the top level Signed-off-by: Sherlock113 <[email protected]> * Import bentoml package Signed-off-by: Sherlock113 <[email protected]> * Add explanations for accessing parameters Signed-off-by: Sherlock113 <[email protected]> --------- Signed-off-by: Sherlock113 <[email protected]>
Signed-off-by: FogDong <[email protected]>
* set upload timeouts to 10 hours * ci: auto fixes from pre-commit.ci For more information, see https://pre-commit.ci --------- Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
* fix: respect the content type config for file output Signed-off-by: Frost Ming <[email protected]> * fix issubclass check Signed-off-by: Frost Ming <[email protected]> --------- Signed-off-by: Frost Ming <[email protected]>
* Update the adaptive batching doc Signed-off-by: Sherlock113 <[email protected]> * Add @bentoml.api decorator Signed-off-by: Sherlock113 <[email protected]> * Add a section to explain how to handle multiple parameters Signed-off-by: Sherlock113 <[email protected]> --------- Signed-off-by: Sherlock113 <[email protected]>
* feat: Service.to_asgi() method Signed-off-by: Frost Ming <[email protected]>
* Add asgi doc Signed-off-by: Sherlock113 <[email protected]> * ci: auto fixes from pre-commit.ci For more information, see https://pre-commit.ci --------- Signed-off-by: Sherlock113 <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
* fix: allow legacy runnable methods when calling API Signed-off-by: Frost Ming <[email protected]> * fix: output content type Signed-off-by: Frost Ming <[email protected]> * fix compatibility with python 3.9 Signed-off-by: Frost Ming <[email protected]>
fix(stub): update env params construction from __init__ Signed-off-by: Aaron <[email protected]>
* Add testing doc Signed-off-by: Sherlock113 <[email protected]> * ci: auto fixes from pre-commit.ci For more information, see https://pre-commit.ci --------- Signed-off-by: Sherlock113 <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
chore(stub): use TypedDict Signed-off-by: Aaron <[email protected]>
…nexpected keyword argument 'task'" (#4880) Fixes #4879 Signed-off-by: Frost Ming <[email protected]>
* fix: silence git output Signed-off-by: Frost Ming <[email protected]>
* fix: surface config error on bento build Signed-off-by: Frost Ming <[email protected]>
* Refactor tasks doc Signed-off-by: Sherlock113 <[email protected]> * ci: auto fixes from pre-commit.ci For more information, see https://pre-commit.ci --------- Signed-off-by: Sherlock113 <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
…page (#4891) * Sync the BYOC doc with blog Signed-off-by: Sherlock113 <[email protected]> * ci: auto fixes from pre-commit.ci For more information, see https://pre-commit.ci --------- Signed-off-by: Sherlock113 <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
…4893) * fix: move user setup to base template to make custom base image work Signed-off-by: Frost Ming <[email protected]>
…rvice(metrics=...)` (#4895) Fixes #4859 Signed-off-by: Frost Ming <[email protected]>
* ci: pre-commit autoupdate [skip ci] updates: - [github.com/astral-sh/ruff-pre-commit: v0.5.0 → v0.5.6](astral-sh/ruff-pre-commit@v0.5.0...v0.5.6) - [github.com/pdm-project/pdm: 2.16.1 → 2.17.3](pdm-project/pdm@2.16.1...2.17.3) - [github.com/bufbuild/buf: v1.34.0 → v1.35.1](bufbuild/buf@v1.34.0...v1.35.1) * chore: update check to use isinstance Signed-off-by: Aaron Pham <[email protected]> --------- Signed-off-by: Aaron Pham <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Aaron Pham <[email protected]>
Update doc images Signed-off-by: Sherlock113 <[email protected]>
* fix: fix resource type Signed-off-by: FogDong <[email protected]> * ci: auto fixes from pre-commit.ci For more information, see https://pre-commit.ci --------- Signed-off-by: FogDong <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
* fix: direct return response if tracer id is null Signed-off-by: FogDong <[email protected]> * ci: auto fixes from pre-commit.ci For more information, see https://pre-commit.ci --------- Signed-off-by: FogDong <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: Frost Ming <[email protected]>
Signed-off-by: Frost Ming <[email protected]>
Signed-off-by: Frost Ming <[email protected]>
Signed-off-by: Frost Ming <[email protected]>
Signed-off-by: Frost Ming <[email protected]>
Signed-off-by: Frost Ming <[email protected]>
Signed-off-by: Frost Ming <[email protected]>
Signed-off-by: Frost Ming <[email protected]>
Signed-off-by: Frost Ming <[email protected]>
Signed-off-by: Frost Ming <[email protected]>
Signed-off-by: Frost Ming <[email protected]>
Signed-off-by: Frost Ming <[email protected]>
Signed-off-by: Frost Ming <[email protected]>
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Signed-off-by: Frost Ming [email protected]
What does this PR address?
Fixes #(issue)
Before submitting:
guide on how to create a pull request.
pre-commit run -a
script has passed (instructions)?those accordingly? Here are documentation guidelines and tips on writting docs.