Skip to content

Commit

Permalink
docs: Add BentoML deployment doc (vllm-project#3336)
Browse files Browse the repository at this point in the history
Signed-off-by: Sherlock113 <[email protected]>
  • Loading branch information
Sherlock113 authored Mar 12, 2024
1 parent 72425a5 commit 55d2491
Show file tree
Hide file tree
Showing 2 changed files with 9 additions and 0 deletions.
1 change: 1 addition & 0 deletions docs/source/index.rst
Original file line number Diff line number Diff line change
Expand Up @@ -73,6 +73,7 @@ Documentation
serving/run_on_sky
serving/deploying_with_kserve
serving/deploying_with_triton
serving/deploying_with_bentoml
serving/deploying_with_docker
serving/serving_with_langchain
serving/metrics
Expand Down
8 changes: 8 additions & 0 deletions docs/source/serving/deploying_with_bentoml.rst
Original file line number Diff line number Diff line change
@@ -0,0 +1,8 @@
.. _deploying_with_bentoml:

Deploying with BentoML
======================

`BentoML <https://github.com/bentoml/BentoML>`_ allows you to deploy a large language model (LLM) server with vLLM as the backend, which exposes OpenAI-compatible endpoints. You can serve the model locally or containerize it as an OCI-complicant image and deploy it on Kubernetes.

For details, see the tutorial `vLLM inference in the BentoML documentation <https://docs.bentoml.com/en/latest/use-cases/large-language-models/vllm.html>`_.

0 comments on commit 55d2491

Please sign in to comment.