Self-host Moshi with BentoML

This is a BentoML example project, showing you how to serve and deploy Moshi with BentoML.

See here for a full list of BentoML example projects.

Prerequisites

If you want to test the Service locally, we recommend you use a Nvidia GPU with at least 48G VRAM.

Instructions

git clone https://github.com/bentoml/BentoMoshi.git && cd BentoMoshi

# option 1: bentoml serve [RECOMMENDED]
bentoml serve . --debug

# option 2: uv
uvx --from . server

To use the client, specify the URL envvar on bentocloud:

# option 1: uv [RECOMMENDED]
URL=<bentocloud-endpoint> uvx --from . client

# option 2: using python
URL=<bentocloud-endpoint> python bentomoshi/client.py

Note

If you are hosting this on your own server, make sure to include the port the model is served into the URL, for example:

URL=http://localhost:3000 uvx --from . client

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
bentomoshi		bentomoshi
.bentoignore		.bentoignore
.gitignore		.gitignore
.python-version		.python-version
LICENSE		LICENSE
README.md		README.md
bentofile.yaml		bentofile.yaml
female.wav		female.wav
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Self-host Moshi with BentoML

Prerequisites

Instructions

About

Releases

Packages

Languages

License

bentoml/BentoMoshi

Folders and files

Latest commit

History

Repository files navigation

Self-host Moshi with BentoML

Prerequisites

Instructions

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages