TorchServe

TorchServe is a flexible and easy to use tool for serving and scaling PyTorch models in production.

Requires python > 3.8

curl http://127.0.0.1:8080/predictions/bert -T input.txt

🚀 Quick start with TorchServe

# Install dependencies
# cuda is optional
python ./ts_scripts/install_dependencies.py --cuda=cu111

# Latest release
pip install torchserve torch-model-archiver torch-workflow-archiver

# Nightly build
pip install torchserve-nightly torch-model-archiver-nightly torch-workflow-archiver-nightly

Getting started guide

🐳 Quick Start with Docker

docker pull pytorch/torchserve

Refer to torchserve docker for details.

⚡ Why TorchServe

Model Management API: multi model management with optimized worker to model allocation
Inference API: REST and gRPC support for batched inference
TorchServe Workflows: deploy complex DAGs with multiple interdependent models
Default way to serve PyTorch models in
- Kubeflow
- MLflow
- Sagemaker
- Vertex AI
Export your model for optimized inference
- Torchscript out of the box
- ORT
- IPEX
- TensorRT
- FasterTransformer
Performance Guide: builtin support to optimize, benchmark and profile PyTorch and TorchServe performance
Expressive handlers: An expressive handler architecture that makes it trivial to support inferencing for your usecase with many supported out of the box
Metrics API: out of box support for system level metrics with Prometheus exports, custom metrics and PyTorch profiler support

🤔 How does TorchServe work

Model Server for PyTorch Documentation: Full documentation
TorchServe internals: How TorchServe was built
Contributing guide: How to contribute to TorchServe

🏆 Highlighted Examples

🤗 HuggingFace Transformers
Model parallel inference
MultiModal models with MMF combining text, audio and video
Dual Neural Machine Translation for a complex workflow DAG

For more examples

🤓 Learn More

https://pytorch.org/serve

🫂 Contributing

We welcome all contributions!

To learn more about how to contribute, see the contributor guide here.

To file a bug or request a feature, please file a GitHub issue. For filing pull requests, please use the template here.

📰 News

💖 All Contributors

Made with contrib.rocks.

⚖️ Disclaimer

This repository is jointly operated and maintained by Amazon, Meta and a number of individual contributors listed in the CONTRIBUTORS file. For questions directed at Meta, please send an email to [email protected]. For questions directed at Amazon, please send an email to [email protected]. For all other questions, please open up an issue in this repository here.

TorchServe acknowledges the Multi Model Server (MMS) project from which it was derived

Name		Name	Last commit message	Last commit date
Latest commit History 2,999 Commits
.github		.github
benchmarks		benchmarks
binaries		binaries
ci		ci
docker		docker
docs		docs
examples		examples
experimental/torchprep		experimental/torchprep
frontend		frontend
kubernetes		kubernetes
model-archiver		model-archiver
plugins		plugins
requirements		requirements
serving-sdk		serving-sdk
test		test
ts		ts
ts_scripts		ts_scripts
workflow-archiver		workflow-archiver
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
LICENSE.txt		LICENSE.txt
MANIFEST.in		MANIFEST.in
PyPiDescription.rst		PyPiDescription.rst
README.md		README.md
SECURITY.md		SECURITY.md
_config.yml		_config.yml
link_check_config.json		link_check_config.json
pull_request_template.md		pull_request_template.md
setup.py		setup.py
torchserve_sanity.py		torchserve_sanity.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Licenses found

Repository files navigation

TorchServe

🚀 Quick start with TorchServe

🐳 Quick Start with Docker

⚡ Why TorchServe

🤔 How does TorchServe work

🏆 Highlighted Examples

🤓 Learn More

🫂 Contributing

📰 News

💖 All Contributors

⚖️ Disclaimer

About

Licenses found

Releases

Packages

Languages

License

Licenses found

KRuok/serve

Folders and files

Latest commit

History

Repository files navigation

TorchServe

🚀 Quick start with TorchServe

🐳 Quick Start with Docker

⚡ Why TorchServe

🤔 How does TorchServe work

🏆 Highlighted Examples

🤓 Learn More

🫂 Contributing

📰 News

💖 All Contributors

⚖️ Disclaimer

About

Resources

License

Licenses found

Code of conduct

Security policy

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages