triton_trtllm_guide

Installation and usage guide for Triton TRT-LLM v0.7.1

GPT Model

Follow the NGC guide if you are using NGC Triton TRT-LLM container and want to deploy GPT model
Follow the Build Your Own Container (BYOC) guide if you are building your Triton TRT-LLM container and want to deploy GPT model

Use Falcon_BYOCTritonTRTLLM.md guide to deploy Falcon-40B on 8 A100 GPU

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
Falcon7B_BYOC.md		Falcon7B_BYOC.md
Falcon_BYOCTritonTRTLLM.md		Falcon_BYOCTritonTRTLLM.md
README.md		README.md
README_GPT_BYOCTritonTRTLLM.md		README_GPT_BYOCTritonTRTLLM.md
README_GPT_NGCTritonTRTLLM.md		README_GPT_NGCTritonTRTLLM.md
README_LLaMA_BYOCTritonTRTLLM.md		README_LLaMA_BYOCTritonTRTLLM.md