Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Triton LangChain Provider #6655

Closed
peterableda opened this issue Dec 3, 2023 · 5 comments
Closed

Triton LangChain Provider #6655

peterableda opened this issue Dec 3, 2023 · 5 comments
Assignees
Labels
enhancement New feature or request

Comments

@peterableda
Copy link

Is your feature request related to a problem? Please describe.
LangChain is one of the most widely used libraries to build LLM based applications with a wide range of integrations to LLM providers. Triton Inference Server should be supported within that community.

I would like to host open source LLMs from HuggingFace in Triton as a Coding Assistant for JupyterLab. I would use the Jupyter-AI extension that already integrates with a wide range of LLM Providers, it relies on LangChain to abstract the interactions.

Describe the solution you'd like
Build an NVIDIA Triton Inference Server LangChain Provider.

Additional context
A similar request was raised in the LangChain community: langchain-ai/langchain#12474

I also saw LangChain coming up as an alternative solution in the #6583 request.

@fpetrini15
Copy link
Contributor

fpetrini15 commented Dec 5, 2023

Thank you @peterableda for submitting a feature request! I have filed a ticket for us to discuss and investigate further.

@fpetrini15 fpetrini15 added the enhancement New feature or request label Dec 5, 2023
@nnshah1
Copy link
Contributor

nnshah1 commented Dec 6, 2023

@peterableda,

@npuichigo has posted this on the TRT-LLM discussion.

NVIDIA/TensorRT-LLM#591

@nnshah1 nnshah1 self-assigned this Dec 6, 2023
@npuichigo
Copy link

@peterableda,

@npuichigo has posted this on the TRT-LLM discussion.

NVIDIA/TensorRT-LLM#591

Welcome to have a try with https://github.com/npuichigo/openai_trtllm if anyone want an openai-like API to use with LangChain.

@peterableda
Copy link
Author

peterableda commented Dec 20, 2023

This seems to be the same/v similar request: langchain-ai/langchain#13975

@dyastremsky
Copy link
Contributor

Closing issue due to inactivity. Please reopen if you would like to follow up with this issue.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Development

No branches or pull requests

5 participants