Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
I'd rather created issue for discussion, but this repo doesn't have issues enabled. First of all such prepending seems redundant especially for long RAG prompts. Then, it's an actual problem since I notice that triton gRPC crops the long response. Curiously, REST doesn't crop payload and full concatenation of prompt and output arrives to client. Here I put more details of the issue langchain-ai/langchain#12474 (comment)
- Loading branch information