-
Notifications
You must be signed in to change notification settings - Fork 15.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
nvidia-trt:add TritonTensorRTLLM(verbose_client=False) #16848
Conversation
mkhludnev
commented
Jan 31, 2024
- Description: adding verbose flag to TritonTensorRTLLM,
- Issue: nope,
- Dependencies: not any,
- Twitter handle:
The latest updates on your projects. Learn more about Vercel for Git ↗︎ 1 Ignored Deployment
|
captured = StringIO() | ||
sys.stdout = captured | ||
with pytest.raises(InferenceServerException): | ||
llm.client.is_server_live() |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
this is not perfect since it tries to request this address anyway, it might cause cloud/CI/security/etc issues. I don't know. Open for any other ideas.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
improved here in the recent push
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LLMs already have a verbose
attribute which is meant for configuring callbacks. should we give this a difference name? maybe client_verbose
?
make no network attempts in unit test
ee7d24d
to
9f05293
Compare
@baskaryan let me know if I can improve it further. |
…hain-ai#16848) - **Description:** adding verbose flag to TritonTensorRTLLM, - **Issue:** nope, - **Dependencies:** not any, - **Twitter handle:**
…hain-ai#16848) - **Description:** adding verbose flag to TritonTensorRTLLM, - **Issue:** nope, - **Dependencies:** not any, - **Twitter handle:**