Skip to content

How triton decide what value to return within a BaseModelOutputWithPoolingAndCrossAttentions #5985

Answered by dyastremsky
sfc-gh-zhwang asked this question in Q&A
Discussion options

You must be logged in to vote

Please see here: https://github.com/triton-inference-server/server/blob/main/docs/user_guide/model_configuration.md#special-conventions-for-pytorch-backend

It looks like because the naming did not follow one of the conventions, Triton assumes the input and output order matches that provided in your model. Your config only has one output, so it looks to be selecting the first output returned in your model, which I'd guess is last_hidden_state.

Replies: 2 comments

Comment options

You must be logged in to vote
0 replies
Answer selected by dyastremsky
Comment options

You must be logged in to vote
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
3 participants