Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Improve log messages around the max sequence length #103

Merged
merged 4 commits into from
Jun 28, 2024

Conversation

maxdebayser
Copy link
Collaborator

Motivation

The existing messages were confusing to the users.

Modifications

In the router the error message was rephrased to make it more understandable for users who arent familiar with the internals.

In the server we now print the maximum possible sequence length limited by the model sequence length. The existing print was showing how much output tokens can fit into the memory if you pass max_sequence_length input tokens and vice-versa. I don't know what I was thinking when I wrote that.

Related Issues

https://github.ibm.com/ai-foundation/watson-fm-stack-tracker/issues/958

In the router the error message was rephrased to make it more
understandable for users who arent familiar with the internals.

In the server we now print the maximum possible sequence length
limited by the model sequence length. The existing print was
showing how much output tokens can fit into the memory if you
pass max_sequence_length input tokens and vice-versa. I don't
know what I was thinking when I wrote that.

Signed-off-by: Max de Bayser <[email protected]>
router/src/server.rs Outdated Show resolved Hide resolved
Copy link
Collaborator

@joerunde joerunde left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

🌶️🌶️🌶️

I'm fine with this or an even simpler message

Remove mention to `max_batch_weight` so as not to confuse users.

Signed-off-by: Maximilien de Bayser <[email protected]>
Signed-off-by: Maximilien de Bayser <[email protected]>
@maxdebayser maxdebayser merged commit 5b5938e into main Jun 28, 2024
7 checks passed
@tjohnson31415 tjohnson31415 deleted the improve_seq_len_messages branch July 31, 2024 16:59
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants