Improve log messages around the max sequence length #103

maxdebayser · 2024-06-20T16:22:55Z

Motivation

The existing messages were confusing to the users.

Modifications

In the router the error message was rephrased to make it more understandable for users who arent familiar with the internals.

In the server we now print the maximum possible sequence length limited by the model sequence length. The existing print was showing how much output tokens can fit into the memory if you pass max_sequence_length input tokens and vice-versa. I don't know what I was thinking when I wrote that.

Related Issues

https://github.ibm.com/ai-foundation/watson-fm-stack-tracker/issues/958

In the router the error message was rephrased to make it more understandable for users who arent familiar with the internals. In the server we now print the maximum possible sequence length limited by the model sequence length. The existing print was showing how much output tokens can fit into the memory if you pass max_sequence_length input tokens and vice-versa. I don't know what I was thinking when I wrote that. Signed-off-by: Max de Bayser <[email protected]>

router/src/server.rs

joerunde

🌶️🌶️🌶️

I'm fine with this or an even simpler message

Remove mention to `max_batch_weight` so as not to confuse users. Signed-off-by: Maximilien de Bayser <[email protected]>

Signed-off-by: Maximilien de Bayser <[email protected]>

maxdebayser requested review from joerunde, tjohnson31415 and njhill June 20, 2024 16:22

joerunde reviewed Jun 27, 2024

View reviewed changes

router/src/server.rs Outdated Show resolved Hide resolved

joerunde approved these changes Jun 27, 2024

View reviewed changes

maxdebayser added 3 commits June 27, 2024 14:49

Simplify error message

01c6ec6

Remove mention to `max_batch_weight` so as not to confuse users. Signed-off-by: Maximilien de Bayser <[email protected]>

Remove second print of max_batch_weight

cdbcfa7

Signed-off-by: Maximilien de Bayser <[email protected]>

Merge branch 'main' into improve_seq_len_messages

48ed1c9

maxdebayser merged commit 5b5938e into main Jun 28, 2024
7 checks passed

tjohnson31415 deleted the improve_seq_len_messages branch July 31, 2024 16:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve log messages around the max sequence length #103

Improve log messages around the max sequence length #103

maxdebayser commented Jun 20, 2024

joerunde left a comment

Improve log messages around the max sequence length #103

Improve log messages around the max sequence length #103

Conversation

maxdebayser commented Jun 20, 2024

Motivation

Modifications

Related Issues

joerunde left a comment

Choose a reason for hiding this comment