bug: max_token limit exceeded when change llm #8027

Weaxs · 2024-09-05T08:52:43Z

Self Checks

This is only for bug report, if you would like to ask a question, please head to Discussions.
I have searched for existing issues search for existing issues, including closed ones.
I confirm that I am using English to submit this report (我已阅读并同意 Language Policy).
[FOR CHINESE USERS] 请务必使用英文提交 Issue，否则会被关闭。谢谢！:）
Please do not modify this template :) and fill in all the required fields.

Dify version

0.7.3

Cloud or Self Hosted

Cloud

Steps to reproduce

choose GTP-4o-mini
set max_token to 16384, save
change model to siliconflow meta-3.1-405B-Instruct
run workflow
llm node occured error

✔️ Expected Behavior

No response

❌ Actual Behavior

No response

dosubot · 2024-09-05T08:54:20Z

I found two similar discussions that might be relevant to your issue:

Max Token Limits Incorrect? - This discussion is still open and was last updated on August 14, 2024.
Error when I modify the max_tokens of the custom Maximum chunk length from 1000 to 2000 - This discussion is also open and was last updated on May 4, 2024 ^[1]^[2].

Regarding your specific issue, the max_tokens parameter for the meta-llama/Meta-Llama-3.1-40GB-Instruct model is indeed set to a high value in the configuration interface, but the actual operational limit is 4096 tokens. This discrepancy can occur due to various reasons such as implementation constraints, API limitations, or practical performance considerations that are not reflected in the configuration interface ^[3]^[4].

The default value for the max_tokens parameter in the Dify configuration for the meta-llama/Meta-Llama-3.1-40GB-Instruct model is 512, with a maximum allowed value of 4096 ^[5].

The validation for the max_tokens parameter is implemented in the api/core/model_runtime/model_providers/openrouter/llm/llama-3.1-8b-instruct.yaml file, where it is restricted to a maximum of 4096 tokens ^[6]^[7].

If you need further assistance, you might want to check the configuration file api/core/model_runtime/model_providers/openrouter/llm/llama-3.1-405b-instruct.yaml for detailed settings and limitations ^[8]^[9].

To continue talking to Dosu, mention @dosu.

Weaxs changed the title ~~bug: siliconflow llama-3.1-405B-Instruct max_token is too large~~ bug: max_token limit exceeded when change llm Sep 5, 2024

dosubot bot added the 🐞 bug Something isn't working label Sep 5, 2024

dosubot bot mentioned this issue Sep 13, 2024

An error "context length exceeded" occurred during the conversation. #8360

Closed

5 tasks

dosubot bot added the stale Issue has not had recent activity or appears to be solved. Stale issues will be automatically closed label Oct 6, 2024

dosubot bot closed this as not planned Won't fix, can't repro, duplicate, stale Oct 21, 2024

dosubot bot removed the stale Issue has not had recent activity or appears to be solved. Stale issues will be automatically closed label Oct 21, 2024

dosubot bot mentioned this issue Nov 19, 2024

llm always reports an error that the length exceeds the limit #10846

Closed

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bug: max_token limit exceeded when change llm #8027

bug: max_token limit exceeded when change llm #8027

Weaxs commented Sep 5, 2024 •

edited

Loading

dosubot bot commented Sep 5, 2024

bug: max_token limit exceeded when change llm #8027

bug: max_token limit exceeded when change llm #8027

Comments

Weaxs commented Sep 5, 2024 • edited Loading

Self Checks

Dify version

Cloud or Self Hosted

Steps to reproduce

✔️ Expected Behavior

❌ Actual Behavior

dosubot bot commented Sep 5, 2024

Weaxs commented Sep 5, 2024 •

edited

Loading