chore: update Qwen model params #2892

soulteary · 2024-03-19T08:47:33Z

Description

Updated model calling parameters based on official website information and SDK parameters.

https://help.aliyun.com/zh/dashscope/developer-reference/api-details?disableWebsiteRedirect=true

https://help.aliyun.com/zh/dashscope/developer-reference/api-details

Type of Change

Bug fix (non-breaking change which fixes an issue)

How Has This Been Tested?

Suggested Checklist:

I have performed a self-review of my own code
I have commented my code, particularly in hard-to-understand areas
My changes generate no new warnings
I ran dev/reformat(backend) and cd web && npx lint-staged(frontend) to appease the lint gods
optional I have made corresponding changes to the documentation
optional I have added tests that prove my fix is effective or that my feature works
optional New and existing unit tests pass locally with my changes

api/core/model_runtime/model_providers/tongyi/llm/qwen-max-1201.yaml

bowenliang123 · 2024-03-19T09:04:45Z

api/core/model_runtime/model_providers/tongyi/llm/qwen-max-longcontext.yaml

    max: 2.0
    help:
      zh_Hans: 用于控制随机性和多样性的程度。具体来说，temperature值控制了生成文本时对每个候选词的概率分布进行平滑的程度。较高的temperature值会降低概率分布的峰值，使得更多的低概率词被选择，生成结果更加多样化；而较低的temperature值则会增强概率分布的峰值，使得高概率词更容易被选择，生成结果更加确定。
      en_US: Used to control the degree of randomness and diversity. Specifically, the temperature value controls the degree to which the probability distribution of each candidate word is smoothed when generating text. A higher temperature value will reduce the peak value of the probability distribution, allowing more low-probability words to be selected, and the generated results will be more diverse; while a lower temperature value will enhance the peak value of the probability distribution, making it easier for high-probability words to be selected. , the generated results are more certain.
+  - name: max_tokens
+    use_template: max_tokens
+    required: false


It's quite confused when setting all these common parameters (max_tokens, top_p and etc.) into not required by default.
These configs are not just for calling the LLM APIs, but also for the balance of productive and deliverable app constructions.

What I hope is that when the front-end is displayed, these options can be unchecked by default.

You are more familiar with the project, so I will follow your suggestion.

cc @crazywoola

bowenliang123 · 2024-03-19T09:06:31Z

Btw, as you're updating Qwen model's specification, use Qwen in PR title instead of Tongyi.

takatost

LGTM

bowenliang123 · 2024-03-19T10:16:40Z

api/core/model_runtime/model_providers/tongyi/llm/qwen-max-1201.yaml

@@ -8,54 +8,65 @@ model_properties:
 parameter_rules:
  - name: temperature
    use_template: temperature
-    default: 1.0
+    type: float
+    default: 0.85


Btw, the default temperature here seems too high. In our practice on Qwen, it will make LLM's answer unrelated to the recalled knowledge. Would prefer setting relatively lower in 0.1 for more stable answer.

soulteary added 2 commits March 19, 2024 16:41

chore: update tongyi models params

ba78ef9

fix: fix type

d40aeea

dosubot bot added the size:L This PR changes 100-499 lines, ignoring generated files. label Mar 19, 2024

fix: fix typo

006df33

bowenliang123 reviewed Mar 19, 2024

View reviewed changes

api/core/model_runtime/model_providers/tongyi/llm/qwen-max-1201.yaml Outdated Show resolved Hide resolved

fix: fix typo

9497bf2

bowenliang123 reviewed Mar 19, 2024

View reviewed changes

fix: fix value

b88c2b2

soulteary changed the title ~~chore: update Tongyi model params~~ chore: update Qwen model params Mar 19, 2024

soulteary added 4 commits March 19, 2024 17:10

fix: remove unused required field

3f8371f

fix: fix generator max tokens

af08068

fix: update qwen context size

576b935

fix: update qwen context size

85404ac

takatost approved these changes Mar 19, 2024

View reviewed changes

dosubot bot added the lgtm label Mar 19, 2024

takatost merged commit 8133ba1 into langgenius:main Mar 19, 2024
7 checks passed

bowenliang123 reviewed Mar 19, 2024

View reviewed changes

takatost pushed a commit that referenced this pull request Mar 19, 2024

chore: update Qwen model params (#2892)

5350753

soulteary deleted the chore/update-qwen-params branch March 19, 2024 16:16

HuberyHuV1 pushed a commit to HuberyHuV1/dify that referenced this pull request Jul 22, 2024

chore: update Qwen model params (langgenius#2892)

5638616

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chore: update Qwen model params #2892

chore: update Qwen model params #2892

soulteary commented Mar 19, 2024

bowenliang123 Mar 19, 2024

soulteary Mar 19, 2024

soulteary Mar 19, 2024

bowenliang123 Mar 19, 2024

bowenliang123 commented Mar 19, 2024

takatost left a comment

bowenliang123 Mar 19, 2024 •

edited

Loading

chore: update Qwen model params #2892

chore: update Qwen model params #2892

Conversation

soulteary commented Mar 19, 2024

Description

Type of Change

How Has This Been Tested?

Suggested Checklist:

bowenliang123 Mar 19, 2024

Choose a reason for hiding this comment

soulteary Mar 19, 2024

Choose a reason for hiding this comment

soulteary Mar 19, 2024

Choose a reason for hiding this comment

bowenliang123 Mar 19, 2024

Choose a reason for hiding this comment

bowenliang123 commented Mar 19, 2024

takatost left a comment

Choose a reason for hiding this comment

bowenliang123 Mar 19, 2024 • edited Loading

Choose a reason for hiding this comment

bowenliang123 Mar 19, 2024 •

edited

Loading