[Improvement-15910][UI] Supposed to provide a default value for the custom parallelism when using the mode of parallel execution. #15912

calvinjiang · 2024-04-25T10:12:01Z

Purpose of the pull request

This PR will close #15910 .

Brief change log

Verify this pull request

This change added tests and can be verified as follows:

Manually verified the change by testing locally.

The test result is as follows:

ruanwenjun · 2024-04-25T10:31:50Z

dolphinscheduler-ui/src/views/projects/workflow/definition/components/start-modal.tsx

+                      max='100'
+                      min='1'


It's not a good idea to restrict the parallelism should between 1 and 100, this shouldn't be restricted.

I understand that an upper limit should be set. If there is no upper limit, all complement tasks will be triggered, which will put a lot of pressure on the system.

I have encountered a real scenario where the user needs to supplement three years of data. After selecting the parallel mode, 1500+ workflow instances are generated at one time, which directly causes the dolphinscheduler worker service to go down. I suggest that there is an upper limit to limit it. We can discuss this upper limit. How much is appropriate to set.

Master/Worker has its own protection strategy, I don't think we should rely on a workflow config to protect the system. And I don't think we can get the correct upper limit at this discussion, different user have different cluster size and usage Scenarios, if we don't know how to set the max value, why we need to set. In additional, if we want to avoid the system crash in a large parallel, the correct way is to limit the rate at server side.

If an appropriate value cannot be negotiated, it should be returned by api-server and allow the administrator to configure it.

You can try to set task-execute-threads-full-policy to be REJECT, then the worker will only receive 100 tasks.

dolphinscheduler/dolphinscheduler-worker/src/main/resources/application.yaml

Line 66 in d1135ea

task-execute-threads-full-policy: REJECT

BTW, I agree with @Gallardot if we want to set a maximal restriction, it's better to make this value to be configurable.

You can try to set task-execute-threads-full-policy to be REJECT, then the worker will only receive 100 tasks.

dolphinscheduler/dolphinscheduler-worker/src/main/resources/application.yaml

Line 66 in d1135ea

task-execute-threads-full-policy: REJECT

BTW, I agree with @Gallardot if we want to set a maximal restriction, it's better to make this value to be configurable.

+1

I'v improved the front-end parallelism strategy so that it won't be limited to a default max value. It'll just offer a tip.

Because Master/Worker has its own protection strategy, giving user prompts is a solution.

The warning message is too strange, why it's better not to be over 10?

ruanwenjun · 2024-04-25T10:35:27Z

dolphinscheduler-ui/src/views/projects/workflow/definition/components/use-form.ts

@@ -66,7 +66,7 @@ export const useForm = () => {
      tenantCode: 'default',
      environmentCode: null,
      startParams: null,
-      expectedParallelismNumber: '',
+      expectedParallelismNumber: '2',


Why the default value is 2 rather than 3? If you want to set a default value, I would prefer 1 or a default empty value.

I understand that if parallel mode is selected, it should be greater than 1, otherwise just select serial mode. Having default values will make the user experience more friendly and allow users to do as few operations as possible.

If my habit is to use maximum parallelism every time, then this change will not be friendly to me.

If my habit is to use maximum parallelism every time, then this change will not be friendly to me.

You could also use the maximum parallelism 100. But limit-free will be risky to the system.

If this will bring risky, there will exist a lot of risky in the system. We need to limit the scheduler number, limit the task number in one workflow, limit the size (parallelism in each backfill * backfill request number)...... The main thing is that why this will bring risky to the system, if this will bring risky to system we should fix this in a correct way, rather than simple limit the input form, can this restriction fix this problem? From my point, this restriction just make it more inconvinent for me to use, but it does not solve the problem caused by the high stress.

Not using the maximum parallelism has another meaning, ensuring the execution of other normal scheduled tasks. If the user directly uses the maximum degree of parallelism, the normally scheduled tasks will not be executed normally. I understand that the user needs to know whether his complement tasks during this period will affect other tasks before selecting the maximum degree of parallelism.

OK, this make sense to me, the maximum paramllelism does really will affect the current running scheduler workflow. We should direct users to try not to use such large parallel reads unless there is some special reason.

ruanwenjun · 2024-04-25T10:38:20Z

Please don't add ready-to-merge label before receive enough committers' approve.

codecov-commenter · 2024-04-25T10:39:14Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 39.66%. Comparing base (7a55ade) to head (e20bf33).

❗ Current head e20bf33 differs from pull request most recent head 050491b. Consider uploading reports for the commit 050491b to get more accurate results

Additional details and impacted files

@@             Coverage Diff              @@
##                dev   #15912      +/-   ##
============================================
- Coverage     39.72%   39.66%   -0.07%     
+ Complexity     5038     5013      -25     
============================================
  Files          1349     1349              
  Lines         45615    45615              
  Branches       4890     4891       +1     
============================================
- Hits          18122    18091      -31     
- Misses        25578    25609      +31     
  Partials       1915     1915

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

sonarcloud · 2024-04-25T11:29:35Z

Please retry analysis of this Pull-Request directly on SonarCloud

…er into f-parallelism

sonarcloud · 2024-04-28T06:10:22Z

Quality Gate passed

Issues
0 New issues
0 Accepted issues

Measures
0 Security Hotspots
No data about Coverage
0.0% Duplication on New Code

See analysis details on SonarCloud

zhuangchong

LGTM.

worked out the issue

13feb08

calvinjiang requested review from songjianet and Amy0104 as code owners April 25, 2024 10:12

github-actions bot added the UI ui and front end related label Apr 25, 2024

calvinjiang added 3.2.2 and removed UI ui and front end related labels Apr 25, 2024

calvinjiang added this to the 3.2.2 milestone Apr 25, 2024

calvinjiang added UI ui and front end related improvement make more easy to user or prompt friendly ready-to-merge labels Apr 25, 2024

calvinjiang requested a review from zhuangchong April 25, 2024 10:13

ruanwenjun reviewed Apr 25, 2024

View reviewed changes

ruanwenjun removed the ready-to-merge label Apr 25, 2024

Merge branch 'dev' into f-parallelism

867a61f

ruanwenjun assigned calvinjiang Apr 25, 2024

calvinjiang added 5 commits April 28, 2024 13:18

imrpove the parallism strategy

861a5f6

imrpove the parallism strategy

3bde625

Merge branch 'dev' into f-parallelism

08349f5

Merge branch 'f-parallelism' of github.com:calvinjiang/dolphinschedul…

921934e

…er into f-parallelism

merge from dev

050491b

zhuangchong added the ready-to-merge label Apr 28, 2024

zhuangchong approved these changes Apr 28, 2024

View reviewed changes

songjianet approved these changes Apr 28, 2024

View reviewed changes

songjianet merged commit 5a6c6c3 into apache:dev Apr 28, 2024
62 checks passed

ruanwenjun mentioned this pull request Apr 28, 2024

Revert "[Improvement-15910][UI] Supposed to provide a default value for the custom parallelism when using the mode of parallel execution." #15930

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Improvement-15910][UI] Supposed to provide a default value for the custom parallelism when using the mode of parallel execution. #15912

[Improvement-15910][UI] Supposed to provide a default value for the custom parallelism when using the mode of parallel execution. #15912

calvinjiang commented Apr 25, 2024

ruanwenjun Apr 25, 2024

zhuangchong Apr 25, 2024

zhuangchong Apr 25, 2024

ruanwenjun Apr 25, 2024

Gallardot Apr 25, 2024

ruanwenjun Apr 26, 2024 •

edited

Loading

SbloodyS Apr 28, 2024

calvinjiang Apr 28, 2024

zhuangchong Apr 28, 2024

ruanwenjun Apr 28, 2024

ruanwenjun Apr 25, 2024

zhuangchong Apr 25, 2024

ruanwenjun Apr 25, 2024

calvinjiang Apr 25, 2024

ruanwenjun Apr 25, 2024

zhuangchong Apr 26, 2024

ruanwenjun Apr 26, 2024

ruanwenjun commented Apr 25, 2024

codecov-commenter commented Apr 25, 2024 •

edited

Loading

sonarcloud bot commented Apr 25, 2024

sonarcloud bot commented Apr 28, 2024

zhuangchong left a comment

[Improvement-15910][UI] Supposed to provide a default value for the custom parallelism when using the mode of parallel execution. #15912

[Improvement-15910][UI] Supposed to provide a default value for the custom parallelism when using the mode of parallel execution. #15912

Conversation

calvinjiang commented Apr 25, 2024

Purpose of the pull request

Brief change log

Verify this pull request

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ruanwenjun Apr 26, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ruanwenjun commented Apr 25, 2024

codecov-commenter commented Apr 25, 2024 • edited Loading

Codecov Report

sonarcloud bot commented Apr 25, 2024

sonarcloud bot commented Apr 28, 2024

Quality Gate passed

zhuangchong left a comment

Choose a reason for hiding this comment

ruanwenjun Apr 26, 2024 •

edited

Loading

codecov-commenter commented Apr 25, 2024 •

edited

Loading