Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Performance regression - max_workers not being used #235

Closed
TomNicholas opened this issue Jun 27, 2023 · 3 comments · Fixed by #249
Closed

Performance regression - max_workers not being used #235

TomNicholas opened this issue Jun 27, 2023 · 3 comments · Fixed by #249

Comments

@TomNicholas
Copy link
Member

TomNicholas commented Jun 27, 2023

I'm pretty sure there was a performance regression in the last few weeks, which appears for the quadratic means workload on the timeline visualization:

Before:
image

After:
image

Other things were changed too so perhaps ignore other aspects of this plot that are different, just notice that for some steps there is a group of tasks (exactly 334 out of 5334) that complete as a separate bunch later on. Looking at this again makes me wonder if this is actually instead some issue with the visualization...

@tomwhite you said you had an idea which commit might have caused this? If you tell me which one(s) you think is suspect can checkout a version prior to that and try it out now?

Ideally this sort of thing would be caught by an automated regression test, see #234.

@tomwhite
Copy link
Member

@tomwhite you said you had an idea which commit might have caused this? If you tell me which one(s) you think is suspect can checkout a version prior to that and try it out now?

Possibly #211.

I haven't merged it yet, but #217 is another change in a similar vein, but with a better implementation of backups.

@TomNicholas
Copy link
Member Author

TomNicholas commented Jun 27, 2023

I checked out the commit immediately prior to #211, but still seeing the same problem

Screenshot from 2023-06-27 14-20-31

EDIT: It is of course possible that it's something I've changed in the configuration rather than the code that is causing this.

@tomwhite
Copy link
Member

This looks like the case described in #222, which is a rechunk operation with two stages of 5000 and 334 tasks. You can check by seeing if there is an array-139-intermediate after running.

This isn't a new problem, but it may have arisen after changing memory settings.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants