Make analytics report update job scheduling more efficient #7576

SpecLad · 2024-03-08T15:51:19Z

Motivation and context

Currently, schedule_analytics_report_autoupdate_job attempts to debounce job scheduling by examining existing jobs before scheduling a new one. Unfortunately, the scheduler.get_jobs function, which it uses for this purpose, scales poorly. Not only does it fetch IDs of all scheduled jobs (and not just ones related to the current object), but it then fetches information about every job, one by one. The current logic doesn't even need this information, but RQ Scheduler provides no method to get just the IDs.

Replace the current logic with a new lightweight approach that uses a custom Redis key to block scheduling of additional jobs.

How has this been tested?

Manual testing.

Checklist

I submit my changes into the develop branch
I have created a changelog fragment
~~[ ] I have updated the documentation accordingly~~
~~[ ] I have added tests to cover my changes~~
~~[ ] I have linked related issues (see GitHub docs)~~
[ ] I have increased versions of npm packages if it is necessary
(cvat-canvas,
cvat-core,
cvat-data and
cvat-ui)

License

I submit my code changes under the same MIT License that covers the project.
Feel free to contact the maintainers if that's a concern.

codecov · 2024-03-08T16:32:25Z

Codecov Report

Merging #7576 (144381c) into develop (bfb902f) will decrease coverage by 0.09%.
Report is 6 commits behind head on develop.
The diff coverage is 100.00%.

Additional details and impacted files

@@             Coverage Diff             @@
##           develop    #7576      +/-   ##
===========================================
- Coverage    83.53%   83.44%   -0.09%     
===========================================
  Files          372      373       +1     
  Lines        39700    39739      +39     
  Branches      3729     3741      +12     
===========================================
  Hits         33162    33162              
- Misses        6538     6577      +39

Components	Coverage Δ
cvat-ui	`79.24% <ø> (-0.19%)`	⬇️
cvat-server	`87.33% <100.00%> (+0.01%)`	⬆️

klakhov

Generally, patch works for me.

cvat/apps/analytics_report/report/create.py

Currently, `schedule_analytics_report_autoupdate_job` attempts to debounce job scheduling by examining existing jobs before scheduling a new one. Unfortunately, the `scheduler.get_jobs` function, which it uses for this purpose, scales poorly. Not only does it fetch a list of all scheduled jobs (and not just ones related to the current object), but it then fetches information about every job, one by one. The current logic doesn't even need this information, but RQ Scheduler provides no method to get just the IDs. Replace the current logic with a new lightweight approach that uses a custom Redis key to block scheduling of additional jobs.

…ts (#7596)   ### Motivation and context  See #7576 for more details. This patch extracts the high-level throttling functionality added in that patch and reuses it for quality reports. Note: in that patch I referred to this functionality as debouncing, but throttling seems like a more accurate description. It would be debouncing if the autoupdate job only ran after no updates occurred for a period, which is not how it actually works. ### How has this been tested?  ### Checklist  - [x] I submit my changes into the `develop` branch - [x] I have created a changelog fragment  - ~~[ ] I have updated the documentation accordingly~~ - ~~[ ] I have added tests to cover my changes~~ - ~~[ ] I have linked related issues (see [GitHub docs]( https://help.github.com/en/github/managing-your-work-on-github/linking-a-pull-request-to-an-issue#linking-a-pull-request-to-an-issue-using-a-keyword))~~ - ~~[ ] I have increased versions of npm packages if it is necessary ([cvat-canvas](https://github.com/opencv/cvat/tree/develop/cvat-canvas#versioning), [cvat-core](https://github.com/opencv/cvat/tree/develop/cvat-core#versioning), [cvat-data](https://github.com/opencv/cvat/tree/develop/cvat-data#versioning) and [cvat-ui](https://github.com/opencv/cvat/tree/develop/cvat-ui#versioning))~~ ### License - [x] I submit _my code changes_ under the same [MIT License]( https://github.com/opencv/cvat/blob/develop/LICENSE) that covers the project. Feel free to contact the maintainers if that's a concern.

SpecLad force-pushed the efficient-analytics-debounce branch from 9173a2a to 0f78ac8 Compare March 8, 2024 15:58

SpecLad marked this pull request as ready for review March 8, 2024 15:58

SpecLad requested review from Marishka17 and nmanovic as code owners March 8, 2024 15:58

SpecLad requested a review from klakhov March 8, 2024 15:59

klakhov reviewed Mar 11, 2024

View reviewed changes

cvat/apps/analytics_report/report/create.py Outdated Show resolved Hide resolved

SpecLad force-pushed the efficient-analytics-debounce branch from 0f78ac8 to 144381c Compare March 11, 2024 15:07

klakhov approved these changes Mar 11, 2024

View reviewed changes

SpecLad merged commit 009f9f8 into cvat-ai:develop Mar 11, 2024
34 checks passed

SpecLad deleted the efficient-analytics-debounce branch March 11, 2024 17:38

cvat-bot bot mentioned this pull request Mar 11, 2024

Release v2.11.2 #7592

Merged

SpecLad mentioned this pull request Mar 12, 2024

Use the same algorithm to throttle quality reports as analytics reports #7596

Merged

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make analytics report update job scheduling more efficient #7576

Make analytics report update job scheduling more efficient #7576

SpecLad commented Mar 8, 2024 •

edited

Loading

codecov bot commented Mar 8, 2024 •

edited

Loading

klakhov left a comment

Make analytics report update job scheduling more efficient #7576

Make analytics report update job scheduling more efficient #7576

Conversation

SpecLad commented Mar 8, 2024 • edited Loading

Motivation and context

How has this been tested?

Checklist

License

codecov bot commented Mar 8, 2024 • edited Loading

Codecov Report

klakhov left a comment

Choose a reason for hiding this comment

SpecLad commented Mar 8, 2024 •

edited

Loading

codecov bot commented Mar 8, 2024 •

edited

Loading