Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

optimize: remove fsync when dumping rwlock data #4052

Merged
merged 1 commit into from
Jun 16, 2020

Conversation

skshetry
Copy link
Member

@skshetry skshetry commented Jun 16, 2020

Fixes #3653

Before:

Screenshot from 2020-06-16 20-00-43

After:

Screenshot from 2020-06-16 20-02-49

(146s with 124.9s spent on fsync vs 14.8s cumulative)

Still 5-6 seconds are spent on 178461 relpath() calls. Will be handled on a separate PR.

  • ❗ I have followed the Contributing to DVC checklist.

  • 📖 If this PR requires documentation updates, I have created a separate PR (or issue, at least) in dvc.org and linked it here. If the CLI API is changed, I have updated tab completion scripts.

  • ❌ I will check DeepSource, CodeClimate, and other sanity checks below. (We consider them recommendatory and don't expect everything to be addressed. Please fix things that actually improve code or fix bugs.)

Thank you for the contribution - we'll try to review it as soon as possible. 🙏

@skshetry skshetry changed the title [WIP] optimize: remove fsync when dumping rwlock data optimize: remove fsync when dumping rwlock data Jun 16, 2020
@skshetry skshetry marked this pull request as ready for review June 16, 2020 14:21
@skshetry skshetry requested review from efiop, pared and pmrowla June 16, 2020 14:21
@skshetry skshetry self-assigned this Jun 16, 2020
@skshetry skshetry added the performance improvement over resource / time consuming tasks label Jun 16, 2020
@@ -449,7 +449,7 @@ def test_repro_when_new_outs_added_does_not_exist(tmp_dir, dvc):
{
"stages": {
"run-copy": {
"cmd": "python copy {} {}".format("foo", "foobar"),
"cmd": "python copy.py {} {}".format("foo", "foobar"),
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Snakeoil tests, don't know how this was even passing.

@skshetry skshetry mentioned this pull request Jun 16, 2020
3 tasks
@efiop efiop merged commit 387867f into iterative:master Jun 16, 2020
@skshetry skshetry deleted the fix-3653 branch June 16, 2020 14:57
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
performance improvement over resource / time consuming tasks
Projects
None yet
Development

Successfully merging this pull request may close these issues.

On Windows, dvc repro spends an awful lot of time in rwlock
2 participants