Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

20.2 → 21.1 roachperf benchmark regressions #62322

Closed
9 of 15 tasks
erikgrinaker opened this issue Mar 21, 2021 · 3 comments
Closed
9 of 15 tasks

20.2 → 21.1 roachperf benchmark regressions #62322

erikgrinaker opened this issue Mar 21, 2021 · 3 comments
Assignees
Labels
C-investigation Further steps needed to qualify. C-label will change. C-performance Perf of queries or internals. Solution not expected to change functional behavior.

Comments

@erikgrinaker
Copy link
Contributor

erikgrinaker commented Mar 21, 2021

Tracking issue for 20.2 → 21.1 roachperf benchmark regressions.

See also investigation spreadsheet.

Branch Benchmarks

kv95/enc=false/nodes=1/cpu=32 on GCE, median cumulative ops/s over 5 runs.

Date release-20.2 release-21.1 master
2021-03-25 76,018 ops/s 53,462 ops/s 70,149 ops/s
2021-03-29 76,713 ops/s 70,672 ops/s 70,763 ops/s

Follow-up tasks

@erikgrinaker erikgrinaker added C-investigation Further steps needed to qualify. C-label will change. C-performance Perf of queries or internals. Solution not expected to change functional behavior. labels Mar 21, 2021
@erikgrinaker erikgrinaker self-assigned this Mar 21, 2021
@erikgrinaker
Copy link
Contributor Author

erikgrinaker commented Mar 29, 2021

We've picked all of the low-hanging fruit at this point. We haven't been able to pinpoint the causes of the gradual decline in performance in the Dec-Feb time frame (#62156), the only remaining known regressions are due to separated intents lock table reads (#62078) and tracing. We did a few runs with these disabled:

  • release-20.2: 76,713 ops/s
  • master: 70,763 ops/s
  • w/o lock table reads: 74,707 ops/s
  • w/o lock table reads + tracing: 75,422 ops/s

This roughly gets us back to 20.2. I suspect vectorized execution may make up the remainder -- or just an accumulation of other minor regressions.

@erikgrinaker
Copy link
Contributor Author

I think we've addressed the regressions that we can at this point, and we'll need more targeted optimization efforts to improve hot path performance.

@erikgrinaker
Copy link
Contributor Author

Did a comparison of 20.2 and 21.1 on all YCSB workloads and a few kv95 ones here: #62078 (comment)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
C-investigation Further steps needed to qualify. C-label will change. C-performance Perf of queries or internals. Solution not expected to change functional behavior.
Projects
None yet
Development

No branches or pull requests

1 participant