-
Notifications
You must be signed in to change notification settings - Fork 49
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Run the Pyston benchmarks and compare how we're doing with those versus CPython 3.10. #164
Comments
FYI, I have the pyston benchmarks running on our internal benchmarking machine. However, only 3 of the 12 are currently succeeding. (See #257.) For now, here's the results for those three, comparing the 3.10.2 release against main (bebaa95fd0):
So in those 3 benchmarks we're showing a decent speedup at least. |
Interestingly those Pyston benchmarks that will actually run end up being roughly the same percentage faster as the PyPerformance benchmarks (comparing the results you posted for 3.10.4 to 3.11.0a7, I see a geometric mean of 22% faster). Compare this to the speedup claimed by Pyston: 34% on Intel, 30% on ARM. Though that's comparing 3.8 to Pyston, so not quite apples to apples. Also, we only run three of the 12 Pyston benchmarks, so there may be surprises ahead as we get more of those to work. To make comparisons easier, perhaps we could add the following:
We could also add results reported by Pyston, if we can get them in JSON format. |
(The results reported by Pyston can never be compared directly because the hardware/OS they used to run their benchmarks are different from ours. Comparing their baseline results for 3.8 our results for 3.8 (for the same benchmarks) might help knowing how much bias there is due to this -- though it may well differ per benchmark as hardware features like branch prediction may favor some benchmarks over others.) |
This is a high-level goal ("epic"?). Tasks like refactoring the PyPerformance suite and making changes to the Pyston benchmarks so they run under 3.10 and 3.11 (subtasks of #60) are supportive of this goal. [Or is #60 just the overall "epic" goal? Then we can close/delete this issue.]
Ideally it would be easy to run the Pyston benchmarks for an arbitrary PR or commit.
The text was updated successfully, but these errors were encountered: