Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[CI] Change doc image #2632

Merged
merged 7 commits into from
Dec 4, 2024
Merged

[CI] Change doc image #2632

merged 7 commits into from
Dec 4, 2024

Conversation

vmoens
Copy link
Contributor

@vmoens vmoens commented Dec 4, 2024

Stack from ghstack (oldest at bottom):

[ghstack-poisoned]
vmoens added a commit that referenced this pull request Dec 4, 2024
ghstack-source-id: b04f17ccea73a72d7b87f5e4f20e498f2c310158
Pull Request resolved: #2632
Copy link

pytorch-bot bot commented Dec 4, 2024

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/rl/2632

Note: Links to docs will display an error until the docs builds have been completed.

❌ 2 New Failures, 9 Unrelated Failures

As of commit b5657ea with merge base 594462d (image):

NEW FAILURES - The following jobs have failed:

BROKEN TRUNK - The following jobs failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Dec 4, 2024
@vmoens vmoens added documentation Improvements or additions to documentation CI Has to do with CI setup (e.g. wheels & builds, tests...) ciflow/docs tests full docs CI labels Dec 4, 2024
Copy link

pytorch-bot bot commented Dec 4, 2024

No ciflow labels are configured for this repo.
For information on how to enable CIFlow bot see this wiki

[ghstack-poisoned]
vmoens added a commit that referenced this pull request Dec 4, 2024
ghstack-source-id: bea234057cfa50f253c211c0b16457df9566fcd6
Pull Request resolved: #2632
[ghstack-poisoned]
vmoens added a commit that referenced this pull request Dec 4, 2024
ghstack-source-id: 98196145049c1aa0d0c1c3856c803e62cd5f76c5
Pull Request resolved: #2632
[ghstack-poisoned]
vmoens added a commit that referenced this pull request Dec 4, 2024
ghstack-source-id: f63bfd471e29388ce05a25d8323af6793af700da
Pull Request resolved: #2632
[ghstack-poisoned]
vmoens added a commit that referenced this pull request Dec 4, 2024
ghstack-source-id: 1333d3b90aa93622fcd2c786c9483a3417c81740
Pull Request resolved: #2632
Copy link

github-actions bot commented Dec 4, 2024

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 149. Improved: $\large\color{#35bf28}19$. Worsened: $\large\color{#d91a1a}5$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_simple 0.4296s 0.4276s 2.3386 Ops/s 2.1597 Ops/s $\textbf{\color{#35bf28}+8.29\%}$
test_transformed 0.6112s 0.6082s 1.6443 Ops/s 1.5860 Ops/s $\color{#35bf28}+3.67\%$
test_serial 1.3924s 1.3686s 0.7307 Ops/s 0.7137 Ops/s $\color{#35bf28}+2.38\%$
test_parallel 1.4127s 1.3171s 0.7592 Ops/s 0.7458 Ops/s $\color{#35bf28}+1.81\%$
test_step_mdp_speed[True-True-True-True-True] 0.2239ms 29.6788μs 33.6940 KOps/s 32.9664 KOps/s $\color{#35bf28}+2.21\%$
test_step_mdp_speed[True-True-True-True-False] 74.3790μs 17.5706μs 56.9132 KOps/s 56.2930 KOps/s $\color{#35bf28}+1.10\%$
test_step_mdp_speed[True-True-True-False-True] 43.2810μs 16.8708μs 59.2741 KOps/s 58.5240 KOps/s $\color{#35bf28}+1.28\%$
test_step_mdp_speed[True-True-True-False-False] 62.0960μs 10.0765μs 99.2410 KOps/s 98.3812 KOps/s $\color{#35bf28}+0.87\%$
test_step_mdp_speed[True-True-False-True-True] 69.5000μs 31.6946μs 31.5511 KOps/s 31.1316 KOps/s $\color{#35bf28}+1.35\%$
test_step_mdp_speed[True-True-False-True-False] 73.2270μs 19.3778μs 51.6053 KOps/s 51.2493 KOps/s $\color{#35bf28}+0.69\%$
test_step_mdp_speed[True-True-False-False-True] 45.5750μs 18.5940μs 53.7808 KOps/s 53.1178 KOps/s $\color{#35bf28}+1.25\%$
test_step_mdp_speed[True-True-False-False-False] 67.5360μs 11.8857μs 84.1347 KOps/s 82.8118 KOps/s $\color{#35bf28}+1.60\%$
test_step_mdp_speed[True-False-True-True-True] 80.4900μs 33.5014μs 29.8495 KOps/s 29.3578 KOps/s $\color{#35bf28}+1.67\%$
test_step_mdp_speed[True-False-True-True-False] 63.6990μs 21.3201μs 46.9041 KOps/s 46.3156 KOps/s $\color{#35bf28}+1.27\%$
test_step_mdp_speed[True-False-True-False-True] 75.2010μs 18.6116μs 53.7298 KOps/s 52.2955 KOps/s $\color{#35bf28}+2.74\%$
test_step_mdp_speed[True-False-True-False-False] 46.4770μs 12.1081μs 82.5897 KOps/s 83.2268 KOps/s $\color{#d91a1a}-0.77\%$
test_step_mdp_speed[True-False-False-True-True] 96.5600μs 35.4521μs 28.2071 KOps/s 27.4345 KOps/s $\color{#35bf28}+2.82\%$
test_step_mdp_speed[True-False-False-True-False] 81.5920μs 23.1789μs 43.1427 KOps/s 42.2518 KOps/s $\color{#35bf28}+2.11\%$
test_step_mdp_speed[True-False-False-False-True] 75.5310μs 20.9996μs 47.6199 KOps/s 47.8779 KOps/s $\color{#d91a1a}-0.54\%$
test_step_mdp_speed[True-False-False-False-False] 64.7510μs 13.6585μs 73.2146 KOps/s 72.1635 KOps/s $\color{#35bf28}+1.46\%$
test_step_mdp_speed[False-True-True-True-True] 78.4070μs 34.1513μs 29.2815 KOps/s 28.8276 KOps/s $\color{#35bf28}+1.57\%$
test_step_mdp_speed[False-True-True-True-False] 77.7860μs 21.4264μs 46.6714 KOps/s 45.6569 KOps/s $\color{#35bf28}+2.22\%$
test_step_mdp_speed[False-True-True-False-True] 46.7970μs 21.0221μs 47.5690 KOps/s 45.8280 KOps/s $\color{#35bf28}+3.80\%$
test_step_mdp_speed[False-True-True-False-False] 59.3010μs 13.1923μs 75.8018 KOps/s 75.5549 KOps/s $\color{#35bf28}+0.33\%$
test_step_mdp_speed[False-True-False-True-True] 90.6300μs 35.5509μs 28.1287 KOps/s 27.6898 KOps/s $\color{#35bf28}+1.59\%$
test_step_mdp_speed[False-True-False-True-False] 62.0160μs 23.0703μs 43.3458 KOps/s 42.6081 KOps/s $\color{#35bf28}+1.73\%$
test_step_mdp_speed[False-True-False-False-True] 2.5918ms 22.7076μs 44.0381 KOps/s 42.1777 KOps/s $\color{#35bf28}+4.41\%$
test_step_mdp_speed[False-True-False-False-False] 46.4770μs 14.7586μs 67.7572 KOps/s 66.5925 KOps/s $\color{#35bf28}+1.75\%$
test_step_mdp_speed[False-False-True-True-True] 94.1960μs 37.2188μs 26.8682 KOps/s 26.3051 KOps/s $\color{#35bf28}+2.14\%$
test_step_mdp_speed[False-False-True-True-False] 0.2514ms 25.5402μs 39.1540 KOps/s 39.4874 KOps/s $\color{#d91a1a}-0.84\%$
test_step_mdp_speed[False-False-True-False-True] 57.9380μs 23.0272μs 43.4269 KOps/s 42.7904 KOps/s $\color{#35bf28}+1.49\%$
test_step_mdp_speed[False-False-True-False-False] 60.5130μs 14.7926μs 67.6015 KOps/s 66.7332 KOps/s $\color{#35bf28}+1.30\%$
test_step_mdp_speed[False-False-False-True-True] 75.8910μs 38.7935μs 25.7775 KOps/s 25.0839 KOps/s $\color{#35bf28}+2.77\%$
test_step_mdp_speed[False-False-False-True-False] 76.2830μs 26.5725μs 37.6329 KOps/s 37.3104 KOps/s $\color{#35bf28}+0.86\%$
test_step_mdp_speed[False-False-False-False-True] 77.6850μs 23.8634μs 41.9052 KOps/s 40.4092 KOps/s $\color{#35bf28}+3.70\%$
test_step_mdp_speed[False-False-False-False-False] 41.9790μs 16.3185μs 61.2802 KOps/s 59.8046 KOps/s $\color{#35bf28}+2.47\%$
test_values[generalized_advantage_estimate-True-True] 11.5710ms 9.3765ms 106.6499 Ops/s 105.9787 Ops/s $\color{#35bf28}+0.63\%$
test_values[vec_generalized_advantage_estimate-True-True] 35.7852ms 33.4409ms 29.9035 Ops/s 29.8619 Ops/s $\color{#35bf28}+0.14\%$
test_values[td0_return_estimate-False-False] 0.2440ms 0.1792ms 5.5789 KOps/s 5.2974 KOps/s $\textbf{\color{#35bf28}+5.31\%}$
test_values[td1_return_estimate-False-False] 27.6306ms 23.8714ms 41.8911 Ops/s 41.6366 Ops/s $\color{#35bf28}+0.61\%$
test_values[vec_td1_return_estimate-False-False] 35.3753ms 33.4654ms 29.8816 Ops/s 29.7311 Ops/s $\color{#35bf28}+0.51\%$
test_values[td_lambda_return_estimate-True-False] 37.2986ms 34.1294ms 29.3002 Ops/s 28.2758 Ops/s $\color{#35bf28}+3.62\%$
test_values[vec_td_lambda_return_estimate-True-False] 49.7934ms 34.2874ms 29.1653 Ops/s 29.7067 Ops/s $\color{#d91a1a}-1.82\%$
test_gae_speed[generalized_advantage_estimate-False-1-512] 8.3800ms 8.2748ms 120.8484 Ops/s 121.3930 Ops/s $\color{#d91a1a}-0.45\%$
test_gae_speed[vec_generalized_advantage_estimate-True-1-512] 2.2901ms 2.0236ms 494.1748 Ops/s 499.4785 Ops/s $\color{#d91a1a}-1.06\%$
test_gae_speed[vec_generalized_advantage_estimate-False-1-512] 0.4315ms 0.3595ms 2.7814 KOps/s 2.8077 KOps/s $\color{#d91a1a}-0.94\%$
test_gae_speed[vec_generalized_advantage_estimate-True-32-512] 43.9172ms 40.5155ms 24.6819 Ops/s 23.4166 Ops/s $\textbf{\color{#35bf28}+5.40\%}$
test_gae_speed[vec_generalized_advantage_estimate-False-32-512] 4.9524ms 3.0554ms 327.2912 Ops/s 326.4606 Ops/s $\color{#35bf28}+0.25\%$
test_dqn_speed[False-None] 1.5296ms 1.3918ms 718.4986 Ops/s 706.9895 Ops/s $\color{#35bf28}+1.63\%$
test_dqn_speed[False-backward] 1.9485ms 1.8759ms 533.0703 Ops/s 526.1930 Ops/s $\color{#35bf28}+1.31\%$
test_dqn_speed[True-None] 0.7046ms 0.4752ms 2.1045 KOps/s 2.1148 KOps/s $\color{#d91a1a}-0.49\%$
test_dqn_speed[True-backward] 0.9823ms 0.8999ms 1.1112 KOps/s 843.1476 Ops/s $\textbf{\color{#35bf28}+31.80\%}$
test_dqn_speed[reduce-overhead-None] 1.5121ms 0.4728ms 2.1150 KOps/s 2.1238 KOps/s $\color{#d91a1a}-0.41\%$
test_dqn_speed[reduce-overhead-backward] 0.9312ms 0.8879ms 1.1263 KOps/s 1.1157 KOps/s $\color{#35bf28}+0.95\%$
test_ddpg_speed[False-None] 4.5182ms 2.8575ms 349.9605 Ops/s 340.6353 Ops/s $\color{#35bf28}+2.74\%$
test_ddpg_speed[False-backward] 4.1015ms 3.9832ms 251.0547 Ops/s 244.8816 Ops/s $\color{#35bf28}+2.52\%$
test_ddpg_speed[True-None] 1.1245ms 1.0011ms 998.8549 Ops/s 992.7716 Ops/s $\color{#35bf28}+0.61\%$
test_ddpg_speed[True-backward] 1.9705ms 1.9080ms 524.1089 Ops/s 513.1611 Ops/s $\color{#35bf28}+2.13\%$
test_ddpg_speed[reduce-overhead-None] 2.1766ms 1.0074ms 992.6195 Ops/s 987.2620 Ops/s $\color{#35bf28}+0.54\%$
test_ddpg_speed[reduce-overhead-backward] 1.9897ms 1.9139ms 522.4996 Ops/s 450.6246 Ops/s $\textbf{\color{#35bf28}+15.95\%}$
test_sac_speed[False-None] 9.1496ms 8.0075ms 124.8835 Ops/s 118.3494 Ops/s $\textbf{\color{#35bf28}+5.52\%}$
test_sac_speed[False-backward] 0.2208s 15.1100ms 66.1814 Ops/s 88.6398 Ops/s $\textbf{\color{#d91a1a}-25.34\%}$
test_sac_speed[True-None] 2.3558ms 1.8391ms 543.7470 Ops/s 528.2014 Ops/s $\color{#35bf28}+2.94\%$
test_sac_speed[True-backward] 3.7924ms 3.5361ms 282.7972 Ops/s 269.0783 Ops/s $\textbf{\color{#35bf28}+5.10\%}$
test_sac_speed[reduce-overhead-None] 2.4036ms 1.8407ms 543.2693 Ops/s 536.1614 Ops/s $\color{#35bf28}+1.33\%$
test_sac_speed[reduce-overhead-backward] 3.9860ms 3.5647ms 280.5295 Ops/s 275.7725 Ops/s $\color{#35bf28}+1.72\%$
test_redq_speed[False-None] 15.1363ms 12.8426ms 77.8656 Ops/s 75.2336 Ops/s $\color{#35bf28}+3.50\%$
test_redq_speed[False-backward] 23.0683ms 21.9521ms 45.5536 Ops/s 44.0893 Ops/s $\color{#35bf28}+3.32\%$
test_redq_speed[True-None] 5.2128ms 4.5406ms 220.2347 Ops/s 208.4652 Ops/s $\textbf{\color{#35bf28}+5.65\%}$
test_redq_speed[True-backward] 13.6185ms 12.1293ms 82.4453 Ops/s 79.0140 Ops/s $\color{#35bf28}+4.34\%$
test_redq_speed[reduce-overhead-None] 5.3709ms 4.5984ms 217.4679 Ops/s 197.8787 Ops/s $\textbf{\color{#35bf28}+9.90\%}$
test_redq_speed[reduce-overhead-backward] 13.7231ms 12.1481ms 82.3175 Ops/s 78.6380 Ops/s $\color{#35bf28}+4.68\%$
test_redq_deprec_speed[False-None] 14.0020ms 12.7022ms 78.7263 Ops/s 70.5219 Ops/s $\textbf{\color{#35bf28}+11.63\%}$
test_redq_deprec_speed[False-backward] 19.9336ms 18.6208ms 53.7035 Ops/s 49.0631 Ops/s $\textbf{\color{#35bf28}+9.46\%}$
test_redq_deprec_speed[True-None] 4.3448ms 3.5885ms 278.6641 Ops/s 266.0660 Ops/s $\color{#35bf28}+4.73\%$
test_redq_deprec_speed[True-backward] 9.2063ms 8.1144ms 123.2381 Ops/s 110.7111 Ops/s $\textbf{\color{#35bf28}+11.32\%}$
test_redq_deprec_speed[reduce-overhead-None] 4.2590ms 3.5538ms 281.3904 Ops/s 255.2125 Ops/s $\textbf{\color{#35bf28}+10.26\%}$
test_redq_deprec_speed[reduce-overhead-backward] 8.2656ms 8.0211ms 124.6716 Ops/s 122.3536 Ops/s $\color{#35bf28}+1.89\%$
test_td3_speed[False-None] 8.2069ms 7.9572ms 125.6719 Ops/s 120.1552 Ops/s $\color{#35bf28}+4.59\%$
test_td3_speed[False-backward] 11.5595ms 10.3777ms 96.3603 Ops/s 90.0007 Ops/s $\textbf{\color{#35bf28}+7.07\%}$
test_td3_speed[True-None] 1.9174ms 1.7080ms 585.4830 Ops/s 576.0308 Ops/s $\color{#35bf28}+1.64\%$
test_td3_speed[True-backward] 4.1156ms 3.3384ms 299.5473 Ops/s 281.7845 Ops/s $\textbf{\color{#35bf28}+6.30\%}$
test_td3_speed[reduce-overhead-None] 1.8735ms 1.7136ms 583.5509 Ops/s 577.3880 Ops/s $\color{#35bf28}+1.07\%$
test_td3_speed[reduce-overhead-backward] 4.4331ms 3.7045ms 269.9399 Ops/s 290.8777 Ops/s $\textbf{\color{#d91a1a}-7.20\%}$
test_cql_speed[False-None] 38.0206ms 35.8847ms 27.8670 Ops/s 27.1989 Ops/s $\color{#35bf28}+2.46\%$
test_cql_speed[False-backward] 48.4342ms 46.3566ms 21.5719 Ops/s 21.1839 Ops/s $\color{#35bf28}+1.83\%$
test_cql_speed[True-None] 16.7661ms 15.7238ms 63.5980 Ops/s 63.5083 Ops/s $\color{#35bf28}+0.14\%$
test_cql_speed[True-backward] 24.1627ms 22.7370ms 43.9811 Ops/s 43.8136 Ops/s $\color{#35bf28}+0.38\%$
test_cql_speed[reduce-overhead-None] 16.3168ms 15.5149ms 64.4542 Ops/s 64.0676 Ops/s $\color{#35bf28}+0.60\%$
test_cql_speed[reduce-overhead-backward] 23.1701ms 22.2873ms 44.8687 Ops/s 44.3631 Ops/s $\color{#35bf28}+1.14\%$
test_a2c_speed[False-None] 9.3048ms 7.1729ms 139.4142 Ops/s 136.0377 Ops/s $\color{#35bf28}+2.48\%$
test_a2c_speed[False-backward] 14.7155ms 14.2767ms 70.0440 Ops/s 68.1212 Ops/s $\color{#35bf28}+2.82\%$
test_a2c_speed[True-None] 4.6000ms 4.2055ms 237.7847 Ops/s 236.7738 Ops/s $\color{#35bf28}+0.43\%$
test_a2c_speed[True-backward] 11.6820ms 10.9032ms 91.7160 Ops/s 93.1362 Ops/s $\color{#d91a1a}-1.52\%$
test_a2c_speed[reduce-overhead-None] 4.9005ms 4.1950ms 238.3805 Ops/s 232.9966 Ops/s $\color{#35bf28}+2.31\%$
test_a2c_speed[reduce-overhead-backward] 11.6764ms 11.0087ms 90.8373 Ops/s 91.7176 Ops/s $\color{#d91a1a}-0.96\%$
test_ppo_speed[False-None] 0.2766s 9.3915ms 106.4794 Ops/s 128.5140 Ops/s $\textbf{\color{#d91a1a}-17.15\%}$
test_ppo_speed[False-backward] 16.6342ms 15.1089ms 66.1860 Ops/s 66.3379 Ops/s $\color{#d91a1a}-0.23\%$
test_ppo_speed[True-None] 4.1462ms 3.6824ms 271.5625 Ops/s 269.0742 Ops/s $\color{#35bf28}+0.92\%$
test_ppo_speed[True-backward] 11.9069ms 9.7965ms 102.0774 Ops/s 102.7042 Ops/s $\color{#d91a1a}-0.61\%$
test_ppo_speed[reduce-overhead-None] 3.9530ms 3.6687ms 272.5739 Ops/s 268.9069 Ops/s $\color{#35bf28}+1.36\%$
test_ppo_speed[reduce-overhead-backward] 10.2137ms 9.7294ms 102.7815 Ops/s 102.9460 Ops/s $\color{#d91a1a}-0.16\%$
test_reinforce_speed[False-None] 8.3537ms 6.5067ms 153.6885 Ops/s 151.4455 Ops/s $\color{#35bf28}+1.48\%$
test_reinforce_speed[False-backward] 10.9229ms 9.8955ms 101.0557 Ops/s 100.0274 Ops/s $\color{#35bf28}+1.03\%$
test_reinforce_speed[True-None] 2.9700ms 2.6491ms 377.4894 Ops/s 375.6470 Ops/s $\color{#35bf28}+0.49\%$
test_reinforce_speed[True-backward] 9.0758ms 8.6907ms 115.0652 Ops/s 116.1112 Ops/s $\color{#d91a1a}-0.90\%$
test_reinforce_speed[reduce-overhead-None] 3.2630ms 2.6342ms 379.6206 Ops/s 373.9697 Ops/s $\color{#35bf28}+1.51\%$
test_reinforce_speed[reduce-overhead-backward] 9.0791ms 8.6499ms 115.6086 Ops/s 113.9105 Ops/s $\color{#35bf28}+1.49\%$
test_iql_speed[False-None] 32.8726ms 31.6089ms 31.6366 Ops/s 31.1068 Ops/s $\color{#35bf28}+1.70\%$
test_iql_speed[False-backward] 46.2166ms 44.5640ms 22.4396 Ops/s 21.7352 Ops/s $\color{#35bf28}+3.24\%$
test_iql_speed[True-None] 11.4744ms 10.6288ms 94.0838 Ops/s 93.6979 Ops/s $\color{#35bf28}+0.41\%$
test_iql_speed[True-backward] 24.9642ms 21.7234ms 46.0334 Ops/s 45.9800 Ops/s $\color{#35bf28}+0.12\%$
test_iql_speed[reduce-overhead-None] 11.4082ms 10.6498ms 93.8983 Ops/s 91.7053 Ops/s $\color{#35bf28}+2.39\%$
test_iql_speed[reduce-overhead-backward] 22.3905ms 21.5891ms 46.3198 Ops/s 45.6734 Ops/s $\color{#35bf28}+1.42\%$
test_rb_sample[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 6.8354ms 4.9037ms 203.9285 Ops/s 196.8025 Ops/s $\color{#35bf28}+3.62\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 0.9535ms 0.5105ms 1.9591 KOps/s 1.9026 KOps/s $\color{#35bf28}+2.96\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.7528ms 0.4845ms 2.0640 KOps/s 1.9925 KOps/s $\color{#35bf28}+3.59\%$
test_rb_sample[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 6.0984ms 4.6871ms 213.3530 Ops/s 199.1612 Ops/s $\textbf{\color{#35bf28}+7.13\%}$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 3.4723ms 0.4967ms 2.0133 KOps/s 1.9637 KOps/s $\color{#35bf28}+2.53\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.8317ms 0.4747ms 2.1067 KOps/s 2.0833 KOps/s $\color{#35bf28}+1.13\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-sampler6-10000] 2.2222ms 1.6285ms 614.0684 Ops/s 607.0698 Ops/s $\color{#35bf28}+1.15\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-sampler7-10000] 2.2910ms 1.5734ms 635.5727 Ops/s 621.2185 Ops/s $\color{#35bf28}+2.31\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 5.5841ms 4.8236ms 207.3132 Ops/s 195.1863 Ops/s $\textbf{\color{#35bf28}+6.21\%}$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 1.0388ms 0.6387ms 1.5656 KOps/s 1.5298 KOps/s $\color{#35bf28}+2.34\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 1.0070ms 0.6165ms 1.6220 KOps/s 1.5903 KOps/s $\color{#35bf28}+1.99\%$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 5.0954ms 4.6463ms 215.2248 Ops/s 208.1475 Ops/s $\color{#35bf28}+3.40\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 0.9582ms 0.5121ms 1.9529 KOps/s 1.9131 KOps/s $\color{#35bf28}+2.08\%$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.6797ms 0.4843ms 2.0648 KOps/s 2.0036 KOps/s $\color{#35bf28}+3.05\%$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 6.7854ms 4.6610ms 214.5464 Ops/s 207.5278 Ops/s $\color{#35bf28}+3.38\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 3.0040ms 0.4995ms 2.0019 KOps/s 1.9737 KOps/s $\color{#35bf28}+1.43\%$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.7592ms 0.4742ms 2.1090 KOps/s 2.1131 KOps/s $\color{#d91a1a}-0.19\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 7.5772ms 4.8159ms 207.6454 Ops/s 204.7907 Ops/s $\color{#35bf28}+1.39\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 2.7661ms 0.6409ms 1.5603 KOps/s 1.5406 KOps/s $\color{#35bf28}+1.28\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.8965ms 0.6221ms 1.6073 KOps/s 1.5782 KOps/s $\color{#35bf28}+1.85\%$
test_rb_populate[TensorDictReplayBuffer-ListStorage-RandomSampler-400] 5.4616ms 4.1410ms 241.4864 Ops/s 253.9306 Ops/s $\color{#d91a1a}-4.90\%$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-400] 0.3966s 10.1294ms 98.7223 Ops/s 463.7002 Ops/s $\textbf{\color{#d91a1a}-78.71\%}$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-400] 4.1352ms 1.3008ms 768.7434 Ops/s 646.3969 Ops/s $\textbf{\color{#35bf28}+18.93\%}$
test_rb_populate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-400] 5.5416ms 4.1861ms 238.8843 Ops/s 37.7550 Ops/s $\textbf{\color{#35bf28}+532.72\%}$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-400] 8.9655ms 2.3095ms 432.9997 Ops/s 440.2074 Ops/s $\color{#d91a1a}-1.64\%$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-400] 1.9132ms 1.2639ms 791.2188 Ops/s 830.5380 Ops/s $\color{#d91a1a}-4.73\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-ListStorage-None-400] 0.3627s 11.5425ms 86.6362 Ops/s 223.7783 Ops/s $\textbf{\color{#d91a1a}-61.28\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-400] 6.0427ms 2.4421ms 409.4914 Ops/s 406.0148 Ops/s $\color{#35bf28}+0.86\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-400] 7.3276ms 1.5265ms 655.0793 Ops/s 669.4602 Ops/s $\color{#d91a1a}-2.15\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-True] 11.5573ms 11.2827ms 88.6316 Ops/s 87.5965 Ops/s $\color{#35bf28}+1.18\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-False] 18.1988ms 14.5734ms 68.6183 Ops/s 68.6177 Ops/s $+0.00\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-True] 20.3426ms 19.9214ms 50.1973 Ops/s 49.6391 Ops/s $\color{#35bf28}+1.12\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-False] 16.3749ms 14.6915ms 68.0665 Ops/s 67.7193 Ops/s $\color{#35bf28}+0.51\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-True] 21.4155ms 20.0127ms 49.9683 Ops/s 49.9544 Ops/s $\color{#35bf28}+0.03\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-False] 16.8959ms 15.9055ms 62.8712 Ops/s 64.0117 Ops/s $\color{#d91a1a}-1.78\%$

Copy link

github-actions bot commented Dec 4, 2024

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 149. Improved: $\large\color{#35bf28}16$. Worsened: $\large\color{#d91a1a}15$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_simple 0.7355s 0.7313s 1.3675 Ops/s 1.2972 Ops/s $\textbf{\color{#35bf28}+5.42\%}$
test_transformed 1.0727s 0.9909s 1.0092 Ops/s 1.0220 Ops/s $\color{#d91a1a}-1.25\%$
test_serial 2.2023s 2.1185s 0.4720 Ops/s 0.4737 Ops/s $\color{#d91a1a}-0.35\%$
test_parallel 2.0556s 1.9754s 0.5062 Ops/s 0.5097 Ops/s $\color{#d91a1a}-0.68\%$
test_step_mdp_speed[True-True-True-True-True] 0.2232ms 38.3349μs 26.0859 KOps/s 25.7719 KOps/s $\color{#35bf28}+1.22\%$
test_step_mdp_speed[True-True-True-True-False] 50.0710μs 21.7537μs 45.9692 KOps/s 45.7088 KOps/s $\color{#35bf28}+0.57\%$
test_step_mdp_speed[True-True-True-False-True] 50.0710μs 21.2613μs 47.0339 KOps/s 47.7344 KOps/s $\color{#d91a1a}-1.47\%$
test_step_mdp_speed[True-True-True-False-False] 40.6100μs 12.2919μs 81.3546 KOps/s 81.7893 KOps/s $\color{#d91a1a}-0.53\%$
test_step_mdp_speed[True-True-False-True-True] 81.2000μs 41.1302μs 24.3131 KOps/s 24.0935 KOps/s $\color{#35bf28}+0.91\%$
test_step_mdp_speed[True-True-False-True-False] 58.0510μs 23.7316μs 42.1380 KOps/s 41.9576 KOps/s $\color{#35bf28}+0.43\%$
test_step_mdp_speed[True-True-False-False-True] 89.6110μs 23.8308μs 41.9625 KOps/s 42.7491 KOps/s $\color{#d91a1a}-1.84\%$
test_step_mdp_speed[True-True-False-False-False] 41.3200μs 14.2405μs 70.2220 KOps/s 69.9794 KOps/s $\color{#35bf28}+0.35\%$
test_step_mdp_speed[True-False-True-True-True] 84.2210μs 43.0453μs 23.2313 KOps/s 23.4714 KOps/s $\color{#d91a1a}-1.02\%$
test_step_mdp_speed[True-False-True-True-False] 57.5610μs 26.2760μs 38.0575 KOps/s 39.3452 KOps/s $\color{#d91a1a}-3.27\%$
test_step_mdp_speed[True-False-True-False-True] 55.9100μs 23.6457μs 42.2910 KOps/s 43.5265 KOps/s $\color{#d91a1a}-2.84\%$
test_step_mdp_speed[True-False-True-False-False] 42.8100μs 14.3553μs 69.6607 KOps/s 70.7796 KOps/s $\color{#d91a1a}-1.58\%$
test_step_mdp_speed[True-False-False-True-True] 91.8810μs 44.7300μs 22.3564 KOps/s 22.9531 KOps/s $\color{#d91a1a}-2.60\%$
test_step_mdp_speed[True-False-False-True-False] 71.9400μs 28.0961μs 35.5921 KOps/s 35.9738 KOps/s $\color{#d91a1a}-1.06\%$
test_step_mdp_speed[True-False-False-False-True] 60.1410μs 24.8727μs 40.2047 KOps/s 39.5869 KOps/s $\color{#35bf28}+1.56\%$
test_step_mdp_speed[True-False-False-False-False] 46.4910μs 16.5048μs 60.5884 KOps/s 62.1607 KOps/s $\color{#d91a1a}-2.53\%$
test_step_mdp_speed[False-True-True-True-True] 70.4610μs 43.1073μs 23.1979 KOps/s 23.7042 KOps/s $\color{#d91a1a}-2.14\%$
test_step_mdp_speed[False-True-True-True-False] 95.7710μs 26.1073μs 38.3035 KOps/s 38.9904 KOps/s $\color{#d91a1a}-1.76\%$
test_step_mdp_speed[False-True-True-False-True] 58.8600μs 26.6767μs 37.4858 KOps/s 37.7102 KOps/s $\color{#d91a1a}-0.59\%$
test_step_mdp_speed[False-True-True-False-False] 46.3800μs 15.5494μs 64.3112 KOps/s 63.8283 KOps/s $\color{#35bf28}+0.76\%$
test_step_mdp_speed[False-True-False-True-True] 82.8710μs 44.5213μs 22.4612 KOps/s 22.7446 KOps/s $\color{#d91a1a}-1.25\%$
test_step_mdp_speed[False-True-False-True-False] 58.8210μs 28.0158μs 35.6941 KOps/s 35.8162 KOps/s $\color{#d91a1a}-0.34\%$
test_step_mdp_speed[False-True-False-False-True] 3.4437ms 29.1417μs 34.3151 KOps/s 34.2087 KOps/s $\color{#35bf28}+0.31\%$
test_step_mdp_speed[False-True-False-False-False] 49.1710μs 17.8725μs 55.9518 KOps/s 56.0258 KOps/s $\color{#d91a1a}-0.13\%$
test_step_mdp_speed[False-False-True-True-True] 74.5000μs 47.3276μs 21.1293 KOps/s 21.6612 KOps/s $\color{#d91a1a}-2.46\%$
test_step_mdp_speed[False-False-True-True-False] 62.9710μs 30.4502μs 32.8405 KOps/s 33.2738 KOps/s $\color{#d91a1a}-1.30\%$
test_step_mdp_speed[False-False-True-False-True] 68.1200μs 29.0820μs 34.3855 KOps/s 35.0632 KOps/s $\color{#d91a1a}-1.93\%$
test_step_mdp_speed[False-False-True-False-False] 51.1700μs 18.4935μs 54.0729 KOps/s 56.0040 KOps/s $\color{#d91a1a}-3.45\%$
test_step_mdp_speed[False-False-False-True-True] 80.2210μs 48.7792μs 20.5005 KOps/s 20.9176 KOps/s $\color{#d91a1a}-1.99\%$
test_step_mdp_speed[False-False-False-True-False] 61.2910μs 32.4811μs 30.7872 KOps/s 31.1119 KOps/s $\color{#d91a1a}-1.04\%$
test_step_mdp_speed[False-False-False-False-True] 65.8700μs 30.0896μs 33.2340 KOps/s 33.0903 KOps/s $\color{#35bf28}+0.43\%$
test_step_mdp_speed[False-False-False-False-False] 54.1400μs 19.7380μs 50.6636 KOps/s 51.4758 KOps/s $\color{#d91a1a}-1.58\%$
test_values[generalized_advantage_estimate-True-True] 24.5723ms 24.0497ms 41.5805 Ops/s 41.2139 Ops/s $\color{#35bf28}+0.89\%$
test_values[vec_generalized_advantage_estimate-True-True] 0.1095s 3.0842ms 324.2371 Ops/s 354.5697 Ops/s $\textbf{\color{#d91a1a}-8.55\%}$
test_values[td0_return_estimate-False-False] 0.1005ms 79.5140μs 12.5764 KOps/s 12.6117 KOps/s $\color{#d91a1a}-0.28\%$
test_values[td1_return_estimate-False-False] 54.5056ms 54.0179ms 18.5124 Ops/s 17.6097 Ops/s $\textbf{\color{#35bf28}+5.13\%}$
test_values[vec_td1_return_estimate-False-False] 1.2642ms 1.0742ms 930.9456 Ops/s 930.4403 Ops/s $\color{#35bf28}+0.05\%$
test_values[td_lambda_return_estimate-True-False] 86.6003ms 85.3334ms 11.7187 Ops/s 11.6007 Ops/s $\color{#35bf28}+1.02\%$
test_values[vec_td_lambda_return_estimate-True-False] 1.3232ms 1.0679ms 936.4536 Ops/s 940.5517 Ops/s $\color{#d91a1a}-0.44\%$
test_gae_speed[generalized_advantage_estimate-False-1-512] 24.4881ms 24.2216ms 41.2854 Ops/s 41.9252 Ops/s $\color{#d91a1a}-1.53\%$
test_gae_speed[vec_generalized_advantage_estimate-True-1-512] 1.0220ms 0.7389ms 1.3534 KOps/s 1.3494 KOps/s $\color{#35bf28}+0.30\%$
test_gae_speed[vec_generalized_advantage_estimate-False-1-512] 0.7679ms 0.6595ms 1.5164 KOps/s 1.5205 KOps/s $\color{#d91a1a}-0.27\%$
test_gae_speed[vec_generalized_advantage_estimate-True-32-512] 1.5201ms 1.4698ms 680.3751 Ops/s 679.9631 Ops/s $\color{#35bf28}+0.06\%$
test_gae_speed[vec_generalized_advantage_estimate-False-32-512] 0.7214ms 0.6705ms 1.4914 KOps/s 1.4858 KOps/s $\color{#35bf28}+0.38\%$
test_dqn_speed[False-None] 7.0749ms 1.4370ms 695.8874 Ops/s 687.6436 Ops/s $\color{#35bf28}+1.20\%$
test_dqn_speed[False-backward] 2.0849ms 2.0399ms 490.2142 Ops/s 485.6630 Ops/s $\color{#35bf28}+0.94\%$
test_dqn_speed[True-None] 0.9387ms 0.5334ms 1.8749 KOps/s 1.8180 KOps/s $\color{#35bf28}+3.13\%$
test_dqn_speed[True-backward] 1.1203ms 1.0612ms 942.3598 Ops/s 840.6761 Ops/s $\textbf{\color{#35bf28}+12.10\%}$
test_dqn_speed[reduce-overhead-None] 0.9285ms 0.5349ms 1.8694 KOps/s 1.8035 KOps/s $\color{#35bf28}+3.65\%$
test_dqn_speed[reduce-overhead-backward] 1.0324ms 0.9490ms 1.0538 KOps/s 1.0605 KOps/s $\color{#d91a1a}-0.64\%$
test_ddpg_speed[False-None] 3.0735ms 2.7006ms 370.2816 Ops/s 363.8626 Ops/s $\color{#35bf28}+1.76\%$
test_ddpg_speed[False-backward] 4.1019ms 3.9692ms 251.9381 Ops/s 250.9473 Ops/s $\color{#35bf28}+0.39\%$
test_ddpg_speed[True-None] 1.4805ms 1.0545ms 948.3333 Ops/s 930.1981 Ops/s $\color{#35bf28}+1.95\%$
test_ddpg_speed[True-backward] 2.1935ms 2.1123ms 473.4195 Ops/s 475.1272 Ops/s $\color{#d91a1a}-0.36\%$
test_ddpg_speed[reduce-overhead-None] 1.4955ms 1.0818ms 924.4224 Ops/s 962.3691 Ops/s $\color{#d91a1a}-3.94\%$
test_ddpg_speed[reduce-overhead-backward] 1.6837ms 1.5853ms 630.7994 Ops/s 627.2593 Ops/s $\color{#35bf28}+0.56\%$
test_sac_speed[False-None] 8.3635ms 7.8055ms 128.1141 Ops/s 126.9970 Ops/s $\color{#35bf28}+0.88\%$
test_sac_speed[False-backward] 11.3611ms 10.6886ms 93.5580 Ops/s 91.7743 Ops/s $\color{#35bf28}+1.94\%$
test_sac_speed[True-None] 1.8872ms 1.4775ms 676.7988 Ops/s 671.9706 Ops/s $\color{#35bf28}+0.72\%$
test_sac_speed[True-backward] 3.3530ms 3.2966ms 303.3442 Ops/s 317.0692 Ops/s $\color{#d91a1a}-4.33\%$
test_sac_speed[reduce-overhead-None] 22.8265ms 12.5870ms 79.4473 Ops/s 79.5164 Ops/s $\color{#d91a1a}-0.09\%$
test_sac_speed[reduce-overhead-backward] 1.5661ms 1.4793ms 675.9736 Ops/s 742.6511 Ops/s $\textbf{\color{#d91a1a}-8.98\%}$
test_redq_speed[False-None] 8.0226ms 7.2461ms 138.0057 Ops/s 133.2521 Ops/s $\color{#35bf28}+3.57\%$
test_redq_speed[False-backward] 12.4734ms 11.4174ms 87.5860 Ops/s 87.7805 Ops/s $\color{#d91a1a}-0.22\%$
test_redq_speed[True-None] 2.4080ms 1.9352ms 516.7411 Ops/s 500.8595 Ops/s $\color{#35bf28}+3.17\%$
test_redq_speed[True-backward] 3.8028ms 3.7545ms 266.3475 Ops/s 265.2570 Ops/s $\color{#35bf28}+0.41\%$
test_redq_speed[reduce-overhead-None] 2.0184ms 1.9357ms 516.6010 Ops/s 509.3612 Ops/s $\color{#35bf28}+1.42\%$
test_redq_speed[reduce-overhead-backward] 3.7525ms 3.6084ms 277.1277 Ops/s 259.9026 Ops/s $\textbf{\color{#35bf28}+6.63\%}$
test_redq_deprec_speed[False-None] 9.5551ms 8.7754ms 113.9555 Ops/s 112.6540 Ops/s $\color{#35bf28}+1.16\%$
test_redq_deprec_speed[False-backward] 12.4039ms 11.7360ms 85.2078 Ops/s 82.1639 Ops/s $\color{#35bf28}+3.70\%$
test_redq_deprec_speed[True-None] 2.5457ms 2.3046ms 433.9206 Ops/s 443.3486 Ops/s $\color{#d91a1a}-2.13\%$
test_redq_deprec_speed[True-backward] 3.9133ms 3.8374ms 260.5945 Ops/s 257.6360 Ops/s $\color{#35bf28}+1.15\%$
test_redq_deprec_speed[reduce-overhead-None] 2.4488ms 2.2554ms 443.3861 Ops/s 443.4123 Ops/s $-0.01\%$
test_redq_deprec_speed[reduce-overhead-backward] 4.4475ms 4.0357ms 247.7915 Ops/s 256.6934 Ops/s $\color{#d91a1a}-3.47\%$
test_td3_speed[False-None] 7.6144ms 7.5792ms 131.9401 Ops/s 129.8306 Ops/s $\color{#35bf28}+1.62\%$
test_td3_speed[False-backward] 10.5439ms 10.0794ms 99.2122 Ops/s 99.4809 Ops/s $\color{#d91a1a}-0.27\%$
test_td3_speed[True-None] 1.5235ms 1.5002ms 666.5625 Ops/s 661.4173 Ops/s $\color{#35bf28}+0.78\%$
test_td3_speed[True-backward] 3.2679ms 3.1604ms 316.4178 Ops/s 329.1724 Ops/s $\color{#d91a1a}-3.87\%$
test_td3_speed[reduce-overhead-None] 48.4420ms 24.7136ms 40.4636 Ops/s 38.1583 Ops/s $\textbf{\color{#35bf28}+6.04\%}$
test_td3_speed[reduce-overhead-backward] 1.3969ms 1.2841ms 778.7345 Ops/s 716.1313 Ops/s $\textbf{\color{#35bf28}+8.74\%}$
test_cql_speed[False-None] 16.3689ms 15.6492ms 63.9009 Ops/s 63.5191 Ops/s $\color{#35bf28}+0.60\%$
test_cql_speed[False-backward] 21.2209ms 20.7146ms 48.2752 Ops/s 47.5603 Ops/s $\color{#35bf28}+1.50\%$
test_cql_speed[True-None] 2.9110ms 2.7814ms 359.5299 Ops/s 357.5064 Ops/s $\color{#35bf28}+0.57\%$
test_cql_speed[True-backward] 5.4007ms 4.8984ms 204.1502 Ops/s 196.1958 Ops/s $\color{#35bf28}+4.05\%$
test_cql_speed[reduce-overhead-None] 20.7872ms 12.7846ms 78.2193 Ops/s 76.5850 Ops/s $\color{#35bf28}+2.13\%$
test_cql_speed[reduce-overhead-backward] 1.7160ms 1.6500ms 606.0666 Ops/s 672.5768 Ops/s $\textbf{\color{#d91a1a}-9.89\%}$
test_a2c_speed[False-None] 3.1672ms 3.0557ms 327.2623 Ops/s 320.5524 Ops/s $\color{#35bf28}+2.09\%$
test_a2c_speed[False-backward] 7.1099ms 6.1451ms 162.7323 Ops/s 164.0601 Ops/s $\color{#d91a1a}-0.81\%$
test_a2c_speed[True-None] 1.0086ms 0.9523ms 1.0501 KOps/s 1.0372 KOps/s $\color{#35bf28}+1.24\%$
test_a2c_speed[True-backward] 2.7610ms 2.7031ms 369.9425 Ops/s 373.4639 Ops/s $\color{#d91a1a}-0.94\%$
test_a2c_speed[reduce-overhead-None] 0.4000s 12.2851ms 81.3991 Ops/s 86.6659 Ops/s $\textbf{\color{#d91a1a}-6.08\%}$
test_a2c_speed[reduce-overhead-backward] 1.1768ms 1.1328ms 882.7581 Ops/s 888.4795 Ops/s $\color{#d91a1a}-0.64\%$
test_ppo_speed[False-None] 3.7609ms 3.5304ms 283.2548 Ops/s 278.6904 Ops/s $\color{#35bf28}+1.64\%$
test_ppo_speed[False-backward] 7.2143ms 6.8611ms 145.7484 Ops/s 144.5784 Ops/s $\color{#35bf28}+0.81\%$
test_ppo_speed[True-None] 1.0098ms 0.9251ms 1.0810 KOps/s 1.0354 KOps/s $\color{#35bf28}+4.41\%$
test_ppo_speed[True-backward] 2.7757ms 2.6558ms 376.5316 Ops/s 402.5541 Ops/s $\textbf{\color{#d91a1a}-6.46\%}$
test_ppo_speed[reduce-overhead-None] 0.5326ms 0.4793ms 2.0862 KOps/s 1.9407 KOps/s $\textbf{\color{#35bf28}+7.50\%}$
test_ppo_speed[reduce-overhead-backward] 1.1502ms 1.1121ms 899.2091 Ops/s 1.0219 KOps/s $\textbf{\color{#d91a1a}-12.01\%}$
test_reinforce_speed[False-None] 2.2542ms 2.1451ms 466.1737 Ops/s 453.8884 Ops/s $\color{#35bf28}+2.71\%$
test_reinforce_speed[False-backward] 3.6364ms 3.2183ms 310.7229 Ops/s 314.5901 Ops/s $\color{#d91a1a}-1.23\%$
test_reinforce_speed[True-None] 0.8529ms 0.7948ms 1.2581 KOps/s 1.2466 KOps/s $\color{#35bf28}+0.92\%$
test_reinforce_speed[True-backward] 2.6309ms 2.4956ms 400.7053 Ops/s 426.8596 Ops/s $\textbf{\color{#d91a1a}-6.13\%}$
test_reinforce_speed[reduce-overhead-None] 22.6334ms 11.6877ms 85.5597 Ops/s 86.7769 Ops/s $\color{#d91a1a}-1.40\%$
test_reinforce_speed[reduce-overhead-backward] 1.2197ms 1.1815ms 846.4173 Ops/s 954.1269 Ops/s $\textbf{\color{#d91a1a}-11.29\%}$
test_iql_speed[False-None] 9.4070ms 9.0017ms 111.0900 Ops/s 111.3862 Ops/s $\color{#d91a1a}-0.27\%$
test_iql_speed[False-backward] 13.3491ms 12.8646ms 77.7325 Ops/s 78.6947 Ops/s $\color{#d91a1a}-1.22\%$
test_iql_speed[True-None] 1.8617ms 1.7908ms 558.4033 Ops/s 600.8307 Ops/s $\textbf{\color{#d91a1a}-7.06\%}$
test_iql_speed[True-backward] 4.3521ms 4.2520ms 235.1817 Ops/s 234.8173 Ops/s $\color{#35bf28}+0.16\%$
test_iql_speed[reduce-overhead-None] 20.1042ms 11.4795ms 87.1116 Ops/s 87.5755 Ops/s $\color{#d91a1a}-0.53\%$
test_iql_speed[reduce-overhead-backward] 1.6514ms 1.5724ms 635.9835 Ops/s 709.1768 Ops/s $\textbf{\color{#d91a1a}-10.32\%}$
test_rb_sample[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 7.8276ms 6.2380ms 160.3076 Ops/s 158.7312 Ops/s $\color{#35bf28}+0.99\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 0.6167ms 0.3457ms 2.8928 KOps/s 2.8756 KOps/s $\color{#35bf28}+0.60\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.5755ms 0.2959ms 3.3797 KOps/s 3.8299 KOps/s $\textbf{\color{#d91a1a}-11.75\%}$
test_rb_sample[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 6.2710ms 6.0154ms 166.2390 Ops/s 164.9599 Ops/s $\color{#35bf28}+0.78\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 1.7927ms 0.2819ms 3.5469 KOps/s 3.5020 KOps/s $\color{#35bf28}+1.28\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.4835ms 0.2464ms 4.0580 KOps/s 3.6871 KOps/s $\textbf{\color{#35bf28}+10.06\%}$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-sampler6-10000] 1.6400ms 1.2537ms 797.6561 Ops/s 787.7563 Ops/s $\color{#35bf28}+1.26\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-sampler7-10000] 1.6283ms 1.1997ms 833.5646 Ops/s 817.3333 Ops/s $\color{#35bf28}+1.99\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 6.3708ms 6.2029ms 161.2138 Ops/s 161.6288 Ops/s $\color{#d91a1a}-0.26\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 0.6779ms 0.4030ms 2.4812 KOps/s 1.9020 KOps/s $\textbf{\color{#35bf28}+30.45\%}$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.7375ms 0.4249ms 2.3535 KOps/s 2.5587 KOps/s $\textbf{\color{#d91a1a}-8.02\%}$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 6.2297ms 6.0286ms 165.8767 Ops/s 165.4429 Ops/s $\color{#35bf28}+0.26\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 0.7873ms 0.3030ms 3.3000 KOps/s 2.9897 KOps/s $\textbf{\color{#35bf28}+10.38\%}$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.5076ms 0.3119ms 3.2066 KOps/s 3.9859 KOps/s $\textbf{\color{#d91a1a}-19.55\%}$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 6.2735ms 5.9833ms 167.1313 Ops/s 167.0315 Ops/s $\color{#35bf28}+0.06\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 0.6127ms 0.2600ms 3.8465 KOps/s 3.2326 KOps/s $\textbf{\color{#35bf28}+18.99\%}$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.4701ms 0.2656ms 3.7654 KOps/s 3.6120 KOps/s $\color{#35bf28}+4.25\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 6.3423ms 6.1855ms 161.6679 Ops/s 161.5430 Ops/s $\color{#35bf28}+0.08\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 2.2940ms 0.4146ms 2.4121 KOps/s 2.0848 KOps/s $\textbf{\color{#35bf28}+15.70\%}$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.8294ms 0.3905ms 2.5611 KOps/s 2.1631 KOps/s $\textbf{\color{#35bf28}+18.40\%}$
test_rb_populate[TensorDictReplayBuffer-ListStorage-RandomSampler-400] 7.6176ms 5.2774ms 189.4867 Ops/s 191.1783 Ops/s $\color{#d91a1a}-0.88\%$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-400] 3.9514ms 1.7515ms 570.9384 Ops/s 440.8026 Ops/s $\textbf{\color{#35bf28}+29.52\%}$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-400] 8.7520ms 1.2273ms 814.8215 Ops/s 793.4725 Ops/s $\color{#35bf28}+2.69\%$
test_rb_populate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-400] 0.4995s 15.2057ms 65.7648 Ops/s 192.8039 Ops/s $\textbf{\color{#d91a1a}-65.89\%}$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-400] 9.9067ms 1.9656ms 508.7412 Ops/s 442.7297 Ops/s $\textbf{\color{#35bf28}+14.91\%}$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-400] 6.9794ms 1.1824ms 845.7194 Ops/s 863.4599 Ops/s $\color{#d91a1a}-2.05\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-ListStorage-None-400] 7.1577ms 5.5040ms 181.6856 Ops/s 32.9493 Ops/s $\textbf{\color{#35bf28}+451.41\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-400] 9.7434ms 2.1784ms 459.0424 Ops/s 481.1496 Ops/s $\color{#d91a1a}-4.59\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-400] 6.5089ms 1.3854ms 721.8158 Ops/s 734.2446 Ops/s $\color{#d91a1a}-1.69\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-True] 13.3060ms 13.1768ms 75.8908 Ops/s 75.9376 Ops/s $\color{#d91a1a}-0.06\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-False] 22.9931ms 17.5176ms 57.0855 Ops/s 60.1903 Ops/s $\textbf{\color{#d91a1a}-5.16\%}$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-True] 18.6517ms 17.8563ms 56.0027 Ops/s 55.1137 Ops/s $\color{#35bf28}+1.61\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-False] 18.5844ms 17.0164ms 58.7667 Ops/s 58.9322 Ops/s $\color{#d91a1a}-0.28\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-True] 18.5952ms 17.7321ms 56.3949 Ops/s 56.1054 Ops/s $\color{#35bf28}+0.52\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-False] 20.2760ms 18.1804ms 55.0044 Ops/s 54.7128 Ops/s $\color{#35bf28}+0.53\%$

[ghstack-poisoned]
vmoens added a commit that referenced this pull request Dec 4, 2024
ghstack-source-id: a614bae500801f1c834e23979ca067d0dba9b966
Pull Request resolved: #2632
[ghstack-poisoned]
vmoens added a commit that referenced this pull request Dec 4, 2024
ghstack-source-id: eceab242294ec55135d79f29e848345a5d5d455e
Pull Request resolved: #2632
@vmoens vmoens merged commit b5657ea into gh/vmoens/49/base Dec 4, 2024
64 of 74 checks passed
@vmoens vmoens deleted the gh/vmoens/49/head branch December 4, 2024 13:39
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CI Has to do with CI setup (e.g. wheels & builds, tests...) ciflow/docs tests full docs CI CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. documentation Improvements or additions to documentation
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants