Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature] inline hold_out_net #2499

Merged
merged 1 commit into from
Oct 17, 2024
Merged

[Feature] inline hold_out_net #2499

merged 1 commit into from
Oct 17, 2024

Conversation

vmoens
Copy link
Contributor

@vmoens vmoens commented Oct 17, 2024

Stack from ghstack (oldest at bottom):

[ghstack-poisoned]
vmoens added a commit that referenced this pull request Oct 17, 2024
ghstack-source-id: c315202c8af55f0852195fe488ae855966386c4c
Pull Request resolved: #2499
Copy link

pytorch-bot bot commented Oct 17, 2024

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/rl/2499

Note: Links to docs will display an error until the docs builds have been completed.

❌ 18 New Failures, 4 Unrelated Failures

As of commit 9a5e15e with merge base d894358 (image):

NEW FAILURES - The following jobs have failed:

BROKEN TRUNK - The following jobs failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Oct 17, 2024
@vmoens vmoens merged commit 9a5e15e into gh/vmoens/33/base Oct 17, 2024
14 of 23 checks passed
vmoens added a commit that referenced this pull request Oct 17, 2024
ghstack-source-id: c315202c8af55f0852195fe488ae855966386c4c
Pull Request resolved: #2499
@vmoens vmoens deleted the gh/vmoens/33/head branch October 17, 2024 14:00
Copy link

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 143. Improved: $\large\color{#35bf28}4$. Worsened: $\large\color{#d91a1a}8$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_simple 0.4256s 0.4231s 2.3636 Ops/s 2.4258 Ops/s $\color{#d91a1a}-2.56\%$
test_transformed 0.7001s 0.6180s 1.6180 Ops/s 1.7085 Ops/s $\textbf{\color{#d91a1a}-5.30\%}$
test_serial 1.4485s 1.3654s 0.7324 Ops/s 0.7427 Ops/s $\color{#d91a1a}-1.39\%$
test_parallel 1.3199s 1.3147s 0.7606 Ops/s 0.7510 Ops/s $\color{#35bf28}+1.28\%$
test_step_mdp_speed[True-True-True-True-True] 0.1536ms 29.2155μs 34.2284 KOps/s 34.1633 KOps/s $\color{#35bf28}+0.19\%$
test_step_mdp_speed[True-True-True-True-False] 56.7560μs 17.3792μs 57.5399 KOps/s 57.9899 KOps/s $\color{#d91a1a}-0.78\%$
test_step_mdp_speed[True-True-True-False-True] 62.4570μs 16.2093μs 61.6931 KOps/s 62.5826 KOps/s $\color{#d91a1a}-1.42\%$
test_step_mdp_speed[True-True-True-False-False] 83.9730μs 9.4598μs 105.7101 KOps/s 108.7525 KOps/s $\color{#d91a1a}-2.80\%$
test_step_mdp_speed[True-True-False-True-True] 66.9460μs 31.2403μs 32.0099 KOps/s 31.9770 KOps/s $\color{#35bf28}+0.10\%$
test_step_mdp_speed[True-True-False-True-False] 75.1210μs 19.4478μs 51.4197 KOps/s 51.1443 KOps/s $\color{#35bf28}+0.54\%$
test_step_mdp_speed[True-True-False-False-True] 78.3460μs 18.3016μs 54.6402 KOps/s 55.0248 KOps/s $\color{#d91a1a}-0.70\%$
test_step_mdp_speed[True-True-False-False-False] 39.1130μs 11.5804μs 86.3525 KOps/s 86.0521 KOps/s $\color{#35bf28}+0.35\%$
test_step_mdp_speed[True-False-True-True-True] 93.3040μs 33.8043μs 29.5821 KOps/s 29.9062 KOps/s $\color{#d91a1a}-1.08\%$
test_step_mdp_speed[True-False-True-True-False] 51.0150μs 21.4612μs 46.5956 KOps/s 46.6376 KOps/s $\color{#d91a1a}-0.09\%$
test_step_mdp_speed[True-False-True-False-True] 79.7990μs 18.6469μs 53.6282 KOps/s 55.8736 KOps/s $\color{#d91a1a}-4.02\%$
test_step_mdp_speed[True-False-True-False-False] 36.6380μs 11.5318μs 86.7164 KOps/s 86.3400 KOps/s $\color{#35bf28}+0.44\%$
test_step_mdp_speed[True-False-False-True-True] 90.8500μs 35.7123μs 28.0015 KOps/s 28.5968 KOps/s $\color{#d91a1a}-2.08\%$
test_step_mdp_speed[True-False-False-True-False] 69.9910μs 23.7121μs 42.1726 KOps/s 42.6093 KOps/s $\color{#d91a1a}-1.02\%$
test_step_mdp_speed[True-False-False-False-True] 51.8470μs 20.6168μs 48.5040 KOps/s 49.6827 KOps/s $\color{#d91a1a}-2.37\%$
test_step_mdp_speed[True-False-False-False-False] 46.2060μs 13.5774μs 73.6517 KOps/s 73.2021 KOps/s $\color{#35bf28}+0.61\%$
test_step_mdp_speed[False-True-True-True-True] 90.9900μs 33.4300μs 29.9132 KOps/s 30.3649 KOps/s $\color{#d91a1a}-1.49\%$
test_step_mdp_speed[False-True-True-True-False] 64.6510μs 21.6401μs 46.2106 KOps/s 45.8279 KOps/s $\color{#35bf28}+0.84\%$
test_step_mdp_speed[False-True-True-False-True] 74.0580μs 22.0312μs 45.3902 KOps/s 45.8140 KOps/s $\color{#d91a1a}-0.93\%$
test_step_mdp_speed[False-True-True-False-False] 58.4590μs 13.5282μs 73.9197 KOps/s 73.9367 KOps/s $\color{#d91a1a}-0.02\%$
test_step_mdp_speed[False-True-False-True-True] 79.5190μs 35.4062μs 28.2437 KOps/s 28.3079 KOps/s $\color{#d91a1a}-0.23\%$
test_step_mdp_speed[False-True-False-True-False] 81.9030μs 23.1936μs 43.1154 KOps/s 42.0639 KOps/s $\color{#35bf28}+2.50\%$
test_step_mdp_speed[False-True-False-False-True] 2.8290ms 24.1746μs 41.3657 KOps/s 41.4622 KOps/s $\color{#d91a1a}-0.23\%$
test_step_mdp_speed[False-True-False-False-False] 42.5390μs 15.3471μs 65.1588 KOps/s 65.2317 KOps/s $\color{#d91a1a}-0.11\%$
test_step_mdp_speed[False-False-True-True-True] 0.1394ms 37.4799μs 26.6809 KOps/s 26.6818 KOps/s $-0.00\%$
test_step_mdp_speed[False-False-True-True-False] 66.2340μs 25.6441μs 38.9953 KOps/s 38.5014 KOps/s $\color{#35bf28}+1.28\%$
test_step_mdp_speed[False-False-True-False-True] 62.3670μs 24.0411μs 41.5954 KOps/s 42.2977 KOps/s $\color{#d91a1a}-1.66\%$
test_step_mdp_speed[False-False-True-False-False] 83.7640μs 15.3195μs 65.2765 KOps/s 64.9296 KOps/s $\color{#35bf28}+0.53\%$
test_step_mdp_speed[False-False-False-True-True] 0.1333ms 39.7835μs 25.1360 KOps/s 25.3575 KOps/s $\color{#d91a1a}-0.87\%$
test_step_mdp_speed[False-False-False-True-False] 88.6350μs 27.8694μs 35.8816 KOps/s 36.0397 KOps/s $\color{#d91a1a}-0.44\%$
test_step_mdp_speed[False-False-False-False-True] 93.3280μs 25.1947μs 39.6908 KOps/s 39.6594 KOps/s $\color{#35bf28}+0.08\%$
test_step_mdp_speed[False-False-False-False-False] 73.1130μs 17.3049μs 57.7869 KOps/s 57.9369 KOps/s $\color{#d91a1a}-0.26\%$
test_values[generalized_advantage_estimate-True-True] 9.8623ms 9.5474ms 104.7409 Ops/s 104.9552 Ops/s $\color{#d91a1a}-0.20\%$
test_values[vec_generalized_advantage_estimate-True-True] 41.4671ms 36.4091ms 27.4657 Ops/s 29.5629 Ops/s $\textbf{\color{#d91a1a}-7.09\%}$
test_values[td0_return_estimate-False-False] 0.2443ms 0.1786ms 5.5984 KOps/s 5.5376 KOps/s $\color{#35bf28}+1.10\%$
test_values[td1_return_estimate-False-False] 28.8332ms 24.6060ms 40.6405 Ops/s 41.2702 Ops/s $\color{#d91a1a}-1.53\%$
test_values[vec_td1_return_estimate-False-False] 41.2836ms 37.1097ms 26.9471 Ops/s 29.5697 Ops/s $\textbf{\color{#d91a1a}-8.87\%}$
test_values[td_lambda_return_estimate-True-False] 39.3043ms 35.2807ms 28.3441 Ops/s 28.9802 Ops/s $\color{#d91a1a}-2.19\%$
test_values[vec_td_lambda_return_estimate-True-False] 40.1417ms 37.1487ms 26.9188 Ops/s 29.5712 Ops/s $\textbf{\color{#d91a1a}-8.97\%}$
test_gae_speed[generalized_advantage_estimate-False-1-512] 8.9170ms 8.2852ms 120.6965 Ops/s 119.2856 Ops/s $\color{#35bf28}+1.18\%$
test_gae_speed[vec_generalized_advantage_estimate-True-1-512] 2.6810ms 2.0205ms 494.9220 Ops/s 499.1166 Ops/s $\color{#d91a1a}-0.84\%$
test_gae_speed[vec_generalized_advantage_estimate-False-1-512] 0.6445ms 0.3617ms 2.7650 KOps/s 2.7216 KOps/s $\color{#35bf28}+1.60\%$
test_gae_speed[vec_generalized_advantage_estimate-True-32-512] 51.1083ms 49.0685ms 20.3797 Ops/s 23.4334 Ops/s $\textbf{\color{#d91a1a}-13.03\%}$
test_gae_speed[vec_generalized_advantage_estimate-False-32-512] 3.1727ms 3.0414ms 328.7937 Ops/s 323.6506 Ops/s $\color{#35bf28}+1.59\%$
test_dqn_speed[False-None] 1.9269ms 1.3701ms 729.8509 Ops/s 721.7085 Ops/s $\color{#35bf28}+1.13\%$
test_dqn_speed[False-backward] 1.9428ms 1.8603ms 537.5411 Ops/s 538.2370 Ops/s $\color{#d91a1a}-0.13\%$
test_dqn_speed[True-None] 0.7095ms 0.4762ms 2.1001 KOps/s 2.1017 KOps/s $\color{#d91a1a}-0.08\%$
test_dqn_speed[True-backward] 0.9701ms 0.8975ms 1.1141 KOps/s 1.0601 KOps/s $\textbf{\color{#35bf28}+5.10\%}$
test_dqn_speed[reduce-overhead-None] 0.8018ms 0.4747ms 2.1066 KOps/s 2.0775 KOps/s $\color{#35bf28}+1.40\%$
test_dqn_speed[reduce-overhead-backward] 0.9792ms 0.8985ms 1.1130 KOps/s 1.1051 KOps/s $\color{#35bf28}+0.71\%$
test_ddpg_speed[False-None] 3.9640ms 2.8219ms 354.3695 Ops/s 347.4011 Ops/s $\color{#35bf28}+2.01\%$
test_ddpg_speed[False-backward] 4.1608ms 3.9770ms 251.4444 Ops/s 241.0807 Ops/s $\color{#35bf28}+4.30\%$
test_ddpg_speed[True-None] 1.3770ms 1.0118ms 988.3810 Ops/s 988.6960 Ops/s $\color{#d91a1a}-0.03\%$
test_ddpg_speed[True-backward] 1.9921ms 1.9123ms 522.9436 Ops/s 518.9902 Ops/s $\color{#35bf28}+0.76\%$
test_ddpg_speed[reduce-overhead-None] 1.2998ms 1.0097ms 990.4318 Ops/s 978.3860 Ops/s $\color{#35bf28}+1.23\%$
test_ddpg_speed[reduce-overhead-backward] 2.0670ms 1.9222ms 520.2447 Ops/s 522.6272 Ops/s $\color{#d91a1a}-0.46\%$
test_sac_speed[False-None] 10.3583ms 8.1012ms 123.4385 Ops/s 123.4445 Ops/s $-0.00\%$
test_sac_speed[False-backward] 11.2399ms 10.8353ms 92.2910 Ops/s 92.2863 Ops/s $+0.01\%$
test_sac_speed[True-None] 2.7117ms 1.8767ms 532.8617 Ops/s 537.9093 Ops/s $\color{#d91a1a}-0.94\%$
test_sac_speed[True-backward] 3.6352ms 3.5605ms 280.8555 Ops/s 277.3083 Ops/s $\color{#35bf28}+1.28\%$
test_sac_speed[reduce-overhead-None] 2.5587ms 1.8658ms 535.9714 Ops/s 530.5757 Ops/s $\color{#35bf28}+1.02\%$
test_sac_speed[reduce-overhead-backward] 4.4609ms 3.6168ms 276.4905 Ops/s 282.3755 Ops/s $\color{#d91a1a}-2.08\%$
test_redq_speed[False-None] 19.1165ms 13.2209ms 75.6379 Ops/s 77.2343 Ops/s $\color{#d91a1a}-2.07\%$
test_redq_speed[False-backward] 24.1991ms 22.5181ms 44.4088 Ops/s 44.9768 Ops/s $\color{#d91a1a}-1.26\%$
test_redq_speed[True-None] 5.7263ms 5.0348ms 198.6193 Ops/s 200.8178 Ops/s $\color{#d91a1a}-1.09\%$
test_redq_speed[True-backward] 13.1050ms 12.5138ms 79.9115 Ops/s 79.2521 Ops/s $\color{#35bf28}+0.83\%$
test_redq_speed[reduce-overhead-None] 5.7344ms 4.9690ms 201.2459 Ops/s 204.4112 Ops/s $\color{#d91a1a}-1.55\%$
test_redq_speed[reduce-overhead-backward] 13.2652ms 12.8065ms 78.0855 Ops/s 79.5951 Ops/s $\color{#d91a1a}-1.90\%$
test_redq_deprec_speed[False-None] 14.8191ms 12.8748ms 77.6714 Ops/s 75.8004 Ops/s $\color{#35bf28}+2.47\%$
test_redq_deprec_speed[False-backward] 20.6960ms 18.7129ms 53.4392 Ops/s 52.5227 Ops/s $\color{#35bf28}+1.75\%$
test_redq_deprec_speed[True-None] 4.2520ms 3.6412ms 274.6310 Ops/s 268.8891 Ops/s $\color{#35bf28}+2.14\%$
test_redq_deprec_speed[True-backward] 9.4458ms 8.4262ms 118.6778 Ops/s 119.9146 Ops/s $\color{#d91a1a}-1.03\%$
test_redq_deprec_speed[reduce-overhead-None] 4.3854ms 3.6581ms 273.3641 Ops/s 213.8987 Ops/s $\textbf{\color{#35bf28}+27.80\%}$
test_redq_deprec_speed[reduce-overhead-backward] 9.8295ms 8.6822ms 115.1782 Ops/s 121.3879 Ops/s $\textbf{\color{#d91a1a}-5.12\%}$
test_td3_speed[False-None] 8.3422ms 8.0288ms 124.5517 Ops/s 125.5475 Ops/s $\color{#d91a1a}-0.79\%$
test_td3_speed[False-backward] 11.0858ms 10.5009ms 95.2296 Ops/s 95.8431 Ops/s $\color{#d91a1a}-0.64\%$
test_td3_speed[True-None] 2.0566ms 1.7832ms 560.7749 Ops/s 567.1496 Ops/s $\color{#d91a1a}-1.12\%$
test_td3_speed[True-backward] 3.5070ms 3.4158ms 292.7551 Ops/s 297.1876 Ops/s $\color{#d91a1a}-1.49\%$
test_td3_speed[reduce-overhead-None] 2.0049ms 1.7913ms 558.2416 Ops/s 565.5545 Ops/s $\color{#d91a1a}-1.29\%$
test_td3_speed[reduce-overhead-backward] 3.5212ms 3.4148ms 292.8390 Ops/s 296.2771 Ops/s $\color{#d91a1a}-1.16\%$
test_cql_speed[False-None] 40.4263ms 36.8908ms 27.1070 Ops/s 27.8105 Ops/s $\color{#d91a1a}-2.53\%$
test_cql_speed[False-backward] 0.3342s 52.9889ms 18.8719 Ops/s 21.5794 Ops/s $\textbf{\color{#d91a1a}-12.55\%}$
test_cql_speed[True-None] 17.5664ms 16.0247ms 62.4037 Ops/s 62.7234 Ops/s $\color{#d91a1a}-0.51\%$
test_cql_speed[True-backward] 29.3566ms 23.6241ms 42.3297 Ops/s 43.9643 Ops/s $\color{#d91a1a}-3.72\%$
test_cql_speed[reduce-overhead-None] 17.0541ms 16.1057ms 62.0897 Ops/s 62.5720 Ops/s $\color{#d91a1a}-0.77\%$
test_cql_speed[reduce-overhead-backward] 25.0047ms 23.1611ms 43.1758 Ops/s 43.0396 Ops/s $\color{#35bf28}+0.32\%$
test_a2c_speed[False-None] 8.9004ms 7.2463ms 138.0020 Ops/s 137.4239 Ops/s $\color{#35bf28}+0.42\%$
test_a2c_speed[False-backward] 16.3247ms 14.6693ms 68.1695 Ops/s 67.8319 Ops/s $\color{#35bf28}+0.50\%$
test_a2c_speed[True-None] 3.7051ms 3.3621ms 297.4317 Ops/s 295.9445 Ops/s $\color{#35bf28}+0.50\%$
test_a2c_speed[True-backward] 10.7923ms 10.1579ms 98.4453 Ops/s 98.2777 Ops/s $\color{#35bf28}+0.17\%$
test_a2c_speed[reduce-overhead-None] 3.8581ms 3.3902ms 294.9705 Ops/s 295.4498 Ops/s $\color{#d91a1a}-0.16\%$
test_a2c_speed[reduce-overhead-backward] 10.7448ms 10.1061ms 98.9498 Ops/s 98.3928 Ops/s $\color{#35bf28}+0.57\%$
test_ppo_speed[False-None] 9.0412ms 7.6139ms 131.3387 Ops/s 131.9097 Ops/s $\color{#d91a1a}-0.43\%$
test_ppo_speed[False-backward] 16.2335ms 15.1764ms 65.8917 Ops/s 66.6391 Ops/s $\color{#d91a1a}-1.12\%$
test_ppo_speed[True-None] 4.5079ms 3.7613ms 265.8643 Ops/s 264.1288 Ops/s $\color{#35bf28}+0.66\%$
test_ppo_speed[True-backward] 11.0651ms 9.9345ms 100.6589 Ops/s 101.0294 Ops/s $\color{#d91a1a}-0.37\%$
test_ppo_speed[reduce-overhead-None] 4.0590ms 3.7580ms 266.0958 Ops/s 266.0968 Ops/s $-0.00\%$
test_ppo_speed[reduce-overhead-backward] 10.7816ms 9.9474ms 100.5284 Ops/s 101.0500 Ops/s $\color{#d91a1a}-0.52\%$
test_reinforce_speed[False-None] 7.4410ms 6.6287ms 150.8584 Ops/s 152.4688 Ops/s $\color{#d91a1a}-1.06\%$
test_reinforce_speed[False-backward] 12.0349ms 9.9578ms 100.4242 Ops/s 100.8736 Ops/s $\color{#d91a1a}-0.45\%$
test_reinforce_speed[True-None] 3.2106ms 2.7256ms 366.8895 Ops/s 367.0282 Ops/s $\color{#d91a1a}-0.04\%$
test_reinforce_speed[True-backward] 9.3586ms 8.9382ms 111.8788 Ops/s 113.1418 Ops/s $\color{#d91a1a}-1.12\%$
test_reinforce_speed[reduce-overhead-None] 3.1375ms 2.7320ms 366.0372 Ops/s 366.0036 Ops/s $+0.01\%$
test_reinforce_speed[reduce-overhead-backward] 9.1522ms 8.8545ms 112.9373 Ops/s 111.4280 Ops/s $\color{#35bf28}+1.35\%$
test_iql_speed[False-None] 34.1252ms 32.6389ms 30.6383 Ops/s 30.5145 Ops/s $\color{#35bf28}+0.41\%$
test_iql_speed[False-backward] 47.7387ms 45.9718ms 21.7525 Ops/s 21.7533 Ops/s $-0.00\%$
test_iql_speed[True-None] 12.8078ms 11.0568ms 90.4425 Ops/s 90.6068 Ops/s $\color{#d91a1a}-0.18\%$
test_iql_speed[True-backward] 23.3051ms 22.4286ms 44.5860 Ops/s 44.2814 Ops/s $\color{#35bf28}+0.69\%$
test_iql_speed[reduce-overhead-None] 12.4956ms 11.1099ms 90.0096 Ops/s 90.5303 Ops/s $\color{#d91a1a}-0.58\%$
test_iql_speed[reduce-overhead-backward] 25.0090ms 22.6118ms 44.2246 Ops/s 44.0049 Ops/s $\color{#35bf28}+0.50\%$
test_rb_sample[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 7.8380ms 5.0169ms 199.3249 Ops/s 201.1785 Ops/s $\color{#d91a1a}-0.92\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 3.6945ms 0.4891ms 2.0447 KOps/s 2.0294 KOps/s $\color{#35bf28}+0.75\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.7724ms 0.4669ms 2.1416 KOps/s 2.1258 KOps/s $\color{#35bf28}+0.74\%$
test_rb_sample[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 7.4871ms 5.0066ms 199.7353 Ops/s 202.3988 Ops/s $\color{#d91a1a}-1.32\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 0.8117ms 0.4812ms 2.0783 KOps/s 2.0630 KOps/s $\color{#35bf28}+0.74\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.7165ms 0.4558ms 2.1942 KOps/s 2.1586 KOps/s $\color{#35bf28}+1.64\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-sampler6-10000] 2.2058ms 1.6054ms 622.8813 Ops/s 616.3858 Ops/s $\color{#35bf28}+1.05\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-sampler7-10000] 2.1458ms 1.5509ms 644.7982 Ops/s 638.2747 Ops/s $\color{#35bf28}+1.02\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 5.8840ms 5.1118ms 195.6266 Ops/s 196.8208 Ops/s $\color{#d91a1a}-0.61\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 1.9399ms 0.6244ms 1.6016 KOps/s 1.5806 KOps/s $\color{#35bf28}+1.33\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.8439ms 0.5956ms 1.6790 KOps/s 1.6415 KOps/s $\color{#35bf28}+2.28\%$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 5.3079ms 4.9615ms 201.5536 Ops/s 200.1306 Ops/s $\color{#35bf28}+0.71\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 1.9378ms 0.4918ms 2.0333 KOps/s 2.0496 KOps/s $\color{#d91a1a}-0.80\%$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.7002ms 0.4592ms 2.1775 KOps/s 2.1257 KOps/s $\color{#35bf28}+2.44\%$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 5.0498ms 4.7885ms 208.8319 Ops/s 201.8041 Ops/s $\color{#35bf28}+3.48\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 0.6514ms 0.4786ms 2.0896 KOps/s 2.0617 KOps/s $\color{#35bf28}+1.35\%$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 7.4038ms 0.4627ms 2.1611 KOps/s 2.1632 KOps/s $\color{#d91a1a}-0.10\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 7.7444ms 5.0104ms 199.5865 Ops/s 198.4696 Ops/s $\color{#35bf28}+0.56\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 0.8826ms 0.6233ms 1.6043 KOps/s 1.6063 KOps/s $\color{#d91a1a}-0.12\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 7.9147ms 0.6074ms 1.6464 KOps/s 1.6769 KOps/s $\color{#d91a1a}-1.82\%$
test_rb_populate[TensorDictReplayBuffer-ListStorage-RandomSampler-400] 5.7774ms 4.3277ms 231.0680 Ops/s 238.9579 Ops/s $\color{#d91a1a}-3.30\%$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-400] 6.4694ms 2.2942ms 435.8871 Ops/s 448.6138 Ops/s $\color{#d91a1a}-2.84\%$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-400] 9.1180ms 1.5627ms 639.9259 Ops/s 713.0855 Ops/s $\textbf{\color{#d91a1a}-10.26\%}$
test_rb_populate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-400] 0.4122s 12.5650ms 79.5863 Ops/s 33.9622 Ops/s $\textbf{\color{#35bf28}+134.34\%}$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-400] 8.4884ms 2.3501ms 425.5156 Ops/s 407.3255 Ops/s $\color{#35bf28}+4.47\%$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-400] 5.3169ms 1.3436ms 744.2474 Ops/s 729.7795 Ops/s $\color{#35bf28}+1.98\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-ListStorage-None-400] 8.6557ms 4.6057ms 217.1238 Ops/s 219.4546 Ops/s $\color{#d91a1a}-1.06\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-400] 8.2101ms 2.4482ms 408.4571 Ops/s 405.6187 Ops/s $\color{#35bf28}+0.70\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-400] 2.7735ms 1.4026ms 712.9593 Ops/s 662.4112 Ops/s $\textbf{\color{#35bf28}+7.63\%}$

Copy link

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 143. Improved: $\large\color{#35bf28}10$. Worsened: $\large\color{#d91a1a}8$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_simple 0.7362s 0.7336s 1.3632 Ops/s 1.3488 Ops/s $\color{#35bf28}+1.07\%$
test_transformed 1.0736s 1.0033s 0.9967 Ops/s 1.0165 Ops/s $\color{#d91a1a}-1.95\%$
test_serial 2.2775s 2.1973s 0.4551 Ops/s 0.4580 Ops/s $\color{#d91a1a}-0.62\%$
test_parallel 2.0738s 2.0058s 0.4986 Ops/s 0.4966 Ops/s $\color{#35bf28}+0.39\%$
test_step_mdp_speed[True-True-True-True-True] 0.1850ms 39.0363μs 25.6172 KOps/s 26.0838 KOps/s $\color{#d91a1a}-1.79\%$
test_step_mdp_speed[True-True-True-True-False] 0.1560ms 22.7065μs 44.0403 KOps/s 43.3539 KOps/s $\color{#35bf28}+1.58\%$
test_step_mdp_speed[True-True-True-False-True] 65.1310μs 21.1338μs 47.3177 KOps/s 47.6099 KOps/s $\color{#d91a1a}-0.61\%$
test_step_mdp_speed[True-True-True-False-False] 37.6110μs 12.3530μs 80.9520 KOps/s 80.4458 KOps/s $\color{#35bf28}+0.63\%$
test_step_mdp_speed[True-True-False-True-True] 75.1310μs 41.8690μs 23.8840 KOps/s 23.6364 KOps/s $\color{#35bf28}+1.05\%$
test_step_mdp_speed[True-True-False-True-False] 53.4710μs 25.4995μs 39.2165 KOps/s 39.3604 KOps/s $\color{#d91a1a}-0.37\%$
test_step_mdp_speed[True-True-False-False-True] 52.9610μs 24.4271μs 40.9381 KOps/s 41.5193 KOps/s $\color{#d91a1a}-1.40\%$
test_step_mdp_speed[True-True-False-False-False] 44.8510μs 15.1365μs 66.0653 KOps/s 66.6395 KOps/s $\color{#d91a1a}-0.86\%$
test_step_mdp_speed[True-False-True-True-True] 83.2520μs 45.0165μs 22.2141 KOps/s 22.6559 KOps/s $\color{#d91a1a}-1.95\%$
test_step_mdp_speed[True-False-True-True-False] 56.1710μs 28.0334μs 35.6717 KOps/s 35.7888 KOps/s $\color{#d91a1a}-0.33\%$
test_step_mdp_speed[True-False-True-False-True] 56.6710μs 23.9033μs 41.8352 KOps/s 42.1057 KOps/s $\color{#d91a1a}-0.64\%$
test_step_mdp_speed[True-False-True-False-False] 0.1955ms 14.3674μs 69.6019 KOps/s 66.5008 KOps/s $\color{#35bf28}+4.66\%$
test_step_mdp_speed[True-False-False-True-True] 95.8920μs 46.6154μs 21.4521 KOps/s 21.0113 KOps/s $\color{#35bf28}+2.10\%$
test_step_mdp_speed[True-False-False-True-False] 64.2010μs 30.4855μs 32.8025 KOps/s 32.8248 KOps/s $\color{#d91a1a}-0.07\%$
test_step_mdp_speed[True-False-False-False-True] 52.5710μs 26.1761μs 38.2028 KOps/s 37.7895 KOps/s $\color{#35bf28}+1.09\%$
test_step_mdp_speed[True-False-False-False-False] 43.1010μs 17.6140μs 56.7731 KOps/s 57.0031 KOps/s $\color{#d91a1a}-0.40\%$
test_step_mdp_speed[False-True-True-True-True] 82.4220μs 44.4879μs 22.4780 KOps/s 22.2086 KOps/s $\color{#35bf28}+1.21\%$
test_step_mdp_speed[False-True-True-True-False] 65.2810μs 27.9166μs 35.8210 KOps/s 35.6109 KOps/s $\color{#35bf28}+0.59\%$
test_step_mdp_speed[False-True-True-False-True] 77.8310μs 28.7618μs 34.7683 KOps/s 35.8017 KOps/s $\color{#d91a1a}-2.89\%$
test_step_mdp_speed[False-True-True-False-False] 42.3710μs 17.2818μs 57.8644 KOps/s 58.1507 KOps/s $\color{#d91a1a}-0.49\%$
test_step_mdp_speed[False-True-False-True-True] 86.3620μs 47.4753μs 21.0636 KOps/s 21.1132 KOps/s $\color{#d91a1a}-0.23\%$
test_step_mdp_speed[False-True-False-True-False] 69.3820μs 30.5449μs 32.7387 KOps/s 32.7848 KOps/s $\color{#d91a1a}-0.14\%$
test_step_mdp_speed[False-True-False-False-True] 3.4528ms 32.0513μs 31.2000 KOps/s 31.4567 KOps/s $\color{#d91a1a}-0.82\%$
test_step_mdp_speed[False-True-False-False-False] 50.8410μs 20.1910μs 49.5271 KOps/s 49.1186 KOps/s $\color{#35bf28}+0.83\%$
test_step_mdp_speed[False-False-True-True-True] 78.8710μs 50.3614μs 19.8565 KOps/s 19.9460 KOps/s $\color{#d91a1a}-0.45\%$
test_step_mdp_speed[False-False-True-True-False] 81.7320μs 33.8343μs 29.5558 KOps/s 29.7178 KOps/s $\color{#d91a1a}-0.55\%$
test_step_mdp_speed[False-False-True-False-True] 71.2920μs 31.6855μs 31.5601 KOps/s 32.9083 KOps/s $\color{#d91a1a}-4.10\%$
test_step_mdp_speed[False-False-True-False-False] 53.2210μs 20.2158μs 49.4662 KOps/s 49.9407 KOps/s $\color{#d91a1a}-0.95\%$
test_step_mdp_speed[False-False-False-True-True] 0.1402ms 52.4860μs 19.0527 KOps/s 19.2303 KOps/s $\color{#d91a1a}-0.92\%$
test_step_mdp_speed[False-False-False-True-False] 67.5120μs 36.0726μs 27.7219 KOps/s 28.0610 KOps/s $\color{#d91a1a}-1.21\%$
test_step_mdp_speed[False-False-False-False-True] 65.7320μs 33.6116μs 29.7516 KOps/s 30.4033 KOps/s $\color{#d91a1a}-2.14\%$
test_step_mdp_speed[False-False-False-False-False] 57.5620μs 22.4630μs 44.5177 KOps/s 45.1418 KOps/s $\color{#d91a1a}-1.38\%$
test_values[generalized_advantage_estimate-True-True] 25.4215ms 24.8998ms 40.1609 Ops/s 40.0732 Ops/s $\color{#35bf28}+0.22\%$
test_values[vec_generalized_advantage_estimate-True-True] 0.1088s 3.0682ms 325.9212 Ops/s 326.7359 Ops/s $\color{#d91a1a}-0.25\%$
test_values[td0_return_estimate-False-False] 87.5910μs 66.4047μs 15.0592 KOps/s 15.0776 KOps/s $\color{#d91a1a}-0.12\%$
test_values[td1_return_estimate-False-False] 56.2491ms 55.7246ms 17.9454 Ops/s 17.8416 Ops/s $\color{#35bf28}+0.58\%$
test_values[vec_td1_return_estimate-False-False] 1.2706ms 1.0756ms 929.6953 Ops/s 925.3882 Ops/s $\color{#35bf28}+0.47\%$
test_values[td_lambda_return_estimate-True-False] 90.2582ms 88.3865ms 11.3139 Ops/s 11.3332 Ops/s $\color{#d91a1a}-0.17\%$
test_values[vec_td_lambda_return_estimate-True-False] 1.3005ms 1.0778ms 927.8268 Ops/s 931.5194 Ops/s $\color{#d91a1a}-0.40\%$
test_gae_speed[generalized_advantage_estimate-False-1-512] 26.4439ms 24.8789ms 40.1947 Ops/s 41.3956 Ops/s $\color{#d91a1a}-2.90\%$
test_gae_speed[vec_generalized_advantage_estimate-True-1-512] 1.0311ms 0.7454ms 1.3416 KOps/s 1.3476 KOps/s $\color{#d91a1a}-0.44\%$
test_gae_speed[vec_generalized_advantage_estimate-False-1-512] 0.7615ms 0.6614ms 1.5118 KOps/s 1.5092 KOps/s $\color{#35bf28}+0.18\%$
test_gae_speed[vec_generalized_advantage_estimate-True-32-512] 1.5798ms 1.4711ms 679.7579 Ops/s 680.8800 Ops/s $\color{#d91a1a}-0.16\%$
test_gae_speed[vec_generalized_advantage_estimate-False-32-512] 0.7114ms 0.6766ms 1.4779 KOps/s 1.4789 KOps/s $\color{#d91a1a}-0.06\%$
test_dqn_speed[False-None] 6.7599ms 1.3419ms 745.2382 Ops/s 669.5173 Ops/s $\textbf{\color{#35bf28}+11.31\%}$
test_dqn_speed[False-backward] 1.9373ms 1.8775ms 532.6102 Ops/s 530.6029 Ops/s $\color{#35bf28}+0.38\%$
test_dqn_speed[True-None] 1.2204ms 0.5700ms 1.7544 KOps/s 1.6728 KOps/s $\color{#35bf28}+4.87\%$
test_dqn_speed[True-backward] 1.0531ms 1.0150ms 985.2196 Ops/s 857.7337 Ops/s $\textbf{\color{#35bf28}+14.86\%}$
test_dqn_speed[reduce-overhead-None] 0.6427ms 0.5652ms 1.7692 KOps/s 1.6968 KOps/s $\color{#35bf28}+4.27\%$
test_dqn_speed[reduce-overhead-backward] 1.0640ms 1.0252ms 975.3907 Ops/s 963.9457 Ops/s $\color{#35bf28}+1.19\%$
test_ddpg_speed[False-None] 3.1522ms 2.7570ms 362.7096 Ops/s 362.7525 Ops/s $\color{#d91a1a}-0.01\%$
test_ddpg_speed[False-backward] 4.2360ms 4.0224ms 248.6084 Ops/s 251.3929 Ops/s $\color{#d91a1a}-1.11\%$
test_ddpg_speed[True-None] 1.4859ms 1.2775ms 782.7936 Ops/s 777.9191 Ops/s $\color{#35bf28}+0.63\%$
test_ddpg_speed[True-backward] 2.3964ms 2.2618ms 442.1294 Ops/s 435.6966 Ops/s $\color{#35bf28}+1.48\%$
test_ddpg_speed[reduce-overhead-None] 1.4460ms 1.2767ms 783.2599 Ops/s 772.1952 Ops/s $\color{#35bf28}+1.43\%$
test_ddpg_speed[reduce-overhead-backward] 2.3700ms 2.2714ms 440.2616 Ops/s 435.3570 Ops/s $\color{#35bf28}+1.13\%$
test_sac_speed[False-None] 8.0127ms 7.7319ms 129.3341 Ops/s 128.8705 Ops/s $\color{#35bf28}+0.36\%$
test_sac_speed[False-backward] 11.4421ms 11.0213ms 90.7335 Ops/s 91.2957 Ops/s $\color{#d91a1a}-0.62\%$
test_sac_speed[True-None] 2.2550ms 2.0997ms 476.2642 Ops/s 474.9581 Ops/s $\color{#35bf28}+0.27\%$
test_sac_speed[True-backward] 4.1548ms 4.0547ms 246.6248 Ops/s 230.4145 Ops/s $\textbf{\color{#35bf28}+7.04\%}$
test_sac_speed[reduce-overhead-None] 2.2512ms 2.0985ms 476.5262 Ops/s 472.9446 Ops/s $\color{#35bf28}+0.76\%$
test_sac_speed[reduce-overhead-backward] 4.3575ms 4.0948ms 244.2135 Ops/s 245.0894 Ops/s $\color{#d91a1a}-0.36\%$
test_redq_speed[False-None] 14.5971ms 10.3611ms 96.5152 Ops/s 94.4562 Ops/s $\color{#35bf28}+2.18\%$
test_redq_speed[False-backward] 18.6765ms 17.8958ms 55.8791 Ops/s 54.7261 Ops/s $\color{#35bf28}+2.11\%$
test_redq_speed[True-None] 4.0335ms 3.7350ms 267.7395 Ops/s 271.3290 Ops/s $\color{#d91a1a}-1.32\%$
test_redq_speed[True-backward] 9.4183ms 8.9321ms 111.9563 Ops/s 113.9583 Ops/s $\color{#d91a1a}-1.76\%$
test_redq_speed[reduce-overhead-None] 4.0279ms 3.6783ms 271.8619 Ops/s 275.1692 Ops/s $\color{#d91a1a}-1.20\%$
test_redq_speed[reduce-overhead-backward] 9.5136ms 8.9285ms 112.0007 Ops/s 113.0666 Ops/s $\color{#d91a1a}-0.94\%$
test_redq_deprec_speed[False-None] 12.5800ms 10.8932ms 91.8005 Ops/s 93.3912 Ops/s $\color{#d91a1a}-1.70\%$
test_redq_deprec_speed[False-backward] 16.2020ms 15.7699ms 63.4118 Ops/s 64.2576 Ops/s $\color{#d91a1a}-1.32\%$
test_redq_deprec_speed[True-None] 4.1547ms 3.3010ms 302.9347 Ops/s 308.7553 Ops/s $\color{#d91a1a}-1.89\%$
test_redq_deprec_speed[True-backward] 7.6544ms 7.2718ms 137.5166 Ops/s 137.9488 Ops/s $\color{#d91a1a}-0.31\%$
test_redq_deprec_speed[reduce-overhead-None] 3.6748ms 3.2687ms 305.9360 Ops/s 301.7485 Ops/s $\color{#35bf28}+1.39\%$
test_redq_deprec_speed[reduce-overhead-backward] 7.5446ms 7.2241ms 138.4258 Ops/s 136.2013 Ops/s $\color{#35bf28}+1.63\%$
test_td3_speed[False-None] 7.8126ms 7.6691ms 130.3941 Ops/s 128.7733 Ops/s $\color{#35bf28}+1.26\%$
test_td3_speed[False-backward] 10.8381ms 10.5154ms 95.0987 Ops/s 93.8744 Ops/s $\color{#35bf28}+1.30\%$
test_td3_speed[True-None] 2.0226ms 1.9615ms 509.8173 Ops/s 508.8090 Ops/s $\color{#35bf28}+0.20\%$
test_td3_speed[True-backward] 3.8789ms 3.7657ms 265.5572 Ops/s 260.1509 Ops/s $\color{#35bf28}+2.08\%$
test_td3_speed[reduce-overhead-None] 2.0041ms 1.9605ms 510.0780 Ops/s 508.3182 Ops/s $\color{#35bf28}+0.35\%$
test_td3_speed[reduce-overhead-backward] 3.9318ms 3.8120ms 262.3326 Ops/s 255.0095 Ops/s $\color{#35bf28}+2.87\%$
test_cql_speed[False-None] 30.0183ms 25.9391ms 38.5519 Ops/s 38.6337 Ops/s $\color{#d91a1a}-0.21\%$
test_cql_speed[False-backward] 38.1225ms 35.0402ms 28.5387 Ops/s 27.8043 Ops/s $\color{#35bf28}+2.64\%$
test_cql_speed[True-None] 11.7777ms 11.3319ms 88.2467 Ops/s 88.7193 Ops/s $\color{#d91a1a}-0.53\%$
test_cql_speed[True-backward] 17.7506ms 17.1360ms 58.3566 Ops/s 57.1405 Ops/s $\color{#35bf28}+2.13\%$
test_cql_speed[reduce-overhead-None] 11.6778ms 11.2941ms 88.5419 Ops/s 83.0010 Ops/s $\textbf{\color{#35bf28}+6.68\%}$
test_cql_speed[reduce-overhead-backward] 17.4860ms 16.9950ms 58.8410 Ops/s 58.1005 Ops/s $\color{#35bf28}+1.27\%$
test_a2c_speed[False-None] 7.5876ms 5.4298ms 184.1676 Ops/s 181.9529 Ops/s $\color{#35bf28}+1.22\%$
test_a2c_speed[False-backward] 12.2711ms 11.9159ms 83.9217 Ops/s 82.8365 Ops/s $\color{#35bf28}+1.31\%$
test_a2c_speed[True-None] 3.5147ms 3.1279ms 319.7077 Ops/s 313.2928 Ops/s $\color{#35bf28}+2.05\%$
test_a2c_speed[True-backward] 9.0501ms 8.7283ms 114.5701 Ops/s 113.2987 Ops/s $\color{#35bf28}+1.12\%$
test_a2c_speed[reduce-overhead-None] 3.5574ms 3.1787ms 314.5951 Ops/s 310.3181 Ops/s $\color{#35bf28}+1.38\%$
test_a2c_speed[reduce-overhead-backward] 9.1518ms 8.6677ms 115.3714 Ops/s 115.7683 Ops/s $\color{#d91a1a}-0.34\%$
test_ppo_speed[False-None] 6.2559ms 5.7489ms 173.9469 Ops/s 175.8565 Ops/s $\color{#d91a1a}-1.09\%$
test_ppo_speed[False-backward] 12.9215ms 12.5386ms 79.7534 Ops/s 80.6681 Ops/s $\color{#d91a1a}-1.13\%$
test_ppo_speed[True-None] 3.7221ms 3.5501ms 281.6802 Ops/s 277.0032 Ops/s $\color{#35bf28}+1.69\%$
test_ppo_speed[True-backward] 8.8016ms 8.4611ms 118.1885 Ops/s 110.9831 Ops/s $\textbf{\color{#35bf28}+6.49\%}$
test_ppo_speed[reduce-overhead-None] 3.8992ms 3.5157ms 284.4420 Ops/s 282.7471 Ops/s $\color{#35bf28}+0.60\%$
test_ppo_speed[reduce-overhead-backward] 8.8356ms 8.4621ms 118.1733 Ops/s 118.6302 Ops/s $\color{#d91a1a}-0.39\%$
test_reinforce_speed[False-None] 6.3309ms 4.5526ms 219.6546 Ops/s 217.7369 Ops/s $\color{#35bf28}+0.88\%$
test_reinforce_speed[False-backward] 7.8274ms 7.4351ms 134.4970 Ops/s 134.0572 Ops/s $\color{#35bf28}+0.33\%$
test_reinforce_speed[True-None] 2.5064ms 2.2931ms 436.0818 Ops/s 440.7474 Ops/s $\color{#d91a1a}-1.06\%$
test_reinforce_speed[True-backward] 7.5172ms 7.2588ms 137.7647 Ops/s 137.8019 Ops/s $\color{#d91a1a}-0.03\%$
test_reinforce_speed[reduce-overhead-None] 2.6904ms 2.3108ms 432.7535 Ops/s 435.2093 Ops/s $\color{#d91a1a}-0.56\%$
test_reinforce_speed[reduce-overhead-backward] 7.5935ms 7.2526ms 137.8812 Ops/s 137.8093 Ops/s $\color{#35bf28}+0.05\%$
test_iql_speed[False-None] 24.5468ms 20.2512ms 49.3798 Ops/s 48.6710 Ops/s $\color{#35bf28}+1.46\%$
test_iql_speed[False-backward] 38.3147ms 31.4210ms 31.8259 Ops/s 31.9142 Ops/s $\color{#d91a1a}-0.28\%$
test_iql_speed[True-None] 7.4377ms 6.9661ms 143.5531 Ops/s 143.1114 Ops/s $\color{#35bf28}+0.31\%$
test_iql_speed[True-backward] 16.1747ms 15.7890ms 63.3353 Ops/s 61.4197 Ops/s $\color{#35bf28}+3.12\%$
test_iql_speed[reduce-overhead-None] 7.5524ms 6.9908ms 143.0458 Ops/s 147.1113 Ops/s $\color{#d91a1a}-2.76\%$
test_iql_speed[reduce-overhead-backward] 16.5318ms 15.9585ms 62.6624 Ops/s 63.1073 Ops/s $\color{#d91a1a}-0.71\%$
test_rb_sample[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 6.5520ms 6.2659ms 159.5943 Ops/s 161.6914 Ops/s $\color{#d91a1a}-1.30\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 2.3773ms 0.3543ms 2.8227 KOps/s 3.7340 KOps/s $\textbf{\color{#d91a1a}-24.41\%}$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.7205ms 0.3195ms 3.1298 KOps/s 3.6349 KOps/s $\textbf{\color{#d91a1a}-13.90\%}$
test_rb_sample[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 6.5214ms 6.1812ms 161.7800 Ops/s 165.2441 Ops/s $\color{#d91a1a}-2.10\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 2.2715ms 0.3345ms 2.9891 KOps/s 3.1289 KOps/s $\color{#d91a1a}-4.47\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.6143ms 0.3138ms 3.1872 KOps/s 3.2990 KOps/s $\color{#d91a1a}-3.39\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-sampler6-10000] 1.6286ms 1.4001ms 714.2204 Ops/s 789.5704 Ops/s $\textbf{\color{#d91a1a}-9.54\%}$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-sampler7-10000] 1.5812ms 1.3442ms 743.9212 Ops/s 761.0130 Ops/s $\color{#d91a1a}-2.25\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 6.4743ms 6.3241ms 158.1257 Ops/s 160.2700 Ops/s $\color{#d91a1a}-1.34\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 2.6703ms 0.4799ms 2.0837 KOps/s 2.1921 KOps/s $\color{#d91a1a}-4.95\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.6094ms 0.4070ms 2.4573 KOps/s 2.7124 KOps/s $\textbf{\color{#d91a1a}-9.40\%}$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 6.3045ms 6.1189ms 163.4277 Ops/s 164.0857 Ops/s $\color{#d91a1a}-0.40\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 2.3665ms 0.3486ms 2.8685 KOps/s 3.7866 KOps/s $\textbf{\color{#d91a1a}-24.25\%}$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 1.1991ms 0.2563ms 3.9010 KOps/s 3.1068 KOps/s $\textbf{\color{#35bf28}+25.56\%}$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 6.3898ms 6.0764ms 164.5711 Ops/s 163.9536 Ops/s $\color{#35bf28}+0.38\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 1.8437ms 0.3372ms 2.9658 KOps/s 3.2019 KOps/s $\textbf{\color{#d91a1a}-7.37\%}$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.5771ms 0.3363ms 2.9733 KOps/s 3.1926 KOps/s $\textbf{\color{#d91a1a}-6.87\%}$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 6.6269ms 6.3861ms 156.5889 Ops/s 156.1360 Ops/s $\color{#35bf28}+0.29\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 0.6814ms 0.4324ms 2.3126 KOps/s 2.3090 KOps/s $\color{#35bf28}+0.16\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 10.1594ms 0.4574ms 2.1864 KOps/s 2.2974 KOps/s $\color{#d91a1a}-4.83\%$
test_rb_populate[TensorDictReplayBuffer-ListStorage-RandomSampler-400] 6.9230ms 5.3114ms 188.2728 Ops/s 184.8784 Ops/s $\color{#35bf28}+1.84\%$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-400] 9.1232ms 2.0366ms 491.0057 Ops/s 405.4538 Ops/s $\textbf{\color{#35bf28}+21.10\%}$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-400] 7.7127ms 1.2115ms 825.4517 Ops/s 844.8863 Ops/s $\color{#d91a1a}-2.30\%$
test_rb_populate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-400] 0.4061s 13.3838ms 74.7171 Ops/s 184.7474 Ops/s $\textbf{\color{#d91a1a}-59.56\%}$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-400] 9.5160ms 2.0406ms 490.0501 Ops/s 436.7656 Ops/s $\textbf{\color{#35bf28}+12.20\%}$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-400] 6.8390ms 1.2368ms 808.5429 Ops/s 790.2765 Ops/s $\color{#35bf28}+2.31\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-ListStorage-None-400] 7.1682ms 5.5237ms 181.0372 Ops/s 181.8420 Ops/s $\color{#d91a1a}-0.44\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-400] 11.1998ms 2.2680ms 440.9202 Ops/s 411.0649 Ops/s $\textbf{\color{#35bf28}+7.26\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-400] 2.3825ms 1.2875ms 776.6933 Ops/s 734.0604 Ops/s $\textbf{\color{#35bf28}+5.81\%}$

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants