-
Notifications
You must be signed in to change notification settings - Fork 321
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Feature] inline hold_out_net
#2499
Merged
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
vmoens
added a commit
that referenced
this pull request
Oct 17, 2024
ghstack-source-id: c315202c8af55f0852195fe488ae855966386c4c Pull Request resolved: #2499
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/rl/2499
Note: Links to docs will display an error until the docs builds have been completed. ❌ 18 New Failures, 4 Unrelated FailuresAs of commit 9a5e15e with merge base d894358 (): NEW FAILURES - The following jobs have failed:
BROKEN TRUNK - The following jobs failed but were present on the merge base:👉 Rebase onto the `viable/strict` branch to avoid these failures
This comment was automatically generated by Dr. CI and updates every 15 minutes. |
facebook-github-bot
added
the
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
label
Oct 17, 2024
vmoens
added a commit
that referenced
this pull request
Oct 17, 2024
ghstack-source-id: c315202c8af55f0852195fe488ae855966386c4c Pull Request resolved: #2499
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_simple | 0.4256s | 0.4231s | 2.3636 Ops/s | 2.4258 Ops/s | |
test_transformed | 0.7001s | 0.6180s | 1.6180 Ops/s | 1.7085 Ops/s | |
test_serial | 1.4485s | 1.3654s | 0.7324 Ops/s | 0.7427 Ops/s | |
test_parallel | 1.3199s | 1.3147s | 0.7606 Ops/s | 0.7510 Ops/s | |
test_step_mdp_speed[True-True-True-True-True] | 0.1536ms | 29.2155μs | 34.2284 KOps/s | 34.1633 KOps/s | |
test_step_mdp_speed[True-True-True-True-False] | 56.7560μs | 17.3792μs | 57.5399 KOps/s | 57.9899 KOps/s | |
test_step_mdp_speed[True-True-True-False-True] | 62.4570μs | 16.2093μs | 61.6931 KOps/s | 62.5826 KOps/s | |
test_step_mdp_speed[True-True-True-False-False] | 83.9730μs | 9.4598μs | 105.7101 KOps/s | 108.7525 KOps/s | |
test_step_mdp_speed[True-True-False-True-True] | 66.9460μs | 31.2403μs | 32.0099 KOps/s | 31.9770 KOps/s | |
test_step_mdp_speed[True-True-False-True-False] | 75.1210μs | 19.4478μs | 51.4197 KOps/s | 51.1443 KOps/s | |
test_step_mdp_speed[True-True-False-False-True] | 78.3460μs | 18.3016μs | 54.6402 KOps/s | 55.0248 KOps/s | |
test_step_mdp_speed[True-True-False-False-False] | 39.1130μs | 11.5804μs | 86.3525 KOps/s | 86.0521 KOps/s | |
test_step_mdp_speed[True-False-True-True-True] | 93.3040μs | 33.8043μs | 29.5821 KOps/s | 29.9062 KOps/s | |
test_step_mdp_speed[True-False-True-True-False] | 51.0150μs | 21.4612μs | 46.5956 KOps/s | 46.6376 KOps/s | |
test_step_mdp_speed[True-False-True-False-True] | 79.7990μs | 18.6469μs | 53.6282 KOps/s | 55.8736 KOps/s | |
test_step_mdp_speed[True-False-True-False-False] | 36.6380μs | 11.5318μs | 86.7164 KOps/s | 86.3400 KOps/s | |
test_step_mdp_speed[True-False-False-True-True] | 90.8500μs | 35.7123μs | 28.0015 KOps/s | 28.5968 KOps/s | |
test_step_mdp_speed[True-False-False-True-False] | 69.9910μs | 23.7121μs | 42.1726 KOps/s | 42.6093 KOps/s | |
test_step_mdp_speed[True-False-False-False-True] | 51.8470μs | 20.6168μs | 48.5040 KOps/s | 49.6827 KOps/s | |
test_step_mdp_speed[True-False-False-False-False] | 46.2060μs | 13.5774μs | 73.6517 KOps/s | 73.2021 KOps/s | |
test_step_mdp_speed[False-True-True-True-True] | 90.9900μs | 33.4300μs | 29.9132 KOps/s | 30.3649 KOps/s | |
test_step_mdp_speed[False-True-True-True-False] | 64.6510μs | 21.6401μs | 46.2106 KOps/s | 45.8279 KOps/s | |
test_step_mdp_speed[False-True-True-False-True] | 74.0580μs | 22.0312μs | 45.3902 KOps/s | 45.8140 KOps/s | |
test_step_mdp_speed[False-True-True-False-False] | 58.4590μs | 13.5282μs | 73.9197 KOps/s | 73.9367 KOps/s | |
test_step_mdp_speed[False-True-False-True-True] | 79.5190μs | 35.4062μs | 28.2437 KOps/s | 28.3079 KOps/s | |
test_step_mdp_speed[False-True-False-True-False] | 81.9030μs | 23.1936μs | 43.1154 KOps/s | 42.0639 KOps/s | |
test_step_mdp_speed[False-True-False-False-True] | 2.8290ms | 24.1746μs | 41.3657 KOps/s | 41.4622 KOps/s | |
test_step_mdp_speed[False-True-False-False-False] | 42.5390μs | 15.3471μs | 65.1588 KOps/s | 65.2317 KOps/s | |
test_step_mdp_speed[False-False-True-True-True] | 0.1394ms | 37.4799μs | 26.6809 KOps/s | 26.6818 KOps/s | |
test_step_mdp_speed[False-False-True-True-False] | 66.2340μs | 25.6441μs | 38.9953 KOps/s | 38.5014 KOps/s | |
test_step_mdp_speed[False-False-True-False-True] | 62.3670μs | 24.0411μs | 41.5954 KOps/s | 42.2977 KOps/s | |
test_step_mdp_speed[False-False-True-False-False] | 83.7640μs | 15.3195μs | 65.2765 KOps/s | 64.9296 KOps/s | |
test_step_mdp_speed[False-False-False-True-True] | 0.1333ms | 39.7835μs | 25.1360 KOps/s | 25.3575 KOps/s | |
test_step_mdp_speed[False-False-False-True-False] | 88.6350μs | 27.8694μs | 35.8816 KOps/s | 36.0397 KOps/s | |
test_step_mdp_speed[False-False-False-False-True] | 93.3280μs | 25.1947μs | 39.6908 KOps/s | 39.6594 KOps/s | |
test_step_mdp_speed[False-False-False-False-False] | 73.1130μs | 17.3049μs | 57.7869 KOps/s | 57.9369 KOps/s | |
test_values[generalized_advantage_estimate-True-True] | 9.8623ms | 9.5474ms | 104.7409 Ops/s | 104.9552 Ops/s | |
test_values[vec_generalized_advantage_estimate-True-True] | 41.4671ms | 36.4091ms | 27.4657 Ops/s | 29.5629 Ops/s | |
test_values[td0_return_estimate-False-False] | 0.2443ms | 0.1786ms | 5.5984 KOps/s | 5.5376 KOps/s | |
test_values[td1_return_estimate-False-False] | 28.8332ms | 24.6060ms | 40.6405 Ops/s | 41.2702 Ops/s | |
test_values[vec_td1_return_estimate-False-False] | 41.2836ms | 37.1097ms | 26.9471 Ops/s | 29.5697 Ops/s | |
test_values[td_lambda_return_estimate-True-False] | 39.3043ms | 35.2807ms | 28.3441 Ops/s | 28.9802 Ops/s | |
test_values[vec_td_lambda_return_estimate-True-False] | 40.1417ms | 37.1487ms | 26.9188 Ops/s | 29.5712 Ops/s | |
test_gae_speed[generalized_advantage_estimate-False-1-512] | 8.9170ms | 8.2852ms | 120.6965 Ops/s | 119.2856 Ops/s | |
test_gae_speed[vec_generalized_advantage_estimate-True-1-512] | 2.6810ms | 2.0205ms | 494.9220 Ops/s | 499.1166 Ops/s | |
test_gae_speed[vec_generalized_advantage_estimate-False-1-512] | 0.6445ms | 0.3617ms | 2.7650 KOps/s | 2.7216 KOps/s | |
test_gae_speed[vec_generalized_advantage_estimate-True-32-512] | 51.1083ms | 49.0685ms | 20.3797 Ops/s | 23.4334 Ops/s | |
test_gae_speed[vec_generalized_advantage_estimate-False-32-512] | 3.1727ms | 3.0414ms | 328.7937 Ops/s | 323.6506 Ops/s | |
test_dqn_speed[False-None] | 1.9269ms | 1.3701ms | 729.8509 Ops/s | 721.7085 Ops/s | |
test_dqn_speed[False-backward] | 1.9428ms | 1.8603ms | 537.5411 Ops/s | 538.2370 Ops/s | |
test_dqn_speed[True-None] | 0.7095ms | 0.4762ms | 2.1001 KOps/s | 2.1017 KOps/s | |
test_dqn_speed[True-backward] | 0.9701ms | 0.8975ms | 1.1141 KOps/s | 1.0601 KOps/s | |
test_dqn_speed[reduce-overhead-None] | 0.8018ms | 0.4747ms | 2.1066 KOps/s | 2.0775 KOps/s | |
test_dqn_speed[reduce-overhead-backward] | 0.9792ms | 0.8985ms | 1.1130 KOps/s | 1.1051 KOps/s | |
test_ddpg_speed[False-None] | 3.9640ms | 2.8219ms | 354.3695 Ops/s | 347.4011 Ops/s | |
test_ddpg_speed[False-backward] | 4.1608ms | 3.9770ms | 251.4444 Ops/s | 241.0807 Ops/s | |
test_ddpg_speed[True-None] | 1.3770ms | 1.0118ms | 988.3810 Ops/s | 988.6960 Ops/s | |
test_ddpg_speed[True-backward] | 1.9921ms | 1.9123ms | 522.9436 Ops/s | 518.9902 Ops/s | |
test_ddpg_speed[reduce-overhead-None] | 1.2998ms | 1.0097ms | 990.4318 Ops/s | 978.3860 Ops/s | |
test_ddpg_speed[reduce-overhead-backward] | 2.0670ms | 1.9222ms | 520.2447 Ops/s | 522.6272 Ops/s | |
test_sac_speed[False-None] | 10.3583ms | 8.1012ms | 123.4385 Ops/s | 123.4445 Ops/s | |
test_sac_speed[False-backward] | 11.2399ms | 10.8353ms | 92.2910 Ops/s | 92.2863 Ops/s | |
test_sac_speed[True-None] | 2.7117ms | 1.8767ms | 532.8617 Ops/s | 537.9093 Ops/s | |
test_sac_speed[True-backward] | 3.6352ms | 3.5605ms | 280.8555 Ops/s | 277.3083 Ops/s | |
test_sac_speed[reduce-overhead-None] | 2.5587ms | 1.8658ms | 535.9714 Ops/s | 530.5757 Ops/s | |
test_sac_speed[reduce-overhead-backward] | 4.4609ms | 3.6168ms | 276.4905 Ops/s | 282.3755 Ops/s | |
test_redq_speed[False-None] | 19.1165ms | 13.2209ms | 75.6379 Ops/s | 77.2343 Ops/s | |
test_redq_speed[False-backward] | 24.1991ms | 22.5181ms | 44.4088 Ops/s | 44.9768 Ops/s | |
test_redq_speed[True-None] | 5.7263ms | 5.0348ms | 198.6193 Ops/s | 200.8178 Ops/s | |
test_redq_speed[True-backward] | 13.1050ms | 12.5138ms | 79.9115 Ops/s | 79.2521 Ops/s | |
test_redq_speed[reduce-overhead-None] | 5.7344ms | 4.9690ms | 201.2459 Ops/s | 204.4112 Ops/s | |
test_redq_speed[reduce-overhead-backward] | 13.2652ms | 12.8065ms | 78.0855 Ops/s | 79.5951 Ops/s | |
test_redq_deprec_speed[False-None] | 14.8191ms | 12.8748ms | 77.6714 Ops/s | 75.8004 Ops/s | |
test_redq_deprec_speed[False-backward] | 20.6960ms | 18.7129ms | 53.4392 Ops/s | 52.5227 Ops/s | |
test_redq_deprec_speed[True-None] | 4.2520ms | 3.6412ms | 274.6310 Ops/s | 268.8891 Ops/s | |
test_redq_deprec_speed[True-backward] | 9.4458ms | 8.4262ms | 118.6778 Ops/s | 119.9146 Ops/s | |
test_redq_deprec_speed[reduce-overhead-None] | 4.3854ms | 3.6581ms | 273.3641 Ops/s | 213.8987 Ops/s | |
test_redq_deprec_speed[reduce-overhead-backward] | 9.8295ms | 8.6822ms | 115.1782 Ops/s | 121.3879 Ops/s | |
test_td3_speed[False-None] | 8.3422ms | 8.0288ms | 124.5517 Ops/s | 125.5475 Ops/s | |
test_td3_speed[False-backward] | 11.0858ms | 10.5009ms | 95.2296 Ops/s | 95.8431 Ops/s | |
test_td3_speed[True-None] | 2.0566ms | 1.7832ms | 560.7749 Ops/s | 567.1496 Ops/s | |
test_td3_speed[True-backward] | 3.5070ms | 3.4158ms | 292.7551 Ops/s | 297.1876 Ops/s | |
test_td3_speed[reduce-overhead-None] | 2.0049ms | 1.7913ms | 558.2416 Ops/s | 565.5545 Ops/s | |
test_td3_speed[reduce-overhead-backward] | 3.5212ms | 3.4148ms | 292.8390 Ops/s | 296.2771 Ops/s | |
test_cql_speed[False-None] | 40.4263ms | 36.8908ms | 27.1070 Ops/s | 27.8105 Ops/s | |
test_cql_speed[False-backward] | 0.3342s | 52.9889ms | 18.8719 Ops/s | 21.5794 Ops/s | |
test_cql_speed[True-None] | 17.5664ms | 16.0247ms | 62.4037 Ops/s | 62.7234 Ops/s | |
test_cql_speed[True-backward] | 29.3566ms | 23.6241ms | 42.3297 Ops/s | 43.9643 Ops/s | |
test_cql_speed[reduce-overhead-None] | 17.0541ms | 16.1057ms | 62.0897 Ops/s | 62.5720 Ops/s | |
test_cql_speed[reduce-overhead-backward] | 25.0047ms | 23.1611ms | 43.1758 Ops/s | 43.0396 Ops/s | |
test_a2c_speed[False-None] | 8.9004ms | 7.2463ms | 138.0020 Ops/s | 137.4239 Ops/s | |
test_a2c_speed[False-backward] | 16.3247ms | 14.6693ms | 68.1695 Ops/s | 67.8319 Ops/s | |
test_a2c_speed[True-None] | 3.7051ms | 3.3621ms | 297.4317 Ops/s | 295.9445 Ops/s | |
test_a2c_speed[True-backward] | 10.7923ms | 10.1579ms | 98.4453 Ops/s | 98.2777 Ops/s | |
test_a2c_speed[reduce-overhead-None] | 3.8581ms | 3.3902ms | 294.9705 Ops/s | 295.4498 Ops/s | |
test_a2c_speed[reduce-overhead-backward] | 10.7448ms | 10.1061ms | 98.9498 Ops/s | 98.3928 Ops/s | |
test_ppo_speed[False-None] | 9.0412ms | 7.6139ms | 131.3387 Ops/s | 131.9097 Ops/s | |
test_ppo_speed[False-backward] | 16.2335ms | 15.1764ms | 65.8917 Ops/s | 66.6391 Ops/s | |
test_ppo_speed[True-None] | 4.5079ms | 3.7613ms | 265.8643 Ops/s | 264.1288 Ops/s | |
test_ppo_speed[True-backward] | 11.0651ms | 9.9345ms | 100.6589 Ops/s | 101.0294 Ops/s | |
test_ppo_speed[reduce-overhead-None] | 4.0590ms | 3.7580ms | 266.0958 Ops/s | 266.0968 Ops/s | |
test_ppo_speed[reduce-overhead-backward] | 10.7816ms | 9.9474ms | 100.5284 Ops/s | 101.0500 Ops/s | |
test_reinforce_speed[False-None] | 7.4410ms | 6.6287ms | 150.8584 Ops/s | 152.4688 Ops/s | |
test_reinforce_speed[False-backward] | 12.0349ms | 9.9578ms | 100.4242 Ops/s | 100.8736 Ops/s | |
test_reinforce_speed[True-None] | 3.2106ms | 2.7256ms | 366.8895 Ops/s | 367.0282 Ops/s | |
test_reinforce_speed[True-backward] | 9.3586ms | 8.9382ms | 111.8788 Ops/s | 113.1418 Ops/s | |
test_reinforce_speed[reduce-overhead-None] | 3.1375ms | 2.7320ms | 366.0372 Ops/s | 366.0036 Ops/s | |
test_reinforce_speed[reduce-overhead-backward] | 9.1522ms | 8.8545ms | 112.9373 Ops/s | 111.4280 Ops/s | |
test_iql_speed[False-None] | 34.1252ms | 32.6389ms | 30.6383 Ops/s | 30.5145 Ops/s | |
test_iql_speed[False-backward] | 47.7387ms | 45.9718ms | 21.7525 Ops/s | 21.7533 Ops/s | |
test_iql_speed[True-None] | 12.8078ms | 11.0568ms | 90.4425 Ops/s | 90.6068 Ops/s | |
test_iql_speed[True-backward] | 23.3051ms | 22.4286ms | 44.5860 Ops/s | 44.2814 Ops/s | |
test_iql_speed[reduce-overhead-None] | 12.4956ms | 11.1099ms | 90.0096 Ops/s | 90.5303 Ops/s | |
test_iql_speed[reduce-overhead-backward] | 25.0090ms | 22.6118ms | 44.2246 Ops/s | 44.0049 Ops/s | |
test_rb_sample[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] | 7.8380ms | 5.0169ms | 199.3249 Ops/s | 201.1785 Ops/s | |
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] | 3.6945ms | 0.4891ms | 2.0447 KOps/s | 2.0294 KOps/s | |
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] | 0.7724ms | 0.4669ms | 2.1416 KOps/s | 2.1258 KOps/s | |
test_rb_sample[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] | 7.4871ms | 5.0066ms | 199.7353 Ops/s | 202.3988 Ops/s | |
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] | 0.8117ms | 0.4812ms | 2.0783 KOps/s | 2.0630 KOps/s | |
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] | 0.7165ms | 0.4558ms | 2.1942 KOps/s | 2.1586 KOps/s | |
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-sampler6-10000] | 2.2058ms | 1.6054ms | 622.8813 Ops/s | 616.3858 Ops/s | |
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-sampler7-10000] | 2.1458ms | 1.5509ms | 644.7982 Ops/s | 638.2747 Ops/s | |
test_rb_sample[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] | 5.8840ms | 5.1118ms | 195.6266 Ops/s | 196.8208 Ops/s | |
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] | 1.9399ms | 0.6244ms | 1.6016 KOps/s | 1.5806 KOps/s | |
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] | 0.8439ms | 0.5956ms | 1.6790 KOps/s | 1.6415 KOps/s | |
test_rb_iterate[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] | 5.3079ms | 4.9615ms | 201.5536 Ops/s | 200.1306 Ops/s | |
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] | 1.9378ms | 0.4918ms | 2.0333 KOps/s | 2.0496 KOps/s | |
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] | 0.7002ms | 0.4592ms | 2.1775 KOps/s | 2.1257 KOps/s | |
test_rb_iterate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] | 5.0498ms | 4.7885ms | 208.8319 Ops/s | 201.8041 Ops/s | |
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] | 0.6514ms | 0.4786ms | 2.0896 KOps/s | 2.0617 KOps/s | |
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] | 7.4038ms | 0.4627ms | 2.1611 KOps/s | 2.1632 KOps/s | |
test_rb_iterate[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] | 7.7444ms | 5.0104ms | 199.5865 Ops/s | 198.4696 Ops/s | |
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] | 0.8826ms | 0.6233ms | 1.6043 KOps/s | 1.6063 KOps/s | |
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] | 7.9147ms | 0.6074ms | 1.6464 KOps/s | 1.6769 KOps/s | |
test_rb_populate[TensorDictReplayBuffer-ListStorage-RandomSampler-400] | 5.7774ms | 4.3277ms | 231.0680 Ops/s | 238.9579 Ops/s | |
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-400] | 6.4694ms | 2.2942ms | 435.8871 Ops/s | 448.6138 Ops/s | |
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-400] | 9.1180ms | 1.5627ms | 639.9259 Ops/s | 713.0855 Ops/s | |
test_rb_populate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-400] | 0.4122s | 12.5650ms | 79.5863 Ops/s | 33.9622 Ops/s | |
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-400] | 8.4884ms | 2.3501ms | 425.5156 Ops/s | 407.3255 Ops/s | |
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-400] | 5.3169ms | 1.3436ms | 744.2474 Ops/s | 729.7795 Ops/s | |
test_rb_populate[TensorDictPrioritizedReplayBuffer-ListStorage-None-400] | 8.6557ms | 4.6057ms | 217.1238 Ops/s | 219.4546 Ops/s | |
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-400] | 8.2101ms | 2.4482ms | 408.4571 Ops/s | 405.6187 Ops/s | |
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-400] | 2.7735ms | 1.4026ms | 712.9593 Ops/s | 662.4112 Ops/s |
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_simple | 0.7362s | 0.7336s | 1.3632 Ops/s | 1.3488 Ops/s | |
test_transformed | 1.0736s | 1.0033s | 0.9967 Ops/s | 1.0165 Ops/s | |
test_serial | 2.2775s | 2.1973s | 0.4551 Ops/s | 0.4580 Ops/s | |
test_parallel | 2.0738s | 2.0058s | 0.4986 Ops/s | 0.4966 Ops/s | |
test_step_mdp_speed[True-True-True-True-True] | 0.1850ms | 39.0363μs | 25.6172 KOps/s | 26.0838 KOps/s | |
test_step_mdp_speed[True-True-True-True-False] | 0.1560ms | 22.7065μs | 44.0403 KOps/s | 43.3539 KOps/s | |
test_step_mdp_speed[True-True-True-False-True] | 65.1310μs | 21.1338μs | 47.3177 KOps/s | 47.6099 KOps/s | |
test_step_mdp_speed[True-True-True-False-False] | 37.6110μs | 12.3530μs | 80.9520 KOps/s | 80.4458 KOps/s | |
test_step_mdp_speed[True-True-False-True-True] | 75.1310μs | 41.8690μs | 23.8840 KOps/s | 23.6364 KOps/s | |
test_step_mdp_speed[True-True-False-True-False] | 53.4710μs | 25.4995μs | 39.2165 KOps/s | 39.3604 KOps/s | |
test_step_mdp_speed[True-True-False-False-True] | 52.9610μs | 24.4271μs | 40.9381 KOps/s | 41.5193 KOps/s | |
test_step_mdp_speed[True-True-False-False-False] | 44.8510μs | 15.1365μs | 66.0653 KOps/s | 66.6395 KOps/s | |
test_step_mdp_speed[True-False-True-True-True] | 83.2520μs | 45.0165μs | 22.2141 KOps/s | 22.6559 KOps/s | |
test_step_mdp_speed[True-False-True-True-False] | 56.1710μs | 28.0334μs | 35.6717 KOps/s | 35.7888 KOps/s | |
test_step_mdp_speed[True-False-True-False-True] | 56.6710μs | 23.9033μs | 41.8352 KOps/s | 42.1057 KOps/s | |
test_step_mdp_speed[True-False-True-False-False] | 0.1955ms | 14.3674μs | 69.6019 KOps/s | 66.5008 KOps/s | |
test_step_mdp_speed[True-False-False-True-True] | 95.8920μs | 46.6154μs | 21.4521 KOps/s | 21.0113 KOps/s | |
test_step_mdp_speed[True-False-False-True-False] | 64.2010μs | 30.4855μs | 32.8025 KOps/s | 32.8248 KOps/s | |
test_step_mdp_speed[True-False-False-False-True] | 52.5710μs | 26.1761μs | 38.2028 KOps/s | 37.7895 KOps/s | |
test_step_mdp_speed[True-False-False-False-False] | 43.1010μs | 17.6140μs | 56.7731 KOps/s | 57.0031 KOps/s | |
test_step_mdp_speed[False-True-True-True-True] | 82.4220μs | 44.4879μs | 22.4780 KOps/s | 22.2086 KOps/s | |
test_step_mdp_speed[False-True-True-True-False] | 65.2810μs | 27.9166μs | 35.8210 KOps/s | 35.6109 KOps/s | |
test_step_mdp_speed[False-True-True-False-True] | 77.8310μs | 28.7618μs | 34.7683 KOps/s | 35.8017 KOps/s | |
test_step_mdp_speed[False-True-True-False-False] | 42.3710μs | 17.2818μs | 57.8644 KOps/s | 58.1507 KOps/s | |
test_step_mdp_speed[False-True-False-True-True] | 86.3620μs | 47.4753μs | 21.0636 KOps/s | 21.1132 KOps/s | |
test_step_mdp_speed[False-True-False-True-False] | 69.3820μs | 30.5449μs | 32.7387 KOps/s | 32.7848 KOps/s | |
test_step_mdp_speed[False-True-False-False-True] | 3.4528ms | 32.0513μs | 31.2000 KOps/s | 31.4567 KOps/s | |
test_step_mdp_speed[False-True-False-False-False] | 50.8410μs | 20.1910μs | 49.5271 KOps/s | 49.1186 KOps/s | |
test_step_mdp_speed[False-False-True-True-True] | 78.8710μs | 50.3614μs | 19.8565 KOps/s | 19.9460 KOps/s | |
test_step_mdp_speed[False-False-True-True-False] | 81.7320μs | 33.8343μs | 29.5558 KOps/s | 29.7178 KOps/s | |
test_step_mdp_speed[False-False-True-False-True] | 71.2920μs | 31.6855μs | 31.5601 KOps/s | 32.9083 KOps/s | |
test_step_mdp_speed[False-False-True-False-False] | 53.2210μs | 20.2158μs | 49.4662 KOps/s | 49.9407 KOps/s | |
test_step_mdp_speed[False-False-False-True-True] | 0.1402ms | 52.4860μs | 19.0527 KOps/s | 19.2303 KOps/s | |
test_step_mdp_speed[False-False-False-True-False] | 67.5120μs | 36.0726μs | 27.7219 KOps/s | 28.0610 KOps/s | |
test_step_mdp_speed[False-False-False-False-True] | 65.7320μs | 33.6116μs | 29.7516 KOps/s | 30.4033 KOps/s | |
test_step_mdp_speed[False-False-False-False-False] | 57.5620μs | 22.4630μs | 44.5177 KOps/s | 45.1418 KOps/s | |
test_values[generalized_advantage_estimate-True-True] | 25.4215ms | 24.8998ms | 40.1609 Ops/s | 40.0732 Ops/s | |
test_values[vec_generalized_advantage_estimate-True-True] | 0.1088s | 3.0682ms | 325.9212 Ops/s | 326.7359 Ops/s | |
test_values[td0_return_estimate-False-False] | 87.5910μs | 66.4047μs | 15.0592 KOps/s | 15.0776 KOps/s | |
test_values[td1_return_estimate-False-False] | 56.2491ms | 55.7246ms | 17.9454 Ops/s | 17.8416 Ops/s | |
test_values[vec_td1_return_estimate-False-False] | 1.2706ms | 1.0756ms | 929.6953 Ops/s | 925.3882 Ops/s | |
test_values[td_lambda_return_estimate-True-False] | 90.2582ms | 88.3865ms | 11.3139 Ops/s | 11.3332 Ops/s | |
test_values[vec_td_lambda_return_estimate-True-False] | 1.3005ms | 1.0778ms | 927.8268 Ops/s | 931.5194 Ops/s | |
test_gae_speed[generalized_advantage_estimate-False-1-512] | 26.4439ms | 24.8789ms | 40.1947 Ops/s | 41.3956 Ops/s | |
test_gae_speed[vec_generalized_advantage_estimate-True-1-512] | 1.0311ms | 0.7454ms | 1.3416 KOps/s | 1.3476 KOps/s | |
test_gae_speed[vec_generalized_advantage_estimate-False-1-512] | 0.7615ms | 0.6614ms | 1.5118 KOps/s | 1.5092 KOps/s | |
test_gae_speed[vec_generalized_advantage_estimate-True-32-512] | 1.5798ms | 1.4711ms | 679.7579 Ops/s | 680.8800 Ops/s | |
test_gae_speed[vec_generalized_advantage_estimate-False-32-512] | 0.7114ms | 0.6766ms | 1.4779 KOps/s | 1.4789 KOps/s | |
test_dqn_speed[False-None] | 6.7599ms | 1.3419ms | 745.2382 Ops/s | 669.5173 Ops/s | |
test_dqn_speed[False-backward] | 1.9373ms | 1.8775ms | 532.6102 Ops/s | 530.6029 Ops/s | |
test_dqn_speed[True-None] | 1.2204ms | 0.5700ms | 1.7544 KOps/s | 1.6728 KOps/s | |
test_dqn_speed[True-backward] | 1.0531ms | 1.0150ms | 985.2196 Ops/s | 857.7337 Ops/s | |
test_dqn_speed[reduce-overhead-None] | 0.6427ms | 0.5652ms | 1.7692 KOps/s | 1.6968 KOps/s | |
test_dqn_speed[reduce-overhead-backward] | 1.0640ms | 1.0252ms | 975.3907 Ops/s | 963.9457 Ops/s | |
test_ddpg_speed[False-None] | 3.1522ms | 2.7570ms | 362.7096 Ops/s | 362.7525 Ops/s | |
test_ddpg_speed[False-backward] | 4.2360ms | 4.0224ms | 248.6084 Ops/s | 251.3929 Ops/s | |
test_ddpg_speed[True-None] | 1.4859ms | 1.2775ms | 782.7936 Ops/s | 777.9191 Ops/s | |
test_ddpg_speed[True-backward] | 2.3964ms | 2.2618ms | 442.1294 Ops/s | 435.6966 Ops/s | |
test_ddpg_speed[reduce-overhead-None] | 1.4460ms | 1.2767ms | 783.2599 Ops/s | 772.1952 Ops/s | |
test_ddpg_speed[reduce-overhead-backward] | 2.3700ms | 2.2714ms | 440.2616 Ops/s | 435.3570 Ops/s | |
test_sac_speed[False-None] | 8.0127ms | 7.7319ms | 129.3341 Ops/s | 128.8705 Ops/s | |
test_sac_speed[False-backward] | 11.4421ms | 11.0213ms | 90.7335 Ops/s | 91.2957 Ops/s | |
test_sac_speed[True-None] | 2.2550ms | 2.0997ms | 476.2642 Ops/s | 474.9581 Ops/s | |
test_sac_speed[True-backward] | 4.1548ms | 4.0547ms | 246.6248 Ops/s | 230.4145 Ops/s | |
test_sac_speed[reduce-overhead-None] | 2.2512ms | 2.0985ms | 476.5262 Ops/s | 472.9446 Ops/s | |
test_sac_speed[reduce-overhead-backward] | 4.3575ms | 4.0948ms | 244.2135 Ops/s | 245.0894 Ops/s | |
test_redq_speed[False-None] | 14.5971ms | 10.3611ms | 96.5152 Ops/s | 94.4562 Ops/s | |
test_redq_speed[False-backward] | 18.6765ms | 17.8958ms | 55.8791 Ops/s | 54.7261 Ops/s | |
test_redq_speed[True-None] | 4.0335ms | 3.7350ms | 267.7395 Ops/s | 271.3290 Ops/s | |
test_redq_speed[True-backward] | 9.4183ms | 8.9321ms | 111.9563 Ops/s | 113.9583 Ops/s | |
test_redq_speed[reduce-overhead-None] | 4.0279ms | 3.6783ms | 271.8619 Ops/s | 275.1692 Ops/s | |
test_redq_speed[reduce-overhead-backward] | 9.5136ms | 8.9285ms | 112.0007 Ops/s | 113.0666 Ops/s | |
test_redq_deprec_speed[False-None] | 12.5800ms | 10.8932ms | 91.8005 Ops/s | 93.3912 Ops/s | |
test_redq_deprec_speed[False-backward] | 16.2020ms | 15.7699ms | 63.4118 Ops/s | 64.2576 Ops/s | |
test_redq_deprec_speed[True-None] | 4.1547ms | 3.3010ms | 302.9347 Ops/s | 308.7553 Ops/s | |
test_redq_deprec_speed[True-backward] | 7.6544ms | 7.2718ms | 137.5166 Ops/s | 137.9488 Ops/s | |
test_redq_deprec_speed[reduce-overhead-None] | 3.6748ms | 3.2687ms | 305.9360 Ops/s | 301.7485 Ops/s | |
test_redq_deprec_speed[reduce-overhead-backward] | 7.5446ms | 7.2241ms | 138.4258 Ops/s | 136.2013 Ops/s | |
test_td3_speed[False-None] | 7.8126ms | 7.6691ms | 130.3941 Ops/s | 128.7733 Ops/s | |
test_td3_speed[False-backward] | 10.8381ms | 10.5154ms | 95.0987 Ops/s | 93.8744 Ops/s | |
test_td3_speed[True-None] | 2.0226ms | 1.9615ms | 509.8173 Ops/s | 508.8090 Ops/s | |
test_td3_speed[True-backward] | 3.8789ms | 3.7657ms | 265.5572 Ops/s | 260.1509 Ops/s | |
test_td3_speed[reduce-overhead-None] | 2.0041ms | 1.9605ms | 510.0780 Ops/s | 508.3182 Ops/s | |
test_td3_speed[reduce-overhead-backward] | 3.9318ms | 3.8120ms | 262.3326 Ops/s | 255.0095 Ops/s | |
test_cql_speed[False-None] | 30.0183ms | 25.9391ms | 38.5519 Ops/s | 38.6337 Ops/s | |
test_cql_speed[False-backward] | 38.1225ms | 35.0402ms | 28.5387 Ops/s | 27.8043 Ops/s | |
test_cql_speed[True-None] | 11.7777ms | 11.3319ms | 88.2467 Ops/s | 88.7193 Ops/s | |
test_cql_speed[True-backward] | 17.7506ms | 17.1360ms | 58.3566 Ops/s | 57.1405 Ops/s | |
test_cql_speed[reduce-overhead-None] | 11.6778ms | 11.2941ms | 88.5419 Ops/s | 83.0010 Ops/s | |
test_cql_speed[reduce-overhead-backward] | 17.4860ms | 16.9950ms | 58.8410 Ops/s | 58.1005 Ops/s | |
test_a2c_speed[False-None] | 7.5876ms | 5.4298ms | 184.1676 Ops/s | 181.9529 Ops/s | |
test_a2c_speed[False-backward] | 12.2711ms | 11.9159ms | 83.9217 Ops/s | 82.8365 Ops/s | |
test_a2c_speed[True-None] | 3.5147ms | 3.1279ms | 319.7077 Ops/s | 313.2928 Ops/s | |
test_a2c_speed[True-backward] | 9.0501ms | 8.7283ms | 114.5701 Ops/s | 113.2987 Ops/s | |
test_a2c_speed[reduce-overhead-None] | 3.5574ms | 3.1787ms | 314.5951 Ops/s | 310.3181 Ops/s | |
test_a2c_speed[reduce-overhead-backward] | 9.1518ms | 8.6677ms | 115.3714 Ops/s | 115.7683 Ops/s | |
test_ppo_speed[False-None] | 6.2559ms | 5.7489ms | 173.9469 Ops/s | 175.8565 Ops/s | |
test_ppo_speed[False-backward] | 12.9215ms | 12.5386ms | 79.7534 Ops/s | 80.6681 Ops/s | |
test_ppo_speed[True-None] | 3.7221ms | 3.5501ms | 281.6802 Ops/s | 277.0032 Ops/s | |
test_ppo_speed[True-backward] | 8.8016ms | 8.4611ms | 118.1885 Ops/s | 110.9831 Ops/s | |
test_ppo_speed[reduce-overhead-None] | 3.8992ms | 3.5157ms | 284.4420 Ops/s | 282.7471 Ops/s | |
test_ppo_speed[reduce-overhead-backward] | 8.8356ms | 8.4621ms | 118.1733 Ops/s | 118.6302 Ops/s | |
test_reinforce_speed[False-None] | 6.3309ms | 4.5526ms | 219.6546 Ops/s | 217.7369 Ops/s | |
test_reinforce_speed[False-backward] | 7.8274ms | 7.4351ms | 134.4970 Ops/s | 134.0572 Ops/s | |
test_reinforce_speed[True-None] | 2.5064ms | 2.2931ms | 436.0818 Ops/s | 440.7474 Ops/s | |
test_reinforce_speed[True-backward] | 7.5172ms | 7.2588ms | 137.7647 Ops/s | 137.8019 Ops/s | |
test_reinforce_speed[reduce-overhead-None] | 2.6904ms | 2.3108ms | 432.7535 Ops/s | 435.2093 Ops/s | |
test_reinforce_speed[reduce-overhead-backward] | 7.5935ms | 7.2526ms | 137.8812 Ops/s | 137.8093 Ops/s | |
test_iql_speed[False-None] | 24.5468ms | 20.2512ms | 49.3798 Ops/s | 48.6710 Ops/s | |
test_iql_speed[False-backward] | 38.3147ms | 31.4210ms | 31.8259 Ops/s | 31.9142 Ops/s | |
test_iql_speed[True-None] | 7.4377ms | 6.9661ms | 143.5531 Ops/s | 143.1114 Ops/s | |
test_iql_speed[True-backward] | 16.1747ms | 15.7890ms | 63.3353 Ops/s | 61.4197 Ops/s | |
test_iql_speed[reduce-overhead-None] | 7.5524ms | 6.9908ms | 143.0458 Ops/s | 147.1113 Ops/s | |
test_iql_speed[reduce-overhead-backward] | 16.5318ms | 15.9585ms | 62.6624 Ops/s | 63.1073 Ops/s | |
test_rb_sample[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] | 6.5520ms | 6.2659ms | 159.5943 Ops/s | 161.6914 Ops/s | |
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] | 2.3773ms | 0.3543ms | 2.8227 KOps/s | 3.7340 KOps/s | |
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] | 0.7205ms | 0.3195ms | 3.1298 KOps/s | 3.6349 KOps/s | |
test_rb_sample[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] | 6.5214ms | 6.1812ms | 161.7800 Ops/s | 165.2441 Ops/s | |
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] | 2.2715ms | 0.3345ms | 2.9891 KOps/s | 3.1289 KOps/s | |
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] | 0.6143ms | 0.3138ms | 3.1872 KOps/s | 3.2990 KOps/s | |
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-sampler6-10000] | 1.6286ms | 1.4001ms | 714.2204 Ops/s | 789.5704 Ops/s | |
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-sampler7-10000] | 1.5812ms | 1.3442ms | 743.9212 Ops/s | 761.0130 Ops/s | |
test_rb_sample[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] | 6.4743ms | 6.3241ms | 158.1257 Ops/s | 160.2700 Ops/s | |
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] | 2.6703ms | 0.4799ms | 2.0837 KOps/s | 2.1921 KOps/s | |
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] | 0.6094ms | 0.4070ms | 2.4573 KOps/s | 2.7124 KOps/s | |
test_rb_iterate[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] | 6.3045ms | 6.1189ms | 163.4277 Ops/s | 164.0857 Ops/s | |
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] | 2.3665ms | 0.3486ms | 2.8685 KOps/s | 3.7866 KOps/s | |
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] | 1.1991ms | 0.2563ms | 3.9010 KOps/s | 3.1068 KOps/s | |
test_rb_iterate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] | 6.3898ms | 6.0764ms | 164.5711 Ops/s | 163.9536 Ops/s | |
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] | 1.8437ms | 0.3372ms | 2.9658 KOps/s | 3.2019 KOps/s | |
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] | 0.5771ms | 0.3363ms | 2.9733 KOps/s | 3.1926 KOps/s | |
test_rb_iterate[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] | 6.6269ms | 6.3861ms | 156.5889 Ops/s | 156.1360 Ops/s | |
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] | 0.6814ms | 0.4324ms | 2.3126 KOps/s | 2.3090 KOps/s | |
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] | 10.1594ms | 0.4574ms | 2.1864 KOps/s | 2.2974 KOps/s | |
test_rb_populate[TensorDictReplayBuffer-ListStorage-RandomSampler-400] | 6.9230ms | 5.3114ms | 188.2728 Ops/s | 184.8784 Ops/s | |
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-400] | 9.1232ms | 2.0366ms | 491.0057 Ops/s | 405.4538 Ops/s | |
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-400] | 7.7127ms | 1.2115ms | 825.4517 Ops/s | 844.8863 Ops/s | |
test_rb_populate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-400] | 0.4061s | 13.3838ms | 74.7171 Ops/s | 184.7474 Ops/s | |
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-400] | 9.5160ms | 2.0406ms | 490.0501 Ops/s | 436.7656 Ops/s | |
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-400] | 6.8390ms | 1.2368ms | 808.5429 Ops/s | 790.2765 Ops/s | |
test_rb_populate[TensorDictPrioritizedReplayBuffer-ListStorage-None-400] | 7.1682ms | 5.5237ms | 181.0372 Ops/s | 181.8420 Ops/s | |
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-400] | 11.1998ms | 2.2680ms | 440.9202 Ops/s | 411.0649 Ops/s | |
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-400] | 2.3825ms | 1.2875ms | 776.6933 Ops/s | 734.0604 Ops/s |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Stack from ghstack (oldest at bottom):
hold_out_net
#2499