-
Notifications
You must be signed in to change notification settings - Fork 76
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Benchmark] Memmap tensordict benchmarks #432
Merged
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
facebook-github-bot
added
the
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
label
Jun 19, 2023
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_items | 13.0000μs | 3.6489μs | 274.0515 KOps/s | 315.4834 KOps/s | |
test_items_nested | 1.2939ms | 0.4872ms | 2.0526 KOps/s | 2.0460 KOps/s | |
test_items_nested_leaf | 0.3508ms | 0.2892ms | 3.4581 KOps/s | 3.4315 KOps/s | |
test_items_stack_nested | 26.2477ms | 23.9985ms | 41.6693 Ops/s | 42.0800 Ops/s | |
test_items_stack_nested_leaf | 17.5271ms | 13.2491ms | 75.4766 Ops/s | 76.7638 Ops/s | |
test_keys | 74.2010μs | 7.3868μs | 135.3768 KOps/s | 143.3242 KOps/s | |
test_keys_nested | 0.2875ms | 0.1866ms | 5.3586 KOps/s | 5.4647 KOps/s | |
test_keys_nested_leaf | 0.2949ms | 0.1844ms | 5.4232 KOps/s | 5.5108 KOps/s | |
test_keys_stack_nested | 1.9570ms | 1.7542ms | 570.0714 Ops/s | 572.4613 Ops/s | |
test_keys_stack_nested_leaf | 1.9514ms | 1.7536ms | 570.2491 Ops/s | 577.8733 Ops/s | |
test_values | 5.6000μs | 2.2827μs | 438.0741 KOps/s | 451.2768 KOps/s | |
test_values_nested | 0.5404ms | 0.4841ms | 2.0659 KOps/s | 2.0484 KOps/s | |
test_values_nested_leaf | 0.3625ms | 0.2891ms | 3.4592 KOps/s | 3.4393 KOps/s | |
test_values_stack_nested | 27.1391ms | 23.9204ms | 41.8053 Ops/s | 42.8161 Ops/s | |
test_values_stack_nested_leaf | 14.9192ms | 13.1568ms | 76.0065 Ops/s | 79.1617 Ops/s | |
test_membership | 16.9000μs | 3.8552μs | 259.3873 KOps/s | 300.0166 KOps/s | |
test_membership_nested | 31.9010μs | 7.2286μs | 138.3399 KOps/s | 148.2362 KOps/s | |
test_membership_nested_leaf | 19.7000μs | 6.8661μs | 145.6437 KOps/s | 155.6863 KOps/s | |
test_membership_stacked_nested | 35.9000μs | 8.1831μs | 122.2029 KOps/s | 103.6075 KOps/s | |
test_membership_stacked_nested_leaf | 24.2010μs | 8.1982μs | 121.9781 KOps/s | 100.7832 KOps/s | |
test_stacked_getleaf | 1.3514ms | 1.1684ms | 855.8855 Ops/s | 888.1331 Ops/s | |
test_stacked_get | 1.2871ms | 1.1161ms | 895.9566 Ops/s | 937.3480 Ops/s | |
test_common_ops | 0.9909ms | 0.9749ms | 1.0257 KOps/s | 1.0420 KOps/s | |
test_creation | 4.6901μs | 4.2761μs | 233.8581 KOps/s | 232.7368 KOps/s | |
test_creation_empty | 12.1981μs | 11.1793μs | 89.4511 KOps/s | 92.1893 KOps/s | |
test_creation_nested_1 | 26.9043μs | 21.2555μs | 47.0467 KOps/s | 49.7255 KOps/s | |
test_creation_nested_2 | 22.7642μs | 22.0919μs | 45.2654 KOps/s | 47.4513 KOps/s | |
test_clone | 25.2383μs | 20.8537μs | 47.9530 KOps/s | 49.3350 KOps/s | |
test_getitem[int] | 30.0285μs | 26.3154μs | 38.0005 KOps/s | 38.4724 KOps/s | |
test_getitem[slice_int] | 64.4604μs | 60.0198μs | 16.6612 KOps/s | 16.7869 KOps/s | |
test_getitem[range] | 65.8843μs | 59.7825μs | 16.7273 KOps/s | 17.0300 KOps/s | |
test_getitem[tuple] | 58.7836μs | 55.4191μs | 18.0443 KOps/s | 17.9770 KOps/s | |
test_getitem[list] | 55.8915μs | 52.6602μs | 18.9897 KOps/s | 19.2527 KOps/s | |
test_setitem_dim[int] | 83.4010μs | 42.6600μs | 23.4412 KOps/s | 24.8577 KOps/s | |
test_setitem_dim[slice_int] | 0.1253ms | 80.7280μs | 12.3873 KOps/s | 12.8439 KOps/s | |
test_setitem_dim[range] | 0.1496ms | 75.0665μs | 13.3215 KOps/s | 14.1239 KOps/s | |
test_setitem_dim[tuple] | 0.1281ms | 73.0613μs | 13.6871 KOps/s | 14.1808 KOps/s | |
test_setitem | 27.7493μs | 26.4340μs | 37.8301 KOps/s | 39.4136 KOps/s | |
test_set | 26.9932μs | 25.8746μs | 38.6480 KOps/s | 40.6163 KOps/s | |
test_set_shared | 0.1485ms | 0.1457ms | 6.8612 KOps/s | 6.3373 KOps/s | |
test_update | 31.5123μs | 29.3079μs | 34.1205 KOps/s | 35.0615 KOps/s | |
test_update_nested | 44.6154μs | 43.4254μs | 23.0280 KOps/s | 23.7707 KOps/s | |
test_set_nested | 38.4394μs | 36.3819μs | 27.4862 KOps/s | 28.5471 KOps/s | |
test_set_nested_new | 53.5255μs | 52.2223μs | 19.1489 KOps/s | 19.5300 KOps/s | |
test_select | 95.7009μs | 86.2516μs | 11.5940 KOps/s | 11.8463 KOps/s | |
test_creation[device0] | 1.3187ms | 0.5016ms | 1.9936 KOps/s | 2.0583 KOps/s | |
test_creation_from_tensor | 0.5852ms | 0.4577ms | 2.1848 KOps/s | 2.1970 KOps/s | |
test_add_one[memmap_tensor0] | 34.2503μs | 31.8379μs | 31.4091 KOps/s | 32.3545 KOps/s | |
test_contiguous[memmap_tensor0] | 9.0571μs | 8.3819μs | 119.3054 KOps/s | 120.6115 KOps/s | |
test_stack[memmap_tensor0] | 0.1583ms | 41.9203μs | 23.8548 KOps/s | 25.2523 KOps/s | |
test_reshape_pytree | 38.4154μs | 35.9348μs | 27.8282 KOps/s | 28.3052 KOps/s | |
test_reshape_td | 41.4604μs | 39.1378μs | 25.5507 KOps/s | 26.0853 KOps/s | |
test_view_pytree | 34.2933μs | 32.9657μs | 30.3346 KOps/s | 31.1175 KOps/s | |
test_view_td | 9.8981μs | 9.0522μs | 110.4709 KOps/s | 112.4788 KOps/s | |
test_unbind_pytree | 38.4123μs | 37.0029μs | 27.0249 KOps/s | 27.6981 KOps/s | |
test_unbind_td | 0.1215ms | 0.1195ms | 8.3658 KOps/s | 8.4587 KOps/s | |
test_split_pytree | 43.9444μs | 41.9206μs | 23.8546 KOps/s | 24.0195 KOps/s | |
test_split_td | 0.1046ms | 0.1007ms | 9.9346 KOps/s | 10.0078 KOps/s | |
test_add_pytree | 46.6314μs | 45.1509μs | 22.1479 KOps/s | 22.7410 KOps/s | |
test_add_td | 53.6605μs | 51.8624μs | 19.2818 KOps/s | 19.7142 KOps/s | |
test_distributed | 88.8000μs | 88.8000μs | 11.2613 KOps/s | 14.1044 KOps/s | |
test_tdmodule | 64.2010μs | 23.7975μs | 42.0211 KOps/s | 44.5772 KOps/s | |
test_tdmodule_dispatch | 0.2694ms | 54.2964μs | 18.4174 KOps/s | 20.0196 KOps/s | |
test_tdseq | 0.2415ms | 34.4640μs | 29.0158 KOps/s | 35.0184 KOps/s | |
test_tdseq_dispatch | 0.2352ms | 65.6611μs | 15.2297 KOps/s | 14.7098 KOps/s | |
test_instantiation_functorch | 10.5584ms | 1.6598ms | 602.4846 Ops/s | 639.7883 Ops/s | |
test_instantiation_td | 1.4014ms | 1.2153ms | 822.8716 Ops/s | 828.2471 Ops/s | |
test_exec_functorch | 0.1898ms | 0.1846ms | 5.4157 KOps/s | 5.6378 KOps/s | |
test_exec_td | 0.1763ms | 0.1720ms | 5.8156 KOps/s | 5.9759 KOps/s | |
test_vmap_mlp_speed[True-True] | 1.5639ms | 1.4735ms | 678.6638 Ops/s | 714.6856 Ops/s | |
test_vmap_mlp_speed[True-False] | 0.7010ms | 0.6707ms | 1.4910 KOps/s | 1.5872 KOps/s | |
test_vmap_mlp_speed[False-True] | 1.6535ms | 1.2664ms | 789.6526 Ops/s | 835.5476 Ops/s | |
test_vmap_mlp_speed[False-False] | 0.6962ms | 0.5570ms | 1.7953 KOps/s | 1.8778 KOps/s | |
test_vmap_transformer_speed[True-True] | 21.6093ms | 17.6126ms | 56.7776 Ops/s | 60.7536 Ops/s | |
test_vmap_transformer_speed[True-False] | 9.0201ms | 8.7367ms | 114.4591 Ops/s | 120.9275 Ops/s | |
test_vmap_transformer_speed[False-True] | 17.4961ms | 16.7805ms | 59.5931 Ops/s | 62.4206 Ops/s | |
test_vmap_transformer_speed[False-False] | 9.0050ms | 8.7104ms | 114.8049 Ops/s | 122.8104 Ops/s |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
No description provided.