-
Notifications
You must be signed in to change notification settings - Fork 76
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Quality] Better use of StrEnum in set_interaction_type #1087
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
vmoens
added a commit
that referenced
this pull request
Nov 13, 2024
ghstack-source-id: c91a7a6be513fb46be6914df0b3bde779fa5528f Pull Request resolved: #1087
facebook-github-bot
added
the
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
label
Nov 13, 2024
vmoens
added a commit
that referenced
this pull request
Nov 13, 2024
ghstack-source-id: c91a7a6be513fb46be6914df0b3bde779fa5528f Pull Request resolved: #1087
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 69.4200μs | 18.5043μs | 54.0414 KOps/s | 61.3913 KOps/s | |
test_plain_set_stack_nested | 56.8160μs | 18.2389μs | 54.8278 KOps/s | 59.5321 KOps/s | |
test_plain_set_nested_inplace | 70.7920μs | 20.1252μs | 49.6890 KOps/s | 55.4136 KOps/s | |
test_plain_set_stack_nested_inplace | 72.8160μs | 20.3519μs | 49.1354 KOps/s | 55.2703 KOps/s | |
test_items | 22.8630μs | 4.0736μs | 245.4833 KOps/s | 240.4326 KOps/s | |
test_items_nested | 0.6099ms | 0.3438ms | 2.9089 KOps/s | 2.9061 KOps/s | |
test_items_nested_locked | 0.7060ms | 0.3441ms | 2.9060 KOps/s | 2.8820 KOps/s | |
test_items_nested_leaf | 0.1296ms | 70.8181μs | 14.1207 KOps/s | 13.9680 KOps/s | |
test_items_stack_nested | 0.7075ms | 0.3471ms | 2.8806 KOps/s | 2.8770 KOps/s | |
test_items_stack_nested_leaf | 0.1356ms | 71.6340μs | 13.9598 KOps/s | 13.5257 KOps/s | |
test_items_stack_nested_locked | 0.6545ms | 0.3437ms | 2.9092 KOps/s | 2.8825 KOps/s | |
test_keys | 21.3600μs | 3.4983μs | 285.8515 KOps/s | 286.3059 KOps/s | |
test_keys_nested | 0.2248ms | 0.1368ms | 7.3124 KOps/s | 7.3575 KOps/s | |
test_keys_nested_locked | 1.6865ms | 0.1396ms | 7.1634 KOps/s | 7.1183 KOps/s | |
test_keys_nested_leaf | 0.1781ms | 0.1145ms | 8.7329 KOps/s | 8.5039 KOps/s | |
test_keys_stack_nested | 0.2934ms | 0.1339ms | 7.4662 KOps/s | 7.3979 KOps/s | |
test_keys_stack_nested_leaf | 0.2370ms | 0.1156ms | 8.6528 KOps/s | 8.6812 KOps/s | |
test_keys_stack_nested_locked | 0.2511ms | 0.1390ms | 7.1960 KOps/s | 7.1719 KOps/s | |
test_values | 4.7930μs | 1.0406μs | 961.0133 KOps/s | 949.2138 KOps/s | |
test_values_nested | 0.1639ms | 53.3917μs | 18.7295 KOps/s | 17.9216 KOps/s | |
test_values_nested_locked | 0.1042ms | 53.9652μs | 18.5305 KOps/s | 16.2071 KOps/s | |
test_values_nested_leaf | 0.1437ms | 59.4806μs | 16.8122 KOps/s | 16.4956 KOps/s | |
test_values_stack_nested | 0.1121ms | 54.9260μs | 18.2063 KOps/s | 17.3581 KOps/s | |
test_values_stack_nested_leaf | 0.1161ms | 59.4014μs | 16.8346 KOps/s | 16.5827 KOps/s | |
test_values_stack_nested_locked | 0.1082ms | 54.1999μs | 18.4502 KOps/s | 17.4159 KOps/s | |
test_membership | 5.3386μs | 0.7265μs | 1.3764 MOps/s | 1.1314 MOps/s | |
test_membership_nested | 19.8260μs | 2.7139μs | 368.4706 KOps/s | 358.7783 KOps/s | |
test_membership_nested_leaf | 39.6840μs | 2.7138μs | 368.4852 KOps/s | 361.6472 KOps/s | |
test_membership_stacked_nested | 21.4100μs | 2.6868μs | 372.1891 KOps/s | 368.1639 KOps/s | |
test_membership_stacked_nested_leaf | 28.8030μs | 2.7015μs | 370.1666 KOps/s | 361.4936 KOps/s | |
test_membership_nested_last | 60.0720μs | 4.0000μs | 249.9974 KOps/s | 242.7534 KOps/s | |
test_membership_nested_leaf_last | 26.6800μs | 3.9935μs | 250.4074 KOps/s | 244.4852 KOps/s | |
test_membership_stacked_nested_last | 23.9150μs | 3.9860μs | 250.8796 KOps/s | 137.8739 KOps/s | |
test_membership_stacked_nested_leaf_last | 25.4680μs | 4.0570μs | 246.4892 KOps/s | 140.5604 KOps/s | |
test_nested_getleaf | 46.9270μs | 10.4404μs | 95.7821 KOps/s | 94.3164 KOps/s | |
test_nested_get | 36.0270μs | 9.7941μs | 102.1028 KOps/s | 99.4722 KOps/s | |
test_stacked_getleaf | 37.7200μs | 10.3890μs | 96.2559 KOps/s | 94.9059 KOps/s | |
test_stacked_get | 47.1780μs | 9.7106μs | 102.9802 KOps/s | 100.2969 KOps/s | |
test_nested_getitemleaf | 46.1460μs | 10.7313μs | 93.1857 KOps/s | 91.7030 KOps/s | |
test_nested_getitem | 45.8960μs | 10.1908μs | 98.1281 KOps/s | 98.2984 KOps/s | |
test_stacked_getitemleaf | 36.4480μs | 10.8796μs | 91.9151 KOps/s | 91.8816 KOps/s | |
test_stacked_getitem | 52.1080μs | 10.1948μs | 98.0895 KOps/s | 99.3743 KOps/s | |
test_lock_nested | 3.0270ms | 0.4465ms | 2.2395 KOps/s | 1.8101 KOps/s | |
test_lock_stack_nested | 0.7020ms | 0.4124ms | 2.4246 KOps/s | 2.4620 KOps/s | |
test_unlock_nested | 0.7213ms | 0.3580ms | 2.7931 KOps/s | 2.7682 KOps/s | |
test_unlock_stack_nested | 0.5079ms | 0.3310ms | 3.0211 KOps/s | 3.0868 KOps/s | |
test_flatten_speed | 0.1626ms | 92.0235μs | 10.8668 KOps/s | 11.0268 KOps/s | |
test_unflatten_speed | 0.8372ms | 0.4676ms | 2.1387 KOps/s | 2.0866 KOps/s | |
test_common_ops | 5.0381ms | 0.7868ms | 1.2710 KOps/s | 1.4250 KOps/s | |
test_creation | 60.8240μs | 2.0475μs | 488.3893 KOps/s | 472.0002 KOps/s | |
test_creation_empty | 49.0320μs | 12.0304μs | 83.1230 KOps/s | 117.2208 KOps/s | |
test_creation_nested_1 | 40.3650μs | 14.7785μs | 67.6659 KOps/s | 88.3209 KOps/s | |
test_creation_nested_2 | 50.6850μs | 19.0358μs | 52.5325 KOps/s | 63.9391 KOps/s | |
test_clone | 67.2950μs | 13.1843μs | 75.8479 KOps/s | 75.4151 KOps/s | |
test_getitem[int] | 1.0628ms | 13.2545μs | 75.4460 KOps/s | 80.0128 KOps/s | |
test_getitem[slice_int] | 0.1526ms | 25.0160μs | 39.9744 KOps/s | 41.5446 KOps/s | |
test_getitem[range] | 0.2008ms | 48.7615μs | 20.5080 KOps/s | 21.2662 KOps/s | |
test_getitem[tuple] | 0.1368ms | 20.7408μs | 48.2142 KOps/s | 50.0513 KOps/s | |
test_getitem[list] | 0.1918ms | 44.3601μs | 22.5428 KOps/s | 23.3529 KOps/s | |
test_setitem_dim[int] | 48.4110μs | 25.6116μs | 39.0448 KOps/s | 39.3948 KOps/s | |
test_setitem_dim[slice_int] | 91.2000μs | 51.4188μs | 19.4481 KOps/s | 19.3618 KOps/s | |
test_setitem_dim[range] | 0.1300ms | 73.3779μs | 13.6281 KOps/s | 13.8368 KOps/s | |
test_setitem_dim[tuple] | 78.7070μs | 40.5910μs | 24.6360 KOps/s | 24.5011 KOps/s | |
test_setitem | 83.7160μs | 20.6279μs | 48.4781 KOps/s | 53.0871 KOps/s | |
test_set | 75.9010μs | 20.4848μs | 48.8166 KOps/s | 54.8140 KOps/s | |
test_set_shared | 1.2558ms | 0.1708ms | 5.8564 KOps/s | 6.0029 KOps/s | |
test_update | 0.1950ms | 23.6054μs | 42.3631 KOps/s | 51.4130 KOps/s | |
test_update_nested | 0.1041ms | 32.8426μs | 30.4482 KOps/s | 33.8093 KOps/s | |
test_update__nested | 1.0500ms | 34.6020μs | 28.9001 KOps/s | 30.0866 KOps/s | |
test_set_nested | 80.9810μs | 22.5075μs | 44.4295 KOps/s | 48.5879 KOps/s | |
test_set_nested_new | 75.0300μs | 26.8933μs | 37.1840 KOps/s | 40.1327 KOps/s | |
test_select | 0.1134ms | 43.0814μs | 23.2119 KOps/s | 24.7907 KOps/s | |
test_select_nested | 0.1423ms | 59.4101μs | 16.8321 KOps/s | 16.9226 KOps/s | |
test_exclude_nested | 0.3400ms | 75.1750μs | 13.3023 KOps/s | 13.0666 KOps/s | |
test_empty[True] | 0.5433ms | 0.3456ms | 2.8934 KOps/s | 2.8504 KOps/s | |
test_empty[False] | 8.9590μs | 1.2444μs | 803.6006 KOps/s | 797.2712 KOps/s | |
test_unbind_speed | 0.3365ms | 0.2609ms | 3.8327 KOps/s | 3.8651 KOps/s | |
test_unbind_speed_stack0 | 0.4999ms | 0.2593ms | 3.8561 KOps/s | 3.9803 KOps/s | |
test_unbind_speed_stack1 | 0.1057s | 0.7676ms | 1.3028 KOps/s | 1.4621 KOps/s | |
test_split | 0.1055s | 1.7809ms | 561.5261 Ops/s | 568.7199 Ops/s | |
test_chunk | 0.1021s | 1.7738ms | 563.7591 Ops/s | 573.0017 Ops/s | |
test_consolidate_njt[False-None] | 8.5706ms | 8.1763ms | 122.3042 Ops/s | 122.7524 Ops/s | |
test_creation[device0] | 0.2186ms | 90.5594μs | 11.0425 KOps/s | 11.0113 KOps/s | |
test_creation_from_tensor | 4.7050ms | 96.1511μs | 10.4003 KOps/s | 10.5392 KOps/s | |
test_add_one[memmap_tensor0] | 0.1681ms | 5.0162μs | 199.3529 KOps/s | 205.4100 KOps/s | |
test_contiguous[memmap_tensor0] | 12.6540μs | 0.5153μs | 1.9405 MOps/s | 1.9633 MOps/s | |
test_stack[memmap_tensor0] | 32.6010μs | 3.5534μs | 281.4245 KOps/s | 293.9670 KOps/s | |
test_memmaptd_index | 0.9247ms | 0.2410ms | 4.1493 KOps/s | 4.2093 KOps/s | |
test_memmaptd_index_astensor | 0.5784ms | 0.3188ms | 3.1371 KOps/s | 3.1639 KOps/s | |
test_memmaptd_index_op | 1.1280ms | 0.6168ms | 1.6212 KOps/s | 1.8424 KOps/s | |
test_serialize_model | 0.1278s | 0.1157s | 8.6398 Ops/s | 8.8378 Ops/s | |
test_serialize_model_pickle | 0.4454s | 0.3868s | 2.5852 Ops/s | 2.5565 Ops/s | |
test_serialize_weights | 0.2147s | 0.1281s | 7.8041 Ops/s | 7.9610 Ops/s | |
test_serialize_weights_returnearly | 0.1677s | 0.1587s | 6.3009 Ops/s | 6.5679 Ops/s | |
test_serialize_weights_pickle | 0.5071s | 0.4527s | 2.2091 Ops/s | 1.0816 Ops/s | |
test_serialize_weights_filesystem | 0.1488s | 0.1397s | 7.1556 Ops/s | 7.2451 Ops/s | |
test_serialize_model_filesystem | 0.2395s | 0.1576s | 6.3456 Ops/s | 6.4908 Ops/s | |
test_reshape_pytree | 59.9720μs | 26.7817μs | 37.3389 KOps/s | 36.7046 KOps/s | |
test_reshape_td | 70.2610μs | 33.5069μs | 29.8446 KOps/s | 30.3681 KOps/s | |
test_view_pytree | 74.4680μs | 27.3882μs | 36.5121 KOps/s | 37.0260 KOps/s | |
test_view_td | 78.7370μs | 38.2249μs | 26.1610 KOps/s | 25.5332 KOps/s | |
test_unbind_pytree | 69.1390μs | 30.1428μs | 33.1754 KOps/s | 33.4777 KOps/s | |
test_unbind_td | 0.3201ms | 38.3959μs | 26.0445 KOps/s | 26.3251 KOps/s | |
test_split_pytree | 81.0010μs | 30.0997μs | 33.2230 KOps/s | 33.8392 KOps/s | |
test_split_td | 0.5132ms | 44.7321μs | 22.3553 KOps/s | 22.7975 KOps/s | |
test_add_pytree | 76.6230μs | 35.8683μs | 27.8797 KOps/s | 27.5891 KOps/s | |
test_add_td | 0.1073ms | 58.6806μs | 17.0414 KOps/s | 19.0059 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1154ms | 63.0498μs | 15.8605 KOps/s | 16.4320 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.5165ms | 0.1620ms | 6.1734 KOps/s | 6.2384 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1603ms | 45.8977μs | 21.7876 KOps/s | 22.1039 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.2667ms | 0.1206ms | 8.2925 KOps/s | 8.5047 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 68.6680μs | 26.7237μs | 37.4199 KOps/s | 38.7123 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1147ms | 53.9360μs | 18.5405 KOps/s | 18.3902 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.1675ms | 78.3640μs | 12.7610 KOps/s | 12.5745 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.1649ms | 67.0907μs | 14.9052 KOps/s | 14.7841 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.1841ms | 0.1060ms | 9.4298 KOps/s | 9.7242 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.4215ms | 0.1992ms | 5.0190 KOps/s | 5.0558 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1641ms | 46.6190μs | 21.4505 KOps/s | 22.5946 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.4589ms | 63.0654μs | 15.8565 KOps/s | 16.2988 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.1799ms | 0.1038ms | 9.6295 KOps/s | 9.9606 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.2900ms | 0.2001ms | 4.9978 KOps/s | 5.0160 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.3675ms | 0.2106ms | 4.7477 KOps/s | 4.7806 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.2151ms | 0.1086ms | 9.2088 KOps/s | 9.6580 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.2731ms | 58.6120μs | 17.0614 KOps/s | 18.7553 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.2867ms | 48.3853μs | 20.6674 KOps/s | 22.3968 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.6388ms | 0.1578ms | 6.3374 KOps/s | 6.3222 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.2302ms | 0.1044ms | 9.5772 KOps/s | 10.0192 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 52.3880μs | 21.3674μs | 46.8002 KOps/s | 47.3257 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.1916ms | 59.4683μs | 16.8157 KOps/s | 16.6475 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1569ms | 80.8619μs | 12.3668 KOps/s | 12.4715 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1368ms | 68.6262μs | 14.5717 KOps/s | 14.4908 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 0.2998ms | 0.2105ms | 4.7509 KOps/s | 4.8669 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 2.0292ms | 1.2704ms | 787.1579 Ops/s | 777.7806 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 0.3069ms | 0.2045ms | 4.8891 KOps/s | 5.0413 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 1.6139ms | 0.7820ms | 1.2788 KOps/s | 1.3000 KOps/s | |
test_compile_assign_and_add_stack[compile] | 0.5545ms | 0.4596ms | 2.1759 KOps/s | 2.2230 KOps/s | |
test_compile_assign_and_add_stack[eager] | 4.4323ms | 2.6898ms | 371.7715 Ops/s | 416.0850 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 91.2100μs | 37.5304μs | 26.6450 KOps/s | 28.4713 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.5083ms | 33.9138μs | 29.4865 KOps/s | 31.4695 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 99.7550μs | 30.1028μs | 33.2195 KOps/s | 34.7470 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 68.9880μs | 23.8304μs | 41.9632 KOps/s | 43.0562 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 78.6870μs | 30.7162μs | 32.5561 KOps/s | 33.6219 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 81.8120μs | 24.2047μs | 41.3144 KOps/s | 42.3829 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1222ms | 51.9306μs | 19.2565 KOps/s | 19.5228 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.5787ms | 20.4952μs | 48.7919 KOps/s | 50.3628 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1018ms | 44.3031μs | 22.5718 KOps/s | 22.9807 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 83.8360μs | 19.0741μs | 52.4271 KOps/s | 53.2498 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1037ms | 44.1338μs | 22.6584 KOps/s | 22.4780 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 52.1980μs | 18.7462μs | 53.3442 KOps/s | 52.9495 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1225ms | 51.9727μs | 19.2409 KOps/s | 19.3580 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.9210ms | 20.7097μs | 48.2864 KOps/s | 51.0076 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1183ms | 44.8467μs | 22.2982 KOps/s | 22.4373 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 0.5420ms | 19.0468μs | 52.5022 KOps/s | 53.8246 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1189ms | 44.3702μs | 22.5376 KOps/s | 22.6880 KOps/s | |
test_compile_indexing[int-pytree-eager] | 53.1890μs | 18.9328μs | 52.8183 KOps/s | 53.5607 KOps/s | |
test_mod_add[eager] | 97.3810μs | 27.2951μs | 36.6366 KOps/s | 39.5830 KOps/s | |
test_mod_add[compile] | 0.1100ms | 45.8519μs | 21.8093 KOps/s | 21.9963 KOps/s | |
test_mod_add[compile-overhead] | 0.1051ms | 45.7395μs | 21.8630 KOps/s | 22.0983 KOps/s | |
test_mod_wrap[eager] | 0.3791ms | 0.2204ms | 4.5380 KOps/s | 4.7858 KOps/s | |
test_mod_wrap[compile] | 1.2420ms | 0.2080ms | 4.8088 KOps/s | 4.9766 KOps/s | |
test_mod_wrap[compile-overhead] | 1.4759ms | 0.2071ms | 4.8293 KOps/s | 4.9975 KOps/s | |
test_mod_wrap_and_backward[eager] | 14.2620ms | 11.8514ms | 84.3784 Ops/s | 92.0458 Ops/s | |
test_mod_wrap_and_backward[compile] | 15.9273ms | 12.8694ms | 77.7039 Ops/s | 93.7276 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 16.0870ms | 11.4145ms | 87.6082 Ops/s | 91.9859 Ops/s | |
test_seq_add[eager] | 0.1642ms | 92.3512μs | 10.8282 KOps/s | 11.3659 KOps/s | |
test_seq_add[compile] | 0.1425ms | 61.9244μs | 16.1487 KOps/s | 16.9519 KOps/s | |
test_seq_add[compile-overhead] | 0.1121ms | 60.9247μs | 16.4137 KOps/s | 17.3890 KOps/s | |
test_seq_wrap[eager] | 0.6089ms | 0.3986ms | 2.5090 KOps/s | 2.6623 KOps/s | |
test_seq_wrap[compile] | 0.3285ms | 0.2299ms | 4.3491 KOps/s | 4.5137 KOps/s | |
test_seq_wrap[compile-overhead] | 0.4188ms | 0.2292ms | 4.3635 KOps/s | 4.5128 KOps/s | |
test_func_call_runtime[False-eager] | 0.8624ms | 0.5624ms | 1.7781 KOps/s | 1.8724 KOps/s | |
test_func_call_runtime[False-compile] | 0.8044ms | 0.4369ms | 2.2890 KOps/s | 2.3675 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.7996ms | 0.4357ms | 2.2953 KOps/s | 2.3421 KOps/s | |
test_func_call_runtime[True-eager] | 0.9298ms | 0.7719ms | 1.2955 KOps/s | 1.3402 KOps/s | |
test_func_call_runtime[True-compile] | 0.5764ms | 0.4709ms | 2.1237 KOps/s | 2.1541 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.6769ms | 0.4713ms | 2.1218 KOps/s | 2.1652 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.7617ms | 0.5619ms | 1.7796 KOps/s | 1.8772 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.7744ms | 0.4353ms | 2.2975 KOps/s | 2.3539 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.7611ms | 0.4367ms | 2.2899 KOps/s | 2.3458 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.0763ms | 0.9089ms | 1.1003 KOps/s | 1.1128 KOps/s | |
test_func_call_cm_runtime[True-compile] | 1.1400ms | 0.5019ms | 1.9925 KOps/s | 2.0508 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.9198ms | 0.5060ms | 1.9761 KOps/s | 2.0549 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.4227ms | 1.9092ms | 523.7663 Ops/s | 530.1247 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 0.9215ms | 0.5261ms | 1.9006 KOps/s | 1.9516 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.7036ms | 0.5227ms | 1.9133 KOps/s | 1.9519 KOps/s | |
test_distributed | 0.2415ms | 0.1309ms | 7.6413 KOps/s | 7.9004 KOps/s | |
test_tdmodule | 47.8890μs | 19.5054μs | 51.2680 KOps/s | 58.3648 KOps/s | |
test_tdmodule_dispatch | 70.3320μs | 38.1131μs | 26.2377 KOps/s | 30.3810 KOps/s | |
test_tdseq | 42.9000μs | 23.2419μs | 43.0258 KOps/s | 51.3279 KOps/s | |
test_tdseq_dispatch | 75.7520μs | 44.6838μs | 22.3795 KOps/s | 26.3995 KOps/s | |
test_instantiation_functorch | 1.8304ms | 1.5521ms | 644.2837 Ops/s | 658.8427 Ops/s | |
test_exec_functorch | 0.3271ms | 0.1803ms | 5.5459 KOps/s | 5.6909 KOps/s | |
test_exec_functional_call | 0.2741ms | 0.1783ms | 5.6087 KOps/s | 5.9328 KOps/s | |
test_exec_td_decorator | 0.4724ms | 0.2344ms | 4.2655 KOps/s | 4.3882 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 1.4189ms | 0.6457ms | 1.5487 KOps/s | 1.6139 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.8766ms | 0.6477ms | 1.5440 KOps/s | 1.6011 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.8559ms | 0.5266ms | 1.8990 KOps/s | 1.9199 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.7358ms | 0.5246ms | 1.9063 KOps/s | 1.9344 KOps/s | |
test_to_module_speed[True] | 2.0365ms | 1.3144ms | 760.8227 Ops/s | 781.1927 Ops/s | |
test_to_module_speed[False] | 1.9700ms | 1.2681ms | 788.5705 Ops/s | 799.4008 Ops/s | |
test_tc_init | 94.5760μs | 45.6625μs | 21.8998 KOps/s | 23.0962 KOps/s | |
test_tc_init_nested | 0.1939ms | 90.0539μs | 11.1045 KOps/s | 11.3617 KOps/s | |
test_tc_first_layer_tensor | 21.0990μs | 1.5176μs | 658.9179 KOps/s | 613.9251 KOps/s | |
test_tc_first_layer_nontensor | 22.9930μs | 4.5723μs | 218.7063 KOps/s | 206.8475 KOps/s | |
test_tc_second_layer_tensor | 22.3120μs | 2.7701μs | 361.0006 KOps/s | 335.0777 KOps/s | |
test_tc_second_layer_nontensor | 35.7670μs | 5.9698μs | 167.5085 KOps/s | 160.8698 KOps/s | |
test_unbind | 0.2069s | 13.0928ms | 76.3777 Ops/s | 83.0251 Ops/s | |
test_full_like | 8.1580ms | 7.0680ms | 141.4821 Ops/s | 136.9367 Ops/s | |
test_zeros_like | 3.0569ms | 2.6881ms | 372.0062 Ops/s | 358.8711 Ops/s | |
test_ones_like | 3.7032ms | 3.2408ms | 308.5613 Ops/s | 301.9916 Ops/s | |
test_clone | 5.1122ms | 4.8817ms | 204.8483 Ops/s | 198.5908 Ops/s | |
test_squeeze | 54.8620μs | 12.0681μs | 82.8629 KOps/s | 82.6870 KOps/s | |
test_unsqueeze | 0.1714ms | 87.4070μs | 11.4407 KOps/s | 11.3761 KOps/s | |
test_split | 0.8531ms | 0.1915ms | 5.2215 KOps/s | 5.2384 KOps/s | |
test_permute | 0.3458ms | 0.2205ms | 4.5355 KOps/s | 4.6784 KOps/s | |
test_stack | 27.7131ms | 25.5232ms | 39.1801 Ops/s | 39.6075 Ops/s | |
test_cat | 30.0245ms | 25.4532ms | 39.2878 Ops/s | 39.8630 Ops/s |
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 31.2010μs | 10.9677μs | 91.1769 KOps/s | 97.5847 KOps/s | |
test_plain_set_stack_nested | 28.1700μs | 11.0420μs | 90.5635 KOps/s | 96.7851 KOps/s | |
test_plain_set_nested_inplace | 39.0810μs | 11.7683μs | 84.9738 KOps/s | 89.1536 KOps/s | |
test_plain_set_stack_nested_inplace | 56.0010μs | 11.7652μs | 84.9961 KOps/s | 89.3660 KOps/s | |
test_items | 29.4610μs | 2.8829μs | 346.8775 KOps/s | 340.6430 KOps/s | |
test_items_nested | 0.3720ms | 0.3175ms | 3.1500 KOps/s | 3.1382 KOps/s | |
test_items_nested_locked | 0.3744ms | 0.3212ms | 3.1132 KOps/s | 3.1228 KOps/s | |
test_items_nested_leaf | 87.3510μs | 58.0675μs | 17.2213 KOps/s | 17.2384 KOps/s | |
test_items_stack_nested | 0.3736ms | 0.3224ms | 3.1019 KOps/s | 3.1133 KOps/s | |
test_items_stack_nested_leaf | 0.1061ms | 59.6377μs | 16.7679 KOps/s | 16.9815 KOps/s | |
test_items_stack_nested_locked | 0.3558ms | 0.3225ms | 3.1011 KOps/s | 3.0907 KOps/s | |
test_keys | 31.5110μs | 3.4806μs | 287.3065 KOps/s | 268.0537 KOps/s | |
test_keys_nested | 98.4920μs | 71.0969μs | 14.0653 KOps/s | 14.1769 KOps/s | |
test_keys_nested_locked | 0.6674ms | 75.8116μs | 13.1906 KOps/s | 13.0458 KOps/s | |
test_keys_nested_leaf | 89.4220μs | 61.5430μs | 16.2488 KOps/s | 16.0298 KOps/s | |
test_keys_stack_nested | 0.1128ms | 71.2178μs | 14.0414 KOps/s | 13.8997 KOps/s | |
test_keys_stack_nested_leaf | 88.1810μs | 62.2982μs | 16.0518 KOps/s | 15.9453 KOps/s | |
test_keys_stack_nested_locked | 0.1051ms | 76.7125μs | 13.0357 KOps/s | 13.0043 KOps/s | |
test_values | 14.1020μs | 0.8643μs | 1.1570 MOps/s | 1.1516 MOps/s | |
test_values_nested | 90.4320μs | 31.4089μs | 31.8381 KOps/s | 31.7128 KOps/s | |
test_values_nested_locked | 55.7820μs | 32.9762μs | 30.3249 KOps/s | 30.1718 KOps/s | |
test_values_nested_leaf | 72.9810μs | 33.9301μs | 29.4724 KOps/s | 29.4624 KOps/s | |
test_values_stack_nested | 56.1610μs | 32.1659μs | 31.0888 KOps/s | 31.4827 KOps/s | |
test_values_stack_nested_leaf | 61.0120μs | 34.2698μs | 29.1802 KOps/s | 29.1072 KOps/s | |
test_values_stack_nested_locked | 73.8310μs | 33.7009μs | 29.6728 KOps/s | 29.8773 KOps/s | |
test_membership | 1.7540μs | 0.5262μs | 1.9003 MOps/s | 1.8839 MOps/s | |
test_membership_nested | 17.3355μs | 1.8872μs | 529.8716 KOps/s | 540.2062 KOps/s | |
test_membership_nested_leaf | 18.1337μs | 1.8577μs | 538.3098 KOps/s | 542.5843 KOps/s | |
test_membership_stacked_nested | 35.5810μs | 1.9492μs | 513.0247 KOps/s | 505.3051 KOps/s | |
test_membership_stacked_nested_leaf | 35.4610μs | 1.9780μs | 505.5737 KOps/s | 505.7108 KOps/s | |
test_membership_nested_last | 28.9810μs | 2.8039μs | 356.6461 KOps/s | 360.5842 KOps/s | |
test_membership_nested_leaf_last | 28.6100μs | 2.7901μs | 358.4059 KOps/s | 359.2276 KOps/s | |
test_membership_stacked_nested_last | 25.5200μs | 3.2472μs | 307.9569 KOps/s | 352.7165 KOps/s | |
test_membership_stacked_nested_leaf_last | 36.3710μs | 3.2410μs | 308.5472 KOps/s | 358.0241 KOps/s | |
test_nested_getleaf | 40.8710μs | 5.9433μs | 168.2556 KOps/s | 165.9670 KOps/s | |
test_nested_get | 27.1210μs | 5.6665μs | 176.4758 KOps/s | 174.7200 KOps/s | |
test_stacked_getleaf | 36.8810μs | 5.9734μs | 167.4102 KOps/s | 166.8124 KOps/s | |
test_stacked_get | 44.4910μs | 5.6912μs | 175.7108 KOps/s | 175.4465 KOps/s | |
test_nested_getitemleaf | 28.8100μs | 6.0761μs | 164.5779 KOps/s | 164.1802 KOps/s | |
test_nested_getitem | 33.5310μs | 5.7747μs | 173.1688 KOps/s | 173.6816 KOps/s | |
test_stacked_getitemleaf | 27.6210μs | 6.0349μs | 165.7028 KOps/s | 163.8201 KOps/s | |
test_stacked_getitem | 39.8110μs | 5.7743μs | 173.1822 KOps/s | 173.1067 KOps/s | |
test_lock_nested | 9.3811ms | 0.3781ms | 2.6449 KOps/s | 2.6994 KOps/s | |
test_lock_stack_nested | 0.3757ms | 0.3384ms | 2.9552 KOps/s | 2.9796 KOps/s | |
test_unlock_nested | 0.7793ms | 0.3092ms | 3.2340 KOps/s | 3.2448 KOps/s | |
test_unlock_stack_nested | 0.3087ms | 0.2776ms | 3.6023 KOps/s | 3.6413 KOps/s | |
test_flatten_speed | 0.1063ms | 73.2237μs | 13.6568 KOps/s | 13.9558 KOps/s | |
test_unflatten_speed | 0.3289ms | 0.2927ms | 3.4167 KOps/s | 3.4374 KOps/s | |
test_common_ops | 1.7747ms | 0.5840ms | 1.7124 KOps/s | 1.7390 KOps/s | |
test_creation | 0.1012ms | 1.4774μs | 676.8866 KOps/s | 671.7622 KOps/s | |
test_creation_empty | 36.2610μs | 8.0111μs | 124.8265 KOps/s | 150.8052 KOps/s | |
test_creation_nested_1 | 1.6372ms | 9.4935μs | 105.3348 KOps/s | 123.0840 KOps/s | |
test_creation_nested_2 | 40.6200μs | 12.1388μs | 82.3807 KOps/s | 94.2240 KOps/s | |
test_clone | 50.1910μs | 10.0723μs | 99.2818 KOps/s | 92.7969 KOps/s | |
test_getitem[int] | 93.5192ms | 15.6549μs | 63.8778 KOps/s | 93.6130 KOps/s | |
test_getitem[slice_int] | 0.1039ms | 19.9262μs | 50.1851 KOps/s | 48.6599 KOps/s | |
test_getitem[range] | 0.1316ms | 36.6065μs | 27.3176 KOps/s | 26.8884 KOps/s | |
test_getitem[tuple] | 0.1091ms | 17.5860μs | 56.8633 KOps/s | 55.0919 KOps/s | |
test_getitem[list] | 0.2248ms | 32.0869μs | 31.1653 KOps/s | 30.4066 KOps/s | |
test_setitem_dim[int] | 38.9610μs | 17.8038μs | 56.1678 KOps/s | 52.8444 KOps/s | |
test_setitem_dim[slice_int] | 62.6720μs | 36.1534μs | 27.6599 KOps/s | 26.7797 KOps/s | |
test_setitem_dim[range] | 76.7610μs | 52.2357μs | 19.1440 KOps/s | 18.8946 KOps/s | |
test_setitem_dim[tuple] | 52.6810μs | 30.8996μs | 32.3628 KOps/s | 31.6466 KOps/s | |
test_setitem | 74.2810μs | 14.5792μs | 68.5907 KOps/s | 68.8240 KOps/s | |
test_set | 86.1720μs | 13.6925μs | 73.0325 KOps/s | 70.4541 KOps/s | |
test_set_shared | 1.4890ms | 0.1461ms | 6.8446 KOps/s | 6.7406 KOps/s | |
test_update | 0.3449ms | 16.5403μs | 60.4582 KOps/s | 61.3159 KOps/s | |
test_update_nested | 84.0620μs | 21.6430μs | 46.2042 KOps/s | 48.3300 KOps/s | |
test_update__nested | 0.5825ms | 23.7796μs | 42.0529 KOps/s | 41.0022 KOps/s | |
test_set_nested | 82.6220μs | 15.2352μs | 65.6373 KOps/s | 65.6441 KOps/s | |
test_set_nested_new | 86.2120μs | 17.3377μs | 57.6779 KOps/s | 57.9964 KOps/s | |
test_select | 91.2420μs | 29.1017μs | 34.3622 KOps/s | 35.1078 KOps/s | |
test_select_nested | 82.4520μs | 42.3158μs | 23.6319 KOps/s | 24.0781 KOps/s | |
test_exclude_nested | 94.1420μs | 59.5318μs | 16.7978 KOps/s | 16.8439 KOps/s | |
test_empty[True] | 0.3117ms | 0.2579ms | 3.8776 KOps/s | 3.8767 KOps/s | |
test_empty[False] | 3.2870μs | 0.7515μs | 1.3306 MOps/s | 1.3462 MOps/s | |
test_to | 86.2610μs | 56.0297μs | 17.8477 KOps/s | 18.0297 KOps/s | |
test_to_nonblocking | 85.2510μs | 45.0099μs | 22.2173 KOps/s | 21.9977 KOps/s | |
test_unbind_speed | 0.2621ms | 0.2333ms | 4.2859 KOps/s | 4.2836 KOps/s | |
test_unbind_speed_stack0 | 0.2907ms | 0.2364ms | 4.2298 KOps/s | 4.2830 KOps/s | |
test_unbind_speed_stack1 | 92.5633ms | 0.6636ms | 1.5069 KOps/s | 1.5328 KOps/s | |
test_split | 94.2461ms | 1.5586ms | 641.6218 Ops/s | 585.3625 Ops/s | |
test_chunk | 96.3190ms | 1.5718ms | 636.1943 Ops/s | 634.6425 Ops/s | |
test_consolidate[False-None] | 2.6877ms | 2.5796ms | 387.6632 Ops/s | 387.1267 Ops/s | |
test_consolidate[default-None] | 1.7053ms | 1.6342ms | 611.9167 Ops/s | 606.3289 Ops/s | |
test_consolidate[reduce-overhead-None] | 1.7293ms | 1.6718ms | 598.1558 Ops/s | 592.3347 Ops/s | |
test_consolidate_njt[False-None] | 6.5812ms | 6.3191ms | 158.2504 Ops/s | 154.0608 Ops/s | |
test_to[False-False-None] | 1.7594ms | 1.6670ms | 599.8811 Ops/s | 592.5163 Ops/s | |
test_to[True-False-None] | 1.4974ms | 1.2252ms | 816.1811 Ops/s | 766.0645 Ops/s | |
test_to[within-False-None] | 4.0146ms | 3.9091ms | 255.8125 Ops/s | 250.9306 Ops/s | |
test_to[True-default-None] | 5.5736ms | 5.0950ms | 196.2709 Ops/s | 189.6743 Ops/s | |
test_to_njt[False-False-None] | 6.9679ms | 6.8082ms | 146.8823 Ops/s | 143.9741 Ops/s | |
test_to_njt[True-False-None] | 5.5276ms | 5.3148ms | 188.1539 Ops/s | 183.5675 Ops/s | |
test_to_njt[within-False-None] | 12.1252ms | 11.9164ms | 83.9176 Ops/s | 82.8130 Ops/s | |
test_creation[device0] | 0.4635ms | 78.6284μs | 12.7181 KOps/s | 11.9623 KOps/s | |
test_creation_from_tensor | 0.4571ms | 82.1826μs | 12.1680 KOps/s | 11.5215 KOps/s | |
test_add_one[memmap_tensor0] | 0.7969ms | 6.7283μs | 148.6270 KOps/s | 147.4919 KOps/s | |
test_contiguous[memmap_tensor0] | 1.8110μs | 0.4113μs | 2.4315 MOps/s | 2.4656 MOps/s | |
test_stack[memmap_tensor0] | 38.7900μs | 4.5922μs | 217.7596 KOps/s | 226.9119 KOps/s | |
test_memmaptd_index | 1.7373ms | 0.2521ms | 3.9672 KOps/s | 4.0959 KOps/s | |
test_memmaptd_index_astensor | 0.5797ms | 0.3123ms | 3.2023 KOps/s | 3.2993 KOps/s | |
test_memmaptd_index_op | 1.0236ms | 0.5861ms | 1.7063 KOps/s | 1.7781 KOps/s | |
test_serialize_model | 0.4169s | 0.1716s | 5.8267 Ops/s | 7.6867 Ops/s | |
test_serialize_model_pickle | 1.3787s | 1.2186s | 0.8206 Ops/s | 0.8240 Ops/s | |
test_serialize_weights | 0.1309s | 0.1304s | 7.6694 Ops/s | 7.7109 Ops/s | |
test_serialize_weights_returnearly | 0.3186s | 53.4740ms | 18.7007 Ops/s | 14.5993 Ops/s | |
test_serialize_weights_pickle | 1.3529s | 1.2125s | 0.8247 Ops/s | 0.8258 Ops/s | |
test_reshape_pytree | 51.6510μs | 21.8991μs | 45.6639 KOps/s | 45.3120 KOps/s | |
test_reshape_td | 47.4910μs | 26.1908μs | 38.1813 KOps/s | 37.0378 KOps/s | |
test_view_pytree | 48.8010μs | 21.9828μs | 45.4902 KOps/s | 44.6860 KOps/s | |
test_view_td | 56.4410μs | 29.1695μs | 34.2824 KOps/s | 32.8259 KOps/s | |
test_unbind_pytree | 54.5410μs | 27.5916μs | 36.2429 KOps/s | 34.9198 KOps/s | |
test_unbind_td | 0.7630ms | 35.5266μs | 28.1479 KOps/s | 26.8295 KOps/s | |
test_split_pytree | 59.8810μs | 30.0568μs | 33.2704 KOps/s | 33.4884 KOps/s | |
test_split_td | 0.9307ms | 37.5154μs | 26.6557 KOps/s | 25.4092 KOps/s | |
test_add_pytree | 63.9410μs | 33.9946μs | 29.4165 KOps/s | 28.5215 KOps/s | |
test_add_td | 82.4810μs | 45.2332μs | 22.1077 KOps/s | 19.4817 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1742ms | 0.1186ms | 8.4315 KOps/s | 8.1282 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.2216ms | 0.1249ms | 8.0090 KOps/s | 8.0225 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1697ms | 94.1642μs | 10.6197 KOps/s | 10.3960 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 1.6611ms | 0.1491ms | 6.7060 KOps/s | 6.5872 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 53.4710μs | 22.7985μs | 43.8626 KOps/s | 42.7160 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 73.4510μs | 26.6527μs | 37.5197 KOps/s | 36.5454 KOps/s | |
test_compile_copy_nested[pytree-compile] | 96.5720μs | 64.5708μs | 15.4869 KOps/s | 15.2507 KOps/s | |
test_compile_copy_nested[pytree-eager] | 80.7510μs | 49.2407μs | 20.3084 KOps/s | 19.7471 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.1973ms | 0.1420ms | 7.0401 KOps/s | 6.8777 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.2931ms | 0.2075ms | 4.8188 KOps/s | 4.8125 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1344ms | 97.5529μs | 10.2509 KOps/s | 10.0189 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.1092ms | 51.5175μs | 19.4109 KOps/s | 19.4830 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.2588ms | 0.1354ms | 7.3840 KOps/s | 7.2832 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.5327ms | 0.4766ms | 2.0980 KOps/s | 2.0347 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.4012ms | 0.2481ms | 4.0302 KOps/s | 4.0200 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.1821ms | 0.1424ms | 7.0203 KOps/s | 6.9140 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1444ms | 60.7100μs | 16.4718 KOps/s | 16.2893 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1607ms | 98.1031μs | 10.1934 KOps/s | 10.1036 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.4566ms | 0.4092ms | 2.4438 KOps/s | 2.4606 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.1737ms | 0.1363ms | 7.3384 KOps/s | 7.2921 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 56.7610μs | 19.7475μs | 50.6393 KOps/s | 53.6197 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 57.8410μs | 26.9664μs | 37.0832 KOps/s | 36.7959 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.2052ms | 70.0202μs | 14.2816 KOps/s | 14.2516 KOps/s | |
test_compile_copy_flat[pytree-eager] | 81.3820μs | 51.7809μs | 19.3122 KOps/s | 19.4565 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 1.5840ms | 0.3847ms | 2.5994 KOps/s | 2.2540 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 2.6679ms | 2.5535ms | 391.6177 Ops/s | 390.7478 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 1.5611ms | 0.3763ms | 2.6573 KOps/s | 2.2605 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 3.1382ms | 2.6021ms | 384.3041 Ops/s | 378.3354 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.8365ms | 0.1170ms | 8.5451 KOps/s | 8.8909 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.5856ms | 82.2530μs | 12.1576 KOps/s | 12.5904 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.5952ms | 0.1088ms | 9.1948 KOps/s | 9.5318 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 0.1132ms | 68.0479μs | 14.6955 KOps/s | 14.8589 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.1540ms | 0.1090ms | 9.1732 KOps/s | 9.4903 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 0.1770ms | 68.1975μs | 14.6633 KOps/s | 14.7897 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1469ms | 99.4305μs | 10.0573 KOps/s | 9.7389 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.1438ms | 16.4721μs | 60.7086 KOps/s | 56.9352 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1460ms | 94.3945μs | 10.5938 KOps/s | 10.3476 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 40.9110μs | 15.6089μs | 64.0661 KOps/s | 62.7979 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1393ms | 94.9804μs | 10.5285 KOps/s | 10.3238 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 43.1310μs | 15.4736μs | 64.6263 KOps/s | 63.2458 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1424ms | 99.2176μs | 10.0789 KOps/s | 9.8373 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.5663ms | 16.3640μs | 61.1098 KOps/s | 58.0525 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1353ms | 94.8169μs | 10.5466 KOps/s | 10.3179 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 47.0610μs | 15.3868μs | 64.9908 KOps/s | 63.1010 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1412ms | 94.6627μs | 10.5638 KOps/s | 10.2971 KOps/s | |
test_compile_indexing[int-pytree-eager] | 0.1558ms | 15.7878μs | 63.3401 KOps/s | 63.4452 KOps/s | |
test_mod_add[eager] | 67.5610μs | 30.8196μs | 32.4469 KOps/s | 33.2078 KOps/s | |
test_mod_add[compile] | 0.3869ms | 75.9105μs | 13.1734 KOps/s | 13.0511 KOps/s | |
test_mod_add[compile-overhead] | 0.3108ms | 0.1636ms | 6.1133 KOps/s | 5.7644 KOps/s | |
test_mod_wrap[eager] | 0.3207ms | 0.2501ms | 3.9977 KOps/s | 3.8689 KOps/s | |
test_mod_wrap[compile] | 1.5730ms | 0.2769ms | 3.6118 KOps/s | 3.5242 KOps/s | |
test_mod_wrap[compile-overhead] | 7.2769ms | 3.8053ms | 262.7931 Ops/s | 261.7007 Ops/s | |
test_mod_wrap_and_backward[eager] | 1.8077ms | 1.4177ms | 705.3699 Ops/s | 691.0061 Ops/s | |
test_mod_wrap_and_backward[compile] | 1.5020ms | 1.3409ms | 745.7586 Ops/s | 723.1784 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 1.4974ms | 1.0192ms | 981.1355 Ops/s | 970.1675 Ops/s | |
test_seq_add[eager] | 0.1398ms | 94.1734μs | 10.6187 KOps/s | 10.3616 KOps/s | |
test_seq_add[compile] | 0.1834ms | 85.8046μs | 11.6544 KOps/s | 11.6049 KOps/s | |
test_seq_add[compile-overhead] | 0.1694ms | 0.1276ms | 7.8342 KOps/s | 7.8193 KOps/s | |
test_seq_wrap[eager] | 0.4639ms | 0.3851ms | 2.5968 KOps/s | 2.6549 KOps/s | |
test_seq_wrap[compile] | 0.3669ms | 0.2914ms | 3.4313 KOps/s | 3.3202 KOps/s | |
test_seq_wrap[compile-overhead] | 0.2696ms | 0.2215ms | 4.5153 KOps/s | 4.4835 KOps/s | |
test_func_call_runtime[False-eager] | 0.7831ms | 0.7209ms | 1.3872 KOps/s | 1.3350 KOps/s | |
test_func_call_runtime[False-compile] | 0.7725ms | 0.7230ms | 1.3830 KOps/s | 1.3007 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.4097ms | 0.3587ms | 2.7876 KOps/s | 2.7943 KOps/s | |
test_func_call_runtime[True-eager] | 0.9699ms | 0.8863ms | 1.1283 KOps/s | 1.1049 KOps/s | |
test_func_call_runtime[True-compile] | 0.7953ms | 0.7461ms | 1.3403 KOps/s | 1.3065 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.4296ms | 0.3806ms | 2.6272 KOps/s | 2.6164 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.8634ms | 0.7178ms | 1.3932 KOps/s | 1.2515 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.8217ms | 0.7308ms | 1.3683 KOps/s | 1.3407 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.4209ms | 0.3604ms | 2.7746 KOps/s | 2.7418 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.0621ms | 0.9792ms | 1.0212 KOps/s | 959.0361 Ops/s | |
test_func_call_cm_runtime[True-compile] | 0.8293ms | 0.7784ms | 1.2847 KOps/s | 1.2587 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.4694ms | 0.4072ms | 2.4557 KOps/s | 2.4217 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.5412ms | 2.0565ms | 486.2523 Ops/s | 475.4813 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 0.8772ms | 0.7976ms | 1.2538 KOps/s | 1.2300 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.4582ms | 0.4097ms | 2.4406 KOps/s | 2.4337 KOps/s | |
test_distributed | 1.7201ms | 0.2028ms | 4.9319 KOps/s | 8.7220 KOps/s | |
test_tdmodule | 44.5600μs | 13.8620μs | 72.1397 KOps/s | 75.2266 KOps/s | |
test_tdmodule_dispatch | 46.9710μs | 27.2872μs | 36.6472 KOps/s | 39.1769 KOps/s | |
test_tdseq | 34.8210μs | 15.2230μs | 65.6902 KOps/s | 69.4380 KOps/s | |
test_tdseq_dispatch | 51.9010μs | 30.3052μs | 32.9976 KOps/s | 34.6197 KOps/s | |
test_instantiation_functorch | 1.6432ms | 1.5092ms | 662.6124 Ops/s | 640.8817 Ops/s | |
test_exec_functorch | 0.1746ms | 0.1398ms | 7.1506 KOps/s | 6.7815 KOps/s | |
test_exec_functional_call | 0.1683ms | 0.1331ms | 7.5154 KOps/s | 7.0908 KOps/s | |
test_exec_td_decorator | 0.3608ms | 0.1789ms | 5.5902 KOps/s | 5.3638 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.7807ms | 0.6678ms | 1.4976 KOps/s | 1.4619 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.8589ms | 0.6812ms | 1.4680 KOps/s | 1.4624 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.7073ms | 0.5855ms | 1.7079 KOps/s | 1.6553 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.7190ms | 0.5996ms | 1.6679 KOps/s | 1.6479 KOps/s | |
test_vmap_transformer_speed_decorator[True-True] | 18.9815ms | 18.9114ms | 52.8782 Ops/s | 52.1106 Ops/s | |
test_vmap_transformer_speed_decorator[True-False] | 19.1376ms | 18.9925ms | 52.6523 Ops/s | 52.0261 Ops/s | |
test_vmap_transformer_speed_decorator[False-True] | 18.8820ms | 18.8333ms | 53.0974 Ops/s | 52.5011 Ops/s | |
test_vmap_transformer_speed_decorator[False-False] | 18.9062ms | 18.8224ms | 53.1281 Ops/s | 52.4551 Ops/s | |
test_to_module_speed[True] | 1.0431ms | 0.9299ms | 1.0753 KOps/s | 1.0634 KOps/s | |
test_to_module_speed[False] | 1.3826ms | 0.9128ms | 1.0956 KOps/s | 1.0750 KOps/s | |
test_tc_init | 74.4320μs | 35.2166μs | 28.3957 KOps/s | 30.2155 KOps/s | |
test_tc_init_nested | 0.1080ms | 67.9034μs | 14.7268 KOps/s | 15.0906 KOps/s | |
test_tc_first_layer_tensor | 5.0759μs | 0.6993μs | 1.4299 MOps/s | 1.4026 MOps/s | |
test_tc_first_layer_nontensor | 32.8310μs | 2.3425μs | 426.8929 KOps/s | 426.6787 KOps/s | |
test_tc_second_layer_tensor | 8.1953μs | 1.4049μs | 711.7858 KOps/s | 699.8819 KOps/s | |
test_tc_second_layer_nontensor | 26.5610μs | 3.0305μs | 329.9742 KOps/s | 326.7604 KOps/s | |
test_unbind | 0.2233s | 9.8794ms | 101.2209 Ops/s | 151.7244 Ops/s | |
test_full_like | 11.2402ms | 9.1188ms | 109.6632 Ops/s | 107.6621 Ops/s | |
test_zeros_like | 4.9071ms | 4.3142ms | 231.7926 Ops/s | 137.0900 Ops/s | |
test_ones_like | 4.8515ms | 4.3145ms | 231.7740 Ops/s | 231.9640 Ops/s | |
test_clone | 6.4869ms | 6.3253ms | 158.0964 Ops/s | 158.4570 Ops/s | |
test_squeeze | 57.7610μs | 9.2803μs | 107.7548 KOps/s | 109.8987 KOps/s | |
test_unsqueeze | 0.1215ms | 69.7481μs | 14.3373 KOps/s | 14.0978 KOps/s | |
test_split | 0.3890ms | 0.1593ms | 6.2772 KOps/s | 6.3662 KOps/s | |
test_permute | 0.2368ms | 0.1821ms | 5.4920 KOps/s | 5.5774 KOps/s | |
test_stack | 53.4491ms | 50.9235ms | 19.6373 Ops/s | 19.8729 Ops/s | |
test_cat | 50.6291ms | 50.3972ms | 19.8424 Ops/s | 20.1854 Ops/s |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Stack from ghstack (oldest at bottom):