-
Notifications
You must be signed in to change notification settings - Fork 76
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Refactor] Use IntEnum for interaction types #989
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
facebook-github-bot
added
the
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
label
Sep 12, 2024
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 46.0970μs | 19.4204μs | 51.4923 KOps/s | 47.3198 KOps/s | |
test_plain_set_stack_nested | 50.2040μs | 19.8492μs | 50.3798 KOps/s | 46.4635 KOps/s | |
test_plain_set_nested_inplace | 53.7710μs | 21.2595μs | 47.0378 KOps/s | 44.0342 KOps/s | |
test_plain_set_stack_nested_inplace | 56.2760μs | 21.3210μs | 46.9020 KOps/s | 43.8728 KOps/s | |
test_items | 29.2740μs | 4.1798μs | 239.2460 KOps/s | 249.5432 KOps/s | |
test_items_nested | 0.6360ms | 0.3299ms | 3.0316 KOps/s | 3.0422 KOps/s | |
test_items_nested_locked | 0.4972ms | 0.3298ms | 3.0322 KOps/s | 3.0513 KOps/s | |
test_items_nested_leaf | 0.1615ms | 86.2459μs | 11.5948 KOps/s | 11.8367 KOps/s | |
test_items_stack_nested | 0.4054ms | 0.3321ms | 3.0109 KOps/s | 3.0238 KOps/s | |
test_items_stack_nested_leaf | 0.1329ms | 86.7124μs | 11.5324 KOps/s | 11.9625 KOps/s | |
test_items_stack_nested_locked | 0.6124ms | 0.3298ms | 3.0323 KOps/s | 3.0231 KOps/s | |
test_keys | 30.6080μs | 3.6129μs | 276.7856 KOps/s | 282.5521 KOps/s | |
test_keys_nested | 0.1639ms | 97.0415μs | 10.3049 KOps/s | 10.4422 KOps/s | |
test_keys_nested_locked | 1.5612ms | 0.1029ms | 9.7228 KOps/s | 9.9328 KOps/s | |
test_keys_nested_leaf | 0.1385ms | 82.7494μs | 12.0847 KOps/s | 12.3829 KOps/s | |
test_keys_stack_nested | 0.1647ms | 98.7073μs | 10.1310 KOps/s | 10.3860 KOps/s | |
test_keys_stack_nested_leaf | 0.1545ms | 83.5396μs | 11.9704 KOps/s | 12.3882 KOps/s | |
test_keys_stack_nested_locked | 0.1692ms | 0.1026ms | 9.7464 KOps/s | 9.8885 KOps/s | |
test_values | 6.0334μs | 1.0736μs | 931.4412 KOps/s | 918.6619 KOps/s | |
test_values_nested | 94.1060μs | 48.3899μs | 20.6655 KOps/s | 20.7737 KOps/s | |
test_values_nested_locked | 0.1050ms | 48.1448μs | 20.7707 KOps/s | 20.6774 KOps/s | |
test_values_nested_leaf | 0.1076ms | 42.6908μs | 23.4243 KOps/s | 23.4539 KOps/s | |
test_values_stack_nested | 89.6680μs | 48.4066μs | 20.6583 KOps/s | 20.7047 KOps/s | |
test_values_stack_nested_leaf | 77.2140μs | 42.4738μs | 23.5439 KOps/s | 23.8347 KOps/s | |
test_values_stack_nested_locked | 94.9170μs | 48.5480μs | 20.5982 KOps/s | 20.2248 KOps/s | |
test_membership | 29.4750μs | 0.8525μs | 1.1731 MOps/s | 1.1950 MOps/s | |
test_membership_nested | 28.4130μs | 2.5901μs | 386.0810 KOps/s | 389.2657 KOps/s | |
test_membership_nested_leaf | 28.9140μs | 2.5831μs | 387.1336 KOps/s | 387.4860 KOps/s | |
test_membership_stacked_nested | 73.7650μs | 2.5524μs | 391.7883 KOps/s | 393.3325 KOps/s | |
test_membership_stacked_nested_leaf | 39.8840μs | 2.5942μs | 385.4756 KOps/s | 389.8656 KOps/s | |
test_membership_nested_last | 27.5120μs | 3.7751μs | 264.8967 KOps/s | 265.2980 KOps/s | |
test_membership_nested_leaf_last | 35.2360μs | 3.7810μs | 264.4787 KOps/s | 260.9538 KOps/s | |
test_membership_stacked_nested_last | 25.4770μs | 3.7925μs | 263.6753 KOps/s | 171.8946 KOps/s | |
test_membership_stacked_nested_leaf_last | 30.2770μs | 3.7645μs | 265.6370 KOps/s | 170.7513 KOps/s | |
test_nested_getleaf | 44.6330μs | 10.8575μs | 92.1025 KOps/s | 94.4631 KOps/s | |
test_nested_get | 38.0010μs | 10.1514μs | 98.5083 KOps/s | 99.4281 KOps/s | |
test_stacked_getleaf | 30.8980μs | 10.7272μs | 93.2209 KOps/s | 95.0193 KOps/s | |
test_stacked_get | 75.4340μs | 9.7304μs | 102.7708 KOps/s | 100.3640 KOps/s | |
test_nested_getitemleaf | 34.4140μs | 10.9673μs | 91.1805 KOps/s | 90.9330 KOps/s | |
test_nested_getitem | 55.9840μs | 9.9088μs | 100.9201 KOps/s | 98.3549 KOps/s | |
test_stacked_getitemleaf | 38.5520μs | 10.9052μs | 91.6995 KOps/s | 92.6345 KOps/s | |
test_stacked_getitem | 33.5430μs | 10.2475μs | 97.5850 KOps/s | 98.1807 KOps/s | |
test_lock_nested | 0.1022s | 0.6245ms | 1.6012 KOps/s | 2.0610 KOps/s | |
test_lock_stack_nested | 0.5876ms | 0.4846ms | 2.0634 KOps/s | 2.2079 KOps/s | |
test_unlock_nested | 0.1009s | 0.5477ms | 1.8258 KOps/s | 2.4220 KOps/s | |
test_unlock_stack_nested | 0.5169ms | 0.3996ms | 2.5023 KOps/s | 2.6807 KOps/s | |
test_flatten_speed | 0.2559ms | 0.1064ms | 9.4001 KOps/s | 9.5609 KOps/s | |
test_unflatten_speed | 0.8589ms | 0.4736ms | 2.1115 KOps/s | 2.1232 KOps/s | |
test_common_ops | 4.2851ms | 1.1205ms | 892.4485 Ops/s | 858.4087 Ops/s | |
test_creation | 20.3880μs | 2.0832μs | 480.0344 KOps/s | 471.2161 KOps/s | |
test_creation_empty | 52.8380μs | 16.2868μs | 61.3993 KOps/s | 52.6857 KOps/s | |
test_creation_nested_1 | 62.2270μs | 19.6462μs | 50.9003 KOps/s | 45.4853 KOps/s | |
test_creation_nested_2 | 64.4500μs | 23.6482μs | 42.2865 KOps/s | 37.4480 KOps/s | |
test_clone | 0.2030ms | 17.6449μs | 56.6737 KOps/s | 57.5838 KOps/s | |
test_getitem[int] | 1.2835ms | 17.7867μs | 56.2219 KOps/s | 58.0418 KOps/s | |
test_getitem[slice_int] | 0.1707ms | 31.9889μs | 31.2609 KOps/s | 32.3412 KOps/s | |
test_getitem[range] | 0.5035ms | 61.1561μs | 16.3516 KOps/s | 16.9198 KOps/s | |
test_getitem[tuple] | 0.1677ms | 26.4541μs | 37.8013 KOps/s | 39.3879 KOps/s | |
test_getitem[list] | 0.4397ms | 56.5360μs | 17.6878 KOps/s | 18.2523 KOps/s | |
test_setitem_dim[int] | 85.8410μs | 33.2104μs | 30.1111 KOps/s | 29.4785 KOps/s | |
test_setitem_dim[slice_int] | 0.1145ms | 62.4887μs | 16.0029 KOps/s | 15.9528 KOps/s | |
test_setitem_dim[range] | 0.1334ms | 85.9325μs | 11.6370 KOps/s | 11.9443 KOps/s | |
test_setitem_dim[tuple] | 97.1610μs | 49.5528μs | 20.1805 KOps/s | 20.1807 KOps/s | |
test_setitem | 0.2585ms | 29.0440μs | 34.4306 KOps/s | 32.1902 KOps/s | |
test_set | 0.1748ms | 28.2571μs | 35.3893 KOps/s | 32.8568 KOps/s | |
test_set_shared | 2.8256ms | 0.2223ms | 4.4985 KOps/s | 4.4859 KOps/s | |
test_update | 0.2202ms | 35.1762μs | 28.4283 KOps/s | 25.8553 KOps/s | |
test_update_nested | 0.1881ms | 45.2124μs | 22.1178 KOps/s | 20.4580 KOps/s | |
test_update__nested | 0.1921ms | 34.8523μs | 28.6925 KOps/s | 29.0685 KOps/s | |
test_set_nested | 0.1748ms | 31.1238μs | 32.1297 KOps/s | 30.0923 KOps/s | |
test_set_nested_new | 0.2002ms | 36.8937μs | 27.1049 KOps/s | 26.1973 KOps/s | |
test_select | 0.2164ms | 55.2571μs | 18.0972 KOps/s | 18.1195 KOps/s | |
test_select_nested | 1.0356ms | 65.0259μs | 15.3785 KOps/s | 16.4418 KOps/s | |
test_exclude_nested | 0.1707ms | 81.6996μs | 12.2400 KOps/s | 13.0098 KOps/s | |
test_empty[True] | 0.5012ms | 0.3225ms | 3.1012 KOps/s | 3.1422 KOps/s | |
test_empty[False] | 9.7683μs | 1.4597μs | 685.0551 KOps/s | 782.8172 KOps/s | |
test_unbind_speed | 0.4248ms | 0.3213ms | 3.1122 KOps/s | 3.3076 KOps/s | |
test_unbind_speed_stack0 | 0.5120ms | 0.3169ms | 3.1555 KOps/s | 3.4436 KOps/s | |
test_unbind_speed_stack1 | 0.1101s | 0.8925ms | 1.1204 KOps/s | 1.3631 KOps/s | |
test_split | 0.1008s | 2.2941ms | 435.8948 Ops/s | 446.1048 Ops/s | |
test_chunk | 3.1965ms | 2.0830ms | 480.0665 Ops/s | 447.1463 Ops/s | |
test_creation[device0] | 0.2972ms | 0.1202ms | 8.3186 KOps/s | 8.4274 KOps/s | |
test_creation_from_tensor | 4.1505ms | 0.1218ms | 8.2070 KOps/s | 8.3339 KOps/s | |
test_add_one[memmap_tensor0] | 0.6170ms | 7.4213μs | 134.7467 KOps/s | 136.5835 KOps/s | |
test_contiguous[memmap_tensor0] | 36.9090μs | 1.9601μs | 510.1688 KOps/s | 544.1024 KOps/s | |
test_stack[memmap_tensor0] | 0.1124ms | 5.8791μs | 170.0932 KOps/s | 178.1235 KOps/s | |
test_memmaptd_index | 1.2513ms | 0.4118ms | 2.4282 KOps/s | 2.5080 KOps/s | |
test_memmaptd_index_astensor | 0.9188ms | 0.4799ms | 2.0838 KOps/s | 2.0846 KOps/s | |
test_memmaptd_index_op | 1.9467ms | 1.0005ms | 999.5424 Ops/s | 949.7870 Ops/s | |
test_serialize_model | 0.2317s | 0.1434s | 6.9729 Ops/s | 8.1698 Ops/s | |
test_serialize_model_pickle | 0.4691s | 0.4003s | 2.4978 Ops/s | 2.5043 Ops/s | |
test_serialize_weights | 0.1340s | 0.1251s | 7.9913 Ops/s | 7.3135 Ops/s | |
test_serialize_weights_returnearly | 0.1880s | 0.1652s | 6.0531 Ops/s | 6.3454 Ops/s | |
test_serialize_weights_pickle | 0.5957s | 0.4629s | 2.1603 Ops/s | 1.1403 Ops/s | |
test_serialize_weights_filesystem | 0.2539s | 0.1620s | 6.1730 Ops/s | 6.7385 Ops/s | |
test_serialize_model_filesystem | 0.1696s | 0.1529s | 6.5383 Ops/s | 6.0466 Ops/s | |
test_reshape_pytree | 84.8280μs | 38.9291μs | 25.6877 KOps/s | 24.9752 KOps/s | |
test_reshape_td | 96.5710μs | 46.8593μs | 21.3405 KOps/s | 21.5404 KOps/s | |
test_view_pytree | 83.5970μs | 39.0032μs | 25.6389 KOps/s | 25.5325 KOps/s | |
test_view_td | 0.1068ms | 52.3498μs | 19.1023 KOps/s | 19.0209 KOps/s | |
test_unbind_pytree | 84.3880μs | 36.7492μs | 27.2115 KOps/s | 27.8865 KOps/s | |
test_unbind_td | 0.3662ms | 47.3110μs | 21.1368 KOps/s | 22.0461 KOps/s | |
test_split_pytree | 79.2880μs | 38.3531μs | 26.0735 KOps/s | 26.5109 KOps/s | |
test_split_td | 0.2364ms | 59.2781μs | 16.8696 KOps/s | 17.3949 KOps/s | |
test_add_pytree | 0.1112ms | 44.3078μs | 22.5694 KOps/s | 22.0341 KOps/s | |
test_add_td | 0.1718ms | 79.3215μs | 12.6069 KOps/s | 11.5134 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1457ms | 56.5032μs | 17.6981 KOps/s | 17.6139 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 1.1739ms | 0.1885ms | 5.3062 KOps/s | 5.2679 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1231ms | 56.5601μs | 17.6803 KOps/s | 17.2043 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.2444ms | 0.1382ms | 7.2349 KOps/s | 7.0400 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 0.1019ms | 20.5185μs | 48.7366 KOps/s | 48.8185 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1587ms | 68.3823μs | 14.6237 KOps/s | 14.9971 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.1620ms | 75.5576μs | 13.2349 KOps/s | 12.9760 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.1318ms | 68.2018μs | 14.6624 KOps/s | 14.2922 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.2994ms | 0.1729ms | 5.7840 KOps/s | 5.7644 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.3584ms | 0.1886ms | 5.3019 KOps/s | 5.2492 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1194ms | 46.7883μs | 21.3729 KOps/s | 20.4297 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.6234ms | 71.2774μs | 14.0297 KOps/s | 13.7869 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.3583ms | 0.1795ms | 5.5703 KOps/s | 5.6601 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.6081ms | 0.2905ms | 3.4421 KOps/s | 3.4881 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.4112ms | 0.2018ms | 4.9561 KOps/s | 4.9500 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.3280ms | 0.1783ms | 5.6091 KOps/s | 5.6746 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1874ms | 63.4790μs | 15.7532 KOps/s | 15.8221 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1043ms | 47.7870μs | 20.9262 KOps/s | 20.3093 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.4655ms | 0.2343ms | 4.2679 KOps/s | 4.3148 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.3575ms | 0.1762ms | 5.6750 KOps/s | 5.6897 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 0.2312ms | 0.1013ms | 9.8733 KOps/s | 9.7232 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.1583ms | 57.4596μs | 17.4035 KOps/s | 17.4919 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1642ms | 75.9716μs | 13.1628 KOps/s | 13.0742 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1418ms | 69.1928μs | 14.4524 KOps/s | 14.6049 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 0.3151ms | 0.2002ms | 4.9958 KOps/s | 5.0846 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 2.5583ms | 1.6665ms | 600.0442 Ops/s | 595.3344 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 0.2786ms | 0.1937ms | 5.1620 KOps/s | 5.2021 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 1.2179ms | 1.1059ms | 904.2245 Ops/s | 919.9146 Ops/s | |
test_compile_assign_and_add_stack[compile] | 0.7844ms | 0.4297ms | 2.3272 KOps/s | 2.3552 KOps/s | |
test_compile_assign_and_add_stack[eager] | 4.1664ms | 3.8471ms | 259.9332 Ops/s | 255.5375 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 83.2560μs | 34.6854μs | 28.8305 KOps/s | 27.5292 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.6291ms | 48.5016μs | 20.6179 KOps/s | 20.2995 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 77.8560μs | 29.5615μs | 33.8278 KOps/s | 32.4175 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 2.2043ms | 28.8061μs | 34.7148 KOps/s | 34.3189 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 91.5110μs | 29.8286μs | 33.5249 KOps/s | 32.3089 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 89.4370μs | 28.6057μs | 34.9581 KOps/s | 34.7656 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1567ms | 75.0317μs | 13.3277 KOps/s | 12.9612 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.6272ms | 28.0561μs | 35.6428 KOps/s | 35.7707 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1435ms | 70.5019μs | 14.1840 KOps/s | 14.2219 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 87.3330μs | 23.4245μs | 42.6903 KOps/s | 43.3057 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1562ms | 69.9615μs | 14.2936 KOps/s | 14.3181 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 83.2860μs | 23.5536μs | 42.4564 KOps/s | 43.0885 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1492ms | 75.1785μs | 13.3017 KOps/s | 13.2513 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.9876ms | 27.8992μs | 35.8433 KOps/s | 36.3125 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1497ms | 70.3680μs | 14.2110 KOps/s | 14.2965 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 65.3320μs | 23.3691μs | 42.7915 KOps/s | 43.1237 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1585ms | 68.9895μs | 14.4950 KOps/s | 14.3829 KOps/s | |
test_compile_indexing[int-pytree-eager] | 72.9870μs | 23.2831μs | 42.9496 KOps/s | 43.3056 KOps/s | |
test_mod_add[eager] | 0.1094ms | 22.9307μs | 43.6096 KOps/s | 40.4862 KOps/s | |
test_mod_add[compile] | 0.1020ms | 40.6868μs | 24.5780 KOps/s | 25.4296 KOps/s | |
test_mod_add[compile-overhead] | 0.1175ms | 38.5835μs | 25.9178 KOps/s | 25.2645 KOps/s | |
test_mod_wrap[eager] | 0.3865ms | 0.2162ms | 4.6244 KOps/s | 4.6592 KOps/s | |
test_mod_wrap[compile] | 0.3844ms | 0.2344ms | 4.2655 KOps/s | 4.1793 KOps/s | |
test_mod_wrap[compile-overhead] | 0.3408ms | 0.2365ms | 4.2285 KOps/s | 4.1791 KOps/s | |
test_mod_wrap_and_backward[eager] | 12.7846ms | 11.3012ms | 88.4859 Ops/s | 89.8801 Ops/s | |
test_mod_wrap_and_backward[compile] | 12.5131ms | 11.3066ms | 88.4437 Ops/s | 89.5865 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 12.8004ms | 11.3277ms | 88.2794 Ops/s | 88.4682 Ops/s | |
test_seq_add[eager] | 0.1625ms | 86.0388μs | 11.6227 KOps/s | 11.0473 KOps/s | |
test_seq_add[compile] | 0.1209ms | 63.3090μs | 15.7955 KOps/s | 15.0324 KOps/s | |
test_seq_add[compile-overhead] | 0.1188ms | 62.2181μs | 16.0725 KOps/s | 15.8007 KOps/s | |
test_seq_wrap[eager] | 0.6361ms | 0.3738ms | 2.6752 KOps/s | 2.5772 KOps/s | |
test_seq_wrap[compile] | 0.4592ms | 0.2740ms | 3.6503 KOps/s | 3.6049 KOps/s | |
test_seq_wrap[compile-overhead] | 0.3688ms | 0.2751ms | 3.6345 KOps/s | 3.6009 KOps/s | |
test_func_call_runtime[False-eager] | 0.7093ms | 0.5271ms | 1.8972 KOps/s | 1.7975 KOps/s | |
test_func_call_runtime[False-compile] | 0.7224ms | 0.5111ms | 1.9567 KOps/s | 2.0020 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.8870ms | 0.5103ms | 1.9595 KOps/s | 1.9806 KOps/s | |
test_func_call_runtime[True-eager] | 1.3034ms | 0.7522ms | 1.3294 KOps/s | 1.3070 KOps/s | |
test_func_call_runtime[True-compile] | 1.0102ms | 0.5237ms | 1.9094 KOps/s | 1.9730 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.9919ms | 0.5245ms | 1.9065 KOps/s | 1.9587 KOps/s | |
test_func_call_cm_runtime[False-eager] | 1.0931ms | 0.5157ms | 1.9391 KOps/s | 1.8758 KOps/s | |
test_func_call_cm_runtime[False-compile] | 1.0834ms | 0.5108ms | 1.9576 KOps/s | 1.9810 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.8944ms | 0.5088ms | 1.9655 KOps/s | 1.9988 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.3826ms | 0.8790ms | 1.1376 KOps/s | 1.1141 KOps/s | |
test_func_call_cm_runtime[True-compile] | 1.2281ms | 0.7499ms | 1.3335 KOps/s | 1.3014 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 1.2707ms | 0.7507ms | 1.3321 KOps/s | 1.2925 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.8774ms | 1.9289ms | 518.4189 Ops/s | 525.2050 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 3.3874ms | 2.0383ms | 490.6098 Ops/s | 507.1329 Ops/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 3.3262ms | 1.9706ms | 507.4624 Ops/s | 503.7586 Ops/s | |
test_distributed | 0.3310ms | 0.1262ms | 7.9232 KOps/s | 7.6375 KOps/s | |
test_tdmodule | 35.5860μs | 16.0838μs | 62.1745 KOps/s | 53.0792 KOps/s | |
test_tdmodule_dispatch | 59.4610μs | 33.8415μs | 29.5495 KOps/s | 25.8635 KOps/s | |
test_tdseq | 59.5810μs | 19.2920μs | 51.8349 KOps/s | 47.5407 KOps/s | |
test_tdseq_dispatch | 65.1220μs | 39.3341μs | 25.4232 KOps/s | 23.8099 KOps/s | |
test_instantiation_functorch | 2.1866ms | 1.6073ms | 622.1713 Ops/s | 610.6745 Ops/s | |
test_instantiation_td | 2.4700ms | 1.2318ms | 811.8457 Ops/s | 821.2324 Ops/s | |
test_exec_functorch | 0.3856ms | 0.1889ms | 5.2924 KOps/s | 5.3653 KOps/s | |
test_exec_functional_call | 0.3330ms | 0.1760ms | 5.6817 KOps/s | 5.3561 KOps/s | |
test_exec_td | 0.3256ms | 0.1688ms | 5.9245 KOps/s | 5.3508 KOps/s | |
test_exec_td_decorator | 0.4146ms | 0.2263ms | 4.4189 KOps/s | 4.4031 KOps/s | |
test_vmap_mlp_speed[True-True] | 0.9010ms | 0.6555ms | 1.5256 KOps/s | 1.5174 KOps/s | |
test_vmap_mlp_speed[True-False] | 0.8657ms | 0.6504ms | 1.5375 KOps/s | 1.5361 KOps/s | |
test_vmap_mlp_speed[False-True] | 0.7302ms | 0.5052ms | 1.9793 KOps/s | 1.9719 KOps/s | |
test_vmap_mlp_speed[False-False] | 0.8142ms | 0.5055ms | 1.9781 KOps/s | 1.9660 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 1.4111ms | 0.6327ms | 1.5806 KOps/s | 1.5812 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.9889ms | 0.6326ms | 1.5808 KOps/s | 1.5773 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.8403ms | 0.5221ms | 1.9153 KOps/s | 1.9370 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.8487ms | 0.5206ms | 1.9210 KOps/s | 1.9169 KOps/s | |
test_to_module_speed[True] | 1.9665ms | 1.3008ms | 768.7647 Ops/s | 778.3459 Ops/s | |
test_to_module_speed[False] | 1.8628ms | 1.2709ms | 786.8431 Ops/s | 797.1385 Ops/s | |
test_tc_init | 84.5880μs | 41.7046μs | 23.9781 KOps/s | 21.8683 KOps/s | |
test_tc_init_nested | 0.1705ms | 85.2052μs | 11.7364 KOps/s | 10.5359 KOps/s | |
test_tc_first_layer_tensor | 23.8150μs | 1.5230μs | 656.6012 KOps/s | 662.3591 KOps/s | |
test_tc_first_layer_nontensor | 42.2100μs | 4.7719μs | 209.5583 KOps/s | 214.7310 KOps/s | |
test_tc_second_layer_tensor | 23.7050μs | 2.8651μs | 349.0294 KOps/s | 352.1194 KOps/s | |
test_tc_second_layer_nontensor | 31.8690μs | 6.0761μs | 164.5805 KOps/s | 168.3531 KOps/s | |
test_unbind | 0.5218s | 14.0864ms | 70.9905 Ops/s | 130.5467 Ops/s | |
test_full_like | 10.9617ms | 9.4466ms | 105.8582 Ops/s | 109.2207 Ops/s | |
test_zeros_like | 4.0533ms | 3.4788ms | 287.4554 Ops/s | 282.3360 Ops/s | |
test_ones_like | 5.2846ms | 3.9893ms | 250.6707 Ops/s | 265.2456 Ops/s | |
test_clone | 6.9680ms | 6.1728ms | 162.0022 Ops/s | 169.2519 Ops/s | |
test_squeeze | 74.1780μs | 12.1322μs | 82.4256 KOps/s | 79.8046 KOps/s | |
test_unsqueeze | 0.1738ms | 92.9741μs | 10.7557 KOps/s | 10.7255 KOps/s | |
test_split | 0.5820ms | 0.1984ms | 5.0404 KOps/s | 5.1256 KOps/s | |
test_permute | 0.4100ms | 0.2256ms | 4.4319 KOps/s | 4.4607 KOps/s | |
test_stack | 38.8289ms | 27.5902ms | 36.2448 Ops/s | 37.6680 Ops/s | |
test_cat | 31.7289ms | 26.5915ms | 37.6060 Ops/s | 37.6863 Ops/s |
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 0.1482ms | 13.9780μs | 71.5410 KOps/s | 73.1680 KOps/s | |
test_plain_set_stack_nested | 32.1310μs | 14.3549μs | 69.6629 KOps/s | 72.7407 KOps/s | |
test_plain_set_nested_inplace | 84.1710μs | 15.1846μs | 65.8562 KOps/s | 67.2330 KOps/s | |
test_plain_set_stack_nested_inplace | 40.2010μs | 15.0208μs | 66.5744 KOps/s | 67.3967 KOps/s | |
test_items | 31.7800μs | 2.8655μs | 348.9742 KOps/s | 345.7707 KOps/s | |
test_items_nested | 0.5017ms | 0.3113ms | 3.2127 KOps/s | 3.1935 KOps/s | |
test_items_nested_locked | 0.3663ms | 0.3147ms | 3.1779 KOps/s | 3.1622 KOps/s | |
test_items_nested_leaf | 87.3220μs | 63.3160μs | 15.7938 KOps/s | 15.8744 KOps/s | |
test_items_stack_nested | 0.3776ms | 0.3141ms | 3.1842 KOps/s | 3.1435 KOps/s | |
test_items_stack_nested_leaf | 0.1150ms | 64.1678μs | 15.5841 KOps/s | 15.7461 KOps/s | |
test_items_stack_nested_locked | 0.3980ms | 0.3147ms | 3.1774 KOps/s | 3.1839 KOps/s | |
test_keys | 63.9510μs | 3.3939μs | 294.6435 KOps/s | 292.8954 KOps/s | |
test_keys_nested | 89.8020μs | 55.3015μs | 18.0827 KOps/s | 18.2967 KOps/s | |
test_keys_nested_locked | 2.3411ms | 60.4296μs | 16.5482 KOps/s | 16.5428 KOps/s | |
test_keys_nested_leaf | 84.4710μs | 45.8495μs | 21.8105 KOps/s | 22.1604 KOps/s | |
test_keys_stack_nested | 86.8520μs | 55.2343μs | 18.1047 KOps/s | 18.2052 KOps/s | |
test_keys_stack_nested_leaf | 79.9610μs | 47.2329μs | 21.1717 KOps/s | 21.2165 KOps/s | |
test_keys_stack_nested_locked | 0.1018ms | 59.4410μs | 16.8234 KOps/s | 16.7862 KOps/s | |
test_values | 14.1920μs | 0.8065μs | 1.2399 MOps/s | 1.2492 MOps/s | |
test_values_nested | 57.9910μs | 27.4412μs | 36.4416 KOps/s | 36.4561 KOps/s | |
test_values_nested_locked | 62.6410μs | 29.6570μs | 33.7189 KOps/s | 33.9844 KOps/s | |
test_values_nested_leaf | 57.1410μs | 24.2416μs | 41.2514 KOps/s | 41.2326 KOps/s | |
test_values_stack_nested | 68.2310μs | 28.4056μs | 35.2044 KOps/s | 35.1346 KOps/s | |
test_values_stack_nested_leaf | 69.2610μs | 24.9512μs | 40.0783 KOps/s | 39.9515 KOps/s | |
test_values_stack_nested_locked | 60.5010μs | 30.4411μs | 32.8504 KOps/s | 33.1191 KOps/s | |
test_membership | 1.7625μs | 0.4746μs | 2.1071 MOps/s | 2.1229 MOps/s | |
test_membership_nested | 14.7000μs | 1.7392μs | 574.9906 KOps/s | 577.3670 KOps/s | |
test_membership_nested_leaf | 12.4270μs | 1.7156μs | 582.9017 KOps/s | 578.6942 KOps/s | |
test_membership_stacked_nested | 0.3029ms | 1.7630μs | 567.2006 KOps/s | 557.1229 KOps/s | |
test_membership_stacked_nested_leaf | 35.5000μs | 1.7791μs | 562.0683 KOps/s | 557.1000 KOps/s | |
test_membership_nested_last | 29.2400μs | 2.5996μs | 384.6789 KOps/s | 385.4760 KOps/s | |
test_membership_nested_leaf_last | 36.1690μs | 2.6197μs | 381.7188 KOps/s | 391.6284 KOps/s | |
test_membership_stacked_nested_last | 39.8000μs | 3.7416μs | 267.2649 KOps/s | 131.8746 KOps/s | |
test_membership_stacked_nested_leaf_last | 28.5390μs | 3.7500μs | 266.6647 KOps/s | 131.8974 KOps/s | |
test_nested_getleaf | 37.1800μs | 6.1294μs | 163.1494 KOps/s | 164.7503 KOps/s | |
test_nested_get | 33.4800μs | 5.7646μs | 173.4730 KOps/s | 172.4294 KOps/s | |
test_stacked_getleaf | 31.9200μs | 6.1320μs | 163.0795 KOps/s | 165.8539 KOps/s | |
test_stacked_get | 40.8200μs | 5.7663μs | 173.4221 KOps/s | 174.3521 KOps/s | |
test_nested_getitemleaf | 41.6490μs | 6.1571μs | 162.4133 KOps/s | 164.7922 KOps/s | |
test_nested_getitem | 47.0010μs | 5.7372μs | 174.3014 KOps/s | 172.1475 KOps/s | |
test_stacked_getitemleaf | 30.3810μs | 6.1756μs | 161.9276 KOps/s | 164.7756 KOps/s | |
test_stacked_getitem | 37.7610μs | 5.8463μs | 171.0482 KOps/s | 172.3716 KOps/s | |
test_lock_nested | 5.1510ms | 0.4222ms | 2.3688 KOps/s | 2.3674 KOps/s | |
test_lock_stack_nested | 0.4638ms | 0.3800ms | 2.6318 KOps/s | 2.6541 KOps/s | |
test_unlock_nested | 0.8344ms | 0.3561ms | 2.8082 KOps/s | 2.7504 KOps/s | |
test_unlock_stack_nested | 0.3966ms | 0.3190ms | 3.1351 KOps/s | 3.1645 KOps/s | |
test_flatten_speed | 0.3098ms | 81.6849μs | 12.2422 KOps/s | 12.6119 KOps/s | |
test_unflatten_speed | 0.3405ms | 0.2829ms | 3.5346 KOps/s | 3.5598 KOps/s | |
test_common_ops | 1.4691ms | 1.2534ms | 797.8151 Ops/s | 795.9851 Ops/s | |
test_creation | 23.6900μs | 1.4911μs | 670.6559 KOps/s | 667.8573 KOps/s | |
test_creation_empty | 59.2100μs | 15.8834μs | 62.9587 KOps/s | 65.6368 KOps/s | |
test_creation_nested_1 | 62.0210μs | 17.6361μs | 56.7019 KOps/s | 59.0665 KOps/s | |
test_creation_nested_2 | 62.8710μs | 20.1811μs | 49.5513 KOps/s | 51.6078 KOps/s | |
test_clone | 72.5710μs | 29.2340μs | 34.2067 KOps/s | 33.2532 KOps/s | |
test_getitem[int] | 1.3035ms | 16.4822μs | 60.6716 KOps/s | 59.2542 KOps/s | |
test_getitem[slice_int] | 0.1255ms | 27.1071μs | 36.8907 KOps/s | 36.1333 KOps/s | |
test_getitem[range] | 0.2428ms | 0.1087ms | 9.1986 KOps/s | 9.2913 KOps/s | |
test_getitem[tuple] | 0.1212ms | 24.2690μs | 41.2049 KOps/s | 40.4209 KOps/s | |
test_getitem[list] | 0.1958ms | 97.2286μs | 10.2850 KOps/s | 10.1389 KOps/s | |
test_setitem_dim[int] | 77.4410μs | 44.9023μs | 22.2706 KOps/s | 21.9849 KOps/s | |
test_setitem_dim[slice_int] | 89.0320μs | 65.6097μs | 15.2417 KOps/s | 15.1265 KOps/s | |
test_setitem_dim[range] | 0.1764ms | 0.1261ms | 7.9317 KOps/s | 7.8360 KOps/s | |
test_setitem_dim[tuple] | 85.4210μs | 60.6247μs | 16.4949 KOps/s | 16.5911 KOps/s | |
test_setitem | 94.9220μs | 42.7267μs | 23.4046 KOps/s | 23.0270 KOps/s | |
test_set | 85.0710μs | 42.3181μs | 23.6305 KOps/s | 22.3481 KOps/s | |
test_set_shared | 0.3623ms | 51.9753μs | 19.2399 KOps/s | 19.0387 KOps/s | |
test_update | 0.1101ms | 51.9819μs | 19.2375 KOps/s | 19.0070 KOps/s | |
test_update_nested | 0.1091ms | 60.9061μs | 16.4187 KOps/s | 17.2919 KOps/s | |
test_update__nested | 0.1032ms | 63.8609μs | 15.6590 KOps/s | 16.0121 KOps/s | |
test_set_nested | 0.1128ms | 47.2503μs | 21.1639 KOps/s | 20.9101 KOps/s | |
test_set_nested_new | 95.6910μs | 49.5100μs | 20.1979 KOps/s | 19.5356 KOps/s | |
test_select | 0.1014ms | 62.8229μs | 15.9178 KOps/s | 15.1304 KOps/s | |
test_select_nested | 0.1204ms | 40.9431μs | 24.4242 KOps/s | 23.7413 KOps/s | |
test_exclude_nested | 96.1010μs | 58.7026μs | 17.0350 KOps/s | 16.7912 KOps/s | |
test_empty[True] | 0.8767ms | 0.2454ms | 4.0746 KOps/s | 4.0881 KOps/s | |
test_empty[False] | 4.1861μs | 0.7314μs | 1.3672 MOps/s | 1.3292 MOps/s | |
test_to | 59.9110μs | 25.2280μs | 39.6385 KOps/s | 38.8493 KOps/s | |
test_to_nonblocking | 64.2000μs | 24.7756μs | 40.3623 KOps/s | 39.6690 KOps/s | |
test_unbind_speed | 1.0367ms | 0.2717ms | 3.6805 KOps/s | 3.5242 KOps/s | |
test_unbind_speed_stack0 | 0.3441ms | 0.2775ms | 3.6037 KOps/s | 3.6529 KOps/s | |
test_unbind_speed_stack1 | 92.1630ms | 0.7000ms | 1.4286 KOps/s | 1.4279 KOps/s | |
test_split | 93.5889ms | 2.2022ms | 454.0843 Ops/s | 442.7878 Ops/s | |
test_chunk | 95.1377ms | 2.2225ms | 449.9494 Ops/s | 440.2371 Ops/s | |
test_creation[device0] | 0.3341ms | 0.1254ms | 7.9747 KOps/s | 7.6905 KOps/s | |
test_creation_from_tensor | 0.3403ms | 0.1282ms | 7.8033 KOps/s | 7.4565 KOps/s | |
test_add_one[memmap_tensor0] | 0.1383ms | 9.1352μs | 109.4665 KOps/s | 108.9162 KOps/s | |
test_contiguous[memmap_tensor0] | 47.8310μs | 2.2178μs | 450.8918 KOps/s | 453.6030 KOps/s | |
test_stack[memmap_tensor0] | 64.1710μs | 6.7790μs | 147.5142 KOps/s | 143.3043 KOps/s | |
test_memmaptd_index | 1.0805ms | 0.4391ms | 2.2773 KOps/s | 2.2533 KOps/s | |
test_memmaptd_index_astensor | 0.7711ms | 0.4994ms | 2.0025 KOps/s | 1.9960 KOps/s | |
test_memmaptd_index_op | 1.4889ms | 1.0582ms | 945.0176 Ops/s | 955.2802 Ops/s | |
test_serialize_model | 0.1298s | 0.1290s | 7.7548 Ops/s | 7.7548 Ops/s | |
test_serialize_model_pickle | 1.3560s | 1.2141s | 0.8237 Ops/s | 0.8246 Ops/s | |
test_serialize_weights | 0.1294s | 0.1284s | 7.7895 Ops/s | 7.8025 Ops/s | |
test_serialize_weights_returnearly | 56.8401ms | 45.8744ms | 21.7986 Ops/s | 17.5098 Ops/s | |
test_serialize_weights_pickle | 1.3538s | 1.2141s | 0.8237 Ops/s | 0.8176 Ops/s | |
test_reshape_pytree | 64.7910μs | 36.0666μs | 27.7265 KOps/s | 26.6803 KOps/s | |
test_reshape_td | 85.0220μs | 41.3780μs | 24.1675 KOps/s | 23.1168 KOps/s | |
test_view_pytree | 64.0010μs | 35.1702μs | 28.4332 KOps/s | 26.9586 KOps/s | |
test_view_td | 81.8410μs | 46.2949μs | 21.6006 KOps/s | 20.8560 KOps/s | |
test_unbind_pytree | 67.8610μs | 35.0887μs | 28.4992 KOps/s | 27.8414 KOps/s | |
test_unbind_td | 0.3939ms | 43.5848μs | 22.9438 KOps/s | 23.0740 KOps/s | |
test_split_pytree | 79.3010μs | 46.1856μs | 21.6518 KOps/s | 21.1649 KOps/s | |
test_split_td | 0.1788ms | 56.7284μs | 17.6279 KOps/s | 17.2894 KOps/s | |
test_add_pytree | 0.1189ms | 56.2609μs | 17.7743 KOps/s | 17.1402 KOps/s | |
test_add_td | 0.1241ms | 88.9348μs | 11.2442 KOps/s | 10.5683 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.4128ms | 0.2067ms | 4.8382 KOps/s | 4.6243 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.2520ms | 0.1568ms | 6.3773 KOps/s | 6.2486 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1887ms | 0.1440ms | 6.9429 KOps/s | 6.5429 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.2268ms | 0.1841ms | 5.4320 KOps/s | 5.2827 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 52.6710μs | 21.1336μs | 47.3179 KOps/s | 45.7213 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 80.1910μs | 43.5151μs | 22.9805 KOps/s | 22.6237 KOps/s | |
test_compile_copy_nested[pytree-compile] | 99.7710μs | 64.4046μs | 15.5268 KOps/s | 15.5688 KOps/s | |
test_compile_copy_nested[pytree-eager] | 89.5910μs | 49.1060μs | 20.3641 KOps/s | 20.2040 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.4110ms | 0.3179ms | 3.1452 KOps/s | 3.1123 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.2738ms | 0.2143ms | 4.6653 KOps/s | 4.5239 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1679ms | 0.1278ms | 7.8248 KOps/s | 7.6939 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.1212ms | 61.6509μs | 16.2204 KOps/s | 15.2148 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.3614ms | 0.3170ms | 3.1545 KOps/s | 3.1213 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.7000ms | 0.6299ms | 1.5875 KOps/s | 1.5257 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.3363ms | 0.2543ms | 3.9319 KOps/s | 3.8305 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.4011ms | 0.3187ms | 3.1373 KOps/s | 3.1207 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1843ms | 72.2726μs | 13.8365 KOps/s | 13.3439 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.2637ms | 0.1286ms | 7.7771 KOps/s | 7.6548 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.6891ms | 0.5545ms | 1.8035 KOps/s | 1.7955 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.4223ms | 0.3278ms | 3.0504 KOps/s | 3.1356 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 49.3700μs | 18.7419μs | 53.3563 KOps/s | 53.3535 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 72.1710μs | 27.6029μs | 36.2281 KOps/s | 36.4924 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1263ms | 68.2196μs | 14.6585 KOps/s | 14.3226 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1101ms | 51.0922μs | 19.5725 KOps/s | 19.2228 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 2.3083ms | 0.8057ms | 1.2412 KOps/s | 1.1268 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 3.3351ms | 3.2160ms | 310.9482 Ops/s | 308.2126 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 2.3226ms | 0.8049ms | 1.2423 KOps/s | 1.1281 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 3.3868ms | 3.2357ms | 309.0518 Ops/s | 303.2812 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.1554ms | 0.1138ms | 8.7893 KOps/s | 9.1053 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.1944ms | 60.6510μs | 16.4878 KOps/s | 16.1387 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.1945ms | 0.1043ms | 9.5878 KOps/s | 9.5149 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 0.1178ms | 42.9463μs | 23.2849 KOps/s | 22.7929 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.1660ms | 0.1045ms | 9.5675 KOps/s | 9.4619 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 0.1064ms | 44.7593μs | 22.3417 KOps/s | 22.9393 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1723ms | 0.1377ms | 7.2596 KOps/s | 7.1633 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.1583ms | 25.7362μs | 38.8558 KOps/s | 37.3240 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1891ms | 0.1355ms | 7.3789 KOps/s | 7.5346 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 59.8710μs | 21.6465μs | 46.1968 KOps/s | 45.1448 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1991ms | 0.1324ms | 7.5535 KOps/s | 7.4383 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 58.9110μs | 21.5931μs | 46.3110 KOps/s | 44.2920 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1873ms | 0.1386ms | 7.2159 KOps/s | 7.1577 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.4881ms | 27.1482μs | 36.8349 KOps/s | 38.5129 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1971ms | 0.1375ms | 7.2719 KOps/s | 7.4722 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 62.8610μs | 21.5015μs | 46.5085 KOps/s | 44.9515 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1851ms | 0.1322ms | 7.5664 KOps/s | 7.4091 KOps/s | |
test_compile_indexing[int-pytree-eager] | 54.4810μs | 21.1567μs | 47.2663 KOps/s | 45.5124 KOps/s | |
test_mod_add[eager] | 82.7410μs | 33.7386μs | 29.6397 KOps/s | 31.6497 KOps/s | |
test_mod_add[compile] | 0.1144ms | 73.1218μs | 13.6758 KOps/s | 13.5568 KOps/s | |
test_mod_add[compile-overhead] | 0.2592ms | 0.1367ms | 7.3176 KOps/s | 7.0182 KOps/s | |
test_mod_wrap[eager] | 0.3224ms | 0.2544ms | 3.9314 KOps/s | 4.1008 KOps/s | |
test_mod_wrap[compile] | 0.4486ms | 0.2956ms | 3.3834 KOps/s | 3.4561 KOps/s | |
test_mod_wrap[compile-overhead] | 7.8674ms | 4.1652ms | 240.0850 Ops/s | 253.0617 Ops/s | |
test_mod_wrap_and_backward[eager] | 1.4350ms | 1.3218ms | 756.5283 Ops/s | 698.4141 Ops/s | |
test_mod_wrap_and_backward[compile] | 1.3858ms | 1.3096ms | 763.6100 Ops/s | 692.7406 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 1.2850ms | 0.8731ms | 1.1453 KOps/s | 1.0166 KOps/s | |
test_seq_add[eager] | 0.1495ms | 97.1042μs | 10.2982 KOps/s | 10.2858 KOps/s | |
test_seq_add[compile] | 0.1365ms | 83.0920μs | 12.0349 KOps/s | 12.2845 KOps/s | |
test_seq_add[compile-overhead] | 0.1819ms | 0.1173ms | 8.5258 KOps/s | 8.7318 KOps/s | |
test_seq_wrap[eager] | 0.4420ms | 0.3929ms | 2.5454 KOps/s | 2.6367 KOps/s | |
test_seq_wrap[compile] | 0.3783ms | 0.3065ms | 3.2631 KOps/s | 3.2211 KOps/s | |
test_seq_wrap[compile-overhead] | 0.2740ms | 0.2086ms | 4.7934 KOps/s | 4.7164 KOps/s | |
test_func_call_runtime[False-eager] | 0.8285ms | 0.7392ms | 1.3529 KOps/s | 1.3471 KOps/s | |
test_func_call_runtime[False-compile] | 0.8619ms | 0.7742ms | 1.2917 KOps/s | 1.2540 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.4655ms | 0.3489ms | 2.8663 KOps/s | 2.8445 KOps/s | |
test_func_call_runtime[True-eager] | 0.9662ms | 0.8916ms | 1.1215 KOps/s | 1.1020 KOps/s | |
test_func_call_runtime[True-compile] | 0.9037ms | 0.8169ms | 1.2241 KOps/s | 1.2012 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.4878ms | 0.3902ms | 2.5630 KOps/s | 2.6025 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.8410ms | 0.7665ms | 1.3046 KOps/s | 1.2663 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.8668ms | 0.7817ms | 1.2793 KOps/s | 1.2203 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.4838ms | 0.3498ms | 2.8589 KOps/s | 2.8181 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.0902ms | 0.9896ms | 1.0105 KOps/s | 995.5644 Ops/s | |
test_func_call_cm_runtime[True-compile] | 0.9332ms | 0.8439ms | 1.1850 KOps/s | 1.1650 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.4775ms | 0.4061ms | 2.4626 KOps/s | 2.4595 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.5554ms | 2.0545ms | 486.7404 Ops/s | 483.0592 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 0.9680ms | 0.8629ms | 1.1589 KOps/s | 1.1304 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.4781ms | 0.4113ms | 2.4314 KOps/s | 2.4163 KOps/s | |
test_distributed | 4.6082ms | 0.2295ms | 4.3577 KOps/s | 8.8819 KOps/s | |
test_tdmodule | 25.0210μs | 14.5014μs | 68.9589 KOps/s | 69.2421 KOps/s | |
test_tdmodule_dispatch | 50.7310μs | 28.9522μs | 34.5397 KOps/s | 35.2964 KOps/s | |
test_tdseq | 36.6510μs | 15.1409μs | 66.0464 KOps/s | 65.2305 KOps/s | |
test_tdseq_dispatch | 50.5410μs | 31.1494μs | 32.1034 KOps/s | 32.3123 KOps/s | |
test_instantiation_functorch | 1.9701ms | 1.8717ms | 534.2696 Ops/s | 523.2429 Ops/s | |
test_instantiation_td | 1.7938ms | 1.1982ms | 834.5729 Ops/s | 818.9905 Ops/s | |
test_exec_functorch | 0.2653ms | 0.2119ms | 4.7186 KOps/s | 4.6625 KOps/s | |
test_exec_functional_call | 0.2607ms | 0.2072ms | 4.8267 KOps/s | 4.7350 KOps/s | |
test_exec_td | 0.2651ms | 0.2176ms | 4.5960 KOps/s | 4.5918 KOps/s | |
test_exec_td_decorator | 0.9801ms | 0.2541ms | 3.9355 KOps/s | 3.8439 KOps/s | |
test_vmap_mlp_speed[True-True] | 0.7561ms | 0.6733ms | 1.4853 KOps/s | 1.4561 KOps/s | |
test_vmap_mlp_speed[True-False] | 0.7659ms | 0.6683ms | 1.4963 KOps/s | 1.4188 KOps/s | |
test_vmap_mlp_speed[False-True] | 0.6885ms | 0.5656ms | 1.7679 KOps/s | 1.6771 KOps/s | |
test_vmap_mlp_speed[False-False] | 0.6776ms | 0.5678ms | 1.7611 KOps/s | 1.6906 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 1.4135ms | 0.6576ms | 1.5207 KOps/s | 1.4655 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.7754ms | 0.6564ms | 1.5234 KOps/s | 1.4902 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.6874ms | 0.5789ms | 1.7275 KOps/s | 1.6700 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.7043ms | 0.5789ms | 1.7273 KOps/s | 1.7008 KOps/s | |
test_vmap_transformer_speed[True-True] | 8.2794ms | 8.1906ms | 122.0918 Ops/s | 120.2507 Ops/s | |
test_vmap_transformer_speed[True-False] | 8.3043ms | 8.1727ms | 122.3581 Ops/s | 120.2021 Ops/s | |
test_vmap_transformer_speed[False-True] | 8.0556ms | 7.9807ms | 125.3026 Ops/s | 123.6418 Ops/s | |
test_vmap_transformer_speed[False-False] | 9.3348ms | 7.9836ms | 125.2567 Ops/s | 123.4157 Ops/s | |
test_vmap_transformer_speed_decorator[True-True] | 19.7457ms | 19.1900ms | 52.1106 Ops/s | 51.7316 Ops/s | |
test_vmap_transformer_speed_decorator[True-False] | 19.6134ms | 19.1705ms | 52.1634 Ops/s | 51.6897 Ops/s | |
test_vmap_transformer_speed_decorator[False-True] | 19.7428ms | 19.1056ms | 52.3407 Ops/s | 52.0756 Ops/s | |
test_vmap_transformer_speed_decorator[False-False] | 19.1158ms | 19.0159ms | 52.5877 Ops/s | 52.0694 Ops/s | |
test_to_module_speed[True] | 1.4005ms | 0.9390ms | 1.0649 KOps/s | 1.0625 KOps/s | |
test_to_module_speed[False] | 1.3215ms | 0.9133ms | 1.0949 KOps/s | 1.0954 KOps/s | |
test_tc_init | 54.9910μs | 33.5714μs | 29.7872 KOps/s | 29.0792 KOps/s | |
test_tc_init_nested | 0.1160ms | 67.7922μs | 14.7510 KOps/s | 13.8946 KOps/s | |
test_tc_first_layer_tensor | 4.5057μs | 0.6781μs | 1.4746 MOps/s | 1.4927 MOps/s | |
test_tc_first_layer_nontensor | 28.0100μs | 2.2054μs | 453.4288 KOps/s | 436.2074 KOps/s | |
test_tc_second_layer_tensor | 8.4953μs | 1.3770μs | 726.2078 KOps/s | 722.5784 KOps/s | |
test_tc_second_layer_nontensor | 30.1210μs | 2.9254μs | 341.8373 KOps/s | 327.8668 KOps/s | |
test_unbind | 0.1938s | 11.8646ms | 84.2843 Ops/s | 95.2824 Ops/s | |
test_full_like | 0.6562ms | 0.5750ms | 1.7393 KOps/s | 1.7381 KOps/s | |
test_zeros_like | 0.2770ms | 0.1980ms | 5.0508 KOps/s | 5.0512 KOps/s | |
test_ones_like | 0.2520ms | 0.1978ms | 5.0559 KOps/s | 5.0564 KOps/s | |
test_clone | 0.4588ms | 0.4135ms | 2.4181 KOps/s | 2.4127 KOps/s | |
test_squeeze | 54.1210μs | 9.6373μs | 103.7633 KOps/s | 103.6408 KOps/s | |
test_unsqueeze | 0.2938ms | 75.8307μs | 13.1873 KOps/s | 13.5814 KOps/s | |
test_split | 0.2595ms | 0.1611ms | 6.2075 KOps/s | 6.2251 KOps/s | |
test_permute | 0.2814ms | 0.1852ms | 5.3999 KOps/s | 5.4688 KOps/s | |
test_stack | 1.2532ms | 0.8677ms | 1.1525 KOps/s | 1.1399 KOps/s | |
test_cat | 1.2509ms | 1.2322ms | 811.5238 Ops/s | 811.9520 Ops/s |
vmoens
changed the title
[Feature] Use IntEnum for interaction types
[Refactor] Use IntEnum for interaction types
Sep 13, 2024
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Refactor
Refactoring code - not a new feature
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Description
Describe your changes in detail.
Motivation and Context
Why is this change required? What problem does it solve?
If it fixes an open issue, please link to the issue here.
You can use the syntax
close #15213
if this solves the issue #15213Types of changes
What types of changes does your code introduce? Remove all that do not apply:
Checklist
Go over all the following points, and put an
x
in all the boxes that apply.If you are unsure about any of these, don't hesitate to ask. We are here to help!