-
Notifications
You must be signed in to change notification settings - Fork 76
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BugFix] Fix buffer identity in Params._apply #1027
Merged
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
facebook-github-bot
added
the
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
label
Oct 4, 2024
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 55.7440μs | 24.6096μs | 40.6346 KOps/s | 42.4670 KOps/s | |
test_plain_set_stack_nested | 76.8840μs | 24.7640μs | 40.3812 KOps/s | 41.6956 KOps/s | |
test_plain_set_nested_inplace | 58.2390μs | 26.6816μs | 37.4790 KOps/s | 38.2023 KOps/s | |
test_plain_set_stack_nested_inplace | 72.2660μs | 26.5134μs | 37.7167 KOps/s | 38.1048 KOps/s | |
test_items | 36.3580μs | 4.2026μs | 237.9466 KOps/s | 242.7801 KOps/s | |
test_items_nested | 0.6618ms | 0.3868ms | 2.5855 KOps/s | 2.6107 KOps/s | |
test_items_nested_locked | 0.5677ms | 0.3841ms | 2.6037 KOps/s | 2.6154 KOps/s | |
test_items_nested_leaf | 0.1605ms | 79.2218μs | 12.6228 KOps/s | 12.4276 KOps/s | |
test_items_stack_nested | 0.7276ms | 0.3893ms | 2.5688 KOps/s | 2.5739 KOps/s | |
test_items_stack_nested_leaf | 0.1569ms | 81.7171μs | 12.2373 KOps/s | 12.1176 KOps/s | |
test_items_stack_nested_locked | 0.7269ms | 0.3877ms | 2.5795 KOps/s | 2.5843 KOps/s | |
test_keys | 40.4960μs | 3.4518μs | 289.7079 KOps/s | 283.8768 KOps/s | |
test_keys_nested | 0.2583ms | 0.1324ms | 7.5516 KOps/s | 7.4861 KOps/s | |
test_keys_nested_locked | 1.7245ms | 0.1380ms | 7.2487 KOps/s | 7.2017 KOps/s | |
test_keys_nested_leaf | 0.2302ms | 0.1158ms | 8.6348 KOps/s | 8.5272 KOps/s | |
test_keys_stack_nested | 0.2209ms | 0.1322ms | 7.5618 KOps/s | 7.4528 KOps/s | |
test_keys_stack_nested_leaf | 0.2343ms | 0.1156ms | 8.6530 KOps/s | 8.6141 KOps/s | |
test_keys_stack_nested_locked | 0.2562ms | 0.1378ms | 7.2544 KOps/s | 7.1889 KOps/s | |
test_values | 5.6206μs | 1.0032μs | 996.8197 KOps/s | 987.1183 KOps/s | |
test_values_nested | 0.1656ms | 93.0733μs | 10.7442 KOps/s | 10.7583 KOps/s | |
test_values_nested_locked | 0.1761ms | 93.1152μs | 10.7394 KOps/s | 11.1678 KOps/s | |
test_values_nested_leaf | 0.1449ms | 78.2157μs | 12.7851 KOps/s | 12.6278 KOps/s | |
test_values_stack_nested | 0.1717ms | 92.5286μs | 10.8075 KOps/s | 10.6559 KOps/s | |
test_values_stack_nested_leaf | 0.1451ms | 77.9509μs | 12.8286 KOps/s | 12.5958 KOps/s | |
test_values_stack_nested_locked | 0.1929ms | 92.6661μs | 10.7914 KOps/s | 10.2661 KOps/s | |
test_membership | 3.8416μs | 0.7241μs | 1.3810 MOps/s | 1.3878 MOps/s | |
test_membership_nested | 21.3710μs | 2.7066μs | 369.4710 KOps/s | 359.5610 KOps/s | |
test_membership_nested_leaf | 21.7110μs | 2.7499μs | 363.6560 KOps/s | 358.4228 KOps/s | |
test_membership_stacked_nested | 24.9160μs | 2.7630μs | 361.9203 KOps/s | 363.2261 KOps/s | |
test_membership_stacked_nested_leaf | 20.2780μs | 2.7068μs | 369.4373 KOps/s | 362.4257 KOps/s | |
test_membership_nested_last | 34.0320μs | 4.1905μs | 238.6333 KOps/s | 237.8428 KOps/s | |
test_membership_nested_leaf_last | 25.4880μs | 4.2166μs | 237.1598 KOps/s | 235.3067 KOps/s | |
test_membership_stacked_nested_last | 22.7320μs | 4.1738μs | 239.5873 KOps/s | 237.5110 KOps/s | |
test_membership_stacked_nested_leaf_last | 36.0180μs | 4.1990μs | 238.1512 KOps/s | 239.4407 KOps/s | |
test_nested_getleaf | 32.3410μs | 10.3819μs | 96.3219 KOps/s | 96.4603 KOps/s | |
test_nested_get | 54.3920μs | 9.8055μs | 101.9834 KOps/s | 100.7796 KOps/s | |
test_stacked_getleaf | 38.5720μs | 10.3317μs | 96.7899 KOps/s | 96.1952 KOps/s | |
test_stacked_get | 32.1600μs | 9.9580μs | 100.4217 KOps/s | 100.1668 KOps/s | |
test_nested_getitemleaf | 38.6520μs | 10.7978μs | 92.6115 KOps/s | 94.1405 KOps/s | |
test_nested_getitem | 39.7050μs | 10.1137μs | 98.8758 KOps/s | 99.0502 KOps/s | |
test_stacked_getitemleaf | 36.9400μs | 10.7653μs | 92.8910 KOps/s | 91.9990 KOps/s | |
test_stacked_getitem | 28.9740μs | 10.0821μs | 99.1854 KOps/s | 98.2371 KOps/s | |
test_lock_nested | 84.1662ms | 0.5956ms | 1.6791 KOps/s | 1.9725 KOps/s | |
test_lock_stack_nested | 1.0612ms | 0.4776ms | 2.0936 KOps/s | 2.1006 KOps/s | |
test_unlock_nested | 84.8606ms | 0.5123ms | 1.9520 KOps/s | 2.3435 KOps/s | |
test_unlock_stack_nested | 0.7183ms | 0.3918ms | 2.5524 KOps/s | 2.5592 KOps/s | |
test_flatten_speed | 0.2061ms | 0.1010ms | 9.9000 KOps/s | 9.9730 KOps/s | |
test_unflatten_speed | 0.9770ms | 0.5050ms | 1.9801 KOps/s | 1.9451 KOps/s | |
test_common_ops | 2.0914ms | 1.1420ms | 875.6421 Ops/s | 865.4323 Ops/s | |
test_creation | 19.4370μs | 2.0507μs | 487.6280 KOps/s | 487.1589 KOps/s | |
test_creation_empty | 55.4740μs | 18.9703μs | 52.7139 KOps/s | 56.0749 KOps/s | |
test_creation_nested_1 | 53.8410μs | 22.3827μs | 44.6773 KOps/s | 46.5505 KOps/s | |
test_creation_nested_2 | 62.6480μs | 26.6807μs | 37.4803 KOps/s | 38.6081 KOps/s | |
test_clone | 62.3770μs | 17.2017μs | 58.1337 KOps/s | 57.7540 KOps/s | |
test_getitem[int] | 1.1145ms | 16.5052μs | 60.5869 KOps/s | 60.3235 KOps/s | |
test_getitem[slice_int] | 0.1365ms | 30.2472μs | 33.0609 KOps/s | 33.4595 KOps/s | |
test_getitem[range] | 0.1937ms | 57.4611μs | 17.4031 KOps/s | 16.8940 KOps/s | |
test_getitem[tuple] | 0.1282ms | 24.7003μs | 40.4854 KOps/s | 40.4768 KOps/s | |
test_getitem[list] | 0.1912ms | 53.4631μs | 18.7045 KOps/s | 18.5930 KOps/s | |
test_setitem_dim[int] | 77.2740μs | 32.8649μs | 30.4276 KOps/s | 30.2262 KOps/s | |
test_setitem_dim[slice_int] | 0.1019ms | 60.3920μs | 16.5585 KOps/s | 15.7000 KOps/s | |
test_setitem_dim[range] | 0.1159ms | 82.2404μs | 12.1595 KOps/s | 11.6819 KOps/s | |
test_setitem_dim[tuple] | 93.0540μs | 48.2461μs | 20.7271 KOps/s | 20.2839 KOps/s | |
test_setitem | 95.7290μs | 31.3231μs | 31.9254 KOps/s | 32.6005 KOps/s | |
test_set | 0.1008ms | 30.4004μs | 32.8943 KOps/s | 33.3476 KOps/s | |
test_set_shared | 3.6019ms | 0.2202ms | 4.5423 KOps/s | 4.5032 KOps/s | |
test_update | 0.1439ms | 39.4697μs | 25.3359 KOps/s | 25.5976 KOps/s | |
test_update_nested | 0.1051ms | 49.2683μs | 20.2970 KOps/s | 20.1716 KOps/s | |
test_update__nested | 0.1131ms | 36.9488μs | 27.0645 KOps/s | 26.1808 KOps/s | |
test_set_nested | 88.7570μs | 34.2226μs | 29.2205 KOps/s | 30.1459 KOps/s | |
test_set_nested_new | 92.4830μs | 38.5246μs | 25.9574 KOps/s | 26.3441 KOps/s | |
test_select | 0.1102ms | 56.3271μs | 17.7535 KOps/s | 18.3920 KOps/s | |
test_select_nested | 0.1543ms | 59.3087μs | 16.8609 KOps/s | 16.9933 KOps/s | |
test_exclude_nested | 0.1523ms | 73.3650μs | 13.6305 KOps/s | 13.5678 KOps/s | |
test_empty[True] | 0.7202ms | 0.3482ms | 2.8722 KOps/s | 2.8636 KOps/s | |
test_empty[False] | 6.1440μs | 1.2208μs | 819.1466 KOps/s | 836.1229 KOps/s | |
test_unbind_speed | 0.4537ms | 0.2969ms | 3.3685 KOps/s | 3.2797 KOps/s | |
test_unbind_speed_stack0 | 0.5812ms | 0.2971ms | 3.3657 KOps/s | 3.3516 KOps/s | |
test_unbind_speed_stack1 | 88.9067ms | 0.8225ms | 1.2158 KOps/s | 1.3241 KOps/s | |
test_split | 78.9165ms | 2.1433ms | 466.5613 Ops/s | 470.6696 Ops/s | |
test_chunk | 2.6994ms | 1.9822ms | 504.4977 Ops/s | 470.4617 Ops/s | |
test_creation[device0] | 3.6539ms | 0.1198ms | 8.3487 KOps/s | 8.4426 KOps/s | |
test_creation_from_tensor | 0.2708ms | 0.1179ms | 8.4794 KOps/s | 8.3332 KOps/s | |
test_add_one[memmap_tensor0] | 0.1666ms | 7.4014μs | 135.1089 KOps/s | 133.2986 KOps/s | |
test_contiguous[memmap_tensor0] | 13.0240μs | 1.8820μs | 531.3358 KOps/s | 520.9288 KOps/s | |
test_stack[memmap_tensor0] | 38.5630μs | 5.6375μs | 177.3842 KOps/s | 170.0613 KOps/s | |
test_memmaptd_index | 1.1367ms | 0.4084ms | 2.4489 KOps/s | 2.4008 KOps/s | |
test_memmaptd_index_astensor | 0.7890ms | 0.5147ms | 1.9430 KOps/s | 1.9274 KOps/s | |
test_memmaptd_index_op | 1.9089ms | 1.0917ms | 915.9794 Ops/s | 937.7689 Ops/s | |
test_serialize_model | 0.1315s | 0.1186s | 8.4319 Ops/s | 8.4994 Ops/s | |
test_serialize_model_pickle | 0.4457s | 0.3864s | 2.5877 Ops/s | 2.5813 Ops/s | |
test_serialize_weights | 0.1340s | 0.1198s | 8.3485 Ops/s | 8.4741 Ops/s | |
test_serialize_weights_returnearly | 0.1834s | 0.1579s | 6.3314 Ops/s | 6.2450 Ops/s | |
test_serialize_weights_pickle | 0.4658s | 0.4135s | 2.4183 Ops/s | 2.3493 Ops/s | |
test_serialize_weights_filesystem | 0.2253s | 0.1508s | 6.6296 Ops/s | 7.1030 Ops/s | |
test_serialize_model_filesystem | 0.1493s | 0.1427s | 7.0084 Ops/s | 6.2125 Ops/s | |
test_reshape_pytree | 0.1046ms | 38.7083μs | 25.8343 KOps/s | 25.9929 KOps/s | |
test_reshape_td | 0.1112ms | 46.0551μs | 21.7131 KOps/s | 21.9244 KOps/s | |
test_view_pytree | 93.0840μs | 38.5132μs | 25.9651 KOps/s | 26.1955 KOps/s | |
test_view_td | 0.1371ms | 51.0477μs | 19.5895 KOps/s | 19.6081 KOps/s | |
test_unbind_pytree | 82.8550μs | 35.0765μs | 28.5091 KOps/s | 27.8912 KOps/s | |
test_unbind_td | 0.2981ms | 44.8084μs | 22.3173 KOps/s | 22.4790 KOps/s | |
test_split_pytree | 82.0030μs | 37.5809μs | 26.6093 KOps/s | 26.3155 KOps/s | |
test_split_td | 0.5602ms | 56.4583μs | 17.7122 KOps/s | 17.6406 KOps/s | |
test_add_pytree | 0.1186ms | 44.9043μs | 22.2696 KOps/s | 22.2594 KOps/s | |
test_add_td | 0.1771ms | 89.1935μs | 11.2116 KOps/s | 11.7151 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1357ms | 59.4903μs | 16.8095 KOps/s | 17.0709 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.2874ms | 0.1949ms | 5.1297 KOps/s | 5.0665 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1225ms | 57.1479μs | 17.4985 KOps/s | 17.7243 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.2389ms | 0.1381ms | 7.2392 KOps/s | 7.0571 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 60.6140μs | 23.8347μs | 41.9556 KOps/s | 41.8864 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1538ms | 73.0623μs | 13.6870 KOps/s | 13.4736 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.1588ms | 75.3914μs | 13.2641 KOps/s | 13.3838 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.1318ms | 67.8762μs | 14.7327 KOps/s | 14.7876 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.3357ms | 0.1810ms | 5.5245 KOps/s | 5.4906 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.4684ms | 0.2432ms | 4.1124 KOps/s | 4.1261 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1149ms | 48.3995μs | 20.6614 KOps/s | 19.7517 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.1564ms | 77.4708μs | 12.9081 KOps/s | 12.8922 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.2684ms | 0.1750ms | 5.7158 KOps/s | 5.6919 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.4896ms | 0.2869ms | 3.4851 KOps/s | 3.4495 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.3988ms | 0.2762ms | 3.6207 KOps/s | 3.6008 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.3512ms | 0.1872ms | 5.3411 KOps/s | 5.5152 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1499ms | 74.7216μs | 13.3830 KOps/s | 13.5513 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1076ms | 50.9805μs | 19.6153 KOps/s | 19.6735 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.4983ms | 0.2339ms | 4.2745 KOps/s | 4.2062 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.3429ms | 0.1757ms | 5.6904 KOps/s | 5.6006 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 0.2462ms | 0.1121ms | 8.9199 KOps/s | 9.0130 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.1727ms | 78.1945μs | 12.7886 KOps/s | 12.9967 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1597ms | 76.4944μs | 13.0728 KOps/s | 13.2247 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1502ms | 68.4983μs | 14.5989 KOps/s | 14.7656 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 0.2988ms | 0.1944ms | 5.1429 KOps/s | 5.1171 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 2.4980ms | 1.7783ms | 562.3300 Ops/s | 560.8074 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 0.2650ms | 0.1922ms | 5.2018 KOps/s | 5.1258 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 1.3620ms | 1.0926ms | 915.2155 Ops/s | 895.1707 Ops/s | |
test_compile_assign_and_add_stack[compile] | 0.5876ms | 0.4153ms | 2.4078 KOps/s | 2.3412 KOps/s | |
test_compile_assign_and_add_stack[eager] | 4.4277ms | 4.2296ms | 236.4276 Ops/s | 244.9006 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 96.0110μs | 35.0848μs | 28.5023 KOps/s | 28.6364 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.9532ms | 47.7895μs | 20.9251 KOps/s | 20.4903 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 91.0010μs | 30.9884μs | 32.2702 KOps/s | 33.3867 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 94.1170μs | 30.2147μs | 33.0965 KOps/s | 34.5420 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 95.6190μs | 31.2246μs | 32.0260 KOps/s | 33.5730 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 74.8500μs | 29.6742μs | 33.6993 KOps/s | 34.5664 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1680ms | 74.3905μs | 13.4426 KOps/s | 13.5352 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.3524ms | 27.7124μs | 36.0849 KOps/s | 36.8127 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1483ms | 68.8621μs | 14.5218 KOps/s | 14.2527 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 64.5710μs | 23.3329μs | 42.8579 KOps/s | 43.0635 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1286ms | 68.5279μs | 14.5926 KOps/s | 14.3659 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 76.6230μs | 22.8899μs | 43.6874 KOps/s | 43.1064 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1640ms | 73.6492μs | 13.5779 KOps/s | 13.2159 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.9066ms | 27.1909μs | 36.7770 KOps/s | 37.3400 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1233ms | 68.9636μs | 14.5004 KOps/s | 14.4290 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 67.2660μs | 22.7572μs | 43.9421 KOps/s | 43.8511 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1413ms | 68.2770μs | 14.6462 KOps/s | 14.5487 KOps/s | |
test_compile_indexing[int-pytree-eager] | 71.9250μs | 22.9055μs | 43.6576 KOps/s | 43.4821 KOps/s | |
test_mod_add[eager] | 0.1096ms | 25.3603μs | 39.4317 KOps/s | 40.4079 KOps/s | |
test_mod_add[compile] | 0.1006ms | 39.3955μs | 25.3836 KOps/s | 25.8447 KOps/s | |
test_mod_add[compile-overhead] | 0.1035ms | 40.1595μs | 24.9007 KOps/s | 25.4467 KOps/s | |
test_mod_wrap[eager] | 0.3423ms | 0.2154ms | 4.6419 KOps/s | 4.7290 KOps/s | |
test_mod_wrap[compile] | 0.3880ms | 0.2314ms | 4.3220 KOps/s | 4.2318 KOps/s | |
test_mod_wrap[compile-overhead] | 0.3486ms | 0.2313ms | 4.3230 KOps/s | 4.3073 KOps/s | |
test_mod_wrap_and_backward[eager] | 13.1525ms | 10.7628ms | 92.9125 Ops/s | 72.3895 Ops/s | |
test_mod_wrap_and_backward[compile] | 12.3013ms | 10.7369ms | 93.1366 Ops/s | 75.8383 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 12.1912ms | 10.5595ms | 94.7016 Ops/s | 81.3074 Ops/s | |
test_seq_add[eager] | 0.1796ms | 94.9141μs | 10.5358 KOps/s | 10.8149 KOps/s | |
test_seq_add[compile] | 0.1253ms | 65.5788μs | 15.2488 KOps/s | 15.3408 KOps/s | |
test_seq_add[compile-overhead] | 0.1315ms | 65.5788μs | 15.2488 KOps/s | 15.5401 KOps/s | |
test_seq_wrap[eager] | 0.6397ms | 0.3958ms | 2.5265 KOps/s | 2.5928 KOps/s | |
test_seq_wrap[compile] | 1.1949ms | 0.2761ms | 3.6215 KOps/s | 3.7036 KOps/s | |
test_seq_wrap[compile-overhead] | 1.1633ms | 0.2733ms | 3.6596 KOps/s | 3.6534 KOps/s | |
test_func_call_runtime[False-eager] | 0.9202ms | 0.5328ms | 1.8767 KOps/s | 1.9053 KOps/s | |
test_func_call_runtime[False-compile] | 0.6225ms | 0.5113ms | 1.9559 KOps/s | 1.9722 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.8143ms | 0.5076ms | 1.9700 KOps/s | 1.9684 KOps/s | |
test_func_call_runtime[True-eager] | 0.9788ms | 0.7481ms | 1.3367 KOps/s | 1.3439 KOps/s | |
test_func_call_runtime[True-compile] | 0.6938ms | 0.5248ms | 1.9056 KOps/s | 1.9156 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.8749ms | 0.5248ms | 1.9053 KOps/s | 1.9366 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.8742ms | 0.5302ms | 1.8859 KOps/s | 1.8906 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.6453ms | 0.5069ms | 1.9726 KOps/s | 1.9735 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.8427ms | 0.5121ms | 1.9528 KOps/s | 1.9684 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.9263ms | 0.9103ms | 1.0986 KOps/s | 1.1139 KOps/s | |
test_func_call_cm_runtime[True-compile] | 1.0845ms | 0.7516ms | 1.3304 KOps/s | 1.3441 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.8695ms | 0.7598ms | 1.3162 KOps/s | 1.3271 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.5414ms | 1.9153ms | 522.1048 Ops/s | 520.9826 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 2.5421ms | 1.9772ms | 505.7551 Ops/s | 503.6587 Ops/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 2.5956ms | 1.9834ms | 504.1819 Ops/s | 502.4631 Ops/s | |
test_distributed | 0.3164ms | 0.1262ms | 7.9247 KOps/s | 7.7530 KOps/s | |
test_tdmodule | 71.5450μs | 18.5969μs | 53.7724 KOps/s | 54.9578 KOps/s | |
test_tdmodule_dispatch | 66.0740μs | 36.9870μs | 27.0365 KOps/s | 27.7920 KOps/s | |
test_tdseq | 50.0140μs | 21.2976μs | 46.9536 KOps/s | 47.6539 KOps/s | |
test_tdseq_dispatch | 73.9390μs | 42.6893μs | 23.4251 KOps/s | 24.2706 KOps/s | |
test_instantiation_functorch | 1.7617ms | 1.5746ms | 635.0830 Ops/s | 633.7333 Ops/s | |
test_instantiation_td | 1.8123ms | 1.1718ms | 853.3597 Ops/s | 856.5842 Ops/s | |
test_exec_functorch | 0.4215ms | 0.1834ms | 5.4534 KOps/s | 5.3966 KOps/s | |
test_exec_functional_call | 0.3442ms | 0.1755ms | 5.6988 KOps/s | 5.6358 KOps/s | |
test_exec_td | 0.3519ms | 0.2015ms | 4.9623 KOps/s | 4.7445 KOps/s | |
test_exec_td_decorator | 0.3723ms | 0.2349ms | 4.2565 KOps/s | 4.2509 KOps/s | |
test_vmap_mlp_speed[True-True] | 1.1833ms | 0.7115ms | 1.4055 KOps/s | 1.4566 KOps/s | |
test_vmap_mlp_speed[True-False] | 0.8471ms | 0.6898ms | 1.4497 KOps/s | 1.4730 KOps/s | |
test_vmap_mlp_speed[False-True] | 0.8725ms | 0.5414ms | 1.8470 KOps/s | 1.8760 KOps/s | |
test_vmap_mlp_speed[False-False] | 0.9029ms | 0.5482ms | 1.8242 KOps/s | 1.8637 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 1.2759ms | 0.6513ms | 1.5353 KOps/s | 1.5565 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.8095ms | 0.6505ms | 1.5372 KOps/s | 1.5584 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.8903ms | 0.5424ms | 1.8436 KOps/s | 1.8843 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.6764ms | 0.5340ms | 1.8725 KOps/s | 1.8739 KOps/s | |
test_to_module_speed[True] | 2.3119ms | 1.3947ms | 717.0183 Ops/s | 714.3941 Ops/s | |
test_to_module_speed[False] | 2.3317ms | 1.3584ms | 736.1858 Ops/s | 730.8041 Ops/s | |
test_tc_init | 0.1109ms | 48.8238μs | 20.4818 KOps/s | 21.0092 KOps/s | |
test_tc_init_nested | 0.2083ms | 95.0836μs | 10.5171 KOps/s | 10.5073 KOps/s | |
test_tc_first_layer_tensor | 15.1590μs | 1.4984μs | 667.3650 KOps/s | 639.1721 KOps/s | |
test_tc_first_layer_nontensor | 43.0020μs | 4.6054μs | 217.1344 KOps/s | 210.5714 KOps/s | |
test_tc_second_layer_tensor | 18.3750μs | 2.7283μs | 366.5276 KOps/s | 345.1004 KOps/s | |
test_tc_second_layer_nontensor | 39.4450μs | 6.0238μs | 166.0073 KOps/s | 161.6530 KOps/s | |
test_unbind | 0.4682s | 13.1706ms | 75.9266 Ops/s | 75.3994 Ops/s | |
test_full_like | 7.8984ms | 7.1706ms | 139.4577 Ops/s | 118.4356 Ops/s | |
test_zeros_like | 3.1112ms | 2.7106ms | 368.9187 Ops/s | 357.6143 Ops/s | |
test_ones_like | 3.5513ms | 3.2596ms | 306.7828 Ops/s | 305.8875 Ops/s | |
test_clone | 5.2088ms | 4.9137ms | 203.5128 Ops/s | 200.1529 Ops/s | |
test_squeeze | 60.5850μs | 12.5073μs | 79.9532 KOps/s | 80.9334 KOps/s | |
test_unsqueeze | 0.3453ms | 93.3821μs | 10.7087 KOps/s | 10.7226 KOps/s | |
test_split | 0.3712ms | 0.1920ms | 5.2074 KOps/s | 5.1868 KOps/s | |
test_permute | 0.3835ms | 0.2186ms | 4.5751 KOps/s | 4.5858 KOps/s | |
test_stack | 28.6305ms | 25.4539ms | 39.2866 Ops/s | 40.1837 Ops/s | |
test_cat | 27.7739ms | 25.2686ms | 39.5747 Ops/s | 40.8933 Ops/s |
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 0.1527ms | 16.4323μs | 60.8559 KOps/s | 59.0341 KOps/s | |
test_plain_set_stack_nested | 41.5010μs | 16.4869μs | 60.6543 KOps/s | 58.9265 KOps/s | |
test_plain_set_nested_inplace | 53.8010μs | 17.5642μs | 56.9340 KOps/s | 55.1198 KOps/s | |
test_plain_set_stack_nested_inplace | 52.3210μs | 17.5003μs | 57.1419 KOps/s | 55.3275 KOps/s | |
test_items | 25.7300μs | 2.9874μs | 334.7382 KOps/s | 346.3573 KOps/s | |
test_items_nested | 0.3810ms | 0.3390ms | 2.9494 KOps/s | 2.9421 KOps/s | |
test_items_nested_locked | 0.4073ms | 0.3361ms | 2.9756 KOps/s | 2.9810 KOps/s | |
test_items_nested_leaf | 95.5520μs | 62.5818μs | 15.9791 KOps/s | 16.0150 KOps/s | |
test_items_stack_nested | 0.5318ms | 0.3409ms | 2.9332 KOps/s | 2.9725 KOps/s | |
test_items_stack_nested_leaf | 92.0320μs | 64.8783μs | 15.4135 KOps/s | 15.7883 KOps/s | |
test_items_stack_nested_locked | 0.3718ms | 0.3402ms | 2.9399 KOps/s | 2.9496 KOps/s | |
test_keys | 32.3210μs | 3.4068μs | 293.5325 KOps/s | 291.7744 KOps/s | |
test_keys_nested | 0.1023ms | 70.3342μs | 14.2178 KOps/s | 14.1924 KOps/s | |
test_keys_nested_locked | 2.4950ms | 76.8917μs | 13.0053 KOps/s | 13.0237 KOps/s | |
test_keys_nested_leaf | 0.1045ms | 60.7283μs | 16.4668 KOps/s | 16.2765 KOps/s | |
test_keys_stack_nested | 0.1702ms | 71.5536μs | 13.9755 KOps/s | 13.9574 KOps/s | |
test_keys_stack_nested_leaf | 93.1720μs | 62.9312μs | 15.8904 KOps/s | 16.0617 KOps/s | |
test_keys_stack_nested_locked | 0.1119ms | 77.6528μs | 12.8778 KOps/s | 12.9425 KOps/s | |
test_values | 8.5685μs | 0.8420μs | 1.1877 MOps/s | 1.1973 MOps/s | |
test_values_nested | 94.7520μs | 48.8223μs | 20.4824 KOps/s | 20.4603 KOps/s | |
test_values_nested_locked | 78.0020μs | 50.2027μs | 19.9192 KOps/s | 19.8116 KOps/s | |
test_values_nested_leaf | 69.5010μs | 42.6271μs | 23.4592 KOps/s | 23.4269 KOps/s | |
test_values_stack_nested | 80.3820μs | 50.0819μs | 19.9673 KOps/s | 20.1586 KOps/s | |
test_values_stack_nested_leaf | 79.2020μs | 44.0966μs | 22.6775 KOps/s | 22.9880 KOps/s | |
test_values_stack_nested_locked | 81.6210μs | 51.5115μs | 19.4132 KOps/s | 19.4339 KOps/s | |
test_membership | 1.7931μs | 0.5007μs | 1.9971 MOps/s | 1.9861 MOps/s | |
test_membership_nested | 16.2255μs | 1.8355μs | 544.8023 KOps/s | 545.0633 KOps/s | |
test_membership_nested_leaf | 13.3603μs | 1.8366μs | 544.4732 KOps/s | 547.2286 KOps/s | |
test_membership_stacked_nested | 31.3200μs | 1.8410μs | 543.1835 KOps/s | 538.9763 KOps/s | |
test_membership_stacked_nested_leaf | 24.8610μs | 1.8861μs | 530.1851 KOps/s | 535.6556 KOps/s | |
test_membership_nested_last | 0.5940ms | 2.9739μs | 336.2557 KOps/s | 337.8659 KOps/s | |
test_membership_nested_leaf_last | 33.1110μs | 2.9295μs | 341.3551 KOps/s | 333.4303 KOps/s | |
test_membership_stacked_nested_last | 31.5000μs | 3.5165μs | 284.3759 KOps/s | 333.8960 KOps/s | |
test_membership_stacked_nested_leaf_last | 34.1000μs | 3.4698μs | 288.1972 KOps/s | 336.2771 KOps/s | |
test_nested_getleaf | 34.0910μs | 6.0634μs | 164.9247 KOps/s | 168.3147 KOps/s | |
test_nested_get | 38.5910μs | 5.7947μs | 172.5703 KOps/s | 175.1260 KOps/s | |
test_stacked_getleaf | 0.1729ms | 5.9962μs | 166.7719 KOps/s | 165.4428 KOps/s | |
test_stacked_get | 34.0110μs | 5.6600μs | 176.6785 KOps/s | 173.2625 KOps/s | |
test_nested_getitemleaf | 0.1885ms | 6.1141μs | 163.5552 KOps/s | 161.7610 KOps/s | |
test_nested_getitem | 34.1810μs | 5.7810μs | 172.9795 KOps/s | 171.8565 KOps/s | |
test_stacked_getitemleaf | 31.3700μs | 6.0764μs | 164.5714 KOps/s | 163.8114 KOps/s | |
test_stacked_getitem | 26.2000μs | 5.7503μs | 173.9053 KOps/s | 173.8409 KOps/s | |
test_lock_nested | 4.3878ms | 0.4233ms | 2.3625 KOps/s | 2.3882 KOps/s | |
test_lock_stack_nested | 0.5202ms | 0.3877ms | 2.5794 KOps/s | 2.5962 KOps/s | |
test_unlock_nested | 0.8589ms | 0.3556ms | 2.8123 KOps/s | 2.7990 KOps/s | |
test_unlock_stack_nested | 0.3543ms | 0.3233ms | 3.0930 KOps/s | 3.0926 KOps/s | |
test_flatten_speed | 0.1658ms | 77.0629μs | 12.9764 KOps/s | 13.0764 KOps/s | |
test_unflatten_speed | 0.3600ms | 0.3264ms | 3.0642 KOps/s | 3.1008 KOps/s | |
test_common_ops | 1.5323ms | 1.2298ms | 813.1221 Ops/s | 763.5003 Ops/s | |
test_creation | 31.5900μs | 1.4518μs | 688.8048 KOps/s | 704.4494 KOps/s | |
test_creation_empty | 0.8790ms | 14.7222μs | 67.9249 KOps/s | 62.5898 KOps/s | |
test_creation_nested_1 | 47.2710μs | 16.3510μs | 61.1583 KOps/s | 56.4307 KOps/s | |
test_creation_nested_2 | 58.3120μs | 18.8627μs | 53.0147 KOps/s | 48.9771 KOps/s | |
test_clone | 0.1660ms | 27.4700μs | 36.4033 KOps/s | 36.8735 KOps/s | |
test_getitem[int] | 92.1347ms | 22.6927μs | 44.0671 KOps/s | 65.3006 KOps/s | |
test_getitem[slice_int] | 0.1592ms | 26.6630μs | 37.5051 KOps/s | 37.8101 KOps/s | |
test_getitem[range] | 0.2200ms | 0.1059ms | 9.4436 KOps/s | 9.4418 KOps/s | |
test_getitem[tuple] | 0.1195ms | 23.0377μs | 43.4071 KOps/s | 43.7493 KOps/s | |
test_getitem[list] | 0.2783ms | 0.1003ms | 9.9731 KOps/s | 10.5205 KOps/s | |
test_setitem_dim[int] | 74.4710μs | 45.4498μs | 22.0023 KOps/s | 22.7761 KOps/s | |
test_setitem_dim[slice_int] | 0.1968ms | 67.0614μs | 14.9117 KOps/s | 15.0944 KOps/s | |
test_setitem_dim[range] | 0.3101ms | 0.1269ms | 7.8819 KOps/s | 8.0406 KOps/s | |
test_setitem_dim[tuple] | 0.2243ms | 60.3292μs | 16.5757 KOps/s | 16.7594 KOps/s | |
test_setitem | 0.1894ms | 40.1520μs | 24.9054 KOps/s | 23.1335 KOps/s | |
test_set | 0.1879ms | 39.1830μs | 25.5213 KOps/s | 23.5060 KOps/s | |
test_set_shared | 0.3452ms | 53.5769μs | 18.6647 KOps/s | 18.9708 KOps/s | |
test_update | 0.2255ms | 50.7522μs | 19.7036 KOps/s | 20.4328 KOps/s | |
test_update_nested | 0.2384ms | 59.1521μs | 16.9056 KOps/s | 17.6677 KOps/s | |
test_update__nested | 0.2265ms | 59.6016μs | 16.7781 KOps/s | 16.8677 KOps/s | |
test_set_nested | 0.1941ms | 42.3555μs | 23.6097 KOps/s | 22.6505 KOps/s | |
test_set_nested_new | 0.2036ms | 45.6338μs | 21.9136 KOps/s | 20.3123 KOps/s | |
test_select | 0.2390ms | 59.9870μs | 16.6703 KOps/s | 15.8197 KOps/s | |
test_select_nested | 0.5056ms | 42.0404μs | 23.7867 KOps/s | 23.8435 KOps/s | |
test_exclude_nested | 88.8620μs | 57.7455μs | 17.3174 KOps/s | 17.1985 KOps/s | |
test_empty[True] | 0.4169ms | 0.2574ms | 3.8853 KOps/s | 3.8835 KOps/s | |
test_empty[False] | 3.1091μs | 0.7294μs | 1.3710 MOps/s | 1.3658 MOps/s | |
test_to | 55.6210μs | 26.5139μs | 37.7161 KOps/s | 38.1303 KOps/s | |
test_to_nonblocking | 0.1397ms | 25.3873μs | 39.3897 KOps/s | 40.4074 KOps/s | |
test_unbind_speed | 0.3237ms | 0.2738ms | 3.6521 KOps/s | 3.6811 KOps/s | |
test_unbind_speed_stack0 | 0.3691ms | 0.2743ms | 3.6450 KOps/s | 3.7322 KOps/s | |
test_unbind_speed_stack1 | 91.6997ms | 0.7016ms | 1.4252 KOps/s | 1.4133 KOps/s | |
test_split | 93.4815ms | 2.0937ms | 477.6176 Ops/s | 475.9170 Ops/s | |
test_chunk | 93.3065ms | 2.0880ms | 478.9242 Ops/s | 473.9808 Ops/s | |
test_creation[device0] | 0.3384ms | 0.1260ms | 7.9375 KOps/s | 8.0476 KOps/s | |
test_creation_from_tensor | 0.3436ms | 0.1326ms | 7.5413 KOps/s | 7.6473 KOps/s | |
test_add_one[memmap_tensor0] | 0.2451ms | 8.3932μs | 119.1446 KOps/s | 119.7406 KOps/s | |
test_contiguous[memmap_tensor0] | 34.1510μs | 2.0978μs | 476.6910 KOps/s | 473.3370 KOps/s | |
test_stack[memmap_tensor0] | 37.2710μs | 6.3965μs | 156.3350 KOps/s | 155.1482 KOps/s | |
test_memmaptd_index | 1.1875ms | 0.4139ms | 2.4159 KOps/s | 2.4252 KOps/s | |
test_memmaptd_index_astensor | 0.7557ms | 0.4838ms | 2.0668 KOps/s | 2.0802 KOps/s | |
test_memmaptd_index_op | 1.3824ms | 0.9875ms | 1.0127 KOps/s | 998.4656 Ops/s | |
test_serialize_model | 0.1305s | 0.1297s | 7.7079 Ops/s | 7.6700 Ops/s | |
test_serialize_model_pickle | 1.3622s | 1.2160s | 0.8224 Ops/s | 0.8244 Ops/s | |
test_serialize_weights | 0.1303s | 0.1294s | 7.7280 Ops/s | 6.9651 Ops/s | |
test_serialize_weights_returnearly | 0.2069s | 55.3873ms | 18.0547 Ops/s | 17.7035 Ops/s | |
test_serialize_weights_pickle | 1.3723s | 1.2169s | 0.8218 Ops/s | 0.8172 Ops/s | |
test_reshape_pytree | 78.4110μs | 35.5523μs | 28.1276 KOps/s | 29.1380 KOps/s | |
test_reshape_td | 0.1698ms | 42.1129μs | 23.7457 KOps/s | 25.1189 KOps/s | |
test_view_pytree | 0.1835ms | 34.0422μs | 29.3753 KOps/s | 29.5595 KOps/s | |
test_view_td | 0.2252ms | 45.4580μs | 21.9983 KOps/s | 21.9944 KOps/s | |
test_unbind_pytree | 0.1818ms | 34.4169μs | 29.0555 KOps/s | 30.0050 KOps/s | |
test_unbind_td | 0.4203ms | 42.0444μs | 23.7844 KOps/s | 23.7180 KOps/s | |
test_split_pytree | 0.1400ms | 46.5980μs | 21.4602 KOps/s | 22.0809 KOps/s | |
test_split_td | 93.6307ms | 64.8835μs | 15.4122 KOps/s | 18.1778 KOps/s | |
test_add_pytree | 0.2020ms | 55.3219μs | 18.0760 KOps/s | 17.2539 KOps/s | |
test_add_td | 0.2714ms | 93.0713μs | 10.7444 KOps/s | 10.4452 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.3107ms | 0.1567ms | 6.3836 KOps/s | 6.0825 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.3246ms | 0.1632ms | 6.1288 KOps/s | 6.2705 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.2936ms | 0.1401ms | 7.1391 KOps/s | 7.1112 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.3669ms | 0.1883ms | 5.3096 KOps/s | 5.6556 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 0.1665ms | 21.3480μs | 46.8427 KOps/s | 47.2196 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1787ms | 47.8348μs | 20.9053 KOps/s | 20.9234 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.2287ms | 63.6733μs | 15.7052 KOps/s | 15.5704 KOps/s | |
test_compile_copy_nested[pytree-eager] | 82.7120μs | 49.9901μs | 20.0040 KOps/s | 19.9349 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.4523ms | 0.3088ms | 3.2378 KOps/s | 3.1894 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.3518ms | 0.2390ms | 4.1846 KOps/s | 4.3252 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.2731ms | 0.1241ms | 8.0598 KOps/s | 7.7041 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.2575ms | 67.9418μs | 14.7185 KOps/s | 15.5721 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.4052ms | 0.3072ms | 3.2555 KOps/s | 3.2274 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.7731ms | 0.5988ms | 1.6701 KOps/s | 1.7085 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.4045ms | 0.2852ms | 3.5062 KOps/s | 3.5798 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.4281ms | 0.3107ms | 3.2189 KOps/s | 3.1982 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.2384ms | 76.9039μs | 13.0032 KOps/s | 13.3709 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.2848ms | 0.1241ms | 8.0579 KOps/s | 7.9551 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.6764ms | 0.5028ms | 1.9888 KOps/s | 1.9856 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.3963ms | 0.3063ms | 3.2644 KOps/s | 3.2290 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 0.1466ms | 19.2914μs | 51.8365 KOps/s | 51.1107 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.1244ms | 37.7128μs | 26.5162 KOps/s | 24.2778 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1319ms | 69.1124μs | 14.4692 KOps/s | 14.4558 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1345ms | 51.7569μs | 19.3211 KOps/s | 19.2115 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 2.3070ms | 0.8038ms | 1.2440 KOps/s | 1.1609 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 3.3563ms | 3.1594ms | 316.5172 Ops/s | 320.3135 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 2.2423ms | 0.7884ms | 1.2685 KOps/s | 1.1786 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 3.5497ms | 3.1805ms | 314.4142 Ops/s | 322.6682 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.2589ms | 0.1063ms | 9.4103 KOps/s | 8.9620 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.2070ms | 57.7580μs | 17.3136 KOps/s | 16.5189 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.2792ms | 0.1043ms | 9.5894 KOps/s | 9.5049 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 0.2276ms | 44.9544μs | 22.2448 KOps/s | 24.1124 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.2751ms | 0.1068ms | 9.3596 KOps/s | 9.4700 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 0.2177ms | 44.9430μs | 22.2504 KOps/s | 23.6211 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.3102ms | 0.1346ms | 7.4308 KOps/s | 7.2820 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.1626ms | 24.3497μs | 41.0682 KOps/s | 41.4173 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.2810ms | 0.1271ms | 7.8654 KOps/s | 7.5389 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 72.4620μs | 20.3565μs | 49.1243 KOps/s | 49.7317 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.3063ms | 0.1332ms | 7.5057 KOps/s | 7.5600 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 55.9710μs | 19.9772μs | 50.0571 KOps/s | 49.5459 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.2976ms | 0.1343ms | 7.4470 KOps/s | 7.2013 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.5119ms | 23.3917μs | 42.7501 KOps/s | 42.4860 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.3167ms | 0.1277ms | 7.8304 KOps/s | 7.6040 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 51.0810μs | 19.9557μs | 50.1111 KOps/s | 49.4455 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.3181ms | 0.1302ms | 7.6793 KOps/s | 7.5391 KOps/s | |
test_compile_indexing[int-pytree-eager] | 0.1687ms | 19.5161μs | 51.2396 KOps/s | 38.0982 KOps/s | |
test_mod_add[eager] | 0.1665ms | 30.6627μs | 32.6129 KOps/s | 31.5429 KOps/s | |
test_mod_add[compile] | 0.3142ms | 67.6929μs | 14.7726 KOps/s | 14.5336 KOps/s | |
test_mod_add[compile-overhead] | 0.2546ms | 0.1317ms | 7.5905 KOps/s | 6.9130 KOps/s | |
test_mod_wrap[eager] | 0.9182ms | 0.7782ms | 1.2851 KOps/s | 1.2874 KOps/s | |
test_mod_wrap[compile] | 2.0699ms | 0.8414ms | 1.1885 KOps/s | 1.2128 KOps/s | |
test_mod_wrap[compile-overhead] | 4.9231ms | 3.1099ms | 321.5501 Ops/s | 328.4527 Ops/s | |
test_mod_wrap_and_backward[eager] | 4.2209ms | 4.0731ms | 245.5161 Ops/s | 242.8560 Ops/s | |
test_mod_wrap_and_backward[compile] | 4.6332ms | 4.0446ms | 247.2407 Ops/s | 244.0315 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 1.4232ms | 0.9144ms | 1.0936 KOps/s | 988.2712 Ops/s | |
test_seq_add[eager] | 0.2414ms | 93.8996μs | 10.6497 KOps/s | 10.2250 KOps/s | |
test_seq_add[compile] | 0.2234ms | 81.8909μs | 12.2114 KOps/s | 12.6197 KOps/s | |
test_seq_add[compile-overhead] | 0.2954ms | 0.1124ms | 8.9005 KOps/s | 9.0143 KOps/s | |
test_seq_wrap[eager] | 1.0958ms | 0.9249ms | 1.0812 KOps/s | 1.0713 KOps/s | |
test_seq_wrap[compile] | 0.9975ms | 0.8508ms | 1.1754 KOps/s | 1.1865 KOps/s | |
test_seq_wrap[compile-overhead] | 0.3732ms | 0.2153ms | 4.6438 KOps/s | 4.5701 KOps/s | |
test_func_call_runtime[False-eager] | 2.5665ms | 2.3809ms | 420.0015 Ops/s | 425.6374 Ops/s | |
test_func_call_runtime[False-compile] | 2.5984ms | 2.4026ms | 416.2140 Ops/s | 421.4910 Ops/s | |
test_func_call_runtime[False-compile-overhead] | 0.4921ms | 0.3489ms | 2.8664 KOps/s | 2.8364 KOps/s | |
test_func_call_runtime[True-eager] | 2.7300ms | 2.5444ms | 393.0212 Ops/s | 396.1123 Ops/s | |
test_func_call_runtime[True-compile] | 2.6173ms | 2.4415ms | 409.5892 Ops/s | 421.1894 Ops/s | |
test_func_call_runtime[True-compile-overhead] | 0.5165ms | 0.3713ms | 2.6934 KOps/s | 2.7034 KOps/s | |
test_func_call_cm_runtime[False-eager] | 2.5595ms | 2.3865ms | 419.0274 Ops/s | 430.6176 Ops/s | |
test_func_call_cm_runtime[False-compile] | 2.5736ms | 2.4108ms | 414.8066 Ops/s | 423.2245 Ops/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.5050ms | 0.3510ms | 2.8488 KOps/s | 2.8259 KOps/s | |
test_func_call_cm_runtime[True-eager] | 2.8114ms | 2.6825ms | 372.7829 Ops/s | 383.4114 Ops/s | |
test_func_call_cm_runtime[True-compile] | 2.6260ms | 2.4662ms | 405.4842 Ops/s | 412.7853 Ops/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.5386ms | 0.3971ms | 2.5185 KOps/s | 2.5167 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 4.1816ms | 3.7286ms | 268.1956 Ops/s | 266.1307 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 2.6335ms | 2.4751ms | 404.0207 Ops/s | 409.4466 Ops/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.5028ms | 0.3982ms | 2.5112 KOps/s | 2.5095 KOps/s | |
test_distributed | 3.9205ms | 0.2505ms | 3.9927 KOps/s | 8.8234 KOps/s | |
test_tdmodule | 49.2210μs | 14.8678μs | 67.2596 KOps/s | 62.5852 KOps/s | |
test_tdmodule_dispatch | 64.6610μs | 28.9026μs | 34.5990 KOps/s | 33.9713 KOps/s | |
test_tdseq | 0.1663ms | 16.0630μs | 62.2548 KOps/s | 62.2775 KOps/s | |
test_tdseq_dispatch | 53.3220μs | 31.5261μs | 31.7197 KOps/s | 31.2112 KOps/s | |
test_instantiation_functorch | 2.0609ms | 1.8194ms | 549.6379 Ops/s | 550.6108 Ops/s | |
test_instantiation_td | 1.8237ms | 1.1820ms | 846.0095 Ops/s | 845.6685 Ops/s | |
test_exec_functorch | 1.1282ms | 0.9969ms | 1.0031 KOps/s | 1.0025 KOps/s | |
test_exec_functional_call | 1.1449ms | 1.0148ms | 985.4556 Ops/s | 1.0004 KOps/s | |
test_exec_td | 1.2017ms | 1.0198ms | 980.6073 Ops/s | 968.2874 Ops/s | |
test_exec_td_decorator | 1.4983ms | 1.0649ms | 939.0631 Ops/s | 948.6506 Ops/s | |
test_vmap_mlp_speed[True-True] | 1.7970ms | 1.2599ms | 793.6834 Ops/s | 794.3251 Ops/s | |
test_vmap_mlp_speed[True-False] | 1.4783ms | 1.2593ms | 794.1142 Ops/s | 798.4347 Ops/s | |
test_vmap_mlp_speed[False-True] | 1.2968ms | 1.1578ms | 863.7061 Ops/s | 867.7175 Ops/s | |
test_vmap_mlp_speed[False-False] | 1.4460ms | 1.1610ms | 861.3355 Ops/s | 867.5776 Ops/s | |
test_vmap_mlp_speed_decorator[True-True] | 1.4222ms | 1.2502ms | 799.9019 Ops/s | 809.9387 Ops/s | |
test_vmap_mlp_speed_decorator[True-False] | 1.7866ms | 1.2383ms | 807.5747 Ops/s | 812.7663 Ops/s | |
test_vmap_mlp_speed_decorator[False-True] | 1.3310ms | 1.1627ms | 860.0515 Ops/s | 871.7473 Ops/s | |
test_vmap_mlp_speed_decorator[False-False] | 1.3769ms | 1.1726ms | 852.8295 Ops/s | 872.0383 Ops/s | |
test_vmap_transformer_speed[True-True] | 13.5738ms | 13.1860ms | 75.8379 Ops/s | 76.4798 Ops/s | |
test_vmap_transformer_speed[True-False] | 13.5662ms | 13.1390ms | 76.1093 Ops/s | 76.8451 Ops/s | |
test_vmap_transformer_speed[False-True] | 13.3140ms | 12.8687ms | 77.7082 Ops/s | 78.4436 Ops/s | |
test_vmap_transformer_speed[False-False] | 13.1318ms | 12.7653ms | 78.3375 Ops/s | 78.4858 Ops/s | |
test_vmap_transformer_speed_decorator[True-True] | 34.2207ms | 33.6485ms | 29.7190 Ops/s | 30.1382 Ops/s | |
test_vmap_transformer_speed_decorator[True-False] | 34.4222ms | 33.5955ms | 29.7659 Ops/s | 29.9707 Ops/s | |
test_vmap_transformer_speed_decorator[False-True] | 34.7789ms | 33.4314ms | 29.9120 Ops/s | 30.1098 Ops/s | |
test_vmap_transformer_speed_decorator[False-False] | 33.7773ms | 33.4094ms | 29.9317 Ops/s | 30.1160 Ops/s | |
test_to_module_speed[True] | 1.3737ms | 0.9979ms | 1.0021 KOps/s | 1.0033 KOps/s | |
test_to_module_speed[False] | 1.3732ms | 0.9743ms | 1.0264 KOps/s | 1.0292 KOps/s | |
test_tc_init | 0.4259ms | 33.8381μs | 29.5525 KOps/s | 28.9957 KOps/s | |
test_tc_init_nested | 0.1127ms | 68.0049μs | 14.7048 KOps/s | 14.0659 KOps/s | |
test_tc_first_layer_tensor | 56.4671μs | 0.6659μs | 1.5017 MOps/s | 1.5043 MOps/s | |
test_tc_first_layer_nontensor | 0.3785ms | 2.1919μs | 456.2215 KOps/s | 453.3263 KOps/s | |
test_tc_second_layer_tensor | 48.8260μs | 1.3444μs | 743.8004 KOps/s | 731.9398 KOps/s | |
test_tc_second_layer_nontensor | 27.4800μs | 2.8982μs | 345.0441 KOps/s | 343.1963 KOps/s | |
test_unbind | 0.1940s | 12.2025ms | 81.9504 Ops/s | 90.9106 Ops/s | |
test_full_like | 0.7866ms | 0.5748ms | 1.7399 KOps/s | 1.7381 KOps/s | |
test_zeros_like | 0.3267ms | 0.1981ms | 5.0470 KOps/s | 5.0472 KOps/s | |
test_ones_like | 0.5514ms | 0.1981ms | 5.0488 KOps/s | 5.0497 KOps/s | |
test_clone | 0.7105ms | 0.4143ms | 2.4136 KOps/s | 2.4051 KOps/s | |
test_squeeze | 0.1329ms | 9.6198μs | 103.9519 KOps/s | 104.3493 KOps/s | |
test_unsqueeze | 0.4491ms | 72.3694μs | 13.8180 KOps/s | 13.7808 KOps/s | |
test_split | 0.5967ms | 0.1534ms | 6.5183 KOps/s | 6.3711 KOps/s | |
test_permute | 0.2439ms | 0.1747ms | 5.7234 KOps/s | 5.7321 KOps/s | |
test_stack | 1.2536ms | 0.8595ms | 1.1634 KOps/s | 1.1587 KOps/s | |
test_cat | 1.3527ms | 1.2320ms | 811.6997 Ops/s | 811.4897 Ops/s |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
bug
Something isn't working
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
No description provided.