Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Test] Rename duplicated test #997

Merged
merged 1 commit into from
Sep 17, 2024
Merged

Conversation

vmoens
Copy link
Contributor

@vmoens vmoens commented Sep 17, 2024

Stack from ghstack (oldest at bottom):

[ghstack-poisoned]
vmoens added a commit that referenced this pull request Sep 17, 2024
ghstack-source-id: b40c941b8f7c7561064fe9df4ab528fc8a0ead9b
Pull Request resolved: #997
@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Sep 17, 2024
@vmoens vmoens merged commit 5c334c5 into gh/vmoens/22/base Sep 17, 2024
8 of 24 checks passed
vmoens added a commit that referenced this pull request Sep 17, 2024
ghstack-source-id: b40c941b8f7c7561064fe9df4ab528fc8a0ead9b
Pull Request resolved: #997
@vmoens vmoens deleted the gh/vmoens/22/head branch September 17, 2024 02:58
Copy link

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 222. Improved: $\large\color{#35bf28}16$. Worsened: $\large\color{#d91a1a}15$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 47.1990μs 19.3865μs 51.5823 KOps/s 52.0703 KOps/s $\color{#d91a1a}-0.94\%$
test_plain_set_stack_nested 56.6660μs 19.6048μs 51.0079 KOps/s 51.4187 KOps/s $\color{#d91a1a}-0.80\%$
test_plain_set_nested_inplace 60.8940μs 21.2691μs 47.0166 KOps/s 47.3134 KOps/s $\color{#d91a1a}-0.63\%$
test_plain_set_stack_nested_inplace 0.1000ms 21.2884μs 46.9740 KOps/s 47.2774 KOps/s $\color{#d91a1a}-0.64\%$
test_items 26.8710μs 4.1495μs 240.9933 KOps/s 238.8854 KOps/s $\color{#35bf28}+0.88\%$
test_items_nested 0.5601ms 0.3611ms 2.7692 KOps/s 2.8111 KOps/s $\color{#d91a1a}-1.49\%$
test_items_nested_locked 0.5219ms 0.3600ms 2.7777 KOps/s 2.8059 KOps/s $\color{#d91a1a}-1.01\%$
test_items_nested_leaf 0.1344ms 69.1469μs 14.4620 KOps/s 14.6523 KOps/s $\color{#d91a1a}-1.30\%$
test_items_stack_nested 0.4920ms 0.3606ms 2.7728 KOps/s 2.7530 KOps/s $\color{#35bf28}+0.72\%$
test_items_stack_nested_leaf 0.1389ms 71.0520μs 14.0742 KOps/s 13.9135 KOps/s $\color{#35bf28}+1.15\%$
test_items_stack_nested_locked 0.7265ms 0.3617ms 2.7646 KOps/s 2.7740 KOps/s $\color{#d91a1a}-0.34\%$
test_keys 24.7170μs 3.6332μs 275.2425 KOps/s 282.4546 KOps/s $\color{#d91a1a}-2.55\%$
test_keys_nested 0.1831ms 0.1053ms 9.4990 KOps/s 9.7872 KOps/s $\color{#d91a1a}-2.94\%$
test_keys_nested_locked 1.9111ms 0.1128ms 8.8621 KOps/s 9.3306 KOps/s $\textbf{\color{#d91a1a}-5.02\%}$
test_keys_nested_leaf 0.1647ms 86.4362μs 11.5692 KOps/s 11.8188 KOps/s $\color{#d91a1a}-2.11\%$
test_keys_stack_nested 0.1858ms 0.1046ms 9.5579 KOps/s 10.0350 KOps/s $\color{#d91a1a}-4.75\%$
test_keys_stack_nested_leaf 0.1630ms 86.9701μs 11.4982 KOps/s 12.0983 KOps/s $\color{#d91a1a}-4.96\%$
test_keys_stack_nested_locked 0.1967ms 0.1114ms 8.9783 KOps/s 9.5175 KOps/s $\textbf{\color{#d91a1a}-5.66\%}$
test_values 14.6854μs 1.1029μs 906.6647 KOps/s 901.5656 KOps/s $\color{#35bf28}+0.57\%$
test_values_nested 0.1325ms 74.8313μs 13.3634 KOps/s 13.3286 KOps/s $\color{#35bf28}+0.26\%$
test_values_nested_locked 0.1323ms 74.8201μs 13.3654 KOps/s 13.8224 KOps/s $\color{#d91a1a}-3.31\%$
test_values_nested_leaf 0.1128ms 63.0494μs 15.8606 KOps/s 16.3711 KOps/s $\color{#d91a1a}-3.12\%$
test_values_stack_nested 0.1339ms 75.7879μs 13.1947 KOps/s 13.7218 KOps/s $\color{#d91a1a}-3.84\%$
test_values_stack_nested_leaf 0.1473ms 62.5536μs 15.9863 KOps/s 16.6485 KOps/s $\color{#d91a1a}-3.98\%$
test_values_stack_nested_locked 0.1051ms 75.7066μs 13.2089 KOps/s 13.6503 KOps/s $\color{#d91a1a}-3.23\%$
test_membership 3.5494μs 0.8054μs 1.2416 MOps/s 1.3988 MOps/s $\textbf{\color{#d91a1a}-11.24\%}$
test_membership_nested 21.1100μs 2.8183μs 354.8275 KOps/s 361.6118 KOps/s $\color{#d91a1a}-1.88\%$
test_membership_nested_leaf 18.8050μs 2.8191μs 354.7248 KOps/s 362.9864 KOps/s $\color{#d91a1a}-2.28\%$
test_membership_stacked_nested 23.2330μs 2.8332μs 352.9517 KOps/s 362.2918 KOps/s $\color{#d91a1a}-2.58\%$
test_membership_stacked_nested_leaf 18.3240μs 2.8541μs 350.3760 KOps/s 354.7530 KOps/s $\color{#d91a1a}-1.23\%$
test_membership_nested_last 25.1380μs 4.0367μs 247.7271 KOps/s 256.3854 KOps/s $\color{#d91a1a}-3.38\%$
test_membership_nested_leaf_last 25.8680μs 4.0867μs 244.6946 KOps/s 252.4297 KOps/s $\color{#d91a1a}-3.06\%$
test_membership_stacked_nested_last 23.8150μs 4.7702μs 209.6343 KOps/s 256.3132 KOps/s $\textbf{\color{#d91a1a}-18.21\%}$
test_membership_stacked_nested_leaf_last 27.1410μs 4.7732μs 209.5043 KOps/s 256.4013 KOps/s $\textbf{\color{#d91a1a}-18.29\%}$
test_nested_getleaf 35.5460μs 10.9088μs 91.6688 KOps/s 96.3427 KOps/s $\color{#d91a1a}-4.85\%$
test_nested_get 32.3910μs 10.3357μs 96.7520 KOps/s 99.0856 KOps/s $\color{#d91a1a}-2.36\%$
test_stacked_getleaf 34.1840μs 10.8618μs 92.0659 KOps/s 95.4850 KOps/s $\color{#d91a1a}-3.58\%$
test_stacked_get 37.9000μs 10.3974μs 96.1775 KOps/s 98.3698 KOps/s $\color{#d91a1a}-2.23\%$
test_nested_getitemleaf 36.6490μs 11.1953μs 89.3234 KOps/s 89.8070 KOps/s $\color{#d91a1a}-0.54\%$
test_nested_getitem 34.1040μs 10.4576μs 95.6239 KOps/s 96.9886 KOps/s $\color{#d91a1a}-1.41\%$
test_stacked_getitemleaf 30.6080μs 11.0403μs 90.5772 KOps/s 91.5764 KOps/s $\color{#d91a1a}-1.09\%$
test_stacked_getitem 25.2370μs 10.4005μs 96.1495 KOps/s 97.2413 KOps/s $\color{#d91a1a}-1.12\%$
test_lock_nested 85.1543ms 0.5591ms 1.7886 KOps/s 2.1001 KOps/s $\textbf{\color{#d91a1a}-14.83\%}$
test_lock_stack_nested 0.5183ms 0.4383ms 2.2817 KOps/s 2.2548 KOps/s $\color{#35bf28}+1.19\%$
test_unlock_nested 80.9887ms 0.4770ms 2.0966 KOps/s 2.5159 KOps/s $\textbf{\color{#d91a1a}-16.67\%}$
test_unlock_stack_nested 0.5562ms 0.3628ms 2.7563 KOps/s 2.7372 KOps/s $\color{#35bf28}+0.70\%$
test_flatten_speed 0.1756ms 87.8526μs 11.3827 KOps/s 11.5363 KOps/s $\color{#d91a1a}-1.33\%$
test_unflatten_speed 0.5761ms 0.4648ms 2.1517 KOps/s 2.1625 KOps/s $\color{#d91a1a}-0.50\%$
test_common_ops 1.9369ms 1.0641ms 939.7328 Ops/s 917.0060 Ops/s $\color{#35bf28}+2.48\%$
test_creation 34.2740μs 2.3520μs 425.1771 KOps/s 486.9499 KOps/s $\textbf{\color{#d91a1a}-12.69\%}$
test_creation_empty 45.0650μs 16.3495μs 61.1638 KOps/s 61.0168 KOps/s $\color{#35bf28}+0.24\%$
test_creation_nested_1 49.9240μs 19.4768μs 51.3431 KOps/s 52.0684 KOps/s $\color{#d91a1a}-1.39\%$
test_creation_nested_2 74.5100μs 23.3663μs 42.7967 KOps/s 42.3391 KOps/s $\color{#35bf28}+1.08\%$
test_clone 72.8770μs 16.7192μs 59.8116 KOps/s 60.3123 KOps/s $\color{#d91a1a}-0.83\%$
test_getitem[int] 1.3187ms 16.7761μs 59.6086 KOps/s 60.1993 KOps/s $\color{#d91a1a}-0.98\%$
test_getitem[slice_int] 0.1832ms 30.6352μs 32.6422 KOps/s 33.7509 KOps/s $\color{#d91a1a}-3.28\%$
test_getitem[range] 0.1918ms 55.9694μs 17.8669 KOps/s 17.6458 KOps/s $\color{#35bf28}+1.25\%$
test_getitem[tuple] 0.1697ms 24.7839μs 40.3487 KOps/s 39.9589 KOps/s $\color{#35bf28}+0.98\%$
test_getitem[list] 0.1874ms 50.7689μs 19.6971 KOps/s 19.4779 KOps/s $\color{#35bf28}+1.13\%$
test_setitem_dim[int] 71.1030μs 30.5241μs 32.7610 KOps/s 31.3610 KOps/s $\color{#35bf28}+4.46\%$
test_setitem_dim[slice_int] 0.1311ms 59.2639μs 16.8737 KOps/s 15.6985 KOps/s $\textbf{\color{#35bf28}+7.49\%}$
test_setitem_dim[range] 0.1350ms 81.3589μs 12.2912 KOps/s 12.1088 KOps/s $\color{#35bf28}+1.51\%$
test_setitem_dim[tuple] 90.1980μs 48.0613μs 20.8067 KOps/s 20.8535 KOps/s $\color{#d91a1a}-0.22\%$
test_setitem 95.3290μs 27.8238μs 35.9404 KOps/s 35.6413 KOps/s $\color{#35bf28}+0.84\%$
test_set 91.6620μs 27.3376μs 36.5797 KOps/s 36.6956 KOps/s $\color{#d91a1a}-0.32\%$
test_set_shared 1.2182ms 0.2097ms 4.7677 KOps/s 4.7363 KOps/s $\color{#35bf28}+0.66\%$
test_update 0.1404ms 33.1726μs 30.1453 KOps/s 29.8674 KOps/s $\color{#35bf28}+0.93\%$
test_update_nested 0.1225ms 43.5609μs 22.9564 KOps/s 23.0784 KOps/s $\color{#d91a1a}-0.53\%$
test_update__nested 87.1730μs 33.5559μs 29.8010 KOps/s 30.0184 KOps/s $\color{#d91a1a}-0.72\%$
test_set_nested 88.1550μs 29.4190μs 33.9917 KOps/s 33.2142 KOps/s $\color{#35bf28}+2.34\%$
test_set_nested_new 90.3190μs 34.3570μs 29.1062 KOps/s 28.0890 KOps/s $\color{#35bf28}+3.62\%$
test_select 0.1139ms 52.0571μs 19.2097 KOps/s 19.3495 KOps/s $\color{#d91a1a}-0.72\%$
test_select_nested 0.8992ms 60.4556μs 16.5411 KOps/s 17.0293 KOps/s $\color{#d91a1a}-2.87\%$
test_exclude_nested 0.1516ms 76.3321μs 13.1007 KOps/s 13.4080 KOps/s $\color{#d91a1a}-2.29\%$
test_empty[True] 0.4827ms 0.3208ms 3.1172 KOps/s 3.1000 KOps/s $\color{#35bf28}+0.56\%$
test_empty[False] 7.0858μs 1.3395μs 746.5234 KOps/s 832.6325 KOps/s $\textbf{\color{#d91a1a}-10.34\%}$
test_unbind_speed 0.4969ms 0.2919ms 3.4253 KOps/s 3.4034 KOps/s $\color{#35bf28}+0.64\%$
test_unbind_speed_stack0 0.4814ms 0.2878ms 3.4747 KOps/s 3.4668 KOps/s $\color{#35bf28}+0.23\%$
test_unbind_speed_stack1 87.8403ms 0.7778ms 1.2857 KOps/s 1.3982 KOps/s $\textbf{\color{#d91a1a}-8.04\%}$
test_split 80.1131ms 2.0862ms 479.3439 Ops/s 455.9268 Ops/s $\textbf{\color{#35bf28}+5.14\%}$
test_chunk 2.1462ms 1.9449ms 514.1628 Ops/s 452.0830 Ops/s $\textbf{\color{#35bf28}+13.73\%}$
test_creation[device0] 0.2275ms 0.1132ms 8.8370 KOps/s 8.4444 KOps/s $\color{#35bf28}+4.65\%$
test_creation_from_tensor 3.0019ms 0.1139ms 8.7767 KOps/s 8.3910 KOps/s $\color{#35bf28}+4.60\%$
test_add_one[memmap_tensor0] 0.1911ms 7.1756μs 139.3621 KOps/s 130.6714 KOps/s $\textbf{\color{#35bf28}+6.65\%}$
test_contiguous[memmap_tensor0] 15.9100μs 1.9047μs 525.0103 KOps/s 535.1350 KOps/s $\color{#d91a1a}-1.89\%$
test_stack[memmap_tensor0] 45.0840μs 5.6504μs 176.9784 KOps/s 174.3331 KOps/s $\color{#35bf28}+1.52\%$
test_memmaptd_index 1.1795ms 0.3976ms 2.5148 KOps/s 2.4516 KOps/s $\color{#35bf28}+2.58\%$
test_memmaptd_index_astensor 0.7525ms 0.4734ms 2.1124 KOps/s 2.0781 KOps/s $\color{#35bf28}+1.65\%$
test_memmaptd_index_op 1.5519ms 0.9579ms 1.0440 KOps/s 992.8613 Ops/s $\textbf{\color{#35bf28}+5.15\%}$
test_serialize_model 0.1198s 0.1159s 8.6315 Ops/s 8.2854 Ops/s $\color{#35bf28}+4.18\%$
test_serialize_model_pickle 0.4294s 0.3942s 2.5371 Ops/s 2.5328 Ops/s $\color{#35bf28}+0.17\%$
test_serialize_weights 0.1217s 0.1178s 8.4874 Ops/s 8.7798 Ops/s $\color{#d91a1a}-3.33\%$
test_serialize_weights_returnearly 0.1722s 0.1607s 6.2209 Ops/s 6.3588 Ops/s $\color{#d91a1a}-2.17\%$
test_serialize_weights_pickle 0.5921s 0.4528s 2.2085 Ops/s 2.5150 Ops/s $\textbf{\color{#d91a1a}-12.19\%}$
test_serialize_weights_filesystem 0.1487s 0.1415s 7.0664 Ops/s 7.1268 Ops/s $\color{#d91a1a}-0.85\%$
test_serialize_model_filesystem 0.1526s 0.1459s 6.8548 Ops/s 6.1143 Ops/s $\textbf{\color{#35bf28}+12.11\%}$
test_reshape_pytree 86.0110μs 39.0247μs 25.6248 KOps/s 25.7116 KOps/s $\color{#d91a1a}-0.34\%$
test_reshape_td 90.1980μs 44.6570μs 22.3929 KOps/s 21.0138 KOps/s $\textbf{\color{#35bf28}+6.56\%}$
test_view_pytree 79.1880μs 38.7810μs 25.7858 KOps/s 25.8831 KOps/s $\color{#d91a1a}-0.38\%$
test_view_td 0.1035ms 50.2697μs 19.8927 KOps/s 18.8374 KOps/s $\textbf{\color{#35bf28}+5.60\%}$
test_unbind_pytree 77.2650μs 35.4539μs 28.2057 KOps/s 27.8166 KOps/s $\color{#35bf28}+1.40\%$
test_unbind_td 0.3275ms 44.3230μs 22.5616 KOps/s 22.5629 KOps/s $-0.01\%$
test_split_pytree 82.0730μs 38.0027μs 26.3139 KOps/s 26.1198 KOps/s $\color{#35bf28}+0.74\%$
test_split_td 0.4735ms 56.3917μs 17.7331 KOps/s 17.4787 KOps/s $\color{#35bf28}+1.46\%$
test_add_pytree 0.1079ms 43.6912μs 22.8879 KOps/s 22.1524 KOps/s $\color{#35bf28}+3.32\%$
test_add_td 0.1394ms 73.2050μs 13.6603 KOps/s 13.0878 KOps/s $\color{#35bf28}+4.37\%$
test_compile_add_one_nested[tensordict-compile] 0.1222ms 58.0467μs 17.2275 KOps/s 17.7208 KOps/s $\color{#d91a1a}-2.78\%$
test_compile_add_one_nested[tensordict-eager] 0.3206ms 0.1706ms 5.8608 KOps/s 5.6882 KOps/s $\color{#35bf28}+3.03\%$
test_compile_add_one_nested[pytree-compile] 0.1077ms 57.0637μs 17.5243 KOps/s 17.8860 KOps/s $\color{#d91a1a}-2.02\%$
test_compile_add_one_nested[pytree-eager] 0.2763ms 0.1367ms 7.3167 KOps/s 7.1119 KOps/s $\color{#35bf28}+2.88\%$
test_compile_copy_nested[tensordict-compile] 64.9520μs 21.4095μs 46.7083 KOps/s 48.5429 KOps/s $\color{#d91a1a}-3.78\%$
test_compile_copy_nested[tensordict-eager] 0.1138ms 67.2424μs 14.8716 KOps/s 15.1742 KOps/s $\color{#d91a1a}-1.99\%$
test_compile_copy_nested[pytree-compile] 0.1276ms 75.7485μs 13.2016 KOps/s 13.1135 KOps/s $\color{#35bf28}+0.67\%$
test_compile_copy_nested[pytree-eager] 0.1289ms 69.1327μs 14.4649 KOps/s 14.9969 KOps/s $\color{#d91a1a}-3.55\%$
test_compile_add_one_flat[tensordict-compile] 0.2513ms 0.1729ms 5.7825 KOps/s 5.8513 KOps/s $\color{#d91a1a}-1.17\%$
test_compile_add_one_flat[tensordict-eager] 0.3814ms 0.1857ms 5.3858 KOps/s 5.2910 KOps/s $\color{#35bf28}+1.79\%$
test_compile_add_one_flat[tensorclass-compile] 97.9450μs 46.9874μs 21.2823 KOps/s 20.9877 KOps/s $\color{#35bf28}+1.40\%$
test_compile_add_one_flat[tensorclass-eager] 0.1414ms 65.8634μs 15.1829 KOps/s 14.5918 KOps/s $\color{#35bf28}+4.05\%$
test_compile_add_one_flat[pytree-compile] 0.2639ms 0.1730ms 5.7805 KOps/s 5.8459 KOps/s $\color{#d91a1a}-1.12\%$
test_compile_add_one_flat[pytree-eager] 0.3954ms 0.2891ms 3.4587 KOps/s 3.4065 KOps/s $\color{#35bf28}+1.53\%$
test_compile_add_self_flat[tensordict-eager] 0.3888ms 0.2001ms 4.9967 KOps/s 4.9910 KOps/s $\color{#35bf28}+0.12\%$
test_compile_add_self_flat[tensordict-compile] 0.3364ms 0.1752ms 5.7067 KOps/s 5.8111 KOps/s $\color{#d91a1a}-1.80\%$
test_compile_add_self_flat[tensorclass-eager] 0.1796ms 60.1984μs 16.6117 KOps/s 16.1924 KOps/s $\color{#35bf28}+2.59\%$
test_compile_add_self_flat[tensorclass-compile] 0.1384ms 48.1582μs 20.7649 KOps/s 21.0020 KOps/s $\color{#d91a1a}-1.13\%$
test_compile_add_self_flat[pytree-eager] 0.3423ms 0.2402ms 4.1635 KOps/s 4.2121 KOps/s $\color{#d91a1a}-1.15\%$
test_compile_add_self_flat[pytree-compile] 0.2933ms 0.1720ms 5.8124 KOps/s 5.7441 KOps/s $\color{#35bf28}+1.19\%$
test_compile_copy_flat[tensordict-compile] 0.2249ms 0.1035ms 9.6638 KOps/s 9.8710 KOps/s $\color{#d91a1a}-2.10\%$
test_compile_copy_flat[tensordict-eager] 0.1299ms 58.4656μs 17.1041 KOps/s 17.3154 KOps/s $\color{#d91a1a}-1.22\%$
test_compile_copy_flat[pytree-compile] 0.1542ms 80.6134μs 12.4049 KOps/s 12.9440 KOps/s $\color{#d91a1a}-4.16\%$
test_compile_copy_flat[pytree-eager] 0.1353ms 78.7842μs 12.6929 KOps/s 14.5532 KOps/s $\textbf{\color{#d91a1a}-12.78\%}$
test_compile_assign_and_add[tensordict-compile] 0.2840ms 0.1899ms 5.2658 KOps/s 5.1123 KOps/s $\color{#35bf28}+3.00\%$
test_compile_assign_and_add[tensordict-eager] 1.7357ms 1.6218ms 616.5918 Ops/s 610.7075 Ops/s $\color{#35bf28}+0.96\%$
test_compile_assign_and_add[pytree-compile] 0.2776ms 0.1886ms 5.3022 KOps/s 5.1900 KOps/s $\color{#35bf28}+2.16\%$
test_compile_assign_and_add[pytree-eager] 1.4440ms 1.0910ms 916.6200 Ops/s 893.0636 Ops/s $\color{#35bf28}+2.64\%$
test_compile_assign_and_add_stack[compile] 0.5487ms 0.4122ms 2.4260 KOps/s 2.4066 KOps/s $\color{#35bf28}+0.80\%$
test_compile_assign_and_add_stack[eager] 3.8387ms 3.5249ms 283.6998 Ops/s 276.8852 Ops/s $\color{#35bf28}+2.46\%$
test_compile_indexing[tensor-tensordict-compile] 91.3810μs 34.7532μs 28.7743 KOps/s 30.0015 KOps/s $\color{#d91a1a}-4.09\%$
test_compile_indexing[tensor-tensordict-eager] 0.7645ms 46.8516μs 21.3440 KOps/s 21.0024 KOps/s $\color{#35bf28}+1.63\%$
test_compile_indexing[tensor-tensorclass-compile] 84.2670μs 29.4543μs 33.9509 KOps/s 34.2161 KOps/s $\color{#d91a1a}-0.77\%$
test_compile_indexing[tensor-tensorclass-eager] 92.7830μs 28.3430μs 35.2820 KOps/s 35.4667 KOps/s $\color{#d91a1a}-0.52\%$
test_compile_indexing[tensor-pytree-compile] 71.5940μs 29.2088μs 34.2363 KOps/s 34.8409 KOps/s $\color{#d91a1a}-1.74\%$
test_compile_indexing[tensor-pytree-eager] 95.0470μs 28.2670μs 35.3769 KOps/s 35.6742 KOps/s $\color{#d91a1a}-0.83\%$
test_compile_indexing[slice-tensordict-compile] 0.1474ms 73.0296μs 13.6931 KOps/s 13.5070 KOps/s $\color{#35bf28}+1.38\%$
test_compile_indexing[slice-tensordict-eager] 0.4709ms 27.4576μs 36.4197 KOps/s 35.9823 KOps/s $\color{#35bf28}+1.22\%$
test_compile_indexing[slice-tensorclass-compile] 0.1476ms 66.6347μs 15.0072 KOps/s 14.5819 KOps/s $\color{#35bf28}+2.92\%$
test_compile_indexing[slice-tensorclass-eager] 65.7530μs 23.1552μs 43.1868 KOps/s 42.8978 KOps/s $\color{#35bf28}+0.67\%$
test_compile_indexing[slice-pytree-compile] 0.1357ms 67.0362μs 14.9173 KOps/s 14.6739 KOps/s $\color{#35bf28}+1.66\%$
test_compile_indexing[slice-pytree-eager] 86.9450μs 23.1157μs 43.2606 KOps/s 41.6036 KOps/s $\color{#35bf28}+3.98\%$
test_compile_indexing[int-tensordict-compile] 0.1586ms 70.7935μs 14.1256 KOps/s 13.7822 KOps/s $\color{#35bf28}+2.49\%$
test_compile_indexing[int-tensordict-eager] 0.6506ms 27.0357μs 36.9881 KOps/s 36.4622 KOps/s $\color{#35bf28}+1.44\%$
test_compile_indexing[int-tensorclass-compile] 0.1662ms 66.7492μs 14.9814 KOps/s 14.7714 KOps/s $\color{#35bf28}+1.42\%$
test_compile_indexing[int-tensorclass-eager] 62.4170μs 22.8793μs 43.7077 KOps/s 43.5567 KOps/s $\color{#35bf28}+0.35\%$
test_compile_indexing[int-pytree-compile] 0.1646ms 66.5728μs 15.0211 KOps/s 14.6847 KOps/s $\color{#35bf28}+2.29\%$
test_compile_indexing[int-pytree-eager] 72.9970μs 22.9173μs 43.6351 KOps/s 42.7220 KOps/s $\color{#35bf28}+2.14\%$
test_mod_add[eager] 95.7090μs 22.7004μs 44.0520 KOps/s 40.1603 KOps/s $\textbf{\color{#35bf28}+9.69\%}$
test_mod_add[compile] 92.9640μs 39.1015μs 25.5745 KOps/s 25.9524 KOps/s $\color{#d91a1a}-1.46\%$
test_mod_add[compile-overhead] 80.8110μs 38.4880μs 25.9821 KOps/s 25.7359 KOps/s $\color{#35bf28}+0.96\%$
test_mod_wrap[eager] 0.3598ms 0.1987ms 5.0323 KOps/s 4.8486 KOps/s $\color{#35bf28}+3.79\%$
test_mod_wrap[compile] 0.3841ms 0.2255ms 4.4350 KOps/s 4.3127 KOps/s $\color{#35bf28}+2.84\%$
test_mod_wrap[compile-overhead] 0.3026ms 0.2273ms 4.3995 KOps/s 4.3015 KOps/s $\color{#35bf28}+2.28\%$
test_mod_wrap_and_backward[eager] 11.9127ms 10.5761ms 94.5529 Ops/s 93.6796 Ops/s $\color{#35bf28}+0.93\%$
test_mod_wrap_and_backward[compile] 11.9070ms 10.6768ms 93.6612 Ops/s 87.2714 Ops/s $\textbf{\color{#35bf28}+7.32\%}$
test_mod_wrap_and_backward[compile-overhead] 12.0791ms 10.5303ms 94.9644 Ops/s 86.0937 Ops/s $\textbf{\color{#35bf28}+10.30\%}$
test_seq_add[eager] 0.1480ms 83.9052μs 11.9182 KOps/s 11.4323 KOps/s $\color{#35bf28}+4.25\%$
test_seq_add[compile] 0.1544ms 62.7324μs 15.9407 KOps/s 15.5633 KOps/s $\color{#35bf28}+2.42\%$
test_seq_add[compile-overhead] 0.1092ms 62.8816μs 15.9029 KOps/s 15.8325 KOps/s $\color{#35bf28}+0.44\%$
test_seq_wrap[eager] 0.4525ms 0.3594ms 2.7827 KOps/s 2.6315 KOps/s $\textbf{\color{#35bf28}+5.74\%}$
test_seq_wrap[compile] 0.7854ms 0.2601ms 3.8444 KOps/s 3.7325 KOps/s $\color{#35bf28}+3.00\%$
test_seq_wrap[compile-overhead] 0.7882ms 0.2633ms 3.7977 KOps/s 3.7006 KOps/s $\color{#35bf28}+2.62\%$
test_func_call_runtime[False-eager] 0.7749ms 0.5044ms 1.9827 KOps/s 1.9013 KOps/s $\color{#35bf28}+4.28\%$
test_func_call_runtime[False-compile] 0.8153ms 0.4856ms 2.0591 KOps/s 1.9741 KOps/s $\color{#35bf28}+4.31\%$
test_func_call_runtime[False-compile-overhead] 0.8480ms 0.4866ms 2.0551 KOps/s 1.9424 KOps/s $\textbf{\color{#35bf28}+5.80\%}$
test_func_call_runtime[True-eager] 0.8752ms 0.7100ms 1.4085 KOps/s 1.3610 KOps/s $\color{#35bf28}+3.49\%$
test_func_call_runtime[True-compile] 0.6026ms 0.4978ms 2.0090 KOps/s 1.9519 KOps/s $\color{#35bf28}+2.93\%$
test_func_call_runtime[True-compile-overhead] 0.7639ms 0.5007ms 1.9972 KOps/s 1.9496 KOps/s $\color{#35bf28}+2.45\%$
test_func_call_cm_runtime[False-eager] 0.7481ms 0.5002ms 1.9994 KOps/s 1.9092 KOps/s $\color{#35bf28}+4.72\%$
test_func_call_cm_runtime[False-compile] 0.7167ms 0.4848ms 2.0629 KOps/s 1.9767 KOps/s $\color{#35bf28}+4.36\%$
test_func_call_cm_runtime[False-compile-overhead] 0.8554ms 0.4913ms 2.0356 KOps/s 1.9873 KOps/s $\color{#35bf28}+2.43\%$
test_func_call_cm_runtime[True-eager] 1.2145ms 0.8534ms 1.1718 KOps/s 1.1544 KOps/s $\color{#35bf28}+1.51\%$
test_func_call_cm_runtime[True-compile] 0.9237ms 0.7276ms 1.3743 KOps/s 1.3648 KOps/s $\color{#35bf28}+0.70\%$
test_func_call_cm_runtime[True-compile-overhead] 0.8870ms 0.7365ms 1.3577 KOps/s 1.3601 KOps/s $\color{#d91a1a}-0.17\%$
test_vmap_func_call_cm_runtime[eager] 2.4093ms 1.8083ms 553.0081 Ops/s 538.8220 Ops/s $\color{#35bf28}+2.63\%$
test_vmap_func_call_cm_runtime[compile] 2.5319ms 1.8497ms 540.6370 Ops/s 523.1486 Ops/s $\color{#35bf28}+3.34\%$
test_vmap_func_call_cm_runtime[compile-overhead] 3.0317ms 1.9109ms 523.3212 Ops/s 525.7149 Ops/s $\color{#d91a1a}-0.46\%$
test_distributed 0.2834ms 0.1236ms 8.0906 KOps/s 7.8919 KOps/s $\color{#35bf28}+2.52\%$
test_tdmodule 0.1148ms 17.1645μs 58.2596 KOps/s 57.4078 KOps/s $\color{#35bf28}+1.48\%$
test_tdmodule_dispatch 50.8650μs 33.2567μs 30.0691 KOps/s 28.6436 KOps/s $\color{#35bf28}+4.98\%$
test_tdseq 39.1740μs 19.1127μs 52.3212 KOps/s 51.5372 KOps/s $\color{#35bf28}+1.52\%$
test_tdseq_dispatch 62.0960μs 37.8960μs 26.3880 KOps/s 25.3695 KOps/s $\color{#35bf28}+4.01\%$
test_instantiation_functorch 2.4955ms 1.5723ms 635.9945 Ops/s 627.2795 Ops/s $\color{#35bf28}+1.39\%$
test_instantiation_td 1.9611ms 1.1562ms 864.8788 Ops/s 860.2185 Ops/s $\color{#35bf28}+0.54\%$
test_exec_functorch 0.4142ms 0.1789ms 5.5910 KOps/s 5.4893 KOps/s $\color{#35bf28}+1.85\%$
test_exec_functional_call 0.3535ms 0.1642ms 6.0894 KOps/s 5.7863 KOps/s $\textbf{\color{#35bf28}+5.24\%}$
test_exec_td 0.2950ms 0.1590ms 6.2889 KOps/s 5.9695 KOps/s $\textbf{\color{#35bf28}+5.35\%}$
test_exec_td_decorator 0.4806ms 0.2160ms 4.6299 KOps/s 4.5789 KOps/s $\color{#35bf28}+1.11\%$
test_vmap_mlp_speed[True-True] 1.0474ms 0.6194ms 1.6144 KOps/s 1.5643 KOps/s $\color{#35bf28}+3.20\%$
test_vmap_mlp_speed[True-False] 0.8244ms 0.6175ms 1.6193 KOps/s 1.5653 KOps/s $\color{#35bf28}+3.45\%$
test_vmap_mlp_speed[False-True] 0.7581ms 0.4803ms 2.0821 KOps/s 2.0131 KOps/s $\color{#35bf28}+3.43\%$
test_vmap_mlp_speed[False-False] 0.6091ms 0.4809ms 2.0792 KOps/s 2.0047 KOps/s $\color{#35bf28}+3.72\%$
test_vmap_mlp_speed_decorator[True-True] 1.1417ms 0.5968ms 1.6756 KOps/s 1.6248 KOps/s $\color{#35bf28}+3.13\%$
test_vmap_mlp_speed_decorator[True-False] 0.7366ms 0.5942ms 1.6828 KOps/s 1.6136 KOps/s $\color{#35bf28}+4.29\%$
test_vmap_mlp_speed_decorator[False-True] 0.6638ms 0.4934ms 2.0268 KOps/s 1.9647 KOps/s $\color{#35bf28}+3.16\%$
test_vmap_mlp_speed_decorator[False-False] 0.8625ms 0.4939ms 2.0249 KOps/s 1.9591 KOps/s $\color{#35bf28}+3.36\%$
test_to_module_speed[True] 1.5920ms 1.2872ms 776.8876 Ops/s 785.3384 Ops/s $\color{#d91a1a}-1.08\%$
test_to_module_speed[False] 1.7721ms 1.2503ms 799.8198 Ops/s 797.9045 Ops/s $\color{#35bf28}+0.24\%$
test_tc_init 81.3120μs 41.7483μs 23.9531 KOps/s 23.3927 KOps/s $\color{#35bf28}+2.40\%$
test_tc_init_nested 0.1570ms 82.7524μs 12.0842 KOps/s 11.7796 KOps/s $\color{#35bf28}+2.59\%$
test_tc_first_layer_tensor 27.4620μs 1.6544μs 604.4509 KOps/s 671.0061 KOps/s $\textbf{\color{#d91a1a}-9.92\%}$
test_tc_first_layer_nontensor 42.3650μs 4.9636μs 201.4685 KOps/s 211.6818 KOps/s $\color{#d91a1a}-4.82\%$
test_tc_second_layer_tensor 20.9490μs 3.1212μs 320.3847 KOps/s 360.8173 KOps/s $\textbf{\color{#d91a1a}-11.21\%}$
test_tc_second_layer_nontensor 28.8340μs 6.3857μs 156.6005 KOps/s 166.4009 KOps/s $\textbf{\color{#d91a1a}-5.89\%}$
test_unbind 0.4682s 13.2944ms 75.2194 Ops/s 70.2796 Ops/s $\textbf{\color{#35bf28}+7.03\%}$
test_full_like 7.7934ms 6.9470ms 143.9469 Ops/s 140.7884 Ops/s $\color{#35bf28}+2.24\%$
test_zeros_like 3.2637ms 2.7209ms 367.5252 Ops/s 358.2339 Ops/s $\color{#35bf28}+2.59\%$
test_ones_like 3.2462ms 3.0484ms 328.0445 Ops/s 323.9736 Ops/s $\color{#35bf28}+1.26\%$
test_clone 5.2198ms 4.8338ms 206.8767 Ops/s 205.9282 Ops/s $\color{#35bf28}+0.46\%$
test_squeeze 60.0320μs 12.3675μs 80.8572 KOps/s 81.1446 KOps/s $\color{#d91a1a}-0.35\%$
test_unsqueeze 0.2994ms 91.9259μs 10.8783 KOps/s 10.7051 KOps/s $\color{#35bf28}+1.62\%$
test_split 0.3956ms 0.1920ms 5.2086 KOps/s 5.1818 KOps/s $\color{#35bf28}+0.52\%$
test_permute 0.3696ms 0.2202ms 4.5404 KOps/s 4.3600 KOps/s $\color{#35bf28}+4.14\%$
test_stack 32.4908ms 26.1378ms 38.2587 Ops/s 39.2567 Ops/s $\color{#d91a1a}-2.54\%$
test_cat 32.6565ms 25.7770ms 38.7943 Ops/s 40.0036 Ops/s $\color{#d91a1a}-3.02\%$

Copy link

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 228. Improved: $\large\color{#35bf28}11$. Worsened: $\large\color{#d91a1a}15$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 0.7152ms 14.4303μs 69.2989 KOps/s 71.6481 KOps/s $\color{#d91a1a}-3.28\%$
test_plain_set_stack_nested 36.8010μs 14.2322μs 70.2634 KOps/s 72.3158 KOps/s $\color{#d91a1a}-2.84\%$
test_plain_set_nested_inplace 44.8610μs 15.5709μs 64.2222 KOps/s 67.6029 KOps/s $\textbf{\color{#d91a1a}-5.00\%}$
test_plain_set_stack_nested_inplace 42.8010μs 15.1043μs 66.2062 KOps/s 67.4889 KOps/s $\color{#d91a1a}-1.90\%$
test_items 26.3700μs 2.8877μs 346.3009 KOps/s 342.7384 KOps/s $\color{#35bf28}+1.04\%$
test_items_nested 0.3634ms 0.3288ms 3.0413 KOps/s 3.0479 KOps/s $\color{#d91a1a}-0.22\%$
test_items_nested_locked 0.3773ms 0.3300ms 3.0306 KOps/s 3.0547 KOps/s $\color{#d91a1a}-0.79\%$
test_items_nested_leaf 82.1710μs 55.8027μs 17.9203 KOps/s 17.9370 KOps/s $\color{#d91a1a}-0.09\%$
test_items_stack_nested 0.3936ms 0.3303ms 3.0280 KOps/s 3.0600 KOps/s $\color{#d91a1a}-1.05\%$
test_items_stack_nested_leaf 82.3420μs 57.0793μs 17.5195 KOps/s 17.3830 KOps/s $\color{#35bf28}+0.79\%$
test_items_stack_nested_locked 0.3709ms 0.3296ms 3.0342 KOps/s 3.0351 KOps/s $\color{#d91a1a}-0.03\%$
test_keys 27.0110μs 3.4703μs 288.1608 KOps/s 288.9492 KOps/s $\color{#d91a1a}-0.27\%$
test_keys_nested 83.2320μs 57.2196μs 17.4765 KOps/s 18.4010 KOps/s $\textbf{\color{#d91a1a}-5.02\%}$
test_keys_nested_locked 2.3755ms 62.7595μs 15.9339 KOps/s 16.0313 KOps/s $\color{#d91a1a}-0.61\%$
test_keys_nested_leaf 73.0520μs 48.2488μs 20.7259 KOps/s 21.0022 KOps/s $\color{#d91a1a}-1.32\%$
test_keys_stack_nested 80.0910μs 56.7821μs 17.6112 KOps/s 18.2897 KOps/s $\color{#d91a1a}-3.71\%$
test_keys_stack_nested_leaf 78.8620μs 48.9296μs 20.4375 KOps/s 20.7743 KOps/s $\color{#d91a1a}-1.62\%$
test_keys_stack_nested_locked 0.1466ms 61.5520μs 16.2464 KOps/s 16.3994 KOps/s $\color{#d91a1a}-0.93\%$
test_values 5.5818μs 0.8543μs 1.1706 MOps/s 1.1834 MOps/s $\color{#d91a1a}-1.08\%$
test_values_nested 78.7320μs 40.6636μs 24.5920 KOps/s 24.5986 KOps/s $\color{#d91a1a}-0.03\%$
test_values_nested_locked 71.3020μs 42.6493μs 23.4471 KOps/s 23.4119 KOps/s $\color{#35bf28}+0.15\%$
test_values_nested_leaf 66.6410μs 35.2801μs 28.3446 KOps/s 28.2926 KOps/s $\color{#35bf28}+0.18\%$
test_values_stack_nested 69.1810μs 41.8267μs 23.9082 KOps/s 24.1471 KOps/s $\color{#d91a1a}-0.99\%$
test_values_stack_nested_leaf 60.7510μs 35.8016μs 27.9317 KOps/s 28.2090 KOps/s $\color{#d91a1a}-0.98\%$
test_values_stack_nested_locked 72.8120μs 43.5549μs 22.9595 KOps/s 23.1174 KOps/s $\color{#d91a1a}-0.68\%$
test_membership 1.6245μs 0.5031μs 1.9878 MOps/s 1.9882 MOps/s $\color{#d91a1a}-0.02\%$
test_membership_nested 14.4255μs 1.8695μs 534.8949 KOps/s 524.7493 KOps/s $\color{#35bf28}+1.93\%$
test_membership_nested_leaf 14.7705μs 1.8772μs 532.7023 KOps/s 537.8493 KOps/s $\color{#d91a1a}-0.96\%$
test_membership_stacked_nested 31.9010μs 1.9592μs 510.4136 KOps/s 510.5878 KOps/s $\color{#d91a1a}-0.03\%$
test_membership_stacked_nested_leaf 28.4800μs 1.9358μs 516.5820 KOps/s 515.4087 KOps/s $\color{#35bf28}+0.23\%$
test_membership_nested_last 32.3010μs 2.7707μs 360.9139 KOps/s 357.0603 KOps/s $\color{#35bf28}+1.08\%$
test_membership_nested_leaf_last 38.8810μs 2.7514μs 363.4546 KOps/s 361.7695 KOps/s $\color{#35bf28}+0.47\%$
test_membership_stacked_nested_last 30.4300μs 5.8477μs 171.0071 KOps/s 128.0457 KOps/s $\textbf{\color{#35bf28}+33.55\%}$
test_membership_stacked_nested_leaf_last 28.8400μs 5.7744μs 173.1775 KOps/s 129.1121 KOps/s $\textbf{\color{#35bf28}+34.13\%}$
test_nested_getleaf 37.6510μs 6.0918μs 164.1562 KOps/s 163.2735 KOps/s $\color{#35bf28}+0.54\%$
test_nested_get 35.9610μs 5.7503μs 173.9035 KOps/s 172.4364 KOps/s $\color{#35bf28}+0.85\%$
test_stacked_getleaf 49.9810μs 6.0377μs 165.6266 KOps/s 166.2651 KOps/s $\color{#d91a1a}-0.38\%$
test_stacked_get 34.7800μs 5.6648μs 176.5298 KOps/s 176.8914 KOps/s $\color{#d91a1a}-0.20\%$
test_nested_getitemleaf 32.1910μs 6.1484μs 162.6429 KOps/s 159.7124 KOps/s $\color{#35bf28}+1.83\%$
test_nested_getitem 34.8310μs 5.7767μs 173.1084 KOps/s 172.6420 KOps/s $\color{#35bf28}+0.27\%$
test_stacked_getitemleaf 32.0510μs 6.0756μs 164.5929 KOps/s 165.0092 KOps/s $\color{#d91a1a}-0.25\%$
test_stacked_getitem 37.1710μs 5.7188μs 174.8608 KOps/s 175.3851 KOps/s $\color{#d91a1a}-0.30\%$
test_lock_nested 3.0245ms 0.4162ms 2.4030 KOps/s 2.4135 KOps/s $\color{#d91a1a}-0.44\%$
test_lock_stack_nested 0.4104ms 0.3786ms 2.6415 KOps/s 2.6956 KOps/s $\color{#d91a1a}-2.01\%$
test_unlock_nested 0.7368ms 0.3549ms 2.8179 KOps/s 2.8517 KOps/s $\color{#d91a1a}-1.19\%$
test_unlock_stack_nested 0.3527ms 0.3174ms 3.1504 KOps/s 3.2109 KOps/s $\color{#d91a1a}-1.89\%$
test_flatten_speed 0.1045ms 70.1808μs 14.2489 KOps/s 14.5490 KOps/s $\color{#d91a1a}-2.06\%$
test_unflatten_speed 0.3403ms 0.2824ms 3.5405 KOps/s 3.5457 KOps/s $\color{#d91a1a}-0.15\%$
test_common_ops 1.5310ms 1.2783ms 782.3191 Ops/s 829.3586 Ops/s $\textbf{\color{#d91a1a}-5.67\%}$
test_creation 29.1510μs 1.4608μs 684.5661 KOps/s 665.2939 KOps/s $\color{#35bf28}+2.90\%$
test_creation_empty 46.7910μs 15.5105μs 64.4726 KOps/s 65.5291 KOps/s $\color{#d91a1a}-1.61\%$
test_creation_nested_1 52.4920μs 17.2779μs 57.8776 KOps/s 59.0686 KOps/s $\color{#d91a1a}-2.02\%$
test_creation_nested_2 53.4510μs 20.0469μs 49.8830 KOps/s 51.5416 KOps/s $\color{#d91a1a}-3.22\%$
test_clone 70.7920μs 29.0272μs 34.4504 KOps/s 35.3712 KOps/s $\color{#d91a1a}-2.60\%$
test_getitem[int] 92.4424ms 23.0400μs 43.4027 KOps/s 64.5185 KOps/s $\textbf{\color{#d91a1a}-32.73\%}$
test_getitem[slice_int] 0.1401ms 26.9917μs 37.0485 KOps/s 37.3940 KOps/s $\color{#d91a1a}-0.92\%$
test_getitem[range] 0.2238ms 0.1046ms 9.5615 KOps/s 9.3411 KOps/s $\color{#35bf28}+2.36\%$
test_getitem[tuple] 0.1185ms 23.5245μs 42.5088 KOps/s 43.8041 KOps/s $\color{#d91a1a}-2.96\%$
test_getitem[list] 0.1917ms 94.4039μs 10.5928 KOps/s 10.5668 KOps/s $\color{#35bf28}+0.25\%$
test_setitem_dim[int] 65.5620μs 43.4919μs 22.9928 KOps/s 23.3962 KOps/s $\color{#d91a1a}-1.72\%$
test_setitem_dim[slice_int] 0.1021ms 70.8911μs 14.1061 KOps/s 15.1574 KOps/s $\textbf{\color{#d91a1a}-6.94\%}$
test_setitem_dim[range] 0.1574ms 0.1303ms 7.6757 KOps/s 8.0420 KOps/s $\color{#d91a1a}-4.56\%$
test_setitem_dim[tuple] 0.1020ms 63.0167μs 15.8688 KOps/s 16.9642 KOps/s $\textbf{\color{#d91a1a}-6.46\%}$
test_setitem 97.8120μs 42.7214μs 23.4075 KOps/s 24.5513 KOps/s $\color{#d91a1a}-4.66\%$
test_set 87.5220μs 42.1295μs 23.7363 KOps/s 25.2259 KOps/s $\textbf{\color{#d91a1a}-5.90\%}$
test_set_shared 0.3433ms 49.4642μs 20.2166 KOps/s 19.9085 KOps/s $\color{#35bf28}+1.55\%$
test_update 0.2001ms 47.9177μs 20.8691 KOps/s 20.7071 KOps/s $\color{#35bf28}+0.78\%$
test_update_nested 0.2264ms 57.4320μs 17.4119 KOps/s 18.3823 KOps/s $\textbf{\color{#d91a1a}-5.28\%}$
test_update__nested 0.2182ms 61.9398μs 16.1447 KOps/s 17.7892 KOps/s $\textbf{\color{#d91a1a}-9.24\%}$
test_set_nested 82.9420μs 44.8074μs 22.3177 KOps/s 23.7418 KOps/s $\textbf{\color{#d91a1a}-6.00\%}$
test_set_nested_new 0.5128ms 48.0956μs 20.7919 KOps/s 21.9306 KOps/s $\textbf{\color{#d91a1a}-5.19\%}$
test_select 95.9620μs 61.5451μs 16.2483 KOps/s 16.9907 KOps/s $\color{#d91a1a}-4.37\%$
test_select_nested 0.1137ms 41.9956μs 23.8120 KOps/s 23.3760 KOps/s $\color{#35bf28}+1.87\%$
test_exclude_nested 89.8720μs 58.6969μs 17.0367 KOps/s 16.6986 KOps/s $\color{#35bf28}+2.02\%$
test_empty[True] 0.3463ms 0.2420ms 4.1315 KOps/s 4.0985 KOps/s $\color{#35bf28}+0.81\%$
test_empty[False] 3.4331μs 0.7352μs 1.3601 MOps/s 1.3350 MOps/s $\color{#35bf28}+1.88\%$
test_to 51.2310μs 25.6299μs 39.0170 KOps/s 41.0437 KOps/s $\color{#d91a1a}-4.94\%$
test_to_nonblocking 50.9020μs 23.6661μs 42.2545 KOps/s 43.5770 KOps/s $\color{#d91a1a}-3.03\%$
test_unbind_speed 1.6810ms 0.2731ms 3.6620 KOps/s 3.6007 KOps/s $\color{#35bf28}+1.70\%$
test_unbind_speed_stack0 0.3196ms 0.2713ms 3.6862 KOps/s 3.7162 KOps/s $\color{#d91a1a}-0.81\%$
test_unbind_speed_stack1 92.1980ms 0.6977ms 1.4334 KOps/s 1.4397 KOps/s $\color{#d91a1a}-0.44\%$
test_split 93.4584ms 2.1708ms 460.6538 Ops/s 457.0222 Ops/s $\color{#35bf28}+0.79\%$
test_chunk 96.8297ms 2.1842ms 457.8237 Ops/s 460.4273 Ops/s $\color{#d91a1a}-0.57\%$
test_creation[device0] 0.3478ms 0.1283ms 7.7934 KOps/s 7.9968 KOps/s $\color{#d91a1a}-2.54\%$
test_creation_from_tensor 0.3417ms 0.1314ms 7.6099 KOps/s 7.8831 KOps/s $\color{#d91a1a}-3.47\%$
test_add_one[memmap_tensor0] 0.2231ms 8.9960μs 111.1599 KOps/s 120.7129 KOps/s $\textbf{\color{#d91a1a}-7.91\%}$
test_contiguous[memmap_tensor0] 30.8600μs 2.2462μs 445.1934 KOps/s 448.7285 KOps/s $\color{#d91a1a}-0.79\%$
test_stack[memmap_tensor0] 38.3110μs 6.6114μs 151.2533 KOps/s 154.1929 KOps/s $\color{#d91a1a}-1.91\%$
test_memmaptd_index 1.1695ms 0.4189ms 2.3869 KOps/s 2.4407 KOps/s $\color{#d91a1a}-2.20\%$
test_memmaptd_index_astensor 0.7274ms 0.4746ms 2.1071 KOps/s 2.1301 KOps/s $\color{#d91a1a}-1.08\%$
test_memmaptd_index_op 1.4428ms 0.9987ms 1.0013 KOps/s 1.0091 KOps/s $\color{#d91a1a}-0.77\%$
test_serialize_model 0.1298s 0.1289s 7.7605 Ops/s 7.7489 Ops/s $\color{#35bf28}+0.15\%$
test_serialize_model_pickle 1.3514s 1.2119s 0.8252 Ops/s 0.8240 Ops/s $\color{#35bf28}+0.14\%$
test_serialize_weights 0.1294s 0.1283s 7.7919 Ops/s 7.0506 Ops/s $\textbf{\color{#35bf28}+10.51\%}$
test_serialize_weights_returnearly 52.8414ms 45.6327ms 21.9141 Ops/s 17.8897 Ops/s $\textbf{\color{#35bf28}+22.50\%}$
test_serialize_weights_pickle 1.3567s 1.2139s 0.8238 Ops/s 0.8214 Ops/s $\color{#35bf28}+0.29\%$
test_reshape_pytree 81.0420μs 35.8936μs 27.8601 KOps/s 27.9737 KOps/s $\color{#d91a1a}-0.41\%$
test_reshape_td 73.1610μs 42.0521μs 23.7800 KOps/s 22.8824 KOps/s $\color{#35bf28}+3.92\%$
test_view_pytree 70.5420μs 35.0520μs 28.5291 KOps/s 28.8959 KOps/s $\color{#d91a1a}-1.27\%$
test_view_td 77.7610μs 44.7820μs 22.3304 KOps/s 22.3355 KOps/s $\color{#d91a1a}-0.02\%$
test_unbind_pytree 68.9220μs 33.6096μs 29.7534 KOps/s 29.7208 KOps/s $\color{#35bf28}+0.11\%$
test_unbind_td 0.5132ms 41.7144μs 23.9725 KOps/s 23.6307 KOps/s $\color{#35bf28}+1.45\%$
test_split_pytree 76.6210μs 44.8460μs 22.2985 KOps/s 22.6682 KOps/s $\color{#d91a1a}-1.63\%$
test_split_td 0.6844ms 55.5935μs 17.9877 KOps/s 17.4825 KOps/s $\color{#35bf28}+2.89\%$
test_add_pytree 84.6220μs 54.2762μs 18.4243 KOps/s 16.6261 KOps/s $\textbf{\color{#35bf28}+10.82\%}$
test_add_td 0.2354ms 89.7660μs 11.1401 KOps/s 10.8579 KOps/s $\color{#35bf28}+2.60\%$
test_compile_add_one_nested[tensordict-compile] 0.4368ms 0.2132ms 4.6912 KOps/s 4.6877 KOps/s $\color{#35bf28}+0.07\%$
test_compile_add_one_nested[tensordict-eager] 0.2733ms 0.1469ms 6.8060 KOps/s 6.7829 KOps/s $\color{#35bf28}+0.34\%$
test_compile_add_one_nested[pytree-compile] 0.1900ms 0.1483ms 6.7410 KOps/s 6.9701 KOps/s $\color{#d91a1a}-3.29\%$
test_compile_add_one_nested[pytree-eager] 0.2602ms 0.1834ms 5.4540 KOps/s 5.6462 KOps/s $\color{#d91a1a}-3.40\%$
test_compile_copy_nested[tensordict-compile] 56.0810μs 21.5134μs 46.4827 KOps/s 46.1458 KOps/s $\color{#35bf28}+0.73\%$
test_compile_copy_nested[tensordict-eager] 88.5720μs 43.0721μs 23.2169 KOps/s 22.5560 KOps/s $\color{#35bf28}+2.93\%$
test_compile_copy_nested[pytree-compile] 0.1810ms 63.1201μs 15.8428 KOps/s 15.6484 KOps/s $\color{#35bf28}+1.24\%$
test_compile_copy_nested[pytree-eager] 0.1011ms 48.5146μs 20.6123 KOps/s 20.1351 KOps/s $\color{#35bf28}+2.37\%$
test_compile_add_one_flat[tensordict-compile] 0.4005ms 0.3176ms 3.1486 KOps/s 3.1463 KOps/s $\color{#35bf28}+0.07\%$
test_compile_add_one_flat[tensordict-eager] 0.2516ms 0.2029ms 4.9283 KOps/s 4.8245 KOps/s $\color{#35bf28}+2.15\%$
test_compile_add_one_flat[tensorclass-compile] 0.1761ms 0.1260ms 7.9389 KOps/s 7.8949 KOps/s $\color{#35bf28}+0.56\%$
test_compile_add_one_flat[tensorclass-eager] 0.1050ms 60.2547μs 16.5962 KOps/s 17.0544 KOps/s $\color{#d91a1a}-2.69\%$
test_compile_add_one_flat[pytree-compile] 0.3667ms 0.3165ms 3.1591 KOps/s 3.1514 KOps/s $\color{#35bf28}+0.24\%$
test_compile_add_one_flat[pytree-eager] 0.7203ms 0.6470ms 1.5456 KOps/s 1.6430 KOps/s $\textbf{\color{#d91a1a}-5.93\%}$
test_compile_add_self_flat[tensordict-eager] 0.3628ms 0.2395ms 4.1757 KOps/s 4.0825 KOps/s $\color{#35bf28}+2.28\%$
test_compile_add_self_flat[tensordict-compile] 0.4051ms 0.3184ms 3.1403 KOps/s 3.1104 KOps/s $\color{#35bf28}+0.96\%$
test_compile_add_self_flat[tensorclass-eager] 0.1276ms 69.3523μs 14.4191 KOps/s 14.3757 KOps/s $\color{#35bf28}+0.30\%$
test_compile_add_self_flat[tensorclass-compile] 0.1849ms 0.1271ms 7.8676 KOps/s 7.8429 KOps/s $\color{#35bf28}+0.31\%$
test_compile_add_self_flat[pytree-eager] 0.5996ms 0.5046ms 1.9816 KOps/s 1.9433 KOps/s $\color{#35bf28}+1.97\%$
test_compile_add_self_flat[pytree-compile] 0.3924ms 0.3177ms 3.1472 KOps/s 3.1448 KOps/s $\color{#35bf28}+0.07\%$
test_compile_copy_flat[tensordict-compile] 60.9610μs 17.8274μs 56.0934 KOps/s 55.9822 KOps/s $\color{#35bf28}+0.20\%$
test_compile_copy_flat[tensordict-eager] 52.6310μs 27.0118μs 37.0209 KOps/s 37.0391 KOps/s $\color{#d91a1a}-0.05\%$
test_compile_copy_flat[pytree-compile] 0.1011ms 69.1824μs 14.4546 KOps/s 14.4575 KOps/s $\color{#d91a1a}-0.02\%$
test_compile_copy_flat[pytree-eager] 76.7610μs 51.5031μs 19.4163 KOps/s 19.5777 KOps/s $\color{#d91a1a}-0.82\%$
test_compile_assign_and_add[tensordict-compile] 2.3051ms 0.8066ms 1.2397 KOps/s 1.1338 KOps/s $\textbf{\color{#35bf28}+9.34\%}$
test_compile_assign_and_add[tensordict-eager] 3.1547ms 3.0486ms 328.0213 Ops/s 325.7570 Ops/s $\color{#35bf28}+0.70\%$
test_compile_assign_and_add[pytree-compile] 2.2665ms 0.8033ms 1.2448 KOps/s 1.1095 KOps/s $\textbf{\color{#35bf28}+12.20\%}$
test_compile_assign_and_add[pytree-eager] 3.2092ms 3.0235ms 330.7435 Ops/s 319.1284 Ops/s $\color{#35bf28}+3.64\%$
test_compile_indexing[tensor-tensordict-compile] 0.1498ms 0.1077ms 9.2878 KOps/s 9.0344 KOps/s $\color{#35bf28}+2.81\%$
test_compile_indexing[tensor-tensordict-eager] 0.1895ms 58.8945μs 16.9795 KOps/s 16.2570 KOps/s $\color{#35bf28}+4.44\%$
test_compile_indexing[tensor-tensorclass-compile] 0.1323ms 0.1013ms 9.8672 KOps/s 9.3049 KOps/s $\textbf{\color{#35bf28}+6.04\%}$
test_compile_indexing[tensor-tensorclass-eager] 0.1137ms 40.8208μs 24.4973 KOps/s 24.0113 KOps/s $\color{#35bf28}+2.02\%$
test_compile_indexing[tensor-pytree-compile] 0.2923ms 0.1016ms 9.8403 KOps/s 9.7025 KOps/s $\color{#35bf28}+1.42\%$
test_compile_indexing[tensor-pytree-eager] 0.2004ms 42.5399μs 23.5074 KOps/s 24.1687 KOps/s $\color{#d91a1a}-2.74\%$
test_compile_indexing[slice-tensordict-compile] 0.1764ms 0.1354ms 7.3849 KOps/s 7.3847 KOps/s $+0.00\%$
test_compile_indexing[slice-tensordict-eager] 0.1581ms 25.0339μs 39.9458 KOps/s 40.6702 KOps/s $\color{#d91a1a}-1.78\%$
test_compile_indexing[slice-tensorclass-compile] 0.2567ms 0.1291ms 7.7486 KOps/s 7.6885 KOps/s $\color{#35bf28}+0.78\%$
test_compile_indexing[slice-tensorclass-eager] 49.7310μs 20.2018μs 49.5004 KOps/s 48.6656 KOps/s $\color{#35bf28}+1.72\%$
test_compile_indexing[slice-pytree-compile] 0.1952ms 0.1297ms 7.7079 KOps/s 7.6586 KOps/s $\color{#35bf28}+0.64\%$
test_compile_indexing[slice-pytree-eager] 53.1110μs 20.3574μs 49.1221 KOps/s 49.2701 KOps/s $\color{#d91a1a}-0.30\%$
test_compile_indexing[int-tensordict-compile] 0.1818ms 0.1398ms 7.1520 KOps/s 7.3509 KOps/s $\color{#d91a1a}-2.71\%$
test_compile_indexing[int-tensordict-eager] 0.5074ms 24.4678μs 40.8700 KOps/s 40.6237 KOps/s $\color{#35bf28}+0.61\%$
test_compile_indexing[int-tensorclass-compile] 0.1839ms 0.1311ms 7.6257 KOps/s 7.6555 KOps/s $\color{#d91a1a}-0.39\%$
test_compile_indexing[int-tensorclass-eager] 58.2110μs 23.0210μs 43.4387 KOps/s 48.9655 KOps/s $\textbf{\color{#d91a1a}-11.29\%}$
test_compile_indexing[int-pytree-compile] 0.1789ms 0.1336ms 7.4859 KOps/s 7.6594 KOps/s $\color{#d91a1a}-2.27\%$
test_compile_indexing[int-pytree-eager] 56.5910μs 20.4134μs 48.9875 KOps/s 48.7205 KOps/s $\color{#35bf28}+0.55\%$
test_mod_add[eager] 69.8910μs 31.1505μs 32.1022 KOps/s 32.1647 KOps/s $\color{#d91a1a}-0.19\%$
test_mod_add[compile] 0.3646ms 69.1033μs 14.4711 KOps/s 13.8159 KOps/s $\color{#35bf28}+4.74\%$
test_mod_add[compile-overhead] 0.2567ms 0.1328ms 7.5311 KOps/s 7.1752 KOps/s $\color{#35bf28}+4.96\%$
test_mod_wrap[eager] 0.3127ms 0.2409ms 4.1503 KOps/s 4.1811 KOps/s $\color{#d91a1a}-0.74\%$
test_mod_wrap[compile] 0.4519ms 0.2897ms 3.4515 KOps/s 3.3656 KOps/s $\color{#35bf28}+2.55\%$
test_mod_wrap[compile-overhead] 7.4792ms 4.0720ms 245.5805 Ops/s 247.6337 Ops/s $\color{#d91a1a}-0.83\%$
test_mod_wrap_and_backward[eager] 1.3904ms 1.2901ms 775.1626 Ops/s 704.4944 Ops/s $\textbf{\color{#35bf28}+10.03\%}$
test_mod_wrap_and_backward[compile] 1.5284ms 1.2832ms 779.3267 Ops/s 711.2937 Ops/s $\textbf{\color{#35bf28}+9.56\%}$
test_mod_wrap_and_backward[compile-overhead] 1.3279ms 0.9023ms 1.1083 KOps/s 987.3861 Ops/s $\textbf{\color{#35bf28}+12.25\%}$
test_seq_add[eager] 0.2422ms 96.7177μs 10.3394 KOps/s 10.4111 KOps/s $\color{#d91a1a}-0.69\%$
test_seq_add[compile] 0.6088ms 82.5442μs 12.1147 KOps/s 12.4195 KOps/s $\color{#d91a1a}-2.45\%$
test_seq_add[compile-overhead] 0.1529ms 0.1151ms 8.6892 KOps/s 8.8033 KOps/s $\color{#d91a1a}-1.30\%$
test_seq_wrap[eager] 0.5676ms 0.3702ms 2.7014 KOps/s 2.6231 KOps/s $\color{#35bf28}+2.99\%$
test_seq_wrap[compile] 0.3586ms 0.3083ms 3.2435 KOps/s 3.1425 KOps/s $\color{#35bf28}+3.21\%$
test_seq_wrap[compile-overhead] 0.3051ms 0.2187ms 4.5734 KOps/s 4.5224 KOps/s $\color{#35bf28}+1.13\%$
test_func_call_runtime[False-eager] 0.8006ms 0.7010ms 1.4265 KOps/s 1.3667 KOps/s $\color{#35bf28}+4.38\%$
test_func_call_runtime[False-compile] 0.9537ms 0.7712ms 1.2966 KOps/s 1.2722 KOps/s $\color{#35bf28}+1.92\%$
test_func_call_runtime[False-compile-overhead] 0.4076ms 0.3580ms 2.7934 KOps/s 2.7831 KOps/s $\color{#35bf28}+0.37\%$
test_func_call_runtime[True-eager] 0.9988ms 0.8765ms 1.1409 KOps/s 1.1317 KOps/s $\color{#35bf28}+0.81\%$
test_func_call_runtime[True-compile] 0.8842ms 0.8092ms 1.2358 KOps/s 1.2226 KOps/s $\color{#35bf28}+1.08\%$
test_func_call_runtime[True-compile-overhead] 0.5178ms 0.3934ms 2.5422 KOps/s 2.5497 KOps/s $\color{#d91a1a}-0.30\%$
test_func_call_cm_runtime[False-eager] 0.7542ms 0.7025ms 1.4236 KOps/s 1.3786 KOps/s $\color{#35bf28}+3.26\%$
test_func_call_cm_runtime[False-compile] 0.8852ms 0.7785ms 1.2845 KOps/s 1.2700 KOps/s $\color{#35bf28}+1.14\%$
test_func_call_cm_runtime[False-compile-overhead] 0.4105ms 0.3630ms 2.7551 KOps/s 2.7695 KOps/s $\color{#d91a1a}-0.52\%$
test_func_call_cm_runtime[True-eager] 1.0560ms 0.9630ms 1.0384 KOps/s 1.0178 KOps/s $\color{#35bf28}+2.03\%$
test_func_call_cm_runtime[True-compile] 0.9260ms 0.8392ms 1.1916 KOps/s 1.1753 KOps/s $\color{#35bf28}+1.39\%$
test_func_call_cm_runtime[True-compile-overhead] 0.4774ms 0.4194ms 2.3846 KOps/s 2.3851 KOps/s $\color{#d91a1a}-0.02\%$
test_vmap_func_call_cm_runtime[eager] 2.4580ms 2.0028ms 499.3051 Ops/s 496.2029 Ops/s $\color{#35bf28}+0.63\%$
test_vmap_func_call_cm_runtime[compile] 0.9922ms 0.8569ms 1.1670 KOps/s 1.1524 KOps/s $\color{#35bf28}+1.27\%$
test_vmap_func_call_cm_runtime[compile-overhead] 0.4754ms 0.4239ms 2.3588 KOps/s 2.3541 KOps/s $\color{#35bf28}+0.20\%$
test_distributed 2.6008ms 0.1879ms 5.3223 KOps/s 8.8388 KOps/s $\textbf{\color{#d91a1a}-39.78\%}$
test_tdmodule 47.0610μs 14.9917μs 66.7035 KOps/s 68.6760 KOps/s $\color{#d91a1a}-2.87\%$
test_tdmodule_dispatch 48.7410μs 28.9625μs 34.5274 KOps/s 35.0311 KOps/s $\color{#d91a1a}-1.44\%$
test_tdseq 26.4810μs 15.4843μs 64.5815 KOps/s 65.8581 KOps/s $\color{#d91a1a}-1.94\%$
test_tdseq_dispatch 61.1710μs 31.5966μs 31.6490 KOps/s 31.9934 KOps/s $\color{#d91a1a}-1.08\%$
test_instantiation_functorch 2.1884ms 1.8348ms 545.0065 Ops/s 543.9712 Ops/s $\color{#35bf28}+0.19\%$
test_instantiation_td 1.7955ms 1.1840ms 844.5785 Ops/s 846.1113 Ops/s $\color{#d91a1a}-0.18\%$
test_exec_functorch 0.2487ms 0.2052ms 4.8730 KOps/s 4.8902 KOps/s $\color{#d91a1a}-0.35\%$
test_exec_functional_call 0.2703ms 0.2014ms 4.9651 KOps/s 4.8830 KOps/s $\color{#35bf28}+1.68\%$
test_exec_td 0.3031ms 0.2078ms 4.8115 KOps/s 4.7930 KOps/s $\color{#35bf28}+0.39\%$
test_exec_td_decorator 0.5838ms 0.2505ms 3.9913 KOps/s 3.9648 KOps/s $\color{#35bf28}+0.67\%$
test_vmap_mlp_speed[True-True] 0.7751ms 0.6746ms 1.4823 KOps/s 1.4991 KOps/s $\color{#d91a1a}-1.12\%$
test_vmap_mlp_speed[True-False] 0.7717ms 0.6620ms 1.5106 KOps/s 1.5104 KOps/s $\color{#35bf28}+0.01\%$
test_vmap_mlp_speed[False-True] 0.6721ms 0.5507ms 1.8160 KOps/s 1.7941 KOps/s $\color{#35bf28}+1.22\%$
test_vmap_mlp_speed[False-False] 0.7362ms 0.5573ms 1.7944 KOps/s 1.7898 KOps/s $\color{#35bf28}+0.26\%$
test_vmap_mlp_speed_decorator[True-True] 1.3074ms 0.6520ms 1.5338 KOps/s 1.5381 KOps/s $\color{#d91a1a}-0.28\%$
test_vmap_mlp_speed_decorator[True-False] 0.7670ms 0.6458ms 1.5484 KOps/s 1.5429 KOps/s $\color{#35bf28}+0.36\%$
test_vmap_mlp_speed_decorator[False-True] 0.6604ms 0.5625ms 1.7776 KOps/s 1.7573 KOps/s $\color{#35bf28}+1.16\%$
test_vmap_mlp_speed_decorator[False-False] 0.7424ms 0.5691ms 1.7570 KOps/s 1.7600 KOps/s $\color{#d91a1a}-0.17\%$
test_vmap_transformer_speed[True-True] 8.5125ms 8.0958ms 123.5214 Ops/s 121.6608 Ops/s $\color{#35bf28}+1.53\%$
test_vmap_transformer_speed[True-False] 8.5637ms 8.1005ms 123.4493 Ops/s 122.4709 Ops/s $\color{#35bf28}+0.80\%$
test_vmap_transformer_speed[False-True] 8.2293ms 7.8959ms 126.6487 Ops/s 125.4189 Ops/s $\color{#35bf28}+0.98\%$
test_vmap_transformer_speed[False-False] 8.0582ms 7.8965ms 126.6379 Ops/s 125.4757 Ops/s $\color{#35bf28}+0.93\%$
test_vmap_transformer_speed_decorator[True-True] 19.5810ms 18.8277ms 53.1132 Ops/s 52.5458 Ops/s $\color{#35bf28}+1.08\%$
test_vmap_transformer_speed_decorator[True-False] 19.7018ms 18.9270ms 52.8346 Ops/s 52.6164 Ops/s $\color{#35bf28}+0.41\%$
test_vmap_transformer_speed_decorator[False-True] 19.6799ms 18.9708ms 52.7125 Ops/s 52.9602 Ops/s $\color{#d91a1a}-0.47\%$
test_vmap_transformer_speed_decorator[False-False] 18.9371ms 18.7427ms 53.3541 Ops/s 52.9520 Ops/s $\color{#35bf28}+0.76\%$
test_to_module_speed[True] 1.9858ms 0.9470ms 1.0559 KOps/s 1.0680 KOps/s $\color{#d91a1a}-1.13\%$
test_to_module_speed[False] 1.1698ms 0.9125ms 1.0959 KOps/s 1.0923 KOps/s $\color{#35bf28}+0.33\%$
test_tc_init 67.5510μs 32.8989μs 30.3961 KOps/s 30.0816 KOps/s $\color{#35bf28}+1.05\%$
test_tc_init_nested 96.6820μs 67.1743μs 14.8867 KOps/s 14.5204 KOps/s $\color{#35bf28}+2.52\%$
test_tc_first_layer_tensor 5.3959μs 0.6747μs 1.4821 MOps/s 1.4866 MOps/s $\color{#d91a1a}-0.30\%$
test_tc_first_layer_nontensor 41.5510μs 2.2312μs 448.1832 KOps/s 446.0041 KOps/s $\color{#35bf28}+0.49\%$
test_tc_second_layer_tensor 9.9452μs 1.3556μs 737.6834 KOps/s 723.5963 KOps/s $\color{#35bf28}+1.95\%$
test_tc_second_layer_nontensor 94.2920μs 2.9326μs 340.9989 KOps/s 338.7873 KOps/s $\color{#35bf28}+0.65\%$
test_unbind 0.1930s 10.9939ms 90.9597 Ops/s 92.0399 Ops/s $\color{#d91a1a}-1.17\%$
test_full_like 0.6539ms 0.5735ms 1.7437 KOps/s 1.7365 KOps/s $\color{#35bf28}+0.41\%$
test_zeros_like 0.2646ms 0.1978ms 5.0543 KOps/s 5.0552 KOps/s $\color{#d91a1a}-0.02\%$
test_ones_like 0.2320ms 0.1976ms 5.0598 KOps/s 5.0626 KOps/s $\color{#d91a1a}-0.06\%$
test_clone 0.4545ms 0.4142ms 2.4142 KOps/s 2.4119 KOps/s $\color{#35bf28}+0.10\%$
test_squeeze 38.6710μs 10.0376μs 99.6257 KOps/s 100.1969 KOps/s $\color{#d91a1a}-0.57\%$
test_unsqueeze 0.3035ms 73.2562μs 13.6507 KOps/s 13.5539 KOps/s $\color{#35bf28}+0.71\%$
test_split 0.2544ms 0.1528ms 6.5452 KOps/s 6.3049 KOps/s $\color{#35bf28}+3.81\%$
test_permute 0.2178ms 0.1736ms 5.7615 KOps/s 5.6714 KOps/s $\color{#35bf28}+1.59\%$
test_stack 1.2524ms 0.8635ms 1.1581 KOps/s 1.1606 KOps/s $\color{#d91a1a}-0.21\%$
test_cat 1.2642ms 1.2319ms 811.7668 Ops/s 811.7450 Ops/s $+0.00\%$

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants