Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BugFix] Fix buffer identity in Params._apply #1027

Merged
merged 1 commit into from
Oct 4, 2024
Merged

[BugFix] Fix buffer identity in Params._apply #1027

merged 1 commit into from
Oct 4, 2024

Conversation

vmoens
Copy link
Contributor

@vmoens vmoens commented Oct 4, 2024

No description provided.

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Oct 4, 2024
Copy link

github-actions bot commented Oct 4, 2024

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 222. Improved: $\large\color{#35bf28}9$. Worsened: $\large\color{#d91a1a}5$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 55.7440μs 24.6096μs 40.6346 KOps/s 42.4670 KOps/s $\color{#d91a1a}-4.31\%$
test_plain_set_stack_nested 76.8840μs 24.7640μs 40.3812 KOps/s 41.6956 KOps/s $\color{#d91a1a}-3.15\%$
test_plain_set_nested_inplace 58.2390μs 26.6816μs 37.4790 KOps/s 38.2023 KOps/s $\color{#d91a1a}-1.89\%$
test_plain_set_stack_nested_inplace 72.2660μs 26.5134μs 37.7167 KOps/s 38.1048 KOps/s $\color{#d91a1a}-1.02\%$
test_items 36.3580μs 4.2026μs 237.9466 KOps/s 242.7801 KOps/s $\color{#d91a1a}-1.99\%$
test_items_nested 0.6618ms 0.3868ms 2.5855 KOps/s 2.6107 KOps/s $\color{#d91a1a}-0.97\%$
test_items_nested_locked 0.5677ms 0.3841ms 2.6037 KOps/s 2.6154 KOps/s $\color{#d91a1a}-0.45\%$
test_items_nested_leaf 0.1605ms 79.2218μs 12.6228 KOps/s 12.4276 KOps/s $\color{#35bf28}+1.57\%$
test_items_stack_nested 0.7276ms 0.3893ms 2.5688 KOps/s 2.5739 KOps/s $\color{#d91a1a}-0.20\%$
test_items_stack_nested_leaf 0.1569ms 81.7171μs 12.2373 KOps/s 12.1176 KOps/s $\color{#35bf28}+0.99\%$
test_items_stack_nested_locked 0.7269ms 0.3877ms 2.5795 KOps/s 2.5843 KOps/s $\color{#d91a1a}-0.19\%$
test_keys 40.4960μs 3.4518μs 289.7079 KOps/s 283.8768 KOps/s $\color{#35bf28}+2.05\%$
test_keys_nested 0.2583ms 0.1324ms 7.5516 KOps/s 7.4861 KOps/s $\color{#35bf28}+0.87\%$
test_keys_nested_locked 1.7245ms 0.1380ms 7.2487 KOps/s 7.2017 KOps/s $\color{#35bf28}+0.65\%$
test_keys_nested_leaf 0.2302ms 0.1158ms 8.6348 KOps/s 8.5272 KOps/s $\color{#35bf28}+1.26\%$
test_keys_stack_nested 0.2209ms 0.1322ms 7.5618 KOps/s 7.4528 KOps/s $\color{#35bf28}+1.46\%$
test_keys_stack_nested_leaf 0.2343ms 0.1156ms 8.6530 KOps/s 8.6141 KOps/s $\color{#35bf28}+0.45\%$
test_keys_stack_nested_locked 0.2562ms 0.1378ms 7.2544 KOps/s 7.1889 KOps/s $\color{#35bf28}+0.91\%$
test_values 5.6206μs 1.0032μs 996.8197 KOps/s 987.1183 KOps/s $\color{#35bf28}+0.98\%$
test_values_nested 0.1656ms 93.0733μs 10.7442 KOps/s 10.7583 KOps/s $\color{#d91a1a}-0.13\%$
test_values_nested_locked 0.1761ms 93.1152μs 10.7394 KOps/s 11.1678 KOps/s $\color{#d91a1a}-3.84\%$
test_values_nested_leaf 0.1449ms 78.2157μs 12.7851 KOps/s 12.6278 KOps/s $\color{#35bf28}+1.25\%$
test_values_stack_nested 0.1717ms 92.5286μs 10.8075 KOps/s 10.6559 KOps/s $\color{#35bf28}+1.42\%$
test_values_stack_nested_leaf 0.1451ms 77.9509μs 12.8286 KOps/s 12.5958 KOps/s $\color{#35bf28}+1.85\%$
test_values_stack_nested_locked 0.1929ms 92.6661μs 10.7914 KOps/s 10.2661 KOps/s $\textbf{\color{#35bf28}+5.12\%}$
test_membership 3.8416μs 0.7241μs 1.3810 MOps/s 1.3878 MOps/s $\color{#d91a1a}-0.49\%$
test_membership_nested 21.3710μs 2.7066μs 369.4710 KOps/s 359.5610 KOps/s $\color{#35bf28}+2.76\%$
test_membership_nested_leaf 21.7110μs 2.7499μs 363.6560 KOps/s 358.4228 KOps/s $\color{#35bf28}+1.46\%$
test_membership_stacked_nested 24.9160μs 2.7630μs 361.9203 KOps/s 363.2261 KOps/s $\color{#d91a1a}-0.36\%$
test_membership_stacked_nested_leaf 20.2780μs 2.7068μs 369.4373 KOps/s 362.4257 KOps/s $\color{#35bf28}+1.93\%$
test_membership_nested_last 34.0320μs 4.1905μs 238.6333 KOps/s 237.8428 KOps/s $\color{#35bf28}+0.33\%$
test_membership_nested_leaf_last 25.4880μs 4.2166μs 237.1598 KOps/s 235.3067 KOps/s $\color{#35bf28}+0.79\%$
test_membership_stacked_nested_last 22.7320μs 4.1738μs 239.5873 KOps/s 237.5110 KOps/s $\color{#35bf28}+0.87\%$
test_membership_stacked_nested_leaf_last 36.0180μs 4.1990μs 238.1512 KOps/s 239.4407 KOps/s $\color{#d91a1a}-0.54\%$
test_nested_getleaf 32.3410μs 10.3819μs 96.3219 KOps/s 96.4603 KOps/s $\color{#d91a1a}-0.14\%$
test_nested_get 54.3920μs 9.8055μs 101.9834 KOps/s 100.7796 KOps/s $\color{#35bf28}+1.19\%$
test_stacked_getleaf 38.5720μs 10.3317μs 96.7899 KOps/s 96.1952 KOps/s $\color{#35bf28}+0.62\%$
test_stacked_get 32.1600μs 9.9580μs 100.4217 KOps/s 100.1668 KOps/s $\color{#35bf28}+0.25\%$
test_nested_getitemleaf 38.6520μs 10.7978μs 92.6115 KOps/s 94.1405 KOps/s $\color{#d91a1a}-1.62\%$
test_nested_getitem 39.7050μs 10.1137μs 98.8758 KOps/s 99.0502 KOps/s $\color{#d91a1a}-0.18\%$
test_stacked_getitemleaf 36.9400μs 10.7653μs 92.8910 KOps/s 91.9990 KOps/s $\color{#35bf28}+0.97\%$
test_stacked_getitem 28.9740μs 10.0821μs 99.1854 KOps/s 98.2371 KOps/s $\color{#35bf28}+0.97\%$
test_lock_nested 84.1662ms 0.5956ms 1.6791 KOps/s 1.9725 KOps/s $\textbf{\color{#d91a1a}-14.88\%}$
test_lock_stack_nested 1.0612ms 0.4776ms 2.0936 KOps/s 2.1006 KOps/s $\color{#d91a1a}-0.33\%$
test_unlock_nested 84.8606ms 0.5123ms 1.9520 KOps/s 2.3435 KOps/s $\textbf{\color{#d91a1a}-16.71\%}$
test_unlock_stack_nested 0.7183ms 0.3918ms 2.5524 KOps/s 2.5592 KOps/s $\color{#d91a1a}-0.27\%$
test_flatten_speed 0.2061ms 0.1010ms 9.9000 KOps/s 9.9730 KOps/s $\color{#d91a1a}-0.73\%$
test_unflatten_speed 0.9770ms 0.5050ms 1.9801 KOps/s 1.9451 KOps/s $\color{#35bf28}+1.80\%$
test_common_ops 2.0914ms 1.1420ms 875.6421 Ops/s 865.4323 Ops/s $\color{#35bf28}+1.18\%$
test_creation 19.4370μs 2.0507μs 487.6280 KOps/s 487.1589 KOps/s $\color{#35bf28}+0.10\%$
test_creation_empty 55.4740μs 18.9703μs 52.7139 KOps/s 56.0749 KOps/s $\textbf{\color{#d91a1a}-5.99\%}$
test_creation_nested_1 53.8410μs 22.3827μs 44.6773 KOps/s 46.5505 KOps/s $\color{#d91a1a}-4.02\%$
test_creation_nested_2 62.6480μs 26.6807μs 37.4803 KOps/s 38.6081 KOps/s $\color{#d91a1a}-2.92\%$
test_clone 62.3770μs 17.2017μs 58.1337 KOps/s 57.7540 KOps/s $\color{#35bf28}+0.66\%$
test_getitem[int] 1.1145ms 16.5052μs 60.5869 KOps/s 60.3235 KOps/s $\color{#35bf28}+0.44\%$
test_getitem[slice_int] 0.1365ms 30.2472μs 33.0609 KOps/s 33.4595 KOps/s $\color{#d91a1a}-1.19\%$
test_getitem[range] 0.1937ms 57.4611μs 17.4031 KOps/s 16.8940 KOps/s $\color{#35bf28}+3.01\%$
test_getitem[tuple] 0.1282ms 24.7003μs 40.4854 KOps/s 40.4768 KOps/s $\color{#35bf28}+0.02\%$
test_getitem[list] 0.1912ms 53.4631μs 18.7045 KOps/s 18.5930 KOps/s $\color{#35bf28}+0.60\%$
test_setitem_dim[int] 77.2740μs 32.8649μs 30.4276 KOps/s 30.2262 KOps/s $\color{#35bf28}+0.67\%$
test_setitem_dim[slice_int] 0.1019ms 60.3920μs 16.5585 KOps/s 15.7000 KOps/s $\textbf{\color{#35bf28}+5.47\%}$
test_setitem_dim[range] 0.1159ms 82.2404μs 12.1595 KOps/s 11.6819 KOps/s $\color{#35bf28}+4.09\%$
test_setitem_dim[tuple] 93.0540μs 48.2461μs 20.7271 KOps/s 20.2839 KOps/s $\color{#35bf28}+2.18\%$
test_setitem 95.7290μs 31.3231μs 31.9254 KOps/s 32.6005 KOps/s $\color{#d91a1a}-2.07\%$
test_set 0.1008ms 30.4004μs 32.8943 KOps/s 33.3476 KOps/s $\color{#d91a1a}-1.36\%$
test_set_shared 3.6019ms 0.2202ms 4.5423 KOps/s 4.5032 KOps/s $\color{#35bf28}+0.87\%$
test_update 0.1439ms 39.4697μs 25.3359 KOps/s 25.5976 KOps/s $\color{#d91a1a}-1.02\%$
test_update_nested 0.1051ms 49.2683μs 20.2970 KOps/s 20.1716 KOps/s $\color{#35bf28}+0.62\%$
test_update__nested 0.1131ms 36.9488μs 27.0645 KOps/s 26.1808 KOps/s $\color{#35bf28}+3.38\%$
test_set_nested 88.7570μs 34.2226μs 29.2205 KOps/s 30.1459 KOps/s $\color{#d91a1a}-3.07\%$
test_set_nested_new 92.4830μs 38.5246μs 25.9574 KOps/s 26.3441 KOps/s $\color{#d91a1a}-1.47\%$
test_select 0.1102ms 56.3271μs 17.7535 KOps/s 18.3920 KOps/s $\color{#d91a1a}-3.47\%$
test_select_nested 0.1543ms 59.3087μs 16.8609 KOps/s 16.9933 KOps/s $\color{#d91a1a}-0.78\%$
test_exclude_nested 0.1523ms 73.3650μs 13.6305 KOps/s 13.5678 KOps/s $\color{#35bf28}+0.46\%$
test_empty[True] 0.7202ms 0.3482ms 2.8722 KOps/s 2.8636 KOps/s $\color{#35bf28}+0.30\%$
test_empty[False] 6.1440μs 1.2208μs 819.1466 KOps/s 836.1229 KOps/s $\color{#d91a1a}-2.03\%$
test_unbind_speed 0.4537ms 0.2969ms 3.3685 KOps/s 3.2797 KOps/s $\color{#35bf28}+2.71\%$
test_unbind_speed_stack0 0.5812ms 0.2971ms 3.3657 KOps/s 3.3516 KOps/s $\color{#35bf28}+0.42\%$
test_unbind_speed_stack1 88.9067ms 0.8225ms 1.2158 KOps/s 1.3241 KOps/s $\textbf{\color{#d91a1a}-8.18\%}$
test_split 78.9165ms 2.1433ms 466.5613 Ops/s 470.6696 Ops/s $\color{#d91a1a}-0.87\%$
test_chunk 2.6994ms 1.9822ms 504.4977 Ops/s 470.4617 Ops/s $\textbf{\color{#35bf28}+7.23\%}$
test_creation[device0] 3.6539ms 0.1198ms 8.3487 KOps/s 8.4426 KOps/s $\color{#d91a1a}-1.11\%$
test_creation_from_tensor 0.2708ms 0.1179ms 8.4794 KOps/s 8.3332 KOps/s $\color{#35bf28}+1.75\%$
test_add_one[memmap_tensor0] 0.1666ms 7.4014μs 135.1089 KOps/s 133.2986 KOps/s $\color{#35bf28}+1.36\%$
test_contiguous[memmap_tensor0] 13.0240μs 1.8820μs 531.3358 KOps/s 520.9288 KOps/s $\color{#35bf28}+2.00\%$
test_stack[memmap_tensor0] 38.5630μs 5.6375μs 177.3842 KOps/s 170.0613 KOps/s $\color{#35bf28}+4.31\%$
test_memmaptd_index 1.1367ms 0.4084ms 2.4489 KOps/s 2.4008 KOps/s $\color{#35bf28}+2.00\%$
test_memmaptd_index_astensor 0.7890ms 0.5147ms 1.9430 KOps/s 1.9274 KOps/s $\color{#35bf28}+0.81\%$
test_memmaptd_index_op 1.9089ms 1.0917ms 915.9794 Ops/s 937.7689 Ops/s $\color{#d91a1a}-2.32\%$
test_serialize_model 0.1315s 0.1186s 8.4319 Ops/s 8.4994 Ops/s $\color{#d91a1a}-0.79\%$
test_serialize_model_pickle 0.4457s 0.3864s 2.5877 Ops/s 2.5813 Ops/s $\color{#35bf28}+0.25\%$
test_serialize_weights 0.1340s 0.1198s 8.3485 Ops/s 8.4741 Ops/s $\color{#d91a1a}-1.48\%$
test_serialize_weights_returnearly 0.1834s 0.1579s 6.3314 Ops/s 6.2450 Ops/s $\color{#35bf28}+1.38\%$
test_serialize_weights_pickle 0.4658s 0.4135s 2.4183 Ops/s 2.3493 Ops/s $\color{#35bf28}+2.94\%$
test_serialize_weights_filesystem 0.2253s 0.1508s 6.6296 Ops/s 7.1030 Ops/s $\textbf{\color{#d91a1a}-6.66\%}$
test_serialize_model_filesystem 0.1493s 0.1427s 7.0084 Ops/s 6.2125 Ops/s $\textbf{\color{#35bf28}+12.81\%}$
test_reshape_pytree 0.1046ms 38.7083μs 25.8343 KOps/s 25.9929 KOps/s $\color{#d91a1a}-0.61\%$
test_reshape_td 0.1112ms 46.0551μs 21.7131 KOps/s 21.9244 KOps/s $\color{#d91a1a}-0.96\%$
test_view_pytree 93.0840μs 38.5132μs 25.9651 KOps/s 26.1955 KOps/s $\color{#d91a1a}-0.88\%$
test_view_td 0.1371ms 51.0477μs 19.5895 KOps/s 19.6081 KOps/s $\color{#d91a1a}-0.09\%$
test_unbind_pytree 82.8550μs 35.0765μs 28.5091 KOps/s 27.8912 KOps/s $\color{#35bf28}+2.22\%$
test_unbind_td 0.2981ms 44.8084μs 22.3173 KOps/s 22.4790 KOps/s $\color{#d91a1a}-0.72\%$
test_split_pytree 82.0030μs 37.5809μs 26.6093 KOps/s 26.3155 KOps/s $\color{#35bf28}+1.12\%$
test_split_td 0.5602ms 56.4583μs 17.7122 KOps/s 17.6406 KOps/s $\color{#35bf28}+0.41\%$
test_add_pytree 0.1186ms 44.9043μs 22.2696 KOps/s 22.2594 KOps/s $\color{#35bf28}+0.05\%$
test_add_td 0.1771ms 89.1935μs 11.2116 KOps/s 11.7151 KOps/s $\color{#d91a1a}-4.30\%$
test_compile_add_one_nested[tensordict-compile] 0.1357ms 59.4903μs 16.8095 KOps/s 17.0709 KOps/s $\color{#d91a1a}-1.53\%$
test_compile_add_one_nested[tensordict-eager] 0.2874ms 0.1949ms 5.1297 KOps/s 5.0665 KOps/s $\color{#35bf28}+1.25\%$
test_compile_add_one_nested[pytree-compile] 0.1225ms 57.1479μs 17.4985 KOps/s 17.7243 KOps/s $\color{#d91a1a}-1.27\%$
test_compile_add_one_nested[pytree-eager] 0.2389ms 0.1381ms 7.2392 KOps/s 7.0571 KOps/s $\color{#35bf28}+2.58\%$
test_compile_copy_nested[tensordict-compile] 60.6140μs 23.8347μs 41.9556 KOps/s 41.8864 KOps/s $\color{#35bf28}+0.17\%$
test_compile_copy_nested[tensordict-eager] 0.1538ms 73.0623μs 13.6870 KOps/s 13.4736 KOps/s $\color{#35bf28}+1.58\%$
test_compile_copy_nested[pytree-compile] 0.1588ms 75.3914μs 13.2641 KOps/s 13.3838 KOps/s $\color{#d91a1a}-0.89\%$
test_compile_copy_nested[pytree-eager] 0.1318ms 67.8762μs 14.7327 KOps/s 14.7876 KOps/s $\color{#d91a1a}-0.37\%$
test_compile_add_one_flat[tensordict-compile] 0.3357ms 0.1810ms 5.5245 KOps/s 5.4906 KOps/s $\color{#35bf28}+0.62\%$
test_compile_add_one_flat[tensordict-eager] 0.4684ms 0.2432ms 4.1124 KOps/s 4.1261 KOps/s $\color{#d91a1a}-0.33\%$
test_compile_add_one_flat[tensorclass-compile] 0.1149ms 48.3995μs 20.6614 KOps/s 19.7517 KOps/s $\color{#35bf28}+4.61\%$
test_compile_add_one_flat[tensorclass-eager] 0.1564ms 77.4708μs 12.9081 KOps/s 12.8922 KOps/s $\color{#35bf28}+0.12\%$
test_compile_add_one_flat[pytree-compile] 0.2684ms 0.1750ms 5.7158 KOps/s 5.6919 KOps/s $\color{#35bf28}+0.42\%$
test_compile_add_one_flat[pytree-eager] 0.4896ms 0.2869ms 3.4851 KOps/s 3.4495 KOps/s $\color{#35bf28}+1.03\%$
test_compile_add_self_flat[tensordict-eager] 0.3988ms 0.2762ms 3.6207 KOps/s 3.6008 KOps/s $\color{#35bf28}+0.55\%$
test_compile_add_self_flat[tensordict-compile] 0.3512ms 0.1872ms 5.3411 KOps/s 5.5152 KOps/s $\color{#d91a1a}-3.16\%$
test_compile_add_self_flat[tensorclass-eager] 0.1499ms 74.7216μs 13.3830 KOps/s 13.5513 KOps/s $\color{#d91a1a}-1.24\%$
test_compile_add_self_flat[tensorclass-compile] 0.1076ms 50.9805μs 19.6153 KOps/s 19.6735 KOps/s $\color{#d91a1a}-0.30\%$
test_compile_add_self_flat[pytree-eager] 0.4983ms 0.2339ms 4.2745 KOps/s 4.2062 KOps/s $\color{#35bf28}+1.62\%$
test_compile_add_self_flat[pytree-compile] 0.3429ms 0.1757ms 5.6904 KOps/s 5.6006 KOps/s $\color{#35bf28}+1.60\%$
test_compile_copy_flat[tensordict-compile] 0.2462ms 0.1121ms 8.9199 KOps/s 9.0130 KOps/s $\color{#d91a1a}-1.03\%$
test_compile_copy_flat[tensordict-eager] 0.1727ms 78.1945μs 12.7886 KOps/s 12.9967 KOps/s $\color{#d91a1a}-1.60\%$
test_compile_copy_flat[pytree-compile] 0.1597ms 76.4944μs 13.0728 KOps/s 13.2247 KOps/s $\color{#d91a1a}-1.15\%$
test_compile_copy_flat[pytree-eager] 0.1502ms 68.4983μs 14.5989 KOps/s 14.7656 KOps/s $\color{#d91a1a}-1.13\%$
test_compile_assign_and_add[tensordict-compile] 0.2988ms 0.1944ms 5.1429 KOps/s 5.1171 KOps/s $\color{#35bf28}+0.50\%$
test_compile_assign_and_add[tensordict-eager] 2.4980ms 1.7783ms 562.3300 Ops/s 560.8074 Ops/s $\color{#35bf28}+0.27\%$
test_compile_assign_and_add[pytree-compile] 0.2650ms 0.1922ms 5.2018 KOps/s 5.1258 KOps/s $\color{#35bf28}+1.48\%$
test_compile_assign_and_add[pytree-eager] 1.3620ms 1.0926ms 915.2155 Ops/s 895.1707 Ops/s $\color{#35bf28}+2.24\%$
test_compile_assign_and_add_stack[compile] 0.5876ms 0.4153ms 2.4078 KOps/s 2.3412 KOps/s $\color{#35bf28}+2.84\%$
test_compile_assign_and_add_stack[eager] 4.4277ms 4.2296ms 236.4276 Ops/s 244.9006 Ops/s $\color{#d91a1a}-3.46\%$
test_compile_indexing[tensor-tensordict-compile] 96.0110μs 35.0848μs 28.5023 KOps/s 28.6364 KOps/s $\color{#d91a1a}-0.47\%$
test_compile_indexing[tensor-tensordict-eager] 0.9532ms 47.7895μs 20.9251 KOps/s 20.4903 KOps/s $\color{#35bf28}+2.12\%$
test_compile_indexing[tensor-tensorclass-compile] 91.0010μs 30.9884μs 32.2702 KOps/s 33.3867 KOps/s $\color{#d91a1a}-3.34\%$
test_compile_indexing[tensor-tensorclass-eager] 94.1170μs 30.2147μs 33.0965 KOps/s 34.5420 KOps/s $\color{#d91a1a}-4.18\%$
test_compile_indexing[tensor-pytree-compile] 95.6190μs 31.2246μs 32.0260 KOps/s 33.5730 KOps/s $\color{#d91a1a}-4.61\%$
test_compile_indexing[tensor-pytree-eager] 74.8500μs 29.6742μs 33.6993 KOps/s 34.5664 KOps/s $\color{#d91a1a}-2.51\%$
test_compile_indexing[slice-tensordict-compile] 0.1680ms 74.3905μs 13.4426 KOps/s 13.5352 KOps/s $\color{#d91a1a}-0.68\%$
test_compile_indexing[slice-tensordict-eager] 0.3524ms 27.7124μs 36.0849 KOps/s 36.8127 KOps/s $\color{#d91a1a}-1.98\%$
test_compile_indexing[slice-tensorclass-compile] 0.1483ms 68.8621μs 14.5218 KOps/s 14.2527 KOps/s $\color{#35bf28}+1.89\%$
test_compile_indexing[slice-tensorclass-eager] 64.5710μs 23.3329μs 42.8579 KOps/s 43.0635 KOps/s $\color{#d91a1a}-0.48\%$
test_compile_indexing[slice-pytree-compile] 0.1286ms 68.5279μs 14.5926 KOps/s 14.3659 KOps/s $\color{#35bf28}+1.58\%$
test_compile_indexing[slice-pytree-eager] 76.6230μs 22.8899μs 43.6874 KOps/s 43.1064 KOps/s $\color{#35bf28}+1.35\%$
test_compile_indexing[int-tensordict-compile] 0.1640ms 73.6492μs 13.5779 KOps/s 13.2159 KOps/s $\color{#35bf28}+2.74\%$
test_compile_indexing[int-tensordict-eager] 0.9066ms 27.1909μs 36.7770 KOps/s 37.3400 KOps/s $\color{#d91a1a}-1.51\%$
test_compile_indexing[int-tensorclass-compile] 0.1233ms 68.9636μs 14.5004 KOps/s 14.4290 KOps/s $\color{#35bf28}+0.49\%$
test_compile_indexing[int-tensorclass-eager] 67.2660μs 22.7572μs 43.9421 KOps/s 43.8511 KOps/s $\color{#35bf28}+0.21\%$
test_compile_indexing[int-pytree-compile] 0.1413ms 68.2770μs 14.6462 KOps/s 14.5487 KOps/s $\color{#35bf28}+0.67\%$
test_compile_indexing[int-pytree-eager] 71.9250μs 22.9055μs 43.6576 KOps/s 43.4821 KOps/s $\color{#35bf28}+0.40\%$
test_mod_add[eager] 0.1096ms 25.3603μs 39.4317 KOps/s 40.4079 KOps/s $\color{#d91a1a}-2.42\%$
test_mod_add[compile] 0.1006ms 39.3955μs 25.3836 KOps/s 25.8447 KOps/s $\color{#d91a1a}-1.78\%$
test_mod_add[compile-overhead] 0.1035ms 40.1595μs 24.9007 KOps/s 25.4467 KOps/s $\color{#d91a1a}-2.15\%$
test_mod_wrap[eager] 0.3423ms 0.2154ms 4.6419 KOps/s 4.7290 KOps/s $\color{#d91a1a}-1.84\%$
test_mod_wrap[compile] 0.3880ms 0.2314ms 4.3220 KOps/s 4.2318 KOps/s $\color{#35bf28}+2.13\%$
test_mod_wrap[compile-overhead] 0.3486ms 0.2313ms 4.3230 KOps/s 4.3073 KOps/s $\color{#35bf28}+0.37\%$
test_mod_wrap_and_backward[eager] 13.1525ms 10.7628ms 92.9125 Ops/s 72.3895 Ops/s $\textbf{\color{#35bf28}+28.35\%}$
test_mod_wrap_and_backward[compile] 12.3013ms 10.7369ms 93.1366 Ops/s 75.8383 Ops/s $\textbf{\color{#35bf28}+22.81\%}$
test_mod_wrap_and_backward[compile-overhead] 12.1912ms 10.5595ms 94.7016 Ops/s 81.3074 Ops/s $\textbf{\color{#35bf28}+16.47\%}$
test_seq_add[eager] 0.1796ms 94.9141μs 10.5358 KOps/s 10.8149 KOps/s $\color{#d91a1a}-2.58\%$
test_seq_add[compile] 0.1253ms 65.5788μs 15.2488 KOps/s 15.3408 KOps/s $\color{#d91a1a}-0.60\%$
test_seq_add[compile-overhead] 0.1315ms 65.5788μs 15.2488 KOps/s 15.5401 KOps/s $\color{#d91a1a}-1.87\%$
test_seq_wrap[eager] 0.6397ms 0.3958ms 2.5265 KOps/s 2.5928 KOps/s $\color{#d91a1a}-2.56\%$
test_seq_wrap[compile] 1.1949ms 0.2761ms 3.6215 KOps/s 3.7036 KOps/s $\color{#d91a1a}-2.22\%$
test_seq_wrap[compile-overhead] 1.1633ms 0.2733ms 3.6596 KOps/s 3.6534 KOps/s $\color{#35bf28}+0.17\%$
test_func_call_runtime[False-eager] 0.9202ms 0.5328ms 1.8767 KOps/s 1.9053 KOps/s $\color{#d91a1a}-1.50\%$
test_func_call_runtime[False-compile] 0.6225ms 0.5113ms 1.9559 KOps/s 1.9722 KOps/s $\color{#d91a1a}-0.83\%$
test_func_call_runtime[False-compile-overhead] 0.8143ms 0.5076ms 1.9700 KOps/s 1.9684 KOps/s $\color{#35bf28}+0.08\%$
test_func_call_runtime[True-eager] 0.9788ms 0.7481ms 1.3367 KOps/s 1.3439 KOps/s $\color{#d91a1a}-0.54\%$
test_func_call_runtime[True-compile] 0.6938ms 0.5248ms 1.9056 KOps/s 1.9156 KOps/s $\color{#d91a1a}-0.52\%$
test_func_call_runtime[True-compile-overhead] 0.8749ms 0.5248ms 1.9053 KOps/s 1.9366 KOps/s $\color{#d91a1a}-1.62\%$
test_func_call_cm_runtime[False-eager] 0.8742ms 0.5302ms 1.8859 KOps/s 1.8906 KOps/s $\color{#d91a1a}-0.25\%$
test_func_call_cm_runtime[False-compile] 0.6453ms 0.5069ms 1.9726 KOps/s 1.9735 KOps/s $\color{#d91a1a}-0.05\%$
test_func_call_cm_runtime[False-compile-overhead] 0.8427ms 0.5121ms 1.9528 KOps/s 1.9684 KOps/s $\color{#d91a1a}-0.79\%$
test_func_call_cm_runtime[True-eager] 1.9263ms 0.9103ms 1.0986 KOps/s 1.1139 KOps/s $\color{#d91a1a}-1.38\%$
test_func_call_cm_runtime[True-compile] 1.0845ms 0.7516ms 1.3304 KOps/s 1.3441 KOps/s $\color{#d91a1a}-1.02\%$
test_func_call_cm_runtime[True-compile-overhead] 0.8695ms 0.7598ms 1.3162 KOps/s 1.3271 KOps/s $\color{#d91a1a}-0.82\%$
test_vmap_func_call_cm_runtime[eager] 2.5414ms 1.9153ms 522.1048 Ops/s 520.9826 Ops/s $\color{#35bf28}+0.22\%$
test_vmap_func_call_cm_runtime[compile] 2.5421ms 1.9772ms 505.7551 Ops/s 503.6587 Ops/s $\color{#35bf28}+0.42\%$
test_vmap_func_call_cm_runtime[compile-overhead] 2.5956ms 1.9834ms 504.1819 Ops/s 502.4631 Ops/s $\color{#35bf28}+0.34\%$
test_distributed 0.3164ms 0.1262ms 7.9247 KOps/s 7.7530 KOps/s $\color{#35bf28}+2.22\%$
test_tdmodule 71.5450μs 18.5969μs 53.7724 KOps/s 54.9578 KOps/s $\color{#d91a1a}-2.16\%$
test_tdmodule_dispatch 66.0740μs 36.9870μs 27.0365 KOps/s 27.7920 KOps/s $\color{#d91a1a}-2.72\%$
test_tdseq 50.0140μs 21.2976μs 46.9536 KOps/s 47.6539 KOps/s $\color{#d91a1a}-1.47\%$
test_tdseq_dispatch 73.9390μs 42.6893μs 23.4251 KOps/s 24.2706 KOps/s $\color{#d91a1a}-3.48\%$
test_instantiation_functorch 1.7617ms 1.5746ms 635.0830 Ops/s 633.7333 Ops/s $\color{#35bf28}+0.21\%$
test_instantiation_td 1.8123ms 1.1718ms 853.3597 Ops/s 856.5842 Ops/s $\color{#d91a1a}-0.38\%$
test_exec_functorch 0.4215ms 0.1834ms 5.4534 KOps/s 5.3966 KOps/s $\color{#35bf28}+1.05\%$
test_exec_functional_call 0.3442ms 0.1755ms 5.6988 KOps/s 5.6358 KOps/s $\color{#35bf28}+1.12\%$
test_exec_td 0.3519ms 0.2015ms 4.9623 KOps/s 4.7445 KOps/s $\color{#35bf28}+4.59\%$
test_exec_td_decorator 0.3723ms 0.2349ms 4.2565 KOps/s 4.2509 KOps/s $\color{#35bf28}+0.13\%$
test_vmap_mlp_speed[True-True] 1.1833ms 0.7115ms 1.4055 KOps/s 1.4566 KOps/s $\color{#d91a1a}-3.51\%$
test_vmap_mlp_speed[True-False] 0.8471ms 0.6898ms 1.4497 KOps/s 1.4730 KOps/s $\color{#d91a1a}-1.59\%$
test_vmap_mlp_speed[False-True] 0.8725ms 0.5414ms 1.8470 KOps/s 1.8760 KOps/s $\color{#d91a1a}-1.54\%$
test_vmap_mlp_speed[False-False] 0.9029ms 0.5482ms 1.8242 KOps/s 1.8637 KOps/s $\color{#d91a1a}-2.12\%$
test_vmap_mlp_speed_decorator[True-True] 1.2759ms 0.6513ms 1.5353 KOps/s 1.5565 KOps/s $\color{#d91a1a}-1.36\%$
test_vmap_mlp_speed_decorator[True-False] 0.8095ms 0.6505ms 1.5372 KOps/s 1.5584 KOps/s $\color{#d91a1a}-1.36\%$
test_vmap_mlp_speed_decorator[False-True] 0.8903ms 0.5424ms 1.8436 KOps/s 1.8843 KOps/s $\color{#d91a1a}-2.16\%$
test_vmap_mlp_speed_decorator[False-False] 0.6764ms 0.5340ms 1.8725 KOps/s 1.8739 KOps/s $\color{#d91a1a}-0.07\%$
test_to_module_speed[True] 2.3119ms 1.3947ms 717.0183 Ops/s 714.3941 Ops/s $\color{#35bf28}+0.37\%$
test_to_module_speed[False] 2.3317ms 1.3584ms 736.1858 Ops/s 730.8041 Ops/s $\color{#35bf28}+0.74\%$
test_tc_init 0.1109ms 48.8238μs 20.4818 KOps/s 21.0092 KOps/s $\color{#d91a1a}-2.51\%$
test_tc_init_nested 0.2083ms 95.0836μs 10.5171 KOps/s 10.5073 KOps/s $\color{#35bf28}+0.09\%$
test_tc_first_layer_tensor 15.1590μs 1.4984μs 667.3650 KOps/s 639.1721 KOps/s $\color{#35bf28}+4.41\%$
test_tc_first_layer_nontensor 43.0020μs 4.6054μs 217.1344 KOps/s 210.5714 KOps/s $\color{#35bf28}+3.12\%$
test_tc_second_layer_tensor 18.3750μs 2.7283μs 366.5276 KOps/s 345.1004 KOps/s $\textbf{\color{#35bf28}+6.21\%}$
test_tc_second_layer_nontensor 39.4450μs 6.0238μs 166.0073 KOps/s 161.6530 KOps/s $\color{#35bf28}+2.69\%$
test_unbind 0.4682s 13.1706ms 75.9266 Ops/s 75.3994 Ops/s $\color{#35bf28}+0.70\%$
test_full_like 7.8984ms 7.1706ms 139.4577 Ops/s 118.4356 Ops/s $\textbf{\color{#35bf28}+17.75\%}$
test_zeros_like 3.1112ms 2.7106ms 368.9187 Ops/s 357.6143 Ops/s $\color{#35bf28}+3.16\%$
test_ones_like 3.5513ms 3.2596ms 306.7828 Ops/s 305.8875 Ops/s $\color{#35bf28}+0.29\%$
test_clone 5.2088ms 4.9137ms 203.5128 Ops/s 200.1529 Ops/s $\color{#35bf28}+1.68\%$
test_squeeze 60.5850μs 12.5073μs 79.9532 KOps/s 80.9334 KOps/s $\color{#d91a1a}-1.21\%$
test_unsqueeze 0.3453ms 93.3821μs 10.7087 KOps/s 10.7226 KOps/s $\color{#d91a1a}-0.13\%$
test_split 0.3712ms 0.1920ms 5.2074 KOps/s 5.1868 KOps/s $\color{#35bf28}+0.40\%$
test_permute 0.3835ms 0.2186ms 4.5751 KOps/s 4.5858 KOps/s $\color{#d91a1a}-0.23\%$
test_stack 28.6305ms 25.4539ms 39.2866 Ops/s 40.1837 Ops/s $\color{#d91a1a}-2.23\%$
test_cat 27.7739ms 25.2686ms 39.5747 Ops/s 40.8933 Ops/s $\color{#d91a1a}-3.22\%$

Copy link

github-actions bot commented Oct 4, 2024

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 228. Improved: $\large\color{#35bf28}17$. Worsened: $\large\color{#d91a1a}12$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 0.1527ms 16.4323μs 60.8559 KOps/s 59.0341 KOps/s $\color{#35bf28}+3.09\%$
test_plain_set_stack_nested 41.5010μs 16.4869μs 60.6543 KOps/s 58.9265 KOps/s $\color{#35bf28}+2.93\%$
test_plain_set_nested_inplace 53.8010μs 17.5642μs 56.9340 KOps/s 55.1198 KOps/s $\color{#35bf28}+3.29\%$
test_plain_set_stack_nested_inplace 52.3210μs 17.5003μs 57.1419 KOps/s 55.3275 KOps/s $\color{#35bf28}+3.28\%$
test_items 25.7300μs 2.9874μs 334.7382 KOps/s 346.3573 KOps/s $\color{#d91a1a}-3.35\%$
test_items_nested 0.3810ms 0.3390ms 2.9494 KOps/s 2.9421 KOps/s $\color{#35bf28}+0.25\%$
test_items_nested_locked 0.4073ms 0.3361ms 2.9756 KOps/s 2.9810 KOps/s $\color{#d91a1a}-0.18\%$
test_items_nested_leaf 95.5520μs 62.5818μs 15.9791 KOps/s 16.0150 KOps/s $\color{#d91a1a}-0.22\%$
test_items_stack_nested 0.5318ms 0.3409ms 2.9332 KOps/s 2.9725 KOps/s $\color{#d91a1a}-1.32\%$
test_items_stack_nested_leaf 92.0320μs 64.8783μs 15.4135 KOps/s 15.7883 KOps/s $\color{#d91a1a}-2.37\%$
test_items_stack_nested_locked 0.3718ms 0.3402ms 2.9399 KOps/s 2.9496 KOps/s $\color{#d91a1a}-0.33\%$
test_keys 32.3210μs 3.4068μs 293.5325 KOps/s 291.7744 KOps/s $\color{#35bf28}+0.60\%$
test_keys_nested 0.1023ms 70.3342μs 14.2178 KOps/s 14.1924 KOps/s $\color{#35bf28}+0.18\%$
test_keys_nested_locked 2.4950ms 76.8917μs 13.0053 KOps/s 13.0237 KOps/s $\color{#d91a1a}-0.14\%$
test_keys_nested_leaf 0.1045ms 60.7283μs 16.4668 KOps/s 16.2765 KOps/s $\color{#35bf28}+1.17\%$
test_keys_stack_nested 0.1702ms 71.5536μs 13.9755 KOps/s 13.9574 KOps/s $\color{#35bf28}+0.13\%$
test_keys_stack_nested_leaf 93.1720μs 62.9312μs 15.8904 KOps/s 16.0617 KOps/s $\color{#d91a1a}-1.07\%$
test_keys_stack_nested_locked 0.1119ms 77.6528μs 12.8778 KOps/s 12.9425 KOps/s $\color{#d91a1a}-0.50\%$
test_values 8.5685μs 0.8420μs 1.1877 MOps/s 1.1973 MOps/s $\color{#d91a1a}-0.80\%$
test_values_nested 94.7520μs 48.8223μs 20.4824 KOps/s 20.4603 KOps/s $\color{#35bf28}+0.11\%$
test_values_nested_locked 78.0020μs 50.2027μs 19.9192 KOps/s 19.8116 KOps/s $\color{#35bf28}+0.54\%$
test_values_nested_leaf 69.5010μs 42.6271μs 23.4592 KOps/s 23.4269 KOps/s $\color{#35bf28}+0.14\%$
test_values_stack_nested 80.3820μs 50.0819μs 19.9673 KOps/s 20.1586 KOps/s $\color{#d91a1a}-0.95\%$
test_values_stack_nested_leaf 79.2020μs 44.0966μs 22.6775 KOps/s 22.9880 KOps/s $\color{#d91a1a}-1.35\%$
test_values_stack_nested_locked 81.6210μs 51.5115μs 19.4132 KOps/s 19.4339 KOps/s $\color{#d91a1a}-0.11\%$
test_membership 1.7931μs 0.5007μs 1.9971 MOps/s 1.9861 MOps/s $\color{#35bf28}+0.55\%$
test_membership_nested 16.2255μs 1.8355μs 544.8023 KOps/s 545.0633 KOps/s $\color{#d91a1a}-0.05\%$
test_membership_nested_leaf 13.3603μs 1.8366μs 544.4732 KOps/s 547.2286 KOps/s $\color{#d91a1a}-0.50\%$
test_membership_stacked_nested 31.3200μs 1.8410μs 543.1835 KOps/s 538.9763 KOps/s $\color{#35bf28}+0.78\%$
test_membership_stacked_nested_leaf 24.8610μs 1.8861μs 530.1851 KOps/s 535.6556 KOps/s $\color{#d91a1a}-1.02\%$
test_membership_nested_last 0.5940ms 2.9739μs 336.2557 KOps/s 337.8659 KOps/s $\color{#d91a1a}-0.48\%$
test_membership_nested_leaf_last 33.1110μs 2.9295μs 341.3551 KOps/s 333.4303 KOps/s $\color{#35bf28}+2.38\%$
test_membership_stacked_nested_last 31.5000μs 3.5165μs 284.3759 KOps/s 333.8960 KOps/s $\textbf{\color{#d91a1a}-14.83\%}$
test_membership_stacked_nested_leaf_last 34.1000μs 3.4698μs 288.1972 KOps/s 336.2771 KOps/s $\textbf{\color{#d91a1a}-14.30\%}$
test_nested_getleaf 34.0910μs 6.0634μs 164.9247 KOps/s 168.3147 KOps/s $\color{#d91a1a}-2.01\%$
test_nested_get 38.5910μs 5.7947μs 172.5703 KOps/s 175.1260 KOps/s $\color{#d91a1a}-1.46\%$
test_stacked_getleaf 0.1729ms 5.9962μs 166.7719 KOps/s 165.4428 KOps/s $\color{#35bf28}+0.80\%$
test_stacked_get 34.0110μs 5.6600μs 176.6785 KOps/s 173.2625 KOps/s $\color{#35bf28}+1.97\%$
test_nested_getitemleaf 0.1885ms 6.1141μs 163.5552 KOps/s 161.7610 KOps/s $\color{#35bf28}+1.11\%$
test_nested_getitem 34.1810μs 5.7810μs 172.9795 KOps/s 171.8565 KOps/s $\color{#35bf28}+0.65\%$
test_stacked_getitemleaf 31.3700μs 6.0764μs 164.5714 KOps/s 163.8114 KOps/s $\color{#35bf28}+0.46\%$
test_stacked_getitem 26.2000μs 5.7503μs 173.9053 KOps/s 173.8409 KOps/s $\color{#35bf28}+0.04\%$
test_lock_nested 4.3878ms 0.4233ms 2.3625 KOps/s 2.3882 KOps/s $\color{#d91a1a}-1.08\%$
test_lock_stack_nested 0.5202ms 0.3877ms 2.5794 KOps/s 2.5962 KOps/s $\color{#d91a1a}-0.65\%$
test_unlock_nested 0.8589ms 0.3556ms 2.8123 KOps/s 2.7990 KOps/s $\color{#35bf28}+0.47\%$
test_unlock_stack_nested 0.3543ms 0.3233ms 3.0930 KOps/s 3.0926 KOps/s $\color{#35bf28}+0.01\%$
test_flatten_speed 0.1658ms 77.0629μs 12.9764 KOps/s 13.0764 KOps/s $\color{#d91a1a}-0.76\%$
test_unflatten_speed 0.3600ms 0.3264ms 3.0642 KOps/s 3.1008 KOps/s $\color{#d91a1a}-1.18\%$
test_common_ops 1.5323ms 1.2298ms 813.1221 Ops/s 763.5003 Ops/s $\textbf{\color{#35bf28}+6.50\%}$
test_creation 31.5900μs 1.4518μs 688.8048 KOps/s 704.4494 KOps/s $\color{#d91a1a}-2.22\%$
test_creation_empty 0.8790ms 14.7222μs 67.9249 KOps/s 62.5898 KOps/s $\textbf{\color{#35bf28}+8.52\%}$
test_creation_nested_1 47.2710μs 16.3510μs 61.1583 KOps/s 56.4307 KOps/s $\textbf{\color{#35bf28}+8.38\%}$
test_creation_nested_2 58.3120μs 18.8627μs 53.0147 KOps/s 48.9771 KOps/s $\textbf{\color{#35bf28}+8.24\%}$
test_clone 0.1660ms 27.4700μs 36.4033 KOps/s 36.8735 KOps/s $\color{#d91a1a}-1.28\%$
test_getitem[int] 92.1347ms 22.6927μs 44.0671 KOps/s 65.3006 KOps/s $\textbf{\color{#d91a1a}-32.52\%}$
test_getitem[slice_int] 0.1592ms 26.6630μs 37.5051 KOps/s 37.8101 KOps/s $\color{#d91a1a}-0.81\%$
test_getitem[range] 0.2200ms 0.1059ms 9.4436 KOps/s 9.4418 KOps/s $\color{#35bf28}+0.02\%$
test_getitem[tuple] 0.1195ms 23.0377μs 43.4071 KOps/s 43.7493 KOps/s $\color{#d91a1a}-0.78\%$
test_getitem[list] 0.2783ms 0.1003ms 9.9731 KOps/s 10.5205 KOps/s $\textbf{\color{#d91a1a}-5.20\%}$
test_setitem_dim[int] 74.4710μs 45.4498μs 22.0023 KOps/s 22.7761 KOps/s $\color{#d91a1a}-3.40\%$
test_setitem_dim[slice_int] 0.1968ms 67.0614μs 14.9117 KOps/s 15.0944 KOps/s $\color{#d91a1a}-1.21\%$
test_setitem_dim[range] 0.3101ms 0.1269ms 7.8819 KOps/s 8.0406 KOps/s $\color{#d91a1a}-1.97\%$
test_setitem_dim[tuple] 0.2243ms 60.3292μs 16.5757 KOps/s 16.7594 KOps/s $\color{#d91a1a}-1.10\%$
test_setitem 0.1894ms 40.1520μs 24.9054 KOps/s 23.1335 KOps/s $\textbf{\color{#35bf28}+7.66\%}$
test_set 0.1879ms 39.1830μs 25.5213 KOps/s 23.5060 KOps/s $\textbf{\color{#35bf28}+8.57\%}$
test_set_shared 0.3452ms 53.5769μs 18.6647 KOps/s 18.9708 KOps/s $\color{#d91a1a}-1.61\%$
test_update 0.2255ms 50.7522μs 19.7036 KOps/s 20.4328 KOps/s $\color{#d91a1a}-3.57\%$
test_update_nested 0.2384ms 59.1521μs 16.9056 KOps/s 17.6677 KOps/s $\color{#d91a1a}-4.31\%$
test_update__nested 0.2265ms 59.6016μs 16.7781 KOps/s 16.8677 KOps/s $\color{#d91a1a}-0.53\%$
test_set_nested 0.1941ms 42.3555μs 23.6097 KOps/s 22.6505 KOps/s $\color{#35bf28}+4.23\%$
test_set_nested_new 0.2036ms 45.6338μs 21.9136 KOps/s 20.3123 KOps/s $\textbf{\color{#35bf28}+7.88\%}$
test_select 0.2390ms 59.9870μs 16.6703 KOps/s 15.8197 KOps/s $\textbf{\color{#35bf28}+5.38\%}$
test_select_nested 0.5056ms 42.0404μs 23.7867 KOps/s 23.8435 KOps/s $\color{#d91a1a}-0.24\%$
test_exclude_nested 88.8620μs 57.7455μs 17.3174 KOps/s 17.1985 KOps/s $\color{#35bf28}+0.69\%$
test_empty[True] 0.4169ms 0.2574ms 3.8853 KOps/s 3.8835 KOps/s $\color{#35bf28}+0.05\%$
test_empty[False] 3.1091μs 0.7294μs 1.3710 MOps/s 1.3658 MOps/s $\color{#35bf28}+0.39\%$
test_to 55.6210μs 26.5139μs 37.7161 KOps/s 38.1303 KOps/s $\color{#d91a1a}-1.09\%$
test_to_nonblocking 0.1397ms 25.3873μs 39.3897 KOps/s 40.4074 KOps/s $\color{#d91a1a}-2.52\%$
test_unbind_speed 0.3237ms 0.2738ms 3.6521 KOps/s 3.6811 KOps/s $\color{#d91a1a}-0.79\%$
test_unbind_speed_stack0 0.3691ms 0.2743ms 3.6450 KOps/s 3.7322 KOps/s $\color{#d91a1a}-2.34\%$
test_unbind_speed_stack1 91.6997ms 0.7016ms 1.4252 KOps/s 1.4133 KOps/s $\color{#35bf28}+0.84\%$
test_split 93.4815ms 2.0937ms 477.6176 Ops/s 475.9170 Ops/s $\color{#35bf28}+0.36\%$
test_chunk 93.3065ms 2.0880ms 478.9242 Ops/s 473.9808 Ops/s $\color{#35bf28}+1.04\%$
test_creation[device0] 0.3384ms 0.1260ms 7.9375 KOps/s 8.0476 KOps/s $\color{#d91a1a}-1.37\%$
test_creation_from_tensor 0.3436ms 0.1326ms 7.5413 KOps/s 7.6473 KOps/s $\color{#d91a1a}-1.39\%$
test_add_one[memmap_tensor0] 0.2451ms 8.3932μs 119.1446 KOps/s 119.7406 KOps/s $\color{#d91a1a}-0.50\%$
test_contiguous[memmap_tensor0] 34.1510μs 2.0978μs 476.6910 KOps/s 473.3370 KOps/s $\color{#35bf28}+0.71\%$
test_stack[memmap_tensor0] 37.2710μs 6.3965μs 156.3350 KOps/s 155.1482 KOps/s $\color{#35bf28}+0.76\%$
test_memmaptd_index 1.1875ms 0.4139ms 2.4159 KOps/s 2.4252 KOps/s $\color{#d91a1a}-0.38\%$
test_memmaptd_index_astensor 0.7557ms 0.4838ms 2.0668 KOps/s 2.0802 KOps/s $\color{#d91a1a}-0.65\%$
test_memmaptd_index_op 1.3824ms 0.9875ms 1.0127 KOps/s 998.4656 Ops/s $\color{#35bf28}+1.42\%$
test_serialize_model 0.1305s 0.1297s 7.7079 Ops/s 7.6700 Ops/s $\color{#35bf28}+0.49\%$
test_serialize_model_pickle 1.3622s 1.2160s 0.8224 Ops/s 0.8244 Ops/s $\color{#d91a1a}-0.24\%$
test_serialize_weights 0.1303s 0.1294s 7.7280 Ops/s 6.9651 Ops/s $\textbf{\color{#35bf28}+10.95\%}$
test_serialize_weights_returnearly 0.2069s 55.3873ms 18.0547 Ops/s 17.7035 Ops/s $\color{#35bf28}+1.98\%$
test_serialize_weights_pickle 1.3723s 1.2169s 0.8218 Ops/s 0.8172 Ops/s $\color{#35bf28}+0.56\%$
test_reshape_pytree 78.4110μs 35.5523μs 28.1276 KOps/s 29.1380 KOps/s $\color{#d91a1a}-3.47\%$
test_reshape_td 0.1698ms 42.1129μs 23.7457 KOps/s 25.1189 KOps/s $\textbf{\color{#d91a1a}-5.47\%}$
test_view_pytree 0.1835ms 34.0422μs 29.3753 KOps/s 29.5595 KOps/s $\color{#d91a1a}-0.62\%$
test_view_td 0.2252ms 45.4580μs 21.9983 KOps/s 21.9944 KOps/s $\color{#35bf28}+0.02\%$
test_unbind_pytree 0.1818ms 34.4169μs 29.0555 KOps/s 30.0050 KOps/s $\color{#d91a1a}-3.16\%$
test_unbind_td 0.4203ms 42.0444μs 23.7844 KOps/s 23.7180 KOps/s $\color{#35bf28}+0.28\%$
test_split_pytree 0.1400ms 46.5980μs 21.4602 KOps/s 22.0809 KOps/s $\color{#d91a1a}-2.81\%$
test_split_td 93.6307ms 64.8835μs 15.4122 KOps/s 18.1778 KOps/s $\textbf{\color{#d91a1a}-15.21\%}$
test_add_pytree 0.2020ms 55.3219μs 18.0760 KOps/s 17.2539 KOps/s $\color{#35bf28}+4.77\%$
test_add_td 0.2714ms 93.0713μs 10.7444 KOps/s 10.4452 KOps/s $\color{#35bf28}+2.86\%$
test_compile_add_one_nested[tensordict-compile] 0.3107ms 0.1567ms 6.3836 KOps/s 6.0825 KOps/s $\color{#35bf28}+4.95\%$
test_compile_add_one_nested[tensordict-eager] 0.3246ms 0.1632ms 6.1288 KOps/s 6.2705 KOps/s $\color{#d91a1a}-2.26\%$
test_compile_add_one_nested[pytree-compile] 0.2936ms 0.1401ms 7.1391 KOps/s 7.1112 KOps/s $\color{#35bf28}+0.39\%$
test_compile_add_one_nested[pytree-eager] 0.3669ms 0.1883ms 5.3096 KOps/s 5.6556 KOps/s $\textbf{\color{#d91a1a}-6.12\%}$
test_compile_copy_nested[tensordict-compile] 0.1665ms 21.3480μs 46.8427 KOps/s 47.2196 KOps/s $\color{#d91a1a}-0.80\%$
test_compile_copy_nested[tensordict-eager] 0.1787ms 47.8348μs 20.9053 KOps/s 20.9234 KOps/s $\color{#d91a1a}-0.09\%$
test_compile_copy_nested[pytree-compile] 0.2287ms 63.6733μs 15.7052 KOps/s 15.5704 KOps/s $\color{#35bf28}+0.87\%$
test_compile_copy_nested[pytree-eager] 82.7120μs 49.9901μs 20.0040 KOps/s 19.9349 KOps/s $\color{#35bf28}+0.35\%$
test_compile_add_one_flat[tensordict-compile] 0.4523ms 0.3088ms 3.2378 KOps/s 3.1894 KOps/s $\color{#35bf28}+1.52\%$
test_compile_add_one_flat[tensordict-eager] 0.3518ms 0.2390ms 4.1846 KOps/s 4.3252 KOps/s $\color{#d91a1a}-3.25\%$
test_compile_add_one_flat[tensorclass-compile] 0.2731ms 0.1241ms 8.0598 KOps/s 7.7041 KOps/s $\color{#35bf28}+4.62\%$
test_compile_add_one_flat[tensorclass-eager] 0.2575ms 67.9418μs 14.7185 KOps/s 15.5721 KOps/s $\textbf{\color{#d91a1a}-5.48\%}$
test_compile_add_one_flat[pytree-compile] 0.4052ms 0.3072ms 3.2555 KOps/s 3.2274 KOps/s $\color{#35bf28}+0.87\%$
test_compile_add_one_flat[pytree-eager] 0.7731ms 0.5988ms 1.6701 KOps/s 1.7085 KOps/s $\color{#d91a1a}-2.24\%$
test_compile_add_self_flat[tensordict-eager] 0.4045ms 0.2852ms 3.5062 KOps/s 3.5798 KOps/s $\color{#d91a1a}-2.06\%$
test_compile_add_self_flat[tensordict-compile] 0.4281ms 0.3107ms 3.2189 KOps/s 3.1982 KOps/s $\color{#35bf28}+0.65\%$
test_compile_add_self_flat[tensorclass-eager] 0.2384ms 76.9039μs 13.0032 KOps/s 13.3709 KOps/s $\color{#d91a1a}-2.75\%$
test_compile_add_self_flat[tensorclass-compile] 0.2848ms 0.1241ms 8.0579 KOps/s 7.9551 KOps/s $\color{#35bf28}+1.29\%$
test_compile_add_self_flat[pytree-eager] 0.6764ms 0.5028ms 1.9888 KOps/s 1.9856 KOps/s $\color{#35bf28}+0.17\%$
test_compile_add_self_flat[pytree-compile] 0.3963ms 0.3063ms 3.2644 KOps/s 3.2290 KOps/s $\color{#35bf28}+1.10\%$
test_compile_copy_flat[tensordict-compile] 0.1466ms 19.2914μs 51.8365 KOps/s 51.1107 KOps/s $\color{#35bf28}+1.42\%$
test_compile_copy_flat[tensordict-eager] 0.1244ms 37.7128μs 26.5162 KOps/s 24.2778 KOps/s $\textbf{\color{#35bf28}+9.22\%}$
test_compile_copy_flat[pytree-compile] 0.1319ms 69.1124μs 14.4692 KOps/s 14.4558 KOps/s $\color{#35bf28}+0.09\%$
test_compile_copy_flat[pytree-eager] 0.1345ms 51.7569μs 19.3211 KOps/s 19.2115 KOps/s $\color{#35bf28}+0.57\%$
test_compile_assign_and_add[tensordict-compile] 2.3070ms 0.8038ms 1.2440 KOps/s 1.1609 KOps/s $\textbf{\color{#35bf28}+7.16\%}$
test_compile_assign_and_add[tensordict-eager] 3.3563ms 3.1594ms 316.5172 Ops/s 320.3135 Ops/s $\color{#d91a1a}-1.19\%$
test_compile_assign_and_add[pytree-compile] 2.2423ms 0.7884ms 1.2685 KOps/s 1.1786 KOps/s $\textbf{\color{#35bf28}+7.63\%}$
test_compile_assign_and_add[pytree-eager] 3.5497ms 3.1805ms 314.4142 Ops/s 322.6682 Ops/s $\color{#d91a1a}-2.56\%$
test_compile_indexing[tensor-tensordict-compile] 0.2589ms 0.1063ms 9.4103 KOps/s 8.9620 KOps/s $\textbf{\color{#35bf28}+5.00\%}$
test_compile_indexing[tensor-tensordict-eager] 0.2070ms 57.7580μs 17.3136 KOps/s 16.5189 KOps/s $\color{#35bf28}+4.81\%$
test_compile_indexing[tensor-tensorclass-compile] 0.2792ms 0.1043ms 9.5894 KOps/s 9.5049 KOps/s $\color{#35bf28}+0.89\%$
test_compile_indexing[tensor-tensorclass-eager] 0.2276ms 44.9544μs 22.2448 KOps/s 24.1124 KOps/s $\textbf{\color{#d91a1a}-7.75\%}$
test_compile_indexing[tensor-pytree-compile] 0.2751ms 0.1068ms 9.3596 KOps/s 9.4700 KOps/s $\color{#d91a1a}-1.17\%$
test_compile_indexing[tensor-pytree-eager] 0.2177ms 44.9430μs 22.2504 KOps/s 23.6211 KOps/s $\textbf{\color{#d91a1a}-5.80\%}$
test_compile_indexing[slice-tensordict-compile] 0.3102ms 0.1346ms 7.4308 KOps/s 7.2820 KOps/s $\color{#35bf28}+2.04\%$
test_compile_indexing[slice-tensordict-eager] 0.1626ms 24.3497μs 41.0682 KOps/s 41.4173 KOps/s $\color{#d91a1a}-0.84\%$
test_compile_indexing[slice-tensorclass-compile] 0.2810ms 0.1271ms 7.8654 KOps/s 7.5389 KOps/s $\color{#35bf28}+4.33\%$
test_compile_indexing[slice-tensorclass-eager] 72.4620μs 20.3565μs 49.1243 KOps/s 49.7317 KOps/s $\color{#d91a1a}-1.22\%$
test_compile_indexing[slice-pytree-compile] 0.3063ms 0.1332ms 7.5057 KOps/s 7.5600 KOps/s $\color{#d91a1a}-0.72\%$
test_compile_indexing[slice-pytree-eager] 55.9710μs 19.9772μs 50.0571 KOps/s 49.5459 KOps/s $\color{#35bf28}+1.03\%$
test_compile_indexing[int-tensordict-compile] 0.2976ms 0.1343ms 7.4470 KOps/s 7.2013 KOps/s $\color{#35bf28}+3.41\%$
test_compile_indexing[int-tensordict-eager] 0.5119ms 23.3917μs 42.7501 KOps/s 42.4860 KOps/s $\color{#35bf28}+0.62\%$
test_compile_indexing[int-tensorclass-compile] 0.3167ms 0.1277ms 7.8304 KOps/s 7.6040 KOps/s $\color{#35bf28}+2.98\%$
test_compile_indexing[int-tensorclass-eager] 51.0810μs 19.9557μs 50.1111 KOps/s 49.4455 KOps/s $\color{#35bf28}+1.35\%$
test_compile_indexing[int-pytree-compile] 0.3181ms 0.1302ms 7.6793 KOps/s 7.5391 KOps/s $\color{#35bf28}+1.86\%$
test_compile_indexing[int-pytree-eager] 0.1687ms 19.5161μs 51.2396 KOps/s 38.0982 KOps/s $\textbf{\color{#35bf28}+34.49\%}$
test_mod_add[eager] 0.1665ms 30.6627μs 32.6129 KOps/s 31.5429 KOps/s $\color{#35bf28}+3.39\%$
test_mod_add[compile] 0.3142ms 67.6929μs 14.7726 KOps/s 14.5336 KOps/s $\color{#35bf28}+1.64\%$
test_mod_add[compile-overhead] 0.2546ms 0.1317ms 7.5905 KOps/s 6.9130 KOps/s $\textbf{\color{#35bf28}+9.80\%}$
test_mod_wrap[eager] 0.9182ms 0.7782ms 1.2851 KOps/s 1.2874 KOps/s $\color{#d91a1a}-0.18\%$
test_mod_wrap[compile] 2.0699ms 0.8414ms 1.1885 KOps/s 1.2128 KOps/s $\color{#d91a1a}-2.01\%$
test_mod_wrap[compile-overhead] 4.9231ms 3.1099ms 321.5501 Ops/s 328.4527 Ops/s $\color{#d91a1a}-2.10\%$
test_mod_wrap_and_backward[eager] 4.2209ms 4.0731ms 245.5161 Ops/s 242.8560 Ops/s $\color{#35bf28}+1.10\%$
test_mod_wrap_and_backward[compile] 4.6332ms 4.0446ms 247.2407 Ops/s 244.0315 Ops/s $\color{#35bf28}+1.32\%$
test_mod_wrap_and_backward[compile-overhead] 1.4232ms 0.9144ms 1.0936 KOps/s 988.2712 Ops/s $\textbf{\color{#35bf28}+10.66\%}$
test_seq_add[eager] 0.2414ms 93.8996μs 10.6497 KOps/s 10.2250 KOps/s $\color{#35bf28}+4.15\%$
test_seq_add[compile] 0.2234ms 81.8909μs 12.2114 KOps/s 12.6197 KOps/s $\color{#d91a1a}-3.24\%$
test_seq_add[compile-overhead] 0.2954ms 0.1124ms 8.9005 KOps/s 9.0143 KOps/s $\color{#d91a1a}-1.26\%$
test_seq_wrap[eager] 1.0958ms 0.9249ms 1.0812 KOps/s 1.0713 KOps/s $\color{#35bf28}+0.93\%$
test_seq_wrap[compile] 0.9975ms 0.8508ms 1.1754 KOps/s 1.1865 KOps/s $\color{#d91a1a}-0.93\%$
test_seq_wrap[compile-overhead] 0.3732ms 0.2153ms 4.6438 KOps/s 4.5701 KOps/s $\color{#35bf28}+1.61\%$
test_func_call_runtime[False-eager] 2.5665ms 2.3809ms 420.0015 Ops/s 425.6374 Ops/s $\color{#d91a1a}-1.32\%$
test_func_call_runtime[False-compile] 2.5984ms 2.4026ms 416.2140 Ops/s 421.4910 Ops/s $\color{#d91a1a}-1.25\%$
test_func_call_runtime[False-compile-overhead] 0.4921ms 0.3489ms 2.8664 KOps/s 2.8364 KOps/s $\color{#35bf28}+1.06\%$
test_func_call_runtime[True-eager] 2.7300ms 2.5444ms 393.0212 Ops/s 396.1123 Ops/s $\color{#d91a1a}-0.78\%$
test_func_call_runtime[True-compile] 2.6173ms 2.4415ms 409.5892 Ops/s 421.1894 Ops/s $\color{#d91a1a}-2.75\%$
test_func_call_runtime[True-compile-overhead] 0.5165ms 0.3713ms 2.6934 KOps/s 2.7034 KOps/s $\color{#d91a1a}-0.37\%$
test_func_call_cm_runtime[False-eager] 2.5595ms 2.3865ms 419.0274 Ops/s 430.6176 Ops/s $\color{#d91a1a}-2.69\%$
test_func_call_cm_runtime[False-compile] 2.5736ms 2.4108ms 414.8066 Ops/s 423.2245 Ops/s $\color{#d91a1a}-1.99\%$
test_func_call_cm_runtime[False-compile-overhead] 0.5050ms 0.3510ms 2.8488 KOps/s 2.8259 KOps/s $\color{#35bf28}+0.81\%$
test_func_call_cm_runtime[True-eager] 2.8114ms 2.6825ms 372.7829 Ops/s 383.4114 Ops/s $\color{#d91a1a}-2.77\%$
test_func_call_cm_runtime[True-compile] 2.6260ms 2.4662ms 405.4842 Ops/s 412.7853 Ops/s $\color{#d91a1a}-1.77\%$
test_func_call_cm_runtime[True-compile-overhead] 0.5386ms 0.3971ms 2.5185 KOps/s 2.5167 KOps/s $\color{#35bf28}+0.07\%$
test_vmap_func_call_cm_runtime[eager] 4.1816ms 3.7286ms 268.1956 Ops/s 266.1307 Ops/s $\color{#35bf28}+0.78\%$
test_vmap_func_call_cm_runtime[compile] 2.6335ms 2.4751ms 404.0207 Ops/s 409.4466 Ops/s $\color{#d91a1a}-1.33\%$
test_vmap_func_call_cm_runtime[compile-overhead] 0.5028ms 0.3982ms 2.5112 KOps/s 2.5095 KOps/s $\color{#35bf28}+0.07\%$
test_distributed 3.9205ms 0.2505ms 3.9927 KOps/s 8.8234 KOps/s $\textbf{\color{#d91a1a}-54.75\%}$
test_tdmodule 49.2210μs 14.8678μs 67.2596 KOps/s 62.5852 KOps/s $\textbf{\color{#35bf28}+7.47\%}$
test_tdmodule_dispatch 64.6610μs 28.9026μs 34.5990 KOps/s 33.9713 KOps/s $\color{#35bf28}+1.85\%$
test_tdseq 0.1663ms 16.0630μs 62.2548 KOps/s 62.2775 KOps/s $\color{#d91a1a}-0.04\%$
test_tdseq_dispatch 53.3220μs 31.5261μs 31.7197 KOps/s 31.2112 KOps/s $\color{#35bf28}+1.63\%$
test_instantiation_functorch 2.0609ms 1.8194ms 549.6379 Ops/s 550.6108 Ops/s $\color{#d91a1a}-0.18\%$
test_instantiation_td 1.8237ms 1.1820ms 846.0095 Ops/s 845.6685 Ops/s $\color{#35bf28}+0.04\%$
test_exec_functorch 1.1282ms 0.9969ms 1.0031 KOps/s 1.0025 KOps/s $\color{#35bf28}+0.06\%$
test_exec_functional_call 1.1449ms 1.0148ms 985.4556 Ops/s 1.0004 KOps/s $\color{#d91a1a}-1.50\%$
test_exec_td 1.2017ms 1.0198ms 980.6073 Ops/s 968.2874 Ops/s $\color{#35bf28}+1.27\%$
test_exec_td_decorator 1.4983ms 1.0649ms 939.0631 Ops/s 948.6506 Ops/s $\color{#d91a1a}-1.01\%$
test_vmap_mlp_speed[True-True] 1.7970ms 1.2599ms 793.6834 Ops/s 794.3251 Ops/s $\color{#d91a1a}-0.08\%$
test_vmap_mlp_speed[True-False] 1.4783ms 1.2593ms 794.1142 Ops/s 798.4347 Ops/s $\color{#d91a1a}-0.54\%$
test_vmap_mlp_speed[False-True] 1.2968ms 1.1578ms 863.7061 Ops/s 867.7175 Ops/s $\color{#d91a1a}-0.46\%$
test_vmap_mlp_speed[False-False] 1.4460ms 1.1610ms 861.3355 Ops/s 867.5776 Ops/s $\color{#d91a1a}-0.72\%$
test_vmap_mlp_speed_decorator[True-True] 1.4222ms 1.2502ms 799.9019 Ops/s 809.9387 Ops/s $\color{#d91a1a}-1.24\%$
test_vmap_mlp_speed_decorator[True-False] 1.7866ms 1.2383ms 807.5747 Ops/s 812.7663 Ops/s $\color{#d91a1a}-0.64\%$
test_vmap_mlp_speed_decorator[False-True] 1.3310ms 1.1627ms 860.0515 Ops/s 871.7473 Ops/s $\color{#d91a1a}-1.34\%$
test_vmap_mlp_speed_decorator[False-False] 1.3769ms 1.1726ms 852.8295 Ops/s 872.0383 Ops/s $\color{#d91a1a}-2.20\%$
test_vmap_transformer_speed[True-True] 13.5738ms 13.1860ms 75.8379 Ops/s 76.4798 Ops/s $\color{#d91a1a}-0.84\%$
test_vmap_transformer_speed[True-False] 13.5662ms 13.1390ms 76.1093 Ops/s 76.8451 Ops/s $\color{#d91a1a}-0.96\%$
test_vmap_transformer_speed[False-True] 13.3140ms 12.8687ms 77.7082 Ops/s 78.4436 Ops/s $\color{#d91a1a}-0.94\%$
test_vmap_transformer_speed[False-False] 13.1318ms 12.7653ms 78.3375 Ops/s 78.4858 Ops/s $\color{#d91a1a}-0.19\%$
test_vmap_transformer_speed_decorator[True-True] 34.2207ms 33.6485ms 29.7190 Ops/s 30.1382 Ops/s $\color{#d91a1a}-1.39\%$
test_vmap_transformer_speed_decorator[True-False] 34.4222ms 33.5955ms 29.7659 Ops/s 29.9707 Ops/s $\color{#d91a1a}-0.68\%$
test_vmap_transformer_speed_decorator[False-True] 34.7789ms 33.4314ms 29.9120 Ops/s 30.1098 Ops/s $\color{#d91a1a}-0.66\%$
test_vmap_transformer_speed_decorator[False-False] 33.7773ms 33.4094ms 29.9317 Ops/s 30.1160 Ops/s $\color{#d91a1a}-0.61\%$
test_to_module_speed[True] 1.3737ms 0.9979ms 1.0021 KOps/s 1.0033 KOps/s $\color{#d91a1a}-0.13\%$
test_to_module_speed[False] 1.3732ms 0.9743ms 1.0264 KOps/s 1.0292 KOps/s $\color{#d91a1a}-0.28\%$
test_tc_init 0.4259ms 33.8381μs 29.5525 KOps/s 28.9957 KOps/s $\color{#35bf28}+1.92\%$
test_tc_init_nested 0.1127ms 68.0049μs 14.7048 KOps/s 14.0659 KOps/s $\color{#35bf28}+4.54\%$
test_tc_first_layer_tensor 56.4671μs 0.6659μs 1.5017 MOps/s 1.5043 MOps/s $\color{#d91a1a}-0.18\%$
test_tc_first_layer_nontensor 0.3785ms 2.1919μs 456.2215 KOps/s 453.3263 KOps/s $\color{#35bf28}+0.64\%$
test_tc_second_layer_tensor 48.8260μs 1.3444μs 743.8004 KOps/s 731.9398 KOps/s $\color{#35bf28}+1.62\%$
test_tc_second_layer_nontensor 27.4800μs 2.8982μs 345.0441 KOps/s 343.1963 KOps/s $\color{#35bf28}+0.54\%$
test_unbind 0.1940s 12.2025ms 81.9504 Ops/s 90.9106 Ops/s $\textbf{\color{#d91a1a}-9.86\%}$
test_full_like 0.7866ms 0.5748ms 1.7399 KOps/s 1.7381 KOps/s $\color{#35bf28}+0.10\%$
test_zeros_like 0.3267ms 0.1981ms 5.0470 KOps/s 5.0472 KOps/s $-0.00\%$
test_ones_like 0.5514ms 0.1981ms 5.0488 KOps/s 5.0497 KOps/s $\color{#d91a1a}-0.02\%$
test_clone 0.7105ms 0.4143ms 2.4136 KOps/s 2.4051 KOps/s $\color{#35bf28}+0.36\%$
test_squeeze 0.1329ms 9.6198μs 103.9519 KOps/s 104.3493 KOps/s $\color{#d91a1a}-0.38\%$
test_unsqueeze 0.4491ms 72.3694μs 13.8180 KOps/s 13.7808 KOps/s $\color{#35bf28}+0.27\%$
test_split 0.5967ms 0.1534ms 6.5183 KOps/s 6.3711 KOps/s $\color{#35bf28}+2.31\%$
test_permute 0.2439ms 0.1747ms 5.7234 KOps/s 5.7321 KOps/s $\color{#d91a1a}-0.15\%$
test_stack 1.2536ms 0.8595ms 1.1634 KOps/s 1.1587 KOps/s $\color{#35bf28}+0.41\%$
test_cat 1.3527ms 1.2320ms 811.6997 Ops/s 811.4897 Ops/s $\color{#35bf28}+0.03\%$

@vmoens vmoens added the bug Something isn't working label Oct 4, 2024
@vmoens vmoens merged commit 04faf40 into main Oct 4, 2024
52 of 57 checks passed
@vmoens vmoens deleted the fix-apply-td branch October 4, 2024 12:41
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants