Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Quality] Better type hints for __init__ #1014

Merged
merged 1 commit into from
Oct 1, 2024
Merged

[Quality] Better type hints for __init__ #1014

merged 1 commit into from
Oct 1, 2024

Conversation

vmoens
Copy link
Contributor

@vmoens vmoens commented Sep 30, 2024

No description provided.

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Sep 30, 2024
Copy link

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 222. Improved: $\large\color{#35bf28}20$. Worsened: $\large\color{#d91a1a}9$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 44.0120μs 21.0774μs 47.4441 KOps/s 46.6907 KOps/s $\color{#35bf28}+1.61\%$
test_plain_set_stack_nested 51.6360μs 20.9695μs 47.6882 KOps/s 46.1934 KOps/s $\color{#35bf28}+3.24\%$
test_plain_set_nested_inplace 71.0330μs 22.5010μs 44.4425 KOps/s 42.3702 KOps/s $\color{#35bf28}+4.89\%$
test_plain_set_stack_nested_inplace 61.5850μs 22.3309μs 44.7810 KOps/s 42.8727 KOps/s $\color{#35bf28}+4.45\%$
test_items 43.6010μs 4.2160μs 237.1940 KOps/s 242.3828 KOps/s $\color{#d91a1a}-2.14\%$
test_items_nested 0.6444ms 0.3670ms 2.7249 KOps/s 2.7741 KOps/s $\color{#d91a1a}-1.77\%$
test_items_nested_locked 0.4855ms 0.3647ms 2.7423 KOps/s 2.7203 KOps/s $\color{#35bf28}+0.81\%$
test_items_nested_leaf 0.1336ms 68.7473μs 14.5460 KOps/s 14.4169 KOps/s $\color{#35bf28}+0.90\%$
test_items_stack_nested 0.5467ms 0.3665ms 2.7284 KOps/s 2.7488 KOps/s $\color{#d91a1a}-0.74\%$
test_items_stack_nested_leaf 0.1488ms 71.2546μs 14.0342 KOps/s 13.8255 KOps/s $\color{#35bf28}+1.51\%$
test_items_stack_nested_locked 0.7999ms 0.3684ms 2.7146 KOps/s 2.7465 KOps/s $\color{#d91a1a}-1.16\%$
test_keys 42.2590μs 3.5393μs 282.5456 KOps/s 288.4870 KOps/s $\color{#d91a1a}-2.06\%$
test_keys_nested 0.1752ms 0.1002ms 9.9832 KOps/s 9.8483 KOps/s $\color{#35bf28}+1.37\%$
test_keys_nested_locked 1.7074ms 0.1063ms 9.4073 KOps/s 9.3540 KOps/s $\color{#35bf28}+0.57\%$
test_keys_nested_leaf 0.1539ms 82.5993μs 12.1066 KOps/s 11.5557 KOps/s $\color{#35bf28}+4.77\%$
test_keys_stack_nested 0.1881ms 0.1005ms 9.9478 KOps/s 9.7029 KOps/s $\color{#35bf28}+2.52\%$
test_keys_stack_nested_leaf 0.1644ms 82.3960μs 12.1365 KOps/s 11.4535 KOps/s $\textbf{\color{#35bf28}+5.96\%}$
test_keys_stack_nested_locked 0.2291ms 0.1056ms 9.4693 KOps/s 9.2834 KOps/s $\color{#35bf28}+2.00\%$
test_values 8.0868μs 1.0416μs 960.0628 KOps/s 881.5024 KOps/s $\textbf{\color{#35bf28}+8.91\%}$
test_values_nested 0.1507ms 76.1195μs 13.1372 KOps/s 13.2374 KOps/s $\color{#d91a1a}-0.76\%$
test_values_nested_locked 0.1238ms 74.8137μs 13.3665 KOps/s 13.2398 KOps/s $\color{#35bf28}+0.96\%$
test_values_nested_leaf 0.1015ms 61.8068μs 16.1794 KOps/s 16.4209 KOps/s $\color{#d91a1a}-1.47\%$
test_values_stack_nested 0.1209ms 76.8370μs 13.0146 KOps/s 13.1394 KOps/s $\color{#d91a1a}-0.95\%$
test_values_stack_nested_leaf 0.1181ms 61.2225μs 16.3339 KOps/s 16.2425 KOps/s $\color{#35bf28}+0.56\%$
test_values_stack_nested_locked 0.1749ms 76.6338μs 13.0491 KOps/s 13.1715 KOps/s $\color{#d91a1a}-0.93\%$
test_membership 6.3847μs 0.7609μs 1.3143 MOps/s 1.3825 MOps/s $\color{#d91a1a}-4.93\%$
test_membership_nested 19.7860μs 2.8210μs 354.4802 KOps/s 361.8672 KOps/s $\color{#d91a1a}-2.04\%$
test_membership_nested_leaf 39.2830μs 2.7933μs 358.0051 KOps/s 367.7848 KOps/s $\color{#d91a1a}-2.66\%$
test_membership_stacked_nested 21.8010μs 2.8117μs 355.6625 KOps/s 366.9278 KOps/s $\color{#d91a1a}-3.07\%$
test_membership_stacked_nested_leaf 44.0420μs 2.7663μs 361.4954 KOps/s 363.8794 KOps/s $\color{#d91a1a}-0.66\%$
test_membership_nested_last 39.6230μs 4.0723μs 245.5602 KOps/s 250.0769 KOps/s $\color{#d91a1a}-1.81\%$
test_membership_nested_leaf_last 41.0560μs 3.9872μs 250.8055 KOps/s 251.0687 KOps/s $\color{#d91a1a}-0.10\%$
test_membership_stacked_nested_last 47.1850μs 6.0338μs 165.7338 KOps/s 247.0787 KOps/s $\textbf{\color{#d91a1a}-32.92\%}$
test_membership_stacked_nested_leaf_last 23.1430μs 6.0469μs 165.3750 KOps/s 254.7932 KOps/s $\textbf{\color{#d91a1a}-35.09\%}$
test_nested_getleaf 36.1580μs 10.6344μs 94.0347 KOps/s 92.6083 KOps/s $\color{#35bf28}+1.54\%$
test_nested_get 42.7600μs 10.1054μs 98.9573 KOps/s 98.5970 KOps/s $\color{#35bf28}+0.37\%$
test_stacked_getleaf 35.2850μs 10.5619μs 94.6798 KOps/s 94.9915 KOps/s $\color{#d91a1a}-0.33\%$
test_stacked_get 31.5990μs 10.0632μs 99.3720 KOps/s 99.6309 KOps/s $\color{#d91a1a}-0.26\%$
test_nested_getitemleaf 32.9520μs 11.0696μs 90.3372 KOps/s 90.5003 KOps/s $\color{#d91a1a}-0.18\%$
test_nested_getitem 35.6260μs 10.3229μs 96.8719 KOps/s 96.6961 KOps/s $\color{#35bf28}+0.18\%$
test_stacked_getitemleaf 38.1140μs 10.7927μs 92.6551 KOps/s 91.3784 KOps/s $\color{#35bf28}+1.40\%$
test_stacked_getitem 37.6000μs 10.2940μs 97.1441 KOps/s 95.5473 KOps/s $\color{#35bf28}+1.67\%$
test_lock_nested 85.4173ms 0.5832ms 1.7147 KOps/s 2.0098 KOps/s $\textbf{\color{#d91a1a}-14.68\%}$
test_lock_stack_nested 0.5450ms 0.4589ms 2.1789 KOps/s 2.1288 KOps/s $\color{#35bf28}+2.35\%$
test_unlock_nested 88.2302ms 0.5037ms 1.9853 KOps/s 2.3915 KOps/s $\textbf{\color{#d91a1a}-16.99\%}$
test_unlock_stack_nested 0.8301ms 0.3756ms 2.6624 KOps/s 2.5689 KOps/s $\color{#35bf28}+3.64\%$
test_flatten_speed 0.1801ms 89.1430μs 11.2179 KOps/s 11.5302 KOps/s $\color{#d91a1a}-2.71\%$
test_unflatten_speed 0.8030ms 0.4677ms 2.1383 KOps/s 2.1717 KOps/s $\color{#d91a1a}-1.54\%$
test_common_ops 4.4715ms 1.1447ms 873.6072 Ops/s 842.8991 Ops/s $\color{#35bf28}+3.64\%$
test_creation 32.8310μs 2.1392μs 467.4727 KOps/s 470.6146 KOps/s $\color{#d91a1a}-0.67\%$
test_creation_empty 62.1960μs 19.2708μs 51.8919 KOps/s 50.4085 KOps/s $\color{#35bf28}+2.94\%$
test_creation_nested_1 0.1029ms 22.3093μs 44.8244 KOps/s 43.2674 KOps/s $\color{#35bf28}+3.60\%$
test_creation_nested_2 69.2790μs 26.8594μs 37.2309 KOps/s 35.6080 KOps/s $\color{#35bf28}+4.56\%$
test_clone 54.0920μs 16.8710μs 59.2731 KOps/s 57.2052 KOps/s $\color{#35bf28}+3.61\%$
test_getitem[int] 1.2416ms 16.8257μs 59.4329 KOps/s 57.5689 KOps/s $\color{#35bf28}+3.24\%$
test_getitem[slice_int] 0.1389ms 30.8377μs 32.4278 KOps/s 31.3548 KOps/s $\color{#35bf28}+3.42\%$
test_getitem[range] 0.1704ms 58.8995μs 16.9781 KOps/s 16.7406 KOps/s $\color{#35bf28}+1.42\%$
test_getitem[tuple] 0.1627ms 25.3701μs 39.4164 KOps/s 38.8745 KOps/s $\color{#35bf28}+1.39\%$
test_getitem[list] 0.1689ms 54.2877μs 18.4204 KOps/s 18.1328 KOps/s $\color{#35bf28}+1.59\%$
test_setitem_dim[int] 66.2040μs 33.2291μs 30.0941 KOps/s 29.2279 KOps/s $\color{#35bf28}+2.96\%$
test_setitem_dim[slice_int] 0.1099ms 62.5459μs 15.9883 KOps/s 15.7417 KOps/s $\color{#35bf28}+1.57\%$
test_setitem_dim[range] 0.1881ms 85.7753μs 11.6584 KOps/s 11.6425 KOps/s $\color{#35bf28}+0.14\%$
test_setitem_dim[tuple] 0.1263ms 50.4566μs 19.8190 KOps/s 19.5962 KOps/s $\color{#35bf28}+1.14\%$
test_setitem 77.6050μs 30.1788μs 33.1358 KOps/s 31.8930 KOps/s $\color{#35bf28}+3.90\%$
test_set 86.0910μs 29.4854μs 33.9151 KOps/s 33.0064 KOps/s $\color{#35bf28}+2.75\%$
test_set_shared 1.3341ms 0.2096ms 4.7708 KOps/s 4.5743 KOps/s $\color{#35bf28}+4.30\%$
test_update 0.1367ms 37.7601μs 26.4830 KOps/s 26.3043 KOps/s $\color{#35bf28}+0.68\%$
test_update_nested 0.1216ms 47.7780μs 20.9301 KOps/s 20.0611 KOps/s $\color{#35bf28}+4.33\%$
test_update__nested 0.1035ms 34.6719μs 28.8418 KOps/s 28.5212 KOps/s $\color{#35bf28}+1.12\%$
test_set_nested 0.1047ms 32.2348μs 31.0223 KOps/s 30.1107 KOps/s $\color{#35bf28}+3.03\%$
test_set_nested_new 0.1093ms 37.9508μs 26.3499 KOps/s 26.2814 KOps/s $\color{#35bf28}+0.26\%$
test_select 0.1132ms 55.6202μs 17.9791 KOps/s 17.3437 KOps/s $\color{#35bf28}+3.66\%$
test_select_nested 0.1403ms 62.9603μs 15.8830 KOps/s 16.0309 KOps/s $\color{#d91a1a}-0.92\%$
test_exclude_nested 0.1604ms 77.2830μs 12.9395 KOps/s 12.9548 KOps/s $\color{#d91a1a}-0.12\%$
test_empty[True] 0.4121ms 0.3217ms 3.1086 KOps/s 3.1162 KOps/s $\color{#d91a1a}-0.24\%$
test_empty[False] 11.1233μs 1.3560μs 737.4496 KOps/s 806.4615 KOps/s $\textbf{\color{#d91a1a}-8.56\%}$
test_unbind_speed 0.6174ms 0.3133ms 3.1919 KOps/s 3.1770 KOps/s $\color{#35bf28}+0.47\%$
test_unbind_speed_stack0 0.5593ms 0.3028ms 3.3022 KOps/s 3.2510 KOps/s $\color{#35bf28}+1.58\%$
test_unbind_speed_stack1 91.5832ms 0.7683ms 1.3016 KOps/s 1.3205 KOps/s $\color{#d91a1a}-1.43\%$
test_split 98.5730ms 2.3694ms 422.0453 Ops/s 440.8101 Ops/s $\color{#d91a1a}-4.26\%$
test_chunk 3.0278ms 1.9852ms 503.7386 Ops/s 446.8641 Ops/s $\textbf{\color{#35bf28}+12.73\%}$
test_creation[device0] 0.2530ms 0.1153ms 8.6731 KOps/s 8.4368 KOps/s $\color{#35bf28}+2.80\%$
test_creation_from_tensor 0.2626ms 0.1143ms 8.7503 KOps/s 8.3238 KOps/s $\textbf{\color{#35bf28}+5.12\%}$
test_add_one[memmap_tensor0] 0.1884ms 6.8662μs 145.6406 KOps/s 136.3610 KOps/s $\textbf{\color{#35bf28}+6.81\%}$
test_contiguous[memmap_tensor0] 23.3730μs 1.9269μs 518.9767 KOps/s 517.5216 KOps/s $\color{#35bf28}+0.28\%$
test_stack[memmap_tensor0] 45.0540μs 5.4370μs 183.9236 KOps/s 173.1370 KOps/s $\textbf{\color{#35bf28}+6.23\%}$
test_memmaptd_index 1.1810ms 0.3999ms 2.5006 KOps/s 2.4354 KOps/s $\color{#35bf28}+2.68\%$
test_memmaptd_index_astensor 0.9524ms 0.4765ms 2.0986 KOps/s 2.0551 KOps/s $\color{#35bf28}+2.11\%$
test_memmaptd_index_op 1.8363ms 1.0267ms 973.9634 Ops/s 930.4954 Ops/s $\color{#35bf28}+4.67\%$
test_serialize_model 0.2127s 0.1350s 7.4059 Ops/s 8.3025 Ops/s $\textbf{\color{#d91a1a}-10.80\%}$
test_serialize_model_pickle 0.4670s 0.3979s 2.5133 Ops/s 2.5485 Ops/s $\color{#d91a1a}-1.38\%$
test_serialize_weights 0.1265s 0.1128s 8.8649 Ops/s 7.7833 Ops/s $\textbf{\color{#35bf28}+13.90\%}$
test_serialize_weights_returnearly 0.2163s 0.1662s 6.0169 Ops/s 6.2934 Ops/s $\color{#d91a1a}-4.39\%$
test_serialize_weights_pickle 1.1555s 0.7405s 1.3503 Ops/s 2.4760 Ops/s $\textbf{\color{#d91a1a}-45.46\%}$
test_serialize_weights_filesystem 0.2212s 0.1517s 6.5933 Ops/s 6.9449 Ops/s $\textbf{\color{#d91a1a}-5.06\%}$
test_serialize_model_filesystem 0.1480s 0.1435s 6.9662 Ops/s 6.5951 Ops/s $\textbf{\color{#35bf28}+5.63\%}$
test_reshape_pytree 0.1209ms 39.5748μs 25.2686 KOps/s 24.4831 KOps/s $\color{#35bf28}+3.21\%$
test_reshape_td 0.1013ms 44.8006μs 22.3211 KOps/s 20.4547 KOps/s $\textbf{\color{#35bf28}+9.12\%}$
test_view_pytree 0.1064ms 38.3381μs 26.0837 KOps/s 25.7073 KOps/s $\color{#35bf28}+1.46\%$
test_view_td 0.1082ms 50.0528μs 19.9789 KOps/s 18.9302 KOps/s $\textbf{\color{#35bf28}+5.54\%}$
test_unbind_pytree 69.6800μs 35.0220μs 28.5535 KOps/s 27.4339 KOps/s $\color{#35bf28}+4.08\%$
test_unbind_td 0.3158ms 45.9306μs 21.7720 KOps/s 21.4642 KOps/s $\color{#35bf28}+1.43\%$
test_split_pytree 89.3070μs 36.7442μs 27.2152 KOps/s 25.8113 KOps/s $\textbf{\color{#35bf28}+5.44\%}$
test_split_td 0.4934ms 58.3400μs 17.1409 KOps/s 16.7800 KOps/s $\color{#35bf28}+2.15\%$
test_add_pytree 91.7410μs 44.3993μs 22.5229 KOps/s 21.9347 KOps/s $\color{#35bf28}+2.68\%$
test_add_td 0.1529ms 79.5487μs 12.5709 KOps/s 11.4448 KOps/s $\textbf{\color{#35bf28}+9.84\%}$
test_compile_add_one_nested[tensordict-compile] 0.1538ms 56.7522μs 17.6205 KOps/s 17.5662 KOps/s $\color{#35bf28}+0.31\%$
test_compile_add_one_nested[tensordict-eager] 0.3434ms 0.1764ms 5.6679 KOps/s 5.6285 KOps/s $\color{#35bf28}+0.70\%$
test_compile_add_one_nested[pytree-compile] 0.1222ms 56.5851μs 17.6725 KOps/s 17.4106 KOps/s $\color{#35bf28}+1.50\%$
test_compile_add_one_nested[pytree-eager] 0.2631ms 0.1390ms 7.1940 KOps/s 7.0621 KOps/s $\color{#35bf28}+1.87\%$
test_compile_copy_nested[tensordict-compile] 64.5500μs 21.5185μs 46.4716 KOps/s 46.2585 KOps/s $\color{#35bf28}+0.46\%$
test_compile_copy_nested[tensordict-eager] 0.1365ms 68.1202μs 14.6799 KOps/s 14.6006 KOps/s $\color{#35bf28}+0.54\%$
test_compile_copy_nested[pytree-compile] 0.1431ms 76.6082μs 13.0534 KOps/s 13.1560 KOps/s $\color{#d91a1a}-0.78\%$
test_compile_copy_nested[pytree-eager] 0.1277ms 69.4016μs 14.4089 KOps/s 14.7274 KOps/s $\color{#d91a1a}-2.16\%$
test_compile_add_one_flat[tensordict-compile] 0.2782ms 0.1718ms 5.8195 KOps/s 5.8478 KOps/s $\color{#d91a1a}-0.48\%$
test_compile_add_one_flat[tensordict-eager] 0.3376ms 0.1883ms 5.3116 KOps/s 5.2043 KOps/s $\color{#35bf28}+2.06\%$
test_compile_add_one_flat[tensorclass-compile] 0.1371ms 46.7727μs 21.3800 KOps/s 21.9388 KOps/s $\color{#d91a1a}-2.55\%$
test_compile_add_one_flat[tensorclass-eager] 0.1353ms 67.9183μs 14.7236 KOps/s 14.4393 KOps/s $\color{#35bf28}+1.97\%$
test_compile_add_one_flat[pytree-compile] 0.3797ms 0.1777ms 5.6283 KOps/s 5.7867 KOps/s $\color{#d91a1a}-2.74\%$
test_compile_add_one_flat[pytree-eager] 0.4971ms 0.2845ms 3.5152 KOps/s 3.3704 KOps/s $\color{#35bf28}+4.30\%$
test_compile_add_self_flat[tensordict-eager] 0.4146ms 0.2037ms 4.9090 KOps/s 4.9231 KOps/s $\color{#d91a1a}-0.28\%$
test_compile_add_self_flat[tensordict-compile] 0.3485ms 0.1786ms 5.5988 KOps/s 5.7325 KOps/s $\color{#d91a1a}-2.33\%$
test_compile_add_self_flat[tensorclass-eager] 0.1668ms 62.4679μs 16.0082 KOps/s 16.2227 KOps/s $\color{#d91a1a}-1.32\%$
test_compile_add_self_flat[tensorclass-compile] 0.1024ms 46.5334μs 21.4899 KOps/s 20.6943 KOps/s $\color{#35bf28}+3.84\%$
test_compile_add_self_flat[pytree-eager] 0.4310ms 0.2345ms 4.2635 KOps/s 4.1958 KOps/s $\color{#35bf28}+1.61\%$
test_compile_add_self_flat[pytree-compile] 0.3242ms 0.1739ms 5.7491 KOps/s 5.7894 KOps/s $\color{#d91a1a}-0.70\%$
test_compile_copy_flat[tensordict-compile] 0.1873ms 0.1036ms 9.6542 KOps/s 9.7329 KOps/s $\color{#d91a1a}-0.81\%$
test_compile_copy_flat[tensordict-eager] 0.1246ms 56.3497μs 17.7463 KOps/s 17.9221 KOps/s $\color{#d91a1a}-0.98\%$
test_compile_copy_flat[pytree-compile] 0.1671ms 78.5123μs 12.7369 KOps/s 13.1424 KOps/s $\color{#d91a1a}-3.09\%$
test_compile_copy_flat[pytree-eager] 0.1352ms 70.3928μs 14.2060 KOps/s 14.6015 KOps/s $\color{#d91a1a}-2.71\%$
test_compile_assign_and_add[tensordict-compile] 0.2806ms 0.1951ms 5.1247 KOps/s 5.1185 KOps/s $\color{#35bf28}+0.12\%$
test_compile_assign_and_add[tensordict-eager] 1.8879ms 1.6562ms 603.7965 Ops/s 591.2945 Ops/s $\color{#35bf28}+2.11\%$
test_compile_assign_and_add[pytree-compile] 0.3781ms 0.1899ms 5.2673 KOps/s 5.1424 KOps/s $\color{#35bf28}+2.43\%$
test_compile_assign_and_add[pytree-eager] 1.3229ms 1.0867ms 920.2327 Ops/s 894.6870 Ops/s $\color{#35bf28}+2.86\%$
test_compile_assign_and_add_stack[compile] 0.5177ms 0.4195ms 2.3838 KOps/s 2.2821 KOps/s $\color{#35bf28}+4.46\%$
test_compile_assign_and_add_stack[eager] 5.3896ms 3.9257ms 254.7286 Ops/s 248.0452 Ops/s $\color{#35bf28}+2.69\%$
test_compile_indexing[tensor-tensordict-compile] 0.1155ms 34.2815μs 29.1702 KOps/s 29.2925 KOps/s $\color{#d91a1a}-0.42\%$
test_compile_indexing[tensor-tensordict-eager] 0.8144ms 50.1193μs 19.9524 KOps/s 19.4178 KOps/s $\color{#35bf28}+2.75\%$
test_compile_indexing[tensor-tensorclass-compile] 75.1230μs 29.0070μs 34.4744 KOps/s 34.3270 KOps/s $\color{#35bf28}+0.43\%$
test_compile_indexing[tensor-tensorclass-eager] 99.8410μs 29.6156μs 33.7659 KOps/s 34.0474 KOps/s $\color{#d91a1a}-0.83\%$
test_compile_indexing[tensor-pytree-compile] 0.1082ms 29.2003μs 34.2463 KOps/s 33.8306 KOps/s $\color{#35bf28}+1.23\%$
test_compile_indexing[tensor-pytree-eager] 75.4230μs 29.6809μs 33.6917 KOps/s 33.8768 KOps/s $\color{#d91a1a}-0.55\%$
test_compile_indexing[slice-tensordict-compile] 0.1556ms 73.1056μs 13.6788 KOps/s 13.6447 KOps/s $\color{#35bf28}+0.25\%$
test_compile_indexing[slice-tensordict-eager] 0.6645ms 27.7986μs 35.9730 KOps/s 34.6192 KOps/s $\color{#35bf28}+3.91\%$
test_compile_indexing[slice-tensorclass-compile] 0.1438ms 66.4312μs 15.0532 KOps/s 14.7249 KOps/s $\color{#35bf28}+2.23\%$
test_compile_indexing[slice-tensorclass-eager] 90.5020μs 23.4557μs 42.6336 KOps/s 43.4961 KOps/s $\color{#d91a1a}-1.98\%$
test_compile_indexing[slice-pytree-compile] 0.1382ms 65.9773μs 15.1567 KOps/s 14.7242 KOps/s $\color{#35bf28}+2.94\%$
test_compile_indexing[slice-pytree-eager] 86.4390μs 23.2944μs 42.9288 KOps/s 43.0081 KOps/s $\color{#d91a1a}-0.18\%$
test_compile_indexing[int-tensordict-compile] 0.1406ms 72.5105μs 13.7911 KOps/s 13.7826 KOps/s $\color{#35bf28}+0.06\%$
test_compile_indexing[int-tensordict-eager] 0.9344ms 28.0329μs 35.6724 KOps/s 35.3376 KOps/s $\color{#35bf28}+0.95\%$
test_compile_indexing[int-tensorclass-compile] 0.1259ms 65.6456μs 15.2333 KOps/s 14.6962 KOps/s $\color{#35bf28}+3.65\%$
test_compile_indexing[int-tensorclass-eager] 83.6010μs 22.5818μs 44.2834 KOps/s 43.9619 KOps/s $\color{#35bf28}+0.73\%$
test_compile_indexing[int-pytree-compile] 0.1557ms 66.3889μs 15.0628 KOps/s 14.6292 KOps/s $\color{#35bf28}+2.96\%$
test_compile_indexing[int-pytree-eager] 77.8350μs 23.0616μs 43.3622 KOps/s 43.7250 KOps/s $\color{#d91a1a}-0.83\%$
test_mod_add[eager] 0.1053ms 26.1953μs 38.1749 KOps/s 36.6905 KOps/s $\color{#35bf28}+4.05\%$
test_mod_add[compile] 83.2160μs 39.2093μs 25.5041 KOps/s 25.7465 KOps/s $\color{#d91a1a}-0.94\%$
test_mod_add[compile-overhead] 0.1236ms 38.6709μs 25.8592 KOps/s 25.5787 KOps/s $\color{#35bf28}+1.10\%$
test_mod_wrap[eager] 0.4188ms 0.2071ms 4.8295 KOps/s 4.6797 KOps/s $\color{#35bf28}+3.20\%$
test_mod_wrap[compile] 0.3848ms 0.2279ms 4.3888 KOps/s 4.2219 KOps/s $\color{#35bf28}+3.95\%$
test_mod_wrap[compile-overhead] 0.3889ms 0.2346ms 4.2620 KOps/s 4.2607 KOps/s $\color{#35bf28}+0.03\%$
test_mod_wrap_and_backward[eager] 12.6892ms 10.7559ms 92.9721 Ops/s 93.3055 Ops/s $\color{#d91a1a}-0.36\%$
test_mod_wrap_and_backward[compile] 12.8900ms 10.8119ms 92.4904 Ops/s 85.3572 Ops/s $\textbf{\color{#35bf28}+8.36\%}$
test_mod_wrap_and_backward[compile-overhead] 12.0607ms 10.9284ms 91.5045 Ops/s 81.5348 Ops/s $\textbf{\color{#35bf28}+12.23\%}$
test_seq_add[eager] 0.1556ms 93.1427μs 10.7362 KOps/s 10.5911 KOps/s $\color{#35bf28}+1.37\%$
test_seq_add[compile] 0.1577ms 65.2016μs 15.3371 KOps/s 15.3753 KOps/s $\color{#d91a1a}-0.25\%$
test_seq_add[compile-overhead] 0.1321ms 64.3526μs 15.5394 KOps/s 15.5708 KOps/s $\color{#d91a1a}-0.20\%$
test_seq_wrap[eager] 0.5267ms 0.3893ms 2.5687 KOps/s 2.4842 KOps/s $\color{#35bf28}+3.40\%$
test_seq_wrap[compile] 1.2352ms 0.2629ms 3.8038 KOps/s 3.6247 KOps/s $\color{#35bf28}+4.94\%$
test_seq_wrap[compile-overhead] 1.2591ms 0.2640ms 3.7883 KOps/s 3.5879 KOps/s $\textbf{\color{#35bf28}+5.58\%}$
test_func_call_runtime[False-eager] 0.6358ms 0.5085ms 1.9665 KOps/s 1.9105 KOps/s $\color{#35bf28}+2.93\%$
test_func_call_runtime[False-compile] 0.9334ms 0.4934ms 2.0266 KOps/s 1.9923 KOps/s $\color{#35bf28}+1.72\%$
test_func_call_runtime[False-compile-overhead] 0.6218ms 0.4946ms 2.0219 KOps/s 1.9973 KOps/s $\color{#35bf28}+1.23\%$
test_func_call_runtime[True-eager] 1.1622ms 0.7353ms 1.3600 KOps/s 1.3431 KOps/s $\color{#35bf28}+1.26\%$
test_func_call_runtime[True-compile] 1.1623ms 0.5374ms 1.8609 KOps/s 1.9366 KOps/s $\color{#d91a1a}-3.91\%$
test_func_call_runtime[True-compile-overhead] 0.6177ms 0.5090ms 1.9648 KOps/s 1.9429 KOps/s $\color{#35bf28}+1.13\%$
test_func_call_cm_runtime[False-eager] 0.8298ms 0.5090ms 1.9648 KOps/s 1.9174 KOps/s $\color{#35bf28}+2.47\%$
test_func_call_cm_runtime[False-compile] 0.7064ms 0.4986ms 2.0055 KOps/s 1.9790 KOps/s $\color{#35bf28}+1.34\%$
test_func_call_cm_runtime[False-compile-overhead] 1.0224ms 0.4986ms 2.0055 KOps/s 1.9870 KOps/s $\color{#35bf28}+0.93\%$
test_func_call_cm_runtime[True-eager] 1.0819ms 0.8731ms 1.1454 KOps/s 1.1468 KOps/s $\color{#d91a1a}-0.13\%$
test_func_call_cm_runtime[True-compile] 0.9198ms 0.7384ms 1.3542 KOps/s 1.3290 KOps/s $\color{#35bf28}+1.90\%$
test_func_call_cm_runtime[True-compile-overhead] 1.1034ms 0.7423ms 1.3471 KOps/s 1.3280 KOps/s $\color{#35bf28}+1.44\%$
test_vmap_func_call_cm_runtime[eager] 3.4547ms 1.8674ms 535.5166 Ops/s 527.2857 Ops/s $\color{#35bf28}+1.56\%$
test_vmap_func_call_cm_runtime[compile] 2.6852ms 1.9054ms 524.8326 Ops/s 512.5661 Ops/s $\color{#35bf28}+2.39\%$
test_vmap_func_call_cm_runtime[compile-overhead] 2.7134ms 1.9107ms 523.3613 Ops/s 511.2862 Ops/s $\color{#35bf28}+2.36\%$
test_distributed 0.2739ms 0.1278ms 7.8236 KOps/s 7.8289 KOps/s $\color{#d91a1a}-0.07\%$
test_tdmodule 78.3670μs 19.1651μs 52.1782 KOps/s 51.3447 KOps/s $\color{#35bf28}+1.62\%$
test_tdmodule_dispatch 63.3690μs 38.0526μs 26.2794 KOps/s 26.3188 KOps/s $\color{#d91a1a}-0.15\%$
test_tdseq 40.4460μs 21.5189μs 46.4708 KOps/s 44.9019 KOps/s $\color{#35bf28}+3.49\%$
test_tdseq_dispatch 81.6530μs 43.4887μs 22.9945 KOps/s 22.6597 KOps/s $\color{#35bf28}+1.48\%$
test_instantiation_functorch 2.1023ms 1.5452ms 647.1759 Ops/s 631.6476 Ops/s $\color{#35bf28}+2.46\%$
test_instantiation_td 1.9449ms 1.1653ms 858.1572 Ops/s 848.0021 Ops/s $\color{#35bf28}+1.20\%$
test_exec_functorch 5.1691ms 0.1854ms 5.3945 KOps/s 5.3655 KOps/s $\color{#35bf28}+0.54\%$
test_exec_functional_call 0.3376ms 0.1748ms 5.7220 KOps/s 5.7595 KOps/s $\color{#d91a1a}-0.65\%$
test_exec_td 0.4111ms 0.1694ms 5.9016 KOps/s 5.8331 KOps/s $\color{#35bf28}+1.17\%$
test_exec_td_decorator 1.1186ms 0.2290ms 4.3664 KOps/s 4.5134 KOps/s $\color{#d91a1a}-3.26\%$
test_vmap_mlp_speed[True-True] 0.8811ms 0.6405ms 1.5613 KOps/s 1.4384 KOps/s $\textbf{\color{#35bf28}+8.54\%}$
test_vmap_mlp_speed[True-False] 0.9672ms 0.6468ms 1.5460 KOps/s 1.5052 KOps/s $\color{#35bf28}+2.71\%$
test_vmap_mlp_speed[False-True] 0.9304ms 0.4974ms 2.0106 KOps/s 1.9636 KOps/s $\color{#35bf28}+2.40\%$
test_vmap_mlp_speed[False-False] 0.7759ms 0.4915ms 2.0346 KOps/s 1.9735 KOps/s $\color{#35bf28}+3.09\%$
test_vmap_mlp_speed_decorator[True-True] 1.3772ms 0.6253ms 1.5992 KOps/s 1.5950 KOps/s $\color{#35bf28}+0.27\%$
test_vmap_mlp_speed_decorator[True-False] 1.0357ms 0.6267ms 1.5958 KOps/s 1.5759 KOps/s $\color{#35bf28}+1.26\%$
test_vmap_mlp_speed_decorator[False-True] 0.8489ms 0.5090ms 1.9647 KOps/s 1.9254 KOps/s $\color{#35bf28}+2.04\%$
test_vmap_mlp_speed_decorator[False-False] 1.0313ms 0.5112ms 1.9562 KOps/s 1.9240 KOps/s $\color{#35bf28}+1.68\%$
test_to_module_speed[True] 2.2002ms 1.3547ms 738.1616 Ops/s 775.0455 Ops/s $\color{#d91a1a}-4.76\%$
test_to_module_speed[False] 2.1598ms 1.3127ms 761.7668 Ops/s 785.8471 Ops/s $\color{#d91a1a}-3.06\%$
test_tc_init 90.9410μs 45.0276μs 22.2086 KOps/s 22.0844 KOps/s $\color{#35bf28}+0.56\%$
test_tc_init_nested 0.2269ms 89.8871μs 11.1251 KOps/s 11.0273 KOps/s $\color{#35bf28}+0.89\%$
test_tc_first_layer_tensor 25.4980μs 1.5451μs 647.1888 KOps/s 649.6073 KOps/s $\color{#d91a1a}-0.37\%$
test_tc_first_layer_nontensor 52.3380μs 4.7549μs 210.3092 KOps/s 211.0597 KOps/s $\color{#d91a1a}-0.36\%$
test_tc_second_layer_tensor 29.2640μs 2.8799μs 347.2383 KOps/s 354.0892 KOps/s $\color{#d91a1a}-1.93\%$
test_tc_second_layer_nontensor 26.7700μs 6.0734μs 164.6525 KOps/s 166.2106 KOps/s $\color{#d91a1a}-0.94\%$
test_unbind 0.4674s 15.7573ms 63.4626 Ops/s 74.5779 Ops/s $\textbf{\color{#d91a1a}-14.90\%}$
test_full_like 9.6342ms 7.2572ms 137.7935 Ops/s 142.0512 Ops/s $\color{#d91a1a}-3.00\%$
test_zeros_like 3.8232ms 2.8773ms 347.5506 Ops/s 158.6426 Ops/s $\textbf{\color{#35bf28}+119.08\%}$
test_ones_like 4.5631ms 3.3602ms 297.6037 Ops/s 129.1472 Ops/s $\textbf{\color{#35bf28}+130.44\%}$
test_clone 6.4730ms 5.1791ms 193.0844 Ops/s 107.8055 Ops/s $\textbf{\color{#35bf28}+79.10\%}$
test_squeeze 66.3740μs 12.2899μs 81.3677 KOps/s 76.1606 KOps/s $\textbf{\color{#35bf28}+6.84\%}$
test_unsqueeze 0.1907ms 91.1522μs 10.9707 KOps/s 10.7639 KOps/s $\color{#35bf28}+1.92\%$
test_split 0.5203ms 0.1957ms 5.1099 KOps/s 4.9924 KOps/s $\color{#35bf28}+2.35\%$
test_permute 0.3295ms 0.2226ms 4.4923 KOps/s 4.4525 KOps/s $\color{#35bf28}+0.89\%$
test_stack 32.6168ms 24.8700ms 40.2092 Ops/s 41.0645 Ops/s $\color{#d91a1a}-2.08\%$
test_cat 35.7418ms 25.1663ms 39.7357 Ops/s 41.5359 Ops/s $\color{#d91a1a}-4.33\%$

Copy link

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 228. Improved: $\large\color{#35bf28}11$. Worsened: $\large\color{#d91a1a}12$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 0.1286ms 12.7625μs 78.3548 KOps/s 76.5828 KOps/s $\color{#35bf28}+2.31\%$
test_plain_set_stack_nested 37.9010μs 12.9700μs 77.1009 KOps/s 75.8520 KOps/s $\color{#35bf28}+1.65\%$
test_plain_set_nested_inplace 40.3210μs 13.9640μs 71.6128 KOps/s 70.4966 KOps/s $\color{#35bf28}+1.58\%$
test_plain_set_stack_nested_inplace 47.5300μs 13.8491μs 72.2069 KOps/s 70.0093 KOps/s $\color{#35bf28}+3.14\%$
test_items 25.5010μs 2.8682μs 348.6484 KOps/s 344.5602 KOps/s $\color{#35bf28}+1.19\%$
test_items_nested 0.5213ms 0.3301ms 3.0290 KOps/s 3.0830 KOps/s $\color{#d91a1a}-1.75\%$
test_items_nested_locked 0.4536ms 0.3280ms 3.0489 KOps/s 3.0646 KOps/s $\color{#d91a1a}-0.52\%$
test_items_nested_leaf 85.6610μs 55.5284μs 18.0088 KOps/s 17.9577 KOps/s $\color{#35bf28}+0.28\%$
test_items_stack_nested 0.4141ms 0.3266ms 3.0621 KOps/s 3.0631 KOps/s $\color{#d91a1a}-0.03\%$
test_items_stack_nested_leaf 89.2710μs 56.5876μs 17.6717 KOps/s 17.6095 KOps/s $\color{#35bf28}+0.35\%$
test_items_stack_nested_locked 0.5139ms 0.3287ms 3.0424 KOps/s 3.0481 KOps/s $\color{#d91a1a}-0.19\%$
test_keys 29.6100μs 3.4032μs 293.8428 KOps/s 288.9614 KOps/s $\color{#35bf28}+1.69\%$
test_keys_nested 0.2184ms 56.3540μs 17.7450 KOps/s 18.1657 KOps/s $\color{#d91a1a}-2.32\%$
test_keys_nested_locked 2.7546ms 62.2688μs 16.0594 KOps/s 15.9873 KOps/s $\color{#35bf28}+0.45\%$
test_keys_nested_leaf 76.6810μs 47.7350μs 20.9490 KOps/s 20.9864 KOps/s $\color{#d91a1a}-0.18\%$
test_keys_stack_nested 0.1138ms 55.6000μs 17.9856 KOps/s 17.6755 KOps/s $\color{#35bf28}+1.75\%$
test_keys_stack_nested_leaf 84.3120μs 48.0182μs 20.8255 KOps/s 20.9461 KOps/s $\color{#d91a1a}-0.58\%$
test_keys_stack_nested_locked 0.1140ms 61.7513μs 16.1940 KOps/s 16.0424 KOps/s $\color{#35bf28}+0.94\%$
test_values 6.5717μs 0.8587μs 1.1645 MOps/s 1.2047 MOps/s $\color{#d91a1a}-3.33\%$
test_values_nested 67.6910μs 41.0691μs 24.3492 KOps/s 24.5530 KOps/s $\color{#d91a1a}-0.83\%$
test_values_nested_locked 77.2220μs 42.6989μs 23.4198 KOps/s 23.4912 KOps/s $\color{#d91a1a}-0.30\%$
test_values_nested_leaf 76.2610μs 35.5033μs 28.1664 KOps/s 28.3536 KOps/s $\color{#d91a1a}-0.66\%$
test_values_stack_nested 86.6420μs 41.7576μs 23.9477 KOps/s 24.0294 KOps/s $\color{#d91a1a}-0.34\%$
test_values_stack_nested_leaf 83.3410μs 36.0565μs 27.7342 KOps/s 27.8225 KOps/s $\color{#d91a1a}-0.32\%$
test_values_stack_nested_locked 79.4410μs 43.4747μs 23.0019 KOps/s 22.9939 KOps/s $\color{#35bf28}+0.03\%$
test_membership 1.6790μs 0.5056μs 1.9780 MOps/s 1.9822 MOps/s $\color{#d91a1a}-0.21\%$
test_membership_nested 19.1950μs 1.9243μs 519.6673 KOps/s 528.2088 KOps/s $\color{#d91a1a}-1.62\%$
test_membership_nested_leaf 18.0555μs 1.9148μs 522.2351 KOps/s 538.5011 KOps/s $\color{#d91a1a}-3.02\%$
test_membership_stacked_nested 25.5010μs 1.9345μs 516.9227 KOps/s 519.2095 KOps/s $\color{#d91a1a}-0.44\%$
test_membership_stacked_nested_leaf 28.5610μs 1.9366μs 516.3574 KOps/s 514.1185 KOps/s $\color{#35bf28}+0.44\%$
test_membership_nested_last 28.4600μs 2.8185μs 354.8007 KOps/s 358.9195 KOps/s $\color{#d91a1a}-1.15\%$
test_membership_nested_leaf_last 27.9010μs 2.8190μs 354.7315 KOps/s 353.6111 KOps/s $\color{#35bf28}+0.32\%$
test_membership_stacked_nested_last 28.3200μs 3.2351μs 309.1092 KOps/s 363.7873 KOps/s $\textbf{\color{#d91a1a}-15.03\%}$
test_membership_stacked_nested_leaf_last 21.6110μs 3.2462μs 308.0495 KOps/s 358.9866 KOps/s $\textbf{\color{#d91a1a}-14.19\%}$
test_nested_getleaf 32.8010μs 6.1442μs 162.7556 KOps/s 163.3223 KOps/s $\color{#d91a1a}-0.35\%$
test_nested_get 29.6810μs 5.7217μs 174.7739 KOps/s 172.7451 KOps/s $\color{#35bf28}+1.17\%$
test_stacked_getleaf 36.5300μs 6.0992μs 163.9570 KOps/s 164.4275 KOps/s $\color{#d91a1a}-0.29\%$
test_stacked_get 32.0100μs 5.7398μs 174.2207 KOps/s 176.1309 KOps/s $\color{#d91a1a}-1.08\%$
test_nested_getitemleaf 36.2210μs 6.1782μs 161.8601 KOps/s 163.8321 KOps/s $\color{#d91a1a}-1.20\%$
test_nested_getitem 68.6910μs 5.7492μs 173.9387 KOps/s 173.3549 KOps/s $\color{#35bf28}+0.34\%$
test_stacked_getitemleaf 43.7500μs 6.1764μs 161.9065 KOps/s 163.9375 KOps/s $\color{#d91a1a}-1.24\%$
test_stacked_getitem 27.6900μs 5.7715μs 173.2652 KOps/s 173.8925 KOps/s $\color{#d91a1a}-0.36\%$
test_lock_nested 6.9683ms 0.4261ms 2.3470 KOps/s 2.3705 KOps/s $\color{#d91a1a}-0.99\%$
test_lock_stack_nested 0.5498ms 0.3872ms 2.5829 KOps/s 2.6147 KOps/s $\color{#d91a1a}-1.21\%$
test_unlock_nested 0.7920ms 0.3566ms 2.8044 KOps/s 2.8009 KOps/s $\color{#35bf28}+0.13\%$
test_unlock_stack_nested 0.4600ms 0.3235ms 3.0914 KOps/s 3.0891 KOps/s $\color{#35bf28}+0.07\%$
test_flatten_speed 0.2487ms 69.7732μs 14.3322 KOps/s 14.3791 KOps/s $\color{#d91a1a}-0.33\%$
test_unflatten_speed 0.3137ms 0.2859ms 3.4983 KOps/s 3.5505 KOps/s $\color{#d91a1a}-1.47\%$
test_common_ops 1.5475ms 1.2104ms 826.1746 Ops/s 820.3379 Ops/s $\color{#35bf28}+0.71\%$
test_creation 20.7400μs 1.4817μs 674.9195 KOps/s 666.9297 KOps/s $\color{#35bf28}+1.20\%$
test_creation_empty 38.5310μs 13.5570μs 73.7628 KOps/s 70.7430 KOps/s $\color{#35bf28}+4.27\%$
test_creation_nested_1 56.3010μs 15.4678μs 64.6502 KOps/s 63.0168 KOps/s $\color{#35bf28}+2.59\%$
test_creation_nested_2 49.9310μs 17.8128μs 56.1394 KOps/s 54.3670 KOps/s $\color{#35bf28}+3.26\%$
test_clone 0.1810ms 29.5631μs 33.8260 KOps/s 33.5992 KOps/s $\color{#35bf28}+0.67\%$
test_getitem[int] 1.3526ms 15.8946μs 62.9145 KOps/s 61.0984 KOps/s $\color{#35bf28}+2.97\%$
test_getitem[slice_int] 0.1172ms 27.6376μs 36.1826 KOps/s 35.4290 KOps/s $\color{#35bf28}+2.13\%$
test_getitem[range] 0.1497ms 0.1089ms 9.1806 KOps/s 9.0574 KOps/s $\color{#35bf28}+1.36\%$
test_getitem[tuple] 0.2056ms 23.8775μs 41.8804 KOps/s 43.4377 KOps/s $\color{#d91a1a}-3.59\%$
test_getitem[list] 0.3029ms 0.1060ms 9.4371 KOps/s 10.0031 KOps/s $\textbf{\color{#d91a1a}-5.66\%}$
test_setitem_dim[int] 72.7710μs 47.6093μs 21.0043 KOps/s 22.1693 KOps/s $\textbf{\color{#d91a1a}-5.25\%}$
test_setitem_dim[slice_int] 0.1855ms 67.2493μs 14.8700 KOps/s 14.9374 KOps/s $\color{#d91a1a}-0.45\%$
test_setitem_dim[range] 0.2734ms 0.1277ms 7.8295 KOps/s 7.8017 KOps/s $\color{#35bf28}+0.36\%$
test_setitem_dim[tuple] 0.2052ms 64.4395μs 15.5184 KOps/s 16.5916 KOps/s $\textbf{\color{#d91a1a}-6.47\%}$
test_setitem 0.1916ms 41.3618μs 24.1769 KOps/s 24.5763 KOps/s $\color{#d91a1a}-1.63\%$
test_set 0.1901ms 40.2284μs 24.8580 KOps/s 25.5278 KOps/s $\color{#d91a1a}-2.62\%$
test_set_shared 0.3668ms 51.3666μs 19.4679 KOps/s 19.6755 KOps/s $\color{#d91a1a}-1.05\%$
test_update 0.1999ms 47.9866μs 20.8391 KOps/s 21.3699 KOps/s $\color{#d91a1a}-2.48\%$
test_update_nested 0.2126ms 55.0796μs 18.1555 KOps/s 18.4356 KOps/s $\color{#d91a1a}-1.52\%$
test_update__nested 0.2074ms 60.7745μs 16.4543 KOps/s 16.8433 KOps/s $\color{#d91a1a}-2.31\%$
test_set_nested 0.1955ms 42.9263μs 23.2958 KOps/s 22.5353 KOps/s $\color{#35bf28}+3.37\%$
test_set_nested_new 0.1952ms 46.5629μs 21.4763 KOps/s 20.4660 KOps/s $\color{#35bf28}+4.94\%$
test_select 0.2109ms 59.4897μs 16.8096 KOps/s 15.9099 KOps/s $\textbf{\color{#35bf28}+5.66\%}$
test_select_nested 77.9710μs 41.5088μs 24.0913 KOps/s 23.8746 KOps/s $\color{#35bf28}+0.91\%$
test_exclude_nested 99.1610μs 59.3828μs 16.8399 KOps/s 17.0295 KOps/s $\color{#d91a1a}-1.11\%$
test_empty[True] 0.2764ms 0.2474ms 4.0426 KOps/s 4.0990 KOps/s $\color{#d91a1a}-1.38\%$
test_empty[False] 4.0060μs 0.7462μs 1.3402 MOps/s 1.3569 MOps/s $\color{#d91a1a}-1.23\%$
test_to 58.9210μs 25.2947μs 39.5340 KOps/s 39.5282 KOps/s $\color{#35bf28}+0.01\%$
test_to_nonblocking 0.1415ms 25.3520μs 39.4447 KOps/s 40.9265 KOps/s $\color{#d91a1a}-3.62\%$
test_unbind_speed 1.4193ms 0.2798ms 3.5737 KOps/s 3.6215 KOps/s $\color{#d91a1a}-1.32\%$
test_unbind_speed_stack0 0.3711ms 0.2776ms 3.6019 KOps/s 3.5831 KOps/s $\color{#35bf28}+0.53\%$
test_unbind_speed_stack1 91.3830ms 0.7168ms 1.3951 KOps/s 1.4095 KOps/s $\color{#d91a1a}-1.02\%$
test_split 93.6847ms 2.1886ms 456.9106 Ops/s 456.4227 Ops/s $\color{#35bf28}+0.11\%$
test_chunk 93.4963ms 2.1888ms 456.8763 Ops/s 455.0309 Ops/s $\color{#35bf28}+0.41\%$
test_creation[device0] 0.3936ms 0.1271ms 7.8693 KOps/s 7.8845 KOps/s $\color{#d91a1a}-0.19\%$
test_creation_from_tensor 0.3967ms 0.1292ms 7.7380 KOps/s 7.7267 KOps/s $\color{#35bf28}+0.15\%$
test_add_one[memmap_tensor0] 0.2804ms 9.1360μs 109.4571 KOps/s 109.7080 KOps/s $\color{#d91a1a}-0.23\%$
test_contiguous[memmap_tensor0] 29.0200μs 2.2135μs 451.7675 KOps/s 461.7786 KOps/s $\color{#d91a1a}-2.17\%$
test_stack[memmap_tensor0] 44.3000μs 7.0759μs 141.3248 KOps/s 148.1735 KOps/s $\color{#d91a1a}-4.62\%$
test_memmaptd_index 1.1793ms 0.4284ms 2.3341 KOps/s 2.3856 KOps/s $\color{#d91a1a}-2.16\%$
test_memmaptd_index_astensor 0.9708ms 0.4860ms 2.0575 KOps/s 2.0847 KOps/s $\color{#d91a1a}-1.30\%$
test_memmaptd_index_op 1.4112ms 1.0082ms 991.8192 Ops/s 988.4657 Ops/s $\color{#35bf28}+0.34\%$
test_serialize_model 0.1280s 0.1268s 7.8876 Ops/s 7.8731 Ops/s $\color{#35bf28}+0.18\%$
test_serialize_model_pickle 1.3491s 1.2125s 0.8248 Ops/s 0.8246 Ops/s $\color{#35bf28}+0.02\%$
test_serialize_weights 0.2207s 0.1393s 7.1785 Ops/s 7.1684 Ops/s $\color{#35bf28}+0.14\%$
test_serialize_weights_returnearly 0.2295s 56.6358ms 17.6567 Ops/s 17.9976 Ops/s $\color{#d91a1a}-1.89\%$
test_serialize_weights_pickle 1.3664s 1.2150s 0.8230 Ops/s 0.8213 Ops/s $\color{#35bf28}+0.21\%$
test_reshape_pytree 0.1116ms 35.9049μs 27.8514 KOps/s 27.5642 KOps/s $\color{#35bf28}+1.04\%$
test_reshape_td 0.1024ms 41.0636μs 24.3525 KOps/s 23.3519 KOps/s $\color{#35bf28}+4.28\%$
test_view_pytree 0.1498ms 35.0650μs 28.5185 KOps/s 27.8001 KOps/s $\color{#35bf28}+2.58\%$
test_view_td 0.1681ms 46.9855μs 21.2832 KOps/s 21.0622 KOps/s $\color{#35bf28}+1.05\%$
test_unbind_pytree 0.1605ms 34.6961μs 28.8217 KOps/s 28.6550 KOps/s $\color{#35bf28}+0.58\%$
test_unbind_td 0.5323ms 42.5189μs 23.5190 KOps/s 23.2270 KOps/s $\color{#35bf28}+1.26\%$
test_split_pytree 0.1378ms 46.5618μs 21.4768 KOps/s 21.4188 KOps/s $\color{#35bf28}+0.27\%$
test_split_td 0.6977ms 61.5678μs 16.2423 KOps/s 17.5389 KOps/s $\textbf{\color{#d91a1a}-7.39\%}$
test_add_pytree 0.2320ms 60.7175μs 16.4697 KOps/s 17.8341 KOps/s $\textbf{\color{#d91a1a}-7.65\%}$
test_add_td 0.2610ms 93.5254μs 10.6923 KOps/s 10.8331 KOps/s $\color{#d91a1a}-1.30\%$
test_compile_add_one_nested[tensordict-compile] 0.4076ms 0.2063ms 4.8480 KOps/s 4.7393 KOps/s $\color{#35bf28}+2.30\%$
test_compile_add_one_nested[tensordict-eager] 0.5246ms 0.1505ms 6.6464 KOps/s 6.5964 KOps/s $\color{#35bf28}+0.76\%$
test_compile_add_one_nested[pytree-compile] 0.3116ms 0.1465ms 6.8263 KOps/s 6.9222 KOps/s $\color{#d91a1a}-1.39\%$
test_compile_add_one_nested[pytree-eager] 0.5794ms 0.1939ms 5.1562 KOps/s 5.5019 KOps/s $\textbf{\color{#d91a1a}-6.28\%}$
test_compile_copy_nested[tensordict-compile] 0.3921ms 22.7889μs 43.8809 KOps/s 46.9284 KOps/s $\textbf{\color{#d91a1a}-6.49\%}$
test_compile_copy_nested[tensordict-eager] 0.4134ms 43.9875μs 22.7337 KOps/s 22.6697 KOps/s $\color{#35bf28}+0.28\%$
test_compile_copy_nested[pytree-compile] 0.4518ms 64.6529μs 15.4672 KOps/s 15.2483 KOps/s $\color{#35bf28}+1.44\%$
test_compile_copy_nested[pytree-eager] 0.4264ms 49.0788μs 20.3754 KOps/s 20.1925 KOps/s $\color{#35bf28}+0.91\%$
test_compile_add_one_flat[tensordict-compile] 0.7050ms 0.3190ms 3.1345 KOps/s 3.1470 KOps/s $\color{#d91a1a}-0.40\%$
test_compile_add_one_flat[tensordict-eager] 0.3736ms 0.2077ms 4.8153 KOps/s 4.7422 KOps/s $\color{#35bf28}+1.54\%$
test_compile_add_one_flat[tensorclass-compile] 0.2821ms 0.1324ms 7.5543 KOps/s 7.8866 KOps/s $\color{#d91a1a}-4.21\%$
test_compile_add_one_flat[tensorclass-eager] 0.4549ms 63.1250μs 15.8416 KOps/s 16.2895 KOps/s $\color{#d91a1a}-2.75\%$
test_compile_add_one_flat[pytree-compile] 0.4807ms 0.3210ms 3.1157 KOps/s 3.1236 KOps/s $\color{#d91a1a}-0.25\%$
test_compile_add_one_flat[pytree-eager] 0.9945ms 0.6256ms 1.5984 KOps/s 1.6352 KOps/s $\color{#d91a1a}-2.25\%$
test_compile_add_self_flat[tensordict-eager] 0.6225ms 0.2485ms 4.0238 KOps/s 3.9823 KOps/s $\color{#35bf28}+1.04\%$
test_compile_add_self_flat[tensordict-compile] 0.4458ms 0.3181ms 3.1439 KOps/s 3.1184 KOps/s $\color{#35bf28}+0.82\%$
test_compile_add_self_flat[tensorclass-eager] 0.2288ms 73.4405μs 13.6165 KOps/s 14.1843 KOps/s $\color{#d91a1a}-4.00\%$
test_compile_add_self_flat[tensorclass-compile] 0.3027ms 0.1334ms 7.4946 KOps/s 7.7917 KOps/s $\color{#d91a1a}-3.81\%$
test_compile_add_self_flat[pytree-eager] 0.6978ms 0.5386ms 1.8568 KOps/s 1.9013 KOps/s $\color{#d91a1a}-2.34\%$
test_compile_add_self_flat[pytree-compile] 0.4686ms 0.3157ms 3.1676 KOps/s 3.1443 KOps/s $\color{#35bf28}+0.74\%$
test_compile_copy_flat[tensordict-compile] 0.1684ms 19.8244μs 50.4429 KOps/s 54.6096 KOps/s $\textbf{\color{#d91a1a}-7.63\%}$
test_compile_copy_flat[tensordict-eager] 0.2091ms 27.4143μs 36.4773 KOps/s 36.7782 KOps/s $\color{#d91a1a}-0.82\%$
test_compile_copy_flat[pytree-compile] 0.1340ms 70.2071μs 14.2436 KOps/s 13.9916 KOps/s $\color{#35bf28}+1.80\%$
test_compile_copy_flat[pytree-eager] 87.3010μs 50.9386μs 19.6315 KOps/s 19.3833 KOps/s $\color{#35bf28}+1.28\%$
test_compile_assign_and_add[tensordict-compile] 2.5028ms 0.8466ms 1.1812 KOps/s 1.1102 KOps/s $\textbf{\color{#35bf28}+6.39\%}$
test_compile_assign_and_add[tensordict-eager] 3.6880ms 3.3489ms 298.6032 Ops/s 300.0491 Ops/s $\color{#d91a1a}-0.48\%$
test_compile_assign_and_add[pytree-compile] 2.3852ms 0.8247ms 1.2125 KOps/s 1.1266 KOps/s $\textbf{\color{#35bf28}+7.63\%}$
test_compile_assign_and_add[pytree-eager] 3.6549ms 3.3706ms 296.6859 Ops/s 309.3098 Ops/s $\color{#d91a1a}-4.08\%$
test_compile_indexing[tensor-tensordict-compile] 0.2984ms 0.1123ms 8.9057 KOps/s 9.2738 KOps/s $\color{#d91a1a}-3.97\%$
test_compile_indexing[tensor-tensordict-eager] 0.2421ms 63.2192μs 15.8180 KOps/s 16.5858 KOps/s $\color{#d91a1a}-4.63\%$
test_compile_indexing[tensor-tensorclass-compile] 0.3305ms 0.1034ms 9.6667 KOps/s 9.3594 KOps/s $\color{#35bf28}+3.28\%$
test_compile_indexing[tensor-tensorclass-eager] 0.2152ms 43.9778μs 22.7387 KOps/s 21.5573 KOps/s $\textbf{\color{#35bf28}+5.48\%}$
test_compile_indexing[tensor-pytree-compile] 0.2707ms 0.1040ms 9.6150 KOps/s 9.1681 KOps/s $\color{#35bf28}+4.87\%$
test_compile_indexing[tensor-pytree-eager] 0.1990ms 43.1312μs 23.1851 KOps/s 21.5074 KOps/s $\textbf{\color{#35bf28}+7.80\%}$
test_compile_indexing[slice-tensordict-compile] 0.3040ms 0.1389ms 7.1992 KOps/s 7.3074 KOps/s $\color{#d91a1a}-1.48\%$
test_compile_indexing[slice-tensordict-eager] 0.1580ms 25.2336μs 39.6297 KOps/s 38.3568 KOps/s $\color{#35bf28}+3.32\%$
test_compile_indexing[slice-tensorclass-compile] 0.2807ms 0.1310ms 7.6321 KOps/s 7.6611 KOps/s $\color{#d91a1a}-0.38\%$
test_compile_indexing[slice-tensorclass-eager] 0.1405ms 21.0779μs 47.4431 KOps/s 46.9924 KOps/s $\color{#35bf28}+0.96\%$
test_compile_indexing[slice-pytree-compile] 0.2839ms 0.1320ms 7.5755 KOps/s 7.6084 KOps/s $\color{#d91a1a}-0.43\%$
test_compile_indexing[slice-pytree-eager] 59.2210μs 21.0670μs 47.4676 KOps/s 47.6136 KOps/s $\color{#d91a1a}-0.31\%$
test_compile_indexing[int-tensordict-compile] 0.2837ms 0.1388ms 7.2029 KOps/s 7.2561 KOps/s $\color{#d91a1a}-0.73\%$
test_compile_indexing[int-tensordict-eager] 0.5028ms 29.6276μs 33.7523 KOps/s 39.4203 KOps/s $\textbf{\color{#d91a1a}-14.38\%}$
test_compile_indexing[int-tensorclass-compile] 0.2630ms 0.1316ms 7.5973 KOps/s 7.3383 KOps/s $\color{#35bf28}+3.53\%$
test_compile_indexing[int-tensorclass-eager] 0.1027ms 21.0089μs 47.5988 KOps/s 47.6253 KOps/s $\color{#d91a1a}-0.06\%$
test_compile_indexing[int-pytree-compile] 0.2807ms 0.1322ms 7.5618 KOps/s 7.4011 KOps/s $\color{#35bf28}+2.17\%$
test_compile_indexing[int-pytree-eager] 92.3320μs 21.2586μs 47.0397 KOps/s 34.8762 KOps/s $\textbf{\color{#35bf28}+34.88\%}$
test_mod_add[eager] 0.1844ms 30.3714μs 32.9257 KOps/s 30.2620 KOps/s $\textbf{\color{#35bf28}+8.80\%}$
test_mod_add[compile] 0.3575ms 71.8776μs 13.9125 KOps/s 14.1843 KOps/s $\color{#d91a1a}-1.92\%$
test_mod_add[compile-overhead] 0.2669ms 0.1365ms 7.3279 KOps/s 7.1598 KOps/s $\color{#35bf28}+2.35\%$
test_mod_wrap[eager] 0.4047ms 0.2409ms 4.1509 KOps/s 4.1207 KOps/s $\color{#35bf28}+0.73\%$
test_mod_wrap[compile] 1.4886ms 0.2964ms 3.3740 KOps/s 3.2361 KOps/s $\color{#35bf28}+4.26\%$
test_mod_wrap[compile-overhead] 7.3649ms 3.9941ms 250.3716 Ops/s 252.0113 Ops/s $\color{#d91a1a}-0.65\%$
test_mod_wrap_and_backward[eager] 1.7159ms 1.3523ms 739.4739 Ops/s 693.6537 Ops/s $\textbf{\color{#35bf28}+6.61\%}$
test_mod_wrap_and_backward[compile] 1.5702ms 1.3442ms 743.9310 Ops/s 696.3805 Ops/s $\textbf{\color{#35bf28}+6.83\%}$
test_mod_wrap_and_backward[compile-overhead] 1.3286ms 0.9123ms 1.0962 KOps/s 984.9214 Ops/s $\textbf{\color{#35bf28}+11.30\%}$
test_seq_add[eager] 0.5102ms 93.1845μs 10.7314 KOps/s 10.3424 KOps/s $\color{#35bf28}+3.76\%$
test_seq_add[compile] 0.4520ms 81.5034μs 12.2694 KOps/s 12.4276 KOps/s $\color{#d91a1a}-1.27\%$
test_seq_add[compile-overhead] 0.2645ms 0.1136ms 8.8047 KOps/s 8.7518 KOps/s $\color{#35bf28}+0.60\%$
test_seq_wrap[eager] 0.5176ms 0.3669ms 2.7252 KOps/s 2.6220 KOps/s $\color{#35bf28}+3.94\%$
test_seq_wrap[compile] 0.4400ms 0.3150ms 3.1747 KOps/s 3.1632 KOps/s $\color{#35bf28}+0.37\%$
test_seq_wrap[compile-overhead] 0.3840ms 0.2190ms 4.5661 KOps/s 4.5304 KOps/s $\color{#35bf28}+0.79\%$
test_func_call_runtime[False-eager] 0.8915ms 0.7401ms 1.3513 KOps/s 1.3450 KOps/s $\color{#35bf28}+0.47\%$
test_func_call_runtime[False-compile] 0.9608ms 0.7988ms 1.2519 KOps/s 1.2568 KOps/s $\color{#d91a1a}-0.39\%$
test_func_call_runtime[False-compile-overhead] 0.5065ms 0.3586ms 2.7883 KOps/s 2.7800 KOps/s $\color{#35bf28}+0.30\%$
test_func_call_runtime[True-eager] 1.0507ms 0.9034ms 1.1070 KOps/s 1.1058 KOps/s $\color{#35bf28}+0.10\%$
test_func_call_runtime[True-compile] 0.9770ms 0.8222ms 1.2163 KOps/s 1.2283 KOps/s $\color{#d91a1a}-0.98\%$
test_func_call_runtime[True-compile-overhead] 0.5261ms 0.3793ms 2.6365 KOps/s 2.6234 KOps/s $\color{#35bf28}+0.50\%$
test_func_call_cm_runtime[False-eager] 0.9564ms 0.7560ms 1.3228 KOps/s 1.3315 KOps/s $\color{#d91a1a}-0.65\%$
test_func_call_cm_runtime[False-compile] 0.9594ms 0.8003ms 1.2495 KOps/s 1.2618 KOps/s $\color{#d91a1a}-0.97\%$
test_func_call_cm_runtime[False-compile-overhead] 0.5165ms 0.3635ms 2.7507 KOps/s 2.7577 KOps/s $\color{#d91a1a}-0.25\%$
test_func_call_cm_runtime[True-eager] 1.1646ms 0.9913ms 1.0088 KOps/s 996.8798 Ops/s $\color{#35bf28}+1.20\%$
test_func_call_cm_runtime[True-compile] 1.0331ms 0.8465ms 1.1813 KOps/s 1.1919 KOps/s $\color{#d91a1a}-0.88\%$
test_func_call_cm_runtime[True-compile-overhead] 0.5868ms 0.4060ms 2.4632 KOps/s 2.4468 KOps/s $\color{#35bf28}+0.67\%$
test_vmap_func_call_cm_runtime[eager] 2.5483ms 2.0612ms 485.1530 Ops/s 483.8035 Ops/s $\color{#35bf28}+0.28\%$
test_vmap_func_call_cm_runtime[compile] 1.0234ms 0.8597ms 1.1632 KOps/s 1.1747 KOps/s $\color{#d91a1a}-0.97\%$
test_vmap_func_call_cm_runtime[compile-overhead] 0.5599ms 0.4089ms 2.4454 KOps/s 2.4494 KOps/s $\color{#d91a1a}-0.17\%$
test_distributed 6.8135ms 0.2890ms 3.4603 KOps/s 8.9565 KOps/s $\textbf{\color{#d91a1a}-61.37\%}$
test_tdmodule 47.7710μs 13.8464μs 72.2208 KOps/s 69.2176 KOps/s $\color{#35bf28}+4.34\%$
test_tdmodule_dispatch 48.7810μs 26.3581μs 37.9390 KOps/s 36.8716 KOps/s $\color{#35bf28}+2.89\%$
test_tdseq 35.8010μs 14.3293μs 69.7870 KOps/s 68.3529 KOps/s $\color{#35bf28}+2.10\%$
test_tdseq_dispatch 39.3600μs 29.6644μs 33.7105 KOps/s 34.0746 KOps/s $\color{#d91a1a}-1.07\%$
test_instantiation_functorch 2.1142ms 1.8848ms 530.5512 Ops/s 533.3443 Ops/s $\color{#d91a1a}-0.52\%$
test_instantiation_td 1.8157ms 1.2010ms 832.6223 Ops/s 834.1334 Ops/s $\color{#d91a1a}-0.18\%$
test_exec_functorch 0.3424ms 0.2078ms 4.8113 KOps/s 4.6537 KOps/s $\color{#35bf28}+3.39\%$
test_exec_functional_call 0.3962ms 0.2213ms 4.5189 KOps/s 4.4543 KOps/s $\color{#35bf28}+1.45\%$
test_exec_td 0.4068ms 0.2274ms 4.3967 KOps/s 4.3576 KOps/s $\color{#35bf28}+0.90\%$
test_exec_td_decorator 0.6324ms 0.2713ms 3.6864 KOps/s 3.6562 KOps/s $\color{#35bf28}+0.82\%$
test_vmap_mlp_speed[True-True] 0.8762ms 0.6962ms 1.4363 KOps/s 1.4741 KOps/s $\color{#d91a1a}-2.56\%$
test_vmap_mlp_speed[True-False] 0.8249ms 0.6734ms 1.4851 KOps/s 1.4808 KOps/s $\color{#35bf28}+0.29\%$
test_vmap_mlp_speed[False-True] 0.7187ms 0.5720ms 1.7484 KOps/s 1.7496 KOps/s $\color{#d91a1a}-0.07\%$
test_vmap_mlp_speed[False-False] 0.7260ms 0.5726ms 1.7464 KOps/s 1.7345 KOps/s $\color{#35bf28}+0.68\%$
test_vmap_mlp_speed_decorator[True-True] 1.3867ms 0.6626ms 1.5092 KOps/s 1.5027 KOps/s $\color{#35bf28}+0.44\%$
test_vmap_mlp_speed_decorator[True-False] 0.8078ms 0.6618ms 1.5110 KOps/s 1.4979 KOps/s $\color{#35bf28}+0.88\%$
test_vmap_mlp_speed_decorator[False-True] 0.7760ms 0.5879ms 1.7009 KOps/s 1.7068 KOps/s $\color{#d91a1a}-0.35\%$
test_vmap_mlp_speed_decorator[False-False] 0.7523ms 0.5880ms 1.7006 KOps/s 1.7007 KOps/s $-0.01\%$
test_vmap_transformer_speed[True-True] 9.1468ms 8.3249ms 120.1220 Ops/s 118.9209 Ops/s $\color{#35bf28}+1.01\%$
test_vmap_transformer_speed[True-False] 8.4629ms 8.2817ms 120.7483 Ops/s 120.1086 Ops/s $\color{#35bf28}+0.53\%$
test_vmap_transformer_speed[False-True] 8.2762ms 8.1124ms 123.2687 Ops/s 122.2292 Ops/s $\color{#35bf28}+0.85\%$
test_vmap_transformer_speed[False-False] 8.2934ms 8.1275ms 123.0389 Ops/s 122.7432 Ops/s $\color{#35bf28}+0.24\%$
test_vmap_transformer_speed_decorator[True-True] 20.1507ms 19.3451ms 51.6927 Ops/s 51.5207 Ops/s $\color{#35bf28}+0.33\%$
test_vmap_transformer_speed_decorator[True-False] 19.5420ms 19.3895ms 51.5742 Ops/s 51.3087 Ops/s $\color{#35bf28}+0.52\%$
test_vmap_transformer_speed_decorator[False-True] 19.4028ms 19.2594ms 51.9226 Ops/s 51.7887 Ops/s $\color{#35bf28}+0.26\%$
test_vmap_transformer_speed_decorator[False-False] 19.3646ms 19.2311ms 51.9990 Ops/s 51.7338 Ops/s $\color{#35bf28}+0.51\%$
test_to_module_speed[True] 2.0102ms 0.9608ms 1.0408 KOps/s 1.0629 KOps/s $\color{#d91a1a}-2.08\%$
test_to_module_speed[False] 1.1019ms 0.9288ms 1.0766 KOps/s 1.0837 KOps/s $\color{#d91a1a}-0.65\%$
test_tc_init 63.2110μs 31.6787μs 31.5670 KOps/s 30.6401 KOps/s $\color{#35bf28}+3.03\%$
test_tc_init_nested 0.1094ms 64.7385μs 15.4467 KOps/s 14.5635 KOps/s $\textbf{\color{#35bf28}+6.06\%}$
test_tc_first_layer_tensor 10.3831μs 0.6792μs 1.4724 MOps/s 1.4703 MOps/s $\color{#35bf28}+0.14\%$
test_tc_first_layer_nontensor 25.2400μs 2.2505μs 444.3392 KOps/s 443.4587 KOps/s $\color{#35bf28}+0.20\%$
test_tc_second_layer_tensor 48.2680μs 1.3626μs 733.8867 KOps/s 734.4047 KOps/s $\color{#d91a1a}-0.07\%$
test_tc_second_layer_nontensor 33.3800μs 2.9592μs 337.9290 KOps/s 342.0348 KOps/s $\color{#d91a1a}-1.20\%$
test_unbind 0.1967s 10.8776ms 91.9318 Ops/s 93.8256 Ops/s $\color{#d91a1a}-2.02\%$
test_full_like 0.7645ms 0.5768ms 1.7337 KOps/s 1.7394 KOps/s $\color{#d91a1a}-0.33\%$
test_zeros_like 0.3444ms 0.1981ms 5.0469 KOps/s 5.0475 KOps/s $\color{#d91a1a}-0.01\%$
test_ones_like 0.3427ms 0.1980ms 5.0499 KOps/s 5.0511 KOps/s $\color{#d91a1a}-0.02\%$
test_clone 0.5635ms 0.4147ms 2.4114 KOps/s 2.4095 KOps/s $\color{#35bf28}+0.08\%$
test_squeeze 0.1723ms 9.9666μs 100.3351 KOps/s 100.0128 KOps/s $\color{#35bf28}+0.32\%$
test_unsqueeze 0.2575ms 76.2366μs 13.1171 KOps/s 13.1484 KOps/s $\color{#d91a1a}-0.24\%$
test_split 0.4377ms 0.1602ms 6.2434 KOps/s 6.2411 KOps/s $\color{#35bf28}+0.04\%$
test_permute 0.3795ms 0.1865ms 5.3610 KOps/s 5.5610 KOps/s $\color{#d91a1a}-3.60\%$
test_stack 1.3847ms 0.8557ms 1.1687 KOps/s 1.1402 KOps/s $\color{#35bf28}+2.50\%$
test_cat 1.3498ms 1.2318ms 811.8486 Ops/s 811.7793 Ops/s $+0.01\%$

@vmoens vmoens added the Quality label Oct 1, 2024
@vmoens vmoens merged commit 4eff3e4 into main Oct 1, 2024
53 of 57 checks passed
@vmoens vmoens deleted the fix-pyre branch October 1, 2024 07:01
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Quality
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants