Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Quality] Better use of StrEnum in set_interaction_type #1087

Merged
merged 1 commit into from
Nov 13, 2024

Conversation

vmoens
Copy link
Contributor

@vmoens vmoens commented Nov 13, 2024

[ghstack-poisoned]
vmoens added a commit that referenced this pull request Nov 13, 2024
ghstack-source-id: c91a7a6be513fb46be6914df0b3bde779fa5528f
Pull Request resolved: #1087
@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Nov 13, 2024
@vmoens vmoens merged commit ee3cd74 into gh/vmoens/35/base Nov 13, 2024
19 of 33 checks passed
vmoens added a commit that referenced this pull request Nov 13, 2024
ghstack-source-id: c91a7a6be513fb46be6914df0b3bde779fa5528f
Pull Request resolved: #1087
@vmoens vmoens deleted the gh/vmoens/35/head branch November 13, 2024 18:47
Copy link

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 217. Improved: $\large\color{#35bf28}10$. Worsened: $\large\color{#d91a1a}41$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 69.4200μs 18.5043μs 54.0414 KOps/s 61.3913 KOps/s $\textbf{\color{#d91a1a}-11.97\%}$
test_plain_set_stack_nested 56.8160μs 18.2389μs 54.8278 KOps/s 59.5321 KOps/s $\textbf{\color{#d91a1a}-7.90\%}$
test_plain_set_nested_inplace 70.7920μs 20.1252μs 49.6890 KOps/s 55.4136 KOps/s $\textbf{\color{#d91a1a}-10.33\%}$
test_plain_set_stack_nested_inplace 72.8160μs 20.3519μs 49.1354 KOps/s 55.2703 KOps/s $\textbf{\color{#d91a1a}-11.10\%}$
test_items 22.8630μs 4.0736μs 245.4833 KOps/s 240.4326 KOps/s $\color{#35bf28}+2.10\%$
test_items_nested 0.6099ms 0.3438ms 2.9089 KOps/s 2.9061 KOps/s $\color{#35bf28}+0.10\%$
test_items_nested_locked 0.7060ms 0.3441ms 2.9060 KOps/s 2.8820 KOps/s $\color{#35bf28}+0.83\%$
test_items_nested_leaf 0.1296ms 70.8181μs 14.1207 KOps/s 13.9680 KOps/s $\color{#35bf28}+1.09\%$
test_items_stack_nested 0.7075ms 0.3471ms 2.8806 KOps/s 2.8770 KOps/s $\color{#35bf28}+0.13\%$
test_items_stack_nested_leaf 0.1356ms 71.6340μs 13.9598 KOps/s 13.5257 KOps/s $\color{#35bf28}+3.21\%$
test_items_stack_nested_locked 0.6545ms 0.3437ms 2.9092 KOps/s 2.8825 KOps/s $\color{#35bf28}+0.92\%$
test_keys 21.3600μs 3.4983μs 285.8515 KOps/s 286.3059 KOps/s $\color{#d91a1a}-0.16\%$
test_keys_nested 0.2248ms 0.1368ms 7.3124 KOps/s 7.3575 KOps/s $\color{#d91a1a}-0.61\%$
test_keys_nested_locked 1.6865ms 0.1396ms 7.1634 KOps/s 7.1183 KOps/s $\color{#35bf28}+0.63\%$
test_keys_nested_leaf 0.1781ms 0.1145ms 8.7329 KOps/s 8.5039 KOps/s $\color{#35bf28}+2.69\%$
test_keys_stack_nested 0.2934ms 0.1339ms 7.4662 KOps/s 7.3979 KOps/s $\color{#35bf28}+0.92\%$
test_keys_stack_nested_leaf 0.2370ms 0.1156ms 8.6528 KOps/s 8.6812 KOps/s $\color{#d91a1a}-0.33\%$
test_keys_stack_nested_locked 0.2511ms 0.1390ms 7.1960 KOps/s 7.1719 KOps/s $\color{#35bf28}+0.34\%$
test_values 4.7930μs 1.0406μs 961.0133 KOps/s 949.2138 KOps/s $\color{#35bf28}+1.24\%$
test_values_nested 0.1639ms 53.3917μs 18.7295 KOps/s 17.9216 KOps/s $\color{#35bf28}+4.51\%$
test_values_nested_locked 0.1042ms 53.9652μs 18.5305 KOps/s 16.2071 KOps/s $\textbf{\color{#35bf28}+14.34\%}$
test_values_nested_leaf 0.1437ms 59.4806μs 16.8122 KOps/s 16.4956 KOps/s $\color{#35bf28}+1.92\%$
test_values_stack_nested 0.1121ms 54.9260μs 18.2063 KOps/s 17.3581 KOps/s $\color{#35bf28}+4.89\%$
test_values_stack_nested_leaf 0.1161ms 59.4014μs 16.8346 KOps/s 16.5827 KOps/s $\color{#35bf28}+1.52\%$
test_values_stack_nested_locked 0.1082ms 54.1999μs 18.4502 KOps/s 17.4159 KOps/s $\textbf{\color{#35bf28}+5.94\%}$
test_membership 5.3386μs 0.7265μs 1.3764 MOps/s 1.1314 MOps/s $\textbf{\color{#35bf28}+21.65\%}$
test_membership_nested 19.8260μs 2.7139μs 368.4706 KOps/s 358.7783 KOps/s $\color{#35bf28}+2.70\%$
test_membership_nested_leaf 39.6840μs 2.7138μs 368.4852 KOps/s 361.6472 KOps/s $\color{#35bf28}+1.89\%$
test_membership_stacked_nested 21.4100μs 2.6868μs 372.1891 KOps/s 368.1639 KOps/s $\color{#35bf28}+1.09\%$
test_membership_stacked_nested_leaf 28.8030μs 2.7015μs 370.1666 KOps/s 361.4936 KOps/s $\color{#35bf28}+2.40\%$
test_membership_nested_last 60.0720μs 4.0000μs 249.9974 KOps/s 242.7534 KOps/s $\color{#35bf28}+2.98\%$
test_membership_nested_leaf_last 26.6800μs 3.9935μs 250.4074 KOps/s 244.4852 KOps/s $\color{#35bf28}+2.42\%$
test_membership_stacked_nested_last 23.9150μs 3.9860μs 250.8796 KOps/s 137.8739 KOps/s $\textbf{\color{#35bf28}+81.96\%}$
test_membership_stacked_nested_leaf_last 25.4680μs 4.0570μs 246.4892 KOps/s 140.5604 KOps/s $\textbf{\color{#35bf28}+75.36\%}$
test_nested_getleaf 46.9270μs 10.4404μs 95.7821 KOps/s 94.3164 KOps/s $\color{#35bf28}+1.55\%$
test_nested_get 36.0270μs 9.7941μs 102.1028 KOps/s 99.4722 KOps/s $\color{#35bf28}+2.64\%$
test_stacked_getleaf 37.7200μs 10.3890μs 96.2559 KOps/s 94.9059 KOps/s $\color{#35bf28}+1.42\%$
test_stacked_get 47.1780μs 9.7106μs 102.9802 KOps/s 100.2969 KOps/s $\color{#35bf28}+2.68\%$
test_nested_getitemleaf 46.1460μs 10.7313μs 93.1857 KOps/s 91.7030 KOps/s $\color{#35bf28}+1.62\%$
test_nested_getitem 45.8960μs 10.1908μs 98.1281 KOps/s 98.2984 KOps/s $\color{#d91a1a}-0.17\%$
test_stacked_getitemleaf 36.4480μs 10.8796μs 91.9151 KOps/s 91.8816 KOps/s $\color{#35bf28}+0.04\%$
test_stacked_getitem 52.1080μs 10.1948μs 98.0895 KOps/s 99.3743 KOps/s $\color{#d91a1a}-1.29\%$
test_lock_nested 3.0270ms 0.4465ms 2.2395 KOps/s 1.8101 KOps/s $\textbf{\color{#35bf28}+23.72\%}$
test_lock_stack_nested 0.7020ms 0.4124ms 2.4246 KOps/s 2.4620 KOps/s $\color{#d91a1a}-1.52\%$
test_unlock_nested 0.7213ms 0.3580ms 2.7931 KOps/s 2.7682 KOps/s $\color{#35bf28}+0.90\%$
test_unlock_stack_nested 0.5079ms 0.3310ms 3.0211 KOps/s 3.0868 KOps/s $\color{#d91a1a}-2.13\%$
test_flatten_speed 0.1626ms 92.0235μs 10.8668 KOps/s 11.0268 KOps/s $\color{#d91a1a}-1.45\%$
test_unflatten_speed 0.8372ms 0.4676ms 2.1387 KOps/s 2.0866 KOps/s $\color{#35bf28}+2.49\%$
test_common_ops 5.0381ms 0.7868ms 1.2710 KOps/s 1.4250 KOps/s $\textbf{\color{#d91a1a}-10.80\%}$
test_creation 60.8240μs 2.0475μs 488.3893 KOps/s 472.0002 KOps/s $\color{#35bf28}+3.47\%$
test_creation_empty 49.0320μs 12.0304μs 83.1230 KOps/s 117.2208 KOps/s $\textbf{\color{#d91a1a}-29.09\%}$
test_creation_nested_1 40.3650μs 14.7785μs 67.6659 KOps/s 88.3209 KOps/s $\textbf{\color{#d91a1a}-23.39\%}$
test_creation_nested_2 50.6850μs 19.0358μs 52.5325 KOps/s 63.9391 KOps/s $\textbf{\color{#d91a1a}-17.84\%}$
test_clone 67.2950μs 13.1843μs 75.8479 KOps/s 75.4151 KOps/s $\color{#35bf28}+0.57\%$
test_getitem[int] 1.0628ms 13.2545μs 75.4460 KOps/s 80.0128 KOps/s $\textbf{\color{#d91a1a}-5.71\%}$
test_getitem[slice_int] 0.1526ms 25.0160μs 39.9744 KOps/s 41.5446 KOps/s $\color{#d91a1a}-3.78\%$
test_getitem[range] 0.2008ms 48.7615μs 20.5080 KOps/s 21.2662 KOps/s $\color{#d91a1a}-3.57\%$
test_getitem[tuple] 0.1368ms 20.7408μs 48.2142 KOps/s 50.0513 KOps/s $\color{#d91a1a}-3.67\%$
test_getitem[list] 0.1918ms 44.3601μs 22.5428 KOps/s 23.3529 KOps/s $\color{#d91a1a}-3.47\%$
test_setitem_dim[int] 48.4110μs 25.6116μs 39.0448 KOps/s 39.3948 KOps/s $\color{#d91a1a}-0.89\%$
test_setitem_dim[slice_int] 91.2000μs 51.4188μs 19.4481 KOps/s 19.3618 KOps/s $\color{#35bf28}+0.45\%$
test_setitem_dim[range] 0.1300ms 73.3779μs 13.6281 KOps/s 13.8368 KOps/s $\color{#d91a1a}-1.51\%$
test_setitem_dim[tuple] 78.7070μs 40.5910μs 24.6360 KOps/s 24.5011 KOps/s $\color{#35bf28}+0.55\%$
test_setitem 83.7160μs 20.6279μs 48.4781 KOps/s 53.0871 KOps/s $\textbf{\color{#d91a1a}-8.68\%}$
test_set 75.9010μs 20.4848μs 48.8166 KOps/s 54.8140 KOps/s $\textbf{\color{#d91a1a}-10.94\%}$
test_set_shared 1.2558ms 0.1708ms 5.8564 KOps/s 6.0029 KOps/s $\color{#d91a1a}-2.44\%$
test_update 0.1950ms 23.6054μs 42.3631 KOps/s 51.4130 KOps/s $\textbf{\color{#d91a1a}-17.60\%}$
test_update_nested 0.1041ms 32.8426μs 30.4482 KOps/s 33.8093 KOps/s $\textbf{\color{#d91a1a}-9.94\%}$
test_update__nested 1.0500ms 34.6020μs 28.9001 KOps/s 30.0866 KOps/s $\color{#d91a1a}-3.94\%$
test_set_nested 80.9810μs 22.5075μs 44.4295 KOps/s 48.5879 KOps/s $\textbf{\color{#d91a1a}-8.56\%}$
test_set_nested_new 75.0300μs 26.8933μs 37.1840 KOps/s 40.1327 KOps/s $\textbf{\color{#d91a1a}-7.35\%}$
test_select 0.1134ms 43.0814μs 23.2119 KOps/s 24.7907 KOps/s $\textbf{\color{#d91a1a}-6.37\%}$
test_select_nested 0.1423ms 59.4101μs 16.8321 KOps/s 16.9226 KOps/s $\color{#d91a1a}-0.53\%$
test_exclude_nested 0.3400ms 75.1750μs 13.3023 KOps/s 13.0666 KOps/s $\color{#35bf28}+1.80\%$
test_empty[True] 0.5433ms 0.3456ms 2.8934 KOps/s 2.8504 KOps/s $\color{#35bf28}+1.51\%$
test_empty[False] 8.9590μs 1.2444μs 803.6006 KOps/s 797.2712 KOps/s $\color{#35bf28}+0.79\%$
test_unbind_speed 0.3365ms 0.2609ms 3.8327 KOps/s 3.8651 KOps/s $\color{#d91a1a}-0.84\%$
test_unbind_speed_stack0 0.4999ms 0.2593ms 3.8561 KOps/s 3.9803 KOps/s $\color{#d91a1a}-3.12\%$
test_unbind_speed_stack1 0.1057s 0.7676ms 1.3028 KOps/s 1.4621 KOps/s $\textbf{\color{#d91a1a}-10.90\%}$
test_split 0.1055s 1.7809ms 561.5261 Ops/s 568.7199 Ops/s $\color{#d91a1a}-1.26\%$
test_chunk 0.1021s 1.7738ms 563.7591 Ops/s 573.0017 Ops/s $\color{#d91a1a}-1.61\%$
test_consolidate_njt[False-None] 8.5706ms 8.1763ms 122.3042 Ops/s 122.7524 Ops/s $\color{#d91a1a}-0.37\%$
test_creation[device0] 0.2186ms 90.5594μs 11.0425 KOps/s 11.0113 KOps/s $\color{#35bf28}+0.28\%$
test_creation_from_tensor 4.7050ms 96.1511μs 10.4003 KOps/s 10.5392 KOps/s $\color{#d91a1a}-1.32\%$
test_add_one[memmap_tensor0] 0.1681ms 5.0162μs 199.3529 KOps/s 205.4100 KOps/s $\color{#d91a1a}-2.95\%$
test_contiguous[memmap_tensor0] 12.6540μs 0.5153μs 1.9405 MOps/s 1.9633 MOps/s $\color{#d91a1a}-1.16\%$
test_stack[memmap_tensor0] 32.6010μs 3.5534μs 281.4245 KOps/s 293.9670 KOps/s $\color{#d91a1a}-4.27\%$
test_memmaptd_index 0.9247ms 0.2410ms 4.1493 KOps/s 4.2093 KOps/s $\color{#d91a1a}-1.43\%$
test_memmaptd_index_astensor 0.5784ms 0.3188ms 3.1371 KOps/s 3.1639 KOps/s $\color{#d91a1a}-0.85\%$
test_memmaptd_index_op 1.1280ms 0.6168ms 1.6212 KOps/s 1.8424 KOps/s $\textbf{\color{#d91a1a}-12.01\%}$
test_serialize_model 0.1278s 0.1157s 8.6398 Ops/s 8.8378 Ops/s $\color{#d91a1a}-2.24\%$
test_serialize_model_pickle 0.4454s 0.3868s 2.5852 Ops/s 2.5565 Ops/s $\color{#35bf28}+1.12\%$
test_serialize_weights 0.2147s 0.1281s 7.8041 Ops/s 7.9610 Ops/s $\color{#d91a1a}-1.97\%$
test_serialize_weights_returnearly 0.1677s 0.1587s 6.3009 Ops/s 6.5679 Ops/s $\color{#d91a1a}-4.06\%$
test_serialize_weights_pickle 0.5071s 0.4527s 2.2091 Ops/s 1.0816 Ops/s $\textbf{\color{#35bf28}+104.25\%}$
test_serialize_weights_filesystem 0.1488s 0.1397s 7.1556 Ops/s 7.2451 Ops/s $\color{#d91a1a}-1.23\%$
test_serialize_model_filesystem 0.2395s 0.1576s 6.3456 Ops/s 6.4908 Ops/s $\color{#d91a1a}-2.24\%$
test_reshape_pytree 59.9720μs 26.7817μs 37.3389 KOps/s 36.7046 KOps/s $\color{#35bf28}+1.73\%$
test_reshape_td 70.2610μs 33.5069μs 29.8446 KOps/s 30.3681 KOps/s $\color{#d91a1a}-1.72\%$
test_view_pytree 74.4680μs 27.3882μs 36.5121 KOps/s 37.0260 KOps/s $\color{#d91a1a}-1.39\%$
test_view_td 78.7370μs 38.2249μs 26.1610 KOps/s 25.5332 KOps/s $\color{#35bf28}+2.46\%$
test_unbind_pytree 69.1390μs 30.1428μs 33.1754 KOps/s 33.4777 KOps/s $\color{#d91a1a}-0.90\%$
test_unbind_td 0.3201ms 38.3959μs 26.0445 KOps/s 26.3251 KOps/s $\color{#d91a1a}-1.07\%$
test_split_pytree 81.0010μs 30.0997μs 33.2230 KOps/s 33.8392 KOps/s $\color{#d91a1a}-1.82\%$
test_split_td 0.5132ms 44.7321μs 22.3553 KOps/s 22.7975 KOps/s $\color{#d91a1a}-1.94\%$
test_add_pytree 76.6230μs 35.8683μs 27.8797 KOps/s 27.5891 KOps/s $\color{#35bf28}+1.05\%$
test_add_td 0.1073ms 58.6806μs 17.0414 KOps/s 19.0059 KOps/s $\textbf{\color{#d91a1a}-10.34\%}$
test_compile_add_one_nested[tensordict-compile] 0.1154ms 63.0498μs 15.8605 KOps/s 16.4320 KOps/s $\color{#d91a1a}-3.48\%$
test_compile_add_one_nested[tensordict-eager] 0.5165ms 0.1620ms 6.1734 KOps/s 6.2384 KOps/s $\color{#d91a1a}-1.04\%$
test_compile_add_one_nested[pytree-compile] 0.1603ms 45.8977μs 21.7876 KOps/s 22.1039 KOps/s $\color{#d91a1a}-1.43\%$
test_compile_add_one_nested[pytree-eager] 0.2667ms 0.1206ms 8.2925 KOps/s 8.5047 KOps/s $\color{#d91a1a}-2.49\%$
test_compile_copy_nested[tensordict-compile] 68.6680μs 26.7237μs 37.4199 KOps/s 38.7123 KOps/s $\color{#d91a1a}-3.34\%$
test_compile_copy_nested[tensordict-eager] 0.1147ms 53.9360μs 18.5405 KOps/s 18.3902 KOps/s $\color{#35bf28}+0.82\%$
test_compile_copy_nested[pytree-compile] 0.1675ms 78.3640μs 12.7610 KOps/s 12.5745 KOps/s $\color{#35bf28}+1.48\%$
test_compile_copy_nested[pytree-eager] 0.1649ms 67.0907μs 14.9052 KOps/s 14.7841 KOps/s $\color{#35bf28}+0.82\%$
test_compile_add_one_flat[tensordict-compile] 0.1841ms 0.1060ms 9.4298 KOps/s 9.7242 KOps/s $\color{#d91a1a}-3.03\%$
test_compile_add_one_flat[tensordict-eager] 0.4215ms 0.1992ms 5.0190 KOps/s 5.0558 KOps/s $\color{#d91a1a}-0.73\%$
test_compile_add_one_flat[tensorclass-compile] 0.1641ms 46.6190μs 21.4505 KOps/s 22.5946 KOps/s $\textbf{\color{#d91a1a}-5.06\%}$
test_compile_add_one_flat[tensorclass-eager] 0.4589ms 63.0654μs 15.8565 KOps/s 16.2988 KOps/s $\color{#d91a1a}-2.71\%$
test_compile_add_one_flat[pytree-compile] 0.1799ms 0.1038ms 9.6295 KOps/s 9.9606 KOps/s $\color{#d91a1a}-3.32\%$
test_compile_add_one_flat[pytree-eager] 0.2900ms 0.2001ms 4.9978 KOps/s 5.0160 KOps/s $\color{#d91a1a}-0.36\%$
test_compile_add_self_flat[tensordict-eager] 0.3675ms 0.2106ms 4.7477 KOps/s 4.7806 KOps/s $\color{#d91a1a}-0.69\%$
test_compile_add_self_flat[tensordict-compile] 0.2151ms 0.1086ms 9.2088 KOps/s 9.6580 KOps/s $\color{#d91a1a}-4.65\%$
test_compile_add_self_flat[tensorclass-eager] 0.2731ms 58.6120μs 17.0614 KOps/s 18.7553 KOps/s $\textbf{\color{#d91a1a}-9.03\%}$
test_compile_add_self_flat[tensorclass-compile] 0.2867ms 48.3853μs 20.6674 KOps/s 22.3968 KOps/s $\textbf{\color{#d91a1a}-7.72\%}$
test_compile_add_self_flat[pytree-eager] 0.6388ms 0.1578ms 6.3374 KOps/s 6.3222 KOps/s $\color{#35bf28}+0.24\%$
test_compile_add_self_flat[pytree-compile] 0.2302ms 0.1044ms 9.5772 KOps/s 10.0192 KOps/s $\color{#d91a1a}-4.41\%$
test_compile_copy_flat[tensordict-compile] 52.3880μs 21.3674μs 46.8002 KOps/s 47.3257 KOps/s $\color{#d91a1a}-1.11\%$
test_compile_copy_flat[tensordict-eager] 0.1916ms 59.4683μs 16.8157 KOps/s 16.6475 KOps/s $\color{#35bf28}+1.01\%$
test_compile_copy_flat[pytree-compile] 0.1569ms 80.8619μs 12.3668 KOps/s 12.4715 KOps/s $\color{#d91a1a}-0.84\%$
test_compile_copy_flat[pytree-eager] 0.1368ms 68.6262μs 14.5717 KOps/s 14.4908 KOps/s $\color{#35bf28}+0.56\%$
test_compile_assign_and_add[tensordict-compile] 0.2998ms 0.2105ms 4.7509 KOps/s 4.8669 KOps/s $\color{#d91a1a}-2.38\%$
test_compile_assign_and_add[tensordict-eager] 2.0292ms 1.2704ms 787.1579 Ops/s 777.7806 Ops/s $\color{#35bf28}+1.21\%$
test_compile_assign_and_add[pytree-compile] 0.3069ms 0.2045ms 4.8891 KOps/s 5.0413 KOps/s $\color{#d91a1a}-3.02\%$
test_compile_assign_and_add[pytree-eager] 1.6139ms 0.7820ms 1.2788 KOps/s 1.3000 KOps/s $\color{#d91a1a}-1.63\%$
test_compile_assign_and_add_stack[compile] 0.5545ms 0.4596ms 2.1759 KOps/s 2.2230 KOps/s $\color{#d91a1a}-2.12\%$
test_compile_assign_and_add_stack[eager] 4.4323ms 2.6898ms 371.7715 Ops/s 416.0850 Ops/s $\textbf{\color{#d91a1a}-10.65\%}$
test_compile_indexing[tensor-tensordict-compile] 91.2100μs 37.5304μs 26.6450 KOps/s 28.4713 KOps/s $\textbf{\color{#d91a1a}-6.41\%}$
test_compile_indexing[tensor-tensordict-eager] 0.5083ms 33.9138μs 29.4865 KOps/s 31.4695 KOps/s $\textbf{\color{#d91a1a}-6.30\%}$
test_compile_indexing[tensor-tensorclass-compile] 99.7550μs 30.1028μs 33.2195 KOps/s 34.7470 KOps/s $\color{#d91a1a}-4.40\%$
test_compile_indexing[tensor-tensorclass-eager] 68.9880μs 23.8304μs 41.9632 KOps/s 43.0562 KOps/s $\color{#d91a1a}-2.54\%$
test_compile_indexing[tensor-pytree-compile] 78.6870μs 30.7162μs 32.5561 KOps/s 33.6219 KOps/s $\color{#d91a1a}-3.17\%$
test_compile_indexing[tensor-pytree-eager] 81.8120μs 24.2047μs 41.3144 KOps/s 42.3829 KOps/s $\color{#d91a1a}-2.52\%$
test_compile_indexing[slice-tensordict-compile] 0.1222ms 51.9306μs 19.2565 KOps/s 19.5228 KOps/s $\color{#d91a1a}-1.36\%$
test_compile_indexing[slice-tensordict-eager] 0.5787ms 20.4952μs 48.7919 KOps/s 50.3628 KOps/s $\color{#d91a1a}-3.12\%$
test_compile_indexing[slice-tensorclass-compile] 0.1018ms 44.3031μs 22.5718 KOps/s 22.9807 KOps/s $\color{#d91a1a}-1.78\%$
test_compile_indexing[slice-tensorclass-eager] 83.8360μs 19.0741μs 52.4271 KOps/s 53.2498 KOps/s $\color{#d91a1a}-1.55\%$
test_compile_indexing[slice-pytree-compile] 0.1037ms 44.1338μs 22.6584 KOps/s 22.4780 KOps/s $\color{#35bf28}+0.80\%$
test_compile_indexing[slice-pytree-eager] 52.1980μs 18.7462μs 53.3442 KOps/s 52.9495 KOps/s $\color{#35bf28}+0.75\%$
test_compile_indexing[int-tensordict-compile] 0.1225ms 51.9727μs 19.2409 KOps/s 19.3580 KOps/s $\color{#d91a1a}-0.61\%$
test_compile_indexing[int-tensordict-eager] 0.9210ms 20.7097μs 48.2864 KOps/s 51.0076 KOps/s $\textbf{\color{#d91a1a}-5.33\%}$
test_compile_indexing[int-tensorclass-compile] 0.1183ms 44.8467μs 22.2982 KOps/s 22.4373 KOps/s $\color{#d91a1a}-0.62\%$
test_compile_indexing[int-tensorclass-eager] 0.5420ms 19.0468μs 52.5022 KOps/s 53.8246 KOps/s $\color{#d91a1a}-2.46\%$
test_compile_indexing[int-pytree-compile] 0.1189ms 44.3702μs 22.5376 KOps/s 22.6880 KOps/s $\color{#d91a1a}-0.66\%$
test_compile_indexing[int-pytree-eager] 53.1890μs 18.9328μs 52.8183 KOps/s 53.5607 KOps/s $\color{#d91a1a}-1.39\%$
test_mod_add[eager] 97.3810μs 27.2951μs 36.6366 KOps/s 39.5830 KOps/s $\textbf{\color{#d91a1a}-7.44\%}$
test_mod_add[compile] 0.1100ms 45.8519μs 21.8093 KOps/s 21.9963 KOps/s $\color{#d91a1a}-0.85\%$
test_mod_add[compile-overhead] 0.1051ms 45.7395μs 21.8630 KOps/s 22.0983 KOps/s $\color{#d91a1a}-1.06\%$
test_mod_wrap[eager] 0.3791ms 0.2204ms 4.5380 KOps/s 4.7858 KOps/s $\textbf{\color{#d91a1a}-5.18\%}$
test_mod_wrap[compile] 1.2420ms 0.2080ms 4.8088 KOps/s 4.9766 KOps/s $\color{#d91a1a}-3.37\%$
test_mod_wrap[compile-overhead] 1.4759ms 0.2071ms 4.8293 KOps/s 4.9975 KOps/s $\color{#d91a1a}-3.37\%$
test_mod_wrap_and_backward[eager] 14.2620ms 11.8514ms 84.3784 Ops/s 92.0458 Ops/s $\textbf{\color{#d91a1a}-8.33\%}$
test_mod_wrap_and_backward[compile] 15.9273ms 12.8694ms 77.7039 Ops/s 93.7276 Ops/s $\textbf{\color{#d91a1a}-17.10\%}$
test_mod_wrap_and_backward[compile-overhead] 16.0870ms 11.4145ms 87.6082 Ops/s 91.9859 Ops/s $\color{#d91a1a}-4.76\%$
test_seq_add[eager] 0.1642ms 92.3512μs 10.8282 KOps/s 11.3659 KOps/s $\color{#d91a1a}-4.73\%$
test_seq_add[compile] 0.1425ms 61.9244μs 16.1487 KOps/s 16.9519 KOps/s $\color{#d91a1a}-4.74\%$
test_seq_add[compile-overhead] 0.1121ms 60.9247μs 16.4137 KOps/s 17.3890 KOps/s $\textbf{\color{#d91a1a}-5.61\%}$
test_seq_wrap[eager] 0.6089ms 0.3986ms 2.5090 KOps/s 2.6623 KOps/s $\textbf{\color{#d91a1a}-5.76\%}$
test_seq_wrap[compile] 0.3285ms 0.2299ms 4.3491 KOps/s 4.5137 KOps/s $\color{#d91a1a}-3.65\%$
test_seq_wrap[compile-overhead] 0.4188ms 0.2292ms 4.3635 KOps/s 4.5128 KOps/s $\color{#d91a1a}-3.31\%$
test_func_call_runtime[False-eager] 0.8624ms 0.5624ms 1.7781 KOps/s 1.8724 KOps/s $\textbf{\color{#d91a1a}-5.03\%}$
test_func_call_runtime[False-compile] 0.8044ms 0.4369ms 2.2890 KOps/s 2.3675 KOps/s $\color{#d91a1a}-3.31\%$
test_func_call_runtime[False-compile-overhead] 0.7996ms 0.4357ms 2.2953 KOps/s 2.3421 KOps/s $\color{#d91a1a}-2.00\%$
test_func_call_runtime[True-eager] 0.9298ms 0.7719ms 1.2955 KOps/s 1.3402 KOps/s $\color{#d91a1a}-3.33\%$
test_func_call_runtime[True-compile] 0.5764ms 0.4709ms 2.1237 KOps/s 2.1541 KOps/s $\color{#d91a1a}-1.41\%$
test_func_call_runtime[True-compile-overhead] 0.6769ms 0.4713ms 2.1218 KOps/s 2.1652 KOps/s $\color{#d91a1a}-2.00\%$
test_func_call_cm_runtime[False-eager] 0.7617ms 0.5619ms 1.7796 KOps/s 1.8772 KOps/s $\textbf{\color{#d91a1a}-5.20\%}$
test_func_call_cm_runtime[False-compile] 0.7744ms 0.4353ms 2.2975 KOps/s 2.3539 KOps/s $\color{#d91a1a}-2.39\%$
test_func_call_cm_runtime[False-compile-overhead] 0.7611ms 0.4367ms 2.2899 KOps/s 2.3458 KOps/s $\color{#d91a1a}-2.38\%$
test_func_call_cm_runtime[True-eager] 1.0763ms 0.9089ms 1.1003 KOps/s 1.1128 KOps/s $\color{#d91a1a}-1.13\%$
test_func_call_cm_runtime[True-compile] 1.1400ms 0.5019ms 1.9925 KOps/s 2.0508 KOps/s $\color{#d91a1a}-2.84\%$
test_func_call_cm_runtime[True-compile-overhead] 0.9198ms 0.5060ms 1.9761 KOps/s 2.0549 KOps/s $\color{#d91a1a}-3.83\%$
test_vmap_func_call_cm_runtime[eager] 2.4227ms 1.9092ms 523.7663 Ops/s 530.1247 Ops/s $\color{#d91a1a}-1.20\%$
test_vmap_func_call_cm_runtime[compile] 0.9215ms 0.5261ms 1.9006 KOps/s 1.9516 KOps/s $\color{#d91a1a}-2.61\%$
test_vmap_func_call_cm_runtime[compile-overhead] 0.7036ms 0.5227ms 1.9133 KOps/s 1.9519 KOps/s $\color{#d91a1a}-1.98\%$
test_distributed 0.2415ms 0.1309ms 7.6413 KOps/s 7.9004 KOps/s $\color{#d91a1a}-3.28\%$
test_tdmodule 47.8890μs 19.5054μs 51.2680 KOps/s 58.3648 KOps/s $\textbf{\color{#d91a1a}-12.16\%}$
test_tdmodule_dispatch 70.3320μs 38.1131μs 26.2377 KOps/s 30.3810 KOps/s $\textbf{\color{#d91a1a}-13.64\%}$
test_tdseq 42.9000μs 23.2419μs 43.0258 KOps/s 51.3279 KOps/s $\textbf{\color{#d91a1a}-16.17\%}$
test_tdseq_dispatch 75.7520μs 44.6838μs 22.3795 KOps/s 26.3995 KOps/s $\textbf{\color{#d91a1a}-15.23\%}$
test_instantiation_functorch 1.8304ms 1.5521ms 644.2837 Ops/s 658.8427 Ops/s $\color{#d91a1a}-2.21\%$
test_exec_functorch 0.3271ms 0.1803ms 5.5459 KOps/s 5.6909 KOps/s $\color{#d91a1a}-2.55\%$
test_exec_functional_call 0.2741ms 0.1783ms 5.6087 KOps/s 5.9328 KOps/s $\textbf{\color{#d91a1a}-5.46\%}$
test_exec_td_decorator 0.4724ms 0.2344ms 4.2655 KOps/s 4.3882 KOps/s $\color{#d91a1a}-2.80\%$
test_vmap_mlp_speed_decorator[True-True] 1.4189ms 0.6457ms 1.5487 KOps/s 1.6139 KOps/s $\color{#d91a1a}-4.04\%$
test_vmap_mlp_speed_decorator[True-False] 0.8766ms 0.6477ms 1.5440 KOps/s 1.6011 KOps/s $\color{#d91a1a}-3.57\%$
test_vmap_mlp_speed_decorator[False-True] 0.8559ms 0.5266ms 1.8990 KOps/s 1.9199 KOps/s $\color{#d91a1a}-1.09\%$
test_vmap_mlp_speed_decorator[False-False] 0.7358ms 0.5246ms 1.9063 KOps/s 1.9344 KOps/s $\color{#d91a1a}-1.45\%$
test_to_module_speed[True] 2.0365ms 1.3144ms 760.8227 Ops/s 781.1927 Ops/s $\color{#d91a1a}-2.61\%$
test_to_module_speed[False] 1.9700ms 1.2681ms 788.5705 Ops/s 799.4008 Ops/s $\color{#d91a1a}-1.35\%$
test_tc_init 94.5760μs 45.6625μs 21.8998 KOps/s 23.0962 KOps/s $\textbf{\color{#d91a1a}-5.18\%}$
test_tc_init_nested 0.1939ms 90.0539μs 11.1045 KOps/s 11.3617 KOps/s $\color{#d91a1a}-2.26\%$
test_tc_first_layer_tensor 21.0990μs 1.5176μs 658.9179 KOps/s 613.9251 KOps/s $\textbf{\color{#35bf28}+7.33\%}$
test_tc_first_layer_nontensor 22.9930μs 4.5723μs 218.7063 KOps/s 206.8475 KOps/s $\textbf{\color{#35bf28}+5.73\%}$
test_tc_second_layer_tensor 22.3120μs 2.7701μs 361.0006 KOps/s 335.0777 KOps/s $\textbf{\color{#35bf28}+7.74\%}$
test_tc_second_layer_nontensor 35.7670μs 5.9698μs 167.5085 KOps/s 160.8698 KOps/s $\color{#35bf28}+4.13\%$
test_unbind 0.2069s 13.0928ms 76.3777 Ops/s 83.0251 Ops/s $\textbf{\color{#d91a1a}-8.01\%}$
test_full_like 8.1580ms 7.0680ms 141.4821 Ops/s 136.9367 Ops/s $\color{#35bf28}+3.32\%$
test_zeros_like 3.0569ms 2.6881ms 372.0062 Ops/s 358.8711 Ops/s $\color{#35bf28}+3.66\%$
test_ones_like 3.7032ms 3.2408ms 308.5613 Ops/s 301.9916 Ops/s $\color{#35bf28}+2.18\%$
test_clone 5.1122ms 4.8817ms 204.8483 Ops/s 198.5908 Ops/s $\color{#35bf28}+3.15\%$
test_squeeze 54.8620μs 12.0681μs 82.8629 KOps/s 82.6870 KOps/s $\color{#35bf28}+0.21\%$
test_unsqueeze 0.1714ms 87.4070μs 11.4407 KOps/s 11.3761 KOps/s $\color{#35bf28}+0.57\%$
test_split 0.8531ms 0.1915ms 5.2215 KOps/s 5.2384 KOps/s $\color{#d91a1a}-0.32\%$
test_permute 0.3458ms 0.2205ms 4.5355 KOps/s 4.6784 KOps/s $\color{#d91a1a}-3.05\%$
test_stack 27.7131ms 25.5232ms 39.1801 Ops/s 39.6075 Ops/s $\color{#d91a1a}-1.08\%$
test_cat 30.0245ms 25.4532ms 39.2878 Ops/s 39.8630 Ops/s $\color{#d91a1a}-1.44\%$

Copy link

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 229. Improved: $\large\color{#35bf28}20$. Worsened: $\large\color{#d91a1a}15$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 31.2010μs 10.9677μs 91.1769 KOps/s 97.5847 KOps/s $\textbf{\color{#d91a1a}-6.57\%}$
test_plain_set_stack_nested 28.1700μs 11.0420μs 90.5635 KOps/s 96.7851 KOps/s $\textbf{\color{#d91a1a}-6.43\%}$
test_plain_set_nested_inplace 39.0810μs 11.7683μs 84.9738 KOps/s 89.1536 KOps/s $\color{#d91a1a}-4.69\%$
test_plain_set_stack_nested_inplace 56.0010μs 11.7652μs 84.9961 KOps/s 89.3660 KOps/s $\color{#d91a1a}-4.89\%$
test_items 29.4610μs 2.8829μs 346.8775 KOps/s 340.6430 KOps/s $\color{#35bf28}+1.83\%$
test_items_nested 0.3720ms 0.3175ms 3.1500 KOps/s 3.1382 KOps/s $\color{#35bf28}+0.38\%$
test_items_nested_locked 0.3744ms 0.3212ms 3.1132 KOps/s 3.1228 KOps/s $\color{#d91a1a}-0.31\%$
test_items_nested_leaf 87.3510μs 58.0675μs 17.2213 KOps/s 17.2384 KOps/s $\color{#d91a1a}-0.10\%$
test_items_stack_nested 0.3736ms 0.3224ms 3.1019 KOps/s 3.1133 KOps/s $\color{#d91a1a}-0.37\%$
test_items_stack_nested_leaf 0.1061ms 59.6377μs 16.7679 KOps/s 16.9815 KOps/s $\color{#d91a1a}-1.26\%$
test_items_stack_nested_locked 0.3558ms 0.3225ms 3.1011 KOps/s 3.0907 KOps/s $\color{#35bf28}+0.34\%$
test_keys 31.5110μs 3.4806μs 287.3065 KOps/s 268.0537 KOps/s $\textbf{\color{#35bf28}+7.18\%}$
test_keys_nested 98.4920μs 71.0969μs 14.0653 KOps/s 14.1769 KOps/s $\color{#d91a1a}-0.79\%$
test_keys_nested_locked 0.6674ms 75.8116μs 13.1906 KOps/s 13.0458 KOps/s $\color{#35bf28}+1.11\%$
test_keys_nested_leaf 89.4220μs 61.5430μs 16.2488 KOps/s 16.0298 KOps/s $\color{#35bf28}+1.37\%$
test_keys_stack_nested 0.1128ms 71.2178μs 14.0414 KOps/s 13.8997 KOps/s $\color{#35bf28}+1.02\%$
test_keys_stack_nested_leaf 88.1810μs 62.2982μs 16.0518 KOps/s 15.9453 KOps/s $\color{#35bf28}+0.67\%$
test_keys_stack_nested_locked 0.1051ms 76.7125μs 13.0357 KOps/s 13.0043 KOps/s $\color{#35bf28}+0.24\%$
test_values 14.1020μs 0.8643μs 1.1570 MOps/s 1.1516 MOps/s $\color{#35bf28}+0.47\%$
test_values_nested 90.4320μs 31.4089μs 31.8381 KOps/s 31.7128 KOps/s $\color{#35bf28}+0.40\%$
test_values_nested_locked 55.7820μs 32.9762μs 30.3249 KOps/s 30.1718 KOps/s $\color{#35bf28}+0.51\%$
test_values_nested_leaf 72.9810μs 33.9301μs 29.4724 KOps/s 29.4624 KOps/s $\color{#35bf28}+0.03\%$
test_values_stack_nested 56.1610μs 32.1659μs 31.0888 KOps/s 31.4827 KOps/s $\color{#d91a1a}-1.25\%$
test_values_stack_nested_leaf 61.0120μs 34.2698μs 29.1802 KOps/s 29.1072 KOps/s $\color{#35bf28}+0.25\%$
test_values_stack_nested_locked 73.8310μs 33.7009μs 29.6728 KOps/s 29.8773 KOps/s $\color{#d91a1a}-0.68\%$
test_membership 1.7540μs 0.5262μs 1.9003 MOps/s 1.8839 MOps/s $\color{#35bf28}+0.87\%$
test_membership_nested 17.3355μs 1.8872μs 529.8716 KOps/s 540.2062 KOps/s $\color{#d91a1a}-1.91\%$
test_membership_nested_leaf 18.1337μs 1.8577μs 538.3098 KOps/s 542.5843 KOps/s $\color{#d91a1a}-0.79\%$
test_membership_stacked_nested 35.5810μs 1.9492μs 513.0247 KOps/s 505.3051 KOps/s $\color{#35bf28}+1.53\%$
test_membership_stacked_nested_leaf 35.4610μs 1.9780μs 505.5737 KOps/s 505.7108 KOps/s $\color{#d91a1a}-0.03\%$
test_membership_nested_last 28.9810μs 2.8039μs 356.6461 KOps/s 360.5842 KOps/s $\color{#d91a1a}-1.09\%$
test_membership_nested_leaf_last 28.6100μs 2.7901μs 358.4059 KOps/s 359.2276 KOps/s $\color{#d91a1a}-0.23\%$
test_membership_stacked_nested_last 25.5200μs 3.2472μs 307.9569 KOps/s 352.7165 KOps/s $\textbf{\color{#d91a1a}-12.69\%}$
test_membership_stacked_nested_leaf_last 36.3710μs 3.2410μs 308.5472 KOps/s 358.0241 KOps/s $\textbf{\color{#d91a1a}-13.82\%}$
test_nested_getleaf 40.8710μs 5.9433μs 168.2556 KOps/s 165.9670 KOps/s $\color{#35bf28}+1.38\%$
test_nested_get 27.1210μs 5.6665μs 176.4758 KOps/s 174.7200 KOps/s $\color{#35bf28}+1.00\%$
test_stacked_getleaf 36.8810μs 5.9734μs 167.4102 KOps/s 166.8124 KOps/s $\color{#35bf28}+0.36\%$
test_stacked_get 44.4910μs 5.6912μs 175.7108 KOps/s 175.4465 KOps/s $\color{#35bf28}+0.15\%$
test_nested_getitemleaf 28.8100μs 6.0761μs 164.5779 KOps/s 164.1802 KOps/s $\color{#35bf28}+0.24\%$
test_nested_getitem 33.5310μs 5.7747μs 173.1688 KOps/s 173.6816 KOps/s $\color{#d91a1a}-0.30\%$
test_stacked_getitemleaf 27.6210μs 6.0349μs 165.7028 KOps/s 163.8201 KOps/s $\color{#35bf28}+1.15\%$
test_stacked_getitem 39.8110μs 5.7743μs 173.1822 KOps/s 173.1067 KOps/s $\color{#35bf28}+0.04\%$
test_lock_nested 9.3811ms 0.3781ms 2.6449 KOps/s 2.6994 KOps/s $\color{#d91a1a}-2.02\%$
test_lock_stack_nested 0.3757ms 0.3384ms 2.9552 KOps/s 2.9796 KOps/s $\color{#d91a1a}-0.82\%$
test_unlock_nested 0.7793ms 0.3092ms 3.2340 KOps/s 3.2448 KOps/s $\color{#d91a1a}-0.33\%$
test_unlock_stack_nested 0.3087ms 0.2776ms 3.6023 KOps/s 3.6413 KOps/s $\color{#d91a1a}-1.07\%$
test_flatten_speed 0.1063ms 73.2237μs 13.6568 KOps/s 13.9558 KOps/s $\color{#d91a1a}-2.14\%$
test_unflatten_speed 0.3289ms 0.2927ms 3.4167 KOps/s 3.4374 KOps/s $\color{#d91a1a}-0.60\%$
test_common_ops 1.7747ms 0.5840ms 1.7124 KOps/s 1.7390 KOps/s $\color{#d91a1a}-1.53\%$
test_creation 0.1012ms 1.4774μs 676.8866 KOps/s 671.7622 KOps/s $\color{#35bf28}+0.76\%$
test_creation_empty 36.2610μs 8.0111μs 124.8265 KOps/s 150.8052 KOps/s $\textbf{\color{#d91a1a}-17.23\%}$
test_creation_nested_1 1.6372ms 9.4935μs 105.3348 KOps/s 123.0840 KOps/s $\textbf{\color{#d91a1a}-14.42\%}$
test_creation_nested_2 40.6200μs 12.1388μs 82.3807 KOps/s 94.2240 KOps/s $\textbf{\color{#d91a1a}-12.57\%}$
test_clone 50.1910μs 10.0723μs 99.2818 KOps/s 92.7969 KOps/s $\textbf{\color{#35bf28}+6.99\%}$
test_getitem[int] 93.5192ms 15.6549μs 63.8778 KOps/s 93.6130 KOps/s $\textbf{\color{#d91a1a}-31.76\%}$
test_getitem[slice_int] 0.1039ms 19.9262μs 50.1851 KOps/s 48.6599 KOps/s $\color{#35bf28}+3.13\%$
test_getitem[range] 0.1316ms 36.6065μs 27.3176 KOps/s 26.8884 KOps/s $\color{#35bf28}+1.60\%$
test_getitem[tuple] 0.1091ms 17.5860μs 56.8633 KOps/s 55.0919 KOps/s $\color{#35bf28}+3.22\%$
test_getitem[list] 0.2248ms 32.0869μs 31.1653 KOps/s 30.4066 KOps/s $\color{#35bf28}+2.50\%$
test_setitem_dim[int] 38.9610μs 17.8038μs 56.1678 KOps/s 52.8444 KOps/s $\textbf{\color{#35bf28}+6.29\%}$
test_setitem_dim[slice_int] 62.6720μs 36.1534μs 27.6599 KOps/s 26.7797 KOps/s $\color{#35bf28}+3.29\%$
test_setitem_dim[range] 76.7610μs 52.2357μs 19.1440 KOps/s 18.8946 KOps/s $\color{#35bf28}+1.32\%$
test_setitem_dim[tuple] 52.6810μs 30.8996μs 32.3628 KOps/s 31.6466 KOps/s $\color{#35bf28}+2.26\%$
test_setitem 74.2810μs 14.5792μs 68.5907 KOps/s 68.8240 KOps/s $\color{#d91a1a}-0.34\%$
test_set 86.1720μs 13.6925μs 73.0325 KOps/s 70.4541 KOps/s $\color{#35bf28}+3.66\%$
test_set_shared 1.4890ms 0.1461ms 6.8446 KOps/s 6.7406 KOps/s $\color{#35bf28}+1.54\%$
test_update 0.3449ms 16.5403μs 60.4582 KOps/s 61.3159 KOps/s $\color{#d91a1a}-1.40\%$
test_update_nested 84.0620μs 21.6430μs 46.2042 KOps/s 48.3300 KOps/s $\color{#d91a1a}-4.40\%$
test_update__nested 0.5825ms 23.7796μs 42.0529 KOps/s 41.0022 KOps/s $\color{#35bf28}+2.56\%$
test_set_nested 82.6220μs 15.2352μs 65.6373 KOps/s 65.6441 KOps/s $\color{#d91a1a}-0.01\%$
test_set_nested_new 86.2120μs 17.3377μs 57.6779 KOps/s 57.9964 KOps/s $\color{#d91a1a}-0.55\%$
test_select 91.2420μs 29.1017μs 34.3622 KOps/s 35.1078 KOps/s $\color{#d91a1a}-2.12\%$
test_select_nested 82.4520μs 42.3158μs 23.6319 KOps/s 24.0781 KOps/s $\color{#d91a1a}-1.85\%$
test_exclude_nested 94.1420μs 59.5318μs 16.7978 KOps/s 16.8439 KOps/s $\color{#d91a1a}-0.27\%$
test_empty[True] 0.3117ms 0.2579ms 3.8776 KOps/s 3.8767 KOps/s $\color{#35bf28}+0.02\%$
test_empty[False] 3.2870μs 0.7515μs 1.3306 MOps/s 1.3462 MOps/s $\color{#d91a1a}-1.15\%$
test_to 86.2610μs 56.0297μs 17.8477 KOps/s 18.0297 KOps/s $\color{#d91a1a}-1.01\%$
test_to_nonblocking 85.2510μs 45.0099μs 22.2173 KOps/s 21.9977 KOps/s $\color{#35bf28}+1.00\%$
test_unbind_speed 0.2621ms 0.2333ms 4.2859 KOps/s 4.2836 KOps/s $\color{#35bf28}+0.05\%$
test_unbind_speed_stack0 0.2907ms 0.2364ms 4.2298 KOps/s 4.2830 KOps/s $\color{#d91a1a}-1.24\%$
test_unbind_speed_stack1 92.5633ms 0.6636ms 1.5069 KOps/s 1.5328 KOps/s $\color{#d91a1a}-1.69\%$
test_split 94.2461ms 1.5586ms 641.6218 Ops/s 585.3625 Ops/s $\textbf{\color{#35bf28}+9.61\%}$
test_chunk 96.3190ms 1.5718ms 636.1943 Ops/s 634.6425 Ops/s $\color{#35bf28}+0.24\%$
test_consolidate[False-None] 2.6877ms 2.5796ms 387.6632 Ops/s 387.1267 Ops/s $\color{#35bf28}+0.14\%$
test_consolidate[default-None] 1.7053ms 1.6342ms 611.9167 Ops/s 606.3289 Ops/s $\color{#35bf28}+0.92\%$
test_consolidate[reduce-overhead-None] 1.7293ms 1.6718ms 598.1558 Ops/s 592.3347 Ops/s $\color{#35bf28}+0.98\%$
test_consolidate_njt[False-None] 6.5812ms 6.3191ms 158.2504 Ops/s 154.0608 Ops/s $\color{#35bf28}+2.72\%$
test_to[False-False-None] 1.7594ms 1.6670ms 599.8811 Ops/s 592.5163 Ops/s $\color{#35bf28}+1.24\%$
test_to[True-False-None] 1.4974ms 1.2252ms 816.1811 Ops/s 766.0645 Ops/s $\textbf{\color{#35bf28}+6.54\%}$
test_to[within-False-None] 4.0146ms 3.9091ms 255.8125 Ops/s 250.9306 Ops/s $\color{#35bf28}+1.95\%$
test_to[True-default-None] 5.5736ms 5.0950ms 196.2709 Ops/s 189.6743 Ops/s $\color{#35bf28}+3.48\%$
test_to_njt[False-False-None] 6.9679ms 6.8082ms 146.8823 Ops/s 143.9741 Ops/s $\color{#35bf28}+2.02\%$
test_to_njt[True-False-None] 5.5276ms 5.3148ms 188.1539 Ops/s 183.5675 Ops/s $\color{#35bf28}+2.50\%$
test_to_njt[within-False-None] 12.1252ms 11.9164ms 83.9176 Ops/s 82.8130 Ops/s $\color{#35bf28}+1.33\%$
test_creation[device0] 0.4635ms 78.6284μs 12.7181 KOps/s 11.9623 KOps/s $\textbf{\color{#35bf28}+6.32\%}$
test_creation_from_tensor 0.4571ms 82.1826μs 12.1680 KOps/s 11.5215 KOps/s $\textbf{\color{#35bf28}+5.61\%}$
test_add_one[memmap_tensor0] 0.7969ms 6.7283μs 148.6270 KOps/s 147.4919 KOps/s $\color{#35bf28}+0.77\%$
test_contiguous[memmap_tensor0] 1.8110μs 0.4113μs 2.4315 MOps/s 2.4656 MOps/s $\color{#d91a1a}-1.38\%$
test_stack[memmap_tensor0] 38.7900μs 4.5922μs 217.7596 KOps/s 226.9119 KOps/s $\color{#d91a1a}-4.03\%$
test_memmaptd_index 1.7373ms 0.2521ms 3.9672 KOps/s 4.0959 KOps/s $\color{#d91a1a}-3.14\%$
test_memmaptd_index_astensor 0.5797ms 0.3123ms 3.2023 KOps/s 3.2993 KOps/s $\color{#d91a1a}-2.94\%$
test_memmaptd_index_op 1.0236ms 0.5861ms 1.7063 KOps/s 1.7781 KOps/s $\color{#d91a1a}-4.04\%$
test_serialize_model 0.4169s 0.1716s 5.8267 Ops/s 7.6867 Ops/s $\textbf{\color{#d91a1a}-24.20\%}$
test_serialize_model_pickle 1.3787s 1.2186s 0.8206 Ops/s 0.8240 Ops/s $\color{#d91a1a}-0.41\%$
test_serialize_weights 0.1309s 0.1304s 7.6694 Ops/s 7.7109 Ops/s $\color{#d91a1a}-0.54\%$
test_serialize_weights_returnearly 0.3186s 53.4740ms 18.7007 Ops/s 14.5993 Ops/s $\textbf{\color{#35bf28}+28.09\%}$
test_serialize_weights_pickle 1.3529s 1.2125s 0.8247 Ops/s 0.8258 Ops/s $\color{#d91a1a}-0.14\%$
test_reshape_pytree 51.6510μs 21.8991μs 45.6639 KOps/s 45.3120 KOps/s $\color{#35bf28}+0.78\%$
test_reshape_td 47.4910μs 26.1908μs 38.1813 KOps/s 37.0378 KOps/s $\color{#35bf28}+3.09\%$
test_view_pytree 48.8010μs 21.9828μs 45.4902 KOps/s 44.6860 KOps/s $\color{#35bf28}+1.80\%$
test_view_td 56.4410μs 29.1695μs 34.2824 KOps/s 32.8259 KOps/s $\color{#35bf28}+4.44\%$
test_unbind_pytree 54.5410μs 27.5916μs 36.2429 KOps/s 34.9198 KOps/s $\color{#35bf28}+3.79\%$
test_unbind_td 0.7630ms 35.5266μs 28.1479 KOps/s 26.8295 KOps/s $\color{#35bf28}+4.91\%$
test_split_pytree 59.8810μs 30.0568μs 33.2704 KOps/s 33.4884 KOps/s $\color{#d91a1a}-0.65\%$
test_split_td 0.9307ms 37.5154μs 26.6557 KOps/s 25.4092 KOps/s $\color{#35bf28}+4.91\%$
test_add_pytree 63.9410μs 33.9946μs 29.4165 KOps/s 28.5215 KOps/s $\color{#35bf28}+3.14\%$
test_add_td 82.4810μs 45.2332μs 22.1077 KOps/s 19.4817 KOps/s $\textbf{\color{#35bf28}+13.48\%}$
test_compile_add_one_nested[tensordict-compile] 0.1742ms 0.1186ms 8.4315 KOps/s 8.1282 KOps/s $\color{#35bf28}+3.73\%$
test_compile_add_one_nested[tensordict-eager] 0.2216ms 0.1249ms 8.0090 KOps/s 8.0225 KOps/s $\color{#d91a1a}-0.17\%$
test_compile_add_one_nested[pytree-compile] 0.1697ms 94.1642μs 10.6197 KOps/s 10.3960 KOps/s $\color{#35bf28}+2.15\%$
test_compile_add_one_nested[pytree-eager] 1.6611ms 0.1491ms 6.7060 KOps/s 6.5872 KOps/s $\color{#35bf28}+1.80\%$
test_compile_copy_nested[tensordict-compile] 53.4710μs 22.7985μs 43.8626 KOps/s 42.7160 KOps/s $\color{#35bf28}+2.68\%$
test_compile_copy_nested[tensordict-eager] 73.4510μs 26.6527μs 37.5197 KOps/s 36.5454 KOps/s $\color{#35bf28}+2.67\%$
test_compile_copy_nested[pytree-compile] 96.5720μs 64.5708μs 15.4869 KOps/s 15.2507 KOps/s $\color{#35bf28}+1.55\%$
test_compile_copy_nested[pytree-eager] 80.7510μs 49.2407μs 20.3084 KOps/s 19.7471 KOps/s $\color{#35bf28}+2.84\%$
test_compile_add_one_flat[tensordict-compile] 0.1973ms 0.1420ms 7.0401 KOps/s 6.8777 KOps/s $\color{#35bf28}+2.36\%$
test_compile_add_one_flat[tensordict-eager] 0.2931ms 0.2075ms 4.8188 KOps/s 4.8125 KOps/s $\color{#35bf28}+0.13\%$
test_compile_add_one_flat[tensorclass-compile] 0.1344ms 97.5529μs 10.2509 KOps/s 10.0189 KOps/s $\color{#35bf28}+2.32\%$
test_compile_add_one_flat[tensorclass-eager] 0.1092ms 51.5175μs 19.4109 KOps/s 19.4830 KOps/s $\color{#d91a1a}-0.37\%$
test_compile_add_one_flat[pytree-compile] 0.2588ms 0.1354ms 7.3840 KOps/s 7.2832 KOps/s $\color{#35bf28}+1.38\%$
test_compile_add_one_flat[pytree-eager] 0.5327ms 0.4766ms 2.0980 KOps/s 2.0347 KOps/s $\color{#35bf28}+3.11\%$
test_compile_add_self_flat[tensordict-eager] 0.4012ms 0.2481ms 4.0302 KOps/s 4.0200 KOps/s $\color{#35bf28}+0.26\%$
test_compile_add_self_flat[tensordict-compile] 0.1821ms 0.1424ms 7.0203 KOps/s 6.9140 KOps/s $\color{#35bf28}+1.54\%$
test_compile_add_self_flat[tensorclass-eager] 0.1444ms 60.7100μs 16.4718 KOps/s 16.2893 KOps/s $\color{#35bf28}+1.12\%$
test_compile_add_self_flat[tensorclass-compile] 0.1607ms 98.1031μs 10.1934 KOps/s 10.1036 KOps/s $\color{#35bf28}+0.89\%$
test_compile_add_self_flat[pytree-eager] 0.4566ms 0.4092ms 2.4438 KOps/s 2.4606 KOps/s $\color{#d91a1a}-0.68\%$
test_compile_add_self_flat[pytree-compile] 0.1737ms 0.1363ms 7.3384 KOps/s 7.2921 KOps/s $\color{#35bf28}+0.63\%$
test_compile_copy_flat[tensordict-compile] 56.7610μs 19.7475μs 50.6393 KOps/s 53.6197 KOps/s $\textbf{\color{#d91a1a}-5.56\%}$
test_compile_copy_flat[tensordict-eager] 57.8410μs 26.9664μs 37.0832 KOps/s 36.7959 KOps/s $\color{#35bf28}+0.78\%$
test_compile_copy_flat[pytree-compile] 0.2052ms 70.0202μs 14.2816 KOps/s 14.2516 KOps/s $\color{#35bf28}+0.21\%$
test_compile_copy_flat[pytree-eager] 81.3820μs 51.7809μs 19.3122 KOps/s 19.4565 KOps/s $\color{#d91a1a}-0.74\%$
test_compile_assign_and_add[tensordict-compile] 1.5840ms 0.3847ms 2.5994 KOps/s 2.2540 KOps/s $\textbf{\color{#35bf28}+15.33\%}$
test_compile_assign_and_add[tensordict-eager] 2.6679ms 2.5535ms 391.6177 Ops/s 390.7478 Ops/s $\color{#35bf28}+0.22\%$
test_compile_assign_and_add[pytree-compile] 1.5611ms 0.3763ms 2.6573 KOps/s 2.2605 KOps/s $\textbf{\color{#35bf28}+17.55\%}$
test_compile_assign_and_add[pytree-eager] 3.1382ms 2.6021ms 384.3041 Ops/s 378.3354 Ops/s $\color{#35bf28}+1.58\%$
test_compile_indexing[tensor-tensordict-compile] 0.8365ms 0.1170ms 8.5451 KOps/s 8.8909 KOps/s $\color{#d91a1a}-3.89\%$
test_compile_indexing[tensor-tensordict-eager] 0.5856ms 82.2530μs 12.1576 KOps/s 12.5904 KOps/s $\color{#d91a1a}-3.44\%$
test_compile_indexing[tensor-tensorclass-compile] 0.5952ms 0.1088ms 9.1948 KOps/s 9.5318 KOps/s $\color{#d91a1a}-3.54\%$
test_compile_indexing[tensor-tensorclass-eager] 0.1132ms 68.0479μs 14.6955 KOps/s 14.8589 KOps/s $\color{#d91a1a}-1.10\%$
test_compile_indexing[tensor-pytree-compile] 0.1540ms 0.1090ms 9.1732 KOps/s 9.4903 KOps/s $\color{#d91a1a}-3.34\%$
test_compile_indexing[tensor-pytree-eager] 0.1770ms 68.1975μs 14.6633 KOps/s 14.7897 KOps/s $\color{#d91a1a}-0.85\%$
test_compile_indexing[slice-tensordict-compile] 0.1469ms 99.4305μs 10.0573 KOps/s 9.7389 KOps/s $\color{#35bf28}+3.27\%$
test_compile_indexing[slice-tensordict-eager] 0.1438ms 16.4721μs 60.7086 KOps/s 56.9352 KOps/s $\textbf{\color{#35bf28}+6.63\%}$
test_compile_indexing[slice-tensorclass-compile] 0.1460ms 94.3945μs 10.5938 KOps/s 10.3476 KOps/s $\color{#35bf28}+2.38\%$
test_compile_indexing[slice-tensorclass-eager] 40.9110μs 15.6089μs 64.0661 KOps/s 62.7979 KOps/s $\color{#35bf28}+2.02\%$
test_compile_indexing[slice-pytree-compile] 0.1393ms 94.9804μs 10.5285 KOps/s 10.3238 KOps/s $\color{#35bf28}+1.98\%$
test_compile_indexing[slice-pytree-eager] 43.1310μs 15.4736μs 64.6263 KOps/s 63.2458 KOps/s $\color{#35bf28}+2.18\%$
test_compile_indexing[int-tensordict-compile] 0.1424ms 99.2176μs 10.0789 KOps/s 9.8373 KOps/s $\color{#35bf28}+2.46\%$
test_compile_indexing[int-tensordict-eager] 0.5663ms 16.3640μs 61.1098 KOps/s 58.0525 KOps/s $\textbf{\color{#35bf28}+5.27\%}$
test_compile_indexing[int-tensorclass-compile] 0.1353ms 94.8169μs 10.5466 KOps/s 10.3179 KOps/s $\color{#35bf28}+2.22\%$
test_compile_indexing[int-tensorclass-eager] 47.0610μs 15.3868μs 64.9908 KOps/s 63.1010 KOps/s $\color{#35bf28}+2.99\%$
test_compile_indexing[int-pytree-compile] 0.1412ms 94.6627μs 10.5638 KOps/s 10.2971 KOps/s $\color{#35bf28}+2.59\%$
test_compile_indexing[int-pytree-eager] 0.1558ms 15.7878μs 63.3401 KOps/s 63.4452 KOps/s $\color{#d91a1a}-0.17\%$
test_mod_add[eager] 67.5610μs 30.8196μs 32.4469 KOps/s 33.2078 KOps/s $\color{#d91a1a}-2.29\%$
test_mod_add[compile] 0.3869ms 75.9105μs 13.1734 KOps/s 13.0511 KOps/s $\color{#35bf28}+0.94\%$
test_mod_add[compile-overhead] 0.3108ms 0.1636ms 6.1133 KOps/s 5.7644 KOps/s $\textbf{\color{#35bf28}+6.05\%}$
test_mod_wrap[eager] 0.3207ms 0.2501ms 3.9977 KOps/s 3.8689 KOps/s $\color{#35bf28}+3.33\%$
test_mod_wrap[compile] 1.5730ms 0.2769ms 3.6118 KOps/s 3.5242 KOps/s $\color{#35bf28}+2.49\%$
test_mod_wrap[compile-overhead] 7.2769ms 3.8053ms 262.7931 Ops/s 261.7007 Ops/s $\color{#35bf28}+0.42\%$
test_mod_wrap_and_backward[eager] 1.8077ms 1.4177ms 705.3699 Ops/s 691.0061 Ops/s $\color{#35bf28}+2.08\%$
test_mod_wrap_and_backward[compile] 1.5020ms 1.3409ms 745.7586 Ops/s 723.1784 Ops/s $\color{#35bf28}+3.12\%$
test_mod_wrap_and_backward[compile-overhead] 1.4974ms 1.0192ms 981.1355 Ops/s 970.1675 Ops/s $\color{#35bf28}+1.13\%$
test_seq_add[eager] 0.1398ms 94.1734μs 10.6187 KOps/s 10.3616 KOps/s $\color{#35bf28}+2.48\%$
test_seq_add[compile] 0.1834ms 85.8046μs 11.6544 KOps/s 11.6049 KOps/s $\color{#35bf28}+0.43\%$
test_seq_add[compile-overhead] 0.1694ms 0.1276ms 7.8342 KOps/s 7.8193 KOps/s $\color{#35bf28}+0.19\%$
test_seq_wrap[eager] 0.4639ms 0.3851ms 2.5968 KOps/s 2.6549 KOps/s $\color{#d91a1a}-2.19\%$
test_seq_wrap[compile] 0.3669ms 0.2914ms 3.4313 KOps/s 3.3202 KOps/s $\color{#35bf28}+3.35\%$
test_seq_wrap[compile-overhead] 0.2696ms 0.2215ms 4.5153 KOps/s 4.4835 KOps/s $\color{#35bf28}+0.71\%$
test_func_call_runtime[False-eager] 0.7831ms 0.7209ms 1.3872 KOps/s 1.3350 KOps/s $\color{#35bf28}+3.91\%$
test_func_call_runtime[False-compile] 0.7725ms 0.7230ms 1.3830 KOps/s 1.3007 KOps/s $\textbf{\color{#35bf28}+6.33\%}$
test_func_call_runtime[False-compile-overhead] 0.4097ms 0.3587ms 2.7876 KOps/s 2.7943 KOps/s $\color{#d91a1a}-0.24\%$
test_func_call_runtime[True-eager] 0.9699ms 0.8863ms 1.1283 KOps/s 1.1049 KOps/s $\color{#35bf28}+2.12\%$
test_func_call_runtime[True-compile] 0.7953ms 0.7461ms 1.3403 KOps/s 1.3065 KOps/s $\color{#35bf28}+2.59\%$
test_func_call_runtime[True-compile-overhead] 0.4296ms 0.3806ms 2.6272 KOps/s 2.6164 KOps/s $\color{#35bf28}+0.41\%$
test_func_call_cm_runtime[False-eager] 0.8634ms 0.7178ms 1.3932 KOps/s 1.2515 KOps/s $\textbf{\color{#35bf28}+11.33\%}$
test_func_call_cm_runtime[False-compile] 0.8217ms 0.7308ms 1.3683 KOps/s 1.3407 KOps/s $\color{#35bf28}+2.06\%$
test_func_call_cm_runtime[False-compile-overhead] 0.4209ms 0.3604ms 2.7746 KOps/s 2.7418 KOps/s $\color{#35bf28}+1.20\%$
test_func_call_cm_runtime[True-eager] 1.0621ms 0.9792ms 1.0212 KOps/s 959.0361 Ops/s $\textbf{\color{#35bf28}+6.48\%}$
test_func_call_cm_runtime[True-compile] 0.8293ms 0.7784ms 1.2847 KOps/s 1.2587 KOps/s $\color{#35bf28}+2.07\%$
test_func_call_cm_runtime[True-compile-overhead] 0.4694ms 0.4072ms 2.4557 KOps/s 2.4217 KOps/s $\color{#35bf28}+1.40\%$
test_vmap_func_call_cm_runtime[eager] 2.5412ms 2.0565ms 486.2523 Ops/s 475.4813 Ops/s $\color{#35bf28}+2.27\%$
test_vmap_func_call_cm_runtime[compile] 0.8772ms 0.7976ms 1.2538 KOps/s 1.2300 KOps/s $\color{#35bf28}+1.93\%$
test_vmap_func_call_cm_runtime[compile-overhead] 0.4582ms 0.4097ms 2.4406 KOps/s 2.4337 KOps/s $\color{#35bf28}+0.29\%$
test_distributed 1.7201ms 0.2028ms 4.9319 KOps/s 8.7220 KOps/s $\textbf{\color{#d91a1a}-43.45\%}$
test_tdmodule 44.5600μs 13.8620μs 72.1397 KOps/s 75.2266 KOps/s $\color{#d91a1a}-4.10\%$
test_tdmodule_dispatch 46.9710μs 27.2872μs 36.6472 KOps/s 39.1769 KOps/s $\textbf{\color{#d91a1a}-6.46\%}$
test_tdseq 34.8210μs 15.2230μs 65.6902 KOps/s 69.4380 KOps/s $\textbf{\color{#d91a1a}-5.40\%}$
test_tdseq_dispatch 51.9010μs 30.3052μs 32.9976 KOps/s 34.6197 KOps/s $\color{#d91a1a}-4.69\%$
test_instantiation_functorch 1.6432ms 1.5092ms 662.6124 Ops/s 640.8817 Ops/s $\color{#35bf28}+3.39\%$
test_exec_functorch 0.1746ms 0.1398ms 7.1506 KOps/s 6.7815 KOps/s $\textbf{\color{#35bf28}+5.44\%}$
test_exec_functional_call 0.1683ms 0.1331ms 7.5154 KOps/s 7.0908 KOps/s $\textbf{\color{#35bf28}+5.99\%}$
test_exec_td_decorator 0.3608ms 0.1789ms 5.5902 KOps/s 5.3638 KOps/s $\color{#35bf28}+4.22\%$
test_vmap_mlp_speed_decorator[True-True] 0.7807ms 0.6678ms 1.4976 KOps/s 1.4619 KOps/s $\color{#35bf28}+2.44\%$
test_vmap_mlp_speed_decorator[True-False] 0.8589ms 0.6812ms 1.4680 KOps/s 1.4624 KOps/s $\color{#35bf28}+0.38\%$
test_vmap_mlp_speed_decorator[False-True] 0.7073ms 0.5855ms 1.7079 KOps/s 1.6553 KOps/s $\color{#35bf28}+3.18\%$
test_vmap_mlp_speed_decorator[False-False] 0.7190ms 0.5996ms 1.6679 KOps/s 1.6479 KOps/s $\color{#35bf28}+1.21\%$
test_vmap_transformer_speed_decorator[True-True] 18.9815ms 18.9114ms 52.8782 Ops/s 52.1106 Ops/s $\color{#35bf28}+1.47\%$
test_vmap_transformer_speed_decorator[True-False] 19.1376ms 18.9925ms 52.6523 Ops/s 52.0261 Ops/s $\color{#35bf28}+1.20\%$
test_vmap_transformer_speed_decorator[False-True] 18.8820ms 18.8333ms 53.0974 Ops/s 52.5011 Ops/s $\color{#35bf28}+1.14\%$
test_vmap_transformer_speed_decorator[False-False] 18.9062ms 18.8224ms 53.1281 Ops/s 52.4551 Ops/s $\color{#35bf28}+1.28\%$
test_to_module_speed[True] 1.0431ms 0.9299ms 1.0753 KOps/s 1.0634 KOps/s $\color{#35bf28}+1.13\%$
test_to_module_speed[False] 1.3826ms 0.9128ms 1.0956 KOps/s 1.0750 KOps/s $\color{#35bf28}+1.91\%$
test_tc_init 74.4320μs 35.2166μs 28.3957 KOps/s 30.2155 KOps/s $\textbf{\color{#d91a1a}-6.02\%}$
test_tc_init_nested 0.1080ms 67.9034μs 14.7268 KOps/s 15.0906 KOps/s $\color{#d91a1a}-2.41\%$
test_tc_first_layer_tensor 5.0759μs 0.6993μs 1.4299 MOps/s 1.4026 MOps/s $\color{#35bf28}+1.95\%$
test_tc_first_layer_nontensor 32.8310μs 2.3425μs 426.8929 KOps/s 426.6787 KOps/s $\color{#35bf28}+0.05\%$
test_tc_second_layer_tensor 8.1953μs 1.4049μs 711.7858 KOps/s 699.8819 KOps/s $\color{#35bf28}+1.70\%$
test_tc_second_layer_nontensor 26.5610μs 3.0305μs 329.9742 KOps/s 326.7604 KOps/s $\color{#35bf28}+0.98\%$
test_unbind 0.2233s 9.8794ms 101.2209 Ops/s 151.7244 Ops/s $\textbf{\color{#d91a1a}-33.29\%}$
test_full_like 11.2402ms 9.1188ms 109.6632 Ops/s 107.6621 Ops/s $\color{#35bf28}+1.86\%$
test_zeros_like 4.9071ms 4.3142ms 231.7926 Ops/s 137.0900 Ops/s $\textbf{\color{#35bf28}+69.08\%}$
test_ones_like 4.8515ms 4.3145ms 231.7740 Ops/s 231.9640 Ops/s $\color{#d91a1a}-0.08\%$
test_clone 6.4869ms 6.3253ms 158.0964 Ops/s 158.4570 Ops/s $\color{#d91a1a}-0.23\%$
test_squeeze 57.7610μs 9.2803μs 107.7548 KOps/s 109.8987 KOps/s $\color{#d91a1a}-1.95\%$
test_unsqueeze 0.1215ms 69.7481μs 14.3373 KOps/s 14.0978 KOps/s $\color{#35bf28}+1.70\%$
test_split 0.3890ms 0.1593ms 6.2772 KOps/s 6.3662 KOps/s $\color{#d91a1a}-1.40\%$
test_permute 0.2368ms 0.1821ms 5.4920 KOps/s 5.5774 KOps/s $\color{#d91a1a}-1.53\%$
test_stack 53.4491ms 50.9235ms 19.6373 Ops/s 19.8729 Ops/s $\color{#d91a1a}-1.19\%$
test_cat 50.6291ms 50.3972ms 19.8424 Ops/s 20.1854 Ops/s $\color{#d91a1a}-1.70\%$

vmoens added a commit that referenced this pull request Nov 14, 2024
ghstack-source-id: c91a7a6be513fb46be6914df0b3bde779fa5528f
Pull Request resolved: #1087

(cherry picked from commit 79a3345)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants