Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Refactor] Use IntEnum for interaction types #989

Merged
merged 1 commit into from
Sep 13, 2024
Merged

[Refactor] Use IntEnum for interaction types #989

merged 1 commit into from
Sep 13, 2024

Conversation

vmoens
Copy link
Contributor

@vmoens vmoens commented Sep 12, 2024

Description

Describe your changes in detail.

Motivation and Context

Why is this change required? What problem does it solve?
If it fixes an open issue, please link to the issue here.
You can use the syntax close #15213 if this solves the issue #15213

  • I have raised an issue to propose this change (required for new features and bug fixes)

Types of changes

What types of changes does your code introduce? Remove all that do not apply:

  • Bug fix (non-breaking change which fixes an issue)
  • New feature (non-breaking change which adds core functionality)
  • Breaking change (fix or feature that would cause existing functionality to change)
  • Documentation (update in the documentation)
  • Example (update in the folder of examples)

Checklist

Go over all the following points, and put an x in all the boxes that apply.
If you are unsure about any of these, don't hesitate to ask. We are here to help!

  • I have read the CONTRIBUTION guide (required)
  • My change requires a change to the documentation.
  • I have updated the tests accordingly (required for a bug fix or a new feature).
  • I have updated the documentation accordingly.

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Sep 12, 2024
Copy link

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 222. Improved: $\large\color{#35bf28}32$. Worsened: $\large\color{#d91a1a}15$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 46.0970μs 19.4204μs 51.4923 KOps/s 47.3198 KOps/s $\textbf{\color{#35bf28}+8.82\%}$
test_plain_set_stack_nested 50.2040μs 19.8492μs 50.3798 KOps/s 46.4635 KOps/s $\textbf{\color{#35bf28}+8.43\%}$
test_plain_set_nested_inplace 53.7710μs 21.2595μs 47.0378 KOps/s 44.0342 KOps/s $\textbf{\color{#35bf28}+6.82\%}$
test_plain_set_stack_nested_inplace 56.2760μs 21.3210μs 46.9020 KOps/s 43.8728 KOps/s $\textbf{\color{#35bf28}+6.90\%}$
test_items 29.2740μs 4.1798μs 239.2460 KOps/s 249.5432 KOps/s $\color{#d91a1a}-4.13\%$
test_items_nested 0.6360ms 0.3299ms 3.0316 KOps/s 3.0422 KOps/s $\color{#d91a1a}-0.35\%$
test_items_nested_locked 0.4972ms 0.3298ms 3.0322 KOps/s 3.0513 KOps/s $\color{#d91a1a}-0.63\%$
test_items_nested_leaf 0.1615ms 86.2459μs 11.5948 KOps/s 11.8367 KOps/s $\color{#d91a1a}-2.04\%$
test_items_stack_nested 0.4054ms 0.3321ms 3.0109 KOps/s 3.0238 KOps/s $\color{#d91a1a}-0.42\%$
test_items_stack_nested_leaf 0.1329ms 86.7124μs 11.5324 KOps/s 11.9625 KOps/s $\color{#d91a1a}-3.60\%$
test_items_stack_nested_locked 0.6124ms 0.3298ms 3.0323 KOps/s 3.0231 KOps/s $\color{#35bf28}+0.30\%$
test_keys 30.6080μs 3.6129μs 276.7856 KOps/s 282.5521 KOps/s $\color{#d91a1a}-2.04\%$
test_keys_nested 0.1639ms 97.0415μs 10.3049 KOps/s 10.4422 KOps/s $\color{#d91a1a}-1.32\%$
test_keys_nested_locked 1.5612ms 0.1029ms 9.7228 KOps/s 9.9328 KOps/s $\color{#d91a1a}-2.11\%$
test_keys_nested_leaf 0.1385ms 82.7494μs 12.0847 KOps/s 12.3829 KOps/s $\color{#d91a1a}-2.41\%$
test_keys_stack_nested 0.1647ms 98.7073μs 10.1310 KOps/s 10.3860 KOps/s $\color{#d91a1a}-2.46\%$
test_keys_stack_nested_leaf 0.1545ms 83.5396μs 11.9704 KOps/s 12.3882 KOps/s $\color{#d91a1a}-3.37\%$
test_keys_stack_nested_locked 0.1692ms 0.1026ms 9.7464 KOps/s 9.8885 KOps/s $\color{#d91a1a}-1.44\%$
test_values 6.0334μs 1.0736μs 931.4412 KOps/s 918.6619 KOps/s $\color{#35bf28}+1.39\%$
test_values_nested 94.1060μs 48.3899μs 20.6655 KOps/s 20.7737 KOps/s $\color{#d91a1a}-0.52\%$
test_values_nested_locked 0.1050ms 48.1448μs 20.7707 KOps/s 20.6774 KOps/s $\color{#35bf28}+0.45\%$
test_values_nested_leaf 0.1076ms 42.6908μs 23.4243 KOps/s 23.4539 KOps/s $\color{#d91a1a}-0.13\%$
test_values_stack_nested 89.6680μs 48.4066μs 20.6583 KOps/s 20.7047 KOps/s $\color{#d91a1a}-0.22\%$
test_values_stack_nested_leaf 77.2140μs 42.4738μs 23.5439 KOps/s 23.8347 KOps/s $\color{#d91a1a}-1.22\%$
test_values_stack_nested_locked 94.9170μs 48.5480μs 20.5982 KOps/s 20.2248 KOps/s $\color{#35bf28}+1.85\%$
test_membership 29.4750μs 0.8525μs 1.1731 MOps/s 1.1950 MOps/s $\color{#d91a1a}-1.83\%$
test_membership_nested 28.4130μs 2.5901μs 386.0810 KOps/s 389.2657 KOps/s $\color{#d91a1a}-0.82\%$
test_membership_nested_leaf 28.9140μs 2.5831μs 387.1336 KOps/s 387.4860 KOps/s $\color{#d91a1a}-0.09\%$
test_membership_stacked_nested 73.7650μs 2.5524μs 391.7883 KOps/s 393.3325 KOps/s $\color{#d91a1a}-0.39\%$
test_membership_stacked_nested_leaf 39.8840μs 2.5942μs 385.4756 KOps/s 389.8656 KOps/s $\color{#d91a1a}-1.13\%$
test_membership_nested_last 27.5120μs 3.7751μs 264.8967 KOps/s 265.2980 KOps/s $\color{#d91a1a}-0.15\%$
test_membership_nested_leaf_last 35.2360μs 3.7810μs 264.4787 KOps/s 260.9538 KOps/s $\color{#35bf28}+1.35\%$
test_membership_stacked_nested_last 25.4770μs 3.7925μs 263.6753 KOps/s 171.8946 KOps/s $\textbf{\color{#35bf28}+53.39\%}$
test_membership_stacked_nested_leaf_last 30.2770μs 3.7645μs 265.6370 KOps/s 170.7513 KOps/s $\textbf{\color{#35bf28}+55.57\%}$
test_nested_getleaf 44.6330μs 10.8575μs 92.1025 KOps/s 94.4631 KOps/s $\color{#d91a1a}-2.50\%$
test_nested_get 38.0010μs 10.1514μs 98.5083 KOps/s 99.4281 KOps/s $\color{#d91a1a}-0.93\%$
test_stacked_getleaf 30.8980μs 10.7272μs 93.2209 KOps/s 95.0193 KOps/s $\color{#d91a1a}-1.89\%$
test_stacked_get 75.4340μs 9.7304μs 102.7708 KOps/s 100.3640 KOps/s $\color{#35bf28}+2.40\%$
test_nested_getitemleaf 34.4140μs 10.9673μs 91.1805 KOps/s 90.9330 KOps/s $\color{#35bf28}+0.27\%$
test_nested_getitem 55.9840μs 9.9088μs 100.9201 KOps/s 98.3549 KOps/s $\color{#35bf28}+2.61\%$
test_stacked_getitemleaf 38.5520μs 10.9052μs 91.6995 KOps/s 92.6345 KOps/s $\color{#d91a1a}-1.01\%$
test_stacked_getitem 33.5430μs 10.2475μs 97.5850 KOps/s 98.1807 KOps/s $\color{#d91a1a}-0.61\%$
test_lock_nested 0.1022s 0.6245ms 1.6012 KOps/s 2.0610 KOps/s $\textbf{\color{#d91a1a}-22.31\%}$
test_lock_stack_nested 0.5876ms 0.4846ms 2.0634 KOps/s 2.2079 KOps/s $\textbf{\color{#d91a1a}-6.54\%}$
test_unlock_nested 0.1009s 0.5477ms 1.8258 KOps/s 2.4220 KOps/s $\textbf{\color{#d91a1a}-24.62\%}$
test_unlock_stack_nested 0.5169ms 0.3996ms 2.5023 KOps/s 2.6807 KOps/s $\textbf{\color{#d91a1a}-6.66\%}$
test_flatten_speed 0.2559ms 0.1064ms 9.4001 KOps/s 9.5609 KOps/s $\color{#d91a1a}-1.68\%$
test_unflatten_speed 0.8589ms 0.4736ms 2.1115 KOps/s 2.1232 KOps/s $\color{#d91a1a}-0.55\%$
test_common_ops 4.2851ms 1.1205ms 892.4485 Ops/s 858.4087 Ops/s $\color{#35bf28}+3.97\%$
test_creation 20.3880μs 2.0832μs 480.0344 KOps/s 471.2161 KOps/s $\color{#35bf28}+1.87\%$
test_creation_empty 52.8380μs 16.2868μs 61.3993 KOps/s 52.6857 KOps/s $\textbf{\color{#35bf28}+16.54\%}$
test_creation_nested_1 62.2270μs 19.6462μs 50.9003 KOps/s 45.4853 KOps/s $\textbf{\color{#35bf28}+11.91\%}$
test_creation_nested_2 64.4500μs 23.6482μs 42.2865 KOps/s 37.4480 KOps/s $\textbf{\color{#35bf28}+12.92\%}$
test_clone 0.2030ms 17.6449μs 56.6737 KOps/s 57.5838 KOps/s $\color{#d91a1a}-1.58\%$
test_getitem[int] 1.2835ms 17.7867μs 56.2219 KOps/s 58.0418 KOps/s $\color{#d91a1a}-3.14\%$
test_getitem[slice_int] 0.1707ms 31.9889μs 31.2609 KOps/s 32.3412 KOps/s $\color{#d91a1a}-3.34\%$
test_getitem[range] 0.5035ms 61.1561μs 16.3516 KOps/s 16.9198 KOps/s $\color{#d91a1a}-3.36\%$
test_getitem[tuple] 0.1677ms 26.4541μs 37.8013 KOps/s 39.3879 KOps/s $\color{#d91a1a}-4.03\%$
test_getitem[list] 0.4397ms 56.5360μs 17.6878 KOps/s 18.2523 KOps/s $\color{#d91a1a}-3.09\%$
test_setitem_dim[int] 85.8410μs 33.2104μs 30.1111 KOps/s 29.4785 KOps/s $\color{#35bf28}+2.15\%$
test_setitem_dim[slice_int] 0.1145ms 62.4887μs 16.0029 KOps/s 15.9528 KOps/s $\color{#35bf28}+0.31\%$
test_setitem_dim[range] 0.1334ms 85.9325μs 11.6370 KOps/s 11.9443 KOps/s $\color{#d91a1a}-2.57\%$
test_setitem_dim[tuple] 97.1610μs 49.5528μs 20.1805 KOps/s 20.1807 KOps/s $-0.00\%$
test_setitem 0.2585ms 29.0440μs 34.4306 KOps/s 32.1902 KOps/s $\textbf{\color{#35bf28}+6.96\%}$
test_set 0.1748ms 28.2571μs 35.3893 KOps/s 32.8568 KOps/s $\textbf{\color{#35bf28}+7.71\%}$
test_set_shared 2.8256ms 0.2223ms 4.4985 KOps/s 4.4859 KOps/s $\color{#35bf28}+0.28\%$
test_update 0.2202ms 35.1762μs 28.4283 KOps/s 25.8553 KOps/s $\textbf{\color{#35bf28}+9.95\%}$
test_update_nested 0.1881ms 45.2124μs 22.1178 KOps/s 20.4580 KOps/s $\textbf{\color{#35bf28}+8.11\%}$
test_update__nested 0.1921ms 34.8523μs 28.6925 KOps/s 29.0685 KOps/s $\color{#d91a1a}-1.29\%$
test_set_nested 0.1748ms 31.1238μs 32.1297 KOps/s 30.0923 KOps/s $\textbf{\color{#35bf28}+6.77\%}$
test_set_nested_new 0.2002ms 36.8937μs 27.1049 KOps/s 26.1973 KOps/s $\color{#35bf28}+3.46\%$
test_select 0.2164ms 55.2571μs 18.0972 KOps/s 18.1195 KOps/s $\color{#d91a1a}-0.12\%$
test_select_nested 1.0356ms 65.0259μs 15.3785 KOps/s 16.4418 KOps/s $\textbf{\color{#d91a1a}-6.47\%}$
test_exclude_nested 0.1707ms 81.6996μs 12.2400 KOps/s 13.0098 KOps/s $\textbf{\color{#d91a1a}-5.92\%}$
test_empty[True] 0.5012ms 0.3225ms 3.1012 KOps/s 3.1422 KOps/s $\color{#d91a1a}-1.31\%$
test_empty[False] 9.7683μs 1.4597μs 685.0551 KOps/s 782.8172 KOps/s $\textbf{\color{#d91a1a}-12.49\%}$
test_unbind_speed 0.4248ms 0.3213ms 3.1122 KOps/s 3.3076 KOps/s $\textbf{\color{#d91a1a}-5.91\%}$
test_unbind_speed_stack0 0.5120ms 0.3169ms 3.1555 KOps/s 3.4436 KOps/s $\textbf{\color{#d91a1a}-8.37\%}$
test_unbind_speed_stack1 0.1101s 0.8925ms 1.1204 KOps/s 1.3631 KOps/s $\textbf{\color{#d91a1a}-17.80\%}$
test_split 0.1008s 2.2941ms 435.8948 Ops/s 446.1048 Ops/s $\color{#d91a1a}-2.29\%$
test_chunk 3.1965ms 2.0830ms 480.0665 Ops/s 447.1463 Ops/s $\textbf{\color{#35bf28}+7.36\%}$
test_creation[device0] 0.2972ms 0.1202ms 8.3186 KOps/s 8.4274 KOps/s $\color{#d91a1a}-1.29\%$
test_creation_from_tensor 4.1505ms 0.1218ms 8.2070 KOps/s 8.3339 KOps/s $\color{#d91a1a}-1.52\%$
test_add_one[memmap_tensor0] 0.6170ms 7.4213μs 134.7467 KOps/s 136.5835 KOps/s $\color{#d91a1a}-1.34\%$
test_contiguous[memmap_tensor0] 36.9090μs 1.9601μs 510.1688 KOps/s 544.1024 KOps/s $\textbf{\color{#d91a1a}-6.24\%}$
test_stack[memmap_tensor0] 0.1124ms 5.8791μs 170.0932 KOps/s 178.1235 KOps/s $\color{#d91a1a}-4.51\%$
test_memmaptd_index 1.2513ms 0.4118ms 2.4282 KOps/s 2.5080 KOps/s $\color{#d91a1a}-3.18\%$
test_memmaptd_index_astensor 0.9188ms 0.4799ms 2.0838 KOps/s 2.0846 KOps/s $\color{#d91a1a}-0.04\%$
test_memmaptd_index_op 1.9467ms 1.0005ms 999.5424 Ops/s 949.7870 Ops/s $\textbf{\color{#35bf28}+5.24\%}$
test_serialize_model 0.2317s 0.1434s 6.9729 Ops/s 8.1698 Ops/s $\textbf{\color{#d91a1a}-14.65\%}$
test_serialize_model_pickle 0.4691s 0.4003s 2.4978 Ops/s 2.5043 Ops/s $\color{#d91a1a}-0.26\%$
test_serialize_weights 0.1340s 0.1251s 7.9913 Ops/s 7.3135 Ops/s $\textbf{\color{#35bf28}+9.27\%}$
test_serialize_weights_returnearly 0.1880s 0.1652s 6.0531 Ops/s 6.3454 Ops/s $\color{#d91a1a}-4.61\%$
test_serialize_weights_pickle 0.5957s 0.4629s 2.1603 Ops/s 1.1403 Ops/s $\textbf{\color{#35bf28}+89.45\%}$
test_serialize_weights_filesystem 0.2539s 0.1620s 6.1730 Ops/s 6.7385 Ops/s $\textbf{\color{#d91a1a}-8.39\%}$
test_serialize_model_filesystem 0.1696s 0.1529s 6.5383 Ops/s 6.0466 Ops/s $\textbf{\color{#35bf28}+8.13\%}$
test_reshape_pytree 84.8280μs 38.9291μs 25.6877 KOps/s 24.9752 KOps/s $\color{#35bf28}+2.85\%$
test_reshape_td 96.5710μs 46.8593μs 21.3405 KOps/s 21.5404 KOps/s $\color{#d91a1a}-0.93\%$
test_view_pytree 83.5970μs 39.0032μs 25.6389 KOps/s 25.5325 KOps/s $\color{#35bf28}+0.42\%$
test_view_td 0.1068ms 52.3498μs 19.1023 KOps/s 19.0209 KOps/s $\color{#35bf28}+0.43\%$
test_unbind_pytree 84.3880μs 36.7492μs 27.2115 KOps/s 27.8865 KOps/s $\color{#d91a1a}-2.42\%$
test_unbind_td 0.3662ms 47.3110μs 21.1368 KOps/s 22.0461 KOps/s $\color{#d91a1a}-4.12\%$
test_split_pytree 79.2880μs 38.3531μs 26.0735 KOps/s 26.5109 KOps/s $\color{#d91a1a}-1.65\%$
test_split_td 0.2364ms 59.2781μs 16.8696 KOps/s 17.3949 KOps/s $\color{#d91a1a}-3.02\%$
test_add_pytree 0.1112ms 44.3078μs 22.5694 KOps/s 22.0341 KOps/s $\color{#35bf28}+2.43\%$
test_add_td 0.1718ms 79.3215μs 12.6069 KOps/s 11.5134 KOps/s $\textbf{\color{#35bf28}+9.50\%}$
test_compile_add_one_nested[tensordict-compile] 0.1457ms 56.5032μs 17.6981 KOps/s 17.6139 KOps/s $\color{#35bf28}+0.48\%$
test_compile_add_one_nested[tensordict-eager] 1.1739ms 0.1885ms 5.3062 KOps/s 5.2679 KOps/s $\color{#35bf28}+0.73\%$
test_compile_add_one_nested[pytree-compile] 0.1231ms 56.5601μs 17.6803 KOps/s 17.2043 KOps/s $\color{#35bf28}+2.77\%$
test_compile_add_one_nested[pytree-eager] 0.2444ms 0.1382ms 7.2349 KOps/s 7.0400 KOps/s $\color{#35bf28}+2.77\%$
test_compile_copy_nested[tensordict-compile] 0.1019ms 20.5185μs 48.7366 KOps/s 48.8185 KOps/s $\color{#d91a1a}-0.17\%$
test_compile_copy_nested[tensordict-eager] 0.1587ms 68.3823μs 14.6237 KOps/s 14.9971 KOps/s $\color{#d91a1a}-2.49\%$
test_compile_copy_nested[pytree-compile] 0.1620ms 75.5576μs 13.2349 KOps/s 12.9760 KOps/s $\color{#35bf28}+2.00\%$
test_compile_copy_nested[pytree-eager] 0.1318ms 68.2018μs 14.6624 KOps/s 14.2922 KOps/s $\color{#35bf28}+2.59\%$
test_compile_add_one_flat[tensordict-compile] 0.2994ms 0.1729ms 5.7840 KOps/s 5.7644 KOps/s $\color{#35bf28}+0.34\%$
test_compile_add_one_flat[tensordict-eager] 0.3584ms 0.1886ms 5.3019 KOps/s 5.2492 KOps/s $\color{#35bf28}+1.00\%$
test_compile_add_one_flat[tensorclass-compile] 0.1194ms 46.7883μs 21.3729 KOps/s 20.4297 KOps/s $\color{#35bf28}+4.62\%$
test_compile_add_one_flat[tensorclass-eager] 0.6234ms 71.2774μs 14.0297 KOps/s 13.7869 KOps/s $\color{#35bf28}+1.76\%$
test_compile_add_one_flat[pytree-compile] 0.3583ms 0.1795ms 5.5703 KOps/s 5.6601 KOps/s $\color{#d91a1a}-1.59\%$
test_compile_add_one_flat[pytree-eager] 0.6081ms 0.2905ms 3.4421 KOps/s 3.4881 KOps/s $\color{#d91a1a}-1.32\%$
test_compile_add_self_flat[tensordict-eager] 0.4112ms 0.2018ms 4.9561 KOps/s 4.9500 KOps/s $\color{#35bf28}+0.12\%$
test_compile_add_self_flat[tensordict-compile] 0.3280ms 0.1783ms 5.6091 KOps/s 5.6746 KOps/s $\color{#d91a1a}-1.15\%$
test_compile_add_self_flat[tensorclass-eager] 0.1874ms 63.4790μs 15.7532 KOps/s 15.8221 KOps/s $\color{#d91a1a}-0.44\%$
test_compile_add_self_flat[tensorclass-compile] 0.1043ms 47.7870μs 20.9262 KOps/s 20.3093 KOps/s $\color{#35bf28}+3.04\%$
test_compile_add_self_flat[pytree-eager] 0.4655ms 0.2343ms 4.2679 KOps/s 4.3148 KOps/s $\color{#d91a1a}-1.09\%$
test_compile_add_self_flat[pytree-compile] 0.3575ms 0.1762ms 5.6750 KOps/s 5.6897 KOps/s $\color{#d91a1a}-0.26\%$
test_compile_copy_flat[tensordict-compile] 0.2312ms 0.1013ms 9.8733 KOps/s 9.7232 KOps/s $\color{#35bf28}+1.54\%$
test_compile_copy_flat[tensordict-eager] 0.1583ms 57.4596μs 17.4035 KOps/s 17.4919 KOps/s $\color{#d91a1a}-0.51\%$
test_compile_copy_flat[pytree-compile] 0.1642ms 75.9716μs 13.1628 KOps/s 13.0742 KOps/s $\color{#35bf28}+0.68\%$
test_compile_copy_flat[pytree-eager] 0.1418ms 69.1928μs 14.4524 KOps/s 14.6049 KOps/s $\color{#d91a1a}-1.04\%$
test_compile_assign_and_add[tensordict-compile] 0.3151ms 0.2002ms 4.9958 KOps/s 5.0846 KOps/s $\color{#d91a1a}-1.75\%$
test_compile_assign_and_add[tensordict-eager] 2.5583ms 1.6665ms 600.0442 Ops/s 595.3344 Ops/s $\color{#35bf28}+0.79\%$
test_compile_assign_and_add[pytree-compile] 0.2786ms 0.1937ms 5.1620 KOps/s 5.2021 KOps/s $\color{#d91a1a}-0.77\%$
test_compile_assign_and_add[pytree-eager] 1.2179ms 1.1059ms 904.2245 Ops/s 919.9146 Ops/s $\color{#d91a1a}-1.71\%$
test_compile_assign_and_add_stack[compile] 0.7844ms 0.4297ms 2.3272 KOps/s 2.3552 KOps/s $\color{#d91a1a}-1.19\%$
test_compile_assign_and_add_stack[eager] 4.1664ms 3.8471ms 259.9332 Ops/s 255.5375 Ops/s $\color{#35bf28}+1.72\%$
test_compile_indexing[tensor-tensordict-compile] 83.2560μs 34.6854μs 28.8305 KOps/s 27.5292 KOps/s $\color{#35bf28}+4.73\%$
test_compile_indexing[tensor-tensordict-eager] 0.6291ms 48.5016μs 20.6179 KOps/s 20.2995 KOps/s $\color{#35bf28}+1.57\%$
test_compile_indexing[tensor-tensorclass-compile] 77.8560μs 29.5615μs 33.8278 KOps/s 32.4175 KOps/s $\color{#35bf28}+4.35\%$
test_compile_indexing[tensor-tensorclass-eager] 2.2043ms 28.8061μs 34.7148 KOps/s 34.3189 KOps/s $\color{#35bf28}+1.15\%$
test_compile_indexing[tensor-pytree-compile] 91.5110μs 29.8286μs 33.5249 KOps/s 32.3089 KOps/s $\color{#35bf28}+3.76\%$
test_compile_indexing[tensor-pytree-eager] 89.4370μs 28.6057μs 34.9581 KOps/s 34.7656 KOps/s $\color{#35bf28}+0.55\%$
test_compile_indexing[slice-tensordict-compile] 0.1567ms 75.0317μs 13.3277 KOps/s 12.9612 KOps/s $\color{#35bf28}+2.83\%$
test_compile_indexing[slice-tensordict-eager] 0.6272ms 28.0561μs 35.6428 KOps/s 35.7707 KOps/s $\color{#d91a1a}-0.36\%$
test_compile_indexing[slice-tensorclass-compile] 0.1435ms 70.5019μs 14.1840 KOps/s 14.2219 KOps/s $\color{#d91a1a}-0.27\%$
test_compile_indexing[slice-tensorclass-eager] 87.3330μs 23.4245μs 42.6903 KOps/s 43.3057 KOps/s $\color{#d91a1a}-1.42\%$
test_compile_indexing[slice-pytree-compile] 0.1562ms 69.9615μs 14.2936 KOps/s 14.3181 KOps/s $\color{#d91a1a}-0.17\%$
test_compile_indexing[slice-pytree-eager] 83.2860μs 23.5536μs 42.4564 KOps/s 43.0885 KOps/s $\color{#d91a1a}-1.47\%$
test_compile_indexing[int-tensordict-compile] 0.1492ms 75.1785μs 13.3017 KOps/s 13.2513 KOps/s $\color{#35bf28}+0.38\%$
test_compile_indexing[int-tensordict-eager] 0.9876ms 27.8992μs 35.8433 KOps/s 36.3125 KOps/s $\color{#d91a1a}-1.29\%$
test_compile_indexing[int-tensorclass-compile] 0.1497ms 70.3680μs 14.2110 KOps/s 14.2965 KOps/s $\color{#d91a1a}-0.60\%$
test_compile_indexing[int-tensorclass-eager] 65.3320μs 23.3691μs 42.7915 KOps/s 43.1237 KOps/s $\color{#d91a1a}-0.77\%$
test_compile_indexing[int-pytree-compile] 0.1585ms 68.9895μs 14.4950 KOps/s 14.3829 KOps/s $\color{#35bf28}+0.78\%$
test_compile_indexing[int-pytree-eager] 72.9870μs 23.2831μs 42.9496 KOps/s 43.3056 KOps/s $\color{#d91a1a}-0.82\%$
test_mod_add[eager] 0.1094ms 22.9307μs 43.6096 KOps/s 40.4862 KOps/s $\textbf{\color{#35bf28}+7.71\%}$
test_mod_add[compile] 0.1020ms 40.6868μs 24.5780 KOps/s 25.4296 KOps/s $\color{#d91a1a}-3.35\%$
test_mod_add[compile-overhead] 0.1175ms 38.5835μs 25.9178 KOps/s 25.2645 KOps/s $\color{#35bf28}+2.59\%$
test_mod_wrap[eager] 0.3865ms 0.2162ms 4.6244 KOps/s 4.6592 KOps/s $\color{#d91a1a}-0.75\%$
test_mod_wrap[compile] 0.3844ms 0.2344ms 4.2655 KOps/s 4.1793 KOps/s $\color{#35bf28}+2.06\%$
test_mod_wrap[compile-overhead] 0.3408ms 0.2365ms 4.2285 KOps/s 4.1791 KOps/s $\color{#35bf28}+1.18\%$
test_mod_wrap_and_backward[eager] 12.7846ms 11.3012ms 88.4859 Ops/s 89.8801 Ops/s $\color{#d91a1a}-1.55\%$
test_mod_wrap_and_backward[compile] 12.5131ms 11.3066ms 88.4437 Ops/s 89.5865 Ops/s $\color{#d91a1a}-1.28\%$
test_mod_wrap_and_backward[compile-overhead] 12.8004ms 11.3277ms 88.2794 Ops/s 88.4682 Ops/s $\color{#d91a1a}-0.21\%$
test_seq_add[eager] 0.1625ms 86.0388μs 11.6227 KOps/s 11.0473 KOps/s $\textbf{\color{#35bf28}+5.21\%}$
test_seq_add[compile] 0.1209ms 63.3090μs 15.7955 KOps/s 15.0324 KOps/s $\textbf{\color{#35bf28}+5.08\%}$
test_seq_add[compile-overhead] 0.1188ms 62.2181μs 16.0725 KOps/s 15.8007 KOps/s $\color{#35bf28}+1.72\%$
test_seq_wrap[eager] 0.6361ms 0.3738ms 2.6752 KOps/s 2.5772 KOps/s $\color{#35bf28}+3.80\%$
test_seq_wrap[compile] 0.4592ms 0.2740ms 3.6503 KOps/s 3.6049 KOps/s $\color{#35bf28}+1.26\%$
test_seq_wrap[compile-overhead] 0.3688ms 0.2751ms 3.6345 KOps/s 3.6009 KOps/s $\color{#35bf28}+0.93\%$
test_func_call_runtime[False-eager] 0.7093ms 0.5271ms 1.8972 KOps/s 1.7975 KOps/s $\textbf{\color{#35bf28}+5.55\%}$
test_func_call_runtime[False-compile] 0.7224ms 0.5111ms 1.9567 KOps/s 2.0020 KOps/s $\color{#d91a1a}-2.26\%$
test_func_call_runtime[False-compile-overhead] 0.8870ms 0.5103ms 1.9595 KOps/s 1.9806 KOps/s $\color{#d91a1a}-1.06\%$
test_func_call_runtime[True-eager] 1.3034ms 0.7522ms 1.3294 KOps/s 1.3070 KOps/s $\color{#35bf28}+1.71\%$
test_func_call_runtime[True-compile] 1.0102ms 0.5237ms 1.9094 KOps/s 1.9730 KOps/s $\color{#d91a1a}-3.22\%$
test_func_call_runtime[True-compile-overhead] 0.9919ms 0.5245ms 1.9065 KOps/s 1.9587 KOps/s $\color{#d91a1a}-2.66\%$
test_func_call_cm_runtime[False-eager] 1.0931ms 0.5157ms 1.9391 KOps/s 1.8758 KOps/s $\color{#35bf28}+3.38\%$
test_func_call_cm_runtime[False-compile] 1.0834ms 0.5108ms 1.9576 KOps/s 1.9810 KOps/s $\color{#d91a1a}-1.18\%$
test_func_call_cm_runtime[False-compile-overhead] 0.8944ms 0.5088ms 1.9655 KOps/s 1.9988 KOps/s $\color{#d91a1a}-1.67\%$
test_func_call_cm_runtime[True-eager] 1.3826ms 0.8790ms 1.1376 KOps/s 1.1141 KOps/s $\color{#35bf28}+2.11\%$
test_func_call_cm_runtime[True-compile] 1.2281ms 0.7499ms 1.3335 KOps/s 1.3014 KOps/s $\color{#35bf28}+2.47\%$
test_func_call_cm_runtime[True-compile-overhead] 1.2707ms 0.7507ms 1.3321 KOps/s 1.2925 KOps/s $\color{#35bf28}+3.06\%$
test_vmap_func_call_cm_runtime[eager] 2.8774ms 1.9289ms 518.4189 Ops/s 525.2050 Ops/s $\color{#d91a1a}-1.29\%$
test_vmap_func_call_cm_runtime[compile] 3.3874ms 2.0383ms 490.6098 Ops/s 507.1329 Ops/s $\color{#d91a1a}-3.26\%$
test_vmap_func_call_cm_runtime[compile-overhead] 3.3262ms 1.9706ms 507.4624 Ops/s 503.7586 Ops/s $\color{#35bf28}+0.74\%$
test_distributed 0.3310ms 0.1262ms 7.9232 KOps/s 7.6375 KOps/s $\color{#35bf28}+3.74\%$
test_tdmodule 35.5860μs 16.0838μs 62.1745 KOps/s 53.0792 KOps/s $\textbf{\color{#35bf28}+17.14\%}$
test_tdmodule_dispatch 59.4610μs 33.8415μs 29.5495 KOps/s 25.8635 KOps/s $\textbf{\color{#35bf28}+14.25\%}$
test_tdseq 59.5810μs 19.2920μs 51.8349 KOps/s 47.5407 KOps/s $\textbf{\color{#35bf28}+9.03\%}$
test_tdseq_dispatch 65.1220μs 39.3341μs 25.4232 KOps/s 23.8099 KOps/s $\textbf{\color{#35bf28}+6.78\%}$
test_instantiation_functorch 2.1866ms 1.6073ms 622.1713 Ops/s 610.6745 Ops/s $\color{#35bf28}+1.88\%$
test_instantiation_td 2.4700ms 1.2318ms 811.8457 Ops/s 821.2324 Ops/s $\color{#d91a1a}-1.14\%$
test_exec_functorch 0.3856ms 0.1889ms 5.2924 KOps/s 5.3653 KOps/s $\color{#d91a1a}-1.36\%$
test_exec_functional_call 0.3330ms 0.1760ms 5.6817 KOps/s 5.3561 KOps/s $\textbf{\color{#35bf28}+6.08\%}$
test_exec_td 0.3256ms 0.1688ms 5.9245 KOps/s 5.3508 KOps/s $\textbf{\color{#35bf28}+10.72\%}$
test_exec_td_decorator 0.4146ms 0.2263ms 4.4189 KOps/s 4.4031 KOps/s $\color{#35bf28}+0.36\%$
test_vmap_mlp_speed[True-True] 0.9010ms 0.6555ms 1.5256 KOps/s 1.5174 KOps/s $\color{#35bf28}+0.54\%$
test_vmap_mlp_speed[True-False] 0.8657ms 0.6504ms 1.5375 KOps/s 1.5361 KOps/s $\color{#35bf28}+0.09\%$
test_vmap_mlp_speed[False-True] 0.7302ms 0.5052ms 1.9793 KOps/s 1.9719 KOps/s $\color{#35bf28}+0.37\%$
test_vmap_mlp_speed[False-False] 0.8142ms 0.5055ms 1.9781 KOps/s 1.9660 KOps/s $\color{#35bf28}+0.62\%$
test_vmap_mlp_speed_decorator[True-True] 1.4111ms 0.6327ms 1.5806 KOps/s 1.5812 KOps/s $\color{#d91a1a}-0.04\%$
test_vmap_mlp_speed_decorator[True-False] 0.9889ms 0.6326ms 1.5808 KOps/s 1.5773 KOps/s $\color{#35bf28}+0.22\%$
test_vmap_mlp_speed_decorator[False-True] 0.8403ms 0.5221ms 1.9153 KOps/s 1.9370 KOps/s $\color{#d91a1a}-1.12\%$
test_vmap_mlp_speed_decorator[False-False] 0.8487ms 0.5206ms 1.9210 KOps/s 1.9169 KOps/s $\color{#35bf28}+0.21\%$
test_to_module_speed[True] 1.9665ms 1.3008ms 768.7647 Ops/s 778.3459 Ops/s $\color{#d91a1a}-1.23\%$
test_to_module_speed[False] 1.8628ms 1.2709ms 786.8431 Ops/s 797.1385 Ops/s $\color{#d91a1a}-1.29\%$
test_tc_init 84.5880μs 41.7046μs 23.9781 KOps/s 21.8683 KOps/s $\textbf{\color{#35bf28}+9.65\%}$
test_tc_init_nested 0.1705ms 85.2052μs 11.7364 KOps/s 10.5359 KOps/s $\textbf{\color{#35bf28}+11.39\%}$
test_tc_first_layer_tensor 23.8150μs 1.5230μs 656.6012 KOps/s 662.3591 KOps/s $\color{#d91a1a}-0.87\%$
test_tc_first_layer_nontensor 42.2100μs 4.7719μs 209.5583 KOps/s 214.7310 KOps/s $\color{#d91a1a}-2.41\%$
test_tc_second_layer_tensor 23.7050μs 2.8651μs 349.0294 KOps/s 352.1194 KOps/s $\color{#d91a1a}-0.88\%$
test_tc_second_layer_nontensor 31.8690μs 6.0761μs 164.5805 KOps/s 168.3531 KOps/s $\color{#d91a1a}-2.24\%$
test_unbind 0.5218s 14.0864ms 70.9905 Ops/s 130.5467 Ops/s $\textbf{\color{#d91a1a}-45.62\%}$
test_full_like 10.9617ms 9.4466ms 105.8582 Ops/s 109.2207 Ops/s $\color{#d91a1a}-3.08\%$
test_zeros_like 4.0533ms 3.4788ms 287.4554 Ops/s 282.3360 Ops/s $\color{#35bf28}+1.81\%$
test_ones_like 5.2846ms 3.9893ms 250.6707 Ops/s 265.2456 Ops/s $\textbf{\color{#d91a1a}-5.49\%}$
test_clone 6.9680ms 6.1728ms 162.0022 Ops/s 169.2519 Ops/s $\color{#d91a1a}-4.28\%$
test_squeeze 74.1780μs 12.1322μs 82.4256 KOps/s 79.8046 KOps/s $\color{#35bf28}+3.28\%$
test_unsqueeze 0.1738ms 92.9741μs 10.7557 KOps/s 10.7255 KOps/s $\color{#35bf28}+0.28\%$
test_split 0.5820ms 0.1984ms 5.0404 KOps/s 5.1256 KOps/s $\color{#d91a1a}-1.66\%$
test_permute 0.4100ms 0.2256ms 4.4319 KOps/s 4.4607 KOps/s $\color{#d91a1a}-0.65\%$
test_stack 38.8289ms 27.5902ms 36.2448 Ops/s 37.6680 Ops/s $\color{#d91a1a}-3.78\%$
test_cat 31.7289ms 26.5915ms 37.6060 Ops/s 37.6863 Ops/s $\color{#d91a1a}-0.21\%$

Copy link

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 228. Improved: $\large\color{#35bf28}17$. Worsened: $\large\color{#d91a1a}5$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 0.1482ms 13.9780μs 71.5410 KOps/s 73.1680 KOps/s $\color{#d91a1a}-2.22\%$
test_plain_set_stack_nested 32.1310μs 14.3549μs 69.6629 KOps/s 72.7407 KOps/s $\color{#d91a1a}-4.23\%$
test_plain_set_nested_inplace 84.1710μs 15.1846μs 65.8562 KOps/s 67.2330 KOps/s $\color{#d91a1a}-2.05\%$
test_plain_set_stack_nested_inplace 40.2010μs 15.0208μs 66.5744 KOps/s 67.3967 KOps/s $\color{#d91a1a}-1.22\%$
test_items 31.7800μs 2.8655μs 348.9742 KOps/s 345.7707 KOps/s $\color{#35bf28}+0.93\%$
test_items_nested 0.5017ms 0.3113ms 3.2127 KOps/s 3.1935 KOps/s $\color{#35bf28}+0.60\%$
test_items_nested_locked 0.3663ms 0.3147ms 3.1779 KOps/s 3.1622 KOps/s $\color{#35bf28}+0.50\%$
test_items_nested_leaf 87.3220μs 63.3160μs 15.7938 KOps/s 15.8744 KOps/s $\color{#d91a1a}-0.51\%$
test_items_stack_nested 0.3776ms 0.3141ms 3.1842 KOps/s 3.1435 KOps/s $\color{#35bf28}+1.29\%$
test_items_stack_nested_leaf 0.1150ms 64.1678μs 15.5841 KOps/s 15.7461 KOps/s $\color{#d91a1a}-1.03\%$
test_items_stack_nested_locked 0.3980ms 0.3147ms 3.1774 KOps/s 3.1839 KOps/s $\color{#d91a1a}-0.20\%$
test_keys 63.9510μs 3.3939μs 294.6435 KOps/s 292.8954 KOps/s $\color{#35bf28}+0.60\%$
test_keys_nested 89.8020μs 55.3015μs 18.0827 KOps/s 18.2967 KOps/s $\color{#d91a1a}-1.17\%$
test_keys_nested_locked 2.3411ms 60.4296μs 16.5482 KOps/s 16.5428 KOps/s $\color{#35bf28}+0.03\%$
test_keys_nested_leaf 84.4710μs 45.8495μs 21.8105 KOps/s 22.1604 KOps/s $\color{#d91a1a}-1.58\%$
test_keys_stack_nested 86.8520μs 55.2343μs 18.1047 KOps/s 18.2052 KOps/s $\color{#d91a1a}-0.55\%$
test_keys_stack_nested_leaf 79.9610μs 47.2329μs 21.1717 KOps/s 21.2165 KOps/s $\color{#d91a1a}-0.21\%$
test_keys_stack_nested_locked 0.1018ms 59.4410μs 16.8234 KOps/s 16.7862 KOps/s $\color{#35bf28}+0.22\%$
test_values 14.1920μs 0.8065μs 1.2399 MOps/s 1.2492 MOps/s $\color{#d91a1a}-0.74\%$
test_values_nested 57.9910μs 27.4412μs 36.4416 KOps/s 36.4561 KOps/s $\color{#d91a1a}-0.04\%$
test_values_nested_locked 62.6410μs 29.6570μs 33.7189 KOps/s 33.9844 KOps/s $\color{#d91a1a}-0.78\%$
test_values_nested_leaf 57.1410μs 24.2416μs 41.2514 KOps/s 41.2326 KOps/s $\color{#35bf28}+0.05\%$
test_values_stack_nested 68.2310μs 28.4056μs 35.2044 KOps/s 35.1346 KOps/s $\color{#35bf28}+0.20\%$
test_values_stack_nested_leaf 69.2610μs 24.9512μs 40.0783 KOps/s 39.9515 KOps/s $\color{#35bf28}+0.32\%$
test_values_stack_nested_locked 60.5010μs 30.4411μs 32.8504 KOps/s 33.1191 KOps/s $\color{#d91a1a}-0.81\%$
test_membership 1.7625μs 0.4746μs 2.1071 MOps/s 2.1229 MOps/s $\color{#d91a1a}-0.74\%$
test_membership_nested 14.7000μs 1.7392μs 574.9906 KOps/s 577.3670 KOps/s $\color{#d91a1a}-0.41\%$
test_membership_nested_leaf 12.4270μs 1.7156μs 582.9017 KOps/s 578.6942 KOps/s $\color{#35bf28}+0.73\%$
test_membership_stacked_nested 0.3029ms 1.7630μs 567.2006 KOps/s 557.1229 KOps/s $\color{#35bf28}+1.81\%$
test_membership_stacked_nested_leaf 35.5000μs 1.7791μs 562.0683 KOps/s 557.1000 KOps/s $\color{#35bf28}+0.89\%$
test_membership_nested_last 29.2400μs 2.5996μs 384.6789 KOps/s 385.4760 KOps/s $\color{#d91a1a}-0.21\%$
test_membership_nested_leaf_last 36.1690μs 2.6197μs 381.7188 KOps/s 391.6284 KOps/s $\color{#d91a1a}-2.53\%$
test_membership_stacked_nested_last 39.8000μs 3.7416μs 267.2649 KOps/s 131.8746 KOps/s $\textbf{\color{#35bf28}+102.67\%}$
test_membership_stacked_nested_leaf_last 28.5390μs 3.7500μs 266.6647 KOps/s 131.8974 KOps/s $\textbf{\color{#35bf28}+102.18\%}$
test_nested_getleaf 37.1800μs 6.1294μs 163.1494 KOps/s 164.7503 KOps/s $\color{#d91a1a}-0.97\%$
test_nested_get 33.4800μs 5.7646μs 173.4730 KOps/s 172.4294 KOps/s $\color{#35bf28}+0.61\%$
test_stacked_getleaf 31.9200μs 6.1320μs 163.0795 KOps/s 165.8539 KOps/s $\color{#d91a1a}-1.67\%$
test_stacked_get 40.8200μs 5.7663μs 173.4221 KOps/s 174.3521 KOps/s $\color{#d91a1a}-0.53\%$
test_nested_getitemleaf 41.6490μs 6.1571μs 162.4133 KOps/s 164.7922 KOps/s $\color{#d91a1a}-1.44\%$
test_nested_getitem 47.0010μs 5.7372μs 174.3014 KOps/s 172.1475 KOps/s $\color{#35bf28}+1.25\%$
test_stacked_getitemleaf 30.3810μs 6.1756μs 161.9276 KOps/s 164.7756 KOps/s $\color{#d91a1a}-1.73\%$
test_stacked_getitem 37.7610μs 5.8463μs 171.0482 KOps/s 172.3716 KOps/s $\color{#d91a1a}-0.77\%$
test_lock_nested 5.1510ms 0.4222ms 2.3688 KOps/s 2.3674 KOps/s $\color{#35bf28}+0.06\%$
test_lock_stack_nested 0.4638ms 0.3800ms 2.6318 KOps/s 2.6541 KOps/s $\color{#d91a1a}-0.84\%$
test_unlock_nested 0.8344ms 0.3561ms 2.8082 KOps/s 2.7504 KOps/s $\color{#35bf28}+2.10\%$
test_unlock_stack_nested 0.3966ms 0.3190ms 3.1351 KOps/s 3.1645 KOps/s $\color{#d91a1a}-0.93\%$
test_flatten_speed 0.3098ms 81.6849μs 12.2422 KOps/s 12.6119 KOps/s $\color{#d91a1a}-2.93\%$
test_unflatten_speed 0.3405ms 0.2829ms 3.5346 KOps/s 3.5598 KOps/s $\color{#d91a1a}-0.71\%$
test_common_ops 1.4691ms 1.2534ms 797.8151 Ops/s 795.9851 Ops/s $\color{#35bf28}+0.23\%$
test_creation 23.6900μs 1.4911μs 670.6559 KOps/s 667.8573 KOps/s $\color{#35bf28}+0.42\%$
test_creation_empty 59.2100μs 15.8834μs 62.9587 KOps/s 65.6368 KOps/s $\color{#d91a1a}-4.08\%$
test_creation_nested_1 62.0210μs 17.6361μs 56.7019 KOps/s 59.0665 KOps/s $\color{#d91a1a}-4.00\%$
test_creation_nested_2 62.8710μs 20.1811μs 49.5513 KOps/s 51.6078 KOps/s $\color{#d91a1a}-3.98\%$
test_clone 72.5710μs 29.2340μs 34.2067 KOps/s 33.2532 KOps/s $\color{#35bf28}+2.87\%$
test_getitem[int] 1.3035ms 16.4822μs 60.6716 KOps/s 59.2542 KOps/s $\color{#35bf28}+2.39\%$
test_getitem[slice_int] 0.1255ms 27.1071μs 36.8907 KOps/s 36.1333 KOps/s $\color{#35bf28}+2.10\%$
test_getitem[range] 0.2428ms 0.1087ms 9.1986 KOps/s 9.2913 KOps/s $\color{#d91a1a}-1.00\%$
test_getitem[tuple] 0.1212ms 24.2690μs 41.2049 KOps/s 40.4209 KOps/s $\color{#35bf28}+1.94\%$
test_getitem[list] 0.1958ms 97.2286μs 10.2850 KOps/s 10.1389 KOps/s $\color{#35bf28}+1.44\%$
test_setitem_dim[int] 77.4410μs 44.9023μs 22.2706 KOps/s 21.9849 KOps/s $\color{#35bf28}+1.30\%$
test_setitem_dim[slice_int] 89.0320μs 65.6097μs 15.2417 KOps/s 15.1265 KOps/s $\color{#35bf28}+0.76\%$
test_setitem_dim[range] 0.1764ms 0.1261ms 7.9317 KOps/s 7.8360 KOps/s $\color{#35bf28}+1.22\%$
test_setitem_dim[tuple] 85.4210μs 60.6247μs 16.4949 KOps/s 16.5911 KOps/s $\color{#d91a1a}-0.58\%$
test_setitem 94.9220μs 42.7267μs 23.4046 KOps/s 23.0270 KOps/s $\color{#35bf28}+1.64\%$
test_set 85.0710μs 42.3181μs 23.6305 KOps/s 22.3481 KOps/s $\textbf{\color{#35bf28}+5.74\%}$
test_set_shared 0.3623ms 51.9753μs 19.2399 KOps/s 19.0387 KOps/s $\color{#35bf28}+1.06\%$
test_update 0.1101ms 51.9819μs 19.2375 KOps/s 19.0070 KOps/s $\color{#35bf28}+1.21\%$
test_update_nested 0.1091ms 60.9061μs 16.4187 KOps/s 17.2919 KOps/s $\textbf{\color{#d91a1a}-5.05\%}$
test_update__nested 0.1032ms 63.8609μs 15.6590 KOps/s 16.0121 KOps/s $\color{#d91a1a}-2.21\%$
test_set_nested 0.1128ms 47.2503μs 21.1639 KOps/s 20.9101 KOps/s $\color{#35bf28}+1.21\%$
test_set_nested_new 95.6910μs 49.5100μs 20.1979 KOps/s 19.5356 KOps/s $\color{#35bf28}+3.39\%$
test_select 0.1014ms 62.8229μs 15.9178 KOps/s 15.1304 KOps/s $\textbf{\color{#35bf28}+5.20\%}$
test_select_nested 0.1204ms 40.9431μs 24.4242 KOps/s 23.7413 KOps/s $\color{#35bf28}+2.88\%$
test_exclude_nested 96.1010μs 58.7026μs 17.0350 KOps/s 16.7912 KOps/s $\color{#35bf28}+1.45\%$
test_empty[True] 0.8767ms 0.2454ms 4.0746 KOps/s 4.0881 KOps/s $\color{#d91a1a}-0.33\%$
test_empty[False] 4.1861μs 0.7314μs 1.3672 MOps/s 1.3292 MOps/s $\color{#35bf28}+2.86\%$
test_to 59.9110μs 25.2280μs 39.6385 KOps/s 38.8493 KOps/s $\color{#35bf28}+2.03\%$
test_to_nonblocking 64.2000μs 24.7756μs 40.3623 KOps/s 39.6690 KOps/s $\color{#35bf28}+1.75\%$
test_unbind_speed 1.0367ms 0.2717ms 3.6805 KOps/s 3.5242 KOps/s $\color{#35bf28}+4.44\%$
test_unbind_speed_stack0 0.3441ms 0.2775ms 3.6037 KOps/s 3.6529 KOps/s $\color{#d91a1a}-1.35\%$
test_unbind_speed_stack1 92.1630ms 0.7000ms 1.4286 KOps/s 1.4279 KOps/s $\color{#35bf28}+0.05\%$
test_split 93.5889ms 2.2022ms 454.0843 Ops/s 442.7878 Ops/s $\color{#35bf28}+2.55\%$
test_chunk 95.1377ms 2.2225ms 449.9494 Ops/s 440.2371 Ops/s $\color{#35bf28}+2.21\%$
test_creation[device0] 0.3341ms 0.1254ms 7.9747 KOps/s 7.6905 KOps/s $\color{#35bf28}+3.70\%$
test_creation_from_tensor 0.3403ms 0.1282ms 7.8033 KOps/s 7.4565 KOps/s $\color{#35bf28}+4.65\%$
test_add_one[memmap_tensor0] 0.1383ms 9.1352μs 109.4665 KOps/s 108.9162 KOps/s $\color{#35bf28}+0.51\%$
test_contiguous[memmap_tensor0] 47.8310μs 2.2178μs 450.8918 KOps/s 453.6030 KOps/s $\color{#d91a1a}-0.60\%$
test_stack[memmap_tensor0] 64.1710μs 6.7790μs 147.5142 KOps/s 143.3043 KOps/s $\color{#35bf28}+2.94\%$
test_memmaptd_index 1.0805ms 0.4391ms 2.2773 KOps/s 2.2533 KOps/s $\color{#35bf28}+1.07\%$
test_memmaptd_index_astensor 0.7711ms 0.4994ms 2.0025 KOps/s 1.9960 KOps/s $\color{#35bf28}+0.32\%$
test_memmaptd_index_op 1.4889ms 1.0582ms 945.0176 Ops/s 955.2802 Ops/s $\color{#d91a1a}-1.07\%$
test_serialize_model 0.1298s 0.1290s 7.7548 Ops/s 7.7548 Ops/s $+0.00\%$
test_serialize_model_pickle 1.3560s 1.2141s 0.8237 Ops/s 0.8246 Ops/s $\color{#d91a1a}-0.11\%$
test_serialize_weights 0.1294s 0.1284s 7.7895 Ops/s 7.8025 Ops/s $\color{#d91a1a}-0.17\%$
test_serialize_weights_returnearly 56.8401ms 45.8744ms 21.7986 Ops/s 17.5098 Ops/s $\textbf{\color{#35bf28}+24.49\%}$
test_serialize_weights_pickle 1.3538s 1.2141s 0.8237 Ops/s 0.8176 Ops/s $\color{#35bf28}+0.74\%$
test_reshape_pytree 64.7910μs 36.0666μs 27.7265 KOps/s 26.6803 KOps/s $\color{#35bf28}+3.92\%$
test_reshape_td 85.0220μs 41.3780μs 24.1675 KOps/s 23.1168 KOps/s $\color{#35bf28}+4.55\%$
test_view_pytree 64.0010μs 35.1702μs 28.4332 KOps/s 26.9586 KOps/s $\textbf{\color{#35bf28}+5.47\%}$
test_view_td 81.8410μs 46.2949μs 21.6006 KOps/s 20.8560 KOps/s $\color{#35bf28}+3.57\%$
test_unbind_pytree 67.8610μs 35.0887μs 28.4992 KOps/s 27.8414 KOps/s $\color{#35bf28}+2.36\%$
test_unbind_td 0.3939ms 43.5848μs 22.9438 KOps/s 23.0740 KOps/s $\color{#d91a1a}-0.56\%$
test_split_pytree 79.3010μs 46.1856μs 21.6518 KOps/s 21.1649 KOps/s $\color{#35bf28}+2.30\%$
test_split_td 0.1788ms 56.7284μs 17.6279 KOps/s 17.2894 KOps/s $\color{#35bf28}+1.96\%$
test_add_pytree 0.1189ms 56.2609μs 17.7743 KOps/s 17.1402 KOps/s $\color{#35bf28}+3.70\%$
test_add_td 0.1241ms 88.9348μs 11.2442 KOps/s 10.5683 KOps/s $\textbf{\color{#35bf28}+6.40\%}$
test_compile_add_one_nested[tensordict-compile] 0.4128ms 0.2067ms 4.8382 KOps/s 4.6243 KOps/s $\color{#35bf28}+4.62\%$
test_compile_add_one_nested[tensordict-eager] 0.2520ms 0.1568ms 6.3773 KOps/s 6.2486 KOps/s $\color{#35bf28}+2.06\%$
test_compile_add_one_nested[pytree-compile] 0.1887ms 0.1440ms 6.9429 KOps/s 6.5429 KOps/s $\textbf{\color{#35bf28}+6.11\%}$
test_compile_add_one_nested[pytree-eager] 0.2268ms 0.1841ms 5.4320 KOps/s 5.2827 KOps/s $\color{#35bf28}+2.83\%$
test_compile_copy_nested[tensordict-compile] 52.6710μs 21.1336μs 47.3179 KOps/s 45.7213 KOps/s $\color{#35bf28}+3.49\%$
test_compile_copy_nested[tensordict-eager] 80.1910μs 43.5151μs 22.9805 KOps/s 22.6237 KOps/s $\color{#35bf28}+1.58\%$
test_compile_copy_nested[pytree-compile] 99.7710μs 64.4046μs 15.5268 KOps/s 15.5688 KOps/s $\color{#d91a1a}-0.27\%$
test_compile_copy_nested[pytree-eager] 89.5910μs 49.1060μs 20.3641 KOps/s 20.2040 KOps/s $\color{#35bf28}+0.79\%$
test_compile_add_one_flat[tensordict-compile] 0.4110ms 0.3179ms 3.1452 KOps/s 3.1123 KOps/s $\color{#35bf28}+1.06\%$
test_compile_add_one_flat[tensordict-eager] 0.2738ms 0.2143ms 4.6653 KOps/s 4.5239 KOps/s $\color{#35bf28}+3.13\%$
test_compile_add_one_flat[tensorclass-compile] 0.1679ms 0.1278ms 7.8248 KOps/s 7.6939 KOps/s $\color{#35bf28}+1.70\%$
test_compile_add_one_flat[tensorclass-eager] 0.1212ms 61.6509μs 16.2204 KOps/s 15.2148 KOps/s $\textbf{\color{#35bf28}+6.61\%}$
test_compile_add_one_flat[pytree-compile] 0.3614ms 0.3170ms 3.1545 KOps/s 3.1213 KOps/s $\color{#35bf28}+1.07\%$
test_compile_add_one_flat[pytree-eager] 0.7000ms 0.6299ms 1.5875 KOps/s 1.5257 KOps/s $\color{#35bf28}+4.05\%$
test_compile_add_self_flat[tensordict-eager] 0.3363ms 0.2543ms 3.9319 KOps/s 3.8305 KOps/s $\color{#35bf28}+2.65\%$
test_compile_add_self_flat[tensordict-compile] 0.4011ms 0.3187ms 3.1373 KOps/s 3.1207 KOps/s $\color{#35bf28}+0.53\%$
test_compile_add_self_flat[tensorclass-eager] 0.1843ms 72.2726μs 13.8365 KOps/s 13.3439 KOps/s $\color{#35bf28}+3.69\%$
test_compile_add_self_flat[tensorclass-compile] 0.2637ms 0.1286ms 7.7771 KOps/s 7.6548 KOps/s $\color{#35bf28}+1.60\%$
test_compile_add_self_flat[pytree-eager] 0.6891ms 0.5545ms 1.8035 KOps/s 1.7955 KOps/s $\color{#35bf28}+0.44\%$
test_compile_add_self_flat[pytree-compile] 0.4223ms 0.3278ms 3.0504 KOps/s 3.1356 KOps/s $\color{#d91a1a}-2.72\%$
test_compile_copy_flat[tensordict-compile] 49.3700μs 18.7419μs 53.3563 KOps/s 53.3535 KOps/s $+0.01\%$
test_compile_copy_flat[tensordict-eager] 72.1710μs 27.6029μs 36.2281 KOps/s 36.4924 KOps/s $\color{#d91a1a}-0.72\%$
test_compile_copy_flat[pytree-compile] 0.1263ms 68.2196μs 14.6585 KOps/s 14.3226 KOps/s $\color{#35bf28}+2.35\%$
test_compile_copy_flat[pytree-eager] 0.1101ms 51.0922μs 19.5725 KOps/s 19.2228 KOps/s $\color{#35bf28}+1.82\%$
test_compile_assign_and_add[tensordict-compile] 2.3083ms 0.8057ms 1.2412 KOps/s 1.1268 KOps/s $\textbf{\color{#35bf28}+10.16\%}$
test_compile_assign_and_add[tensordict-eager] 3.3351ms 3.2160ms 310.9482 Ops/s 308.2126 Ops/s $\color{#35bf28}+0.89\%$
test_compile_assign_and_add[pytree-compile] 2.3226ms 0.8049ms 1.2423 KOps/s 1.1281 KOps/s $\textbf{\color{#35bf28}+10.13\%}$
test_compile_assign_and_add[pytree-eager] 3.3868ms 3.2357ms 309.0518 Ops/s 303.2812 Ops/s $\color{#35bf28}+1.90\%$
test_compile_indexing[tensor-tensordict-compile] 0.1554ms 0.1138ms 8.7893 KOps/s 9.1053 KOps/s $\color{#d91a1a}-3.47\%$
test_compile_indexing[tensor-tensordict-eager] 0.1944ms 60.6510μs 16.4878 KOps/s 16.1387 KOps/s $\color{#35bf28}+2.16\%$
test_compile_indexing[tensor-tensorclass-compile] 0.1945ms 0.1043ms 9.5878 KOps/s 9.5149 KOps/s $\color{#35bf28}+0.77\%$
test_compile_indexing[tensor-tensorclass-eager] 0.1178ms 42.9463μs 23.2849 KOps/s 22.7929 KOps/s $\color{#35bf28}+2.16\%$
test_compile_indexing[tensor-pytree-compile] 0.1660ms 0.1045ms 9.5675 KOps/s 9.4619 KOps/s $\color{#35bf28}+1.12\%$
test_compile_indexing[tensor-pytree-eager] 0.1064ms 44.7593μs 22.3417 KOps/s 22.9393 KOps/s $\color{#d91a1a}-2.61\%$
test_compile_indexing[slice-tensordict-compile] 0.1723ms 0.1377ms 7.2596 KOps/s 7.1633 KOps/s $\color{#35bf28}+1.34\%$
test_compile_indexing[slice-tensordict-eager] 0.1583ms 25.7362μs 38.8558 KOps/s 37.3240 KOps/s $\color{#35bf28}+4.10\%$
test_compile_indexing[slice-tensorclass-compile] 0.1891ms 0.1355ms 7.3789 KOps/s 7.5346 KOps/s $\color{#d91a1a}-2.07\%$
test_compile_indexing[slice-tensorclass-eager] 59.8710μs 21.6465μs 46.1968 KOps/s 45.1448 KOps/s $\color{#35bf28}+2.33\%$
test_compile_indexing[slice-pytree-compile] 0.1991ms 0.1324ms 7.5535 KOps/s 7.4383 KOps/s $\color{#35bf28}+1.55\%$
test_compile_indexing[slice-pytree-eager] 58.9110μs 21.5931μs 46.3110 KOps/s 44.2920 KOps/s $\color{#35bf28}+4.56\%$
test_compile_indexing[int-tensordict-compile] 0.1873ms 0.1386ms 7.2159 KOps/s 7.1577 KOps/s $\color{#35bf28}+0.81\%$
test_compile_indexing[int-tensordict-eager] 0.4881ms 27.1482μs 36.8349 KOps/s 38.5129 KOps/s $\color{#d91a1a}-4.36\%$
test_compile_indexing[int-tensorclass-compile] 0.1971ms 0.1375ms 7.2719 KOps/s 7.4722 KOps/s $\color{#d91a1a}-2.68\%$
test_compile_indexing[int-tensorclass-eager] 62.8610μs 21.5015μs 46.5085 KOps/s 44.9515 KOps/s $\color{#35bf28}+3.46\%$
test_compile_indexing[int-pytree-compile] 0.1851ms 0.1322ms 7.5664 KOps/s 7.4091 KOps/s $\color{#35bf28}+2.12\%$
test_compile_indexing[int-pytree-eager] 54.4810μs 21.1567μs 47.2663 KOps/s 45.5124 KOps/s $\color{#35bf28}+3.85\%$
test_mod_add[eager] 82.7410μs 33.7386μs 29.6397 KOps/s 31.6497 KOps/s $\textbf{\color{#d91a1a}-6.35\%}$
test_mod_add[compile] 0.1144ms 73.1218μs 13.6758 KOps/s 13.5568 KOps/s $\color{#35bf28}+0.88\%$
test_mod_add[compile-overhead] 0.2592ms 0.1367ms 7.3176 KOps/s 7.0182 KOps/s $\color{#35bf28}+4.27\%$
test_mod_wrap[eager] 0.3224ms 0.2544ms 3.9314 KOps/s 4.1008 KOps/s $\color{#d91a1a}-4.13\%$
test_mod_wrap[compile] 0.4486ms 0.2956ms 3.3834 KOps/s 3.4561 KOps/s $\color{#d91a1a}-2.10\%$
test_mod_wrap[compile-overhead] 7.8674ms 4.1652ms 240.0850 Ops/s 253.0617 Ops/s $\textbf{\color{#d91a1a}-5.13\%}$
test_mod_wrap_and_backward[eager] 1.4350ms 1.3218ms 756.5283 Ops/s 698.4141 Ops/s $\textbf{\color{#35bf28}+8.32\%}$
test_mod_wrap_and_backward[compile] 1.3858ms 1.3096ms 763.6100 Ops/s 692.7406 Ops/s $\textbf{\color{#35bf28}+10.23\%}$
test_mod_wrap_and_backward[compile-overhead] 1.2850ms 0.8731ms 1.1453 KOps/s 1.0166 KOps/s $\textbf{\color{#35bf28}+12.66\%}$
test_seq_add[eager] 0.1495ms 97.1042μs 10.2982 KOps/s 10.2858 KOps/s $\color{#35bf28}+0.12\%$
test_seq_add[compile] 0.1365ms 83.0920μs 12.0349 KOps/s 12.2845 KOps/s $\color{#d91a1a}-2.03\%$
test_seq_add[compile-overhead] 0.1819ms 0.1173ms 8.5258 KOps/s 8.7318 KOps/s $\color{#d91a1a}-2.36\%$
test_seq_wrap[eager] 0.4420ms 0.3929ms 2.5454 KOps/s 2.6367 KOps/s $\color{#d91a1a}-3.46\%$
test_seq_wrap[compile] 0.3783ms 0.3065ms 3.2631 KOps/s 3.2211 KOps/s $\color{#35bf28}+1.30\%$
test_seq_wrap[compile-overhead] 0.2740ms 0.2086ms 4.7934 KOps/s 4.7164 KOps/s $\color{#35bf28}+1.63\%$
test_func_call_runtime[False-eager] 0.8285ms 0.7392ms 1.3529 KOps/s 1.3471 KOps/s $\color{#35bf28}+0.43\%$
test_func_call_runtime[False-compile] 0.8619ms 0.7742ms 1.2917 KOps/s 1.2540 KOps/s $\color{#35bf28}+3.00\%$
test_func_call_runtime[False-compile-overhead] 0.4655ms 0.3489ms 2.8663 KOps/s 2.8445 KOps/s $\color{#35bf28}+0.77\%$
test_func_call_runtime[True-eager] 0.9662ms 0.8916ms 1.1215 KOps/s 1.1020 KOps/s $\color{#35bf28}+1.77\%$
test_func_call_runtime[True-compile] 0.9037ms 0.8169ms 1.2241 KOps/s 1.2012 KOps/s $\color{#35bf28}+1.91\%$
test_func_call_runtime[True-compile-overhead] 0.4878ms 0.3902ms 2.5630 KOps/s 2.6025 KOps/s $\color{#d91a1a}-1.52\%$
test_func_call_cm_runtime[False-eager] 0.8410ms 0.7665ms 1.3046 KOps/s 1.2663 KOps/s $\color{#35bf28}+3.02\%$
test_func_call_cm_runtime[False-compile] 0.8668ms 0.7817ms 1.2793 KOps/s 1.2203 KOps/s $\color{#35bf28}+4.84\%$
test_func_call_cm_runtime[False-compile-overhead] 0.4838ms 0.3498ms 2.8589 KOps/s 2.8181 KOps/s $\color{#35bf28}+1.45\%$
test_func_call_cm_runtime[True-eager] 1.0902ms 0.9896ms 1.0105 KOps/s 995.5644 Ops/s $\color{#35bf28}+1.51\%$
test_func_call_cm_runtime[True-compile] 0.9332ms 0.8439ms 1.1850 KOps/s 1.1650 KOps/s $\color{#35bf28}+1.72\%$
test_func_call_cm_runtime[True-compile-overhead] 0.4775ms 0.4061ms 2.4626 KOps/s 2.4595 KOps/s $\color{#35bf28}+0.13\%$
test_vmap_func_call_cm_runtime[eager] 2.5554ms 2.0545ms 486.7404 Ops/s 483.0592 Ops/s $\color{#35bf28}+0.76\%$
test_vmap_func_call_cm_runtime[compile] 0.9680ms 0.8629ms 1.1589 KOps/s 1.1304 KOps/s $\color{#35bf28}+2.52\%$
test_vmap_func_call_cm_runtime[compile-overhead] 0.4781ms 0.4113ms 2.4314 KOps/s 2.4163 KOps/s $\color{#35bf28}+0.63\%$
test_distributed 4.6082ms 0.2295ms 4.3577 KOps/s 8.8819 KOps/s $\textbf{\color{#d91a1a}-50.94\%}$
test_tdmodule 25.0210μs 14.5014μs 68.9589 KOps/s 69.2421 KOps/s $\color{#d91a1a}-0.41\%$
test_tdmodule_dispatch 50.7310μs 28.9522μs 34.5397 KOps/s 35.2964 KOps/s $\color{#d91a1a}-2.14\%$
test_tdseq 36.6510μs 15.1409μs 66.0464 KOps/s 65.2305 KOps/s $\color{#35bf28}+1.25\%$
test_tdseq_dispatch 50.5410μs 31.1494μs 32.1034 KOps/s 32.3123 KOps/s $\color{#d91a1a}-0.65\%$
test_instantiation_functorch 1.9701ms 1.8717ms 534.2696 Ops/s 523.2429 Ops/s $\color{#35bf28}+2.11\%$
test_instantiation_td 1.7938ms 1.1982ms 834.5729 Ops/s 818.9905 Ops/s $\color{#35bf28}+1.90\%$
test_exec_functorch 0.2653ms 0.2119ms 4.7186 KOps/s 4.6625 KOps/s $\color{#35bf28}+1.20\%$
test_exec_functional_call 0.2607ms 0.2072ms 4.8267 KOps/s 4.7350 KOps/s $\color{#35bf28}+1.94\%$
test_exec_td 0.2651ms 0.2176ms 4.5960 KOps/s 4.5918 KOps/s $\color{#35bf28}+0.09\%$
test_exec_td_decorator 0.9801ms 0.2541ms 3.9355 KOps/s 3.8439 KOps/s $\color{#35bf28}+2.39\%$
test_vmap_mlp_speed[True-True] 0.7561ms 0.6733ms 1.4853 KOps/s 1.4561 KOps/s $\color{#35bf28}+2.00\%$
test_vmap_mlp_speed[True-False] 0.7659ms 0.6683ms 1.4963 KOps/s 1.4188 KOps/s $\textbf{\color{#35bf28}+5.46\%}$
test_vmap_mlp_speed[False-True] 0.6885ms 0.5656ms 1.7679 KOps/s 1.6771 KOps/s $\textbf{\color{#35bf28}+5.41\%}$
test_vmap_mlp_speed[False-False] 0.6776ms 0.5678ms 1.7611 KOps/s 1.6906 KOps/s $\color{#35bf28}+4.17\%$
test_vmap_mlp_speed_decorator[True-True] 1.4135ms 0.6576ms 1.5207 KOps/s 1.4655 KOps/s $\color{#35bf28}+3.77\%$
test_vmap_mlp_speed_decorator[True-False] 0.7754ms 0.6564ms 1.5234 KOps/s 1.4902 KOps/s $\color{#35bf28}+2.23\%$
test_vmap_mlp_speed_decorator[False-True] 0.6874ms 0.5789ms 1.7275 KOps/s 1.6700 KOps/s $\color{#35bf28}+3.44\%$
test_vmap_mlp_speed_decorator[False-False] 0.7043ms 0.5789ms 1.7273 KOps/s 1.7008 KOps/s $\color{#35bf28}+1.56\%$
test_vmap_transformer_speed[True-True] 8.2794ms 8.1906ms 122.0918 Ops/s 120.2507 Ops/s $\color{#35bf28}+1.53\%$
test_vmap_transformer_speed[True-False] 8.3043ms 8.1727ms 122.3581 Ops/s 120.2021 Ops/s $\color{#35bf28}+1.79\%$
test_vmap_transformer_speed[False-True] 8.0556ms 7.9807ms 125.3026 Ops/s 123.6418 Ops/s $\color{#35bf28}+1.34\%$
test_vmap_transformer_speed[False-False] 9.3348ms 7.9836ms 125.2567 Ops/s 123.4157 Ops/s $\color{#35bf28}+1.49\%$
test_vmap_transformer_speed_decorator[True-True] 19.7457ms 19.1900ms 52.1106 Ops/s 51.7316 Ops/s $\color{#35bf28}+0.73\%$
test_vmap_transformer_speed_decorator[True-False] 19.6134ms 19.1705ms 52.1634 Ops/s 51.6897 Ops/s $\color{#35bf28}+0.92\%$
test_vmap_transformer_speed_decorator[False-True] 19.7428ms 19.1056ms 52.3407 Ops/s 52.0756 Ops/s $\color{#35bf28}+0.51\%$
test_vmap_transformer_speed_decorator[False-False] 19.1158ms 19.0159ms 52.5877 Ops/s 52.0694 Ops/s $\color{#35bf28}+1.00\%$
test_to_module_speed[True] 1.4005ms 0.9390ms 1.0649 KOps/s 1.0625 KOps/s $\color{#35bf28}+0.23\%$
test_to_module_speed[False] 1.3215ms 0.9133ms 1.0949 KOps/s 1.0954 KOps/s $\color{#d91a1a}-0.04\%$
test_tc_init 54.9910μs 33.5714μs 29.7872 KOps/s 29.0792 KOps/s $\color{#35bf28}+2.43\%$
test_tc_init_nested 0.1160ms 67.7922μs 14.7510 KOps/s 13.8946 KOps/s $\textbf{\color{#35bf28}+6.16\%}$
test_tc_first_layer_tensor 4.5057μs 0.6781μs 1.4746 MOps/s 1.4927 MOps/s $\color{#d91a1a}-1.21\%$
test_tc_first_layer_nontensor 28.0100μs 2.2054μs 453.4288 KOps/s 436.2074 KOps/s $\color{#35bf28}+3.95\%$
test_tc_second_layer_tensor 8.4953μs 1.3770μs 726.2078 KOps/s 722.5784 KOps/s $\color{#35bf28}+0.50\%$
test_tc_second_layer_nontensor 30.1210μs 2.9254μs 341.8373 KOps/s 327.8668 KOps/s $\color{#35bf28}+4.26\%$
test_unbind 0.1938s 11.8646ms 84.2843 Ops/s 95.2824 Ops/s $\textbf{\color{#d91a1a}-11.54\%}$
test_full_like 0.6562ms 0.5750ms 1.7393 KOps/s 1.7381 KOps/s $\color{#35bf28}+0.07\%$
test_zeros_like 0.2770ms 0.1980ms 5.0508 KOps/s 5.0512 KOps/s $-0.01\%$
test_ones_like 0.2520ms 0.1978ms 5.0559 KOps/s 5.0564 KOps/s $-0.01\%$
test_clone 0.4588ms 0.4135ms 2.4181 KOps/s 2.4127 KOps/s $\color{#35bf28}+0.23\%$
test_squeeze 54.1210μs 9.6373μs 103.7633 KOps/s 103.6408 KOps/s $\color{#35bf28}+0.12\%$
test_unsqueeze 0.2938ms 75.8307μs 13.1873 KOps/s 13.5814 KOps/s $\color{#d91a1a}-2.90\%$
test_split 0.2595ms 0.1611ms 6.2075 KOps/s 6.2251 KOps/s $\color{#d91a1a}-0.28\%$
test_permute 0.2814ms 0.1852ms 5.3999 KOps/s 5.4688 KOps/s $\color{#d91a1a}-1.26\%$
test_stack 1.2532ms 0.8677ms 1.1525 KOps/s 1.1399 KOps/s $\color{#35bf28}+1.10\%$
test_cat 1.2509ms 1.2322ms 811.5238 Ops/s 811.9520 Ops/s $\color{#d91a1a}-0.05\%$

@vmoens vmoens added the Refactor Refactoring code - not a new feature label Sep 13, 2024
@vmoens vmoens changed the title [Feature] Use IntEnum for interaction types [Refactor] Use IntEnum for interaction types Sep 13, 2024
@vmoens vmoens merged commit 187541b into main Sep 13, 2024
46 of 54 checks passed
@vmoens vmoens deleted the intenum branch September 13, 2024 08:20
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Refactor Refactoring code - not a new feature
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants