Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Versioning] 0.6.2 #1089

Merged
merged 1 commit into from
Nov 14, 2024
Merged

[Versioning] 0.6.2 #1089

merged 1 commit into from
Nov 14, 2024

Conversation

vmoens
Copy link
Contributor

@vmoens vmoens commented Nov 14, 2024

Stack from ghstack (oldest at bottom):

[ghstack-poisoned]
vmoens added a commit that referenced this pull request Nov 14, 2024
ghstack-source-id: 929ad7fb89e0bd6d25a70ed64b340ad7245fd693
Pull Request resolved: #1089
@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Nov 14, 2024
@vmoens vmoens merged commit 7dc7014 into gh/vmoens/36/base Nov 14, 2024
11 of 22 checks passed
@vmoens vmoens deleted the gh/vmoens/36/head branch November 14, 2024 06:31
vmoens added a commit that referenced this pull request Nov 14, 2024
ghstack-source-id: 929ad7fb89e0bd6d25a70ed64b340ad7245fd693
Pull Request resolved: #1089

(cherry picked from commit 73b0fd7)
Copy link

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 217. Improved: $\large\color{#35bf28}25$. Worsened: $\large\color{#d91a1a}10$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 34.0330μs 17.2155μs 58.0873 KOps/s 55.2824 KOps/s $\textbf{\color{#35bf28}+5.07\%}$
test_plain_set_stack_nested 44.9340μs 17.3571μs 57.6132 KOps/s 53.4492 KOps/s $\textbf{\color{#35bf28}+7.79\%}$
test_plain_set_nested_inplace 69.0390μs 18.9931μs 52.6506 KOps/s 49.1327 KOps/s $\textbf{\color{#35bf28}+7.16\%}$
test_plain_set_stack_nested_inplace 48.0400μs 18.7374μs 53.3691 KOps/s 49.9726 KOps/s $\textbf{\color{#35bf28}+6.80\%}$
test_items 39.8940μs 4.1019μs 243.7924 KOps/s 233.5284 KOps/s $\color{#35bf28}+4.40\%$
test_items_nested 0.6590ms 0.3425ms 2.9199 KOps/s 2.9658 KOps/s $\color{#d91a1a}-1.55\%$
test_items_nested_locked 0.5674ms 0.3422ms 2.9223 KOps/s 2.9470 KOps/s $\color{#d91a1a}-0.84\%$
test_items_nested_leaf 0.1429ms 71.5416μs 13.9779 KOps/s 14.1360 KOps/s $\color{#d91a1a}-1.12\%$
test_items_stack_nested 0.6603ms 0.3472ms 2.8805 KOps/s 2.9228 KOps/s $\color{#d91a1a}-1.45\%$
test_items_stack_nested_leaf 0.1290ms 74.2431μs 13.4693 KOps/s 13.5762 KOps/s $\color{#d91a1a}-0.79\%$
test_items_stack_nested_locked 0.6497ms 0.3437ms 2.9091 KOps/s 2.9044 KOps/s $\color{#35bf28}+0.16\%$
test_keys 37.3200μs 3.5116μs 284.7721 KOps/s 280.2664 KOps/s $\color{#35bf28}+1.61\%$
test_keys_nested 0.2037ms 0.1382ms 7.2337 KOps/s 7.4003 KOps/s $\color{#d91a1a}-2.25\%$
test_keys_nested_locked 1.8223ms 0.1439ms 6.9500 KOps/s 7.0457 KOps/s $\color{#d91a1a}-1.36\%$
test_keys_nested_leaf 0.1941ms 0.1158ms 8.6331 KOps/s 8.4954 KOps/s $\color{#35bf28}+1.62\%$
test_keys_stack_nested 0.2604ms 0.1369ms 7.3065 KOps/s 7.3153 KOps/s $\color{#d91a1a}-0.12\%$
test_keys_stack_nested_leaf 0.2053ms 0.1157ms 8.6420 KOps/s 8.5152 KOps/s $\color{#35bf28}+1.49\%$
test_keys_stack_nested_locked 0.2586ms 0.1416ms 7.0623 KOps/s 7.1081 KOps/s $\color{#d91a1a}-0.64\%$
test_values 8.7984μs 1.0463μs 955.7487 KOps/s 976.0191 KOps/s $\color{#d91a1a}-2.08\%$
test_values_nested 0.1111ms 55.9917μs 17.8598 KOps/s 17.9787 KOps/s $\color{#d91a1a}-0.66\%$
test_values_nested_locked 0.1022ms 55.5160μs 18.0128 KOps/s 17.4470 KOps/s $\color{#35bf28}+3.24\%$
test_values_nested_leaf 0.1174ms 59.8479μs 16.7090 KOps/s 16.6513 KOps/s $\color{#35bf28}+0.35\%$
test_values_stack_nested 0.1099ms 57.0915μs 17.5157 KOps/s 17.7004 KOps/s $\color{#d91a1a}-1.04\%$
test_values_stack_nested_leaf 0.1222ms 59.6029μs 16.7777 KOps/s 16.3464 KOps/s $\color{#35bf28}+2.64\%$
test_values_stack_nested_locked 0.1147ms 57.1298μs 17.5040 KOps/s 17.7087 KOps/s $\color{#d91a1a}-1.16\%$
test_membership 36.2080μs 0.8856μs 1.1292 MOps/s 1.1462 MOps/s $\color{#d91a1a}-1.49\%$
test_membership_nested 17.6330μs 2.7299μs 366.3095 KOps/s 361.3660 KOps/s $\color{#35bf28}+1.37\%$
test_membership_nested_leaf 43.4810μs 2.7621μs 362.0422 KOps/s 361.6734 KOps/s $\color{#35bf28}+0.10\%$
test_membership_stacked_nested 36.3370μs 2.7168μs 368.0760 KOps/s 370.6455 KOps/s $\color{#d91a1a}-0.69\%$
test_membership_stacked_nested_leaf 33.1020μs 2.7466μs 364.0831 KOps/s 370.2253 KOps/s $\color{#d91a1a}-1.66\%$
test_membership_nested_last 66.3540μs 4.0223μs 248.6155 KOps/s 249.2027 KOps/s $\color{#d91a1a}-0.24\%$
test_membership_nested_leaf_last 46.7560μs 4.0279μs 248.2711 KOps/s 248.8903 KOps/s $\color{#d91a1a}-0.25\%$
test_membership_stacked_nested_last 30.3370μs 3.9601μs 252.5162 KOps/s 252.0878 KOps/s $\color{#35bf28}+0.17\%$
test_membership_stacked_nested_leaf_last 41.9480μs 3.9306μs 254.4125 KOps/s 250.3836 KOps/s $\color{#35bf28}+1.61\%$
test_nested_getleaf 55.6940μs 10.5469μs 94.8149 KOps/s 93.4902 KOps/s $\color{#35bf28}+1.42\%$
test_nested_get 49.0010μs 10.2675μs 97.3942 KOps/s 95.6813 KOps/s $\color{#35bf28}+1.79\%$
test_stacked_getleaf 39.9740μs 10.7300μs 93.1964 KOps/s 93.8445 KOps/s $\color{#d91a1a}-0.69\%$
test_stacked_get 54.9520μs 9.9539μs 100.4636 KOps/s 99.1376 KOps/s $\color{#35bf28}+1.34\%$
test_nested_getitemleaf 52.4480μs 11.0076μs 90.8462 KOps/s 85.3475 KOps/s $\textbf{\color{#35bf28}+6.44\%}$
test_nested_getitem 36.5580μs 10.4120μs 96.0433 KOps/s 97.4754 KOps/s $\color{#d91a1a}-1.47\%$
test_stacked_getitemleaf 54.0010μs 11.2190μs 89.1347 KOps/s 88.5959 KOps/s $\color{#35bf28}+0.61\%$
test_stacked_getitem 54.3500μs 10.4133μs 96.0308 KOps/s 95.6772 KOps/s $\color{#35bf28}+0.37\%$
test_lock_nested 3.1251ms 0.4386ms 2.2801 KOps/s 1.8538 KOps/s $\textbf{\color{#35bf28}+23.00\%}$
test_lock_stack_nested 0.7402ms 0.4048ms 2.4705 KOps/s 2.4496 KOps/s $\color{#35bf28}+0.85\%$
test_unlock_nested 0.7514ms 0.3507ms 2.8515 KOps/s 2.7762 KOps/s $\color{#35bf28}+2.71\%$
test_unlock_stack_nested 0.5941ms 0.3226ms 3.0995 KOps/s 3.0407 KOps/s $\color{#35bf28}+1.93\%$
test_flatten_speed 0.2124ms 90.7289μs 11.0218 KOps/s 10.9802 KOps/s $\color{#35bf28}+0.38\%$
test_unflatten_speed 0.9748ms 0.4787ms 2.0889 KOps/s 2.1344 KOps/s $\color{#d91a1a}-2.13\%$
test_common_ops 1.5057ms 0.7397ms 1.3520 KOps/s 1.2696 KOps/s $\textbf{\color{#35bf28}+6.49\%}$
test_creation 26.5700μs 2.0368μs 490.9595 KOps/s 481.4325 KOps/s $\color{#35bf28}+1.98\%$
test_creation_empty 39.4830μs 9.5871μs 104.3071 KOps/s 86.8033 KOps/s $\textbf{\color{#35bf28}+20.16\%}$
test_creation_nested_1 42.5690μs 12.4216μs 80.5052 KOps/s 69.9928 KOps/s $\textbf{\color{#35bf28}+15.02\%}$
test_creation_nested_2 79.0570μs 16.5546μs 60.4060 KOps/s 54.3300 KOps/s $\textbf{\color{#35bf28}+11.18\%}$
test_clone 0.1600ms 13.1744μs 75.9050 KOps/s 76.8560 KOps/s $\color{#d91a1a}-1.24\%$
test_getitem[int] 1.1415ms 12.4547μs 80.2910 KOps/s 81.3166 KOps/s $\color{#d91a1a}-1.26\%$
test_getitem[slice_int] 0.1414ms 23.7836μs 42.0458 KOps/s 43.5816 KOps/s $\color{#d91a1a}-3.52\%$
test_getitem[range] 0.1936ms 47.3263μs 21.1299 KOps/s 21.3250 KOps/s $\color{#d91a1a}-0.91\%$
test_getitem[tuple] 0.1321ms 19.5599μs 51.1250 KOps/s 52.7701 KOps/s $\color{#d91a1a}-3.12\%$
test_getitem[list] 0.2159ms 43.2474μs 23.1228 KOps/s 23.5607 KOps/s $\color{#d91a1a}-1.86\%$
test_setitem_dim[int] 61.6350μs 24.7116μs 40.4668 KOps/s 39.8387 KOps/s $\color{#35bf28}+1.58\%$
test_setitem_dim[slice_int] 93.5540μs 48.8392μs 20.4753 KOps/s 19.7550 KOps/s $\color{#35bf28}+3.65\%$
test_setitem_dim[range] 0.1604ms 71.5831μs 13.9698 KOps/s 13.6699 KOps/s $\color{#35bf28}+2.19\%$
test_setitem_dim[tuple] 71.2520μs 38.8808μs 25.7197 KOps/s 25.1520 KOps/s $\color{#35bf28}+2.26\%$
test_setitem 0.1628ms 19.7541μs 50.6224 KOps/s 48.4607 KOps/s $\color{#35bf28}+4.46\%$
test_set 68.3470μs 18.7899μs 53.2200 KOps/s 49.3624 KOps/s $\textbf{\color{#35bf28}+7.81\%}$
test_set_shared 4.1942ms 0.1670ms 5.9880 KOps/s 5.9583 KOps/s $\color{#35bf28}+0.50\%$
test_update 0.1316ms 21.0453μs 47.5165 KOps/s 42.6369 KOps/s $\textbf{\color{#35bf28}+11.44\%}$
test_update_nested 96.0290μs 29.8465μs 33.5048 KOps/s 30.8369 KOps/s $\textbf{\color{#35bf28}+8.65\%}$
test_update__nested 0.1466ms 32.5254μs 30.7452 KOps/s 31.3182 KOps/s $\color{#d91a1a}-1.83\%$
test_set_nested 0.1330ms 21.1756μs 47.2242 KOps/s 45.1861 KOps/s $\color{#35bf28}+4.51\%$
test_set_nested_new 83.9750μs 25.6713μs 38.9540 KOps/s 37.1309 KOps/s $\color{#35bf28}+4.91\%$
test_select 0.2035ms 42.3380μs 23.6195 KOps/s 23.7492 KOps/s $\color{#d91a1a}-0.55\%$
test_select_nested 0.1303ms 60.4908μs 16.5314 KOps/s 16.8990 KOps/s $\color{#d91a1a}-2.17\%$
test_exclude_nested 0.1383ms 75.2388μs 13.2910 KOps/s 13.4896 KOps/s $\color{#d91a1a}-1.47\%$
test_empty[True] 0.4335ms 0.3536ms 2.8283 KOps/s 2.8719 KOps/s $\color{#d91a1a}-1.52\%$
test_empty[False] 9.2397μs 1.2165μs 822.0412 KOps/s 815.5702 KOps/s $\color{#35bf28}+0.79\%$
test_unbind_speed 0.3048ms 0.2592ms 3.8578 KOps/s 3.8999 KOps/s $\color{#d91a1a}-1.08\%$
test_unbind_speed_stack0 0.3168ms 0.2540ms 3.9376 KOps/s 3.9233 KOps/s $\color{#35bf28}+0.36\%$
test_unbind_speed_stack1 0.1074s 0.7507ms 1.3322 KOps/s 1.4055 KOps/s $\textbf{\color{#d91a1a}-5.22\%}$
test_split 0.1032s 1.7199ms 581.4234 Ops/s 576.0875 Ops/s $\color{#35bf28}+0.93\%$
test_chunk 0.1038s 1.7317ms 577.4727 Ops/s 571.2454 Ops/s $\color{#35bf28}+1.09\%$
test_consolidate_njt[False-None] 10.2681ms 8.1167ms 123.2025 Ops/s 122.3270 Ops/s $\color{#35bf28}+0.72\%$
test_creation[device0] 0.2515ms 88.5869μs 11.2884 KOps/s 10.0688 KOps/s $\textbf{\color{#35bf28}+12.11\%}$
test_creation_from_tensor 4.4472ms 93.6161μs 10.6819 KOps/s 10.5141 KOps/s $\color{#35bf28}+1.60\%$
test_add_one[memmap_tensor0] 0.1579ms 4.8670μs 205.4640 KOps/s 212.4853 KOps/s $\color{#d91a1a}-3.30\%$
test_contiguous[memmap_tensor0] 21.6710μs 0.5216μs 1.9171 MOps/s 1.9577 MOps/s $\color{#d91a1a}-2.07\%$
test_stack[memmap_tensor0] 33.1010μs 3.4137μs 292.9336 KOps/s 304.1745 KOps/s $\color{#d91a1a}-3.70\%$
test_memmaptd_index 1.0519ms 0.2323ms 4.3053 KOps/s 4.2958 KOps/s $\color{#35bf28}+0.22\%$
test_memmaptd_index_astensor 0.7123ms 0.3100ms 3.2260 KOps/s 3.2308 KOps/s $\color{#d91a1a}-0.15\%$
test_memmaptd_index_op 1.2933ms 0.5450ms 1.8350 KOps/s 1.7070 KOps/s $\textbf{\color{#35bf28}+7.50\%}$
test_serialize_model 0.1239s 0.1141s 8.7660 Ops/s 7.5202 Ops/s $\textbf{\color{#35bf28}+16.57\%}$
test_serialize_model_pickle 0.4783s 0.3926s 2.5474 Ops/s 2.5102 Ops/s $\color{#35bf28}+1.48\%$
test_serialize_weights 0.1174s 0.1129s 8.8559 Ops/s 8.7591 Ops/s $\color{#35bf28}+1.11\%$
test_serialize_weights_returnearly 0.1800s 0.1551s 6.4475 Ops/s 6.2093 Ops/s $\color{#35bf28}+3.84\%$
test_serialize_weights_pickle 1.1830s 0.7425s 1.3468 Ops/s 2.2347 Ops/s $\textbf{\color{#d91a1a}-39.73\%}$
test_serialize_weights_filesystem 0.1518s 0.1436s 6.9619 Ops/s 6.4377 Ops/s $\textbf{\color{#35bf28}+8.14\%}$
test_serialize_model_filesystem 0.2505s 0.1565s 6.3916 Ops/s 6.6370 Ops/s $\color{#d91a1a}-3.70\%$
test_reshape_pytree 70.6320μs 26.9656μs 37.0843 KOps/s 37.7827 KOps/s $\color{#d91a1a}-1.85\%$
test_reshape_td 66.3030μs 32.3685μs 30.8942 KOps/s 31.6640 KOps/s $\color{#d91a1a}-2.43\%$
test_view_pytree 68.7980μs 27.1409μs 36.8448 KOps/s 38.1008 KOps/s $\color{#d91a1a}-3.30\%$
test_view_td 79.9090μs 38.5424μs 25.9454 KOps/s 27.1742 KOps/s $\color{#d91a1a}-4.52\%$
test_unbind_pytree 86.0200μs 30.1755μs 33.1394 KOps/s 34.1677 KOps/s $\color{#d91a1a}-3.01\%$
test_unbind_td 0.3546ms 38.2840μs 26.1206 KOps/s 26.3039 KOps/s $\color{#d91a1a}-0.70\%$
test_split_pytree 65.5120μs 30.0858μs 33.2383 KOps/s 34.5185 KOps/s $\color{#d91a1a}-3.71\%$
test_split_td 0.2000ms 43.6209μs 22.9248 KOps/s 22.5244 KOps/s $\color{#35bf28}+1.78\%$
test_add_pytree 80.7600μs 35.4442μs 28.2133 KOps/s 27.9739 KOps/s $\color{#35bf28}+0.86\%$
test_add_td 0.1838ms 53.9451μs 18.5374 KOps/s 18.1559 KOps/s $\color{#35bf28}+2.10\%$
test_compile_add_one_nested[tensordict-compile] 0.1180ms 61.0436μs 16.3817 KOps/s 16.3273 KOps/s $\color{#35bf28}+0.33\%$
test_compile_add_one_nested[tensordict-eager] 0.3767ms 0.1602ms 6.2406 KOps/s 6.2716 KOps/s $\color{#d91a1a}-0.49\%$
test_compile_add_one_nested[pytree-compile] 0.1186ms 45.4754μs 21.9899 KOps/s 22.5049 KOps/s $\color{#d91a1a}-2.29\%$
test_compile_add_one_nested[pytree-eager] 0.2555ms 0.1197ms 8.3560 KOps/s 8.5603 KOps/s $\color{#d91a1a}-2.39\%$
test_compile_copy_nested[tensordict-compile] 71.4930μs 25.9722μs 38.5027 KOps/s 37.7693 KOps/s $\color{#35bf28}+1.94\%$
test_compile_copy_nested[tensordict-eager] 0.1158ms 53.5867μs 18.6613 KOps/s 18.5441 KOps/s $\color{#35bf28}+0.63\%$
test_compile_copy_nested[pytree-compile] 0.1700ms 78.3562μs 12.7622 KOps/s 12.7709 KOps/s $\color{#d91a1a}-0.07\%$
test_compile_copy_nested[pytree-eager] 0.1334ms 67.5519μs 14.8034 KOps/s 14.8080 KOps/s $\color{#d91a1a}-0.03\%$
test_compile_add_one_flat[tensordict-compile] 0.1926ms 0.1045ms 9.5722 KOps/s 9.7278 KOps/s $\color{#d91a1a}-1.60\%$
test_compile_add_one_flat[tensordict-eager] 0.2732ms 0.2001ms 4.9981 KOps/s 5.0351 KOps/s $\color{#d91a1a}-0.73\%$
test_compile_add_one_flat[tensorclass-compile] 0.1228ms 44.4211μs 22.5118 KOps/s 22.6403 KOps/s $\color{#d91a1a}-0.57\%$
test_compile_add_one_flat[tensorclass-eager] 0.4950ms 60.8647μs 16.4299 KOps/s 16.4551 KOps/s $\color{#d91a1a}-0.15\%$
test_compile_add_one_flat[pytree-compile] 0.1815ms 0.1030ms 9.7132 KOps/s 9.8997 KOps/s $\color{#d91a1a}-1.88\%$
test_compile_add_one_flat[pytree-eager] 0.3680ms 0.2038ms 4.9074 KOps/s 4.9526 KOps/s $\color{#d91a1a}-0.91\%$
test_compile_add_self_flat[tensordict-eager] 0.3982ms 0.2099ms 4.7642 KOps/s 4.7882 KOps/s $\color{#d91a1a}-0.50\%$
test_compile_add_self_flat[tensordict-compile] 0.2084ms 0.1069ms 9.3558 KOps/s 9.4292 KOps/s $\color{#d91a1a}-0.78\%$
test_compile_add_self_flat[tensorclass-eager] 0.2345ms 52.6583μs 18.9904 KOps/s 18.8650 KOps/s $\color{#35bf28}+0.66\%$
test_compile_add_self_flat[tensorclass-compile] 0.1076ms 45.4351μs 22.0094 KOps/s 22.0112 KOps/s $-0.01\%$
test_compile_add_self_flat[pytree-eager] 0.5911ms 0.1612ms 6.2050 KOps/s 6.3089 KOps/s $\color{#d91a1a}-1.65\%$
test_compile_add_self_flat[pytree-compile] 0.1907ms 0.1041ms 9.6060 KOps/s 9.8073 KOps/s $\color{#d91a1a}-2.05\%$
test_compile_copy_flat[tensordict-compile] 54.7720μs 20.9857μs 47.6515 KOps/s 48.6865 KOps/s $\color{#d91a1a}-2.13\%$
test_compile_copy_flat[tensordict-eager] 0.1140ms 58.3190μs 17.1471 KOps/s 16.7317 KOps/s $\color{#35bf28}+2.48\%$
test_compile_copy_flat[pytree-compile] 0.1512ms 80.9231μs 12.3574 KOps/s 12.3844 KOps/s $\color{#d91a1a}-0.22\%$
test_compile_copy_flat[pytree-eager] 0.1278ms 70.5547μs 14.1734 KOps/s 14.6065 KOps/s $\color{#d91a1a}-2.97\%$
test_compile_assign_and_add[tensordict-compile] 0.3006ms 0.2099ms 4.7653 KOps/s 4.9228 KOps/s $\color{#d91a1a}-3.20\%$
test_compile_assign_and_add[tensordict-eager] 1.3608ms 1.2426ms 804.7553 Ops/s 794.1810 Ops/s $\color{#35bf28}+1.33\%$
test_compile_assign_and_add[pytree-compile] 0.2822ms 0.2027ms 4.9339 KOps/s 5.0445 KOps/s $\color{#d91a1a}-2.19\%$
test_compile_assign_and_add[pytree-eager] 0.8619ms 0.7767ms 1.2875 KOps/s 1.2963 KOps/s $\color{#d91a1a}-0.68\%$
test_compile_assign_and_add_stack[compile] 0.8087ms 0.4555ms 2.1953 KOps/s 2.2377 KOps/s $\color{#d91a1a}-1.90\%$
test_compile_assign_and_add_stack[eager] 2.7034ms 2.4826ms 402.8013 Ops/s 380.4091 Ops/s $\textbf{\color{#35bf28}+5.89\%}$
test_compile_indexing[tensor-tensordict-compile] 98.5630μs 35.4647μs 28.1971 KOps/s 27.7918 KOps/s $\color{#35bf28}+1.46\%$
test_compile_indexing[tensor-tensordict-eager] 0.7250ms 32.3670μs 30.8957 KOps/s 30.6879 KOps/s $\color{#35bf28}+0.68\%$
test_compile_indexing[tensor-tensorclass-compile] 0.1000ms 28.6510μs 34.9028 KOps/s 35.1230 KOps/s $\color{#d91a1a}-0.63\%$
test_compile_indexing[tensor-tensorclass-eager] 86.0700μs 23.0782μs 43.3309 KOps/s 43.5187 KOps/s $\color{#d91a1a}-0.43\%$
test_compile_indexing[tensor-pytree-compile] 78.1150μs 29.5036μs 33.8941 KOps/s 34.2947 KOps/s $\color{#d91a1a}-1.17\%$
test_compile_indexing[tensor-pytree-eager] 99.7360μs 23.1433μs 43.2090 KOps/s 43.5740 KOps/s $\color{#d91a1a}-0.84\%$
test_compile_indexing[slice-tensordict-compile] 0.1268ms 50.8813μs 19.6536 KOps/s 19.6054 KOps/s $\color{#35bf28}+0.25\%$
test_compile_indexing[slice-tensordict-eager] 0.5874ms 19.4789μs 51.3375 KOps/s 52.3084 KOps/s $\color{#d91a1a}-1.86\%$
test_compile_indexing[slice-tensorclass-compile] 87.2120μs 44.5315μs 22.4560 KOps/s 22.9969 KOps/s $\color{#d91a1a}-2.35\%$
test_compile_indexing[slice-tensorclass-eager] 93.5440μs 19.0228μs 52.5685 KOps/s 53.0186 KOps/s $\color{#d91a1a}-0.85\%$
test_compile_indexing[slice-pytree-compile] 0.1202ms 45.6145μs 21.9229 KOps/s 22.4914 KOps/s $\color{#d91a1a}-2.53\%$
test_compile_indexing[slice-pytree-eager] 55.8640μs 18.9196μs 52.8552 KOps/s 54.0902 KOps/s $\color{#d91a1a}-2.28\%$
test_compile_indexing[int-tensordict-compile] 0.1249ms 51.5426μs 19.4014 KOps/s 19.2942 KOps/s $\color{#35bf28}+0.56\%$
test_compile_indexing[int-tensordict-eager] 0.9919ms 19.2224μs 52.0228 KOps/s 52.6676 KOps/s $\color{#d91a1a}-1.22\%$
test_compile_indexing[int-tensorclass-compile] 93.7750μs 45.2785μs 22.0856 KOps/s 22.3264 KOps/s $\color{#d91a1a}-1.08\%$
test_compile_indexing[int-tensorclass-eager] 83.9170μs 18.7238μs 53.4081 KOps/s 54.1108 KOps/s $\color{#d91a1a}-1.30\%$
test_compile_indexing[int-pytree-compile] 0.1236ms 45.3106μs 22.0699 KOps/s 22.2616 KOps/s $\color{#d91a1a}-0.86\%$
test_compile_indexing[int-pytree-eager] 87.4930μs 18.9627μs 52.7351 KOps/s 54.3177 KOps/s $\color{#d91a1a}-2.91\%$
test_mod_add[eager] 79.7190μs 24.9932μs 40.0108 KOps/s 38.1178 KOps/s $\color{#35bf28}+4.97\%$
test_mod_add[compile] 93.4840μs 44.8017μs 22.3206 KOps/s 22.5489 KOps/s $\color{#d91a1a}-1.01\%$
test_mod_add[compile-overhead] 96.7000μs 44.4562μs 22.4941 KOps/s 22.6184 KOps/s $\color{#d91a1a}-0.55\%$
test_mod_wrap[eager] 0.3646ms 0.2077ms 4.8137 KOps/s 4.7064 KOps/s $\color{#35bf28}+2.28\%$
test_mod_wrap[compile] 1.9884ms 0.2014ms 4.9661 KOps/s 4.9556 KOps/s $\color{#35bf28}+0.21\%$
test_mod_wrap[compile-overhead] 1.9420ms 0.2031ms 4.9231 KOps/s 4.9007 KOps/s $\color{#35bf28}+0.46\%$
test_mod_wrap_and_backward[eager] 15.8178ms 12.0110ms 83.2574 Ops/s 80.4709 Ops/s $\color{#35bf28}+3.46\%$
test_mod_wrap_and_backward[compile] 18.4565ms 13.0104ms 76.8617 Ops/s 76.3013 Ops/s $\color{#35bf28}+0.73\%$
test_mod_wrap_and_backward[compile-overhead] 19.3093ms 13.5721ms 73.6804 Ops/s 82.8470 Ops/s $\textbf{\color{#d91a1a}-11.06\%}$
test_seq_add[eager] 0.1739ms 89.8614μs 11.1282 KOps/s 10.4934 KOps/s $\textbf{\color{#35bf28}+6.05\%}$
test_seq_add[compile] 0.1200ms 59.7173μs 16.7456 KOps/s 16.6452 KOps/s $\color{#35bf28}+0.60\%$
test_seq_add[compile-overhead] 0.1691ms 58.8444μs 16.9940 KOps/s 16.8985 KOps/s $\color{#35bf28}+0.56\%$
test_seq_wrap[eager] 0.6373ms 0.3717ms 2.6905 KOps/s 2.5546 KOps/s $\textbf{\color{#35bf28}+5.32\%}$
test_seq_wrap[compile] 0.4100ms 0.2223ms 4.4983 KOps/s 4.5248 KOps/s $\color{#d91a1a}-0.59\%$
test_seq_wrap[compile-overhead] 0.3943ms 0.2220ms 4.5039 KOps/s 4.4812 KOps/s $\color{#35bf28}+0.51\%$
test_func_call_runtime[False-eager] 0.8542ms 0.5480ms 1.8248 KOps/s 1.8508 KOps/s $\color{#d91a1a}-1.41\%$
test_func_call_runtime[False-compile] 0.8585ms 0.4243ms 2.3568 KOps/s 2.3884 KOps/s $\color{#d91a1a}-1.32\%$
test_func_call_runtime[False-compile-overhead] 0.5416ms 0.4248ms 2.3541 KOps/s 2.3759 KOps/s $\color{#d91a1a}-0.92\%$
test_func_call_runtime[True-eager] 1.0400ms 0.7579ms 1.3194 KOps/s 1.3159 KOps/s $\color{#35bf28}+0.27\%$
test_func_call_runtime[True-compile] 0.6152ms 0.4592ms 2.1776 KOps/s 2.1719 KOps/s $\color{#35bf28}+0.26\%$
test_func_call_runtime[True-compile-overhead] 0.6362ms 0.4696ms 2.1293 KOps/s 2.1629 KOps/s $\color{#d91a1a}-1.56\%$
test_func_call_cm_runtime[False-eager] 0.7171ms 0.5430ms 1.8415 KOps/s 1.8462 KOps/s $\color{#d91a1a}-0.26\%$
test_func_call_cm_runtime[False-compile] 0.5478ms 0.4266ms 2.3441 KOps/s 2.3851 KOps/s $\color{#d91a1a}-1.72\%$
test_func_call_cm_runtime[False-compile-overhead] 0.5906ms 0.4270ms 2.3420 KOps/s 2.3944 KOps/s $\color{#d91a1a}-2.19\%$
test_func_call_cm_runtime[True-eager] 1.0721ms 0.8916ms 1.1216 KOps/s 1.1247 KOps/s $\color{#d91a1a}-0.27\%$
test_func_call_cm_runtime[True-compile] 0.6600ms 0.4895ms 2.0430 KOps/s 2.0708 KOps/s $\color{#d91a1a}-1.34\%$
test_func_call_cm_runtime[True-compile-overhead] 0.7783ms 0.4927ms 2.0295 KOps/s 2.0754 KOps/s $\color{#d91a1a}-2.21\%$
test_vmap_func_call_cm_runtime[eager] 2.7004ms 1.9425ms 514.8063 Ops/s 521.7696 Ops/s $\color{#d91a1a}-1.33\%$
test_vmap_func_call_cm_runtime[compile] 0.8765ms 0.5175ms 1.9322 KOps/s 1.9401 KOps/s $\color{#d91a1a}-0.40\%$
test_vmap_func_call_cm_runtime[compile-overhead] 1.0200ms 0.5205ms 1.9214 KOps/s 1.9618 KOps/s $\color{#d91a1a}-2.06\%$
test_distributed 0.4228ms 0.1295ms 7.7216 KOps/s 7.7573 KOps/s $\color{#d91a1a}-0.46\%$
test_tdmodule 62.3780μs 17.4953μs 57.1581 KOps/s 50.7500 KOps/s $\textbf{\color{#35bf28}+12.63\%}$
test_tdmodule_dispatch 70.1110μs 34.7485μs 28.7782 KOps/s 26.4395 KOps/s $\textbf{\color{#35bf28}+8.85\%}$
test_tdseq 58.4290μs 20.0946μs 49.7647 KOps/s 44.9112 KOps/s $\textbf{\color{#35bf28}+10.81\%}$
test_tdseq_dispatch 0.1191ms 40.6443μs 24.6037 KOps/s 23.3835 KOps/s $\textbf{\color{#35bf28}+5.22\%}$
test_instantiation_functorch 2.1139ms 1.5317ms 652.8832 Ops/s 661.1570 Ops/s $\color{#d91a1a}-1.25\%$
test_exec_functorch 0.3121ms 0.1791ms 5.5824 KOps/s 5.6675 KOps/s $\color{#d91a1a}-1.50\%$
test_exec_functional_call 0.3894ms 0.1704ms 5.8701 KOps/s 5.7528 KOps/s $\color{#35bf28}+2.04\%$
test_exec_td_decorator 0.5618ms 0.2262ms 4.4214 KOps/s 4.4221 KOps/s $\color{#d91a1a}-0.02\%$
test_vmap_mlp_speed_decorator[True-True] 0.9589ms 0.6350ms 1.5748 KOps/s 1.5098 KOps/s $\color{#35bf28}+4.31\%$
test_vmap_mlp_speed_decorator[True-False] 0.9503ms 0.6262ms 1.5969 KOps/s 1.5525 KOps/s $\color{#35bf28}+2.86\%$
test_vmap_mlp_speed_decorator[False-True] 0.7506ms 0.5150ms 1.9416 KOps/s 1.9226 KOps/s $\color{#35bf28}+0.98\%$
test_vmap_mlp_speed_decorator[False-False] 0.7433ms 0.5162ms 1.9374 KOps/s 1.9033 KOps/s $\color{#35bf28}+1.79\%$
test_to_module_speed[True] 1.4509ms 1.2893ms 775.6408 Ops/s 763.3252 Ops/s $\color{#35bf28}+1.61\%$
test_to_module_speed[False] 1.3952ms 1.2610ms 793.0327 Ops/s 796.8724 Ops/s $\color{#d91a1a}-0.48\%$
test_tc_init 85.5790μs 45.8882μs 21.7921 KOps/s 21.7004 KOps/s $\color{#35bf28}+0.42\%$
test_tc_init_nested 0.1683ms 89.4738μs 11.1765 KOps/s 10.6694 KOps/s $\color{#35bf28}+4.75\%$
test_tc_first_layer_tensor 33.8330μs 1.4759μs 677.5609 KOps/s 627.1249 KOps/s $\textbf{\color{#35bf28}+8.04\%}$
test_tc_first_layer_nontensor 41.4370μs 4.6490μs 215.0997 KOps/s 210.1343 KOps/s $\color{#35bf28}+2.36\%$
test_tc_second_layer_tensor 41.7780μs 2.7740μs 360.4916 KOps/s 352.0511 KOps/s $\color{#35bf28}+2.40\%$
test_tc_second_layer_nontensor 28.8830μs 6.0326μs 165.7648 KOps/s 162.5791 KOps/s $\color{#35bf28}+1.96\%$
test_unbind 0.2767s 14.4453ms 69.2267 Ops/s 82.1446 Ops/s $\textbf{\color{#d91a1a}-15.73\%}$
test_full_like 13.1051ms 10.5778ms 94.5376 Ops/s 127.6903 Ops/s $\textbf{\color{#d91a1a}-25.96\%}$
test_zeros_like 5.4033ms 4.0175ms 248.9124 Ops/s 351.9781 Ops/s $\textbf{\color{#d91a1a}-29.28\%}$
test_ones_like 5.3278ms 4.3190ms 231.5372 Ops/s 305.2165 Ops/s $\textbf{\color{#d91a1a}-24.14\%}$
test_clone 9.6131ms 6.4196ms 155.7736 Ops/s 196.4489 Ops/s $\textbf{\color{#d91a1a}-20.71\%}$
test_squeeze 67.9860μs 11.7408μs 85.1730 KOps/s 84.7960 KOps/s $\color{#35bf28}+0.44\%$
test_unsqueeze 0.1606ms 86.9902μs 11.4955 KOps/s 11.6183 KOps/s $\color{#d91a1a}-1.06\%$
test_split 0.5586ms 0.1867ms 5.3554 KOps/s 5.4530 KOps/s $\color{#d91a1a}-1.79\%$
test_permute 0.3716ms 0.2165ms 4.6199 KOps/s 4.5542 KOps/s $\color{#35bf28}+1.44\%$
test_stack 37.3730ms 30.3290ms 32.9717 Ops/s 40.9682 Ops/s $\textbf{\color{#d91a1a}-19.52\%}$
test_cat 33.6046ms 30.0310ms 33.2990 Ops/s 41.3347 Ops/s $\textbf{\color{#d91a1a}-19.44\%}$

Copy link

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 229. Improved: $\large\color{#35bf28}22$. Worsened: $\large\color{#d91a1a}9$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 32.2310μs 10.3500μs 96.6182 KOps/s 97.1439 KOps/s $\color{#d91a1a}-0.54\%$
test_plain_set_stack_nested 34.3510μs 10.4009μs 96.1459 KOps/s 97.0080 KOps/s $\color{#d91a1a}-0.89\%$
test_plain_set_nested_inplace 42.4710μs 11.2227μs 89.1054 KOps/s 89.6670 KOps/s $\color{#d91a1a}-0.63\%$
test_plain_set_stack_nested_inplace 63.1310μs 11.2195μs 89.1309 KOps/s 89.9676 KOps/s $\color{#d91a1a}-0.93\%$
test_items 29.1000μs 2.9216μs 342.2834 KOps/s 342.1267 KOps/s $\color{#35bf28}+0.05\%$
test_items_nested 0.3680ms 0.3221ms 3.1042 KOps/s 3.1299 KOps/s $\color{#d91a1a}-0.82\%$
test_items_nested_locked 0.3588ms 0.3245ms 3.0817 KOps/s 3.0995 KOps/s $\color{#d91a1a}-0.58\%$
test_items_nested_leaf 0.1000ms 58.5510μs 17.0791 KOps/s 17.1668 KOps/s $\color{#d91a1a}-0.51\%$
test_items_stack_nested 0.3660ms 0.3237ms 3.0895 KOps/s 3.1421 KOps/s $\color{#d91a1a}-1.67\%$
test_items_stack_nested_leaf 86.2220μs 58.2802μs 17.1585 KOps/s 16.8202 KOps/s $\color{#35bf28}+2.01\%$
test_items_stack_nested_locked 0.3992ms 0.3248ms 3.0790 KOps/s 3.1058 KOps/s $\color{#d91a1a}-0.87\%$
test_keys 26.0810μs 3.4911μs 286.4395 KOps/s 291.0012 KOps/s $\color{#d91a1a}-1.57\%$
test_keys_nested 99.8720μs 70.6850μs 14.1473 KOps/s 14.2529 KOps/s $\color{#d91a1a}-0.74\%$
test_keys_nested_locked 0.7585ms 75.7069μs 13.2088 KOps/s 13.2263 KOps/s $\color{#d91a1a}-0.13\%$
test_keys_nested_leaf 0.1729ms 60.9861μs 16.3972 KOps/s 16.3420 KOps/s $\color{#35bf28}+0.34\%$
test_keys_stack_nested 0.1196ms 69.5354μs 14.3812 KOps/s 14.2470 KOps/s $\color{#35bf28}+0.94\%$
test_keys_stack_nested_leaf 93.3320μs 61.1799μs 16.3452 KOps/s 16.2869 KOps/s $\color{#35bf28}+0.36\%$
test_keys_stack_nested_locked 0.1139ms 74.8776μs 13.3551 KOps/s 13.2617 KOps/s $\color{#35bf28}+0.70\%$
test_values 11.5985μs 0.8474μs 1.1801 MOps/s 1.1874 MOps/s $\color{#d91a1a}-0.61\%$
test_values_nested 61.0710μs 31.1214μs 32.1322 KOps/s 31.9960 KOps/s $\color{#35bf28}+0.43\%$
test_values_nested_locked 74.2620μs 32.7650μs 30.5204 KOps/s 30.5716 KOps/s $\color{#d91a1a}-0.17\%$
test_values_nested_leaf 69.0520μs 33.8453μs 29.5462 KOps/s 29.6020 KOps/s $\color{#d91a1a}-0.19\%$
test_values_stack_nested 62.8710μs 31.4036μs 31.8435 KOps/s 31.3535 KOps/s $\color{#35bf28}+1.56\%$
test_values_stack_nested_leaf 63.1310μs 33.9859μs 29.4240 KOps/s 28.8930 KOps/s $\color{#35bf28}+1.84\%$
test_values_stack_nested_locked 67.1620μs 33.4673μs 29.8799 KOps/s 29.9873 KOps/s $\color{#d91a1a}-0.36\%$
test_membership 1.6181μs 0.5118μs 1.9540 MOps/s 1.9716 MOps/s $\color{#d91a1a}-0.89\%$
test_membership_nested 20.5305μs 1.9332μs 517.2874 KOps/s 512.6604 KOps/s $\color{#35bf28}+0.90\%$
test_membership_nested_leaf 27.1305μs 1.9133μs 522.6499 KOps/s 526.0789 KOps/s $\color{#d91a1a}-0.65\%$
test_membership_stacked_nested 20.0900μs 2.0087μs 497.8388 KOps/s 500.1970 KOps/s $\color{#d91a1a}-0.47\%$
test_membership_stacked_nested_leaf 43.5910μs 2.0194μs 495.2010 KOps/s 500.0540 KOps/s $\color{#d91a1a}-0.97\%$
test_membership_nested_last 32.3610μs 2.8376μs 352.4112 KOps/s 352.5223 KOps/s $\color{#d91a1a}-0.03\%$
test_membership_nested_leaf_last 43.0300μs 2.8416μs 351.9121 KOps/s 349.8289 KOps/s $\color{#35bf28}+0.60\%$
test_membership_stacked_nested_last 35.3100μs 2.8641μs 349.1451 KOps/s 287.7714 KOps/s $\textbf{\color{#35bf28}+21.33\%}$
test_membership_stacked_nested_leaf_last 27.3300μs 2.9061μs 344.1049 KOps/s 288.0541 KOps/s $\textbf{\color{#35bf28}+19.46\%}$
test_nested_getleaf 42.6800μs 6.0566μs 165.1083 KOps/s 166.1058 KOps/s $\color{#d91a1a}-0.60\%$
test_nested_get 34.3300μs 5.7287μs 174.5608 KOps/s 175.5863 KOps/s $\color{#d91a1a}-0.58\%$
test_stacked_getleaf 29.5510μs 6.0539μs 165.1839 KOps/s 167.1651 KOps/s $\color{#d91a1a}-1.19\%$
test_stacked_get 35.6300μs 5.7177μs 174.8968 KOps/s 176.0139 KOps/s $\color{#d91a1a}-0.63\%$
test_nested_getitemleaf 36.1800μs 6.1008μs 163.9131 KOps/s 164.3498 KOps/s $\color{#d91a1a}-0.27\%$
test_nested_getitem 32.4300μs 5.7893μs 172.7334 KOps/s 173.3899 KOps/s $\color{#d91a1a}-0.38\%$
test_stacked_getitemleaf 29.1000μs 6.0709μs 164.7207 KOps/s 164.9075 KOps/s $\color{#d91a1a}-0.11\%$
test_stacked_getitem 69.2510μs 5.7448μs 174.0708 KOps/s 174.1637 KOps/s $\color{#d91a1a}-0.05\%$
test_lock_nested 9.5457ms 0.3791ms 2.6379 KOps/s 2.6935 KOps/s $\color{#d91a1a}-2.06\%$
test_lock_stack_nested 0.3768ms 0.3395ms 2.9459 KOps/s 2.9989 KOps/s $\color{#d91a1a}-1.77\%$
test_unlock_nested 0.6752ms 0.3093ms 3.2328 KOps/s 3.2793 KOps/s $\color{#d91a1a}-1.42\%$
test_unlock_stack_nested 0.3086ms 0.2770ms 3.6103 KOps/s 3.6726 KOps/s $\color{#d91a1a}-1.70\%$
test_flatten_speed 0.1081ms 74.1789μs 13.4809 KOps/s 13.7964 KOps/s $\color{#d91a1a}-2.29\%$
test_unflatten_speed 0.3450ms 0.3009ms 3.3231 KOps/s 3.3505 KOps/s $\color{#d91a1a}-0.82\%$
test_common_ops 1.7346ms 0.5890ms 1.6978 KOps/s 1.7160 KOps/s $\color{#d91a1a}-1.07\%$
test_creation 0.1873ms 1.5005μs 666.4657 KOps/s 670.3365 KOps/s $\color{#d91a1a}-0.58\%$
test_creation_empty 35.6110μs 6.9255μs 144.3930 KOps/s 144.5336 KOps/s $\color{#d91a1a}-0.10\%$
test_creation_nested_1 1.6604ms 8.5138μs 117.4562 KOps/s 118.0231 KOps/s $\color{#d91a1a}-0.48\%$
test_creation_nested_2 41.6510μs 11.0968μs 90.1161 KOps/s 90.7160 KOps/s $\color{#d91a1a}-0.66\%$
test_clone 56.6610μs 10.8333μs 92.3079 KOps/s 93.6760 KOps/s $\color{#d91a1a}-1.46\%$
test_getitem[int] 93.0885ms 16.6152μs 60.1859 KOps/s 93.9971 KOps/s $\textbf{\color{#d91a1a}-35.97\%}$
test_getitem[slice_int] 0.1050ms 20.8453μs 47.9724 KOps/s 48.7979 KOps/s $\color{#d91a1a}-1.69\%$
test_getitem[range] 0.1308ms 38.0212μs 26.3011 KOps/s 26.3155 KOps/s $\color{#d91a1a}-0.05\%$
test_getitem[tuple] 0.1039ms 18.4282μs 54.2648 KOps/s 55.8936 KOps/s $\color{#d91a1a}-2.91\%$
test_getitem[list] 0.2133ms 33.8039μs 29.5823 KOps/s 29.8347 KOps/s $\color{#d91a1a}-0.85\%$
test_setitem_dim[int] 37.7410μs 19.0237μs 52.5659 KOps/s 53.2697 KOps/s $\color{#d91a1a}-1.32\%$
test_setitem_dim[slice_int] 68.2710μs 37.8610μs 26.4124 KOps/s 26.3485 KOps/s $\color{#35bf28}+0.24\%$
test_setitem_dim[range] 78.3920μs 54.0727μs 18.4936 KOps/s 18.5517 KOps/s $\color{#d91a1a}-0.31\%$
test_setitem_dim[tuple] 55.4310μs 32.1995μs 31.0563 KOps/s 31.6621 KOps/s $\color{#d91a1a}-1.91\%$
test_setitem 0.1124ms 14.7890μs 67.6179 KOps/s 67.2338 KOps/s $\color{#35bf28}+0.57\%$
test_set 90.8020μs 14.4249μs 69.3245 KOps/s 70.2046 KOps/s $\color{#d91a1a}-1.25\%$
test_set_shared 1.4624ms 0.1464ms 6.8327 KOps/s 6.7860 KOps/s $\color{#35bf28}+0.69\%$
test_update 0.6642ms 16.3444μs 61.1831 KOps/s 60.5520 KOps/s $\color{#35bf28}+1.04\%$
test_update_nested 92.2110μs 21.6734μs 46.1395 KOps/s 46.6154 KOps/s $\color{#d91a1a}-1.02\%$
test_update__nested 0.9515ms 25.1611μs 39.7438 KOps/s 40.6233 KOps/s $\color{#d91a1a}-2.17\%$
test_set_nested 80.3620μs 15.5695μs 64.2281 KOps/s 65.4209 KOps/s $\color{#d91a1a}-1.82\%$
test_set_nested_new 90.9510μs 18.3208μs 54.5826 KOps/s 56.7252 KOps/s $\color{#d91a1a}-3.78\%$
test_select 89.8220μs 30.5272μs 32.7576 KOps/s 33.7320 KOps/s $\color{#d91a1a}-2.89\%$
test_select_nested 69.0210μs 43.7904μs 22.8361 KOps/s 23.1244 KOps/s $\color{#d91a1a}-1.25\%$
test_exclude_nested 93.8720μs 60.9021μs 16.4198 KOps/s 16.5164 KOps/s $\color{#d91a1a}-0.59\%$
test_empty[True] 0.3056ms 0.2657ms 3.7633 KOps/s 3.8199 KOps/s $\color{#d91a1a}-1.48\%$
test_empty[False] 4.1701μs 0.7900μs 1.2658 MOps/s 1.2861 MOps/s $\color{#d91a1a}-1.58\%$
test_to 88.0910μs 60.8981μs 16.4209 KOps/s 18.3387 KOps/s $\textbf{\color{#d91a1a}-10.46\%}$
test_to_nonblocking 0.1413ms 46.9530μs 21.2979 KOps/s 21.7349 KOps/s $\color{#d91a1a}-2.01\%$
test_unbind_speed 1.4043ms 0.2339ms 4.2749 KOps/s 4.2850 KOps/s $\color{#d91a1a}-0.24\%$
test_unbind_speed_stack0 0.2840ms 0.2314ms 4.3213 KOps/s 4.3627 KOps/s $\color{#d91a1a}-0.95\%$
test_unbind_speed_stack1 92.4879ms 0.6489ms 1.5412 KOps/s 1.5583 KOps/s $\color{#d91a1a}-1.10\%$
test_split 94.9225ms 1.7302ms 577.9564 Ops/s 570.0969 Ops/s $\color{#35bf28}+1.38\%$
test_chunk 95.4961ms 1.6014ms 624.4380 Ops/s 677.5376 Ops/s $\textbf{\color{#d91a1a}-7.84\%}$
test_consolidate[False-None] 2.6789ms 2.6148ms 382.4375 Ops/s 347.3850 Ops/s $\textbf{\color{#35bf28}+10.09\%}$
test_consolidate[default-None] 1.7554ms 1.6507ms 605.7921 Ops/s 592.0299 Ops/s $\color{#35bf28}+2.32\%$
test_consolidate[reduce-overhead-None] 1.8142ms 1.6946ms 590.1220 Ops/s 582.6146 Ops/s $\color{#35bf28}+1.29\%$
test_consolidate_njt[False-None] 6.7737ms 6.6155ms 151.1593 Ops/s 150.5161 Ops/s $\color{#35bf28}+0.43\%$
test_to[False-False-None] 1.8296ms 1.7550ms 569.8071 Ops/s 574.6702 Ops/s $\color{#d91a1a}-0.85\%$
test_to[True-False-None] 1.5543ms 1.3208ms 757.1037 Ops/s 765.3564 Ops/s $\color{#d91a1a}-1.08\%$
test_to[within-False-None] 0.2951s 5.2068ms 192.0570 Ops/s 246.4909 Ops/s $\textbf{\color{#d91a1a}-22.08\%}$
test_to[True-default-None] 5.4639ms 5.3109ms 188.2914 Ops/s 186.6891 Ops/s $\color{#35bf28}+0.86\%$
test_to_njt[False-False-None] 8.0129ms 7.0549ms 141.7460 Ops/s 140.9537 Ops/s $\color{#35bf28}+0.56\%$
test_to_njt[True-False-None] 5.9617ms 5.6243ms 177.7984 Ops/s 178.1951 Ops/s $\color{#d91a1a}-0.22\%$
test_to_njt[within-False-None] 12.4829ms 12.3731ms 80.8204 Ops/s 80.0367 Ops/s $\color{#35bf28}+0.98\%$
test_creation[device0] 0.4568ms 78.8810μs 12.6773 KOps/s 12.5432 KOps/s $\color{#35bf28}+1.07\%$
test_creation_from_tensor 0.5177ms 85.4996μs 11.6960 KOps/s 11.9726 KOps/s $\color{#d91a1a}-2.31\%$
test_add_one[memmap_tensor0] 0.6161ms 6.8816μs 145.3143 KOps/s 145.3415 KOps/s $\color{#d91a1a}-0.02\%$
test_contiguous[memmap_tensor0] 1.7600μs 0.4177μs 2.3941 MOps/s 2.4038 MOps/s $\color{#d91a1a}-0.40\%$
test_stack[memmap_tensor0] 39.2610μs 4.4769μs 223.3669 KOps/s 226.6126 KOps/s $\color{#d91a1a}-1.43\%$
test_memmaptd_index 1.8065ms 0.2484ms 4.0253 KOps/s 4.0214 KOps/s $\color{#35bf28}+0.09\%$
test_memmaptd_index_astensor 0.5543ms 0.3081ms 3.2456 KOps/s 3.2563 KOps/s $\color{#d91a1a}-0.33\%$
test_memmaptd_index_op 0.9646ms 0.5786ms 1.7282 KOps/s 1.7185 KOps/s $\color{#35bf28}+0.56\%$
test_serialize_model 0.1315s 0.1306s 7.6558 Ops/s 7.6741 Ops/s $\color{#d91a1a}-0.24\%$
test_serialize_model_pickle 1.3460s 1.1845s 0.8442 Ops/s 0.8218 Ops/s $\color{#35bf28}+2.73\%$
test_serialize_weights 0.1303s 0.1296s 7.7163 Ops/s 7.7017 Ops/s $\color{#35bf28}+0.19\%$
test_serialize_weights_returnearly 0.2592s 62.3979ms 16.0262 Ops/s 23.2122 Ops/s $\textbf{\color{#d91a1a}-30.96\%}$
test_serialize_weights_pickle 1.3772s 1.2165s 0.8220 Ops/s 0.8225 Ops/s $\color{#d91a1a}-0.06\%$
test_reshape_pytree 50.0010μs 22.8245μs 43.8127 KOps/s 43.4044 KOps/s $\color{#35bf28}+0.94\%$
test_reshape_td 59.5510μs 26.8734μs 37.2115 KOps/s 35.8204 KOps/s $\color{#35bf28}+3.88\%$
test_view_pytree 62.4610μs 22.7171μs 44.0197 KOps/s 43.5067 KOps/s $\color{#35bf28}+1.18\%$
test_view_td 65.8820μs 30.4287μs 32.8637 KOps/s 30.9220 KOps/s $\textbf{\color{#35bf28}+6.28\%}$
test_unbind_pytree 61.8110μs 28.5560μs 35.0189 KOps/s 34.8319 KOps/s $\color{#35bf28}+0.54\%$
test_unbind_td 0.5436ms 35.4693μs 28.1934 KOps/s 27.0957 KOps/s $\color{#35bf28}+4.05\%$
test_split_pytree 62.8610μs 30.7845μs 32.4839 KOps/s 32.0312 KOps/s $\color{#35bf28}+1.41\%$
test_split_td 0.6426ms 39.0180μs 25.6292 KOps/s 25.4945 KOps/s $\color{#35bf28}+0.53\%$
test_add_pytree 80.6910μs 35.2154μs 28.3967 KOps/s 28.4277 KOps/s $\color{#d91a1a}-0.11\%$
test_add_td 84.9110μs 47.9794μs 20.8423 KOps/s 22.1682 KOps/s $\textbf{\color{#d91a1a}-5.98\%}$
test_compile_add_one_nested[tensordict-compile] 0.1700ms 0.1187ms 8.4217 KOps/s 7.9639 KOps/s $\textbf{\color{#35bf28}+5.75\%}$
test_compile_add_one_nested[tensordict-eager] 0.2189ms 0.1267ms 7.8948 KOps/s 7.8842 KOps/s $\color{#35bf28}+0.13\%$
test_compile_add_one_nested[pytree-compile] 0.1458ms 97.0497μs 10.3040 KOps/s 10.0585 KOps/s $\color{#35bf28}+2.44\%$
test_compile_add_one_nested[pytree-eager] 0.7738ms 0.1531ms 6.5321 KOps/s 6.4911 KOps/s $\color{#35bf28}+0.63\%$
test_compile_copy_nested[tensordict-compile] 50.3410μs 23.3917μs 42.7503 KOps/s 32.4287 KOps/s $\textbf{\color{#35bf28}+31.83\%}$
test_compile_copy_nested[tensordict-eager] 56.8410μs 27.7052μs 36.0942 KOps/s 34.8009 KOps/s $\color{#35bf28}+3.72\%$
test_compile_copy_nested[pytree-compile] 0.2799ms 65.9043μs 15.1735 KOps/s 15.1316 KOps/s $\color{#35bf28}+0.28\%$
test_compile_copy_nested[pytree-eager] 83.6220μs 50.7019μs 19.7231 KOps/s 19.8810 KOps/s $\color{#d91a1a}-0.79\%$
test_compile_add_one_flat[tensordict-compile] 0.1823ms 0.1433ms 6.9768 KOps/s 6.8178 KOps/s $\color{#35bf28}+2.33\%$
test_compile_add_one_flat[tensordict-eager] 0.2955ms 0.2083ms 4.8005 KOps/s 4.7902 KOps/s $\color{#35bf28}+0.22\%$
test_compile_add_one_flat[tensorclass-compile] 0.1627ms 98.9514μs 10.1060 KOps/s 9.9306 KOps/s $\color{#35bf28}+1.77\%$
test_compile_add_one_flat[tensorclass-eager] 0.1078ms 52.1493μs 19.1757 KOps/s 19.3985 KOps/s $\color{#d91a1a}-1.15\%$
test_compile_add_one_flat[pytree-compile] 0.1856ms 0.1343ms 7.4438 KOps/s 7.1377 KOps/s $\color{#35bf28}+4.29\%$
test_compile_add_one_flat[pytree-eager] 0.6445ms 0.4979ms 2.0085 KOps/s 2.0310 KOps/s $\color{#d91a1a}-1.11\%$
test_compile_add_self_flat[tensordict-eager] 0.3631ms 0.2473ms 4.0429 KOps/s 3.9858 KOps/s $\color{#35bf28}+1.43\%$
test_compile_add_self_flat[tensordict-compile] 0.1881ms 0.1426ms 7.0132 KOps/s 6.8787 KOps/s $\color{#35bf28}+1.95\%$
test_compile_add_self_flat[tensorclass-eager] 0.1390ms 63.8347μs 15.6655 KOps/s 15.9192 KOps/s $\color{#d91a1a}-1.59\%$
test_compile_add_self_flat[tensorclass-compile] 0.1620ms 98.8160μs 10.1198 KOps/s 9.9274 KOps/s $\color{#35bf28}+1.94\%$
test_compile_add_self_flat[pytree-eager] 0.4637ms 0.4222ms 2.3683 KOps/s 2.4094 KOps/s $\color{#d91a1a}-1.71\%$
test_compile_add_self_flat[pytree-compile] 0.1799ms 0.1360ms 7.3541 KOps/s 7.2381 KOps/s $\color{#35bf28}+1.60\%$
test_compile_copy_flat[tensordict-compile] 0.1455ms 18.5325μs 53.9592 KOps/s 38.9116 KOps/s $\textbf{\color{#35bf28}+38.67\%}$
test_compile_copy_flat[tensordict-eager] 0.1425ms 28.3132μs 35.3192 KOps/s 33.7854 KOps/s $\color{#35bf28}+4.54\%$
test_compile_copy_flat[pytree-compile] 0.1034ms 70.4592μs 14.1926 KOps/s 14.1176 KOps/s $\color{#35bf28}+0.53\%$
test_compile_copy_flat[pytree-eager] 85.6310μs 51.8378μs 19.2909 KOps/s 19.5292 KOps/s $\color{#d91a1a}-1.22\%$
test_compile_assign_and_add[tensordict-compile] 1.7292ms 0.4145ms 2.4124 KOps/s 2.1998 KOps/s $\textbf{\color{#35bf28}+9.66\%}$
test_compile_assign_and_add[tensordict-eager] 3.2696ms 2.6291ms 380.3556 Ops/s 373.1304 Ops/s $\color{#35bf28}+1.94\%$
test_compile_assign_and_add[pytree-compile] 1.6631ms 0.3951ms 2.5310 KOps/s 2.2278 KOps/s $\textbf{\color{#35bf28}+13.61\%}$
test_compile_assign_and_add[pytree-eager] 2.8326ms 2.7236ms 367.1668 Ops/s 364.0755 Ops/s $\color{#35bf28}+0.85\%$
test_compile_indexing[tensor-tensordict-compile] 0.7022ms 0.1200ms 8.3328 KOps/s 8.4031 KOps/s $\color{#d91a1a}-0.84\%$
test_compile_indexing[tensor-tensordict-eager] 0.5583ms 82.5714μs 12.1107 KOps/s 11.3429 KOps/s $\textbf{\color{#35bf28}+6.77\%}$
test_compile_indexing[tensor-tensorclass-compile] 0.6553ms 0.1135ms 8.8113 KOps/s 8.3911 KOps/s $\textbf{\color{#35bf28}+5.01\%}$
test_compile_indexing[tensor-tensorclass-eager] 0.1783ms 70.6300μs 14.1583 KOps/s 13.1629 KOps/s $\textbf{\color{#35bf28}+7.56\%}$
test_compile_indexing[tensor-pytree-compile] 0.2427ms 0.1129ms 8.8545 KOps/s 8.9218 KOps/s $\color{#d91a1a}-0.75\%$
test_compile_indexing[tensor-pytree-eager] 0.1258ms 70.7552μs 14.1332 KOps/s 14.3222 KOps/s $\color{#d91a1a}-1.32\%$
test_compile_indexing[slice-tensordict-compile] 0.1612ms 0.1061ms 9.4237 KOps/s 9.6146 KOps/s $\color{#d91a1a}-1.98\%$
test_compile_indexing[slice-tensordict-eager] 0.1430ms 17.2841μs 57.8567 KOps/s 54.3894 KOps/s $\textbf{\color{#35bf28}+6.37\%}$
test_compile_indexing[slice-tensorclass-compile] 0.1562ms 0.1003ms 9.9742 KOps/s 9.9989 KOps/s $\color{#d91a1a}-0.25\%$
test_compile_indexing[slice-tensorclass-eager] 57.3210μs 15.9305μs 62.7725 KOps/s 62.1915 KOps/s $\color{#35bf28}+0.93\%$
test_compile_indexing[slice-pytree-compile] 0.2154ms 0.1019ms 9.8171 KOps/s 9.9680 KOps/s $\color{#d91a1a}-1.51\%$
test_compile_indexing[slice-pytree-eager] 46.3700μs 15.9402μs 62.7345 KOps/s 61.4314 KOps/s $\color{#35bf28}+2.12\%$
test_compile_indexing[int-tensordict-compile] 0.1705ms 0.1045ms 9.5666 KOps/s 9.5939 KOps/s $\color{#d91a1a}-0.28\%$
test_compile_indexing[int-tensordict-eager] 0.6249ms 16.9876μs 58.8665 KOps/s 56.1519 KOps/s $\color{#35bf28}+4.83\%$
test_compile_indexing[int-tensorclass-compile] 0.2514ms 98.4203μs 10.1605 KOps/s 9.9760 KOps/s $\color{#35bf28}+1.85\%$
test_compile_indexing[int-tensorclass-eager] 47.9110μs 15.9304μs 62.7729 KOps/s 62.5164 KOps/s $\color{#35bf28}+0.41\%$
test_compile_indexing[int-pytree-compile] 0.2263ms 98.4201μs 10.1605 KOps/s 9.9621 KOps/s $\color{#35bf28}+1.99\%$
test_compile_indexing[int-pytree-eager] 67.0720μs 16.0605μs 62.2646 KOps/s 62.7801 KOps/s $\color{#d91a1a}-0.82\%$
test_mod_add[eager] 0.1719ms 31.6542μs 31.5914 KOps/s 31.1057 KOps/s $\color{#35bf28}+1.56\%$
test_mod_add[compile] 0.2104ms 77.8398μs 12.8469 KOps/s 12.6486 KOps/s $\color{#35bf28}+1.57\%$
test_mod_add[compile-overhead] 0.3125ms 0.1635ms 6.1163 KOps/s 5.6153 KOps/s $\textbf{\color{#35bf28}+8.92\%}$
test_mod_wrap[eager] 0.3281ms 0.2479ms 4.0346 KOps/s 3.8507 KOps/s $\color{#35bf28}+4.77\%$
test_mod_wrap[compile] 1.6322ms 0.2888ms 3.4623 KOps/s 3.4500 KOps/s $\color{#35bf28}+0.36\%$
test_mod_wrap[compile-overhead] 7.1532ms 3.8028ms 262.9620 Ops/s 265.7953 Ops/s $\color{#d91a1a}-1.07\%$
test_mod_wrap_and_backward[eager] 1.7308ms 1.3820ms 723.5758 Ops/s 676.1047 Ops/s $\textbf{\color{#35bf28}+7.02\%}$
test_mod_wrap_and_backward[compile] 1.6895ms 1.2958ms 771.7347 Ops/s 716.2925 Ops/s $\textbf{\color{#35bf28}+7.74\%}$
test_mod_wrap_and_backward[compile-overhead] 1.4159ms 0.9953ms 1.0047 KOps/s 931.5088 Ops/s $\textbf{\color{#35bf28}+7.86\%}$
test_seq_add[eager] 0.1521ms 96.6033μs 10.3516 KOps/s 10.0610 KOps/s $\color{#35bf28}+2.89\%$
test_seq_add[compile] 0.2776ms 88.9885μs 11.2374 KOps/s 11.1853 KOps/s $\color{#35bf28}+0.47\%$
test_seq_add[compile-overhead] 0.2807ms 0.1309ms 7.6366 KOps/s 7.5761 KOps/s $\color{#35bf28}+0.80\%$
test_seq_wrap[eager] 0.5311ms 0.3807ms 2.6266 KOps/s 2.5759 KOps/s $\color{#35bf28}+1.96\%$
test_seq_wrap[compile] 0.3562ms 0.3062ms 3.2656 KOps/s 3.2505 KOps/s $\color{#35bf28}+0.46\%$
test_seq_wrap[compile-overhead] 0.2721ms 0.2271ms 4.4036 KOps/s 4.3733 KOps/s $\color{#35bf28}+0.69\%$
test_func_call_runtime[False-eager] 1.0293ms 0.7907ms 1.2647 KOps/s 1.2971 KOps/s $\color{#d91a1a}-2.50\%$
test_func_call_runtime[False-compile] 1.0011ms 0.7717ms 1.2959 KOps/s 1.3031 KOps/s $\color{#d91a1a}-0.55\%$
test_func_call_runtime[False-compile-overhead] 0.4273ms 0.3675ms 2.7208 KOps/s 2.7254 KOps/s $\color{#d91a1a}-0.17\%$
test_func_call_runtime[True-eager] 1.2039ms 0.9153ms 1.0925 KOps/s 1.0216 KOps/s $\textbf{\color{#35bf28}+6.95\%}$
test_func_call_runtime[True-compile] 0.8339ms 0.7815ms 1.2796 KOps/s 1.2487 KOps/s $\color{#35bf28}+2.48\%$
test_func_call_runtime[True-compile-overhead] 0.4368ms 0.3896ms 2.5665 KOps/s 2.5780 KOps/s $\color{#d91a1a}-0.45\%$
test_func_call_cm_runtime[False-eager] 0.8337ms 0.7534ms 1.3272 KOps/s 1.2409 KOps/s $\textbf{\color{#35bf28}+6.96\%}$
test_func_call_cm_runtime[False-compile] 0.8455ms 0.7654ms 1.3065 KOps/s 1.2608 KOps/s $\color{#35bf28}+3.63\%$
test_func_call_cm_runtime[False-compile-overhead] 0.4292ms 0.3699ms 2.7034 KOps/s 2.7090 KOps/s $\color{#d91a1a}-0.21\%$
test_func_call_cm_runtime[True-eager] 1.1682ms 1.0196ms 980.7826 Ops/s 950.0709 Ops/s $\color{#35bf28}+3.23\%$
test_func_call_cm_runtime[True-compile] 1.0293ms 0.8169ms 1.2241 KOps/s 1.2430 KOps/s $\color{#d91a1a}-1.52\%$
test_func_call_cm_runtime[True-compile-overhead] 0.4809ms 0.4159ms 2.4042 KOps/s 2.3893 KOps/s $\color{#35bf28}+0.62\%$
test_vmap_func_call_cm_runtime[eager] 2.6272ms 2.1160ms 472.5935 Ops/s 469.9653 Ops/s $\color{#35bf28}+0.56\%$
test_vmap_func_call_cm_runtime[compile] 0.9748ms 0.8263ms 1.2102 KOps/s 1.2163 KOps/s $\color{#d91a1a}-0.50\%$
test_vmap_func_call_cm_runtime[compile-overhead] 0.4810ms 0.4178ms 2.3932 KOps/s 2.3925 KOps/s $\color{#35bf28}+0.03\%$
test_distributed 3.2001ms 0.1723ms 5.8030 KOps/s 8.7105 KOps/s $\textbf{\color{#d91a1a}-33.38\%}$
test_tdmodule 32.3510μs 13.5113μs 74.0123 KOps/s 73.3333 KOps/s $\color{#35bf28}+0.93\%$
test_tdmodule_dispatch 0.6263ms 26.7055μs 37.4454 KOps/s 36.8491 KOps/s $\color{#35bf28}+1.62\%$
test_tdseq 50.1410μs 15.0267μs 66.5484 KOps/s 65.8946 KOps/s $\color{#35bf28}+0.99\%$
test_tdseq_dispatch 48.2410μs 29.3849μs 34.0310 KOps/s 33.2672 KOps/s $\color{#35bf28}+2.30\%$
test_instantiation_functorch 1.6370ms 1.5615ms 640.4117 Ops/s 636.7610 Ops/s $\color{#35bf28}+0.57\%$
test_exec_functorch 0.1773ms 0.1446ms 6.9162 KOps/s 6.8336 KOps/s $\color{#35bf28}+1.21\%$
test_exec_functional_call 0.1893ms 0.1404ms 7.1211 KOps/s 6.9502 KOps/s $\color{#35bf28}+2.46\%$
test_exec_td_decorator 0.3801ms 0.1857ms 5.3840 KOps/s 5.3395 KOps/s $\color{#35bf28}+0.83\%$
test_vmap_mlp_speed_decorator[True-True] 0.8207ms 0.6829ms 1.4644 KOps/s 1.4589 KOps/s $\color{#35bf28}+0.38\%$
test_vmap_mlp_speed_decorator[True-False] 0.8412ms 0.6828ms 1.4646 KOps/s 1.4560 KOps/s $\color{#35bf28}+0.59\%$
test_vmap_mlp_speed_decorator[False-True] 0.7197ms 0.6021ms 1.6610 KOps/s 1.6557 KOps/s $\color{#35bf28}+0.32\%$
test_vmap_mlp_speed_decorator[False-False] 0.7116ms 0.6016ms 1.6623 KOps/s 1.6532 KOps/s $\color{#35bf28}+0.55\%$
test_vmap_transformer_speed_decorator[True-True] 19.5453ms 19.4750ms 51.3479 Ops/s 51.4195 Ops/s $\color{#d91a1a}-0.14\%$
test_vmap_transformer_speed_decorator[True-False] 20.2083ms 19.4854ms 51.3206 Ops/s 51.1419 Ops/s $\color{#35bf28}+0.35\%$
test_vmap_transformer_speed_decorator[False-True] 19.5006ms 19.4003ms 51.5457 Ops/s 51.8681 Ops/s $\color{#d91a1a}-0.62\%$
test_vmap_transformer_speed_decorator[False-False] 19.4380ms 19.3602ms 51.6524 Ops/s 51.5775 Ops/s $\color{#35bf28}+0.15\%$
test_to_module_speed[True] 1.0541ms 0.9441ms 1.0593 KOps/s 1.0544 KOps/s $\color{#35bf28}+0.47\%$
test_to_module_speed[False] 1.4018ms 0.9316ms 1.0735 KOps/s 1.0730 KOps/s $\color{#35bf28}+0.05\%$
test_tc_init 70.6010μs 36.1690μs 27.6480 KOps/s 28.4691 KOps/s $\color{#d91a1a}-2.88\%$
test_tc_init_nested 0.2285ms 72.1450μs 13.8610 KOps/s 13.8736 KOps/s $\color{#d91a1a}-0.09\%$
test_tc_first_layer_tensor 4.4344μs 0.6998μs 1.4291 MOps/s 1.2431 MOps/s $\textbf{\color{#35bf28}+14.96\%}$
test_tc_first_layer_nontensor 24.8700μs 2.3690μs 422.1225 KOps/s 422.9950 KOps/s $\color{#d91a1a}-0.21\%$
test_tc_second_layer_tensor 7.1628μs 1.4251μs 701.6986 KOps/s 694.4544 KOps/s $\color{#35bf28}+1.04\%$
test_tc_second_layer_nontensor 28.7710μs 3.1113μs 321.4061 KOps/s 325.4738 KOps/s $\color{#d91a1a}-1.25\%$
test_unbind 0.2194s 9.6705ms 103.4077 Ops/s 150.1627 Ops/s $\textbf{\color{#d91a1a}-31.14\%}$
test_full_like 9.8562ms 9.4424ms 105.9048 Ops/s 104.8956 Ops/s $\color{#35bf28}+0.96\%$
test_zeros_like 4.9233ms 4.3394ms 230.4441 Ops/s 113.5344 Ops/s $\textbf{\color{#35bf28}+102.97\%}$
test_ones_like 9.2680ms 7.2738ms 137.4790 Ops/s 229.5147 Ops/s $\textbf{\color{#d91a1a}-40.10\%}$
test_clone 7.2199ms 6.6442ms 150.5073 Ops/s 151.1462 Ops/s $\color{#d91a1a}-0.42\%$
test_squeeze 59.3910μs 9.3824μs 106.5830 KOps/s 105.1477 KOps/s $\color{#35bf28}+1.37\%$
test_unsqueeze 0.1210ms 70.9858μs 14.0873 KOps/s 13.4441 KOps/s $\color{#35bf28}+4.78\%$
test_split 0.3853ms 0.1564ms 6.3926 KOps/s 6.0868 KOps/s $\textbf{\color{#35bf28}+5.02\%}$
test_permute 0.2258ms 0.1772ms 5.6444 KOps/s 5.6125 KOps/s $\color{#35bf28}+0.57\%$
test_stack 51.8320ms 51.3084ms 19.4900 Ops/s 19.4603 Ops/s $\color{#35bf28}+0.15\%$
test_cat 51.5221ms 51.0868ms 19.5745 Ops/s 19.3468 Ops/s $\color{#35bf28}+1.18\%$

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants