Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BugFix] Fix index in list error #627

Merged
merged 2 commits into from
Jan 17, 2024
Merged

[BugFix] Fix index in list error #627

merged 2 commits into from
Jan 17, 2024

Conversation

vmoens
Copy link
Contributor

@vmoens vmoens commented Jan 17, 2024

No description provided.

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jan 17, 2024
@vmoens vmoens added the bug Something isn't working label Jan 17, 2024
@vmoens vmoens merged commit 4225911 into main Jan 17, 2024
1 check passed
@vmoens vmoens deleted the fix-index-belongs branch January 17, 2024 20:59
Copy link

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 124. Improved: $\large\color{#35bf28}25$. Worsened: $\large\color{#d91a1a}6$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 34.8150μs 15.6132μs 64.0486 KOps/s 59.7762 KOps/s $\textbf{\color{#35bf28}+7.15\%}$
test_plain_set_stack_nested 0.1973ms 0.1428ms 7.0034 KOps/s 7.0055 KOps/s $\color{#d91a1a}-0.03\%$
test_plain_set_nested_inplace 53.6300μs 17.9498μs 55.7109 KOps/s 51.6536 KOps/s $\textbf{\color{#35bf28}+7.85\%}$
test_plain_set_stack_nested_inplace 0.3004ms 0.1738ms 5.7551 KOps/s 5.7552 KOps/s $-0.00\%$
test_items 20.5090μs 2.3673μs 422.4234 KOps/s 397.8892 KOps/s $\textbf{\color{#35bf28}+6.17\%}$
test_items_nested 0.4785ms 0.2709ms 3.6913 KOps/s 3.7294 KOps/s $\color{#d91a1a}-1.02\%$
test_items_nested_locked 1.3337ms 0.2740ms 3.6494 KOps/s 3.7130 KOps/s $\color{#d91a1a}-1.71\%$
test_items_nested_leaf 0.3182ms 0.1689ms 5.9191 KOps/s 5.9312 KOps/s $\color{#d91a1a}-0.20\%$
test_items_stack_nested 1.5592ms 1.3098ms 763.4481 Ops/s 762.0299 Ops/s $\color{#35bf28}+0.19\%$
test_items_stack_nested_leaf 1.7705ms 1.1830ms 845.3040 Ops/s 849.5588 Ops/s $\color{#d91a1a}-0.50\%$
test_items_stack_nested_locked 1.1307ms 0.8636ms 1.1579 KOps/s 1.1332 KOps/s $\color{#35bf28}+2.18\%$
test_keys 39.9550μs 4.2813μs 233.5758 KOps/s 225.2095 KOps/s $\color{#35bf28}+3.71\%$
test_keys_nested 47.9578ms 0.1528ms 6.5432 KOps/s 6.8168 KOps/s $\color{#d91a1a}-4.01\%$
test_keys_nested_locked 0.2857ms 0.1505ms 6.6448 KOps/s 6.6339 KOps/s $\color{#35bf28}+0.16\%$
test_keys_nested_leaf 0.2341ms 0.1276ms 7.8378 KOps/s 7.8706 KOps/s $\color{#d91a1a}-0.42\%$
test_keys_stack_nested 1.5152ms 1.2610ms 793.0313 Ops/s 799.3916 Ops/s $\color{#d91a1a}-0.80\%$
test_keys_stack_nested_leaf 1.8986ms 1.2608ms 793.1274 Ops/s 802.0668 Ops/s $\color{#d91a1a}-1.11\%$
test_keys_stack_nested_locked 1.0386ms 0.8001ms 1.2499 KOps/s 1.2506 KOps/s $\color{#d91a1a}-0.06\%$
test_values 8.6863μs 1.1639μs 859.2084 KOps/s 860.6981 KOps/s $\color{#d91a1a}-0.17\%$
test_values_nested 93.1530μs 51.2233μs 19.5224 KOps/s 19.3696 KOps/s $\color{#35bf28}+0.79\%$
test_values_nested_locked 0.1023ms 51.5840μs 19.3859 KOps/s 19.3703 KOps/s $\color{#35bf28}+0.08\%$
test_values_nested_leaf 88.6940μs 45.6557μs 21.9031 KOps/s 21.8711 KOps/s $\color{#35bf28}+0.15\%$
test_values_stack_nested 1.5515ms 1.0243ms 976.2987 Ops/s 964.9300 Ops/s $\color{#35bf28}+1.18\%$
test_values_stack_nested_leaf 1.1395ms 1.0117ms 988.4400 Ops/s 966.1060 Ops/s $\color{#35bf28}+2.31\%$
test_values_stack_nested_locked 1.0228ms 0.5970ms 1.6750 KOps/s 1.6305 KOps/s $\color{#35bf28}+2.73\%$
test_membership 20.1970μs 1.3383μs 747.2223 KOps/s 740.5582 KOps/s $\color{#35bf28}+0.90\%$
test_membership_nested 33.1320μs 3.5250μs 283.6858 KOps/s 286.7071 KOps/s $\color{#d91a1a}-1.05\%$
test_membership_nested_leaf 36.9090μs 3.5319μs 283.1303 KOps/s 285.7772 KOps/s $\color{#d91a1a}-0.93\%$
test_membership_stacked_nested 47.5380μs 11.7402μs 85.1774 KOps/s 85.0715 KOps/s $\color{#35bf28}+0.12\%$
test_membership_stacked_nested_leaf 43.5710μs 11.7705μs 84.9583 KOps/s 77.9737 KOps/s $\textbf{\color{#35bf28}+8.96\%}$
test_membership_nested_last 38.5710μs 6.6687μs 149.9544 KOps/s 153.0159 KOps/s $\color{#d91a1a}-2.00\%$
test_membership_nested_leaf_last 47.8590μs 6.7086μs 149.0623 KOps/s 152.2065 KOps/s $\color{#d91a1a}-2.07\%$
test_membership_stacked_nested_last 0.2720ms 0.1739ms 5.7500 KOps/s 5.7983 KOps/s $\color{#d91a1a}-0.83\%$
test_membership_stacked_nested_leaf_last 46.4170μs 13.7727μs 72.6076 KOps/s 72.6214 KOps/s $\color{#d91a1a}-0.02\%$
test_nested_getleaf 39.3030μs 10.5728μs 94.5825 KOps/s 94.8940 KOps/s $\color{#d91a1a}-0.33\%$
test_nested_get 41.6270μs 10.1890μs 98.1453 KOps/s 100.0751 KOps/s $\color{#d91a1a}-1.93\%$
test_stacked_getleaf 0.5350ms 0.3895ms 2.5676 KOps/s 2.5607 KOps/s $\color{#35bf28}+0.27\%$
test_stacked_get 0.4385ms 0.3587ms 2.7879 KOps/s 2.7764 KOps/s $\color{#35bf28}+0.41\%$
test_nested_getitemleaf 35.0750μs 10.8377μs 92.2708 KOps/s 94.0008 KOps/s $\color{#d91a1a}-1.84\%$
test_nested_getitem 35.1860μs 10.1501μs 98.5208 KOps/s 99.5842 KOps/s $\color{#d91a1a}-1.07\%$
test_stacked_getitemleaf 0.7059ms 0.3965ms 2.5223 KOps/s 2.5260 KOps/s $\color{#d91a1a}-0.15\%$
test_stacked_getitem 0.4415ms 0.3605ms 2.7743 KOps/s 2.7701 KOps/s $\color{#35bf28}+0.15\%$
test_lock_nested 1.3090ms 0.3830ms 2.6111 KOps/s 2.5392 KOps/s $\color{#35bf28}+2.83\%$
test_lock_stack_nested 81.9961ms 6.1264ms 163.2273 Ops/s 163.2027 Ops/s $\color{#35bf28}+0.02\%$
test_unlock_nested 70.1113ms 0.4533ms 2.2058 KOps/s 2.5432 KOps/s $\textbf{\color{#d91a1a}-13.26\%}$
test_unlock_stack_nested 74.2966ms 5.7487ms 173.9524 Ops/s 170.4738 Ops/s $\color{#35bf28}+2.04\%$
test_flatten_speed 0.5785ms 0.3630ms 2.7547 KOps/s 2.6500 KOps/s $\color{#35bf28}+3.95\%$
test_unflatten_speed 0.7755ms 0.4598ms 2.1750 KOps/s 2.2373 KOps/s $\color{#d91a1a}-2.78\%$
test_common_ops 4.0232ms 0.6211ms 1.6099 KOps/s 1.4944 KOps/s $\textbf{\color{#35bf28}+7.73\%}$
test_creation 16.7710μs 1.8598μs 537.6808 KOps/s 539.2486 KOps/s $\color{#d91a1a}-0.29\%$
test_creation_empty 27.3110μs 7.6840μs 130.1407 KOps/s 100.8422 KOps/s $\textbf{\color{#35bf28}+29.05\%}$
test_creation_nested_1 36.2780μs 10.2679μs 97.3913 KOps/s 81.8226 KOps/s $\textbf{\color{#35bf28}+19.03\%}$
test_creation_nested_2 41.8480μs 13.4501μs 74.3490 KOps/s 64.2662 KOps/s $\textbf{\color{#35bf28}+15.69\%}$
test_clone 90.6680μs 12.6878μs 78.8156 KOps/s 77.6814 KOps/s $\color{#35bf28}+1.46\%$
test_getitem[int] 32.8210μs 10.8078μs 92.5261 KOps/s 89.9540 KOps/s $\color{#35bf28}+2.86\%$
test_getitem[slice_int] 73.0560μs 21.5204μs 46.4676 KOps/s 45.2269 KOps/s $\color{#35bf28}+2.74\%$
test_getitem[range] 82.3830μs 39.5709μs 25.2711 KOps/s 24.1704 KOps/s $\color{#35bf28}+4.55\%$
test_getitem[tuple] 49.3120μs 18.0878μs 55.2859 KOps/s 56.0170 KOps/s $\color{#d91a1a}-1.31\%$
test_getitem[list] 97.8620μs 35.1491μs 28.4502 KOps/s 28.6488 KOps/s $\color{#d91a1a}-0.69\%$
test_setitem_dim[int] 58.1380μs 26.5827μs 37.6185 KOps/s 35.1596 KOps/s $\textbf{\color{#35bf28}+6.99\%}$
test_setitem_dim[slice_int] 84.1460μs 51.5400μs 19.4024 KOps/s 18.2494 KOps/s $\textbf{\color{#35bf28}+6.32\%}$
test_setitem_dim[range] 0.1073ms 69.8568μs 14.3150 KOps/s 13.3373 KOps/s $\textbf{\color{#35bf28}+7.33\%}$
test_setitem_dim[tuple] 72.7050μs 40.5882μs 24.6377 KOps/s 23.1770 KOps/s $\textbf{\color{#35bf28}+6.30\%}$
test_setitem 0.1655ms 17.4904μs 57.1743 KOps/s 52.3415 KOps/s $\textbf{\color{#35bf28}+9.23\%}$
test_set 0.1744ms 16.7797μs 59.5959 KOps/s 54.2579 KOps/s $\textbf{\color{#35bf28}+9.84\%}$
test_set_shared 2.4815ms 0.1408ms 7.1033 KOps/s 7.2908 KOps/s $\color{#d91a1a}-2.57\%$
test_update 0.1641ms 18.1379μs 55.1333 KOps/s 47.2672 KOps/s $\textbf{\color{#35bf28}+16.64\%}$
test_update_nested 0.1865ms 25.3653μs 39.4239 KOps/s 34.5238 KOps/s $\textbf{\color{#35bf28}+14.19\%}$
test_set_nested 88.5650μs 18.7118μs 53.4421 KOps/s 49.1116 KOps/s $\textbf{\color{#35bf28}+8.82\%}$
test_set_nested_new 93.9360μs 22.3181μs 44.8066 KOps/s 41.2041 KOps/s $\textbf{\color{#35bf28}+8.74\%}$
test_select 0.1093ms 34.3972μs 29.0721 KOps/s 27.1627 KOps/s $\textbf{\color{#35bf28}+7.03\%}$
test_select_nested 0.1162ms 56.9340μs 17.5642 KOps/s 17.3644 KOps/s $\color{#35bf28}+1.15\%$
test_exclude_nested 0.1919ms 0.1077ms 9.2843 KOps/s 9.3543 KOps/s $\color{#d91a1a}-0.75\%$
test_empty[True] 0.8287ms 0.3179ms 3.1461 KOps/s 3.1576 KOps/s $\color{#d91a1a}-0.37\%$
test_empty[False] 4.8690μs 1.0350μs 966.1490 KOps/s 958.3110 KOps/s $\color{#35bf28}+0.82\%$
test_unbind_speed 0.5653ms 0.3084ms 3.2423 KOps/s 3.1652 KOps/s $\color{#35bf28}+2.43\%$
test_unbind_speed_stack0 75.5958ms 4.1071ms 243.4834 Ops/s 259.6939 Ops/s $\textbf{\color{#d91a1a}-6.24\%}$
test_unbind_speed_stack1 1.8875μs 0.6282μs 1.5919 MOps/s 1.5653 MOps/s $\color{#35bf28}+1.70\%$
test_split 67.3985ms 1.5638ms 639.4678 Ops/s 638.7274 Ops/s $\color{#35bf28}+0.12\%$
test_chunk 67.3932ms 1.5511ms 644.7004 Ops/s 647.7901 Ops/s $\color{#d91a1a}-0.48\%$
test_creation[device0] 0.1695ms 97.0448μs 10.3045 KOps/s 10.1912 KOps/s $\color{#35bf28}+1.11\%$
test_creation_from_tensor 3.5371ms 79.2895μs 12.6120 KOps/s 12.6861 KOps/s $\color{#d91a1a}-0.58\%$
test_add_one[memmap_tensor0] 0.2566ms 5.2495μs 190.4941 KOps/s 192.8306 KOps/s $\color{#d91a1a}-1.21\%$
test_contiguous[memmap_tensor0] 9.2480μs 0.6347μs 1.5756 MOps/s 1.5266 MOps/s $\color{#35bf28}+3.21\%$
test_stack[memmap_tensor0] 47.8690μs 3.5639μs 280.5885 KOps/s 287.5165 KOps/s $\color{#d91a1a}-2.41\%$
test_memmaptd_index 0.9405ms 0.2201ms 4.5430 KOps/s 4.5577 KOps/s $\color{#d91a1a}-0.32\%$
test_memmaptd_index_astensor 0.6580ms 0.2812ms 3.5558 KOps/s 3.5832 KOps/s $\color{#d91a1a}-0.76\%$
test_memmaptd_index_op 0.8762ms 0.5226ms 1.9136 KOps/s 1.8235 KOps/s $\color{#35bf28}+4.94\%$
test_serialize_model 0.1783s 0.1078s 9.2746 Ops/s 9.0296 Ops/s $\color{#35bf28}+2.71\%$
test_serialize_model_pickle 0.4464s 0.3751s 2.6663 Ops/s 2.6285 Ops/s $\color{#35bf28}+1.44\%$
test_serialize_weights 0.1607s 0.1044s 9.5799 Ops/s 9.3198 Ops/s $\color{#35bf28}+2.79\%$
test_serialize_weights_returnearly 0.2967s 0.1477s 6.7715 Ops/s 7.5164 Ops/s $\textbf{\color{#d91a1a}-9.91\%}$
test_serialize_weights_pickle 1.2004s 0.5916s 1.6904 Ops/s 2.4050 Ops/s $\textbf{\color{#d91a1a}-29.71\%}$
test_serialize_weights_filesystem 0.1701s 99.3698ms 10.0634 Ops/s 10.8661 Ops/s $\textbf{\color{#d91a1a}-7.39\%}$
test_serialize_model_filesystem 0.1039s 93.0616ms 10.7456 Ops/s 10.1238 Ops/s $\textbf{\color{#35bf28}+6.14\%}$
test_reshape_pytree 56.4540μs 22.8674μs 43.7304 KOps/s 43.7266 KOps/s $+0.01\%$
test_reshape_td 0.1135ms 30.0999μs 33.2227 KOps/s 34.2633 KOps/s $\color{#d91a1a}-3.04\%$
test_view_pytree 56.4650μs 22.8413μs 43.7803 KOps/s 44.2098 KOps/s $\color{#d91a1a}-0.97\%$
test_view_td 19.8370μs 4.8130μs 207.7685 KOps/s 200.0918 KOps/s $\color{#35bf28}+3.84\%$
test_unbind_pytree 56.5950μs 25.9390μs 38.5520 KOps/s 37.9573 KOps/s $\color{#35bf28}+1.57\%$
test_unbind_td 0.1324ms 49.2414μs 20.3081 KOps/s 20.0672 KOps/s $\color{#35bf28}+1.20\%$
test_split_pytree 52.4570μs 25.6722μs 38.9526 KOps/s 39.3508 KOps/s $\color{#d91a1a}-1.01\%$
test_split_td 0.5153ms 39.6188μs 25.2406 KOps/s 25.0672 KOps/s $\color{#35bf28}+0.69\%$
test_add_pytree 78.1050μs 31.9025μs 31.3455 KOps/s 31.7268 KOps/s $\color{#d91a1a}-1.20\%$
test_add_td 0.1012ms 45.2471μs 22.1008 KOps/s 21.0440 KOps/s $\textbf{\color{#35bf28}+5.02\%}$
test_distributed 0.1711ms 96.2625μs 10.3883 KOps/s 9.9906 KOps/s $\color{#35bf28}+3.98\%$
test_tdmodule 0.7989ms 21.3174μs 46.9101 KOps/s 45.8564 KOps/s $\color{#35bf28}+2.30\%$
test_tdmodule_dispatch 0.1885ms 36.5362μs 27.3701 KOps/s 25.3555 KOps/s $\textbf{\color{#35bf28}+7.95\%}$
test_tdseq 47.9390μs 22.6989μs 44.0549 KOps/s 39.1029 KOps/s $\textbf{\color{#35bf28}+12.66\%}$
test_tdseq_dispatch 0.1311ms 39.4281μs 25.3626 KOps/s 22.8855 KOps/s $\textbf{\color{#35bf28}+10.82\%}$
test_instantiation_functorch 2.0445ms 1.2895ms 775.4803 Ops/s 773.7715 Ops/s $\color{#35bf28}+0.22\%$
test_instantiation_td 1.7017ms 0.9847ms 1.0155 KOps/s 1.0018 KOps/s $\color{#35bf28}+1.38\%$
test_exec_functorch 0.3993ms 0.1578ms 6.3371 KOps/s 6.2261 KOps/s $\color{#35bf28}+1.78\%$
test_exec_functional_call 0.2308ms 0.1445ms 6.9182 KOps/s 6.7295 KOps/s $\color{#35bf28}+2.81\%$
test_exec_td 0.2293ms 0.1400ms 7.1421 KOps/s 6.8262 KOps/s $\color{#35bf28}+4.63\%$
test_exec_td_decorator 73.7824ms 0.1952ms 5.1220 KOps/s 5.5247 KOps/s $\textbf{\color{#d91a1a}-7.29\%}$
test_vmap_mlp_speed[True-True] 1.6672ms 0.8753ms 1.1425 KOps/s 1.1358 KOps/s $\color{#35bf28}+0.59\%$
test_vmap_mlp_speed[True-False] 0.6360ms 0.4630ms 2.1598 KOps/s 2.1119 KOps/s $\color{#35bf28}+2.27\%$
test_vmap_mlp_speed[False-True] 1.2536ms 0.7771ms 1.2868 KOps/s 1.2882 KOps/s $\color{#d91a1a}-0.11\%$
test_vmap_mlp_speed[False-False] 0.5772ms 0.3850ms 2.5973 KOps/s 2.5735 KOps/s $\color{#35bf28}+0.92\%$
test_vmap_mlp_speed_decorator[True-True] 3.0742ms 2.3084ms 433.2023 Ops/s 417.4956 Ops/s $\color{#35bf28}+3.76\%$
test_vmap_mlp_speed_decorator[True-False] 0.8070ms 0.5072ms 1.9715 KOps/s 1.7543 KOps/s $\textbf{\color{#35bf28}+12.38\%}$
test_vmap_mlp_speed_decorator[False-True] 2.5953ms 1.8974ms 527.0353 Ops/s 517.0464 Ops/s $\color{#35bf28}+1.93\%$
test_vmap_mlp_speed_decorator[False-False] 0.8352ms 0.3948ms 2.5329 KOps/s 2.4756 KOps/s $\color{#35bf28}+2.32\%$

Copy link

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 132. Improved: $\large\color{#35bf28}4$. Worsened: $\large\color{#d91a1a}9$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 0.1343ms 13.3995μs 74.6298 KOps/s 75.2388 KOps/s $\color{#d91a1a}-0.81\%$
test_plain_set_stack_nested 0.1438ms 0.1179ms 8.4835 KOps/s 8.5049 KOps/s $\color{#d91a1a}-0.25\%$
test_plain_set_nested_inplace 0.1098ms 15.1291μs 66.0976 KOps/s 68.1183 KOps/s $\color{#d91a1a}-2.97\%$
test_plain_set_stack_nested_inplace 0.1825ms 0.1446ms 6.9154 KOps/s 6.9063 KOps/s $\color{#35bf28}+0.13\%$
test_items 16.7600μs 4.7130μs 212.1795 KOps/s 209.6091 KOps/s $\color{#35bf28}+1.23\%$
test_items_nested 0.3745ms 0.3375ms 2.9633 KOps/s 2.9529 KOps/s $\color{#35bf28}+0.35\%$
test_items_nested_locked 0.3732ms 0.3398ms 2.9432 KOps/s 2.9319 KOps/s $\color{#35bf28}+0.38\%$
test_items_nested_leaf 0.2285ms 0.2005ms 4.9876 KOps/s 4.9477 KOps/s $\color{#35bf28}+0.81\%$
test_items_stack_nested 1.4110ms 1.2901ms 775.1402 Ops/s 771.9412 Ops/s $\color{#35bf28}+0.41\%$
test_items_stack_nested_leaf 1.5255ms 1.1589ms 862.9159 Ops/s 886.2536 Ops/s $\color{#d91a1a}-2.63\%$
test_items_stack_nested_locked 0.9709ms 0.8982ms 1.1134 KOps/s 1.0884 KOps/s $\color{#35bf28}+2.30\%$
test_keys 24.5610μs 4.5854μs 218.0815 KOps/s 208.9430 KOps/s $\color{#35bf28}+4.37\%$
test_keys_nested 0.8604ms 95.5110μs 10.4700 KOps/s 10.5463 KOps/s $\color{#d91a1a}-0.72\%$
test_keys_nested_locked 0.1381ms 99.4680μs 10.0535 KOps/s 10.2585 KOps/s $\color{#d91a1a}-2.00\%$
test_keys_nested_leaf 0.1820ms 79.4723μs 12.5830 KOps/s 12.7560 KOps/s $\color{#d91a1a}-1.36\%$
test_keys_stack_nested 1.1997ms 1.1379ms 878.8237 Ops/s 872.7241 Ops/s $\color{#35bf28}+0.70\%$
test_keys_stack_nested_leaf 1.2072ms 1.1349ms 881.1706 Ops/s 889.8940 Ops/s $\color{#d91a1a}-0.98\%$
test_keys_stack_nested_locked 0.7611ms 0.7278ms 1.3740 KOps/s 1.3640 KOps/s $\color{#35bf28}+0.73\%$
test_values 8.9870μs 1.8787μs 532.2757 KOps/s 527.6616 KOps/s $\color{#35bf28}+0.87\%$
test_values_nested 70.4730μs 45.3514μs 22.0500 KOps/s 21.9794 KOps/s $\color{#35bf28}+0.32\%$
test_values_nested_locked 69.5530μs 47.6840μs 20.9714 KOps/s 20.9177 KOps/s $\color{#35bf28}+0.26\%$
test_values_nested_leaf 60.6230μs 39.9716μs 25.0178 KOps/s 25.1214 KOps/s $\color{#d91a1a}-0.41\%$
test_values_stack_nested 0.9872ms 0.9448ms 1.0584 KOps/s 1.0515 KOps/s $\color{#35bf28}+0.66\%$
test_values_stack_nested_leaf 1.0123ms 0.9477ms 1.0552 KOps/s 1.0622 KOps/s $\color{#d91a1a}-0.66\%$
test_values_stack_nested_locked 0.7787ms 0.6075ms 1.6462 KOps/s 1.7011 KOps/s $\color{#d91a1a}-3.23\%$
test_membership 13.4266μs 0.9522μs 1.0502 MOps/s 1.0019 MOps/s $\color{#35bf28}+4.82\%$
test_membership_nested 31.8410μs 2.9309μs 341.1944 KOps/s 340.2931 KOps/s $\color{#35bf28}+0.26\%$
test_membership_nested_leaf 0.1161ms 2.9233μs 342.0800 KOps/s 338.5161 KOps/s $\color{#35bf28}+1.05\%$
test_membership_stacked_nested 45.9620μs 11.2571μs 88.8326 KOps/s 89.0357 KOps/s $\color{#d91a1a}-0.23\%$
test_membership_stacked_nested_leaf 29.9210μs 11.2353μs 89.0056 KOps/s 87.2619 KOps/s $\color{#35bf28}+2.00\%$
test_membership_nested_last 32.0910μs 5.3777μs 185.9527 KOps/s 185.0861 KOps/s $\color{#35bf28}+0.47\%$
test_membership_nested_leaf_last 35.1310μs 5.3865μs 185.6495 KOps/s 186.2031 KOps/s $\color{#d91a1a}-0.30\%$
test_membership_stacked_nested_last 0.1830ms 0.1447ms 6.9086 KOps/s 7.0111 KOps/s $\color{#d91a1a}-1.46\%$
test_membership_stacked_nested_leaf_last 47.5520μs 13.1833μs 75.8533 KOps/s 75.6682 KOps/s $\color{#35bf28}+0.24\%$
test_nested_getleaf 39.0320μs 8.5256μs 117.2941 KOps/s 119.1368 KOps/s $\color{#d91a1a}-1.55\%$
test_nested_get 33.9620μs 8.0612μs 124.0505 KOps/s 126.1438 KOps/s $\color{#d91a1a}-1.66\%$
test_stacked_getleaf 0.4487ms 0.3201ms 3.1241 KOps/s 3.1252 KOps/s $\color{#d91a1a}-0.04\%$
test_stacked_get 0.3082ms 0.2878ms 3.4742 KOps/s 3.4690 KOps/s $\color{#35bf28}+0.15\%$
test_nested_getitemleaf 29.7710μs 8.4993μs 117.6564 KOps/s 118.8374 KOps/s $\color{#d91a1a}-0.99\%$
test_nested_getitem 31.6420μs 8.0483μs 124.2505 KOps/s 125.5468 KOps/s $\color{#d91a1a}-1.03\%$
test_stacked_getitemleaf 0.4175ms 0.3199ms 3.1262 KOps/s 3.1297 KOps/s $\color{#d91a1a}-0.11\%$
test_stacked_getitem 0.4102ms 0.2891ms 3.4595 KOps/s 3.4946 KOps/s $\color{#d91a1a}-1.00\%$
test_lock_nested 4.1110ms 0.3994ms 2.5040 KOps/s 2.5365 KOps/s $\color{#d91a1a}-1.28\%$
test_lock_stack_nested 83.7256ms 6.2496ms 160.0093 Ops/s 159.4382 Ops/s $\color{#35bf28}+0.36\%$
test_unlock_nested 0.8009ms 0.3946ms 2.5342 KOps/s 2.5112 KOps/s $\color{#35bf28}+0.91\%$
test_unlock_stack_nested 82.6482ms 6.6414ms 150.5702 Ops/s 148.0358 Ops/s $\color{#35bf28}+1.71\%$
test_flatten_speed 0.3537ms 0.2636ms 3.7935 KOps/s 3.7745 KOps/s $\color{#35bf28}+0.50\%$
test_unflatten_speed 0.4476ms 0.3647ms 2.7419 KOps/s 2.7171 KOps/s $\color{#35bf28}+0.91\%$
test_common_ops 1.1056ms 0.6002ms 1.6661 KOps/s 1.6772 KOps/s $\color{#d91a1a}-0.66\%$
test_creation 19.7910μs 1.6046μs 623.2017 KOps/s 638.6522 KOps/s $\color{#d91a1a}-2.42\%$
test_creation_empty 59.6130μs 7.9608μs 125.6161 KOps/s 131.5113 KOps/s $\color{#d91a1a}-4.48\%$
test_creation_nested_1 42.6220μs 9.6420μs 103.7126 KOps/s 106.1070 KOps/s $\color{#d91a1a}-2.26\%$
test_creation_nested_2 30.5910μs 12.1207μs 82.5034 KOps/s 84.4323 KOps/s $\color{#d91a1a}-2.28\%$
test_clone 0.1289ms 13.4984μs 74.0827 KOps/s 72.9702 KOps/s $\color{#35bf28}+1.52\%$
test_getitem[int] 26.1910μs 10.6402μs 93.9835 KOps/s 93.3358 KOps/s $\color{#35bf28}+0.69\%$
test_getitem[slice_int] 42.8220μs 21.2161μs 47.1341 KOps/s 46.6294 KOps/s $\color{#35bf28}+1.08\%$
test_getitem[range] 0.1788ms 35.4076μs 28.2425 KOps/s 27.2225 KOps/s $\color{#35bf28}+3.75\%$
test_getitem[tuple] 44.2520μs 18.4156μs 54.3017 KOps/s 53.4835 KOps/s $\color{#35bf28}+1.53\%$
test_getitem[list] 0.1232ms 32.3723μs 30.8906 KOps/s 30.0321 KOps/s $\color{#35bf28}+2.86\%$
test_setitem_dim[int] 65.1020μs 26.1412μs 38.2539 KOps/s 38.2536 KOps/s $+0.00\%$
test_setitem_dim[slice_int] 68.7130μs 47.1037μs 21.2298 KOps/s 20.9880 KOps/s $\color{#35bf28}+1.15\%$
test_setitem_dim[range] 89.1640μs 59.6265μs 16.7711 KOps/s 16.3317 KOps/s $\color{#35bf28}+2.69\%$
test_setitem_dim[tuple] 61.8830μs 42.4951μs 23.5322 KOps/s 24.1082 KOps/s $\color{#d91a1a}-2.39\%$
test_setitem 0.1725ms 19.9805μs 50.0487 KOps/s 54.7383 KOps/s $\textbf{\color{#d91a1a}-8.57\%}$
test_set 0.1411ms 19.5453μs 51.1632 KOps/s 57.1370 KOps/s $\textbf{\color{#d91a1a}-10.46\%}$
test_set_shared 2.9059ms 0.1035ms 9.6647 KOps/s 9.6081 KOps/s $\color{#35bf28}+0.59\%$
test_update 0.1274ms 20.8058μs 48.0635 KOps/s 49.9387 KOps/s $\color{#d91a1a}-3.75\%$
test_update_nested 0.1582ms 29.8156μs 33.5394 KOps/s 37.5017 KOps/s $\textbf{\color{#d91a1a}-10.57\%}$
test_set_nested 0.1743ms 20.9856μs 47.6518 KOps/s 53.0024 KOps/s $\textbf{\color{#d91a1a}-10.10\%}$
test_set_nested_new 0.1395ms 23.8219μs 41.9782 KOps/s 45.1200 KOps/s $\textbf{\color{#d91a1a}-6.96\%}$
test_select 0.1817ms 36.5859μs 27.3329 KOps/s 28.1211 KOps/s $\color{#d91a1a}-2.80\%$
test_select_nested 0.9258ms 53.0998μs 18.8325 KOps/s 18.9090 KOps/s $\color{#d91a1a}-0.40\%$
test_exclude_nested 0.1369ms 0.1084ms 9.2274 KOps/s 9.2078 KOps/s $\color{#35bf28}+0.21\%$
test_empty[True] 0.4966ms 0.3244ms 3.0822 KOps/s 3.1250 KOps/s $\color{#d91a1a}-1.37\%$
test_empty[False] 3.0081μs 0.8578μs 1.1658 MOps/s 1.1665 MOps/s $\color{#d91a1a}-0.06\%$
test_to 70.7130μs 51.3742μs 19.4650 KOps/s 18.2787 KOps/s $\textbf{\color{#35bf28}+6.49\%}$
test_to_nonblocking 0.1982ms 34.5596μs 28.9355 KOps/s 29.2824 KOps/s $\color{#d91a1a}-1.18\%$
test_unbind_speed 0.3468ms 0.3124ms 3.2010 KOps/s 3.1773 KOps/s $\color{#35bf28}+0.75\%$
test_unbind_speed_stack0 80.6604ms 3.8785ms 257.8322 Ops/s 256.4163 Ops/s $\color{#35bf28}+0.55\%$
test_unbind_speed_stack1 1.7036μs 0.5283μs 1.8928 MOps/s 1.8767 MOps/s $\color{#35bf28}+0.86\%$
test_split 1.7960ms 1.5235ms 656.3990 Ops/s 660.4033 Ops/s $\color{#d91a1a}-0.61\%$
test_chunk 74.9412ms 1.6288ms 613.9381 Ops/s 661.1826 Ops/s $\textbf{\color{#d91a1a}-7.15\%}$
test_creation[device0] 0.2190ms 71.2244μs 14.0401 KOps/s 13.7857 KOps/s $\color{#35bf28}+1.85\%$
test_creation_from_tensor 0.2038ms 56.1717μs 17.8026 KOps/s 18.6192 KOps/s $\color{#d91a1a}-4.39\%$
test_add_one[memmap_tensor0] 0.1716ms 6.6744μs 149.8259 KOps/s 146.1550 KOps/s $\color{#35bf28}+2.51\%$
test_contiguous[memmap_tensor0] 12.6100μs 0.6264μs 1.5963 MOps/s 1.5831 MOps/s $\color{#35bf28}+0.84\%$
test_stack[memmap_tensor0] 26.5210μs 4.3578μs 229.4726 KOps/s 223.2844 KOps/s $\color{#35bf28}+2.77\%$
test_memmaptd_index 0.9846ms 0.2561ms 3.9043 KOps/s 3.9068 KOps/s $\color{#d91a1a}-0.06\%$
test_memmaptd_index_astensor 0.5952ms 0.3123ms 3.2019 KOps/s 3.1820 KOps/s $\color{#35bf28}+0.63\%$
test_memmaptd_index_op 0.9735ms 0.5951ms 1.6805 KOps/s 1.6905 KOps/s $\color{#d91a1a}-0.59\%$
test_serialize_model 0.1726s 96.7547ms 10.3354 Ops/s 9.7393 Ops/s $\textbf{\color{#35bf28}+6.12\%}$
test_serialize_model_pickle 1.3661s 1.2394s 0.8068 Ops/s 0.8067 Ops/s $\color{#35bf28}+0.02\%$
test_serialize_weights 89.6586ms 86.8224ms 11.5178 Ops/s 9.8474 Ops/s $\textbf{\color{#35bf28}+16.96\%}$
test_serialize_weights_returnearly 0.2575s 80.0977ms 12.4848 Ops/s 14.0886 Ops/s $\textbf{\color{#d91a1a}-11.38\%}$
test_serialize_weights_pickle 1.3573s 1.2365s 0.8087 Ops/s 0.8091 Ops/s $\color{#d91a1a}-0.05\%$
test_reshape_pytree 55.1430μs 24.6724μs 40.5311 KOps/s 40.6000 KOps/s $\color{#d91a1a}-0.17\%$
test_reshape_td 0.2313ms 29.5346μs 33.8586 KOps/s 34.3681 KOps/s $\color{#d91a1a}-1.48\%$
test_view_pytree 0.1543ms 24.3678μs 41.0378 KOps/s 41.2137 KOps/s $\color{#d91a1a}-0.43\%$
test_view_td 0.2105ms 4.4389μs 225.2810 KOps/s 238.9955 KOps/s $\textbf{\color{#d91a1a}-5.74\%}$
test_unbind_pytree 0.1865ms 30.6970μs 32.5765 KOps/s 33.2367 KOps/s $\color{#d91a1a}-1.99\%$
test_unbind_td 78.4230μs 48.9913μs 20.4118 KOps/s 20.4027 KOps/s $\color{#35bf28}+0.04\%$
test_split_pytree 51.3520μs 28.8793μs 34.6269 KOps/s 35.0720 KOps/s $\color{#d91a1a}-1.27\%$
test_split_td 0.6995ms 39.2229μs 25.4953 KOps/s 24.9606 KOps/s $\color{#35bf28}+2.14\%$
test_add_pytree 0.2350ms 36.1072μs 27.6953 KOps/s 26.7895 KOps/s $\color{#35bf28}+3.38\%$
test_add_td 82.9430μs 47.7126μs 20.9588 KOps/s 20.7491 KOps/s $\color{#35bf28}+1.01\%$
test_distributed 1.9635ms 74.0252μs 13.5089 KOps/s 10.7317 KOps/s $\textbf{\color{#35bf28}+25.88\%}$
test_tdmodule 0.1074ms 17.4980μs 57.1494 KOps/s 57.5602 KOps/s $\color{#d91a1a}-0.71\%$
test_tdmodule_dispatch 0.1385ms 33.3942μs 29.9453 KOps/s 30.9519 KOps/s $\color{#d91a1a}-3.25\%$
test_tdseq 36.1520μs 20.0468μs 49.8834 KOps/s 49.8041 KOps/s $\color{#35bf28}+0.16\%$
test_tdseq_dispatch 51.3320μs 35.6027μs 28.0877 KOps/s 28.0409 KOps/s $\color{#35bf28}+0.17\%$
test_instantiation_functorch 1.9162ms 1.6981ms 588.9007 Ops/s 593.8881 Ops/s $\color{#d91a1a}-0.84\%$
test_instantiation_td 1.9264ms 1.2105ms 826.0762 Ops/s 853.7487 Ops/s $\color{#d91a1a}-3.24\%$
test_exec_functorch 0.3184ms 0.1627ms 6.1452 KOps/s 6.2329 KOps/s $\color{#d91a1a}-1.41\%$
test_exec_functional_call 0.2329ms 0.1669ms 5.9904 KOps/s 6.3035 KOps/s $\color{#d91a1a}-4.97\%$
test_exec_td 0.3399ms 0.1599ms 6.2545 KOps/s 6.6894 KOps/s $\textbf{\color{#d91a1a}-6.50\%}$
test_exec_td_decorator 0.9318ms 0.1964ms 5.0925 KOps/s 5.3400 KOps/s $\color{#d91a1a}-4.64\%$
test_vmap_mlp_speed[True-True] 1.3246ms 1.1390ms 877.9398 Ops/s 908.8293 Ops/s $\color{#d91a1a}-3.40\%$
test_vmap_mlp_speed[True-False] 0.8560ms 0.6765ms 1.4781 KOps/s 1.5227 KOps/s $\color{#d91a1a}-2.93\%$
test_vmap_mlp_speed[False-True] 1.2221ms 1.0550ms 947.9113 Ops/s 987.1482 Ops/s $\color{#d91a1a}-3.97\%$
test_vmap_mlp_speed[False-False] 0.7408ms 0.6040ms 1.6557 KOps/s 1.6722 KOps/s $\color{#d91a1a}-0.99\%$
test_vmap_mlp_speed_decorator[True-True] 3.3422ms 2.5661ms 389.7019 Ops/s 391.0500 Ops/s $\color{#d91a1a}-0.34\%$
test_vmap_mlp_speed_decorator[True-False] 1.0612ms 0.7253ms 1.3787 KOps/s 1.4260 KOps/s $\color{#d91a1a}-3.32\%$
test_vmap_mlp_speed_decorator[False-True] 2.6025ms 2.1499ms 465.1313 Ops/s 483.0553 Ops/s $\color{#d91a1a}-3.71\%$
test_vmap_mlp_speed_decorator[False-False] 1.0512ms 0.6243ms 1.6018 KOps/s 1.6520 KOps/s $\color{#d91a1a}-3.04\%$
test_vmap_transformer_speed[True-True] 12.8667ms 12.3287ms 81.1115 Ops/s 80.6423 Ops/s $\color{#35bf28}+0.58\%$
test_vmap_transformer_speed[True-False] 8.1758ms 8.0249ms 124.6118 Ops/s 121.4883 Ops/s $\color{#35bf28}+2.57\%$
test_vmap_transformer_speed[False-True] 12.9882ms 12.2374ms 81.7168 Ops/s 79.2910 Ops/s $\color{#35bf28}+3.06\%$
test_vmap_transformer_speed[False-False] 8.3263ms 7.9534ms 125.7328 Ops/s 121.3813 Ops/s $\color{#35bf28}+3.59\%$
test_vmap_transformer_speed_decorator[True-True] 0.1657s 81.7583ms 12.2312 Ops/s 11.9680 Ops/s $\color{#35bf28}+2.20\%$
test_vmap_transformer_speed_decorator[True-False] 21.9244ms 19.4389ms 51.4432 Ops/s 49.8889 Ops/s $\color{#35bf28}+3.12\%$
test_vmap_transformer_speed_decorator[False-True] 69.7874ms 67.3827ms 14.8406 Ops/s 14.4513 Ops/s $\color{#35bf28}+2.69\%$
test_vmap_transformer_speed_decorator[False-False] 0.1168s 20.9597ms 47.7105 Ops/s 46.5222 Ops/s $\color{#35bf28}+2.55\%$

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants