Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature, Test] Add tests for partial update #578

Merged
merged 7 commits into from
Nov 24, 2023
Merged

Conversation

vmoens
Copy link
Contributor

@vmoens vmoens commented Nov 24, 2023

No description provided.

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Nov 24, 2023
Copy link

github-actions bot commented Nov 24, 2023

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 113. Improved: $\large\color{#35bf28}12$. Worsened: $\large\color{#d91a1a}34$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 41.4070μs 15.8836μs 62.9580 KOps/s 62.4967 KOps/s $\color{#35bf28}+0.74\%$
test_plain_set_stack_nested 0.1988ms 0.1428ms 7.0030 KOps/s 6.9764 KOps/s $\color{#35bf28}+0.38\%$
test_plain_set_nested_inplace 59.8720μs 19.4378μs 51.4461 KOps/s 52.5255 KOps/s $\color{#d91a1a}-2.05\%$
test_plain_set_stack_nested_inplace 0.3269ms 0.1730ms 5.7817 KOps/s 5.8572 KOps/s $\color{#d91a1a}-1.29\%$
test_items 22.0720μs 2.4339μs 410.8612 KOps/s 376.2520 KOps/s $\textbf{\color{#35bf28}+9.20\%}$
test_items_nested 7.8324ms 0.3145ms 3.1797 KOps/s 3.3092 KOps/s $\color{#d91a1a}-3.91\%$
test_items_nested_locked 1.3299ms 0.2843ms 3.5175 KOps/s 3.7221 KOps/s $\textbf{\color{#d91a1a}-5.50\%}$
test_items_nested_leaf 0.2888ms 0.1745ms 5.7319 KOps/s 6.0801 KOps/s $\textbf{\color{#d91a1a}-5.73\%}$
test_items_stack_nested 2.2894ms 1.4958ms 668.5496 Ops/s 670.8179 Ops/s $\color{#d91a1a}-0.34\%$
test_items_stack_nested_leaf 1.6060ms 1.3479ms 741.8721 Ops/s 740.5993 Ops/s $\color{#35bf28}+0.17\%$
test_items_stack_nested_locked 1.8480ms 0.7778ms 1.2857 KOps/s 1.2515 KOps/s $\color{#35bf28}+2.73\%$
test_keys 4.3937ms 5.2670μs 189.8598 KOps/s 241.5038 KOps/s $\textbf{\color{#d91a1a}-21.38\%}$
test_keys_nested 4.4962ms 0.1570ms 6.3675 KOps/s 6.4041 KOps/s $\color{#d91a1a}-0.57\%$
test_keys_nested_locked 6.8025ms 0.1660ms 6.0242 KOps/s 7.0853 KOps/s $\textbf{\color{#d91a1a}-14.98\%}$
test_keys_nested_leaf 8.0634ms 0.1593ms 6.2784 KOps/s 7.1312 KOps/s $\textbf{\color{#d91a1a}-11.96\%}$
test_keys_stack_nested 6.4389ms 1.6027ms 623.9408 Ops/s 710.3476 Ops/s $\textbf{\color{#d91a1a}-12.16\%}$
test_keys_stack_nested_leaf 1.5605ms 1.3963ms 716.1883 Ops/s 712.6182 Ops/s $\color{#35bf28}+0.50\%$
test_keys_stack_nested_locked 7.0955ms 0.8216ms 1.2171 KOps/s 1.4815 KOps/s $\textbf{\color{#d91a1a}-17.85\%}$
test_values 1.9380ms 1.3141μs 760.9776 KOps/s 691.6661 KOps/s $\textbf{\color{#35bf28}+10.02\%}$
test_values_nested 7.7683ms 53.8426μs 18.5727 KOps/s 19.4610 KOps/s $\color{#d91a1a}-4.56\%$
test_values_nested_locked 4.8572ms 56.8990μs 17.5750 KOps/s 19.3875 KOps/s $\textbf{\color{#d91a1a}-9.35\%}$
test_values_nested_leaf 1.2105ms 44.8724μs 22.2854 KOps/s 22.7554 KOps/s $\color{#d91a1a}-2.07\%$
test_values_stack_nested 10.7941ms 1.3804ms 724.4335 Ops/s 821.3357 Ops/s $\textbf{\color{#d91a1a}-11.80\%}$
test_values_stack_nested_leaf 8.0686ms 1.3600ms 735.3016 Ops/s 837.6776 Ops/s $\textbf{\color{#d91a1a}-12.22\%}$
test_values_stack_nested_locked 5.9225ms 0.5869ms 1.7039 KOps/s 1.8998 KOps/s $\textbf{\color{#d91a1a}-10.31\%}$
test_membership 3.6530ms 1.4421μs 693.4131 KOps/s 624.5947 KOps/s $\textbf{\color{#35bf28}+11.02\%}$
test_membership_nested 7.4888ms 3.2724μs 305.5858 KOps/s 307.8388 KOps/s $\color{#d91a1a}-0.73\%$
test_membership_nested_leaf 2.8155ms 3.0445μs 328.4596 KOps/s 297.9249 KOps/s $\textbf{\color{#35bf28}+10.25\%}$
test_membership_stacked_nested 37.0090μs 11.8332μs 84.5081 KOps/s 71.9565 KOps/s $\textbf{\color{#35bf28}+17.44\%}$
test_membership_stacked_nested_leaf 6.6996ms 12.4971μs 80.0185 KOps/s 74.2190 KOps/s $\textbf{\color{#35bf28}+7.81\%}$
test_membership_nested_last 3.5830ms 6.7737μs 147.6302 KOps/s 160.6494 KOps/s $\textbf{\color{#d91a1a}-8.10\%}$
test_membership_nested_leaf_last 1.9340ms 6.2601μs 159.7425 KOps/s 157.5121 KOps/s $\color{#35bf28}+1.42\%$
test_membership_stacked_nested_last 0.4306ms 0.1738ms 5.7532 KOps/s 5.1679 KOps/s $\textbf{\color{#35bf28}+11.32\%}$
test_membership_stacked_nested_leaf_last 40.9570μs 14.1987μs 70.4289 KOps/s 59.7912 KOps/s $\textbf{\color{#35bf28}+17.79\%}$
test_nested_getleaf 31.5090μs 10.6756μs 93.6715 KOps/s 91.5026 KOps/s $\color{#35bf28}+2.37\%$
test_nested_get 28.0520μs 10.1953μs 98.0845 KOps/s 96.4047 KOps/s $\color{#35bf28}+1.74\%$
test_stacked_getleaf 1.1055ms 0.6485ms 1.5419 KOps/s 1.5158 KOps/s $\color{#35bf28}+1.73\%$
test_stacked_get 1.0057ms 0.6223ms 1.6069 KOps/s 1.6269 KOps/s $\color{#d91a1a}-1.23\%$
test_nested_getitemleaf 47.8770μs 10.6787μs 93.6442 KOps/s 93.6181 KOps/s $\color{#35bf28}+0.03\%$
test_nested_getitem 37.6810μs 10.2951μs 97.1338 KOps/s 86.9687 KOps/s $\textbf{\color{#35bf28}+11.69\%}$
test_stacked_getitemleaf 0.7513ms 0.6463ms 1.5472 KOps/s 1.3459 KOps/s $\textbf{\color{#35bf28}+14.96\%}$
test_stacked_getitem 1.0400ms 0.6226ms 1.6061 KOps/s 1.6242 KOps/s $\color{#d91a1a}-1.12\%$
test_lock_nested 56.2455ms 0.6145ms 1.6273 KOps/s 1.7965 KOps/s $\textbf{\color{#d91a1a}-9.42\%}$
test_lock_stack_nested 13.8566ms 5.1284ms 194.9938 Ops/s 197.9010 Ops/s $\color{#d91a1a}-1.47\%$
test_unlock_nested 0.8897ms 0.4410ms 2.2678 KOps/s 2.2622 KOps/s $\color{#35bf28}+0.25\%$
test_unlock_stack_nested 63.8355ms 6.5426ms 152.8433 Ops/s 150.9294 Ops/s $\color{#35bf28}+1.27\%$
test_flatten_speed 0.4817ms 0.2694ms 3.7121 KOps/s 3.7485 KOps/s $\color{#d91a1a}-0.97\%$
test_unflatten_speed 0.9760ms 0.4643ms 2.1536 KOps/s 2.0828 KOps/s $\color{#35bf28}+3.40\%$
test_common_ops 5.2900ms 0.8125ms 1.2308 KOps/s 1.2153 KOps/s $\color{#35bf28}+1.28\%$
test_creation 1.2908ms 2.6609μs 375.8116 KOps/s 397.2106 KOps/s $\textbf{\color{#d91a1a}-5.39\%}$
test_creation_empty 2.6703ms 10.0212μs 99.7880 KOps/s 114.3416 KOps/s $\textbf{\color{#d91a1a}-12.73\%}$
test_creation_nested_1 3.4999ms 14.0230μs 71.3115 KOps/s 81.8056 KOps/s $\textbf{\color{#d91a1a}-12.83\%}$
test_creation_nested_2 0.4953ms 16.2491μs 61.5417 KOps/s 64.2440 KOps/s $\color{#d91a1a}-4.21\%$
test_clone 74.0590μs 13.6181μs 73.4317 KOps/s 72.1061 KOps/s $\color{#35bf28}+1.84\%$
test_getitem[int] 37.5700μs 12.8678μs 77.7136 KOps/s 76.0403 KOps/s $\color{#35bf28}+2.20\%$
test_getitem[slice_int] 65.2120μs 25.7156μs 38.8869 KOps/s 39.2969 KOps/s $\color{#d91a1a}-1.04\%$
test_getitem[range] 94.5770μs 44.8657μs 22.2887 KOps/s 22.2126 KOps/s $\color{#35bf28}+0.34\%$
test_getitem[tuple] 3.7430ms 23.4003μs 42.7344 KOps/s 48.7685 KOps/s $\textbf{\color{#d91a1a}-12.37\%}$
test_getitem[list] 83.4660μs 39.7198μs 25.1764 KOps/s 24.6354 KOps/s $\color{#35bf28}+2.20\%$
test_setitem_dim[int] 48.6810μs 29.5502μs 33.8407 KOps/s 34.9378 KOps/s $\color{#d91a1a}-3.14\%$
test_setitem_dim[slice_int] 94.9180μs 54.4246μs 18.3740 KOps/s 19.0063 KOps/s $\color{#d91a1a}-3.33\%$
test_setitem_dim[range] 0.1117ms 73.0043μs 13.6978 KOps/s 14.0140 KOps/s $\color{#d91a1a}-2.26\%$
test_setitem_dim[tuple] 76.8040μs 42.8147μs 23.3565 KOps/s 23.0459 KOps/s $\color{#35bf28}+1.35\%$
test_setitem 65.1020μs 19.2612μs 51.9178 KOps/s 51.8334 KOps/s $\color{#35bf28}+0.16\%$
test_set 64.5210μs 18.7167μs 53.4281 KOps/s 54.0676 KOps/s $\color{#d91a1a}-1.18\%$
test_set_shared 3.2213ms 0.1467ms 6.8178 KOps/s 7.1438 KOps/s $\color{#d91a1a}-4.56\%$
test_update 0.1445ms 20.2152μs 49.4676 KOps/s 50.9712 KOps/s $\color{#d91a1a}-2.95\%$
test_update_nested 77.2040μs 28.1531μs 35.5200 KOps/s 36.9032 KOps/s $\color{#d91a1a}-3.75\%$
test_set_nested 65.8640μs 20.7740μs 48.1370 KOps/s 48.6278 KOps/s $\color{#d91a1a}-1.01\%$
test_set_nested_new 72.8270μs 25.8908μs 38.6238 KOps/s 39.0938 KOps/s $\color{#d91a1a}-1.20\%$
test_select 0.1142ms 51.4303μs 19.4438 KOps/s 19.5782 KOps/s $\color{#d91a1a}-0.69\%$
test_unbind_speed 6.2259ms 0.4268ms 2.3430 KOps/s 2.6867 KOps/s $\textbf{\color{#d91a1a}-12.79\%}$
test_unbind_speed_stack0 81.1788ms 5.8275ms 171.6014 Ops/s 235.7305 Ops/s $\textbf{\color{#d91a1a}-27.20\%}$
test_unbind_speed_stack1 1.8735μs 0.6242μs 1.6020 MOps/s 1.5934 MOps/s $\color{#35bf28}+0.54\%$
test_split 54.7484ms 1.7515ms 570.9422 Ops/s 567.9958 Ops/s $\color{#35bf28}+0.52\%$
test_chunk 54.7397ms 1.7319ms 577.3988 Ops/s 578.6987 Ops/s $\color{#d91a1a}-0.22\%$
test_creation[device0] 5.0567ms 0.3083ms 3.2438 KOps/s 2.5796 KOps/s $\textbf{\color{#35bf28}+25.75\%}$
test_creation_from_tensor 3.9513ms 0.4188ms 2.3876 KOps/s 2.4987 KOps/s $\color{#d91a1a}-4.45\%$
test_add_one[memmap_tensor0] 7.2385ms 32.2866μs 30.9726 KOps/s 37.6237 KOps/s $\textbf{\color{#d91a1a}-17.68\%}$
test_contiguous[memmap_tensor0] 1.6830ms 6.0805μs 164.4594 KOps/s 175.6347 KOps/s $\textbf{\color{#d91a1a}-6.36\%}$
test_stack[memmap_tensor0] 98.2120μs 19.1448μs 52.2335 KOps/s 53.4359 KOps/s $\color{#d91a1a}-2.25\%$
test_memmaptd_index 0.4934ms 0.4072ms 2.4557 KOps/s 2.4956 KOps/s $\color{#d91a1a}-1.60\%$
test_memmaptd_index_astensor 0.5546ms 0.4639ms 2.1559 KOps/s 2.1664 KOps/s $\color{#d91a1a}-0.48\%$
test_memmaptd_index_op 0.8711ms 0.7275ms 1.3745 KOps/s 1.4012 KOps/s $\color{#d91a1a}-1.90\%$
test_reshape_pytree 57.9480μs 23.8878μs 41.8623 KOps/s 42.7413 KOps/s $\color{#d91a1a}-2.06\%$
test_reshape_td 91.2910μs 32.3566μs 30.9056 KOps/s 31.7864 KOps/s $\color{#d91a1a}-2.77\%$
test_view_pytree 0.3955ms 23.5700μs 42.4268 KOps/s 42.8268 KOps/s $\color{#d91a1a}-0.93\%$
test_view_td 4.9194ms 5.4496μs 183.4997 KOps/s 198.0529 KOps/s $\textbf{\color{#d91a1a}-7.35\%}$
test_unbind_pytree 2.0933ms 33.7788μs 29.6044 KOps/s 38.1245 KOps/s $\textbf{\color{#d91a1a}-22.35\%}$
test_unbind_td 8.2716ms 69.8726μs 14.3118 KOps/s 16.9098 KOps/s $\textbf{\color{#d91a1a}-15.36\%}$
test_split_pytree 2.8866ms 30.0813μs 33.2433 KOps/s 37.9162 KOps/s $\textbf{\color{#d91a1a}-12.32\%}$
test_split_td 7.3225ms 54.4384μs 18.3694 KOps/s 21.4150 KOps/s $\textbf{\color{#d91a1a}-14.22\%}$
test_add_pytree 7.6758ms 37.7164μs 26.5136 KOps/s 31.1058 KOps/s $\textbf{\color{#d91a1a}-14.76\%}$
test_add_td 2.3245ms 54.3920μs 18.3851 KOps/s 21.9329 KOps/s $\textbf{\color{#d91a1a}-16.18\%}$
test_distributed 1.9579ms 7.0877μs 141.0886 KOps/s 160.0809 KOps/s $\textbf{\color{#d91a1a}-11.86\%}$
test_tdmodule 0.4097ms 25.7826μs 38.7859 KOps/s 46.6629 KOps/s $\textbf{\color{#d91a1a}-16.88\%}$
test_tdmodule_dispatch 1.5120ms 47.4781μs 21.0623 KOps/s 24.8453 KOps/s $\textbf{\color{#d91a1a}-15.23\%}$
test_tdseq 46.8980μs 24.3248μs 41.1103 KOps/s 39.6713 KOps/s $\color{#35bf28}+3.63\%$
test_tdseq_dispatch 0.3789ms 42.5343μs 23.5104 KOps/s 22.2381 KOps/s $\textbf{\color{#35bf28}+5.72\%}$
test_instantiation_functorch 1.4183ms 1.2936ms 773.0657 Ops/s 775.7535 Ops/s $\color{#d91a1a}-0.35\%$
test_instantiation_td 61.0796ms 1.1774ms 849.3545 Ops/s 986.8322 Ops/s $\textbf{\color{#d91a1a}-13.93\%}$
test_exec_functorch 4.2961ms 0.1976ms 5.0617 KOps/s 6.3229 KOps/s $\textbf{\color{#d91a1a}-19.95\%}$
test_exec_functional_call 7.1216ms 0.1841ms 5.4333 KOps/s 6.5932 KOps/s $\textbf{\color{#d91a1a}-17.59\%}$
test_exec_td 0.2212ms 0.1429ms 6.9963 KOps/s 6.7469 KOps/s $\color{#35bf28}+3.70\%$
test_exec_td_decorator 0.9366ms 0.1782ms 5.6125 KOps/s 5.5918 KOps/s $\color{#35bf28}+0.37\%$
test_vmap_mlp_speed[True-True] 1.4242ms 0.8895ms 1.1242 KOps/s 1.1249 KOps/s $\color{#d91a1a}-0.06\%$
test_vmap_mlp_speed[True-False] 0.8250ms 0.4713ms 2.1218 KOps/s 2.1231 KOps/s $\color{#d91a1a}-0.06\%$
test_vmap_mlp_speed[False-True] 1.1242ms 0.7695ms 1.2996 KOps/s 1.2933 KOps/s $\color{#35bf28}+0.49\%$
test_vmap_mlp_speed[False-False] 0.4647ms 0.3770ms 2.6522 KOps/s 2.5871 KOps/s $\color{#35bf28}+2.52\%$
test_vmap_mlp_speed_decorator[True-True] 2.6790ms 1.7781ms 562.4032 Ops/s 574.0236 Ops/s $\color{#d91a1a}-2.02\%$
test_vmap_mlp_speed_decorator[True-False] 0.9773ms 0.5119ms 1.9534 KOps/s 1.9277 KOps/s $\color{#35bf28}+1.33\%$
test_vmap_mlp_speed_decorator[False-True] 1.9084ms 1.4630ms 683.5435 Ops/s 684.0606 Ops/s $\color{#d91a1a}-0.08\%$
test_vmap_mlp_speed_decorator[False-False] 0.9883ms 0.3926ms 2.5474 KOps/s 2.5329 KOps/s $\color{#35bf28}+0.57\%$

Copy link

github-actions bot commented Nov 24, 2023

$\color{#35bf28}\textsf{\Large✔\kern{0.2cm}\normalsize OK}$ Result of GPU Benchmark Tests

Total Benchmarks: 127. Improved: $\large\color{#35bf28}6$. Worsened: $\large\color{#d91a1a}0$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 79.1510μs 12.5106μs 79.9324 KOps/s 79.5431 KOps/s $\color{#35bf28}+0.49\%$
test_plain_set_stack_nested 0.1328ms 0.1155ms 8.6561 KOps/s 8.7322 KOps/s $\color{#d91a1a}-0.87\%$
test_plain_set_nested_inplace 29.1610μs 14.8248μs 67.4544 KOps/s 67.1733 KOps/s $\color{#35bf28}+0.42\%$
test_plain_set_stack_nested_inplace 0.1821ms 0.1404ms 7.1243 KOps/s 7.1759 KOps/s $\color{#d91a1a}-0.72\%$
test_items 21.5700μs 4.6462μs 215.2311 KOps/s 213.2458 KOps/s $\color{#35bf28}+0.93\%$
test_items_nested 0.3697ms 0.3366ms 2.9705 KOps/s 2.9895 KOps/s $\color{#d91a1a}-0.64\%$
test_items_nested_locked 0.3707ms 0.3392ms 2.9483 KOps/s 2.9743 KOps/s $\color{#d91a1a}-0.88\%$
test_items_nested_leaf 0.2178ms 0.1985ms 5.0387 KOps/s 5.0944 KOps/s $\color{#d91a1a}-1.09\%$
test_items_stack_nested 1.5614ms 1.4873ms 672.3446 Ops/s 673.8635 Ops/s $\color{#d91a1a}-0.23\%$
test_items_stack_nested_leaf 1.3728ms 1.3218ms 756.5259 Ops/s 756.5245 Ops/s $+0.00\%$
test_items_stack_nested_locked 2.0429ms 0.8333ms 1.2000 KOps/s 1.2023 KOps/s $\color{#d91a1a}-0.19\%$
test_keys 25.5610μs 4.5758μs 218.5424 KOps/s 217.0800 KOps/s $\color{#35bf28}+0.67\%$
test_keys_nested 0.4603ms 90.2577μs 11.0794 KOps/s 11.0122 KOps/s $\color{#35bf28}+0.61\%$
test_keys_nested_locked 0.1093ms 89.2442μs 11.2052 KOps/s 11.0088 KOps/s $\color{#35bf28}+1.78\%$
test_keys_nested_leaf 42.8081ms 86.4129μs 11.5723 KOps/s 12.0924 KOps/s $\color{#d91a1a}-4.30\%$
test_keys_stack_nested 1.3490ms 1.2857ms 777.7652 Ops/s 770.7290 Ops/s $\color{#35bf28}+0.91\%$
test_keys_stack_nested_leaf 1.3230ms 1.2780ms 782.4623 Ops/s 781.0614 Ops/s $\color{#35bf28}+0.18\%$
test_keys_stack_nested_locked 0.6878ms 0.6239ms 1.6028 KOps/s 1.5839 KOps/s $\color{#35bf28}+1.19\%$
test_values 8.3437μs 1.8811μs 531.5971 KOps/s 526.2443 KOps/s $\color{#35bf28}+1.02\%$
test_values_nested 68.8010μs 43.2429μs 23.1252 KOps/s 23.2978 KOps/s $\color{#d91a1a}-0.74\%$
test_values_nested_locked 67.2910μs 45.5231μs 21.9669 KOps/s 22.2395 KOps/s $\color{#d91a1a}-1.23\%$
test_values_nested_leaf 63.3210μs 37.2970μs 26.8118 KOps/s 27.0082 KOps/s $\color{#d91a1a}-0.73\%$
test_values_stack_nested 1.2030ms 1.1479ms 871.1727 Ops/s 877.3908 Ops/s $\color{#d91a1a}-0.71\%$
test_values_stack_nested_leaf 1.1770ms 1.1296ms 885.2835 Ops/s 880.0373 Ops/s $\color{#35bf28}+0.60\%$
test_values_stack_nested_locked 0.5553ms 0.5021ms 1.9915 KOps/s 1.9640 KOps/s $\color{#35bf28}+1.40\%$
test_membership 4.8302μs 0.9399μs 1.0640 MOps/s 1.0568 MOps/s $\color{#35bf28}+0.68\%$
test_membership_nested 16.2855μs 2.1069μs 474.6411 KOps/s 463.6394 KOps/s $\color{#35bf28}+2.37\%$
test_membership_nested_leaf 15.9050μs 2.1134μs 473.1785 KOps/s 469.8647 KOps/s $\color{#35bf28}+0.71\%$
test_membership_stacked_nested 44.2510μs 10.9266μs 91.5201 KOps/s 91.0351 KOps/s $\color{#35bf28}+0.53\%$
test_membership_stacked_nested_leaf 30.3410μs 10.9477μs 91.3431 KOps/s 91.5017 KOps/s $\color{#d91a1a}-0.17\%$
test_membership_nested_last 32.9910μs 4.6218μs 216.3660 KOps/s 214.9150 KOps/s $\color{#35bf28}+0.68\%$
test_membership_nested_leaf_last 32.0100μs 4.6378μs 215.6189 KOps/s 215.0877 KOps/s $\color{#35bf28}+0.25\%$
test_membership_stacked_nested_last 0.1700ms 0.1342ms 7.4540 KOps/s 7.4190 KOps/s $\color{#35bf28}+0.47\%$
test_membership_stacked_nested_leaf_last 40.4410μs 12.7475μs 78.4469 KOps/s 77.9429 KOps/s $\color{#35bf28}+0.65\%$
test_nested_getleaf 33.3910μs 8.4324μs 118.5900 KOps/s 118.9156 KOps/s $\color{#d91a1a}-0.27\%$
test_nested_get 31.3500μs 7.9618μs 125.6002 KOps/s 125.8740 KOps/s $\color{#d91a1a}-0.22\%$
test_stacked_getleaf 0.5966ms 0.5668ms 1.7644 KOps/s 1.7589 KOps/s $\color{#35bf28}+0.31\%$
test_stacked_get 0.5821ms 0.5249ms 1.9050 KOps/s 1.8493 KOps/s $\color{#35bf28}+3.01\%$
test_nested_getitemleaf 40.4910μs 8.4638μs 118.1498 KOps/s 117.7041 KOps/s $\color{#35bf28}+0.38\%$
test_nested_getitem 38.6110μs 7.9683μs 125.4978 KOps/s 124.6524 KOps/s $\color{#35bf28}+0.68\%$
test_stacked_getitemleaf 0.5974ms 0.5607ms 1.7836 KOps/s 1.7487 KOps/s $\color{#35bf28}+2.00\%$
test_stacked_getitem 0.5795ms 0.5270ms 1.8977 KOps/s 1.8497 KOps/s $\color{#35bf28}+2.59\%$
test_lock_nested 3.1279ms 0.5538ms 1.8056 KOps/s 1.7391 KOps/s $\color{#35bf28}+3.83\%$
test_lock_stack_nested 84.4409ms 7.2308ms 138.2977 Ops/s 135.7110 Ops/s $\color{#35bf28}+1.91\%$
test_unlock_nested 2.3484ms 0.4313ms 2.3186 KOps/s 2.2634 KOps/s $\color{#35bf28}+2.44\%$
test_unlock_stack_nested 67.2396ms 6.2507ms 159.9831 Ops/s 156.1231 Ops/s $\color{#35bf28}+2.47\%$
test_flatten_speed 0.2301ms 0.1872ms 5.3405 KOps/s 5.3413 KOps/s $\color{#d91a1a}-0.01\%$
test_unflatten_speed 0.4062ms 0.3659ms 2.7327 KOps/s 2.7675 KOps/s $\color{#d91a1a}-1.26\%$
test_common_ops 1.0794ms 0.5960ms 1.6779 KOps/s 1.6507 KOps/s $\color{#35bf28}+1.65\%$
test_creation 13.7000μs 2.0723μs 482.5606 KOps/s 469.1407 KOps/s $\color{#35bf28}+2.86\%$
test_creation_empty 36.6510μs 6.7547μs 148.0443 KOps/s 146.0285 KOps/s $\color{#35bf28}+1.38\%$
test_creation_nested_1 23.7000μs 9.1558μs 109.2206 KOps/s 108.5679 KOps/s $\color{#35bf28}+0.60\%$
test_creation_nested_2 38.4800μs 11.7416μs 85.1671 KOps/s 85.3108 KOps/s $\color{#d91a1a}-0.17\%$
test_clone 82.7620μs 14.4921μs 69.0029 KOps/s 66.6124 KOps/s $\color{#35bf28}+3.59\%$
test_getitem[int] 28.0700μs 12.2296μs 81.7691 KOps/s 79.5518 KOps/s $\color{#35bf28}+2.79\%$
test_getitem[slice_int] 47.7610μs 24.0163μs 41.6383 KOps/s 41.1798 KOps/s $\color{#35bf28}+1.11\%$
test_getitem[range] 73.2210μs 40.9242μs 24.4354 KOps/s 24.7642 KOps/s $\color{#d91a1a}-1.33\%$
test_getitem[tuple] 40.7410μs 20.1406μs 49.6511 KOps/s 47.6656 KOps/s $\color{#35bf28}+4.17\%$
test_getitem[list] 0.2306ms 36.2519μs 27.5848 KOps/s 27.4314 KOps/s $\color{#35bf28}+0.56\%$
test_setitem_dim[int] 40.2310μs 24.4423μs 40.9127 KOps/s 38.7197 KOps/s $\textbf{\color{#35bf28}+5.66\%}$
test_setitem_dim[slice_int] 79.5210μs 45.1279μs 22.1593 KOps/s 21.4495 KOps/s $\color{#35bf28}+3.31\%$
test_setitem_dim[range] 81.5810μs 62.4418μs 16.0149 KOps/s 15.3850 KOps/s $\color{#35bf28}+4.09\%$
test_setitem_dim[tuple] 55.8610μs 38.1879μs 26.1863 KOps/s 25.1832 KOps/s $\color{#35bf28}+3.98\%$
test_setitem 75.0710μs 18.0730μs 55.3311 KOps/s 52.5587 KOps/s $\textbf{\color{#35bf28}+5.27\%}$
test_set 78.8610μs 17.3068μs 57.7807 KOps/s 55.0301 KOps/s $\color{#35bf28}+5.00\%$
test_set_shared 2.7794ms 0.1034ms 9.6751 KOps/s 8.6256 KOps/s $\textbf{\color{#35bf28}+12.17\%}$
test_update 80.3610μs 18.3209μs 54.5824 KOps/s 53.0377 KOps/s $\color{#35bf28}+2.91\%$
test_update_nested 83.5020μs 25.2503μs 39.6034 KOps/s 38.8968 KOps/s $\color{#35bf28}+1.82\%$
test_set_nested 74.4910μs 18.8222μs 53.1288 KOps/s 51.3907 KOps/s $\color{#35bf28}+3.38\%$
test_set_nested_new 80.0410μs 22.5224μs 44.4003 KOps/s 42.8016 KOps/s $\color{#35bf28}+3.74\%$
test_select 98.1020μs 45.9176μs 21.7782 KOps/s 21.3143 KOps/s $\color{#35bf28}+2.18\%$
test_to 75.0710μs 52.9282μs 18.8935 KOps/s 18.7928 KOps/s $\color{#35bf28}+0.54\%$
test_to_nonblocking 63.1110μs 34.6767μs 28.8378 KOps/s 28.1667 KOps/s $\color{#35bf28}+2.38\%$
test_unbind_speed 0.3916ms 0.3534ms 2.8300 KOps/s 2.6544 KOps/s $\textbf{\color{#35bf28}+6.62\%}$
test_unbind_speed_stack0 63.5772ms 4.5269ms 220.8999 Ops/s 224.8595 Ops/s $\color{#d91a1a}-1.76\%$
test_unbind_speed_stack1 2.0181μs 0.5210μs 1.9195 MOps/s 1.9202 MOps/s $\color{#d91a1a}-0.04\%$
test_split 53.6768ms 1.8314ms 546.0275 Ops/s 540.7249 Ops/s $\color{#35bf28}+0.98\%$
test_chunk 52.8190ms 1.8084ms 552.9888 Ops/s 547.2495 Ops/s $\color{#35bf28}+1.05\%$
test_creation[device0] 0.3664ms 0.3100ms 3.2258 KOps/s 3.2196 KOps/s $\color{#35bf28}+0.19\%$
test_creation[device1] 0.6517ms 0.3115ms 3.2104 KOps/s 3.1844 KOps/s $\color{#35bf28}+0.82\%$
test_creation_from_tensor 0.6170ms 0.3378ms 2.9603 KOps/s 2.9427 KOps/s $\color{#35bf28}+0.60\%$
test_add_one[memmap_tensor0] 69.1110μs 23.6969μs 42.1996 KOps/s 40.4243 KOps/s $\color{#35bf28}+4.39\%$
test_add_one[memmap_tensor1] 0.1980ms 73.1312μs 13.6741 KOps/s 13.5207 KOps/s $\color{#35bf28}+1.13\%$
test_contiguous[memmap_tensor0] 30.6710μs 5.8740μs 170.2414 KOps/s 162.0886 KOps/s $\textbf{\color{#35bf28}+5.03\%}$
test_contiguous[memmap_tensor1] 50.1010μs 21.7062μs 46.0697 KOps/s 45.1593 KOps/s $\color{#35bf28}+2.02\%$
test_stack[memmap_tensor0] 45.0710μs 19.7103μs 50.7350 KOps/s 48.5308 KOps/s $\color{#35bf28}+4.54\%$
test_stack[memmap_tensor1] 0.1443ms 73.3641μs 13.6306 KOps/s 13.3651 KOps/s $\color{#35bf28}+1.99\%$
test_memmaptd_index 0.4948ms 0.4221ms 2.3689 KOps/s 2.2864 KOps/s $\color{#35bf28}+3.61\%$
test_memmaptd_index_astensor 0.5250ms 0.4898ms 2.0417 KOps/s 2.0085 KOps/s $\color{#35bf28}+1.65\%$
test_memmaptd_index_op 0.8052ms 0.7340ms 1.3624 KOps/s 1.2987 KOps/s $\color{#35bf28}+4.90\%$
test_reshape_pytree 37.1000μs 21.1742μs 47.2273 KOps/s 46.9325 KOps/s $\color{#35bf28}+0.63\%$
test_reshape_td 53.4610μs 29.5333μs 33.8601 KOps/s 32.9248 KOps/s $\color{#35bf28}+2.84\%$
test_view_pytree 34.7400μs 20.6931μs 48.3253 KOps/s 48.2140 KOps/s $\color{#35bf28}+0.23\%$
test_view_td 19.5600μs 4.0756μs 245.3625 KOps/s 246.3557 KOps/s $\color{#d91a1a}-0.40\%$
test_unbind_pytree 49.9510μs 25.9780μs 38.4942 KOps/s 38.2876 KOps/s $\color{#35bf28}+0.54\%$
test_unbind_td 87.7410μs 55.9546μs 17.8716 KOps/s 17.0639 KOps/s $\color{#35bf28}+4.73\%$
test_split_pytree 0.7308ms 24.4612μs 40.8810 KOps/s 41.6514 KOps/s $\color{#d91a1a}-1.85\%$
test_split_td 74.7110μs 46.0227μs 21.7284 KOps/s 21.8803 KOps/s $\color{#d91a1a}-0.69\%$
test_add_pytree 54.3910μs 30.8369μs 32.4287 KOps/s 32.0067 KOps/s $\color{#35bf28}+1.32\%$
test_add_td 65.7710μs 41.3781μs 24.1674 KOps/s 23.4908 KOps/s $\color{#35bf28}+2.88\%$
test_distributed 24.1710μs 5.6311μs 177.5867 KOps/s 180.0694 KOps/s $\color{#d91a1a}-1.38\%$
test_tdmodule 32.8310μs 16.1432μs 61.9457 KOps/s 59.4966 KOps/s $\color{#35bf28}+4.12\%$
test_tdmodule_dispatch 0.2587ms 32.5254μs 30.7452 KOps/s 30.7346 KOps/s $\color{#35bf28}+0.03\%$
test_tdseq 38.0110μs 19.2466μs 51.9572 KOps/s 51.0691 KOps/s $\color{#35bf28}+1.74\%$
test_tdseq_dispatch 58.3210μs 35.4225μs 28.2307 KOps/s 28.3159 KOps/s $\color{#d91a1a}-0.30\%$
test_instantiation_functorch 1.7678ms 1.6771ms 596.2719 Ops/s 588.6078 Ops/s $\color{#35bf28}+1.30\%$
test_instantiation_td 1.7695ms 1.1693ms 855.2041 Ops/s 847.6182 Ops/s $\color{#35bf28}+0.89\%$
test_exec_functorch 0.2180ms 0.1546ms 6.4701 KOps/s 6.3391 KOps/s $\color{#35bf28}+2.07\%$
test_exec_functional_call 0.1879ms 0.1490ms 6.7135 KOps/s 6.6296 KOps/s $\color{#35bf28}+1.27\%$
test_exec_td 0.1726ms 0.1406ms 7.1099 KOps/s 7.0355 KOps/s $\color{#35bf28}+1.06\%$
test_exec_td_decorator 0.7295ms 0.1760ms 5.6826 KOps/s 5.6340 KOps/s $\color{#35bf28}+0.86\%$
test_vmap_mlp_speed[True-True] 1.4687ms 1.0856ms 921.1564 Ops/s 938.2702 Ops/s $\color{#d91a1a}-1.82\%$
test_vmap_mlp_speed[True-False] 0.6478ms 0.6014ms 1.6627 KOps/s 1.6723 KOps/s $\color{#d91a1a}-0.57\%$
test_vmap_mlp_speed[False-True] 1.0423ms 0.9664ms 1.0348 KOps/s 1.0310 KOps/s $\color{#35bf28}+0.37\%$
test_vmap_mlp_speed[False-False] 0.5679ms 0.5285ms 1.8922 KOps/s 1.8765 KOps/s $\color{#35bf28}+0.84\%$
test_vmap_mlp_speed_decorator[True-True] 2.6511ms 2.0253ms 493.7647 Ops/s 493.1741 Ops/s $\color{#35bf28}+0.12\%$
test_vmap_mlp_speed_decorator[True-False] 1.1007ms 0.6427ms 1.5559 KOps/s 1.5565 KOps/s $\color{#d91a1a}-0.04\%$
test_vmap_mlp_speed_decorator[False-True] 2.1803ms 1.7508ms 571.1824 Ops/s 570.7024 Ops/s $\color{#35bf28}+0.08\%$
test_vmap_mlp_speed_decorator[False-False] 1.0345ms 0.5438ms 1.8388 KOps/s 1.8392 KOps/s $\color{#d91a1a}-0.03\%$
test_vmap_transformer_speed[True-True] 12.8396ms 12.3692ms 80.8458 Ops/s 80.1085 Ops/s $\color{#35bf28}+0.92\%$
test_vmap_transformer_speed[True-False] 12.9767ms 8.0911ms 123.5925 Ops/s 124.2474 Ops/s $\color{#d91a1a}-0.53\%$
test_vmap_transformer_speed[False-True] 12.3695ms 12.2703ms 81.4975 Ops/s 80.8495 Ops/s $\color{#35bf28}+0.80\%$
test_vmap_transformer_speed[False-False] 8.0306ms 7.9649ms 125.5515 Ops/s 125.4841 Ops/s $\color{#35bf28}+0.05\%$
test_vmap_transformer_speed_decorator[True-True] 0.1382s 68.6300ms 14.5709 Ops/s 14.5485 Ops/s $\color{#35bf28}+0.15\%$
test_vmap_transformer_speed_decorator[True-False] 21.4792ms 19.3206ms 51.7583 Ops/s 51.3668 Ops/s $\color{#35bf28}+0.76\%$
test_vmap_transformer_speed_decorator[False-True] 59.1405ms 57.9167ms 17.2662 Ops/s 17.2309 Ops/s $\color{#35bf28}+0.21\%$
test_vmap_transformer_speed_decorator[False-False] 20.2046ms 18.9295ms 52.8276 Ops/s 48.4506 Ops/s $\textbf{\color{#35bf28}+9.03\%}$

@vmoens vmoens added the Test label Nov 24, 2023
@vmoens vmoens changed the title [Test] Add tests for partial update [Feature, Test] Add tests for partial update Nov 24, 2023
@vmoens vmoens merged commit 04f6375 into main Nov 24, 2023
41 of 45 checks passed
@vmoens vmoens deleted the add-tests-partial-update branch November 24, 2023 17:40
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Test
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants