Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Versioning] Bump v0.3.0 #613

Merged
merged 1 commit into from
Jan 8, 2024
Merged

[Versioning] Bump v0.3.0 #613

merged 1 commit into from
Jan 8, 2024

Conversation

vmoens
Copy link
Contributor

@vmoens vmoens commented Jan 8, 2024

No description provided.

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jan 8, 2024
@vmoens vmoens merged commit aaee52d into main Jan 8, 2024
14 of 24 checks passed
@vmoens vmoens deleted the bump-v0.3 branch January 8, 2024 16:23
Copy link

github-actions bot commented Jan 8, 2024

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 120. Improved: $\large\color{#35bf28}8$. Worsened: $\large\color{#d91a1a}13$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 44.1620μs 17.1599μs 58.2753 KOps/s 60.9639 KOps/s $\color{#d91a1a}-4.41\%$
test_plain_set_stack_nested 0.1689ms 0.1427ms 7.0055 KOps/s 7.0968 KOps/s $\color{#d91a1a}-1.29\%$
test_plain_set_nested_inplace 56.3820μs 19.3162μs 51.7701 KOps/s 53.0387 KOps/s $\color{#d91a1a}-2.39\%$
test_plain_set_stack_nested_inplace 0.3215ms 0.1768ms 5.6577 KOps/s 5.6926 KOps/s $\color{#d91a1a}-0.61\%$
test_items 16.0100μs 2.3819μs 419.8296 KOps/s 331.1836 KOps/s $\textbf{\color{#35bf28}+26.77\%}$
test_items_nested 0.5567ms 0.2728ms 3.6659 KOps/s 3.7224 KOps/s $\color{#d91a1a}-1.52\%$
test_items_nested_locked 0.3749ms 0.2717ms 3.6804 KOps/s 3.7442 KOps/s $\color{#d91a1a}-1.70\%$
test_items_nested_leaf 0.6920ms 0.1674ms 5.9734 KOps/s 5.9893 KOps/s $\color{#d91a1a}-0.27\%$
test_items_stack_nested 1.5403ms 1.3141ms 761.0000 Ops/s 766.4318 Ops/s $\color{#d91a1a}-0.71\%$
test_items_stack_nested_leaf 1.3998ms 1.1797ms 847.6773 Ops/s 851.5587 Ops/s $\color{#d91a1a}-0.46\%$
test_items_stack_nested_locked 0.9571ms 0.7769ms 1.2872 KOps/s 1.3077 KOps/s $\color{#d91a1a}-1.57\%$
test_keys 20.1480μs 4.3990μs 227.3260 KOps/s 258.3923 KOps/s $\textbf{\color{#d91a1a}-12.02\%}$
test_keys_nested 53.0714ms 0.1556ms 6.4248 KOps/s 6.6873 KOps/s $\color{#d91a1a}-3.93\%$
test_keys_nested_locked 0.2685ms 0.1459ms 6.8526 KOps/s 6.7852 KOps/s $\color{#35bf28}+0.99\%$
test_keys_nested_leaf 0.2308ms 0.1295ms 7.7220 KOps/s 7.7513 KOps/s $\color{#d91a1a}-0.38\%$
test_keys_stack_nested 1.6579ms 1.2780ms 782.4742 Ops/s 797.8215 Ops/s $\color{#d91a1a}-1.92\%$
test_keys_stack_nested_leaf 1.4785ms 1.2699ms 787.4704 Ops/s 810.0006 Ops/s $\color{#d91a1a}-2.78\%$
test_keys_stack_nested_locked 1.2779ms 0.6948ms 1.4393 KOps/s 1.4572 KOps/s $\color{#d91a1a}-1.23\%$
test_values 7.8988μs 1.1250μs 888.8788 KOps/s 871.9960 KOps/s $\color{#35bf28}+1.94\%$
test_values_nested 0.1048ms 52.0070μs 19.2282 KOps/s 19.1425 KOps/s $\color{#35bf28}+0.45\%$
test_values_nested_locked 0.1221ms 52.2448μs 19.1406 KOps/s 19.0064 KOps/s $\color{#35bf28}+0.71\%$
test_values_nested_leaf 2.6032ms 46.8330μs 21.3525 KOps/s 21.6454 KOps/s $\color{#d91a1a}-1.35\%$
test_values_stack_nested 1.6471ms 1.0432ms 958.5724 Ops/s 972.9275 Ops/s $\color{#d91a1a}-1.48\%$
test_values_stack_nested_leaf 1.9792ms 1.0272ms 973.4857 Ops/s 988.5298 Ops/s $\color{#d91a1a}-1.52\%$
test_values_stack_nested_locked 0.9469ms 0.5131ms 1.9488 KOps/s 1.9548 KOps/s $\color{#d91a1a}-0.31\%$
test_membership 14.9070μs 1.3348μs 749.1965 KOps/s 718.9811 KOps/s $\color{#35bf28}+4.20\%$
test_membership_nested 38.0710μs 2.8439μs 351.6296 KOps/s 344.6652 KOps/s $\color{#35bf28}+2.02\%$
test_membership_nested_leaf 38.5620μs 2.8453μs 351.4528 KOps/s 349.4452 KOps/s $\color{#35bf28}+0.57\%$
test_membership_stacked_nested 60.9530μs 11.9815μs 83.4622 KOps/s 85.1710 KOps/s $\color{#d91a1a}-2.01\%$
test_membership_stacked_nested_leaf 49.2920μs 11.6884μs 85.5551 KOps/s 84.1909 KOps/s $\color{#35bf28}+1.62\%$
test_membership_nested_last 26.8400μs 5.9816μs 167.1800 KOps/s 166.7595 KOps/s $\color{#35bf28}+0.25\%$
test_membership_nested_leaf_last 39.3530μs 6.0065μs 166.4867 KOps/s 167.3470 KOps/s $\color{#d91a1a}-0.51\%$
test_membership_stacked_nested_last 0.3273ms 0.1689ms 5.9206 KOps/s 5.8349 KOps/s $\color{#35bf28}+1.47\%$
test_membership_stacked_nested_leaf_last 0.2450ms 14.4162μs 69.3665 KOps/s 72.1367 KOps/s $\color{#d91a1a}-3.84\%$
test_nested_getleaf 44.7240μs 10.8659μs 92.0307 KOps/s 93.5456 KOps/s $\color{#d91a1a}-1.62\%$
test_nested_get 49.3820μs 10.2801μs 97.2752 KOps/s 99.6836 KOps/s $\color{#d91a1a}-2.42\%$
test_stacked_getleaf 0.6699ms 0.4674ms 2.1393 KOps/s 2.1614 KOps/s $\color{#d91a1a}-1.02\%$
test_stacked_get 0.8581ms 0.4401ms 2.2723 KOps/s 2.3321 KOps/s $\color{#d91a1a}-2.56\%$
test_nested_getitemleaf 44.0620μs 10.8045μs 92.5537 KOps/s 94.8998 KOps/s $\color{#d91a1a}-2.47\%$
test_nested_getitem 29.8060μs 10.2860μs 97.2194 KOps/s 100.3653 KOps/s $\color{#d91a1a}-3.13\%$
test_stacked_getitemleaf 0.8232ms 0.4735ms 2.1119 KOps/s 2.1447 KOps/s $\color{#d91a1a}-1.53\%$
test_stacked_getitem 0.5411ms 0.4381ms 2.2828 KOps/s 2.3170 KOps/s $\color{#d91a1a}-1.47\%$
test_lock_nested 1.3210ms 0.4087ms 2.4466 KOps/s 2.4073 KOps/s $\color{#35bf28}+1.63\%$
test_lock_stack_nested 80.6681ms 6.4514ms 155.0041 Ops/s 146.0287 Ops/s $\textbf{\color{#35bf28}+6.15\%}$
test_unlock_nested 67.3733ms 0.4854ms 2.0600 KOps/s 2.3556 KOps/s $\textbf{\color{#d91a1a}-12.55\%}$
test_unlock_stack_nested 78.4057ms 6.2097ms 161.0378 Ops/s 153.4765 Ops/s $\color{#35bf28}+4.93\%$
test_flatten_speed 0.6599ms 0.3652ms 2.7386 KOps/s 2.7151 KOps/s $\color{#35bf28}+0.86\%$
test_unflatten_speed 0.7214ms 0.4452ms 2.2459 KOps/s 2.2258 KOps/s $\color{#35bf28}+0.91\%$
test_common_ops 4.7912ms 0.6974ms 1.4340 KOps/s 1.4606 KOps/s $\color{#d91a1a}-1.82\%$
test_creation 50.2030μs 1.9724μs 506.9972 KOps/s 502.0436 KOps/s $\color{#35bf28}+0.99\%$
test_creation_empty 43.1530μs 10.7430μs 93.0838 KOps/s 104.9131 KOps/s $\textbf{\color{#d91a1a}-11.28\%}$
test_creation_nested_1 50.1230μs 13.6485μs 73.2681 KOps/s 81.4542 KOps/s $\textbf{\color{#d91a1a}-10.05\%}$
test_creation_nested_2 38.7920μs 18.8144μs 53.1509 KOps/s 57.5487 KOps/s $\textbf{\color{#d91a1a}-7.64\%}$
test_clone 0.1160ms 12.5179μs 79.8859 KOps/s 79.3111 KOps/s $\color{#35bf28}+0.72\%$
test_getitem[int] 33.2920μs 11.9224μs 83.8760 KOps/s 80.9472 KOps/s $\color{#35bf28}+3.62\%$
test_getitem[slice_int] 56.5550μs 23.2746μs 42.9653 KOps/s 42.5363 KOps/s $\color{#35bf28}+1.01\%$
test_getitem[range] 97.7820μs 41.7793μs 23.9353 KOps/s 24.4755 KOps/s $\color{#d91a1a}-2.21\%$
test_getitem[tuple] 73.4370μs 19.3984μs 51.5506 KOps/s 52.5370 KOps/s $\color{#d91a1a}-1.88\%$
test_getitem[list] 0.2521ms 37.4586μs 26.6961 KOps/s 27.2751 KOps/s $\color{#d91a1a}-2.12\%$
test_setitem_dim[int] 51.8970μs 30.0086μs 33.3238 KOps/s 34.2368 KOps/s $\color{#d91a1a}-2.67\%$
test_setitem_dim[slice_int] 0.1018ms 55.7364μs 17.9416 KOps/s 18.3568 KOps/s $\color{#d91a1a}-2.26\%$
test_setitem_dim[range] 0.1102ms 74.3470μs 13.4504 KOps/s 14.0286 KOps/s $\color{#d91a1a}-4.12\%$
test_setitem_dim[tuple] 79.4180μs 44.5545μs 22.4444 KOps/s 22.8861 KOps/s $\color{#d91a1a}-1.93\%$
test_setitem 0.1189ms 18.9180μs 52.8596 KOps/s 54.2526 KOps/s $\color{#d91a1a}-2.57\%$
test_set 0.1033ms 18.5120μs 54.0190 KOps/s 56.7025 KOps/s $\color{#d91a1a}-4.73\%$
test_set_shared 2.4430ms 0.1375ms 7.2718 KOps/s 7.1552 KOps/s $\color{#35bf28}+1.63\%$
test_update 0.1031ms 21.4757μs 46.5642 KOps/s 49.6039 KOps/s $\textbf{\color{#d91a1a}-6.13\%}$
test_update_nested 0.1074ms 29.2988μs 34.1311 KOps/s 37.0837 KOps/s $\textbf{\color{#d91a1a}-7.96\%}$
test_set_nested 0.1076ms 20.2128μs 49.4735 KOps/s 51.6498 KOps/s $\color{#d91a1a}-4.21\%$
test_set_nested_new 0.1341ms 24.3878μs 41.0041 KOps/s 42.8353 KOps/s $\color{#d91a1a}-4.27\%$
test_select 96.2900μs 47.3999μs 21.0971 KOps/s 21.1301 KOps/s $\color{#d91a1a}-0.16\%$
test_unbind_speed 0.6151ms 0.3395ms 2.9457 KOps/s 2.9189 KOps/s $\color{#35bf28}+0.92\%$
test_unbind_speed_stack0 73.0063ms 4.3007ms 232.5205 Ops/s 239.7146 Ops/s $\color{#d91a1a}-3.00\%$
test_unbind_speed_stack1 1.6676μs 0.6424μs 1.5568 MOps/s 1.5755 MOps/s $\color{#d91a1a}-1.19\%$
test_split 1.8881ms 1.5420ms 648.5281 Ops/s 584.3691 Ops/s $\textbf{\color{#35bf28}+10.98\%}$
test_chunk 66.1826ms 1.6576ms 603.2965 Ops/s 595.1360 Ops/s $\color{#35bf28}+1.37\%$
test_creation[device0] 0.5499ms 0.2870ms 3.4841 KOps/s 3.4145 KOps/s $\color{#35bf28}+2.04\%$
test_creation_from_tensor 4.0921ms 0.3253ms 3.0737 KOps/s 2.9258 KOps/s $\textbf{\color{#35bf28}+5.05\%}$
test_add_one[memmap_tensor0] 67.2950μs 25.1326μs 39.7890 KOps/s 38.9728 KOps/s $\color{#35bf28}+2.09\%$
test_contiguous[memmap_tensor0] 38.1810μs 5.7935μs 172.6087 KOps/s 170.0070 KOps/s $\color{#35bf28}+1.53\%$
test_stack[memmap_tensor0] 54.4520μs 18.4624μs 54.1642 KOps/s 51.6502 KOps/s $\color{#35bf28}+4.87\%$
test_memmaptd_index 0.3166ms 0.2039ms 4.9053 KOps/s 4.9868 KOps/s $\color{#d91a1a}-1.64\%$
test_memmaptd_index_astensor 0.5307ms 0.2635ms 3.7955 KOps/s 3.8586 KOps/s $\color{#d91a1a}-1.64\%$
test_memmaptd_index_op 0.7714ms 0.5560ms 1.7987 KOps/s 1.8985 KOps/s $\textbf{\color{#d91a1a}-5.26\%}$
test_serialize_model 0.1709s 0.1059s 9.4471 Ops/s 9.2774 Ops/s $\color{#35bf28}+1.83\%$
test_serialize_model_pickle 0.4510s 0.3772s 2.6513 Ops/s 2.6416 Ops/s $\color{#35bf28}+0.37\%$
test_serialize_weights 0.1048s 95.3262ms 10.4903 Ops/s 9.1886 Ops/s $\textbf{\color{#35bf28}+14.17\%}$
test_serialize_weights_returnearly 0.1265s 0.1209s 8.2700 Ops/s 7.5639 Ops/s $\textbf{\color{#35bf28}+9.34\%}$
test_serialize_weights_pickle 1.0908s 0.5848s 1.7099 Ops/s 2.3858 Ops/s $\textbf{\color{#d91a1a}-28.33\%}$
test_serialize_weights_filesystem 93.0991ms 89.7393ms 11.1434 Ops/s 10.8311 Ops/s $\color{#35bf28}+2.88\%$
test_serialize_model_filesystem 0.1762s 0.1008s 9.9184 Ops/s 9.9489 Ops/s $\color{#d91a1a}-0.31\%$
test_reshape_pytree 49.8530μs 22.8252μs 43.8111 KOps/s 44.6143 KOps/s $\color{#d91a1a}-1.80\%$
test_reshape_td 79.9000μs 29.5880μs 33.7975 KOps/s 34.0551 KOps/s $\color{#d91a1a}-0.76\%$
test_view_pytree 54.4220μs 22.7727μs 43.9123 KOps/s 44.6507 KOps/s $\color{#d91a1a}-1.65\%$
test_view_td 21.0190μs 4.8032μs 208.1961 KOps/s 208.6774 KOps/s $\color{#d91a1a}-0.23\%$
test_unbind_pytree 62.6270μs 26.4110μs 37.8630 KOps/s 38.4855 KOps/s $\color{#d91a1a}-1.62\%$
test_unbind_td 0.1368ms 54.0672μs 18.4955 KOps/s 17.9309 KOps/s $\color{#35bf28}+3.15\%$
test_split_pytree 66.5340μs 26.4105μs 37.8638 KOps/s 35.3622 KOps/s $\textbf{\color{#35bf28}+7.07\%}$
test_split_td 0.5315ms 42.1797μs 23.7081 KOps/s 23.7760 KOps/s $\color{#d91a1a}-0.29\%$
test_add_pytree 79.1480μs 32.3876μs 30.8760 KOps/s 31.2481 KOps/s $\color{#d91a1a}-1.19\%$
test_add_td 0.1862ms 49.0585μs 20.3838 KOps/s 21.3317 KOps/s $\color{#d91a1a}-4.44\%$
test_distributed 28.9440μs 6.1493μs 162.6209 KOps/s 167.6727 KOps/s $\color{#d91a1a}-3.01\%$
test_tdmodule 99.3450μs 23.1295μs 43.2348 KOps/s 45.2053 KOps/s $\color{#d91a1a}-4.36\%$
test_tdmodule_dispatch 0.2342ms 43.1383μs 23.1812 KOps/s 25.1922 KOps/s $\textbf{\color{#d91a1a}-7.98\%}$
test_tdseq 67.0560μs 26.8762μs 37.2076 KOps/s 39.3301 KOps/s $\textbf{\color{#d91a1a}-5.40\%}$
test_tdseq_dispatch 0.1457ms 46.6629μs 21.4303 KOps/s 23.0894 KOps/s $\textbf{\color{#d91a1a}-7.19\%}$
test_instantiation_functorch 1.9878ms 1.3022ms 767.9293 Ops/s 777.1424 Ops/s $\color{#d91a1a}-1.19\%$
test_instantiation_td 1.5118ms 0.9958ms 1.0042 KOps/s 1.0012 KOps/s $\color{#35bf28}+0.30\%$
test_exec_functorch 0.4200ms 0.1710ms 5.8478 KOps/s 6.2639 KOps/s $\textbf{\color{#d91a1a}-6.64\%}$
test_exec_functional_call 0.8743ms 0.1460ms 6.8495 KOps/s 6.8459 KOps/s $\color{#35bf28}+0.05\%$
test_exec_td 0.2614ms 0.1424ms 7.0230 KOps/s 7.0584 KOps/s $\color{#d91a1a}-0.50\%$
test_exec_td_decorator 0.9469ms 0.1771ms 5.6456 KOps/s 5.3536 KOps/s $\textbf{\color{#35bf28}+5.45\%}$
test_vmap_mlp_speed[True-True] 1.2987ms 0.8917ms 1.1215 KOps/s 1.1309 KOps/s $\color{#d91a1a}-0.83\%$
test_vmap_mlp_speed[True-False] 0.7984ms 0.4735ms 2.1119 KOps/s 2.1210 KOps/s $\color{#d91a1a}-0.43\%$
test_vmap_mlp_speed[False-True] 1.1343ms 0.7773ms 1.2865 KOps/s 1.2728 KOps/s $\color{#35bf28}+1.08\%$
test_vmap_mlp_speed[False-False] 0.6023ms 0.3890ms 2.5704 KOps/s 2.6305 KOps/s $\color{#d91a1a}-2.29\%$
test_vmap_mlp_speed_decorator[True-True] 2.4648ms 1.7842ms 560.4735 Ops/s 576.8991 Ops/s $\color{#d91a1a}-2.85\%$
test_vmap_mlp_speed_decorator[True-False] 0.9435ms 0.5190ms 1.9268 KOps/s 1.9359 KOps/s $\color{#d91a1a}-0.47\%$
test_vmap_mlp_speed_decorator[False-True] 2.4061ms 1.5082ms 663.0552 Ops/s 688.3776 Ops/s $\color{#d91a1a}-3.68\%$
test_vmap_mlp_speed_decorator[False-False] 0.7750ms 0.4040ms 2.4751 KOps/s 2.5339 KOps/s $\color{#d91a1a}-2.32\%$

Copy link

github-actions bot commented Jan 8, 2024

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 132. Improved: $\large\color{#35bf28}7$. Worsened: $\large\color{#d91a1a}13$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 0.1232ms 13.0938μs 76.3719 KOps/s 77.9412 KOps/s $\color{#d91a1a}-2.01\%$
test_plain_set_stack_nested 0.2418ms 0.1166ms 8.5728 KOps/s 8.6184 KOps/s $\color{#d91a1a}-0.53\%$
test_plain_set_nested_inplace 39.8600μs 14.2696μs 70.0788 KOps/s 70.9106 KOps/s $\color{#d91a1a}-1.17\%$
test_plain_set_stack_nested_inplace 0.1930ms 0.1447ms 6.9128 KOps/s 6.9545 KOps/s $\color{#d91a1a}-0.60\%$
test_items 31.1110μs 4.6970μs 212.9020 KOps/s 208.3904 KOps/s $\color{#35bf28}+2.16\%$
test_items_nested 0.4096ms 0.3385ms 2.9538 KOps/s 2.9502 KOps/s $\color{#35bf28}+0.12\%$
test_items_nested_locked 0.3761ms 0.3427ms 2.9177 KOps/s 2.9413 KOps/s $\color{#d91a1a}-0.80\%$
test_items_nested_leaf 0.2288ms 0.2020ms 4.9501 KOps/s 5.0408 KOps/s $\color{#d91a1a}-1.80\%$
test_items_stack_nested 1.8752ms 1.3245ms 755.0178 Ops/s 760.8751 Ops/s $\color{#d91a1a}-0.77\%$
test_items_stack_nested_leaf 1.3842ms 1.1452ms 873.1865 Ops/s 881.3791 Ops/s $\color{#d91a1a}-0.93\%$
test_items_stack_nested_locked 0.9226ms 0.8277ms 1.2082 KOps/s 1.2109 KOps/s $\color{#d91a1a}-0.22\%$
test_keys 28.5410μs 4.6570μs 214.7297 KOps/s 217.0462 KOps/s $\color{#d91a1a}-1.07\%$
test_keys_nested 0.7730ms 95.7117μs 10.4480 KOps/s 10.6915 KOps/s $\color{#d91a1a}-2.28\%$
test_keys_nested_locked 0.1193ms 94.4244μs 10.5905 KOps/s 10.7149 KOps/s $\color{#d91a1a}-1.16\%$
test_keys_nested_leaf 0.1951ms 78.2511μs 12.7794 KOps/s 12.9037 KOps/s $\color{#d91a1a}-0.96\%$
test_keys_stack_nested 1.3206ms 1.1650ms 858.3804 Ops/s 854.2094 Ops/s $\color{#35bf28}+0.49\%$
test_keys_stack_nested_leaf 1.2569ms 1.1418ms 875.7802 Ops/s 870.6118 Ops/s $\color{#35bf28}+0.59\%$
test_keys_stack_nested_locked 0.7685ms 0.6442ms 1.5524 KOps/s 1.5379 KOps/s $\color{#35bf28}+0.95\%$
test_values 21.3203μs 1.8926μs 528.3643 KOps/s 527.3411 KOps/s $\color{#35bf28}+0.19\%$
test_values_nested 77.5210μs 44.9664μs 22.2388 KOps/s 22.2166 KOps/s $\color{#35bf28}+0.10\%$
test_values_nested_locked 76.1710μs 47.1497μs 21.2090 KOps/s 21.0815 KOps/s $\color{#35bf28}+0.61\%$
test_values_nested_leaf 58.9510μs 39.2277μs 25.4922 KOps/s 25.3003 KOps/s $\color{#35bf28}+0.76\%$
test_values_stack_nested 1.1748ms 0.9769ms 1.0237 KOps/s 1.0355 KOps/s $\color{#d91a1a}-1.15\%$
test_values_stack_nested_leaf 1.0639ms 0.9605ms 1.0412 KOps/s 1.0462 KOps/s $\color{#d91a1a}-0.48\%$
test_values_stack_nested_locked 0.6864ms 0.5072ms 1.9714 KOps/s 1.9864 KOps/s $\color{#d91a1a}-0.75\%$
test_membership 6.0022μs 0.9461μs 1.0570 MOps/s 1.0509 MOps/s $\color{#35bf28}+0.59\%$
test_membership_nested 22.4100μs 2.3096μs 432.9684 KOps/s 432.6735 KOps/s $\color{#35bf28}+0.07\%$
test_membership_nested_leaf 27.8850μs 2.2196μs 450.5376 KOps/s 445.2258 KOps/s $\color{#35bf28}+1.19\%$
test_membership_stacked_nested 50.1510μs 11.1130μs 89.9851 KOps/s 91.0834 KOps/s $\color{#d91a1a}-1.21\%$
test_membership_stacked_nested_leaf 38.5900μs 11.0768μs 90.2789 KOps/s 91.1073 KOps/s $\color{#d91a1a}-0.91\%$
test_membership_nested_last 26.0010μs 4.7766μs 209.3554 KOps/s 212.7732 KOps/s $\color{#d91a1a}-1.61\%$
test_membership_nested_leaf_last 27.7910μs 4.7986μs 208.3948 KOps/s 214.3506 KOps/s $\color{#d91a1a}-2.78\%$
test_membership_stacked_nested_last 0.1681ms 0.1367ms 7.3153 KOps/s 7.4192 KOps/s $\color{#d91a1a}-1.40\%$
test_membership_stacked_nested_leaf_last 31.8010μs 13.0155μs 76.8317 KOps/s 79.4118 KOps/s $\color{#d91a1a}-3.25\%$
test_nested_getleaf 77.7610μs 8.4142μs 118.8468 KOps/s 119.3669 KOps/s $\color{#d91a1a}-0.44\%$
test_nested_get 0.1974ms 7.9538μs 125.7257 KOps/s 126.0070 KOps/s $\color{#d91a1a}-0.22\%$
test_stacked_getleaf 0.5118ms 0.4004ms 2.4977 KOps/s 2.5629 KOps/s $\color{#d91a1a}-2.55\%$
test_stacked_get 0.5113ms 0.3666ms 2.7278 KOps/s 2.7826 KOps/s $\color{#d91a1a}-1.97\%$
test_nested_getitemleaf 0.1763ms 8.4655μs 118.1262 KOps/s 118.6851 KOps/s $\color{#d91a1a}-0.47\%$
test_nested_getitem 28.3610μs 8.0046μs 124.9281 KOps/s 125.2447 KOps/s $\color{#d91a1a}-0.25\%$
test_stacked_getitemleaf 0.4589ms 0.4001ms 2.4995 KOps/s 2.5661 KOps/s $\color{#d91a1a}-2.60\%$
test_stacked_getitem 0.4098ms 0.3621ms 2.7613 KOps/s 2.7563 KOps/s $\color{#35bf28}+0.18\%$
test_lock_nested 4.4665ms 0.4202ms 2.3797 KOps/s 2.4024 KOps/s $\color{#d91a1a}-0.94\%$
test_lock_stack_nested 84.5639ms 6.5957ms 151.6138 Ops/s 152.9391 Ops/s $\color{#d91a1a}-0.87\%$
test_unlock_nested 0.9789ms 0.4137ms 2.4175 KOps/s 2.4137 KOps/s $\color{#35bf28}+0.16\%$
test_unlock_stack_nested 84.2137ms 6.9053ms 144.8159 Ops/s 144.1511 Ops/s $\color{#35bf28}+0.46\%$
test_flatten_speed 0.7928ms 0.2628ms 3.8050 KOps/s 3.8193 KOps/s $\color{#d91a1a}-0.37\%$
test_unflatten_speed 0.4733ms 0.3599ms 2.7784 KOps/s 2.8362 KOps/s $\color{#d91a1a}-2.04\%$
test_common_ops 1.0441ms 0.6078ms 1.6452 KOps/s 1.6905 KOps/s $\color{#d91a1a}-2.68\%$
test_creation 18.2000μs 1.6266μs 614.7678 KOps/s 608.6187 KOps/s $\color{#35bf28}+1.01\%$
test_creation_empty 0.1659ms 7.3445μs 136.1564 KOps/s 145.2596 KOps/s $\textbf{\color{#d91a1a}-6.27\%}$
test_creation_nested_1 39.9400μs 9.4145μs 106.2186 KOps/s 112.3272 KOps/s $\textbf{\color{#d91a1a}-5.44\%}$
test_creation_nested_2 34.2800μs 14.1360μs 70.7414 KOps/s 74.6125 KOps/s $\textbf{\color{#d91a1a}-5.19\%}$
test_clone 0.1184ms 13.4240μs 74.4932 KOps/s 77.3726 KOps/s $\color{#d91a1a}-3.72\%$
test_getitem[int] 0.1869ms 11.4001μs 87.7182 KOps/s 86.8105 KOps/s $\color{#35bf28}+1.05\%$
test_getitem[slice_int] 0.1420ms 21.3636μs 46.8086 KOps/s 44.3546 KOps/s $\textbf{\color{#35bf28}+5.53\%}$
test_getitem[range] 0.2230ms 37.2887μs 26.8178 KOps/s 27.2017 KOps/s $\color{#d91a1a}-1.41\%$
test_getitem[tuple] 45.8210μs 19.0740μs 52.4274 KOps/s 51.3680 KOps/s $\color{#35bf28}+2.06\%$
test_getitem[list] 0.3629ms 35.4175μs 28.2346 KOps/s 29.5457 KOps/s $\color{#d91a1a}-4.44\%$
test_setitem_dim[int] 43.4000μs 27.7243μs 36.0695 KOps/s 38.2716 KOps/s $\textbf{\color{#d91a1a}-5.75\%}$
test_setitem_dim[slice_int] 65.0610μs 48.0062μs 20.8306 KOps/s 20.7832 KOps/s $\color{#35bf28}+0.23\%$
test_setitem_dim[range] 87.8210μs 66.5174μs 15.0337 KOps/s 16.0924 KOps/s $\textbf{\color{#d91a1a}-6.58\%}$
test_setitem_dim[tuple] 60.3610μs 43.3129μs 23.0878 KOps/s 24.1346 KOps/s $\color{#d91a1a}-4.34\%$
test_setitem 0.1525ms 19.5847μs 51.0602 KOps/s 59.5605 KOps/s $\textbf{\color{#d91a1a}-14.27\%}$
test_set 0.1683ms 18.8113μs 53.1596 KOps/s 61.2058 KOps/s $\textbf{\color{#d91a1a}-13.15\%}$
test_set_shared 2.6470ms 0.1047ms 9.5469 KOps/s 9.7795 KOps/s $\color{#d91a1a}-2.38\%$
test_update 0.1560ms 19.4121μs 51.5142 KOps/s 55.2490 KOps/s $\textbf{\color{#d91a1a}-6.76\%}$
test_update_nested 0.1371ms 25.5576μs 39.1274 KOps/s 40.7269 KOps/s $\color{#d91a1a}-3.93\%$
test_set_nested 0.1215ms 18.6520μs 53.6136 KOps/s 57.0502 KOps/s $\textbf{\color{#d91a1a}-6.02\%}$
test_set_nested_new 0.1307ms 21.9149μs 45.6310 KOps/s 48.2942 KOps/s $\textbf{\color{#d91a1a}-5.51\%}$
test_select 0.1835ms 46.1001μs 21.6919 KOps/s 23.2999 KOps/s $\textbf{\color{#d91a1a}-6.90\%}$
test_to 73.2710μs 53.8888μs 18.5567 KOps/s 18.7061 KOps/s $\color{#d91a1a}-0.80\%$
test_to_nonblocking 0.2028ms 34.2754μs 29.1754 KOps/s 30.2487 KOps/s $\color{#d91a1a}-3.55\%$
test_unbind_speed 0.3846ms 0.3328ms 3.0046 KOps/s 2.9692 KOps/s $\color{#35bf28}+1.19\%$
test_unbind_speed_stack0 81.0862ms 3.7993ms 263.2093 Ops/s 261.5643 Ops/s $\color{#35bf28}+0.63\%$
test_unbind_speed_stack1 1.5720μs 0.5169μs 1.9346 MOps/s 1.8986 MOps/s $\color{#35bf28}+1.90\%$
test_split 1.8903ms 1.5851ms 630.8841 Ops/s 575.9320 Ops/s $\textbf{\color{#35bf28}+9.54\%}$
test_chunk 76.1696ms 1.7076ms 585.6065 Ops/s 585.0677 Ops/s $\color{#35bf28}+0.09\%$
test_creation[device0] 0.4510ms 0.3105ms 3.2202 KOps/s 3.2190 KOps/s $\color{#35bf28}+0.04\%$
test_creation[device1] 0.4573ms 0.3137ms 3.1876 KOps/s 3.1566 KOps/s $\color{#35bf28}+0.98\%$
test_creation_from_tensor 78.7456ms 0.3788ms 2.6396 KOps/s 2.8631 KOps/s $\textbf{\color{#d91a1a}-7.81\%}$
test_add_one[memmap_tensor0] 0.1340ms 25.3946μs 39.3785 KOps/s 41.4228 KOps/s $\color{#d91a1a}-4.94\%$
test_add_one[memmap_tensor1] 0.2125ms 72.6278μs 13.7688 KOps/s 13.8099 KOps/s $\color{#d91a1a}-0.30\%$
test_contiguous[memmap_tensor0] 21.9110μs 5.9722μs 167.4418 KOps/s 169.2885 KOps/s $\color{#d91a1a}-1.09\%$
test_contiguous[memmap_tensor1] 0.1774ms 22.2406μs 44.9629 KOps/s 44.8880 KOps/s $\color{#35bf28}+0.17\%$
test_stack[memmap_tensor0] 0.1136ms 19.8363μs 50.4127 KOps/s 46.2560 KOps/s $\textbf{\color{#35bf28}+8.99\%}$
test_stack[memmap_tensor1] 0.1067ms 71.0701μs 14.0706 KOps/s 13.0099 KOps/s $\textbf{\color{#35bf28}+8.15\%}$
test_memmaptd_index 0.3162ms 0.2420ms 4.1317 KOps/s 4.1732 KOps/s $\color{#d91a1a}-0.99\%$
test_memmaptd_index_astensor 0.3679ms 0.3028ms 3.3021 KOps/s 3.3496 KOps/s $\color{#d91a1a}-1.42\%$
test_memmaptd_index_op 0.6376ms 0.5783ms 1.7293 KOps/s 1.7645 KOps/s $\color{#d91a1a}-1.99\%$
test_serialize_model 0.1719s 97.0207ms 10.3071 Ops/s 10.7795 Ops/s $\color{#d91a1a}-4.38\%$
test_serialize_model_pickle 1.3481s 1.2360s 0.8091 Ops/s 0.8083 Ops/s $\color{#35bf28}+0.10\%$
test_serialize_weights 89.2894ms 85.8116ms 11.6534 Ops/s 10.0581 Ops/s $\textbf{\color{#35bf28}+15.86\%}$
test_serialize_weights_returnearly 69.4865ms 57.4317ms 17.4120 Ops/s 12.7434 Ops/s $\textbf{\color{#35bf28}+36.64\%}$
test_serialize_weights_pickle 1.3946s 1.2414s 0.8055 Ops/s 0.8093 Ops/s $\color{#d91a1a}-0.46\%$
test_reshape_pytree 0.2245ms 24.0091μs 41.6509 KOps/s 42.4043 KOps/s $\color{#d91a1a}-1.78\%$
test_reshape_td 0.1494ms 29.5906μs 33.7946 KOps/s 34.6570 KOps/s $\color{#d91a1a}-2.49\%$
test_view_pytree 0.1502ms 23.7286μs 42.1432 KOps/s 43.1837 KOps/s $\color{#d91a1a}-2.41\%$
test_view_td 0.2098ms 4.0676μs 245.8428 KOps/s 247.7353 KOps/s $\color{#d91a1a}-0.76\%$
test_unbind_pytree 0.1313ms 29.5690μs 33.8192 KOps/s 32.6686 KOps/s $\color{#35bf28}+3.52\%$
test_unbind_td 0.2414ms 51.9172μs 19.2615 KOps/s 19.1361 KOps/s $\color{#35bf28}+0.65\%$
test_split_pytree 44.2110μs 28.3623μs 35.2581 KOps/s 36.7124 KOps/s $\color{#d91a1a}-3.96\%$
test_split_td 0.6625ms 41.4459μs 24.1278 KOps/s 24.1893 KOps/s $\color{#d91a1a}-0.25\%$
test_add_pytree 0.2068ms 36.1715μs 27.6461 KOps/s 28.4940 KOps/s $\color{#d91a1a}-2.98\%$
test_add_td 0.2740ms 47.5810μs 21.0168 KOps/s 23.2126 KOps/s $\textbf{\color{#d91a1a}-9.46\%}$
test_distributed 22.4110μs 5.5606μs 179.8381 KOps/s 178.7752 KOps/s $\color{#35bf28}+0.59\%$
test_tdmodule 96.7810μs 17.0671μs 58.5922 KOps/s 59.1967 KOps/s $\color{#d91a1a}-1.02\%$
test_tdmodule_dispatch 0.1214ms 32.4304μs 30.8352 KOps/s 30.8583 KOps/s $\color{#d91a1a}-0.07\%$
test_tdseq 34.8010μs 20.1107μs 49.7247 KOps/s 50.8519 KOps/s $\color{#d91a1a}-2.22\%$
test_tdseq_dispatch 0.2483ms 36.2689μs 27.5718 KOps/s 28.3566 KOps/s $\color{#d91a1a}-2.77\%$
test_instantiation_functorch 1.8227ms 1.6680ms 599.5106 Ops/s 597.5635 Ops/s $\color{#35bf28}+0.33\%$
test_instantiation_td 1.7025ms 1.1648ms 858.5230 Ops/s 849.5544 Ops/s $\color{#35bf28}+1.06\%$
test_exec_functorch 0.3907ms 0.1622ms 6.1643 KOps/s 6.3903 KOps/s $\color{#d91a1a}-3.54\%$
test_exec_functional_call 0.3908ms 0.1606ms 6.2259 KOps/s 6.4398 KOps/s $\color{#d91a1a}-3.32\%$
test_exec_td 0.1792ms 0.1505ms 6.6431 KOps/s 6.8052 KOps/s $\color{#d91a1a}-2.38\%$
test_exec_td_decorator 0.8342ms 0.1899ms 5.2667 KOps/s 5.3275 KOps/s $\color{#d91a1a}-1.14\%$
test_vmap_mlp_speed[True-True] 1.3266ms 1.1083ms 902.2998 Ops/s 907.4048 Ops/s $\color{#d91a1a}-0.56\%$
test_vmap_mlp_speed[True-False] 0.9012ms 0.6582ms 1.5194 KOps/s 1.5306 KOps/s $\color{#d91a1a}-0.73\%$
test_vmap_mlp_speed[False-True] 1.2390ms 1.0162ms 984.0746 Ops/s 992.4719 Ops/s $\color{#d91a1a}-0.85\%$
test_vmap_mlp_speed[False-False] 0.7898ms 0.5880ms 1.7007 KOps/s 1.7236 KOps/s $\color{#d91a1a}-1.32\%$
test_vmap_mlp_speed_decorator[True-True] 2.8027ms 2.0739ms 482.1922 Ops/s 485.9867 Ops/s $\color{#d91a1a}-0.78\%$
test_vmap_mlp_speed_decorator[True-False] 1.1794ms 0.7060ms 1.4165 KOps/s 1.4068 KOps/s $\color{#35bf28}+0.69\%$
test_vmap_mlp_speed_decorator[False-True] 2.1923ms 1.8087ms 552.8883 Ops/s 553.3530 Ops/s $\color{#d91a1a}-0.08\%$
test_vmap_mlp_speed_decorator[False-False] 1.0173ms 0.6071ms 1.6471 KOps/s 1.6637 KOps/s $\color{#d91a1a}-1.00\%$
test_vmap_transformer_speed[True-True] 12.6843ms 12.3516ms 80.9613 Ops/s 78.0983 Ops/s $\color{#35bf28}+3.67\%$
test_vmap_transformer_speed[True-False] 8.3255ms 8.1342ms 122.9374 Ops/s 121.1097 Ops/s $\color{#35bf28}+1.51\%$
test_vmap_transformer_speed[False-True] 12.3565ms 12.1800ms 82.1021 Ops/s 80.9019 Ops/s $\color{#35bf28}+1.48\%$
test_vmap_transformer_speed[False-False] 8.6453ms 8.0649ms 123.9946 Ops/s 122.4442 Ops/s $\color{#35bf28}+1.27\%$
test_vmap_transformer_speed_decorator[True-True] 64.5487ms 63.3144ms 15.7942 Ops/s 15.4707 Ops/s $\color{#35bf28}+2.09\%$
test_vmap_transformer_speed_decorator[True-False] 20.7659ms 19.5487ms 51.1542 Ops/s 45.9004 Ops/s $\textbf{\color{#35bf28}+11.45\%}$
test_vmap_transformer_speed_decorator[False-True] 59.0637ms 57.5735ms 17.3691 Ops/s 17.1104 Ops/s $\color{#35bf28}+1.51\%$
test_vmap_transformer_speed_decorator[False-False] 21.0059ms 19.1376ms 52.2531 Ops/s 51.6259 Ops/s $\color{#35bf28}+1.21\%$

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. versioning
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants