Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Doc] Doc revamp #593

Merged
merged 5 commits into from
Jan 4, 2024
Merged

[Doc] Doc revamp #593

merged 5 commits into from
Jan 4, 2024

Conversation

vmoens
Copy link
Contributor

@vmoens vmoens commented Dec 8, 2023

Refresh doc

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Dec 8, 2023
Copy link

github-actions bot commented Dec 8, 2023

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 120. Improved: $\large\color{#35bf28}4$. Worsened: $\large\color{#d91a1a}11$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 31.9290μs 16.2967μs 61.3621 KOps/s 63.5656 KOps/s $\color{#d91a1a}-3.47\%$
test_plain_set_stack_nested 0.1930ms 0.1398ms 7.1545 KOps/s 7.0651 KOps/s $\color{#35bf28}+1.27\%$
test_plain_set_nested_inplace 42.7900μs 18.3514μs 54.4917 KOps/s 54.3035 KOps/s $\color{#35bf28}+0.35\%$
test_plain_set_stack_nested_inplace 0.2424ms 0.1754ms 5.7028 KOps/s 5.6564 KOps/s $\color{#35bf28}+0.82\%$
test_items 38.3120μs 2.3866μs 419.0083 KOps/s 426.2874 KOps/s $\color{#d91a1a}-1.71\%$
test_items_nested 0.4780ms 0.2764ms 3.6176 KOps/s 3.6723 KOps/s $\color{#d91a1a}-1.49\%$
test_items_nested_locked 0.7153ms 0.2733ms 3.6591 KOps/s 3.5868 KOps/s $\color{#35bf28}+2.02\%$
test_items_nested_leaf 0.4078ms 0.1722ms 5.8084 KOps/s 5.8167 KOps/s $\color{#d91a1a}-0.14\%$
test_items_stack_nested 1.8390ms 1.3130ms 761.6310 Ops/s 733.7796 Ops/s $\color{#35bf28}+3.80\%$
test_items_stack_nested_leaf 1.3063ms 1.1791ms 848.1372 Ops/s 845.6460 Ops/s $\color{#35bf28}+0.29\%$
test_items_stack_nested_locked 1.3647ms 0.7733ms 1.2931 KOps/s 1.2429 KOps/s $\color{#35bf28}+4.04\%$
test_keys 22.3920μs 3.7937μs 263.5949 KOps/s 253.1590 KOps/s $\color{#35bf28}+4.12\%$
test_keys_nested 53.1745ms 0.1547ms 6.4645 KOps/s 6.5892 KOps/s $\color{#d91a1a}-1.89\%$
test_keys_nested_locked 0.1990ms 0.1433ms 6.9781 KOps/s 6.7412 KOps/s $\color{#35bf28}+3.51\%$
test_keys_nested_leaf 0.2997ms 0.1271ms 7.8705 KOps/s 7.7048 KOps/s $\color{#35bf28}+2.15\%$
test_keys_stack_nested 1.9892ms 1.2755ms 784.0164 Ops/s 770.4502 Ops/s $\color{#35bf28}+1.76\%$
test_keys_stack_nested_leaf 1.3682ms 1.2670ms 789.2608 Ops/s 784.5296 Ops/s $\color{#35bf28}+0.60\%$
test_keys_stack_nested_locked 0.8092ms 0.6963ms 1.4362 KOps/s 1.4142 KOps/s $\color{#35bf28}+1.56\%$
test_values 16.6710μs 1.2921μs 773.9279 KOps/s 893.1039 KOps/s $\textbf{\color{#d91a1a}-13.34\%}$
test_values_nested 0.1707ms 51.5368μs 19.4036 KOps/s 19.5098 KOps/s $\color{#d91a1a}-0.54\%$
test_values_nested_locked 97.0300μs 52.3219μs 19.1125 KOps/s 19.3996 KOps/s $\color{#d91a1a}-1.48\%$
test_values_nested_leaf 91.4800μs 45.9019μs 21.7856 KOps/s 21.8375 KOps/s $\color{#d91a1a}-0.24\%$
test_values_stack_nested 1.3829ms 1.0292ms 971.6753 Ops/s 968.2122 Ops/s $\color{#35bf28}+0.36\%$
test_values_stack_nested_leaf 1.2453ms 1.0276ms 973.1666 Ops/s 976.8860 Ops/s $\color{#d91a1a}-0.38\%$
test_values_stack_nested_locked 0.6969ms 0.5091ms 1.9644 KOps/s 1.9244 KOps/s $\color{#35bf28}+2.08\%$
test_membership 33.4530μs 1.3250μs 754.7389 KOps/s 744.2150 KOps/s $\color{#35bf28}+1.41\%$
test_membership_nested 21.1590μs 2.9259μs 341.7769 KOps/s 350.9760 KOps/s $\color{#d91a1a}-2.62\%$
test_membership_nested_leaf 38.6320μs 2.9431μs 339.7835 KOps/s 348.0343 KOps/s $\color{#d91a1a}-2.37\%$
test_membership_stacked_nested 32.0000μs 11.7353μs 85.2130 KOps/s 86.4575 KOps/s $\color{#d91a1a}-1.44\%$
test_membership_stacked_nested_leaf 55.6940μs 11.6720μs 85.6750 KOps/s 84.0667 KOps/s $\color{#35bf28}+1.91\%$
test_membership_nested_last 42.9700μs 6.0962μs 164.0376 KOps/s 167.6850 KOps/s $\color{#d91a1a}-2.18\%$
test_membership_nested_leaf_last 27.0510μs 6.0936μs 164.1068 KOps/s 166.2546 KOps/s $\color{#d91a1a}-1.29\%$
test_membership_stacked_nested_last 0.3806ms 0.1678ms 5.9591 KOps/s 5.9689 KOps/s $\color{#d91a1a}-0.16\%$
test_membership_stacked_nested_leaf_last 37.5590μs 13.6684μs 73.1613 KOps/s 74.0093 KOps/s $\color{#d91a1a}-1.15\%$
test_nested_getleaf 55.5430μs 10.6391μs 93.9933 KOps/s 94.0401 KOps/s $\color{#d91a1a}-0.05\%$
test_nested_get 46.4460μs 10.0163μs 99.8377 KOps/s 98.0131 KOps/s $\color{#35bf28}+1.86\%$
test_stacked_getleaf 1.0467ms 0.4686ms 2.1341 KOps/s 2.1688 KOps/s $\color{#d91a1a}-1.60\%$
test_stacked_get 0.5126ms 0.4299ms 2.3263 KOps/s 2.3288 KOps/s $\color{#d91a1a}-0.10\%$
test_nested_getitemleaf 93.4540μs 10.5523μs 94.7662 KOps/s 94.5564 KOps/s $\color{#35bf28}+0.22\%$
test_nested_getitem 0.1188ms 10.2133μs 97.9116 KOps/s 99.7426 KOps/s $\color{#d91a1a}-1.84\%$
test_stacked_getitemleaf 0.5945ms 0.4665ms 2.1436 KOps/s 2.1522 KOps/s $\color{#d91a1a}-0.40\%$
test_stacked_getitem 0.5213ms 0.4353ms 2.2973 KOps/s 2.3099 KOps/s $\color{#d91a1a}-0.55\%$
test_lock_nested 1.2865ms 0.4100ms 2.4390 KOps/s 2.4216 KOps/s $\color{#35bf28}+0.72\%$
test_lock_stack_nested 77.4025ms 6.5787ms 152.0057 Ops/s 151.4823 Ops/s $\color{#35bf28}+0.35\%$
test_unlock_nested 62.9388ms 0.4771ms 2.0960 KOps/s 2.3958 KOps/s $\textbf{\color{#d91a1a}-12.51\%}$
test_unlock_stack_nested 80.8193ms 6.2050ms 161.1604 Ops/s 161.7092 Ops/s $\color{#d91a1a}-0.34\%$
test_flatten_speed 0.7532ms 0.3598ms 2.7797 KOps/s 2.7426 KOps/s $\color{#35bf28}+1.35\%$
test_unflatten_speed 0.5262ms 0.4499ms 2.2226 KOps/s 2.2084 KOps/s $\color{#35bf28}+0.64\%$
test_common_ops 1.1722ms 0.6764ms 1.4785 KOps/s 1.5136 KOps/s $\color{#d91a1a}-2.32\%$
test_creation 29.8160μs 1.9406μs 515.3116 KOps/s 506.9737 KOps/s $\color{#35bf28}+1.64\%$
test_creation_empty 29.2650μs 9.2237μs 108.4169 KOps/s 124.5057 KOps/s $\textbf{\color{#d91a1a}-12.92\%}$
test_creation_nested_1 42.6990μs 12.1999μs 81.9678 KOps/s 91.4557 KOps/s $\textbf{\color{#d91a1a}-10.37\%}$
test_creation_nested_2 48.2600μs 17.2325μs 58.0298 KOps/s 61.8415 KOps/s $\textbf{\color{#d91a1a}-6.16\%}$
test_clone 0.1013ms 12.4547μs 80.2911 KOps/s 82.4987 KOps/s $\color{#d91a1a}-2.68\%$
test_getitem[int] 45.8850μs 12.2673μs 81.5173 KOps/s 85.0562 KOps/s $\color{#d91a1a}-4.16\%$
test_getitem[slice_int] 59.2800μs 23.1194μs 43.2538 KOps/s 42.8848 KOps/s $\color{#35bf28}+0.86\%$
test_getitem[range] 0.1258ms 41.3201μs 24.2013 KOps/s 23.2534 KOps/s $\color{#35bf28}+4.08\%$
test_getitem[tuple] 51.6860μs 19.0898μs 52.3840 KOps/s 53.8845 KOps/s $\color{#d91a1a}-2.78\%$
test_getitem[list] 0.2980ms 36.3563μs 27.5056 KOps/s 26.1909 KOps/s $\textbf{\color{#35bf28}+5.02\%}$
test_setitem_dim[int] 0.1022ms 29.1407μs 34.3163 KOps/s 34.5376 KOps/s $\color{#d91a1a}-0.64\%$
test_setitem_dim[slice_int] 0.1411ms 55.6436μs 17.9715 KOps/s 18.0338 KOps/s $\color{#d91a1a}-0.35\%$
test_setitem_dim[range] 0.1160ms 73.1337μs 13.6736 KOps/s 13.7036 KOps/s $\color{#d91a1a}-0.22\%$
test_setitem_dim[tuple] 81.5220μs 44.0515μs 22.7007 KOps/s 23.4152 KOps/s $\color{#d91a1a}-3.05\%$
test_setitem 0.2301ms 18.0899μs 55.2796 KOps/s 57.3817 KOps/s $\color{#d91a1a}-3.66\%$
test_set 0.2087ms 17.3535μs 57.6253 KOps/s 59.0520 KOps/s $\color{#d91a1a}-2.42\%$
test_set_shared 2.2473ms 0.1389ms 7.1976 KOps/s 7.2575 KOps/s $\color{#d91a1a}-0.82\%$
test_update 0.2323ms 20.2929μs 49.2782 KOps/s 52.4178 KOps/s $\textbf{\color{#d91a1a}-5.99\%}$
test_update_nested 0.2076ms 27.2302μs 36.7239 KOps/s 38.3256 KOps/s $\color{#d91a1a}-4.18\%$
test_set_nested 0.1099ms 19.1995μs 52.0846 KOps/s 54.5778 KOps/s $\color{#d91a1a}-4.57\%$
test_set_nested_new 0.2066ms 23.7460μs 42.1124 KOps/s 42.5945 KOps/s $\color{#d91a1a}-1.13\%$
test_select 0.2038ms 46.9572μs 21.2960 KOps/s 21.8006 KOps/s $\color{#d91a1a}-2.31\%$
test_unbind_speed 0.4085ms 0.3388ms 2.9519 KOps/s 2.9526 KOps/s $\color{#d91a1a}-0.02\%$
test_unbind_speed_stack0 74.7537ms 4.1668ms 239.9930 Ops/s 229.1001 Ops/s $\color{#35bf28}+4.75\%$
test_unbind_speed_stack1 1.8044μs 0.6287μs 1.5905 MOps/s 1.5949 MOps/s $\color{#d91a1a}-0.27\%$
test_split 64.0930ms 1.6803ms 595.1484 Ops/s 600.8809 Ops/s $\color{#d91a1a}-0.95\%$
test_chunk 62.2187ms 1.6502ms 605.9705 Ops/s 649.3049 Ops/s $\textbf{\color{#d91a1a}-6.67\%}$
test_creation[device0] 0.4413ms 0.2929ms 3.4140 KOps/s 3.3482 KOps/s $\color{#35bf28}+1.96\%$
test_creation_from_tensor 1.8859ms 0.3272ms 3.0558 KOps/s 3.0006 KOps/s $\color{#35bf28}+1.84\%$
test_add_one[memmap_tensor0] 71.0120μs 25.3160μs 39.5007 KOps/s 38.6528 KOps/s $\color{#35bf28}+2.19\%$
test_contiguous[memmap_tensor0] 30.0560μs 5.7583μs 173.6625 KOps/s 174.7171 KOps/s $\color{#d91a1a}-0.60\%$
test_stack[memmap_tensor0] 54.7920μs 18.7006μs 53.4743 KOps/s 50.2496 KOps/s $\textbf{\color{#35bf28}+6.42\%}$
test_memmaptd_index 0.2653ms 0.1998ms 5.0054 KOps/s 5.0592 KOps/s $\color{#d91a1a}-1.06\%$
test_memmaptd_index_astensor 0.3545ms 0.2583ms 3.8711 KOps/s 3.8988 KOps/s $\color{#d91a1a}-0.71\%$
test_memmaptd_index_op 1.0461ms 0.5271ms 1.8973 KOps/s 1.8469 KOps/s $\color{#35bf28}+2.73\%$
test_serialize_model 0.1667s 0.1056s 9.4672 Ops/s 8.9997 Ops/s $\textbf{\color{#35bf28}+5.19\%}$
test_serialize_model_filesystem 0.1532s 97.3967ms 10.2673 Ops/s 10.6964 Ops/s $\color{#d91a1a}-4.01\%$
test_serialize_model_pickle 0.4471s 0.3737s 2.6763 Ops/s 2.5828 Ops/s $\color{#35bf28}+3.62\%$
test_serialize_weights 0.1661s 0.1049s 9.5325 Ops/s 10.2747 Ops/s $\textbf{\color{#d91a1a}-7.22\%}$
test_serialize_weights_filesystem 94.4273ms 89.6608ms 11.1531 Ops/s 10.7585 Ops/s $\color{#35bf28}+3.67\%$
test_serialize_weights_returnearly 0.1908s 0.1291s 7.7440 Ops/s 8.3140 Ops/s $\textbf{\color{#d91a1a}-6.86\%}$
test_serialize_weights_pickle 0.8915s 0.5698s 1.7550 Ops/s 1.4081 Ops/s $\textbf{\color{#35bf28}+24.63\%}$
test_reshape_pytree 66.3640μs 22.8407μs 43.7814 KOps/s 42.8764 KOps/s $\color{#35bf28}+2.11\%$
test_reshape_td 66.1030μs 29.5938μs 33.7908 KOps/s 33.3340 KOps/s $\color{#35bf28}+1.37\%$
test_view_pytree 62.4860μs 22.6209μs 44.2070 KOps/s 43.3940 KOps/s $\color{#35bf28}+1.87\%$
test_view_td 22.3510μs 4.7984μs 208.4008 KOps/s 207.4589 KOps/s $\color{#35bf28}+0.45\%$
test_unbind_pytree 66.3940μs 26.0232μs 38.4272 KOps/s 37.7238 KOps/s $\color{#35bf28}+1.86\%$
test_unbind_td 0.1309ms 54.9001μs 18.2149 KOps/s 18.4732 KOps/s $\color{#d91a1a}-1.40\%$
test_split_pytree 57.7870μs 25.9245μs 38.5735 KOps/s 38.2554 KOps/s $\color{#35bf28}+0.83\%$
test_split_td 0.5148ms 43.9386μs 22.7590 KOps/s 23.4793 KOps/s $\color{#d91a1a}-3.07\%$
test_add_pytree 0.1175ms 31.9653μs 31.2839 KOps/s 31.2707 KOps/s $\color{#35bf28}+0.04\%$
test_add_td 93.2540μs 45.6873μs 21.8879 KOps/s 22.7304 KOps/s $\color{#d91a1a}-3.71\%$
test_distributed 24.7060μs 6.0567μs 165.1058 KOps/s 165.9869 KOps/s $\color{#d91a1a}-0.53\%$
test_tdmodule 0.1763ms 22.5847μs 44.2778 KOps/s 47.8455 KOps/s $\textbf{\color{#d91a1a}-7.46\%}$
test_tdmodule_dispatch 0.2131ms 41.3198μs 24.2015 KOps/s 25.3125 KOps/s $\color{#d91a1a}-4.39\%$
test_tdseq 52.6180μs 25.0068μs 39.9892 KOps/s 41.9414 KOps/s $\color{#d91a1a}-4.65\%$
test_tdseq_dispatch 0.1373ms 44.4674μs 22.4884 KOps/s 23.2555 KOps/s $\color{#d91a1a}-3.30\%$
test_instantiation_functorch 1.4231ms 1.2915ms 774.3173 Ops/s 770.0596 Ops/s $\color{#35bf28}+0.55\%$
test_instantiation_td 1.5753ms 0.9881ms 1.0121 KOps/s 1.0053 KOps/s $\color{#35bf28}+0.68\%$
test_exec_functorch 0.2657ms 0.1577ms 6.3421 KOps/s 6.4143 KOps/s $\color{#d91a1a}-1.13\%$
test_exec_functional_call 0.2582ms 0.1435ms 6.9702 KOps/s 6.7527 KOps/s $\color{#35bf28}+3.22\%$
test_exec_td 0.2176ms 0.1427ms 7.0069 KOps/s 7.0445 KOps/s $\color{#d91a1a}-0.53\%$
test_exec_td_decorator 0.7795ms 0.1749ms 5.7187 KOps/s 5.8643 KOps/s $\color{#d91a1a}-2.48\%$
test_vmap_mlp_speed[True-True] 1.4178ms 0.8972ms 1.1146 KOps/s 1.1392 KOps/s $\color{#d91a1a}-2.16\%$
test_vmap_mlp_speed[True-False] 0.6539ms 0.4707ms 2.1243 KOps/s 2.1703 KOps/s $\color{#d91a1a}-2.12\%$
test_vmap_mlp_speed[False-True] 1.1029ms 0.7714ms 1.2964 KOps/s 1.3126 KOps/s $\color{#d91a1a}-1.23\%$
test_vmap_mlp_speed[False-False] 0.6463ms 0.3831ms 2.6100 KOps/s 2.6288 KOps/s $\color{#d91a1a}-0.71\%$
test_vmap_mlp_speed_decorator[True-True] 75.6249ms 1.8976ms 526.9709 Ops/s 564.7349 Ops/s $\textbf{\color{#d91a1a}-6.69\%}$
test_vmap_mlp_speed_decorator[True-False] 1.1380ms 0.5164ms 1.9366 KOps/s 1.9737 KOps/s $\color{#d91a1a}-1.88\%$
test_vmap_mlp_speed_decorator[False-True] 2.3522ms 1.5055ms 664.2141 Ops/s 683.8032 Ops/s $\color{#d91a1a}-2.86\%$
test_vmap_mlp_speed_decorator[False-False] 0.7782ms 0.4027ms 2.4835 KOps/s 2.5360 KOps/s $\color{#d91a1a}-2.07\%$

Copy link

github-actions bot commented Dec 8, 2023

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 127. Improved: $\large\color{#35bf28}20$. Worsened: $\large\color{#d91a1a}4$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 0.5848ms 12.5891μs 79.4339 KOps/s 80.6879 KOps/s $\color{#d91a1a}-1.55\%$
test_plain_set_stack_nested 0.1406ms 0.1162ms 8.6037 KOps/s 8.3315 KOps/s $\color{#35bf28}+3.27\%$
test_plain_set_nested_inplace 36.4610μs 13.9118μs 71.8813 KOps/s 72.5795 KOps/s $\color{#d91a1a}-0.96\%$
test_plain_set_stack_nested_inplace 0.1715ms 0.1428ms 7.0005 KOps/s 6.9974 KOps/s $\color{#35bf28}+0.04\%$
test_items 18.4000μs 4.6794μs 213.7045 KOps/s 210.3671 KOps/s $\color{#35bf28}+1.59\%$
test_items_nested 0.3787ms 0.3362ms 2.9744 KOps/s 2.9600 KOps/s $\color{#35bf28}+0.49\%$
test_items_nested_locked 0.4425ms 0.3378ms 2.9607 KOps/s 2.9388 KOps/s $\color{#35bf28}+0.75\%$
test_items_nested_leaf 0.2440ms 0.1968ms 5.0802 KOps/s 4.9979 KOps/s $\color{#35bf28}+1.65\%$
test_items_stack_nested 1.5471ms 1.4536ms 687.9424 Ops/s 677.8348 Ops/s $\color{#35bf28}+1.49\%$
test_items_stack_nested_leaf 1.3544ms 1.2850ms 778.2150 Ops/s 774.6327 Ops/s $\color{#35bf28}+0.46\%$
test_items_stack_nested_locked 2.0828ms 0.8037ms 1.2443 KOps/s 1.2170 KOps/s $\color{#35bf28}+2.24\%$
test_keys 24.8700μs 4.6184μs 216.5238 KOps/s 218.9973 KOps/s $\color{#d91a1a}-1.13\%$
test_keys_nested 0.4700ms 90.9887μs 10.9904 KOps/s 11.0912 KOps/s $\color{#d91a1a}-0.91\%$
test_keys_nested_locked 0.1211ms 90.3312μs 11.0704 KOps/s 11.1787 KOps/s $\color{#d91a1a}-0.97\%$
test_keys_nested_leaf 44.3310ms 87.4568μs 11.4342 KOps/s 12.2851 KOps/s $\textbf{\color{#d91a1a}-6.93\%}$
test_keys_stack_nested 1.3380ms 1.2786ms 782.1080 Ops/s 771.0840 Ops/s $\color{#35bf28}+1.43\%$
test_keys_stack_nested_leaf 1.3461ms 1.2666ms 789.5311 Ops/s 794.2577 Ops/s $\color{#d91a1a}-0.60\%$
test_keys_stack_nested_locked 0.6785ms 0.6126ms 1.6323 KOps/s 1.6090 KOps/s $\color{#35bf28}+1.45\%$
test_values 6.3833μs 1.8812μs 531.5721 KOps/s 526.2295 KOps/s $\color{#35bf28}+1.02\%$
test_values_nested 61.1710μs 43.3859μs 23.0490 KOps/s 23.3302 KOps/s $\color{#d91a1a}-1.21\%$
test_values_nested_locked 66.4010μs 45.7183μs 21.8731 KOps/s 22.2047 KOps/s $\color{#d91a1a}-1.49\%$
test_values_nested_leaf 97.3210μs 37.8019μs 26.4537 KOps/s 26.6681 KOps/s $\color{#d91a1a}-0.80\%$
test_values_stack_nested 1.2415ms 1.1236ms 890.0286 Ops/s 891.9404 Ops/s $\color{#d91a1a}-0.21\%$
test_values_stack_nested_leaf 1.1585ms 1.1083ms 902.2467 Ops/s 881.3297 Ops/s $\color{#35bf28}+2.37\%$
test_values_stack_nested_locked 0.5283ms 0.4851ms 2.0615 KOps/s 2.0188 KOps/s $\color{#35bf28}+2.11\%$
test_membership 15.1900μs 1.0449μs 957.0344 KOps/s 1.0659 MOps/s $\textbf{\color{#d91a1a}-10.22\%}$
test_membership_nested 32.3205μs 2.1291μs 469.6779 KOps/s 441.3892 KOps/s $\textbf{\color{#35bf28}+6.41\%}$
test_membership_nested_leaf 10.6700μs 2.1495μs 465.2210 KOps/s 467.0182 KOps/s $\color{#d91a1a}-0.38\%$
test_membership_stacked_nested 33.6900μs 11.0756μs 90.2884 KOps/s 91.5205 KOps/s $\color{#d91a1a}-1.35\%$
test_membership_stacked_nested_leaf 31.4710μs 11.0704μs 90.3306 KOps/s 90.9343 KOps/s $\color{#d91a1a}-0.66\%$
test_membership_nested_last 26.5910μs 4.6591μs 214.6325 KOps/s 216.5347 KOps/s $\color{#d91a1a}-0.88\%$
test_membership_nested_leaf_last 20.2000μs 4.6624μs 214.4841 KOps/s 218.2070 KOps/s $\color{#d91a1a}-1.71\%$
test_membership_stacked_nested_last 0.1993ms 0.1344ms 7.4412 KOps/s 7.4851 KOps/s $\color{#d91a1a}-0.59\%$
test_membership_stacked_nested_leaf_last 26.6910μs 12.9235μs 77.3782 KOps/s 78.2652 KOps/s $\color{#d91a1a}-1.13\%$
test_nested_getleaf 23.4500μs 8.4180μs 118.7925 KOps/s 119.0574 KOps/s $\color{#d91a1a}-0.22\%$
test_nested_get 30.9900μs 7.9844μs 125.2437 KOps/s 125.9427 KOps/s $\color{#d91a1a}-0.56\%$
test_stacked_getleaf 0.6337ms 0.5647ms 1.7709 KOps/s 1.7622 KOps/s $\color{#35bf28}+0.49\%$
test_stacked_get 0.6029ms 0.5316ms 1.8811 KOps/s 1.8745 KOps/s $\color{#35bf28}+0.35\%$
test_nested_getitemleaf 24.6410μs 8.5163μs 117.4217 KOps/s 118.5845 KOps/s $\color{#d91a1a}-0.98\%$
test_nested_getitem 20.9410μs 8.0416μs 124.3541 KOps/s 125.4593 KOps/s $\color{#d91a1a}-0.88\%$
test_stacked_getitemleaf 0.6007ms 0.5637ms 1.7738 KOps/s 1.7710 KOps/s $\color{#35bf28}+0.16\%$
test_stacked_getitem 0.6013ms 0.5320ms 1.8797 KOps/s 1.8805 KOps/s $\color{#d91a1a}-0.05\%$
test_lock_nested 3.2130ms 0.5546ms 1.8033 KOps/s 1.7987 KOps/s $\color{#35bf28}+0.25\%$
test_lock_stack_nested 85.4229ms 7.2639ms 137.6674 Ops/s 134.7914 Ops/s $\color{#35bf28}+2.13\%$
test_unlock_nested 2.3385ms 0.4331ms 2.3092 KOps/s 2.3262 KOps/s $\color{#d91a1a}-0.73\%$
test_unlock_stack_nested 72.5755ms 6.3496ms 157.4913 Ops/s 158.1401 Ops/s $\color{#d91a1a}-0.41\%$
test_flatten_speed 0.2239ms 0.1871ms 5.3440 KOps/s 5.3588 KOps/s $\color{#d91a1a}-0.28\%$
test_unflatten_speed 0.4051ms 0.3662ms 2.7305 KOps/s 2.7853 KOps/s $\color{#d91a1a}-1.97\%$
test_common_ops 1.0962ms 0.5895ms 1.6965 KOps/s 1.6864 KOps/s $\color{#35bf28}+0.60\%$
test_creation 38.4000μs 2.0623μs 484.8900 KOps/s 470.2777 KOps/s $\color{#35bf28}+3.11\%$
test_creation_empty 33.9210μs 6.5992μs 151.5328 KOps/s 150.0116 KOps/s $\color{#35bf28}+1.01\%$
test_creation_nested_1 38.4400μs 9.0220μs 110.8397 KOps/s 110.9634 KOps/s $\color{#d91a1a}-0.11\%$
test_creation_nested_2 28.0290μs 11.7819μs 84.8758 KOps/s 85.7182 KOps/s $\color{#d91a1a}-0.98\%$
test_clone 66.4110μs 14.1607μs 70.6180 KOps/s 72.3499 KOps/s $\color{#d91a1a}-2.39\%$
test_getitem[int] 81.4810μs 12.1379μs 82.3869 KOps/s 80.3910 KOps/s $\color{#35bf28}+2.48\%$
test_getitem[slice_int] 49.5110μs 23.2826μs 42.9504 KOps/s 42.7323 KOps/s $\color{#35bf28}+0.51\%$
test_getitem[range] 76.9110μs 40.1985μs 24.8765 KOps/s 24.4418 KOps/s $\color{#35bf28}+1.78\%$
test_getitem[tuple] 37.8210μs 19.8149μs 50.4670 KOps/s 50.5593 KOps/s $\color{#d91a1a}-0.18\%$
test_getitem[list] 0.2629ms 35.8492μs 27.8946 KOps/s 27.3817 KOps/s $\color{#35bf28}+1.87\%$
test_setitem_dim[int] 41.1210μs 25.3209μs 39.4930 KOps/s 38.3391 KOps/s $\color{#35bf28}+3.01\%$
test_setitem_dim[slice_int] 68.0600μs 44.1620μs 22.6439 KOps/s 21.3172 KOps/s $\textbf{\color{#35bf28}+6.22\%}$
test_setitem_dim[range] 94.6310μs 62.4463μs 16.0138 KOps/s 15.7704 KOps/s $\color{#35bf28}+1.54\%$
test_setitem_dim[tuple] 54.0610μs 36.9023μs 27.0986 KOps/s 24.8165 KOps/s $\textbf{\color{#35bf28}+9.20\%}$
test_setitem 71.5520μs 17.4716μs 57.2359 KOps/s 54.5775 KOps/s $\color{#35bf28}+4.87\%$
test_set 85.1410μs 16.6717μs 59.9817 KOps/s 56.1572 KOps/s $\textbf{\color{#35bf28}+6.81\%}$
test_set_shared 2.9665ms 0.1051ms 9.5108 KOps/s 8.5507 KOps/s $\textbf{\color{#35bf28}+11.23\%}$
test_update 75.8520μs 17.7773μs 56.2514 KOps/s 54.7994 KOps/s $\color{#35bf28}+2.65\%$
test_update_nested 83.6920μs 24.6812μs 40.5166 KOps/s 38.5425 KOps/s $\textbf{\color{#35bf28}+5.12\%}$
test_set_nested 65.6210μs 18.1613μs 55.0620 KOps/s 50.6113 KOps/s $\textbf{\color{#35bf28}+8.79\%}$
test_set_nested_new 74.9420μs 22.8983μs 43.6713 KOps/s 42.6681 KOps/s $\color{#35bf28}+2.35\%$
test_select 95.5820μs 46.3309μs 21.5839 KOps/s 21.1381 KOps/s $\color{#35bf28}+2.11\%$
test_to 73.8920μs 52.1599μs 19.1718 KOps/s 18.8362 KOps/s $\color{#35bf28}+1.78\%$
test_to_nonblocking 67.5710μs 33.8679μs 29.5265 KOps/s 26.3633 KOps/s $\textbf{\color{#35bf28}+12.00\%}$
test_unbind_speed 0.3984ms 0.3637ms 2.7491 KOps/s 2.7660 KOps/s $\color{#d91a1a}-0.61\%$
test_unbind_speed_stack0 65.2007ms 4.1785ms 239.3179 Ops/s 231.1776 Ops/s $\color{#35bf28}+3.52\%$
test_unbind_speed_stack1 1.3606μs 0.5259μs 1.9014 MOps/s 1.9169 MOps/s $\color{#d91a1a}-0.81\%$
test_split 57.6465ms 1.7647ms 566.6614 Ops/s 564.9153 Ops/s $\color{#35bf28}+0.31\%$
test_chunk 56.9329ms 1.7543ms 570.0314 Ops/s 570.0938 Ops/s $\color{#d91a1a}-0.01\%$
test_creation[device0] 0.3895ms 0.3092ms 3.2345 KOps/s 3.1659 KOps/s $\color{#35bf28}+2.17\%$
test_creation[device1] 57.3072ms 0.3377ms 2.9611 KOps/s 3.1525 KOps/s $\textbf{\color{#d91a1a}-6.07\%}$
test_creation_from_tensor 0.5718ms 0.3378ms 2.9602 KOps/s 2.9409 KOps/s $\color{#35bf28}+0.66\%$
test_add_one[memmap_tensor0] 0.2441ms 24.2436μs 41.2481 KOps/s 37.6884 KOps/s $\textbf{\color{#35bf28}+9.44\%}$
test_add_one[memmap_tensor1] 0.2181ms 72.2711μs 13.8368 KOps/s 12.7677 KOps/s $\textbf{\color{#35bf28}+8.37\%}$
test_contiguous[memmap_tensor0] 26.2100μs 5.8123μs 172.0480 KOps/s 166.9405 KOps/s $\color{#35bf28}+3.06\%$
test_contiguous[memmap_tensor1] 42.9300μs 21.2777μs 46.9975 KOps/s 44.5698 KOps/s $\textbf{\color{#35bf28}+5.45\%}$
test_stack[memmap_tensor0] 52.9600μs 19.9257μs 50.1864 KOps/s 46.3744 KOps/s $\textbf{\color{#35bf28}+8.22\%}$
test_stack[memmap_tensor1] 0.1041ms 71.7894μs 13.9296 KOps/s 13.7904 KOps/s $\color{#35bf28}+1.01\%$
test_memmaptd_index 0.2695ms 0.2316ms 4.3187 KOps/s 4.1660 KOps/s $\color{#35bf28}+3.66\%$
test_memmaptd_index_astensor 0.3568ms 0.2925ms 3.4182 KOps/s 3.3679 KOps/s $\color{#35bf28}+1.50\%$
test_memmaptd_index_op 0.6190ms 0.5387ms 1.8564 KOps/s 1.7978 KOps/s $\color{#35bf28}+3.26\%$
test_reshape_pytree 38.6200μs 21.2883μs 46.9741 KOps/s 47.2784 KOps/s $\color{#d91a1a}-0.64\%$
test_reshape_td 54.6610μs 30.8492μs 32.4157 KOps/s 32.4758 KOps/s $\color{#d91a1a}-0.18\%$
test_view_pytree 36.4410μs 20.8566μs 47.9464 KOps/s 48.0673 KOps/s $\color{#d91a1a}-0.25\%$
test_view_td 18.5000μs 4.0502μs 246.9001 KOps/s 243.3121 KOps/s $\color{#35bf28}+1.47\%$
test_unbind_pytree 49.1320μs 25.5128μs 39.1960 KOps/s 36.7648 KOps/s $\textbf{\color{#35bf28}+6.61\%}$
test_unbind_td 84.4420μs 55.8635μs 17.9008 KOps/s 17.8090 KOps/s $\color{#35bf28}+0.52\%$
test_split_pytree 49.6410μs 24.3466μs 41.0735 KOps/s 39.1521 KOps/s $\color{#35bf28}+4.91\%$
test_split_td 64.6310μs 43.2929μs 23.0985 KOps/s 23.0298 KOps/s $\color{#35bf28}+0.30\%$
test_add_pytree 0.1090ms 31.3094μs 31.9393 KOps/s 31.6217 KOps/s $\color{#35bf28}+1.00\%$
test_add_td 68.4710μs 43.0350μs 23.2369 KOps/s 21.5954 KOps/s $\textbf{\color{#35bf28}+7.60\%}$
test_distributed 20.1810μs 5.5876μs 178.9669 KOps/s 177.4701 KOps/s $\color{#35bf28}+0.84\%$
test_tdmodule 30.0310μs 16.2391μs 61.5798 KOps/s 58.0329 KOps/s $\textbf{\color{#35bf28}+6.11\%}$
test_tdmodule_dispatch 0.1988ms 32.9363μs 30.3616 KOps/s 29.0328 KOps/s $\color{#35bf28}+4.58\%$
test_tdseq 34.4510μs 19.7784μs 50.5601 KOps/s 50.4424 KOps/s $\color{#35bf28}+0.23\%$
test_tdseq_dispatch 52.0320μs 35.8218μs 27.9159 KOps/s 26.7866 KOps/s $\color{#35bf28}+4.22\%$
test_instantiation_functorch 1.8171ms 1.6716ms 598.2124 Ops/s 605.7069 Ops/s $\color{#d91a1a}-1.24\%$
test_instantiation_td 1.7199ms 1.1785ms 848.5575 Ops/s 843.3632 Ops/s $\color{#35bf28}+0.62\%$
test_exec_functorch 0.2328ms 0.1575ms 6.3492 KOps/s 6.2625 KOps/s $\color{#35bf28}+1.38\%$
test_exec_functional_call 0.2179ms 0.1550ms 6.4526 KOps/s 6.3569 KOps/s $\color{#35bf28}+1.50\%$
test_exec_td 0.1916ms 0.1450ms 6.8984 KOps/s 6.6850 KOps/s $\color{#35bf28}+3.19\%$
test_exec_td_decorator 68.2404ms 0.2022ms 4.9459 KOps/s 5.4111 KOps/s $\textbf{\color{#d91a1a}-8.60\%}$
test_vmap_mlp_speed[True-True] 1.1788ms 1.0717ms 933.0821 Ops/s 882.3212 Ops/s $\textbf{\color{#35bf28}+5.75\%}$
test_vmap_mlp_speed[True-False] 1.1197ms 0.6134ms 1.6302 KOps/s 1.5764 KOps/s $\color{#35bf28}+3.41\%$
test_vmap_mlp_speed[False-True] 1.0786ms 0.9845ms 1.0157 KOps/s 953.3319 Ops/s $\textbf{\color{#35bf28}+6.54\%}$
test_vmap_mlp_speed[False-False] 0.6008ms 0.5407ms 1.8494 KOps/s 1.7621 KOps/s $\color{#35bf28}+4.96\%$
test_vmap_mlp_speed_decorator[True-True] 2.7330ms 2.0437ms 489.3013 Ops/s 462.7909 Ops/s $\textbf{\color{#35bf28}+5.73\%}$
test_vmap_mlp_speed_decorator[True-False] 1.1208ms 0.6589ms 1.5176 KOps/s 1.4891 KOps/s $\color{#35bf28}+1.91\%$
test_vmap_mlp_speed_decorator[False-True] 2.2040ms 1.7687ms 565.3799 Ops/s 542.3147 Ops/s $\color{#35bf28}+4.25\%$
test_vmap_mlp_speed_decorator[False-False] 0.9487ms 0.5568ms 1.7960 KOps/s 1.7515 KOps/s $\color{#35bf28}+2.54\%$
test_vmap_transformer_speed[True-True] 13.0825ms 12.6222ms 79.2252 Ops/s 77.5452 Ops/s $\color{#35bf28}+2.17\%$
test_vmap_transformer_speed[True-False] 8.4984ms 8.2422ms 121.3266 Ops/s 119.5933 Ops/s $\color{#35bf28}+1.45\%$
test_vmap_transformer_speed[False-True] 12.8390ms 12.4912ms 80.0561 Ops/s 79.1620 Ops/s $\color{#35bf28}+1.13\%$
test_vmap_transformer_speed[False-False] 8.2452ms 8.1472ms 122.7409 Ops/s 121.0662 Ops/s $\color{#35bf28}+1.38\%$
test_vmap_transformer_speed_decorator[True-True] 65.6556ms 64.5458ms 15.4929 Ops/s 13.8374 Ops/s $\textbf{\color{#35bf28}+11.96\%}$
test_vmap_transformer_speed_decorator[True-False] 22.1164ms 19.9104ms 50.2250 Ops/s 49.1953 Ops/s $\color{#35bf28}+2.09\%$
test_vmap_transformer_speed_decorator[False-True] 60.0099ms 58.6467ms 17.0512 Ops/s 16.0377 Ops/s $\textbf{\color{#35bf28}+6.32\%}$
test_vmap_transformer_speed_decorator[False-False] 21.6508ms 19.5307ms 51.2016 Ops/s 50.3242 Ops/s $\color{#35bf28}+1.74\%$

@vmoens vmoens added the documentation Improvements or additions to documentation label Dec 12, 2023
@vmoens vmoens marked this pull request as ready for review January 4, 2024 15:55
@vmoens vmoens merged commit 3262b39 into main Jan 4, 2024
43 of 45 checks passed
@vmoens vmoens deleted the doc-revamp branch January 4, 2024 17:15
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. documentation Improvements or additions to documentation
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants