Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BugFix] _FileHandler for windows #577

Merged
merged 2 commits into from
Nov 24, 2023
Merged

[BugFix] _FileHandler for windows #577

merged 2 commits into from
Nov 24, 2023

Conversation

vmoens
Copy link
Contributor

@vmoens vmoens commented Nov 24, 2023

No description provided.

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Nov 24, 2023
@vmoens vmoens added the bug Something isn't working label Nov 24, 2023
@vmoens vmoens marked this pull request as ready for review November 24, 2023 14:40
@vmoens vmoens merged commit 91ffde1 into main Nov 24, 2023
21 of 33 checks passed
@vmoens vmoens deleted the handler-win branch November 24, 2023 14:40
Copy link

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 113. Improved: $\large\color{#35bf28}7$. Worsened: $\large\color{#d91a1a}7$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 35.5760μs 16.0001μs 62.4995 KOps/s 63.0784 KOps/s $\color{#d91a1a}-0.92\%$
test_plain_set_stack_nested 0.2726ms 0.1423ms 7.0293 KOps/s 7.0213 KOps/s $\color{#35bf28}+0.11\%$
test_plain_set_nested_inplace 80.0820μs 19.1475μs 52.2260 KOps/s 52.1863 KOps/s $\color{#35bf28}+0.08\%$
test_plain_set_stack_nested_inplace 0.2332ms 0.1744ms 5.7341 KOps/s 5.8182 KOps/s $\color{#d91a1a}-1.45\%$
test_items 20.1380μs 2.4214μs 412.9905 KOps/s 411.1169 KOps/s $\color{#35bf28}+0.46\%$
test_items_nested 0.3336ms 0.2712ms 3.6877 KOps/s 3.7322 KOps/s $\color{#d91a1a}-1.19\%$
test_items_nested_locked 1.4037ms 0.2727ms 3.6674 KOps/s 3.7252 KOps/s $\color{#d91a1a}-1.55\%$
test_items_nested_leaf 0.3247ms 0.1673ms 5.9778 KOps/s 6.0698 KOps/s $\color{#d91a1a}-1.52\%$
test_items_stack_nested 1.5911ms 1.4767ms 677.1792 Ops/s 666.4390 Ops/s $\color{#35bf28}+1.61\%$
test_items_stack_nested_leaf 1.7937ms 1.3541ms 738.5072 Ops/s 733.2015 Ops/s $\color{#35bf28}+0.72\%$
test_items_stack_nested_locked 1.3705ms 0.7682ms 1.3017 KOps/s 1.2904 KOps/s $\color{#35bf28}+0.88\%$
test_keys 40.6260μs 3.8168μs 262.0028 KOps/s 254.4512 KOps/s $\color{#35bf28}+2.97\%$
test_keys_nested 1.4164ms 0.1400ms 7.1450 KOps/s 6.7261 KOps/s $\textbf{\color{#35bf28}+6.23\%}$
test_keys_nested_locked 0.2992ms 0.1401ms 7.1352 KOps/s 7.0777 KOps/s $\color{#35bf28}+0.81\%$
test_keys_nested_leaf 0.3403ms 0.1389ms 7.1977 KOps/s 7.1145 KOps/s $\color{#35bf28}+1.17\%$
test_keys_stack_nested 1.5633ms 1.4061ms 711.2028 Ops/s 707.1758 Ops/s $\color{#35bf28}+0.57\%$
test_keys_stack_nested_leaf 2.2211ms 1.4230ms 702.7485 Ops/s 710.0791 Ops/s $\color{#d91a1a}-1.03\%$
test_keys_stack_nested_locked 4.6358ms 0.6901ms 1.4490 KOps/s 1.4656 KOps/s $\color{#d91a1a}-1.13\%$
test_values 13.5303μs 1.1458μs 872.7480 KOps/s 865.3855 KOps/s $\color{#35bf28}+0.85\%$
test_values_nested 97.6930μs 50.3332μs 19.8676 KOps/s 20.5812 KOps/s $\color{#d91a1a}-3.47\%$
test_values_nested_locked 0.1309ms 50.6012μs 19.7624 KOps/s 19.7289 KOps/s $\color{#35bf28}+0.17\%$
test_values_nested_leaf 76.1630μs 45.4409μs 22.0066 KOps/s 22.9096 KOps/s $\color{#d91a1a}-3.94\%$
test_values_stack_nested 1.3807ms 1.1899ms 840.3835 Ops/s 808.8832 Ops/s $\color{#35bf28}+3.89\%$
test_values_stack_nested_leaf 1.8783ms 1.1796ms 847.7242 Ops/s 843.3100 Ops/s $\color{#35bf28}+0.52\%$
test_values_stack_nested_locked 0.6165ms 0.5119ms 1.9537 KOps/s 1.9105 KOps/s $\color{#35bf28}+2.26\%$
test_membership 15.6090μs 1.3955μs 716.6063 KOps/s 737.6459 KOps/s $\color{#d91a1a}-2.85\%$
test_membership_nested 40.4560μs 2.8698μs 348.4557 KOps/s 359.4427 KOps/s $\color{#d91a1a}-3.06\%$
test_membership_nested_leaf 20.2080μs 2.8459μs 351.3836 KOps/s 358.0080 KOps/s $\color{#d91a1a}-1.85\%$
test_membership_stacked_nested 29.4250μs 11.5927μs 86.2614 KOps/s 86.1542 KOps/s $\color{#35bf28}+0.12\%$
test_membership_stacked_nested_leaf 41.9180μs 11.8594μs 84.3212 KOps/s 85.7788 KOps/s $\color{#d91a1a}-1.70\%$
test_membership_nested_last 51.6370μs 5.9708μs 167.4816 KOps/s 168.4744 KOps/s $\color{#d91a1a}-0.59\%$
test_membership_nested_leaf_last 37.4100μs 5.9852μs 167.0796 KOps/s 170.1675 KOps/s $\color{#d91a1a}-1.81\%$
test_membership_stacked_nested_last 0.3246ms 0.1699ms 5.8866 KOps/s 6.0301 KOps/s $\color{#d91a1a}-2.38\%$
test_membership_stacked_nested_leaf_last 46.3270μs 13.7746μs 72.5975 KOps/s 71.1489 KOps/s $\color{#35bf28}+2.04\%$
test_nested_getleaf 54.7220μs 10.6802μs 93.6310 KOps/s 95.1938 KOps/s $\color{#d91a1a}-1.64\%$
test_nested_get 52.7490μs 10.0681μs 99.3237 KOps/s 100.4868 KOps/s $\color{#d91a1a}-1.16\%$
test_stacked_getleaf 0.8589ms 0.6379ms 1.5675 KOps/s 1.5595 KOps/s $\color{#35bf28}+0.52\%$
test_stacked_get 0.7431ms 0.5996ms 1.6678 KOps/s 1.6215 KOps/s $\color{#35bf28}+2.86\%$
test_nested_getitemleaf 38.9930μs 10.6355μs 94.0249 KOps/s 95.2917 KOps/s $\color{#d91a1a}-1.33\%$
test_nested_getitem 47.7990μs 10.1046μs 98.9644 KOps/s 100.6428 KOps/s $\color{#d91a1a}-1.67\%$
test_stacked_getitemleaf 1.0765ms 0.6371ms 1.5697 KOps/s 1.5562 KOps/s $\color{#35bf28}+0.87\%$
test_stacked_getitem 0.7028ms 0.6063ms 1.6493 KOps/s 1.6269 KOps/s $\color{#35bf28}+1.37\%$
test_lock_nested 56.2351ms 0.5477ms 1.8260 KOps/s 2.0252 KOps/s $\textbf{\color{#d91a1a}-9.84\%}$
test_lock_stack_nested 83.7572ms 8.5243ms 117.3114 Ops/s 118.7394 Ops/s $\color{#d91a1a}-1.20\%$
test_unlock_nested 60.2694ms 0.5088ms 1.9656 KOps/s 1.9221 KOps/s $\color{#35bf28}+2.26\%$
test_unlock_stack_nested 83.7100ms 8.2290ms 121.5215 Ops/s 204.8105 Ops/s $\textbf{\color{#d91a1a}-40.67\%}$
test_flatten_speed 0.5443ms 0.2671ms 3.7434 KOps/s 3.7055 KOps/s $\color{#35bf28}+1.02\%$
test_unflatten_speed 0.9807ms 0.4641ms 2.1549 KOps/s 2.1403 KOps/s $\color{#35bf28}+0.68\%$
test_common_ops 4.9053ms 0.6898ms 1.4497 KOps/s 1.4750 KOps/s $\color{#d91a1a}-1.71\%$
test_creation 92.1530μs 2.3930μs 417.8922 KOps/s 422.4231 KOps/s $\color{#d91a1a}-1.07\%$
test_creation_empty 34.0840μs 8.5982μs 116.3028 KOps/s 116.9233 KOps/s $\color{#d91a1a}-0.53\%$
test_creation_nested_1 61.8460μs 12.0658μs 82.8789 KOps/s 84.1128 KOps/s $\color{#d91a1a}-1.47\%$
test_creation_nested_2 36.8090μs 15.5240μs 64.4164 KOps/s 64.9024 KOps/s $\color{#d91a1a}-0.75\%$
test_clone 0.1010ms 13.2648μs 75.3873 KOps/s 75.0487 KOps/s $\color{#35bf28}+0.45\%$
test_getitem[int] 47.9000μs 12.9527μs 77.2038 KOps/s 77.0703 KOps/s $\color{#35bf28}+0.17\%$
test_getitem[slice_int] 79.0380μs 24.5780μs 40.6867 KOps/s 39.4681 KOps/s $\color{#35bf28}+3.09\%$
test_getitem[range] 0.1022ms 45.0674μs 22.1890 KOps/s 22.2268 KOps/s $\color{#d91a1a}-0.17\%$
test_getitem[tuple] 61.7150μs 20.2325μs 49.4254 KOps/s 49.1292 KOps/s $\color{#35bf28}+0.60\%$
test_getitem[list] 0.3931ms 38.8860μs 25.7162 KOps/s 25.5961 KOps/s $\color{#35bf28}+0.47\%$
test_setitem_dim[int] 51.2060μs 29.4933μs 33.9060 KOps/s 34.0884 KOps/s $\color{#d91a1a}-0.54\%$
test_setitem_dim[slice_int] 85.5310μs 53.7361μs 18.6095 KOps/s 18.1461 KOps/s $\color{#35bf28}+2.55\%$
test_setitem_dim[range] 0.1240ms 72.4302μs 13.8064 KOps/s 13.3732 KOps/s $\color{#35bf28}+3.24\%$
test_setitem_dim[tuple] 0.1044ms 42.7533μs 23.3900 KOps/s 23.2144 KOps/s $\color{#35bf28}+0.76\%$
test_setitem 0.1772ms 18.4271μs 54.2679 KOps/s 54.1021 KOps/s $\color{#35bf28}+0.31\%$
test_set 0.1514ms 17.8021μs 56.1731 KOps/s 55.5016 KOps/s $\color{#35bf28}+1.21\%$
test_set_shared 2.7105ms 0.1448ms 6.9071 KOps/s 7.2248 KOps/s $\color{#d91a1a}-4.40\%$
test_update 0.2253ms 19.0879μs 52.3891 KOps/s 50.7779 KOps/s $\color{#35bf28}+3.17\%$
test_update_nested 0.1802ms 26.2104μs 38.1527 KOps/s 34.1392 KOps/s $\textbf{\color{#35bf28}+11.76\%}$
test_set_nested 0.1455ms 19.8555μs 50.3640 KOps/s 50.6069 KOps/s $\color{#d91a1a}-0.48\%$
test_set_nested_new 0.2020ms 25.2947μs 39.5339 KOps/s 39.6186 KOps/s $\color{#d91a1a}-0.21\%$
test_select 0.2120ms 50.6009μs 19.7625 KOps/s 19.4764 KOps/s $\color{#35bf28}+1.47\%$
test_unbind_speed 0.4763ms 0.3737ms 2.6759 KOps/s 2.6754 KOps/s $\color{#35bf28}+0.02\%$
test_unbind_speed_stack0 68.8004ms 5.3100ms 188.3231 Ops/s 179.1466 Ops/s $\textbf{\color{#35bf28}+5.12\%}$
test_unbind_speed_stack1 11.2310μs 0.8248μs 1.2124 MOps/s 1.5610 MOps/s $\textbf{\color{#d91a1a}-22.33\%}$
test_split 58.1379ms 1.8082ms 553.0296 Ops/s 591.7021 Ops/s $\textbf{\color{#d91a1a}-6.54\%}$
test_chunk 61.2999ms 1.7691ms 565.2642 Ops/s 557.1070 Ops/s $\color{#35bf28}+1.46\%$
test_creation[device0] 4.0936ms 0.3010ms 3.3222 KOps/s 2.9834 KOps/s $\textbf{\color{#35bf28}+11.36\%}$
test_creation_from_tensor 3.4801ms 0.3327ms 3.0057 KOps/s 2.9671 KOps/s $\color{#35bf28}+1.30\%$
test_add_one[memmap_tensor0] 94.3270μs 26.3964μs 37.8839 KOps/s 38.6621 KOps/s $\color{#d91a1a}-2.01\%$
test_contiguous[memmap_tensor0] 28.6830μs 5.8737μs 170.2493 KOps/s 168.8833 KOps/s $\color{#35bf28}+0.81\%$
test_stack[memmap_tensor0] 0.1057ms 19.5791μs 51.0750 KOps/s 50.3293 KOps/s $\color{#35bf28}+1.48\%$
test_memmaptd_index 0.7485ms 0.3948ms 2.5330 KOps/s 2.4046 KOps/s $\textbf{\color{#35bf28}+5.34\%}$
test_memmaptd_index_astensor 0.6124ms 0.4595ms 2.1763 KOps/s 2.1199 KOps/s $\color{#35bf28}+2.66\%$
test_memmaptd_index_op 1.4043ms 0.7272ms 1.3751 KOps/s 1.4001 KOps/s $\color{#d91a1a}-1.79\%$
test_reshape_pytree 50.4940μs 23.9614μs 41.7338 KOps/s 42.0702 KOps/s $\color{#d91a1a}-0.80\%$
test_reshape_td 69.1000μs 31.8577μs 31.3896 KOps/s 31.7851 KOps/s $\color{#d91a1a}-1.24\%$
test_view_pytree 78.0090μs 23.4843μs 42.5817 KOps/s 43.6792 KOps/s $\color{#d91a1a}-2.51\%$
test_view_td 23.4950μs 5.0653μs 197.4207 KOps/s 202.0504 KOps/s $\color{#d91a1a}-2.29\%$
test_unbind_pytree 55.9150μs 26.5780μs 37.6251 KOps/s 38.3744 KOps/s $\color{#d91a1a}-1.95\%$
test_unbind_td 0.1913ms 59.2218μs 16.8857 KOps/s 17.0267 KOps/s $\color{#d91a1a}-0.83\%$
test_split_pytree 67.6370μs 26.3671μs 37.9260 KOps/s 38.4846 KOps/s $\color{#d91a1a}-1.45\%$
test_split_td 92.2230μs 46.0078μs 21.7354 KOps/s 21.4187 KOps/s $\color{#35bf28}+1.48\%$
test_add_pytree 0.1008ms 32.5111μs 30.7588 KOps/s 31.3056 KOps/s $\color{#d91a1a}-1.75\%$
test_add_td 89.6880μs 45.6963μs 21.8836 KOps/s 22.1959 KOps/s $\color{#d91a1a}-1.41\%$
test_distributed 33.4630μs 5.8972μs 169.5730 KOps/s 168.6899 KOps/s $\color{#35bf28}+0.52\%$
test_tdmodule 0.1533ms 21.1195μs 47.3496 KOps/s 47.1233 KOps/s $\color{#35bf28}+0.48\%$
test_tdmodule_dispatch 0.1698ms 39.4029μs 25.3789 KOps/s 24.8649 KOps/s $\color{#35bf28}+2.07\%$
test_tdseq 0.1124ms 23.6508μs 42.2819 KOps/s 41.9675 KOps/s $\color{#35bf28}+0.75\%$
test_tdseq_dispatch 0.1311ms 42.2019μs 23.6956 KOps/s 23.2368 KOps/s $\color{#35bf28}+1.97\%$
test_instantiation_functorch 1.5148ms 1.3163ms 759.6968 Ops/s 759.3679 Ops/s $\color{#35bf28}+0.04\%$
test_instantiation_td 1.6522ms 1.0310ms 969.9756 Ops/s 975.9901 Ops/s $\color{#d91a1a}-0.62\%$
test_exec_functorch 0.2318ms 0.1590ms 6.2880 KOps/s 6.2175 KOps/s $\color{#35bf28}+1.13\%$
test_exec_functional_call 0.2651ms 0.1476ms 6.7762 KOps/s 6.7219 KOps/s $\color{#35bf28}+0.81\%$
test_exec_td 0.2688ms 0.1445ms 6.9197 KOps/s 6.8103 KOps/s $\color{#35bf28}+1.61\%$
test_exec_td_decorator 0.9984ms 0.2245ms 4.4549 KOps/s 5.4794 KOps/s $\textbf{\color{#d91a1a}-18.70\%}$
test_vmap_mlp_speed[True-True] 1.1636ms 0.9143ms 1.0937 KOps/s 1.1055 KOps/s $\color{#d91a1a}-1.06\%$
test_vmap_mlp_speed[True-False] 0.5826ms 0.4767ms 2.0980 KOps/s 2.0968 KOps/s $\color{#35bf28}+0.06\%$
test_vmap_mlp_speed[False-True] 1.4811ms 0.7964ms 1.2556 KOps/s 1.2717 KOps/s $\color{#d91a1a}-1.26\%$
test_vmap_mlp_speed[False-False] 0.6156ms 0.3936ms 2.5405 KOps/s 2.5513 KOps/s $\color{#d91a1a}-0.42\%$
test_vmap_mlp_speed_decorator[True-True] 2.5824ms 1.6117ms 620.4601 Ops/s 558.6627 Ops/s $\textbf{\color{#35bf28}+11.06\%}$
test_vmap_mlp_speed_decorator[True-False] 1.2978ms 0.5702ms 1.7538 KOps/s 1.8730 KOps/s $\textbf{\color{#d91a1a}-6.36\%}$
test_vmap_mlp_speed_decorator[False-True] 2.0388ms 1.3897ms 719.5545 Ops/s 667.0861 Ops/s $\textbf{\color{#35bf28}+7.87\%}$
test_vmap_mlp_speed_decorator[False-False] 0.9253ms 0.4370ms 2.2884 KOps/s 2.4551 KOps/s $\textbf{\color{#d91a1a}-6.79\%}$

Copy link

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 127. Improved: $\large\color{#35bf28}16$. Worsened: $\large\color{#d91a1a}4$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 0.4463ms 12.4431μs 80.3656 KOps/s 80.2606 KOps/s $\color{#35bf28}+0.13\%$
test_plain_set_stack_nested 0.1392ms 0.1135ms 8.8073 KOps/s 8.3707 KOps/s $\textbf{\color{#35bf28}+5.22\%}$
test_plain_set_nested_inplace 35.4910μs 14.8663μs 67.2663 KOps/s 67.0650 KOps/s $\color{#35bf28}+0.30\%$
test_plain_set_stack_nested_inplace 0.2175ms 0.1410ms 7.0909 KOps/s 7.0748 KOps/s $\color{#35bf28}+0.23\%$
test_items 62.6210μs 4.7335μs 211.2621 KOps/s 216.5692 KOps/s $\color{#d91a1a}-2.45\%$
test_items_nested 0.3860ms 0.3445ms 2.9031 KOps/s 2.9803 KOps/s $\color{#d91a1a}-2.59\%$
test_items_nested_locked 0.3839ms 0.3431ms 2.9150 KOps/s 2.9696 KOps/s $\color{#d91a1a}-1.84\%$
test_items_nested_leaf 0.3463ms 0.2008ms 4.9811 KOps/s 5.0206 KOps/s $\color{#d91a1a}-0.79\%$
test_items_stack_nested 1.6936ms 1.4842ms 673.7508 Ops/s 666.5528 Ops/s $\color{#35bf28}+1.08\%$
test_items_stack_nested_leaf 1.4443ms 1.3041ms 766.8043 Ops/s 753.3470 Ops/s $\color{#35bf28}+1.79\%$
test_items_stack_nested_locked 1.7832ms 0.8221ms 1.2165 KOps/s 1.2253 KOps/s $\color{#d91a1a}-0.72\%$
test_keys 23.0800μs 4.5773μs 218.4676 KOps/s 218.5407 KOps/s $\color{#d91a1a}-0.03\%$
test_keys_nested 0.5235ms 90.0893μs 11.1001 KOps/s 11.0755 KOps/s $\color{#35bf28}+0.22\%$
test_keys_nested_locked 0.1629ms 89.1963μs 11.2112 KOps/s 11.1375 KOps/s $\color{#35bf28}+0.66\%$
test_keys_nested_leaf 42.0704ms 86.4221μs 11.5711 KOps/s 12.1857 KOps/s $\textbf{\color{#d91a1a}-5.04\%}$
test_keys_stack_nested 1.4910ms 1.2930ms 773.3902 Ops/s 764.8430 Ops/s $\color{#35bf28}+1.12\%$
test_keys_stack_nested_leaf 1.4706ms 1.2919ms 774.0630 Ops/s 777.6347 Ops/s $\color{#d91a1a}-0.46\%$
test_keys_stack_nested_locked 0.8049ms 0.6122ms 1.6336 KOps/s 1.6092 KOps/s $\color{#35bf28}+1.52\%$
test_values 62.6210μs 1.8818μs 531.3952 KOps/s 529.0342 KOps/s $\color{#35bf28}+0.45\%$
test_values_nested 0.2349ms 42.7495μs 23.3921 KOps/s 23.2052 KOps/s $\color{#35bf28}+0.81\%$
test_values_nested_locked 0.2083ms 42.6310μs 23.4571 KOps/s 23.0755 KOps/s $\color{#35bf28}+1.65\%$
test_values_nested_leaf 0.1031ms 37.3278μs 26.7897 KOps/s 26.6944 KOps/s $\color{#35bf28}+0.36\%$
test_values_stack_nested 1.2747ms 1.1211ms 891.9512 Ops/s 873.8645 Ops/s $\color{#35bf28}+2.07\%$
test_values_stack_nested_leaf 1.2019ms 1.1156ms 896.4047 Ops/s 869.9905 Ops/s $\color{#35bf28}+3.04\%$
test_values_stack_nested_locked 0.6487ms 0.4911ms 2.0364 KOps/s 1.9700 KOps/s $\color{#35bf28}+3.37\%$
test_membership 5.0960μs 0.9458μs 1.0574 MOps/s 1.0474 MOps/s $\color{#35bf28}+0.95\%$
test_membership_nested 15.8410μs 2.0780μs 481.2262 KOps/s 455.4685 KOps/s $\textbf{\color{#35bf28}+5.66\%}$
test_membership_nested_leaf 84.2410μs 2.0948μs 477.3717 KOps/s 473.5382 KOps/s $\color{#35bf28}+0.81\%$
test_membership_stacked_nested 0.1822ms 10.9343μs 91.4551 KOps/s 93.2945 KOps/s $\color{#d91a1a}-1.97\%$
test_membership_stacked_nested_leaf 0.1897ms 10.8700μs 91.9964 KOps/s 93.7599 KOps/s $\color{#d91a1a}-1.88\%$
test_membership_nested_last 0.1625ms 4.5713μs 218.7555 KOps/s 218.2462 KOps/s $\color{#35bf28}+0.23\%$
test_membership_nested_leaf_last 33.6710μs 4.5697μs 218.8310 KOps/s 217.5911 KOps/s $\color{#35bf28}+0.57\%$
test_membership_stacked_nested_last 0.1787ms 0.1342ms 7.4505 KOps/s 7.5305 KOps/s $\color{#d91a1a}-1.06\%$
test_membership_stacked_nested_leaf_last 28.3210μs 12.7743μs 78.2820 KOps/s 79.4346 KOps/s $\color{#d91a1a}-1.45\%$
test_nested_getleaf 21.4610μs 8.3664μs 119.5251 KOps/s 119.3838 KOps/s $\color{#35bf28}+0.12\%$
test_nested_get 28.3310μs 7.8731μs 127.0147 KOps/s 125.6983 KOps/s $\color{#35bf28}+1.05\%$
test_stacked_getleaf 0.7725ms 0.5620ms 1.7793 KOps/s 1.7186 KOps/s $\color{#35bf28}+3.53\%$
test_stacked_get 0.6163ms 0.5309ms 1.8836 KOps/s 1.8307 KOps/s $\color{#35bf28}+2.89\%$
test_nested_getitemleaf 0.1933ms 8.4102μs 118.9036 KOps/s 118.7973 KOps/s $\color{#35bf28}+0.09\%$
test_nested_getitem 29.6500μs 7.9613μs 125.6076 KOps/s 125.3357 KOps/s $\color{#35bf28}+0.22\%$
test_stacked_getitemleaf 0.7351ms 0.5595ms 1.7874 KOps/s 1.7577 KOps/s $\color{#35bf28}+1.69\%$
test_stacked_getitem 0.7326ms 0.5358ms 1.8663 KOps/s 1.8486 KOps/s $\color{#35bf28}+0.96\%$
test_lock_nested 4.3647ms 0.4575ms 2.1857 KOps/s 2.1624 KOps/s $\color{#35bf28}+1.08\%$
test_lock_stack_nested 70.2837ms 6.5561ms 152.5298 Ops/s 148.9118 Ops/s $\color{#35bf28}+2.43\%$
test_unlock_nested 1.2839ms 0.4352ms 2.2979 KOps/s 1.9880 KOps/s $\textbf{\color{#35bf28}+15.59\%}$
test_unlock_stack_nested 66.1927ms 7.2916ms 137.1442 Ops/s 135.3404 Ops/s $\color{#35bf28}+1.33\%$
test_flatten_speed 0.5265ms 0.1863ms 5.3684 KOps/s 5.3407 KOps/s $\color{#35bf28}+0.52\%$
test_unflatten_speed 0.4708ms 0.3578ms 2.7946 KOps/s 2.7591 KOps/s $\color{#35bf28}+1.29\%$
test_common_ops 1.0535ms 0.5933ms 1.6855 KOps/s 1.6956 KOps/s $\color{#d91a1a}-0.60\%$
test_creation 17.9300μs 1.9077μs 524.1878 KOps/s 506.2153 KOps/s $\color{#35bf28}+3.55\%$
test_creation_empty 26.0700μs 6.5482μs 152.7140 KOps/s 153.8967 KOps/s $\color{#d91a1a}-0.77\%$
test_creation_nested_1 41.4410μs 8.9099μs 112.2342 KOps/s 112.9953 KOps/s $\color{#d91a1a}-0.67\%$
test_creation_nested_2 28.0510μs 11.4848μs 87.0719 KOps/s 87.5949 KOps/s $\color{#d91a1a}-0.60\%$
test_clone 0.1569ms 14.2554μs 70.1487 KOps/s 71.6957 KOps/s $\color{#d91a1a}-2.16\%$
test_getitem[int] 0.1473ms 11.8971μs 84.0542 KOps/s 83.9685 KOps/s $\color{#35bf28}+0.10\%$
test_getitem[slice_int] 45.2710μs 23.1719μs 43.1558 KOps/s 44.1459 KOps/s $\color{#d91a1a}-2.24\%$
test_getitem[range] 0.1773ms 38.4470μs 26.0099 KOps/s 24.1202 KOps/s $\textbf{\color{#35bf28}+7.83\%}$
test_getitem[tuple] 0.1104ms 19.9785μs 50.0538 KOps/s 50.7234 KOps/s $\color{#d91a1a}-1.32\%$
test_getitem[list] 0.3636ms 35.0407μs 28.5382 KOps/s 28.7063 KOps/s $\color{#d91a1a}-0.59\%$
test_setitem_dim[int] 46.0210μs 25.3329μs 39.4744 KOps/s 38.5872 KOps/s $\color{#35bf28}+2.30\%$
test_setitem_dim[slice_int] 0.1313ms 45.1381μs 22.1542 KOps/s 21.1882 KOps/s $\color{#35bf28}+4.56\%$
test_setitem_dim[range] 83.5810μs 61.2445μs 16.3280 KOps/s 15.2621 KOps/s $\textbf{\color{#35bf28}+6.98\%}$
test_setitem_dim[tuple] 59.0810μs 38.2864μs 26.1189 KOps/s 25.4011 KOps/s $\color{#35bf28}+2.83\%$
test_setitem 0.1899ms 17.9427μs 55.7331 KOps/s 55.4297 KOps/s $\color{#35bf28}+0.55\%$
test_set 0.1233ms 17.3321μs 57.6963 KOps/s 58.2827 KOps/s $\color{#d91a1a}-1.01\%$
test_set_shared 2.9146ms 0.1019ms 9.8163 KOps/s 9.3015 KOps/s $\textbf{\color{#35bf28}+5.54\%}$
test_update 0.2029ms 18.4483μs 54.2055 KOps/s 54.6678 KOps/s $\color{#d91a1a}-0.85\%$
test_update_nested 0.2189ms 24.9939μs 40.0097 KOps/s 40.4391 KOps/s $\color{#d91a1a}-1.06\%$
test_set_nested 0.1166ms 18.6259μs 53.6887 KOps/s 51.5955 KOps/s $\color{#35bf28}+4.06\%$
test_set_nested_new 0.1222ms 22.6342μs 44.1809 KOps/s 41.8175 KOps/s $\textbf{\color{#35bf28}+5.65\%}$
test_select 0.1736ms 44.2424μs 22.6028 KOps/s 20.9039 KOps/s $\textbf{\color{#35bf28}+8.13\%}$
test_to 71.9830μs 50.7699μs 19.6967 KOps/s 19.4856 KOps/s $\color{#35bf28}+1.08\%$
test_to_nonblocking 0.1852ms 33.9923μs 29.4184 KOps/s 29.7223 KOps/s $\color{#d91a1a}-1.02\%$
test_unbind_speed 0.4629ms 0.3484ms 2.8703 KOps/s 2.8764 KOps/s $\color{#d91a1a}-0.21\%$
test_unbind_speed_stack0 60.8099ms 5.1128ms 195.5878 Ops/s 193.5987 Ops/s $\color{#35bf28}+1.03\%$
test_unbind_speed_stack1 1.9070μs 0.5240μs 1.9083 MOps/s 1.9067 MOps/s $\color{#35bf28}+0.08\%$
test_split 54.4191ms 1.7984ms 556.0382 Ops/s 557.8280 Ops/s $\color{#d91a1a}-0.32\%$
test_chunk 54.0103ms 1.7860ms 559.8984 Ops/s 563.1535 Ops/s $\color{#d91a1a}-0.58\%$
test_creation[device0] 0.4474ms 0.3065ms 3.2623 KOps/s 3.2607 KOps/s $\color{#35bf28}+0.05\%$
test_creation[device1] 0.6569ms 0.3190ms 3.1351 KOps/s 3.2216 KOps/s $\color{#d91a1a}-2.69\%$
test_creation_from_tensor 0.5785ms 0.3353ms 2.9825 KOps/s 2.9838 KOps/s $\color{#d91a1a}-0.04\%$
test_add_one[memmap_tensor0] 71.1420μs 24.0345μs 41.6069 KOps/s 42.5508 KOps/s $\color{#d91a1a}-2.22\%$
test_add_one[memmap_tensor1] 0.2096ms 73.1750μs 13.6659 KOps/s 13.8239 KOps/s $\color{#d91a1a}-1.14\%$
test_contiguous[memmap_tensor0] 0.1329ms 5.7846μs 172.8731 KOps/s 173.0882 KOps/s $\color{#d91a1a}-0.12\%$
test_contiguous[memmap_tensor1] 0.2060ms 21.6539μs 46.1811 KOps/s 46.1244 KOps/s $\color{#35bf28}+0.12\%$
test_stack[memmap_tensor0] 50.5910μs 19.8331μs 50.4207 KOps/s 47.0390 KOps/s $\textbf{\color{#35bf28}+7.19\%}$
test_stack[memmap_tensor1] 0.2130ms 73.9420μs 13.5241 KOps/s 13.6234 KOps/s $\color{#d91a1a}-0.73\%$
test_memmaptd_index 0.4590ms 0.4141ms 2.4152 KOps/s 2.3387 KOps/s $\color{#35bf28}+3.27\%$
test_memmaptd_index_astensor 0.5812ms 0.4721ms 2.1181 KOps/s 2.0598 KOps/s $\color{#35bf28}+2.83\%$
test_memmaptd_index_op 0.8587ms 0.7290ms 1.3717 KOps/s 1.3507 KOps/s $\color{#35bf28}+1.56\%$
test_reshape_pytree 43.6400μs 20.8279μs 48.0125 KOps/s 47.7037 KOps/s $\color{#35bf28}+0.65\%$
test_reshape_td 48.8610μs 28.6188μs 34.9421 KOps/s 34.1756 KOps/s $\color{#35bf28}+2.24\%$
test_view_pytree 46.9210μs 20.8062μs 48.0625 KOps/s 48.3572 KOps/s $\color{#d91a1a}-0.61\%$
test_view_td 23.2600μs 4.0779μs 245.2242 KOps/s 247.9009 KOps/s $\color{#d91a1a}-1.08\%$
test_unbind_pytree 48.9100μs 26.0002μs 38.4612 KOps/s 38.4710 KOps/s $\color{#d91a1a}-0.03\%$
test_unbind_td 87.7810μs 55.0858μs 18.1535 KOps/s 18.1680 KOps/s $\color{#d91a1a}-0.08\%$
test_split_pytree 0.7391ms 24.4786μs 40.8521 KOps/s 41.8977 KOps/s $\color{#d91a1a}-2.50\%$
test_split_td 0.1388ms 43.7536μs 22.8552 KOps/s 23.1908 KOps/s $\color{#d91a1a}-1.45\%$
test_add_pytree 59.5910μs 32.4435μs 30.8228 KOps/s 31.3741 KOps/s $\color{#d91a1a}-1.76\%$
test_add_td 91.2910μs 41.5364μs 24.0753 KOps/s 23.7375 KOps/s $\color{#35bf28}+1.42\%$
test_distributed 38.8210μs 5.5848μs 179.0568 KOps/s 180.4494 KOps/s $\color{#d91a1a}-0.77\%$
test_tdmodule 37.5600μs 15.9787μs 62.5832 KOps/s 61.2766 KOps/s $\color{#35bf28}+2.13\%$
test_tdmodule_dispatch 0.2290ms 31.4258μs 31.8210 KOps/s 31.7589 KOps/s $\color{#35bf28}+0.20\%$
test_tdseq 34.7000μs 19.0994μs 52.3577 KOps/s 51.7748 KOps/s $\color{#35bf28}+1.13\%$
test_tdseq_dispatch 0.1329ms 33.9296μs 29.4728 KOps/s 28.8548 KOps/s $\color{#35bf28}+2.14\%$
test_instantiation_functorch 1.8269ms 1.6731ms 597.7097 Ops/s 590.4721 Ops/s $\color{#35bf28}+1.23\%$
test_instantiation_td 1.9462ms 1.1650ms 858.3782 Ops/s 844.0819 Ops/s $\color{#35bf28}+1.69\%$
test_exec_functorch 0.3461ms 0.1585ms 6.3076 KOps/s 6.2896 KOps/s $\color{#35bf28}+0.29\%$
test_exec_functional_call 0.3556ms 0.1552ms 6.4433 KOps/s 6.3520 KOps/s $\color{#35bf28}+1.44\%$
test_exec_td 0.3506ms 0.1488ms 6.7226 KOps/s 6.6748 KOps/s $\color{#35bf28}+0.72\%$
test_exec_td_decorator 0.9368ms 0.2184ms 4.5794 KOps/s 5.1882 KOps/s $\textbf{\color{#d91a1a}-11.74\%}$
test_vmap_mlp_speed[True-True] 1.2830ms 1.0612ms 942.3722 Ops/s 932.0509 Ops/s $\color{#35bf28}+1.11\%$
test_vmap_mlp_speed[True-False] 0.7631ms 0.6030ms 1.6583 KOps/s 1.5632 KOps/s $\textbf{\color{#35bf28}+6.08\%}$
test_vmap_mlp_speed[False-True] 1.1351ms 0.9746ms 1.0261 KOps/s 961.3071 Ops/s $\textbf{\color{#35bf28}+6.74\%}$
test_vmap_mlp_speed[False-False] 0.6846ms 0.5420ms 1.8450 KOps/s 1.7531 KOps/s $\textbf{\color{#35bf28}+5.24\%}$
test_vmap_mlp_speed_decorator[True-True] 2.6452ms 1.7750ms 563.3958 Ops/s 489.3329 Ops/s $\textbf{\color{#35bf28}+15.14\%}$
test_vmap_mlp_speed_decorator[True-False] 1.1256ms 0.6787ms 1.4735 KOps/s 1.5094 KOps/s $\color{#d91a1a}-2.38\%$
test_vmap_mlp_speed_decorator[False-True] 2.0740ms 1.5948ms 627.0311 Ops/s 564.6522 Ops/s $\textbf{\color{#35bf28}+11.05\%}$
test_vmap_mlp_speed_decorator[False-False] 1.0252ms 0.5823ms 1.7174 KOps/s 1.7621 KOps/s $\color{#d91a1a}-2.54\%$
test_vmap_transformer_speed[True-True] 12.7956ms 12.4941ms 80.0377 Ops/s 79.0104 Ops/s $\color{#35bf28}+1.30\%$
test_vmap_transformer_speed[True-False] 8.5489ms 8.2418ms 121.3322 Ops/s 120.5423 Ops/s $\color{#35bf28}+0.66\%$
test_vmap_transformer_speed[False-True] 12.9465ms 12.4581ms 80.2690 Ops/s 80.3188 Ops/s $\color{#d91a1a}-0.06\%$
test_vmap_transformer_speed[False-False] 8.4831ms 8.1721ms 122.3675 Ops/s 121.3346 Ops/s $\color{#35bf28}+0.85\%$
test_vmap_transformer_speed_decorator[True-True] 44.0502ms 42.5822ms 23.4840 Ops/s 15.5370 Ops/s $\textbf{\color{#35bf28}+51.15\%}$
test_vmap_transformer_speed_decorator[True-False] 97.8755ms 21.8219ms 45.8255 Ops/s 49.5822 Ops/s $\textbf{\color{#d91a1a}-7.58\%}$
test_vmap_transformer_speed_decorator[False-True] 42.9305ms 41.9770ms 23.8225 Ops/s 15.8005 Ops/s $\textbf{\color{#35bf28}+50.77\%}$
test_vmap_transformer_speed_decorator[False-False] 0.1048s 21.5819ms 46.3350 Ops/s 50.5316 Ops/s $\textbf{\color{#d91a1a}-8.30\%}$

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants