Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BugFIx] Fix exclude indent #637

Merged
merged 1 commit into from
Jan 24, 2024
Merged

[BugFIx] Fix exclude indent #637

merged 1 commit into from
Jan 24, 2024

Conversation

vmoens
Copy link
Contributor

@vmoens vmoens commented Jan 24, 2024

No description provided.

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jan 24, 2024
@vmoens vmoens linked an issue Jan 24, 2024 that may be closed by this pull request
3 tasks
@vmoens vmoens added the bug Something isn't working label Jan 24, 2024
Copy link

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 124. Improved: $\large\color{#35bf28}8$. Worsened: $\large\color{#d91a1a}21$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 66.2330μs 17.0327μs 58.7105 KOps/s 60.7106 KOps/s $\color{#d91a1a}-3.29\%$
test_plain_set_stack_nested 0.2674ms 0.1475ms 6.7800 KOps/s 6.7192 KOps/s $\color{#35bf28}+0.90\%$
test_plain_set_nested_inplace 74.3990μs 19.7979μs 50.5105 KOps/s 53.5738 KOps/s $\textbf{\color{#d91a1a}-5.72\%}$
test_plain_set_stack_nested_inplace 0.4593ms 0.1818ms 5.5002 KOps/s 5.5182 KOps/s $\color{#d91a1a}-0.33\%$
test_items 31.6390μs 2.3751μs 421.0436 KOps/s 412.4770 KOps/s $\color{#35bf28}+2.08\%$
test_items_nested 1.0810ms 0.2744ms 3.6443 KOps/s 3.6350 KOps/s $\color{#35bf28}+0.26\%$
test_items_nested_locked 0.4833ms 0.2735ms 3.6562 KOps/s 3.6173 KOps/s $\color{#35bf28}+1.08\%$
test_items_nested_leaf 0.5701ms 0.1686ms 5.9300 KOps/s 5.8604 KOps/s $\color{#35bf28}+1.19\%$
test_items_stack_nested 2.0390ms 1.3225ms 756.1188 Ops/s 750.2031 Ops/s $\color{#35bf28}+0.79\%$
test_items_stack_nested_leaf 1.4362ms 1.1845ms 844.2467 Ops/s 830.2712 Ops/s $\color{#35bf28}+1.68\%$
test_items_stack_nested_locked 1.5304ms 0.8775ms 1.1396 KOps/s 1.1354 KOps/s $\color{#35bf28}+0.37\%$
test_keys 41.4970μs 3.8408μs 260.3606 KOps/s 254.6277 KOps/s $\color{#35bf28}+2.25\%$
test_keys_nested 60.2529ms 0.1576ms 6.3454 KOps/s 6.5599 KOps/s $\color{#d91a1a}-3.27\%$
test_keys_nested_locked 0.2835ms 0.1511ms 6.6203 KOps/s 6.3897 KOps/s $\color{#35bf28}+3.61\%$
test_keys_nested_leaf 0.2405ms 0.1306ms 7.6598 KOps/s 7.4454 KOps/s $\color{#35bf28}+2.88\%$
test_keys_stack_nested 1.9737ms 1.2568ms 795.6936 Ops/s 784.4975 Ops/s $\color{#35bf28}+1.43\%$
test_keys_stack_nested_leaf 1.4073ms 1.2575ms 795.2166 Ops/s 789.4143 Ops/s $\color{#35bf28}+0.74\%$
test_keys_stack_nested_locked 1.3805ms 0.7928ms 1.2614 KOps/s 1.2461 KOps/s $\color{#35bf28}+1.23\%$
test_values 9.2798μs 1.1776μs 849.1922 KOps/s 777.9312 KOps/s $\textbf{\color{#35bf28}+9.16\%}$
test_values_nested 98.4030μs 51.9217μs 19.2598 KOps/s 19.0549 KOps/s $\color{#35bf28}+1.08\%$
test_values_nested_locked 0.1003ms 52.2131μs 19.1523 KOps/s 19.0032 KOps/s $\color{#35bf28}+0.78\%$
test_values_nested_leaf 94.2450μs 46.4295μs 21.5380 KOps/s 21.3768 KOps/s $\color{#35bf28}+0.75\%$
test_values_stack_nested 1.2284ms 1.0158ms 984.4339 Ops/s 948.5073 Ops/s $\color{#35bf28}+3.79\%$
test_values_stack_nested_leaf 1.7138ms 1.0135ms 986.6332 Ops/s 967.3316 Ops/s $\color{#35bf28}+2.00\%$
test_values_stack_nested_locked 0.7780ms 0.5967ms 1.6758 KOps/s 1.6637 KOps/s $\color{#35bf28}+0.73\%$
test_membership 21.2790μs 1.3534μs 738.8987 KOps/s 733.8427 KOps/s $\color{#35bf28}+0.69\%$
test_membership_nested 24.5260μs 3.4781μs 287.5173 KOps/s 282.3077 KOps/s $\color{#35bf28}+1.85\%$
test_membership_nested_leaf 23.9550μs 3.4721μs 288.0069 KOps/s 287.1162 KOps/s $\color{#35bf28}+0.31\%$
test_membership_stacked_nested 30.3470μs 11.6797μs 85.6190 KOps/s 85.0135 KOps/s $\color{#35bf28}+0.71\%$
test_membership_stacked_nested_leaf 45.3750μs 11.7613μs 85.0249 KOps/s 84.3147 KOps/s $\color{#35bf28}+0.84\%$
test_membership_nested_last 39.2430μs 6.5910μs 151.7216 KOps/s 149.6388 KOps/s $\color{#35bf28}+1.39\%$
test_membership_nested_leaf_last 24.8760μs 6.6363μs 150.6863 KOps/s 149.3016 KOps/s $\color{#35bf28}+0.93\%$
test_membership_stacked_nested_last 0.3182ms 0.1760ms 5.6805 KOps/s 5.6727 KOps/s $\color{#35bf28}+0.14\%$
test_membership_stacked_nested_leaf_last 62.8370μs 13.7742μs 72.5994 KOps/s 70.9201 KOps/s $\color{#35bf28}+2.37\%$
test_nested_getleaf 31.5490μs 10.6869μs 93.5722 KOps/s 93.9599 KOps/s $\color{#d91a1a}-0.41\%$
test_nested_get 31.6580μs 10.0242μs 99.7586 KOps/s 95.2701 KOps/s $\color{#35bf28}+4.71\%$
test_stacked_getleaf 0.5866ms 0.3921ms 2.5504 KOps/s 2.4636 KOps/s $\color{#35bf28}+3.52\%$
test_stacked_get 0.6415ms 0.3656ms 2.7352 KOps/s 2.6852 KOps/s $\color{#35bf28}+1.86\%$
test_nested_getitemleaf 39.1530μs 12.2168μs 81.8547 KOps/s 80.4656 KOps/s $\color{#35bf28}+1.73\%$
test_nested_getitem 32.3110μs 11.7616μs 85.0221 KOps/s 84.5890 KOps/s $\color{#35bf28}+0.51\%$
test_stacked_getitemleaf 0.8834ms 0.4007ms 2.4955 KOps/s 2.4456 KOps/s $\color{#35bf28}+2.04\%$
test_stacked_getitem 0.6827ms 0.3719ms 2.6889 KOps/s 2.6417 KOps/s $\color{#35bf28}+1.79\%$
test_lock_nested 0.8082ms 0.3306ms 3.0244 KOps/s 2.9746 KOps/s $\color{#35bf28}+1.68\%$
test_lock_stack_nested 88.1212ms 5.6412ms 177.2684 Ops/s 183.8740 Ops/s $\color{#d91a1a}-3.59\%$
test_unlock_nested 0.7173ms 0.3334ms 2.9998 KOps/s 2.5125 KOps/s $\textbf{\color{#35bf28}+19.40\%}$
test_unlock_stack_nested 78.2486ms 5.5874ms 178.9757 Ops/s 171.1977 Ops/s $\color{#35bf28}+4.54\%$
test_flatten_speed 0.7760ms 0.3681ms 2.7170 KOps/s 2.4878 KOps/s $\textbf{\color{#35bf28}+9.21\%}$
test_unflatten_speed 0.6613ms 0.4656ms 2.1478 KOps/s 2.1169 KOps/s $\color{#35bf28}+1.46\%$
test_common_ops 1.2028ms 0.6938ms 1.4413 KOps/s 1.5967 KOps/s $\textbf{\color{#d91a1a}-9.73\%}$
test_creation 15.6490μs 1.8901μs 529.0688 KOps/s 525.9095 KOps/s $\color{#35bf28}+0.60\%$
test_creation_empty 31.4590μs 10.5911μs 94.4189 KOps/s 129.2714 KOps/s $\textbf{\color{#d91a1a}-26.96\%}$
test_creation_nested_1 39.4940μs 13.1034μs 76.3162 KOps/s 96.4327 KOps/s $\textbf{\color{#d91a1a}-20.86\%}$
test_creation_nested_2 55.8330μs 16.4614μs 60.7482 KOps/s 73.8576 KOps/s $\textbf{\color{#d91a1a}-17.75\%}$
test_clone 0.2148ms 12.9093μs 77.4638 KOps/s 76.5649 KOps/s $\color{#35bf28}+1.17\%$
test_getitem[int] 32.6300μs 10.9138μs 91.6271 KOps/s 91.1084 KOps/s $\color{#35bf28}+0.57\%$
test_getitem[slice_int] 63.8180μs 22.5400μs 44.3656 KOps/s 43.5379 KOps/s $\color{#35bf28}+1.90\%$
test_getitem[range] 0.1139ms 40.1704μs 24.8940 KOps/s 24.2587 KOps/s $\color{#35bf28}+2.62\%$
test_getitem[tuple] 73.9780μs 18.2681μs 54.7404 KOps/s 51.7544 KOps/s $\textbf{\color{#35bf28}+5.77\%}$
test_getitem[list] 0.4280ms 35.3628μs 28.2783 KOps/s 26.7434 KOps/s $\textbf{\color{#35bf28}+5.74\%}$
test_setitem_dim[int] 62.3560μs 29.9529μs 33.3858 KOps/s 34.9248 KOps/s $\color{#d91a1a}-4.41\%$
test_setitem_dim[slice_int] 98.4940μs 55.4894μs 18.0215 KOps/s 18.1186 KOps/s $\color{#d91a1a}-0.54\%$
test_setitem_dim[range] 0.1711ms 73.0138μs 13.6960 KOps/s 14.0624 KOps/s $\color{#d91a1a}-2.60\%$
test_setitem_dim[tuple] 79.1070μs 45.0425μs 22.2013 KOps/s 23.4005 KOps/s $\textbf{\color{#d91a1a}-5.12\%}$
test_setitem 0.2079ms 19.8028μs 50.4980 KOps/s 56.0338 KOps/s $\textbf{\color{#d91a1a}-9.88\%}$
test_set 0.1670ms 19.3936μs 51.5635 KOps/s 57.9990 KOps/s $\textbf{\color{#d91a1a}-11.10\%}$
test_set_shared 3.5371ms 0.1439ms 6.9498 KOps/s 7.1090 KOps/s $\color{#d91a1a}-2.24\%$
test_update 0.1859ms 22.2384μs 44.9673 KOps/s 53.1638 KOps/s $\textbf{\color{#d91a1a}-15.42\%}$
test_update_nested 0.1356ms 30.0877μs 33.2362 KOps/s 38.0004 KOps/s $\textbf{\color{#d91a1a}-12.54\%}$
test_set_nested 0.1048ms 21.0577μs 47.4885 KOps/s 52.9961 KOps/s $\textbf{\color{#d91a1a}-10.39\%}$
test_set_nested_new 76.9530μs 24.7251μs 40.4447 KOps/s 43.2583 KOps/s $\textbf{\color{#d91a1a}-6.50\%}$
test_select 0.1324ms 37.4437μs 26.7068 KOps/s 27.7445 KOps/s $\color{#d91a1a}-3.74\%$
test_select_nested 0.1297ms 58.7064μs 17.0339 KOps/s 16.7942 KOps/s $\color{#35bf28}+1.43\%$
test_exclude_nested 0.2940ms 0.1185ms 8.4412 KOps/s 9.1272 KOps/s $\textbf{\color{#d91a1a}-7.52\%}$
test_empty[True] 0.6216ms 0.4111ms 2.4328 KOps/s 3.0476 KOps/s $\textbf{\color{#d91a1a}-20.17\%}$
test_empty[False] 7.6602μs 1.0548μs 948.0357 KOps/s 954.9398 KOps/s $\color{#d91a1a}-0.72\%$
test_unbind_speed 0.4277ms 0.2404ms 4.1592 KOps/s 4.0353 KOps/s $\color{#35bf28}+3.07\%$
test_unbind_speed_stack0 70.6294ms 3.4056ms 293.6365 Ops/s 302.6281 Ops/s $\color{#d91a1a}-2.97\%$
test_unbind_speed_stack1 20.5880μs 1.9768μs 505.8784 KOps/s 500.5920 KOps/s $\color{#35bf28}+1.06\%$
test_split 1.6976ms 1.4778ms 676.6701 Ops/s 669.7787 Ops/s $\color{#35bf28}+1.03\%$
test_chunk 68.7073ms 1.5778ms 633.8024 Ops/s 623.1601 Ops/s $\color{#35bf28}+1.71\%$
test_creation[device0] 0.2052ms 0.1003ms 9.9737 KOps/s 9.9282 KOps/s $\color{#35bf28}+0.46\%$
test_creation_from_tensor 5.6791ms 83.5267μs 11.9722 KOps/s 12.5312 KOps/s $\color{#d91a1a}-4.46\%$
test_add_one[memmap_tensor0] 0.3408ms 5.2077μs 192.0234 KOps/s 186.7354 KOps/s $\color{#35bf28}+2.83\%$
test_contiguous[memmap_tensor0] 11.0200μs 0.6483μs 1.5426 MOps/s 1.5408 MOps/s $\color{#35bf28}+0.12\%$
test_stack[memmap_tensor0] 75.5500μs 3.4541μs 289.5104 KOps/s 279.2053 KOps/s $\color{#35bf28}+3.69\%$
test_memmaptd_index 0.9091ms 0.2215ms 4.5145 KOps/s 4.5244 KOps/s $\color{#d91a1a}-0.22\%$
test_memmaptd_index_astensor 0.6417ms 0.2824ms 3.5410 KOps/s 3.5484 KOps/s $\color{#d91a1a}-0.21\%$
test_memmaptd_index_op 0.8913ms 0.5682ms 1.7600 KOps/s 1.9041 KOps/s $\textbf{\color{#d91a1a}-7.57\%}$
test_serialize_model 0.1126s 0.1047s 9.5538 Ops/s 9.7598 Ops/s $\color{#d91a1a}-2.11\%$
test_serialize_model_pickle 0.4477s 0.3772s 2.6510 Ops/s 2.6246 Ops/s $\color{#35bf28}+1.01\%$
test_serialize_weights 0.1734s 0.1050s 9.5211 Ops/s 9.2764 Ops/s $\color{#35bf28}+2.64\%$
test_serialize_weights_returnearly 0.1939s 0.1363s 7.3372 Ops/s 6.2511 Ops/s $\textbf{\color{#35bf28}+17.37\%}$
test_serialize_weights_pickle 1.0901s 0.6041s 1.6553 Ops/s 2.4487 Ops/s $\textbf{\color{#d91a1a}-32.40\%}$
test_serialize_weights_filesystem 0.1549s 97.1528ms 10.2931 Ops/s 10.7061 Ops/s $\color{#d91a1a}-3.86\%$
test_serialize_model_filesystem 0.1515s 97.4996ms 10.2565 Ops/s 10.7599 Ops/s $\color{#d91a1a}-4.68\%$
test_reshape_pytree 55.0520μs 22.7699μs 43.9176 KOps/s 43.9458 KOps/s $\color{#d91a1a}-0.06\%$
test_reshape_td 0.1024ms 29.0645μs 34.4062 KOps/s 32.9462 KOps/s $\color{#35bf28}+4.43\%$
test_view_pytree 55.5030μs 22.5468μs 44.3522 KOps/s 43.3460 KOps/s $\color{#35bf28}+2.32\%$
test_view_td 15.5790μs 4.8412μs 206.5608 KOps/s 199.6029 KOps/s $\color{#35bf28}+3.49\%$
test_unbind_pytree 71.5230μs 26.2991μs 38.0241 KOps/s 37.8251 KOps/s $\color{#35bf28}+0.53\%$
test_unbind_td 0.4380ms 34.9908μs 28.5789 KOps/s 27.4393 KOps/s $\color{#35bf28}+4.15\%$
test_split_pytree 71.3730μs 25.7073μs 38.8994 KOps/s 37.3976 KOps/s $\color{#35bf28}+4.02\%$
test_split_td 0.1117ms 40.0140μs 24.9912 KOps/s 24.6867 KOps/s $\color{#35bf28}+1.23\%$
test_add_pytree 71.6330μs 32.0787μs 31.1733 KOps/s 31.4775 KOps/s $\color{#d91a1a}-0.97\%$
test_add_td 0.1215ms 49.7665μs 20.0938 KOps/s 21.1919 KOps/s $\textbf{\color{#d91a1a}-5.18\%}$
test_distributed 0.1684ms 97.6167μs 10.2441 KOps/s 9.9577 KOps/s $\color{#35bf28}+2.88\%$
test_tdmodule 0.8195ms 23.4452μs 42.6527 KOps/s 47.5480 KOps/s $\textbf{\color{#d91a1a}-10.30\%}$
test_tdmodule_dispatch 0.2121ms 44.9910μs 22.2267 KOps/s 24.4411 KOps/s $\textbf{\color{#d91a1a}-9.06\%}$
test_tdseq 51.2860μs 26.3646μs 37.9297 KOps/s 41.2134 KOps/s $\textbf{\color{#d91a1a}-7.97\%}$
test_tdseq_dispatch 0.1562ms 48.9170μs 20.4428 KOps/s 21.8046 KOps/s $\textbf{\color{#d91a1a}-6.25\%}$
test_instantiation_functorch 1.5122ms 1.2659ms 789.9352 Ops/s 748.4523 Ops/s $\textbf{\color{#35bf28}+5.54\%}$
test_instantiation_td 1.4355ms 0.9880ms 1.0121 KOps/s 975.9120 Ops/s $\color{#35bf28}+3.71\%$
test_exec_functorch 0.6814ms 0.1577ms 6.3431 KOps/s 6.3426 KOps/s $+0.01\%$
test_exec_functional_call 0.8579ms 0.1432ms 6.9829 KOps/s 6.8916 KOps/s $\color{#35bf28}+1.32\%$
test_exec_td 0.2843ms 0.1399ms 7.1456 KOps/s 6.9317 KOps/s $\color{#35bf28}+3.09\%$
test_exec_td_decorator 0.8044ms 0.1741ms 5.7450 KOps/s 5.6197 KOps/s $\color{#35bf28}+2.23\%$
test_vmap_mlp_speed[True-True] 0.9644ms 0.8528ms 1.1727 KOps/s 1.1102 KOps/s $\textbf{\color{#35bf28}+5.63\%}$
test_vmap_mlp_speed[True-False] 0.7081ms 0.4629ms 2.1603 KOps/s 2.1303 KOps/s $\color{#35bf28}+1.41\%$
test_vmap_mlp_speed[False-True] 1.0713ms 0.7470ms 1.3386 KOps/s 1.2900 KOps/s $\color{#35bf28}+3.76\%$
test_vmap_mlp_speed[False-False] 0.6667ms 0.3758ms 2.6610 KOps/s 2.5866 KOps/s $\color{#35bf28}+2.88\%$
test_vmap_mlp_speed_decorator[True-True] 2.9961ms 2.2824ms 438.1315 Ops/s 442.8900 Ops/s $\color{#d91a1a}-1.07\%$
test_vmap_mlp_speed_decorator[True-False] 0.8620ms 0.5101ms 1.9604 KOps/s 1.9185 KOps/s $\color{#35bf28}+2.18\%$
test_vmap_mlp_speed_decorator[False-True] 2.8751ms 1.9321ms 517.5668 Ops/s 539.9235 Ops/s $\color{#d91a1a}-4.14\%$
test_vmap_mlp_speed_decorator[False-False] 0.9165ms 0.3928ms 2.5459 KOps/s 2.5235 KOps/s $\color{#35bf28}+0.89\%$

Copy link

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 132. Improved: $\large\color{#35bf28}7$. Worsened: $\large\color{#d91a1a}29$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 56.3820μs 14.4712μs 69.1026 KOps/s 77.9193 KOps/s $\textbf{\color{#d91a1a}-11.32\%}$
test_plain_set_stack_nested 0.1622ms 0.1202ms 8.3212 KOps/s 8.5082 KOps/s $\color{#d91a1a}-2.20\%$
test_plain_set_nested_inplace 32.4710μs 15.7756μs 63.3891 KOps/s 70.6643 KOps/s $\textbf{\color{#d91a1a}-10.30\%}$
test_plain_set_stack_nested_inplace 0.2068ms 0.1482ms 6.7487 KOps/s 6.8177 KOps/s $\color{#d91a1a}-1.01\%$
test_items 28.3200μs 4.7048μs 212.5507 KOps/s 214.1249 KOps/s $\color{#d91a1a}-0.74\%$
test_items_nested 0.4369ms 0.3390ms 2.9497 KOps/s 2.9272 KOps/s $\color{#35bf28}+0.77\%$
test_items_nested_locked 0.4081ms 0.3454ms 2.8955 KOps/s 2.8979 KOps/s $\color{#d91a1a}-0.08\%$
test_items_nested_leaf 0.2664ms 0.2002ms 4.9948 KOps/s 4.9436 KOps/s $\color{#35bf28}+1.04\%$
test_items_stack_nested 1.3969ms 1.2866ms 777.2713 Ops/s 758.6564 Ops/s $\color{#35bf28}+2.45\%$
test_items_stack_nested_leaf 1.2391ms 1.1296ms 885.3022 Ops/s 872.1788 Ops/s $\color{#35bf28}+1.50\%$
test_items_stack_nested_locked 1.9022ms 0.8810ms 1.1351 KOps/s 1.1168 KOps/s $\color{#35bf28}+1.63\%$
test_keys 28.1810μs 4.5665μs 218.9854 KOps/s 219.6340 KOps/s $\color{#d91a1a}-0.30\%$
test_keys_nested 0.4686ms 95.0277μs 10.5233 KOps/s 10.5041 KOps/s $\color{#35bf28}+0.18\%$
test_keys_nested_locked 0.1345ms 98.8565μs 10.1157 KOps/s 10.2523 KOps/s $\color{#d91a1a}-1.33\%$
test_keys_nested_leaf 0.1808ms 78.9355μs 12.6686 KOps/s 12.6537 KOps/s $\color{#35bf28}+0.12\%$
test_keys_stack_nested 1.2382ms 1.1342ms 881.6968 Ops/s 857.6888 Ops/s $\color{#35bf28}+2.80\%$
test_keys_stack_nested_leaf 1.2552ms 1.1272ms 887.1164 Ops/s 882.2501 Ops/s $\color{#35bf28}+0.55\%$
test_keys_stack_nested_locked 0.7635ms 0.7049ms 1.4187 KOps/s 1.3874 KOps/s $\color{#35bf28}+2.25\%$
test_values 8.8370μs 1.8762μs 532.9913 KOps/s 526.6943 KOps/s $\color{#35bf28}+1.20\%$
test_values_nested 76.3710μs 44.7843μs 22.3293 KOps/s 22.1817 KOps/s $\color{#35bf28}+0.67\%$
test_values_nested_locked 83.4020μs 47.3408μs 21.1234 KOps/s 21.1273 KOps/s $\color{#d91a1a}-0.02\%$
test_values_nested_leaf 64.3110μs 39.1498μs 25.5429 KOps/s 25.3725 KOps/s $\color{#35bf28}+0.67\%$
test_values_stack_nested 1.0807ms 0.9417ms 1.0619 KOps/s 1.0630 KOps/s $\color{#d91a1a}-0.10\%$
test_values_stack_nested_leaf 1.0375ms 0.9391ms 1.0649 KOps/s 1.0636 KOps/s $\color{#35bf28}+0.12\%$
test_values_stack_nested_locked 0.6711ms 0.5642ms 1.7724 KOps/s 1.7896 KOps/s $\color{#d91a1a}-0.96\%$
test_membership 28.5110μs 1.0380μs 963.4035 KOps/s 1.0661 MOps/s $\textbf{\color{#d91a1a}-9.63\%}$
test_membership_nested 27.5700μs 2.8409μs 351.9960 KOps/s 353.7340 KOps/s $\color{#d91a1a}-0.49\%$
test_membership_nested_leaf 31.9300μs 2.8545μs 350.3299 KOps/s 352.7069 KOps/s $\color{#d91a1a}-0.67\%$
test_membership_stacked_nested 31.3210μs 11.3431μs 88.1597 KOps/s 88.4341 KOps/s $\color{#d91a1a}-0.31\%$
test_membership_stacked_nested_leaf 45.2020μs 11.3054μs 88.4532 KOps/s 88.4575 KOps/s $-0.00\%$
test_membership_nested_last 21.6000μs 5.2659μs 189.9014 KOps/s 191.4907 KOps/s $\color{#d91a1a}-0.83\%$
test_membership_nested_leaf_last 32.9110μs 5.2639μs 189.9747 KOps/s 193.0565 KOps/s $\color{#d91a1a}-1.60\%$
test_membership_stacked_nested_last 0.1856ms 0.1549ms 6.4538 KOps/s 6.4413 KOps/s $\color{#35bf28}+0.19\%$
test_membership_stacked_nested_leaf_last 29.5700μs 13.1480μs 76.0573 KOps/s 76.4198 KOps/s $\color{#d91a1a}-0.47\%$
test_nested_getleaf 31.4710μs 8.3817μs 119.3081 KOps/s 118.4641 KOps/s $\color{#35bf28}+0.71\%$
test_nested_get 31.2910μs 7.8945μs 126.6708 KOps/s 126.3130 KOps/s $\color{#35bf28}+0.28\%$
test_stacked_getleaf 0.4273ms 0.3227ms 3.0985 KOps/s 3.0469 KOps/s $\color{#35bf28}+1.69\%$
test_stacked_get 0.3864ms 0.2920ms 3.4246 KOps/s 3.3629 KOps/s $\color{#35bf28}+1.84\%$
test_nested_getitemleaf 25.4300μs 9.7182μs 102.8999 KOps/s 101.1343 KOps/s $\color{#35bf28}+1.75\%$
test_nested_getitem 30.6620μs 9.2538μs 108.0636 KOps/s 106.1685 KOps/s $\color{#35bf28}+1.78\%$
test_stacked_getitemleaf 0.4240ms 0.3266ms 3.0619 KOps/s 3.0266 KOps/s $\color{#35bf28}+1.16\%$
test_stacked_getitem 0.3594ms 0.2925ms 3.4184 KOps/s 3.3444 KOps/s $\color{#35bf28}+2.21\%$
test_lock_nested 0.8228ms 0.3524ms 2.8378 KOps/s 2.8196 KOps/s $\color{#35bf28}+0.64\%$
test_lock_stack_nested 83.2428ms 6.2682ms 159.5356 Ops/s 160.7880 Ops/s $\color{#d91a1a}-0.78\%$
test_unlock_nested 0.8255ms 0.3464ms 2.8871 KOps/s 2.3487 KOps/s $\textbf{\color{#35bf28}+22.92\%}$
test_unlock_stack_nested 82.6547ms 6.2493ms 160.0177 Ops/s 159.1435 Ops/s $\color{#35bf28}+0.55\%$
test_flatten_speed 0.5187ms 0.2619ms 3.8189 KOps/s 3.7961 KOps/s $\color{#35bf28}+0.60\%$
test_unflatten_speed 0.4521ms 0.3677ms 2.7194 KOps/s 2.7601 KOps/s $\color{#d91a1a}-1.48\%$
test_common_ops 1.1432ms 0.6255ms 1.5987 KOps/s 1.7354 KOps/s $\textbf{\color{#d91a1a}-7.88\%}$
test_creation 16.7500μs 1.5262μs 655.2340 KOps/s 647.9944 KOps/s $\color{#35bf28}+1.12\%$
test_creation_empty 26.2800μs 9.8990μs 101.0205 KOps/s 150.4851 KOps/s $\textbf{\color{#d91a1a}-32.87\%}$
test_creation_nested_1 49.4110μs 11.6548μs 85.8016 KOps/s 119.0628 KOps/s $\textbf{\color{#d91a1a}-27.94\%}$
test_creation_nested_2 39.5310μs 14.1824μs 70.5098 KOps/s 93.3654 KOps/s $\textbf{\color{#d91a1a}-24.48\%}$
test_clone 87.4010μs 13.9377μs 71.7477 KOps/s 73.3736 KOps/s $\color{#d91a1a}-2.22\%$
test_getitem[int] 26.5220μs 10.9740μs 91.1245 KOps/s 92.0416 KOps/s $\color{#d91a1a}-1.00\%$
test_getitem[slice_int] 49.5120μs 20.7639μs 48.1604 KOps/s 48.4575 KOps/s $\color{#d91a1a}-0.61\%$
test_getitem[range] 65.6510μs 35.7888μs 27.9417 KOps/s 28.1759 KOps/s $\color{#d91a1a}-0.83\%$
test_getitem[tuple] 43.2600μs 18.4870μs 54.0919 KOps/s 54.4031 KOps/s $\color{#d91a1a}-0.57\%$
test_getitem[list] 0.2859ms 31.9351μs 31.3135 KOps/s 31.0217 KOps/s $\color{#35bf28}+0.94\%$
test_setitem_dim[int] 50.0410μs 27.2662μs 36.6754 KOps/s 42.0601 KOps/s $\textbf{\color{#d91a1a}-12.80\%}$
test_setitem_dim[slice_int] 72.9420μs 48.5341μs 20.6041 KOps/s 22.5931 KOps/s $\textbf{\color{#d91a1a}-8.80\%}$
test_setitem_dim[range] 87.7920μs 62.1795μs 16.0825 KOps/s 17.3890 KOps/s $\textbf{\color{#d91a1a}-7.51\%}$
test_setitem_dim[tuple] 62.8310μs 41.6712μs 23.9974 KOps/s 26.0080 KOps/s $\textbf{\color{#d91a1a}-7.73\%}$
test_setitem 89.2830μs 19.0266μs 52.5580 KOps/s 57.1764 KOps/s $\textbf{\color{#d91a1a}-8.08\%}$
test_set 98.5920μs 18.4502μs 54.2000 KOps/s 58.3585 KOps/s $\textbf{\color{#d91a1a}-7.13\%}$
test_set_shared 1.5717ms 0.1015ms 9.8570 KOps/s 9.7617 KOps/s $\color{#35bf28}+0.98\%$
test_update 90.9230μs 21.8978μs 45.6667 KOps/s 53.6306 KOps/s $\textbf{\color{#d91a1a}-14.85\%}$
test_update_nested 0.1034ms 28.3470μs 35.2772 KOps/s 40.4665 KOps/s $\textbf{\color{#d91a1a}-12.82\%}$
test_set_nested 99.8220μs 19.7152μs 50.7223 KOps/s 55.0980 KOps/s $\textbf{\color{#d91a1a}-7.94\%}$
test_set_nested_new 88.3620μs 22.3994μs 44.6440 KOps/s 48.0473 KOps/s $\textbf{\color{#d91a1a}-7.08\%}$
test_select 0.1092ms 35.4872μs 28.1792 KOps/s 29.3413 KOps/s $\color{#d91a1a}-3.96\%$
test_select_nested 84.4220μs 53.2924μs 18.7644 KOps/s 18.6789 KOps/s $\color{#35bf28}+0.46\%$
test_exclude_nested 0.1961ms 0.1178ms 8.4907 KOps/s 9.2434 KOps/s $\textbf{\color{#d91a1a}-8.14\%}$
test_empty[True] 0.4870ms 0.3883ms 2.5751 KOps/s 3.0915 KOps/s $\textbf{\color{#d91a1a}-16.70\%}$
test_empty[False] 3.2441μs 0.8688μs 1.1510 MOps/s 1.1285 MOps/s $\color{#35bf28}+1.99\%$
test_to 72.1110μs 52.2490μs 19.1391 KOps/s 18.6411 KOps/s $\color{#35bf28}+2.67\%$
test_to_nonblocking 67.5920μs 34.1530μs 29.2800 KOps/s 30.1137 KOps/s $\color{#d91a1a}-2.77\%$
test_unbind_speed 0.9043ms 0.2658ms 3.7618 KOps/s 3.7620 KOps/s $-0.00\%$
test_unbind_speed_stack0 82.5771ms 3.7027ms 270.0728 Ops/s 245.6485 Ops/s $\textbf{\color{#35bf28}+9.94\%}$
test_unbind_speed_stack1 8.3733μs 1.7119μs 584.1474 KOps/s 580.5969 KOps/s $\color{#35bf28}+0.61\%$
test_split 1.6452ms 1.5443ms 647.5514 Ops/s 653.3938 Ops/s $\color{#d91a1a}-0.89\%$
test_chunk 75.8640ms 1.6491ms 606.3775 Ops/s 606.6088 Ops/s $\color{#d91a1a}-0.04\%$
test_creation[device0] 0.1390ms 73.0057μs 13.6976 KOps/s 14.3164 KOps/s $\color{#d91a1a}-4.32\%$
test_creation_from_tensor 0.1364ms 55.0784μs 18.1559 KOps/s 19.2381 KOps/s $\textbf{\color{#d91a1a}-5.62\%}$
test_add_one[memmap_tensor0] 0.1581ms 6.5839μs 151.8845 KOps/s 142.3510 KOps/s $\textbf{\color{#35bf28}+6.70\%}$
test_contiguous[memmap_tensor0] 9.2000μs 0.5984μs 1.6712 MOps/s 1.6643 MOps/s $\color{#35bf28}+0.41\%$
test_stack[memmap_tensor0] 54.3610μs 4.2170μs 237.1356 KOps/s 235.9099 KOps/s $\color{#35bf28}+0.52\%$
test_memmaptd_index 1.0266ms 0.2530ms 3.9522 KOps/s 3.8887 KOps/s $\color{#35bf28}+1.63\%$
test_memmaptd_index_astensor 0.6356ms 0.3101ms 3.2251 KOps/s 3.1707 KOps/s $\color{#35bf28}+1.72\%$
test_memmaptd_index_op 0.9539ms 0.6295ms 1.5886 KOps/s 1.7003 KOps/s $\textbf{\color{#d91a1a}-6.57\%}$
test_serialize_model 0.1668s 96.6672ms 10.3448 Ops/s 9.7746 Ops/s $\textbf{\color{#35bf28}+5.83\%}$
test_serialize_model_pickle 1.3498s 1.2357s 0.8093 Ops/s 0.8081 Ops/s $\color{#35bf28}+0.14\%$
test_serialize_weights 0.1666s 94.8227ms 10.5460 Ops/s 9.9591 Ops/s $\textbf{\color{#35bf28}+5.89\%}$
test_serialize_weights_returnearly 0.2718s 77.6764ms 12.8739 Ops/s 12.5037 Ops/s $\color{#35bf28}+2.96\%$
test_serialize_weights_pickle 1.3486s 1.2357s 0.8093 Ops/s 0.8091 Ops/s $\color{#35bf28}+0.02\%$
test_reshape_pytree 0.1398ms 24.4651μs 40.8745 KOps/s 41.0987 KOps/s $\color{#d91a1a}-0.55\%$
test_reshape_td 0.1927ms 30.8551μs 32.4095 KOps/s 34.6809 KOps/s $\textbf{\color{#d91a1a}-6.55\%}$
test_view_pytree 0.2411ms 24.4815μs 40.8471 KOps/s 41.9570 KOps/s $\color{#d91a1a}-2.65\%$
test_view_td 21.2010μs 4.1945μs 238.4080 KOps/s 235.7766 KOps/s $\color{#35bf28}+1.12\%$
test_unbind_pytree 54.1810μs 30.0298μs 33.3003 KOps/s 32.9286 KOps/s $\color{#35bf28}+1.13\%$
test_unbind_td 0.5315ms 40.3586μs 24.7779 KOps/s 25.1597 KOps/s $\color{#d91a1a}-1.52\%$
test_split_pytree 62.1610μs 29.1465μs 34.3094 KOps/s 34.6611 KOps/s $\color{#d91a1a}-1.01\%$
test_split_td 0.2302ms 39.5251μs 25.3004 KOps/s 26.4611 KOps/s $\color{#d91a1a}-4.39\%$
test_add_pytree 57.8110μs 35.4490μs 28.2096 KOps/s 27.4191 KOps/s $\color{#35bf28}+2.88\%$
test_add_td 0.2830ms 49.6785μs 20.1294 KOps/s 22.2418 KOps/s $\textbf{\color{#d91a1a}-9.50\%}$
test_distributed 2.2417ms 79.1778μs 12.6298 KOps/s 13.0417 KOps/s $\color{#d91a1a}-3.16\%$
test_tdmodule 0.1111ms 18.3121μs 54.6087 KOps/s 59.4035 KOps/s $\textbf{\color{#d91a1a}-8.07\%}$
test_tdmodule_dispatch 0.2190ms 38.3405μs 26.0821 KOps/s 27.9452 KOps/s $\textbf{\color{#d91a1a}-6.67\%}$
test_tdseq 37.4500μs 21.5347μs 46.4366 KOps/s 51.7146 KOps/s $\textbf{\color{#d91a1a}-10.21\%}$
test_tdseq_dispatch 57.9810μs 41.0581μs 24.3557 KOps/s 27.0014 KOps/s $\textbf{\color{#d91a1a}-9.80\%}$
test_instantiation_functorch 2.0773ms 1.6675ms 599.7152 Ops/s 600.9495 Ops/s $\color{#d91a1a}-0.21\%$
test_instantiation_td 1.7682ms 1.1609ms 861.4230 Ops/s 860.0954 Ops/s $\color{#35bf28}+0.15\%$
test_exec_functorch 0.1875ms 0.1586ms 6.3061 KOps/s 6.1911 KOps/s $\color{#35bf28}+1.86\%$
test_exec_functional_call 0.2033ms 0.1582ms 6.3201 KOps/s 6.1247 KOps/s $\color{#35bf28}+3.19\%$
test_exec_td 0.1835ms 0.1484ms 6.7393 KOps/s 6.3757 KOps/s $\textbf{\color{#35bf28}+5.70\%}$
test_exec_td_decorator 0.6662ms 0.1883ms 5.3103 KOps/s 5.1753 KOps/s $\color{#35bf28}+2.61\%$
test_vmap_mlp_speed[True-True] 1.0989ms 1.0342ms 966.9379 Ops/s 930.4919 Ops/s $\color{#35bf28}+3.92\%$
test_vmap_mlp_speed[True-False] 0.6990ms 0.6204ms 1.6119 KOps/s 1.6499 KOps/s $\color{#d91a1a}-2.30\%$
test_vmap_mlp_speed[False-True] 1.0773ms 0.9969ms 1.0031 KOps/s 1.0395 KOps/s $\color{#d91a1a}-3.50\%$
test_vmap_mlp_speed[False-False] 0.6845ms 0.5667ms 1.7647 KOps/s 1.8595 KOps/s $\textbf{\color{#d91a1a}-5.10\%}$
test_vmap_mlp_speed_decorator[True-True] 3.0578ms 2.3264ms 429.8401 Ops/s 440.2884 Ops/s $\color{#d91a1a}-2.37\%$
test_vmap_mlp_speed_decorator[True-False] 1.0692ms 0.6744ms 1.4828 KOps/s 1.5446 KOps/s $\color{#d91a1a}-4.00\%$
test_vmap_mlp_speed_decorator[False-True] 2.4509ms 1.9996ms 500.0896 Ops/s 525.4030 Ops/s $\color{#d91a1a}-4.82\%$
test_vmap_mlp_speed_decorator[False-False] 1.0052ms 0.5487ms 1.8227 KOps/s 1.8010 KOps/s $\color{#35bf28}+1.20\%$
test_vmap_transformer_speed[True-True] 12.1441ms 11.9823ms 83.4564 Ops/s 80.1954 Ops/s $\color{#35bf28}+4.07\%$
test_vmap_transformer_speed[True-False] 8.0442ms 7.9737ms 125.4116 Ops/s 123.6434 Ops/s $\color{#35bf28}+1.43\%$
test_vmap_transformer_speed[False-True] 12.1003ms 11.8587ms 84.3264 Ops/s 82.5907 Ops/s $\color{#35bf28}+2.10\%$
test_vmap_transformer_speed[False-False] 7.9801ms 7.8721ms 127.0305 Ops/s 124.9602 Ops/s $\color{#35bf28}+1.66\%$
test_vmap_transformer_speed_decorator[True-True] 0.1625s 78.5331ms 12.7335 Ops/s 13.9216 Ops/s $\textbf{\color{#d91a1a}-8.53\%}$
test_vmap_transformer_speed_decorator[True-False] 21.0376ms 19.2577ms 51.9273 Ops/s 46.4354 Ops/s $\textbf{\color{#35bf28}+11.83\%}$
test_vmap_transformer_speed_decorator[False-True] 65.9660ms 64.6518ms 15.4675 Ops/s 15.3501 Ops/s $\color{#35bf28}+0.76\%$
test_vmap_transformer_speed_decorator[False-False] 20.6270ms 18.8222ms 53.1286 Ops/s 52.5101 Ops/s $\color{#35bf28}+1.18\%$

@vmoens vmoens merged commit 6a38d31 into main Jan 24, 2024
43 of 45 checks passed
@vmoens vmoens deleted the fix-exclude branch January 24, 2024 18:13
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[BUG] exclude adds keys to the resulted tensordict
2 participants