Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Badge] Discord shield #736

Merged
merged 2 commits into from
Apr 19, 2024
Merged

[Badge] Discord shield #736

merged 2 commits into from
Apr 19, 2024

Conversation

vmoens
Copy link
Contributor

@vmoens vmoens commented Apr 19, 2024

No description provided.

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Apr 19, 2024
@vmoens vmoens added the enhancement New feature or request label Apr 19, 2024
@vmoens vmoens merged commit 3a44928 into main Apr 19, 2024
9 of 11 checks passed
@vmoens vmoens deleted the discord branch April 19, 2024 13:19
Copy link

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 127. Improved: $\large\color{#35bf28}26$. Worsened: $\large\color{#d91a1a}3$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 41.4770μs 16.3944μs 60.9965 KOps/s 57.7233 KOps/s $\textbf{\color{#35bf28}+5.67\%}$
test_plain_set_stack_nested 45.4850μs 16.9076μs 59.1451 KOps/s 57.3579 KOps/s $\color{#35bf28}+3.12\%$
test_plain_set_nested_inplace 0.2577ms 19.3633μs 51.6440 KOps/s 50.9831 KOps/s $\color{#35bf28}+1.30\%$
test_plain_set_stack_nested_inplace 64.7570μs 18.5872μs 53.8004 KOps/s 51.7185 KOps/s $\color{#35bf28}+4.03\%$
test_items 42.1490μs 2.4735μs 404.2826 KOps/s 370.6063 KOps/s $\textbf{\color{#35bf28}+9.09\%}$
test_items_nested 1.0451ms 0.2700ms 3.7031 KOps/s 3.7123 KOps/s $\color{#d91a1a}-0.25\%$
test_items_nested_locked 0.4021ms 0.2700ms 3.7031 KOps/s 3.7000 KOps/s $\color{#35bf28}+0.08\%$
test_items_nested_leaf 0.4234ms 77.4581μs 12.9102 KOps/s 13.0264 KOps/s $\color{#d91a1a}-0.89\%$
test_items_stack_nested 0.9198ms 0.2735ms 3.6566 KOps/s 3.7224 KOps/s $\color{#d91a1a}-1.77\%$
test_items_stack_nested_leaf 0.1529ms 78.7753μs 12.6943 KOps/s 12.8284 KOps/s $\color{#d91a1a}-1.05\%$
test_items_stack_nested_locked 0.3509ms 0.2738ms 3.6523 KOps/s 3.7105 KOps/s $\color{#d91a1a}-1.57\%$
test_keys 47.4690μs 3.8473μs 259.9258 KOps/s 260.4344 KOps/s $\color{#d91a1a}-0.20\%$
test_keys_nested 0.2178ms 0.1348ms 7.4209 KOps/s 7.2379 KOps/s $\color{#35bf28}+2.53\%$
test_keys_nested_locked 0.7573ms 0.1385ms 7.2185 KOps/s 7.0261 KOps/s $\color{#35bf28}+2.74\%$
test_keys_nested_leaf 0.2293ms 0.1135ms 8.8125 KOps/s 8.5318 KOps/s $\color{#35bf28}+3.29\%$
test_keys_stack_nested 0.2209ms 0.1325ms 7.5471 KOps/s 7.2661 KOps/s $\color{#35bf28}+3.87\%$
test_keys_stack_nested_leaf 0.1853ms 0.1131ms 8.8397 KOps/s 8.5639 KOps/s $\color{#35bf28}+3.22\%$
test_keys_stack_nested_locked 0.2610ms 0.1381ms 7.2433 KOps/s 6.9707 KOps/s $\color{#35bf28}+3.91\%$
test_values 7.5992μs 1.1388μs 878.1315 KOps/s 878.9963 KOps/s $\color{#d91a1a}-0.10\%$
test_values_nested 0.1035ms 50.3549μs 19.8590 KOps/s 19.3262 KOps/s $\color{#35bf28}+2.76\%$
test_values_nested_locked 0.1062ms 50.5769μs 19.7719 KOps/s 19.3728 KOps/s $\color{#35bf28}+2.06\%$
test_values_nested_leaf 98.6740μs 45.6051μs 21.9274 KOps/s 21.4016 KOps/s $\color{#35bf28}+2.46\%$
test_values_stack_nested 0.1042ms 51.5651μs 19.3930 KOps/s 19.5258 KOps/s $\color{#d91a1a}-0.68\%$
test_values_stack_nested_leaf 0.1643ms 44.5035μs 22.4701 KOps/s 21.5279 KOps/s $\color{#35bf28}+4.38\%$
test_values_stack_nested_locked 2.0708ms 51.4717μs 19.4282 KOps/s 19.2933 KOps/s $\color{#35bf28}+0.70\%$
test_membership 20.0170μs 1.3491μs 741.2361 KOps/s 745.6566 KOps/s $\color{#d91a1a}-0.59\%$
test_membership_nested 36.7090μs 3.4633μs 288.7383 KOps/s 292.3976 KOps/s $\color{#d91a1a}-1.25\%$
test_membership_nested_leaf 48.3000μs 3.4315μs 291.4167 KOps/s 291.8343 KOps/s $\color{#d91a1a}-0.14\%$
test_membership_stacked_nested 21.4300μs 3.4420μs 290.5287 KOps/s 291.9832 KOps/s $\color{#d91a1a}-0.50\%$
test_membership_stacked_nested_leaf 47.5790μs 3.4055μs 293.6465 KOps/s 288.1751 KOps/s $\color{#35bf28}+1.90\%$
test_membership_nested_last 38.8920μs 4.1583μs 240.4834 KOps/s 228.2640 KOps/s $\textbf{\color{#35bf28}+5.35\%}$
test_membership_nested_leaf_last 22.3020μs 4.1859μs 238.8945 KOps/s 236.9139 KOps/s $\color{#35bf28}+0.84\%$
test_membership_stacked_nested_last 35.2260μs 4.1841μs 238.9976 KOps/s 239.1167 KOps/s $\color{#d91a1a}-0.05\%$
test_membership_stacked_nested_leaf_last 20.2580μs 4.1432μs 241.3609 KOps/s 234.6613 KOps/s $\color{#35bf28}+2.85\%$
test_nested_getleaf 59.4310μs 10.5631μs 94.6696 KOps/s 91.9017 KOps/s $\color{#35bf28}+3.01\%$
test_nested_get 46.8270μs 9.8865μs 101.1480 KOps/s 98.7174 KOps/s $\color{#35bf28}+2.46\%$
test_stacked_getleaf 63.7690μs 10.5922μs 94.4092 KOps/s 92.3299 KOps/s $\color{#35bf28}+2.25\%$
test_stacked_get 29.4150μs 9.9246μs 100.7595 KOps/s 99.4523 KOps/s $\color{#35bf28}+1.31\%$
test_nested_getitemleaf 48.5910μs 11.1980μs 89.3013 KOps/s 88.6068 KOps/s $\color{#35bf28}+0.78\%$
test_nested_getitem 57.7380μs 10.2021μs 98.0189 KOps/s 96.7981 KOps/s $\color{#35bf28}+1.26\%$
test_stacked_getitemleaf 50.8750μs 11.2690μs 88.7387 KOps/s 87.0192 KOps/s $\color{#35bf28}+1.98\%$
test_stacked_getitem 56.8870μs 10.1995μs 98.0440 KOps/s 95.9969 KOps/s $\color{#35bf28}+2.13\%$
test_lock_nested 50.8844ms 0.3949ms 2.5322 KOps/s 2.8435 KOps/s $\textbf{\color{#d91a1a}-10.95\%}$
test_lock_stack_nested 0.4282ms 0.3016ms 3.3159 KOps/s 3.1861 KOps/s $\color{#35bf28}+4.08\%$
test_unlock_nested 0.1029s 0.4576ms 2.1852 KOps/s 2.1601 KOps/s $\color{#35bf28}+1.16\%$
test_unlock_stack_nested 0.8197ms 0.3155ms 3.1693 KOps/s 3.0860 KOps/s $\color{#35bf28}+2.70\%$
test_flatten_speed 4.1839ms 92.4467μs 10.8170 KOps/s 10.8609 KOps/s $\color{#d91a1a}-0.40\%$
test_unflatten_speed 0.6936ms 0.3983ms 2.5105 KOps/s 2.4460 KOps/s $\color{#35bf28}+2.64\%$
test_common_ops 1.3349ms 0.7028ms 1.4229 KOps/s 1.3386 KOps/s $\textbf{\color{#35bf28}+6.30\%}$
test_creation 0.1147ms 1.8298μs 546.5101 KOps/s 532.8282 KOps/s $\color{#35bf28}+2.57\%$
test_creation_empty 34.8350μs 9.9601μs 100.4009 KOps/s 92.0514 KOps/s $\textbf{\color{#35bf28}+9.07\%}$
test_creation_nested_1 70.6860μs 12.4629μs 80.2382 KOps/s 73.6364 KOps/s $\textbf{\color{#35bf28}+8.97\%}$
test_creation_nested_2 41.5880μs 15.9361μs 62.7505 KOps/s 58.2654 KOps/s $\textbf{\color{#35bf28}+7.70\%}$
test_clone 92.0320μs 13.2239μs 75.6205 KOps/s 74.2238 KOps/s $\color{#35bf28}+1.88\%$
test_getitem[int] 32.2200μs 11.3703μs 87.9488 KOps/s 88.0324 KOps/s $\color{#d91a1a}-0.10\%$
test_getitem[slice_int] 86.8020μs 22.7282μs 43.9983 KOps/s 43.8211 KOps/s $\color{#35bf28}+0.40\%$
test_getitem[range] 0.3040ms 45.3684μs 22.0418 KOps/s 22.8459 KOps/s $\color{#d91a1a}-3.52\%$
test_getitem[tuple] 61.5650μs 18.3350μs 54.5405 KOps/s 52.8319 KOps/s $\color{#35bf28}+3.23\%$
test_getitem[list] 0.3259ms 40.3910μs 24.7580 KOps/s 25.8492 KOps/s $\color{#d91a1a}-4.22\%$
test_setitem_dim[int] 0.1050ms 34.8656μs 28.6816 KOps/s 27.4934 KOps/s $\color{#35bf28}+4.32\%$
test_setitem_dim[slice_int] 0.1051ms 59.4152μs 16.8307 KOps/s 15.6178 KOps/s $\textbf{\color{#35bf28}+7.77\%}$
test_setitem_dim[range] 0.1242ms 79.7745μs 12.5353 KOps/s 11.9563 KOps/s $\color{#35bf28}+4.84\%$
test_setitem_dim[tuple] 88.8560μs 48.0526μs 20.8105 KOps/s 19.3097 KOps/s $\textbf{\color{#35bf28}+7.77\%}$
test_setitem 0.1399ms 19.4664μs 51.3704 KOps/s 47.9021 KOps/s $\textbf{\color{#35bf28}+7.24\%}$
test_set 0.1408ms 19.1300μs 52.2740 KOps/s 48.9325 KOps/s $\textbf{\color{#35bf28}+6.83\%}$
test_set_shared 5.6596ms 0.1502ms 6.6580 KOps/s 6.8733 KOps/s $\color{#d91a1a}-3.13\%$
test_update 0.2595ms 20.5735μs 48.6061 KOps/s 44.2350 KOps/s $\textbf{\color{#35bf28}+9.88\%}$
test_update_nested 0.1722ms 28.8959μs 34.6070 KOps/s 31.4679 KOps/s $\textbf{\color{#35bf28}+9.98\%}$
test_update__nested 0.1276ms 24.4672μs 40.8711 KOps/s 39.6062 KOps/s $\color{#35bf28}+3.19\%$
test_set_nested 0.1182ms 20.8834μs 47.8850 KOps/s 45.0522 KOps/s $\textbf{\color{#35bf28}+6.29\%}$
test_set_nested_new 0.2578ms 25.9310μs 38.5639 KOps/s 38.0123 KOps/s $\color{#35bf28}+1.45\%$
test_select 0.1494ms 38.5117μs 25.9661 KOps/s 24.1244 KOps/s $\textbf{\color{#35bf28}+7.63\%}$
test_select_nested 0.1420ms 58.8708μs 16.9863 KOps/s 16.7992 KOps/s $\color{#35bf28}+1.11\%$
test_exclude_nested 0.2474ms 0.1174ms 8.5207 KOps/s 8.5057 KOps/s $\color{#35bf28}+0.18\%$
test_empty[True] 1.6896ms 0.3897ms 2.5664 KOps/s 2.5445 KOps/s $\color{#35bf28}+0.86\%$
test_empty[False] 9.6742μs 1.0613μs 942.2302 KOps/s 912.0619 KOps/s $\color{#35bf28}+3.31\%$
test_unbind_speed 0.4367ms 0.2541ms 3.9361 KOps/s 3.9291 KOps/s $\color{#35bf28}+0.18\%$
test_unbind_speed_stack0 0.7718ms 0.2456ms 4.0720 KOps/s 3.9609 KOps/s $\color{#35bf28}+2.81\%$
test_unbind_speed_stack1 0.1453s 0.7074ms 1.4136 KOps/s 1.4253 KOps/s $\color{#d91a1a}-0.82\%$
test_split 1.6139ms 1.4813ms 675.0752 Ops/s 607.2258 Ops/s $\textbf{\color{#35bf28}+11.17\%}$
test_chunk 0.1362s 1.6949ms 590.0201 Ops/s 685.3620 Ops/s $\textbf{\color{#d91a1a}-13.91\%}$
test_creation[device0] 0.2758ms 0.1051ms 9.5182 KOps/s 9.5100 KOps/s $\color{#35bf28}+0.09\%$
test_creation_from_tensor 5.4208ms 85.1248μs 11.7475 KOps/s 12.2639 KOps/s $\color{#d91a1a}-4.21\%$
test_add_one[memmap_tensor0] 0.1235ms 5.4851μs 182.3131 KOps/s 182.7967 KOps/s $\color{#d91a1a}-0.26\%$
test_contiguous[memmap_tensor0] 24.6870μs 0.6546μs 1.5277 MOps/s 1.5761 MOps/s $\color{#d91a1a}-3.07\%$
test_stack[memmap_tensor0] 34.2730μs 3.7330μs 267.8845 KOps/s 285.1781 KOps/s $\textbf{\color{#d91a1a}-6.06\%}$
test_memmaptd_index 1.0888ms 0.2388ms 4.1871 KOps/s 4.2485 KOps/s $\color{#d91a1a}-1.44\%$
test_memmaptd_index_astensor 0.5712ms 0.3006ms 3.3265 KOps/s 3.2159 KOps/s $\color{#35bf28}+3.44\%$
test_memmaptd_index_op 0.8892ms 0.5859ms 1.7069 KOps/s 1.6497 KOps/s $\color{#35bf28}+3.47\%$
test_serialize_model 0.1126s 0.1041s 9.6082 Ops/s 8.1470 Ops/s $\textbf{\color{#35bf28}+17.94\%}$
test_serialize_model_pickle 0.4637s 0.3772s 2.6508 Ops/s 2.5825 Ops/s $\color{#35bf28}+2.65\%$
test_serialize_weights 0.1085s 0.1008s 9.9188 Ops/s 9.9790 Ops/s $\color{#d91a1a}-0.60\%$
test_serialize_weights_returnearly 0.1408s 0.1275s 7.8413 Ops/s 8.1410 Ops/s $\color{#d91a1a}-3.68\%$
test_serialize_weights_pickle 0.7978s 0.4935s 2.0262 Ops/s 1.5012 Ops/s $\textbf{\color{#35bf28}+34.97\%}$
test_serialize_weights_filesystem 98.3749ms 93.6366ms 10.6796 Ops/s 9.3457 Ops/s $\textbf{\color{#35bf28}+14.27\%}$
test_serialize_model_filesystem 0.1040s 95.1582ms 10.5088 Ops/s 10.4778 Ops/s $\color{#35bf28}+0.30\%$
test_reshape_pytree 65.1410μs 20.5472μs 48.6685 KOps/s 47.6433 KOps/s $\color{#35bf28}+2.15\%$
test_reshape_td 82.0460μs 32.2702μs 30.9884 KOps/s 31.2899 KOps/s $\color{#d91a1a}-0.96\%$
test_view_pytree 57.9480μs 20.5526μs 48.6557 KOps/s 48.0023 KOps/s $\color{#35bf28}+1.36\%$
test_view_td 0.1466s 65.3023μs 15.3134 KOps/s 14.3655 KOps/s $\textbf{\color{#35bf28}+6.60\%}$
test_unbind_pytree 61.5640μs 24.1371μs 41.4300 KOps/s 40.4108 KOps/s $\color{#35bf28}+2.52\%$
test_unbind_td 0.1321ms 36.7350μs 27.2220 KOps/s 26.9905 KOps/s $\color{#35bf28}+0.86\%$
test_split_pytree 65.1320μs 23.6147μs 42.3465 KOps/s 42.2052 KOps/s $\color{#35bf28}+0.33\%$
test_split_td 0.1339ms 40.2995μs 24.8142 KOps/s 25.0008 KOps/s $\color{#d91a1a}-0.75\%$
test_add_pytree 75.5010μs 29.5999μs 33.7839 KOps/s 33.1602 KOps/s $\color{#35bf28}+1.88\%$
test_add_td 0.1188ms 52.4343μs 19.0715 KOps/s 18.0950 KOps/s $\textbf{\color{#35bf28}+5.40\%}$
test_distributed 0.2106ms 0.1007ms 9.9264 KOps/s 9.7676 KOps/s $\color{#35bf28}+1.63\%$
test_tdmodule 50.4750μs 16.5861μs 60.2916 KOps/s 54.4310 KOps/s $\textbf{\color{#35bf28}+10.77\%}$
test_tdmodule_dispatch 56.4850μs 32.6062μs 30.6690 KOps/s 28.0408 KOps/s $\textbf{\color{#35bf28}+9.37\%}$
test_tdseq 57.6780μs 19.0298μs 52.5492 KOps/s 47.4031 KOps/s $\textbf{\color{#35bf28}+10.86\%}$
test_tdseq_dispatch 77.9860μs 37.7004μs 26.5249 KOps/s 23.8289 KOps/s $\textbf{\color{#35bf28}+11.31\%}$
test_instantiation_functorch 1.6034ms 1.3093ms 763.7439 Ops/s 756.7543 Ops/s $\color{#35bf28}+0.92\%$
test_instantiation_td 2.1193ms 1.0168ms 983.4473 Ops/s 982.6445 Ops/s $\color{#35bf28}+0.08\%$
test_exec_functorch 0.2365ms 0.1583ms 6.3190 KOps/s 6.2718 KOps/s $\color{#35bf28}+0.75\%$
test_exec_functional_call 0.2950ms 0.1491ms 6.7073 KOps/s 6.7992 KOps/s $\color{#d91a1a}-1.35\%$
test_exec_td 0.2705ms 0.1443ms 6.9283 KOps/s 6.7989 KOps/s $\color{#35bf28}+1.90\%$
test_exec_td_decorator 0.8103ms 0.1966ms 5.0872 KOps/s 5.0303 KOps/s $\color{#35bf28}+1.13\%$
test_vmap_mlp_speed[True-True] 0.6750ms 0.4765ms 2.0988 KOps/s 2.0752 KOps/s $\color{#35bf28}+1.14\%$
test_vmap_mlp_speed[True-False] 0.8067ms 0.4700ms 2.1275 KOps/s 2.0776 KOps/s $\color{#35bf28}+2.40\%$
test_vmap_mlp_speed[False-True] 0.6129ms 0.3870ms 2.5839 KOps/s 2.6002 KOps/s $\color{#d91a1a}-0.63\%$
test_vmap_mlp_speed[False-False] 0.6217ms 0.3876ms 2.5799 KOps/s 2.5950 KOps/s $\color{#d91a1a}-0.58\%$
test_vmap_mlp_speed_decorator[True-True] 0.7565ms 0.4946ms 2.0219 KOps/s 1.9500 KOps/s $\color{#35bf28}+3.69\%$
test_vmap_mlp_speed_decorator[True-False] 0.7028ms 0.4919ms 2.0328 KOps/s 2.0096 KOps/s $\color{#35bf28}+1.16\%$
test_vmap_mlp_speed_decorator[False-True] 0.6243ms 0.4037ms 2.4772 KOps/s 2.4864 KOps/s $\color{#d91a1a}-0.37\%$
test_vmap_mlp_speed_decorator[False-False] 0.5913ms 0.4019ms 2.4879 KOps/s 2.4838 KOps/s $\color{#35bf28}+0.17\%$
test_to_module_speed[True] 1.5095ms 1.3870ms 720.9938 Ops/s 593.3002 Ops/s $\textbf{\color{#35bf28}+21.52\%}$
test_to_module_speed[False] 1.9385ms 1.3708ms 729.5186 Ops/s 707.4872 Ops/s $\color{#35bf28}+3.11\%$

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. enhancement New feature or request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants