-
Notifications
You must be signed in to change notification settings - Fork 76
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Refactor] Refactor keys, items and values #1058
Merged
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
vmoens
added a commit
that referenced
this pull request
Oct 24, 2024
ghstack-source-id: 711131dcaabec7f90bf4b57c1dda38c34ee9e028 Pull Request resolved: #1058
facebook-github-bot
added
the
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
label
Oct 24, 2024
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 50.6140μs | 25.2355μs | 39.6268 KOps/s | 42.4630 KOps/s | |
test_plain_set_stack_nested | 84.0870μs | 25.3015μs | 39.5233 KOps/s | 41.7482 KOps/s | |
test_plain_set_nested_inplace | 68.4180μs | 27.4524μs | 36.4267 KOps/s | 38.8372 KOps/s | |
test_plain_set_stack_nested_inplace | 80.5300μs | 27.1969μs | 36.7689 KOps/s | 38.6319 KOps/s | |
test_items | 42.1890μs | 4.1586μs | 240.4661 KOps/s | 239.1242 KOps/s | |
test_items_nested | 0.4159ms | 0.3501ms | 2.8562 KOps/s | 2.6325 KOps/s | |
test_items_nested_locked | 0.4815ms | 0.3506ms | 2.8520 KOps/s | 2.6350 KOps/s | |
test_items_nested_leaf | 0.1439ms | 80.3698μs | 12.4425 KOps/s | 12.5837 KOps/s | |
test_items_stack_nested | 0.4355ms | 0.3569ms | 2.8017 KOps/s | 2.6287 KOps/s | |
test_items_stack_nested_leaf | 0.1421ms | 84.2466μs | 11.8699 KOps/s | 12.1382 KOps/s | |
test_items_stack_nested_locked | 0.5810ms | 0.3544ms | 2.8218 KOps/s | 2.6119 KOps/s | |
test_keys | 22.4220μs | 3.5760μs | 279.6449 KOps/s | 283.7716 KOps/s | |
test_keys_nested | 0.3252ms | 0.1861ms | 5.3748 KOps/s | 7.3362 KOps/s | |
test_keys_nested_locked | 0.7368ms | 0.1910ms | 5.2365 KOps/s | 7.0125 KOps/s | |
test_keys_nested_leaf | 0.2762ms | 0.1641ms | 6.0946 KOps/s | 8.3710 KOps/s | |
test_keys_stack_nested | 0.3402ms | 0.1906ms | 5.2458 KOps/s | 7.2687 KOps/s | |
test_keys_stack_nested_leaf | 0.3260ms | 0.1664ms | 6.0095 KOps/s | 8.4863 KOps/s | |
test_keys_stack_nested_locked | 0.3317ms | 0.1910ms | 5.2360 KOps/s | 7.0537 KOps/s | |
test_values | 9.0128μs | 1.0771μs | 928.4087 KOps/s | 880.1979 KOps/s | |
test_values_nested | 0.1308ms | 69.4073μs | 14.4077 KOps/s | 10.5993 KOps/s | |
test_values_nested_locked | 0.1267ms | 69.2069μs | 14.4494 KOps/s | 10.6530 KOps/s | |
test_values_nested_leaf | 0.1448ms | 82.5027μs | 12.1208 KOps/s | 11.6920 KOps/s | |
test_values_stack_nested | 0.1289ms | 70.3989μs | 14.2048 KOps/s | 10.5199 KOps/s | |
test_values_stack_nested_leaf | 0.1480ms | 83.3048μs | 12.0041 KOps/s | 12.5909 KOps/s | |
test_values_stack_nested_locked | 0.1222ms | 70.2204μs | 14.2409 KOps/s | 10.5493 KOps/s | |
test_membership | 27.5220μs | 0.9737μs | 1.0271 MOps/s | 1.3770 MOps/s | |
test_membership_nested | 17.3620μs | 2.8669μs | 348.8093 KOps/s | 366.9254 KOps/s | |
test_membership_nested_leaf | 44.7730μs | 2.8552μs | 350.2351 KOps/s | 365.2736 KOps/s | |
test_membership_stacked_nested | 35.1150μs | 2.8586μs | 349.8271 KOps/s | 365.8265 KOps/s | |
test_membership_stacked_nested_leaf | 47.3980μs | 2.8486μs | 351.0527 KOps/s | 361.7046 KOps/s | |
test_membership_nested_last | 18.4340μs | 4.3530μs | 229.7267 KOps/s | 239.4020 KOps/s | |
test_membership_nested_leaf_last | 57.2060μs | 4.3203μs | 231.4663 KOps/s | 238.6918 KOps/s | |
test_membership_stacked_nested_last | 23.1430μs | 5.6205μs | 177.9198 KOps/s | 204.9089 KOps/s | |
test_membership_stacked_nested_leaf_last | 45.2470μs | 5.6068μs | 178.3562 KOps/s | 205.2691 KOps/s | |
test_nested_getleaf | 45.6350μs | 10.5695μs | 94.6120 KOps/s | 93.3648 KOps/s | |
test_nested_get | 56.9560μs | 9.8998μs | 101.0118 KOps/s | 97.9389 KOps/s | |
test_stacked_getleaf | 33.9840μs | 10.6584μs | 93.8227 KOps/s | 92.7969 KOps/s | |
test_stacked_get | 49.6820μs | 10.0439μs | 99.5628 KOps/s | 97.2092 KOps/s | |
test_nested_getitemleaf | 54.2740μs | 10.7503μs | 93.0205 KOps/s | 89.6940 KOps/s | |
test_nested_getitem | 58.8990μs | 10.4923μs | 95.3080 KOps/s | 96.3241 KOps/s | |
test_stacked_getitemleaf | 49.8230μs | 11.0909μs | 90.1640 KOps/s | 89.5494 KOps/s | |
test_stacked_getitem | 55.9240μs | 10.2844μs | 97.2342 KOps/s | 96.3184 KOps/s | |
test_lock_nested | 0.9869ms | 0.5003ms | 1.9987 KOps/s | 1.9130 KOps/s | |
test_lock_stack_nested | 0.6793ms | 0.4674ms | 2.1396 KOps/s | 2.0295 KOps/s | |
test_unlock_nested | 0.7764ms | 0.4154ms | 2.4072 KOps/s | 2.2513 KOps/s | |
test_unlock_stack_nested | 0.4873ms | 0.3844ms | 2.6015 KOps/s | 2.4457 KOps/s | |
test_flatten_speed | 0.1579ms | 0.1013ms | 9.8723 KOps/s | 9.9826 KOps/s | |
test_unflatten_speed | 0.6120ms | 0.5136ms | 1.9469 KOps/s | 1.9516 KOps/s | |
test_common_ops | 2.2057ms | 1.1725ms | 852.9082 Ops/s | 885.1013 Ops/s | |
test_creation | 18.8150μs | 2.1327μs | 468.8995 KOps/s | 473.8742 KOps/s | |
test_creation_empty | 59.6110μs | 19.0178μs | 52.5824 KOps/s | 57.7068 KOps/s | |
test_creation_nested_1 | 86.7610μs | 22.1929μs | 45.0595 KOps/s | 47.7376 KOps/s | |
test_creation_nested_2 | 70.7810μs | 26.7852μs | 37.3340 KOps/s | 40.7247 KOps/s | |
test_clone | 0.1064ms | 17.0011μs | 58.8197 KOps/s | 56.7568 KOps/s | |
test_getitem[int] | 1.0601ms | 16.9183μs | 59.1075 KOps/s | 58.9736 KOps/s | |
test_getitem[slice_int] | 0.1335ms | 30.8463μs | 32.4188 KOps/s | 32.6084 KOps/s | |
test_getitem[range] | 0.1922ms | 58.7845μs | 17.0113 KOps/s | 17.3135 KOps/s | |
test_getitem[tuple] | 0.1320ms | 25.2465μs | 39.6094 KOps/s | 39.5895 KOps/s | |
test_getitem[list] | 0.1969ms | 54.2191μs | 18.4437 KOps/s | 18.9654 KOps/s | |
test_setitem_dim[int] | 69.3490μs | 32.5279μs | 30.7428 KOps/s | 30.5752 KOps/s | |
test_setitem_dim[slice_int] | 0.1038ms | 60.0836μs | 16.6435 KOps/s | 16.0873 KOps/s | |
test_setitem_dim[range] | 0.1303ms | 83.8039μs | 11.9326 KOps/s | 11.5479 KOps/s | |
test_setitem_dim[tuple] | 78.1960μs | 47.9722μs | 20.8454 KOps/s | 20.1398 KOps/s | |
test_setitem | 0.2643ms | 30.0979μs | 33.2249 KOps/s | 32.7870 KOps/s | |
test_set | 69.2690μs | 30.3540μs | 32.9446 KOps/s | 33.3817 KOps/s | |
test_set_shared | 1.3114ms | 0.2181ms | 4.5845 KOps/s | 4.4858 KOps/s | |
test_update | 0.9611ms | 39.1041μs | 25.5728 KOps/s | 26.6470 KOps/s | |
test_update_nested | 0.2982ms | 50.2487μs | 19.9010 KOps/s | 20.5445 KOps/s | |
test_update__nested | 0.3132ms | 43.7414μs | 22.8616 KOps/s | 21.5653 KOps/s | |
test_set_nested | 0.2814ms | 32.4485μs | 30.8181 KOps/s | 30.9389 KOps/s | |
test_set_nested_new | 0.2630ms | 37.7878μs | 26.4636 KOps/s | 25.9842 KOps/s | |
test_select | 0.5070ms | 56.4026μs | 17.7297 KOps/s | 17.7201 KOps/s | |
test_select_nested | 0.1140ms | 59.5319μs | 16.7977 KOps/s | 16.6531 KOps/s | |
test_exclude_nested | 0.1594ms | 74.8550μs | 13.3592 KOps/s | 13.3367 KOps/s | |
test_empty[True] | 0.5379ms | 0.3985ms | 2.5093 KOps/s | 2.8568 KOps/s | |
test_empty[False] | 8.4105μs | 1.2388μs | 807.2210 KOps/s | 833.8254 KOps/s | |
test_unbind_speed | 0.4056ms | 0.3032ms | 3.2982 KOps/s | 3.1448 KOps/s | |
test_unbind_speed_stack0 | 0.5921ms | 0.2980ms | 3.3562 KOps/s | 3.1765 KOps/s | |
test_unbind_speed_stack1 | 0.1087s | 0.8209ms | 1.2181 KOps/s | 1.2600 KOps/s | |
test_split | 5.5630ms | 2.0044ms | 498.8959 Ops/s | 445.3620 Ops/s | |
test_chunk | 0.1035s | 2.2040ms | 453.7275 Ops/s | 447.1676 Ops/s | |
test_creation[device0] | 3.6932ms | 0.1182ms | 8.4601 KOps/s | 8.4134 KOps/s | |
test_creation_from_tensor | 0.2627ms | 0.1160ms | 8.6195 KOps/s | 8.2544 KOps/s | |
test_add_one[memmap_tensor0] | 92.3520μs | 7.0900μs | 141.0439 KOps/s | 128.4856 KOps/s | |
test_contiguous[memmap_tensor0] | 15.5080μs | 2.0014μs | 499.6484 KOps/s | 516.6286 KOps/s | |
test_stack[memmap_tensor0] | 0.1489ms | 5.7056μs | 175.2649 KOps/s | 165.3732 KOps/s | |
test_memmaptd_index | 1.1694ms | 0.4130ms | 2.4211 KOps/s | 2.3284 KOps/s | |
test_memmaptd_index_astensor | 1.0859ms | 0.5159ms | 1.9382 KOps/s | 1.8698 KOps/s | |
test_memmaptd_index_op | 2.1709ms | 1.0691ms | 935.3615 Ops/s | 906.8647 Ops/s | |
test_serialize_model | 0.2238s | 0.1349s | 7.4134 Ops/s | 8.3265 Ops/s | |
test_serialize_model_pickle | 0.4819s | 0.3976s | 2.5152 Ops/s | 2.5551 Ops/s | |
test_serialize_weights | 0.1238s | 0.1165s | 8.5824 Ops/s | 8.4048 Ops/s | |
test_serialize_weights_returnearly | 0.1696s | 0.1598s | 6.2560 Ops/s | 5.5840 Ops/s | |
test_serialize_weights_pickle | 0.5420s | 0.4167s | 2.3999 Ops/s | 2.4791 Ops/s | |
test_serialize_weights_filesystem | 0.2438s | 0.1588s | 6.2955 Ops/s | 6.7991 Ops/s | |
test_serialize_model_filesystem | 0.1594s | 0.1543s | 6.4807 Ops/s | 6.4042 Ops/s | |
test_reshape_pytree | 76.9630μs | 38.5160μs | 25.9632 KOps/s | 24.6231 KOps/s | |
test_reshape_td | 0.1020ms | 46.3052μs | 21.5958 KOps/s | 20.6703 KOps/s | |
test_view_pytree | 80.9100μs | 38.4212μs | 26.0273 KOps/s | 24.8140 KOps/s | |
test_view_td | 0.1194ms | 51.2055μs | 19.5291 KOps/s | 18.8974 KOps/s | |
test_unbind_pytree | 82.3330μs | 35.2233μs | 28.3903 KOps/s | 26.8375 KOps/s | |
test_unbind_td | 0.3164ms | 45.0233μs | 22.2107 KOps/s | 20.9446 KOps/s | |
test_split_pytree | 83.6060μs | 37.4692μs | 26.6886 KOps/s | 25.3287 KOps/s | |
test_split_td | 0.4571ms | 58.3145μs | 17.1484 KOps/s | 17.1364 KOps/s | |
test_add_pytree | 0.1158ms | 44.0222μs | 22.7158 KOps/s | 20.9490 KOps/s | |
test_add_td | 0.2386ms | 85.0853μs | 11.7529 KOps/s | 11.1586 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1353ms | 72.7989μs | 13.7365 KOps/s | 13.2656 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.4388ms | 0.2046ms | 4.8865 KOps/s | 4.7424 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1281ms | 54.3208μs | 18.4091 KOps/s | 18.0260 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.2422ms | 0.1462ms | 6.8404 KOps/s | 6.5612 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 90.2180μs | 27.9837μs | 35.7351 KOps/s | 34.8110 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1701ms | 76.4116μs | 13.0870 KOps/s | 12.7748 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.1500ms | 76.9300μs | 12.9988 KOps/s | 12.7295 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.1316ms | 66.8850μs | 14.9510 KOps/s | 14.4107 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.1919ms | 0.1223ms | 8.1772 KOps/s | 7.9732 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.3491ms | 0.2521ms | 3.9660 KOps/s | 3.8754 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1123ms | 55.6624μs | 17.9655 KOps/s | 18.4201 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.1543ms | 76.0017μs | 13.1576 KOps/s | 12.0549 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.2045ms | 0.1155ms | 8.6565 KOps/s | 8.6179 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.5519ms | 0.2996ms | 3.3374 KOps/s | 3.2120 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.5024ms | 0.2775ms | 3.6030 KOps/s | 3.4711 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.2553ms | 0.1285ms | 7.7846 KOps/s | 8.1275 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1374ms | 71.3587μs | 14.0137 KOps/s | 13.0032 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1423ms | 55.7972μs | 17.9220 KOps/s | 18.1889 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.3391ms | 0.2431ms | 4.1130 KOps/s | 3.9811 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.2079ms | 0.1136ms | 8.8061 KOps/s | 8.8026 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 81.0610μs | 30.0482μs | 33.2798 KOps/s | 33.1477 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.2992ms | 78.2319μs | 12.7825 KOps/s | 11.4184 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1557ms | 80.4580μs | 12.4288 KOps/s | 12.0015 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1342ms | 68.6665μs | 14.5631 KOps/s | 14.2499 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 0.3096ms | 0.2202ms | 4.5414 KOps/s | 4.5389 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 2.8238ms | 1.8169ms | 550.3838 Ops/s | 532.8897 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 0.3557ms | 0.2133ms | 4.6892 KOps/s | 4.6253 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 1.5542ms | 1.1544ms | 866.2319 Ops/s | 808.6920 Ops/s | |
test_compile_assign_and_add_stack[compile] | 0.6267ms | 0.4700ms | 2.1277 KOps/s | 2.0885 KOps/s | |
test_compile_assign_and_add_stack[eager] | 6.8178ms | 4.2932ms | 232.9285 Ops/s | 236.0931 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.1289ms | 44.0775μs | 22.6873 KOps/s | 22.9441 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.5796ms | 50.8198μs | 19.6774 KOps/s | 19.5029 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.1076ms | 38.7791μs | 25.7871 KOps/s | 26.1213 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 0.1137ms | 29.7821μs | 33.5773 KOps/s | 33.1468 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.1199ms | 39.7683μs | 25.1456 KOps/s | 25.6114 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 98.5430μs | 29.5310μs | 33.8627 KOps/s | 33.4610 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1552ms | 79.8629μs | 12.5215 KOps/s | 13.1435 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.5829ms | 29.3526μs | 34.0685 KOps/s | 32.7715 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1579ms | 71.8692μs | 13.9142 KOps/s | 14.0436 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 0.1100ms | 23.6101μs | 42.3548 KOps/s | 39.8520 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1642ms | 72.9569μs | 13.7067 KOps/s | 13.8844 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 0.1109ms | 23.4240μs | 42.6913 KOps/s | 39.8824 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1904ms | 80.9603μs | 12.3517 KOps/s | 12.5722 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.8544ms | 28.7614μs | 34.7689 KOps/s | 33.6785 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1438ms | 72.3703μs | 13.8178 KOps/s | 13.9124 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 92.5210μs | 23.4801μs | 42.5893 KOps/s | 40.5555 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1413ms | 72.4435μs | 13.8039 KOps/s | 14.0302 KOps/s | |
test_compile_indexing[int-pytree-eager] | 75.5600μs | 23.3749μs | 42.7809 KOps/s | 39.3881 KOps/s | |
test_mod_add[eager] | 82.9240μs | 25.7749μs | 38.7975 KOps/s | 38.3216 KOps/s | |
test_mod_add[compile] | 0.1051ms | 44.5981μs | 22.4225 KOps/s | 20.9645 KOps/s | |
test_mod_add[compile-overhead] | 0.1164ms | 45.1925μs | 22.1275 KOps/s | 22.2923 KOps/s | |
test_mod_wrap[eager] | 0.3738ms | 0.2130ms | 4.6959 KOps/s | 4.6689 KOps/s | |
test_mod_wrap[compile] | 1.7776ms | 0.2060ms | 4.8534 KOps/s | 4.8975 KOps/s | |
test_mod_wrap[compile-overhead] | 4.9491ms | 0.2043ms | 4.8957 KOps/s | 4.9828 KOps/s | |
test_mod_wrap_and_backward[eager] | 15.7511ms | 11.6089ms | 86.1406 Ops/s | 90.0161 Ops/s | |
test_mod_wrap_and_backward[compile] | 16.2223ms | 12.6634ms | 78.9675 Ops/s | 90.7567 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 14.5455ms | 13.4094ms | 74.5744 Ops/s | 91.9516 Ops/s | |
test_seq_add[eager] | 0.1812ms | 92.3275μs | 10.8310 KOps/s | 10.9818 KOps/s | |
test_seq_add[compile] | 0.1349ms | 59.6797μs | 16.7561 KOps/s | 16.8775 KOps/s | |
test_seq_add[compile-overhead] | 0.1195ms | 58.2141μs | 17.1780 KOps/s | 17.0021 KOps/s | |
test_seq_wrap[eager] | 0.5506ms | 0.3846ms | 2.6002 KOps/s | 2.6242 KOps/s | |
test_seq_wrap[compile] | 0.4002ms | 0.2287ms | 4.3722 KOps/s | 4.3810 KOps/s | |
test_seq_wrap[compile-overhead] | 0.4349ms | 0.2259ms | 4.4266 KOps/s | 4.5143 KOps/s | |
test_func_call_runtime[False-eager] | 0.7294ms | 0.5496ms | 1.8195 KOps/s | 1.8287 KOps/s | |
test_func_call_runtime[False-compile] | 0.6852ms | 0.4307ms | 2.3219 KOps/s | 2.3107 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.7826ms | 0.4329ms | 2.3101 KOps/s | 2.3150 KOps/s | |
test_func_call_runtime[True-eager] | 0.9781ms | 0.7651ms | 1.3070 KOps/s | 1.3105 KOps/s | |
test_func_call_runtime[True-compile] | 0.5939ms | 0.4690ms | 2.1320 KOps/s | 2.0572 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.6280ms | 0.4686ms | 2.1339 KOps/s | 2.1257 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.9451ms | 0.5483ms | 1.8239 KOps/s | 1.8293 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.5383ms | 0.4284ms | 2.3343 KOps/s | 2.3275 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.5425ms | 0.4292ms | 2.3299 KOps/s | 2.3288 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.1421ms | 0.9147ms | 1.0932 KOps/s | 1.0697 KOps/s | |
test_func_call_cm_runtime[True-compile] | 0.9045ms | 0.5006ms | 1.9975 KOps/s | 2.0098 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.6743ms | 0.4987ms | 2.0053 KOps/s | 1.9843 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.6731ms | 1.9431ms | 514.6536 Ops/s | 512.7626 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 0.6663ms | 0.5199ms | 1.9233 KOps/s | 1.8824 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.9079ms | 0.5269ms | 1.8978 KOps/s | 1.8842 KOps/s | |
test_distributed | 0.4238ms | 0.1298ms | 7.7036 KOps/s | 7.5105 KOps/s | |
test_tdmodule | 39.6940μs | 19.2837μs | 51.8574 KOps/s | 56.0926 KOps/s | |
test_tdmodule_dispatch | 67.7550μs | 37.0234μs | 27.0099 KOps/s | 29.0831 KOps/s | |
test_tdseq | 38.2110μs | 21.7461μs | 45.9852 KOps/s | 49.8340 KOps/s | |
test_tdseq_dispatch | 85.7090μs | 43.8344μs | 22.8131 KOps/s | 25.7099 KOps/s | |
test_instantiation_functorch | 2.6112ms | 1.5692ms | 637.2621 Ops/s | 644.8378 Ops/s | |
test_exec_functorch | 0.3307ms | 0.1801ms | 5.5524 KOps/s | 5.4630 KOps/s | |
test_exec_functional_call | 0.3436ms | 0.1720ms | 5.8124 KOps/s | 5.8382 KOps/s | |
test_exec_td_decorator | 0.5866ms | 0.2430ms | 4.1154 KOps/s | 4.1580 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.9331ms | 0.6548ms | 1.5272 KOps/s | 1.5158 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 1.0065ms | 0.6545ms | 1.5279 KOps/s | 1.5245 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.7049ms | 0.5352ms | 1.8684 KOps/s | 1.8595 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.8414ms | 0.5335ms | 1.8743 KOps/s | 1.8455 KOps/s | |
test_to_module_speed[True] | 2.0043ms | 1.4124ms | 708.0030 Ops/s | 719.6505 Ops/s | |
test_to_module_speed[False] | 1.9623ms | 1.3694ms | 730.2264 Ops/s | 722.9008 Ops/s | |
test_tc_init | 91.1790μs | 47.7072μs | 20.9612 KOps/s | 21.7903 KOps/s | |
test_tc_init_nested | 0.2346ms | 93.7980μs | 10.6612 KOps/s | 11.0208 KOps/s | |
test_tc_first_layer_tensor | 26.3180μs | 1.5330μs | 652.2969 KOps/s | 673.6307 KOps/s | |
test_tc_first_layer_nontensor | 39.0830μs | 4.7521μs | 210.4334 KOps/s | 208.9891 KOps/s | |
test_tc_second_layer_tensor | 46.3570μs | 2.8292μs | 353.4629 KOps/s | 367.3525 KOps/s | |
test_tc_second_layer_nontensor | 44.3720μs | 6.1124μs | 163.6020 KOps/s | 161.8562 KOps/s | |
test_unbind | 0.2198s | 12.6082ms | 79.3132 Ops/s | 78.0571 Ops/s | |
test_full_like | 9.1414ms | 7.7528ms | 128.9856 Ops/s | 128.6230 Ops/s | |
test_zeros_like | 3.6398ms | 2.8686ms | 348.5985 Ops/s | 331.0730 Ops/s | |
test_ones_like | 4.1220ms | 3.3975ms | 294.3313 Ops/s | 288.0322 Ops/s | |
test_clone | 5.7019ms | 5.1441ms | 194.3965 Ops/s | 194.0650 Ops/s | |
test_squeeze | 58.4690μs | 11.8879μs | 84.1193 KOps/s | 78.2151 KOps/s | |
test_unsqueeze | 0.2978ms | 89.3929μs | 11.1866 KOps/s | 10.6339 KOps/s | |
test_split | 0.3344ms | 0.1845ms | 5.4196 KOps/s | 5.1839 KOps/s | |
test_permute | 0.3596ms | 0.2150ms | 4.6516 KOps/s | 4.5070 KOps/s | |
test_stack | 31.6649ms | 26.4532ms | 37.8026 Ops/s | 40.8091 Ops/s | |
test_cat | 32.5064ms | 26.2408ms | 38.1086 Ops/s | 40.5075 Ops/s |
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 36.3800μs | 15.8412μs | 63.1266 KOps/s | 56.2247 KOps/s | |
test_plain_set_stack_nested | 45.5700μs | 15.9522μs | 62.6873 KOps/s | 55.3444 KOps/s | |
test_plain_set_nested_inplace | 47.3900μs | 17.2179μs | 58.0790 KOps/s | 51.3140 KOps/s | |
test_plain_set_stack_nested_inplace | 0.1229ms | 17.0804μs | 58.5467 KOps/s | 51.7573 KOps/s | |
test_items | 45.5300μs | 2.8853μs | 346.5837 KOps/s | 341.3629 KOps/s | |
test_items_nested | 0.3645ms | 0.3267ms | 3.0611 KOps/s | 2.9479 KOps/s | |
test_items_nested_locked | 0.3584ms | 0.3292ms | 3.0376 KOps/s | 2.9294 KOps/s | |
test_items_nested_leaf | 95.3010μs | 63.8048μs | 15.6728 KOps/s | 15.7055 KOps/s | |
test_items_stack_nested | 0.3681ms | 0.3299ms | 3.0310 KOps/s | 2.9300 KOps/s | |
test_items_stack_nested_leaf | 86.3210μs | 64.3794μs | 15.5329 KOps/s | 15.4174 KOps/s | |
test_items_stack_nested_locked | 0.3773ms | 0.3309ms | 3.0221 KOps/s | 2.9124 KOps/s | |
test_keys | 31.5000μs | 3.7821μs | 264.4052 KOps/s | 291.1214 KOps/s | |
test_keys_nested | 0.1222ms | 94.3423μs | 10.5997 KOps/s | 14.2183 KOps/s | |
test_keys_nested_locked | 2.3715ms | 0.1020ms | 9.8018 KOps/s | 13.1070 KOps/s | |
test_keys_nested_leaf | 0.1156ms | 84.4692μs | 11.8386 KOps/s | 16.4330 KOps/s | |
test_keys_stack_nested | 0.1247ms | 94.4760μs | 10.5847 KOps/s | 13.9780 KOps/s | |
test_keys_stack_nested_leaf | 0.1291ms | 85.4739μs | 11.6995 KOps/s | 15.8496 KOps/s | |
test_keys_stack_nested_locked | 0.1537ms | 0.1009ms | 9.9083 KOps/s | 12.8891 KOps/s | |
test_values | 5.0800μs | 0.8497μs | 1.1769 MOps/s | 1.1742 MOps/s | |
test_values_nested | 66.9810μs | 37.7293μs | 26.5046 KOps/s | 20.1645 KOps/s | |
test_values_nested_locked | 74.2910μs | 39.3103μs | 25.4386 KOps/s | 19.2210 KOps/s | |
test_values_nested_leaf | 94.8210μs | 45.2557μs | 22.0967 KOps/s | 22.9715 KOps/s | |
test_values_stack_nested | 0.1119ms | 37.5899μs | 26.6029 KOps/s | 19.7023 KOps/s | |
test_values_stack_nested_leaf | 68.4410μs | 45.9156μs | 21.7791 KOps/s | 22.2602 KOps/s | |
test_values_stack_nested_locked | 74.4200μs | 39.5417μs | 25.2898 KOps/s | 18.8876 KOps/s | |
test_membership | 1.6636μs | 0.5086μs | 1.9663 MOps/s | 1.9617 MOps/s | |
test_membership_nested | 16.4050μs | 1.8627μs | 536.8435 KOps/s | 530.6628 KOps/s | |
test_membership_nested_leaf | 12.8600μs | 1.8791μs | 532.1795 KOps/s | 528.2532 KOps/s | |
test_membership_stacked_nested | 33.5510μs | 1.9434μs | 514.5566 KOps/s | 503.3658 KOps/s | |
test_membership_stacked_nested_leaf | 63.2210μs | 1.9669μs | 508.4051 KOps/s | 508.3658 KOps/s | |
test_membership_nested_last | 35.6410μs | 2.9941μs | 333.9933 KOps/s | 335.7015 KOps/s | |
test_membership_nested_leaf_last | 34.5000μs | 3.0161μs | 331.5521 KOps/s | 331.6177 KOps/s | |
test_membership_stacked_nested_last | 33.4600μs | 2.9764μs | 335.9722 KOps/s | 330.9265 KOps/s | |
test_membership_stacked_nested_leaf_last | 36.5900μs | 3.0104μs | 332.1845 KOps/s | 329.0447 KOps/s | |
test_nested_getleaf | 27.9610μs | 6.0352μs | 165.6959 KOps/s | 164.6236 KOps/s | |
test_nested_get | 34.3500μs | 5.7247μs | 174.6823 KOps/s | 174.8378 KOps/s | |
test_stacked_getleaf | 30.4900μs | 6.0077μs | 166.4518 KOps/s | 165.3565 KOps/s | |
test_stacked_get | 30.9910μs | 5.7209μs | 174.7983 KOps/s | 175.3693 KOps/s | |
test_nested_getitemleaf | 35.9610μs | 6.1221μs | 163.3436 KOps/s | 163.4131 KOps/s | |
test_nested_getitem | 65.9610μs | 5.7881μs | 172.7689 KOps/s | 172.2301 KOps/s | |
test_stacked_getitemleaf | 35.2500μs | 6.0702μs | 164.7395 KOps/s | 162.7088 KOps/s | |
test_stacked_getitem | 28.2100μs | 5.7655μs | 173.4460 KOps/s | 172.2660 KOps/s | |
test_lock_nested | 4.8086ms | 0.4269ms | 2.3423 KOps/s | 2.3608 KOps/s | |
test_lock_stack_nested | 0.4393ms | 0.3893ms | 2.5689 KOps/s | 2.5528 KOps/s | |
test_unlock_nested | 0.7434ms | 0.3618ms | 2.7637 KOps/s | 2.7608 KOps/s | |
test_unlock_stack_nested | 0.3675ms | 0.3274ms | 3.0540 KOps/s | 3.0336 KOps/s | |
test_flatten_speed | 0.1100ms | 76.7719μs | 13.0256 KOps/s | 12.8587 KOps/s | |
test_unflatten_speed | 0.3573ms | 0.3218ms | 3.1071 KOps/s | 3.1176 KOps/s | |
test_common_ops | 1.4028ms | 1.1956ms | 836.3856 Ops/s | 778.7611 Ops/s | |
test_creation | 30.5300μs | 1.4880μs | 672.0347 KOps/s | 663.2351 KOps/s | |
test_creation_empty | 46.5000μs | 13.7958μs | 72.4861 KOps/s | 57.1571 KOps/s | |
test_creation_nested_1 | 43.3810μs | 15.3119μs | 65.3088 KOps/s | 50.9583 KOps/s | |
test_creation_nested_2 | 47.0410μs | 17.9898μs | 55.5870 KOps/s | 45.1823 KOps/s | |
test_clone | 68.3910μs | 28.1206μs | 35.5612 KOps/s | 35.1191 KOps/s | |
test_getitem[int] | 1.1387ms | 15.7757μs | 63.3886 KOps/s | 62.9054 KOps/s | |
test_getitem[slice_int] | 0.1233ms | 27.9399μs | 35.7910 KOps/s | 35.1601 KOps/s | |
test_getitem[range] | 0.2256ms | 0.1153ms | 8.6762 KOps/s | 8.7593 KOps/s | |
test_getitem[tuple] | 0.1195ms | 24.3737μs | 41.0279 KOps/s | 40.6938 KOps/s | |
test_getitem[list] | 0.2144ms | 0.1077ms | 9.2863 KOps/s | 9.7813 KOps/s | |
test_setitem_dim[int] | 71.5500μs | 45.5827μs | 21.9382 KOps/s | 23.0598 KOps/s | |
test_setitem_dim[slice_int] | 90.5410μs | 65.0907μs | 15.3632 KOps/s | 15.2683 KOps/s | |
test_setitem_dim[range] | 0.1702ms | 0.1349ms | 7.4136 KOps/s | 7.8811 KOps/s | |
test_setitem_dim[tuple] | 88.5210μs | 59.7284μs | 16.7425 KOps/s | 15.8681 KOps/s | |
test_setitem | 78.0210μs | 39.1274μs | 25.5575 KOps/s | 22.0966 KOps/s | |
test_set | 73.8610μs | 38.4184μs | 26.0292 KOps/s | 22.6959 KOps/s | |
test_set_shared | 0.3485ms | 53.2756μs | 18.7703 KOps/s | 18.6485 KOps/s | |
test_update | 88.2510μs | 47.7820μs | 20.9284 KOps/s | 19.1656 KOps/s | |
test_update_nested | 96.4410μs | 54.9531μs | 18.1973 KOps/s | 16.9349 KOps/s | |
test_update__nested | 0.4233ms | 66.6664μs | 15.0001 KOps/s | 14.6238 KOps/s | |
test_set_nested | 88.3910μs | 41.7290μs | 23.9641 KOps/s | 22.3697 KOps/s | |
test_set_nested_new | 85.5410μs | 45.2520μs | 22.0985 KOps/s | 21.2061 KOps/s | |
test_select | 0.1002ms | 60.5434μs | 16.5171 KOps/s | 16.1229 KOps/s | |
test_select_nested | 74.0310μs | 41.8759μs | 23.8801 KOps/s | 24.0499 KOps/s | |
test_exclude_nested | 97.7410μs | 60.0596μs | 16.6501 KOps/s | 16.7375 KOps/s | |
test_empty[True] | 0.3123ms | 0.2803ms | 3.5678 KOps/s | 3.9095 KOps/s | |
test_empty[False] | 3.1371μs | 0.7468μs | 1.3390 MOps/s | 1.3450 MOps/s | |
test_to | 63.8410μs | 26.3647μs | 37.9295 KOps/s | 38.4511 KOps/s | |
test_to_nonblocking | 64.2300μs | 25.2950μs | 39.5335 KOps/s | 38.9749 KOps/s | |
test_unbind_speed | 0.3056ms | 0.2704ms | 3.6984 KOps/s | 3.6654 KOps/s | |
test_unbind_speed_stack0 | 0.3743ms | 0.2664ms | 3.7542 KOps/s | 3.6461 KOps/s | |
test_unbind_speed_stack1 | 93.3544ms | 0.6938ms | 1.4414 KOps/s | 1.4096 KOps/s | |
test_split | 96.5381ms | 2.1592ms | 463.1292 Ops/s | 459.6567 Ops/s | |
test_chunk | 96.7068ms | 2.1659ms | 461.7111 Ops/s | 457.1314 Ops/s | |
test_to[False] | 3.4305ms | 3.3185ms | 301.3410 Ops/s | 300.0900 Ops/s | |
test_to[True] | 4.6846ms | 4.3707ms | 228.7942 Ops/s | 228.9494 Ops/s | |
test_to_njt[False] | 0.2296s | 0.2294s | 4.3599 Ops/s | 4.3325 Ops/s | |
test_to_njt[True] | 0.3634s | 0.2785s | 3.5902 Ops/s | 3.5455 Ops/s | |
test_creation[device0] | 0.3475ms | 0.1316ms | 7.5967 KOps/s | 7.5043 KOps/s | |
test_creation_from_tensor | 0.3573ms | 0.1336ms | 7.4835 KOps/s | 7.4288 KOps/s | |
test_add_one[memmap_tensor0] | 0.1463ms | 8.2815μs | 120.7505 KOps/s | 119.1331 KOps/s | |
test_contiguous[memmap_tensor0] | 19.9110μs | 2.1740μs | 459.9862 KOps/s | 453.5284 KOps/s | |
test_stack[memmap_tensor0] | 39.3610μs | 6.5211μs | 153.3484 KOps/s | 151.4501 KOps/s | |
test_memmaptd_index | 1.0182ms | 0.4158ms | 2.4052 KOps/s | 2.3527 KOps/s | |
test_memmaptd_index_astensor | 0.7359ms | 0.4934ms | 2.0267 KOps/s | 2.0198 KOps/s | |
test_memmaptd_index_op | 1.3524ms | 0.9672ms | 1.0339 KOps/s | 972.6144 Ops/s | |
test_serialize_model | 0.1314s | 0.1301s | 7.6865 Ops/s | 7.6582 Ops/s | |
test_serialize_model_pickle | 1.3487s | 1.2187s | 0.8205 Ops/s | 0.8384 Ops/s | |
test_serialize_weights | 0.1324s | 0.1299s | 7.6993 Ops/s | 6.9393 Ops/s | |
test_serialize_weights_returnearly | 53.1764ms | 47.0914ms | 21.2353 Ops/s | 17.7420 Ops/s | |
test_serialize_weights_pickle | 1.3486s | 1.1904s | 0.8401 Ops/s | 0.8375 Ops/s | |
test_reshape_pytree | 71.2810μs | 36.1277μs | 27.6796 KOps/s | 27.7835 KOps/s | |
test_reshape_td | 73.9110μs | 41.1873μs | 24.2793 KOps/s | 24.3827 KOps/s | |
test_view_pytree | 64.4010μs | 35.4155μs | 28.2363 KOps/s | 27.8425 KOps/s | |
test_view_td | 76.9810μs | 46.8101μs | 21.3629 KOps/s | 21.5368 KOps/s | |
test_unbind_pytree | 68.8310μs | 33.6591μs | 29.7097 KOps/s | 29.3953 KOps/s | |
test_unbind_td | 0.4918ms | 42.1332μs | 23.7342 KOps/s | 24.0912 KOps/s | |
test_split_pytree | 87.2710μs | 45.7757μs | 21.8456 KOps/s | 21.6777 KOps/s | |
test_split_td | 0.6778ms | 56.8438μs | 17.5921 KOps/s | 17.5896 KOps/s | |
test_add_pytree | 94.3610μs | 55.8559μs | 17.9032 KOps/s | 17.8764 KOps/s | |
test_add_td | 0.1502ms | 89.5136μs | 11.1715 KOps/s | 10.5785 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.2149ms | 0.1613ms | 6.1979 KOps/s | 6.1446 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.2841ms | 0.1608ms | 6.2199 KOps/s | 6.2685 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.2145ms | 0.1540ms | 6.4937 KOps/s | 6.3054 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.2518ms | 0.1803ms | 5.5464 KOps/s | 5.5824 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 79.0900μs | 21.5088μs | 46.4927 KOps/s | 45.9676 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 94.3510μs | 48.2262μs | 20.7356 KOps/s | 21.0837 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.1215ms | 66.6405μs | 15.0059 KOps/s | 15.1132 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.1104ms | 50.2480μs | 19.9013 KOps/s | 20.0787 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.3604ms | 0.3161ms | 3.1638 KOps/s | 3.1638 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.3126ms | 0.2371ms | 4.2184 KOps/s | 4.2386 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.2119ms | 0.1292ms | 7.7409 KOps/s | 7.7726 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.1183ms | 63.9373μs | 15.6403 KOps/s | 15.2167 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.3747ms | 0.3227ms | 3.0988 KOps/s | 3.1002 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.6977ms | 0.6012ms | 1.6634 KOps/s | 1.5915 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.4032ms | 0.2899ms | 3.4499 KOps/s | 3.4939 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.3928ms | 0.3192ms | 3.1329 KOps/s | 3.1633 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1668ms | 76.0865μs | 13.1429 KOps/s | 12.3485 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.2560ms | 0.1342ms | 7.4533 KOps/s | 7.7347 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.5927ms | 0.5166ms | 1.9358 KOps/s | 1.9501 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.3799ms | 0.3243ms | 3.0834 KOps/s | 3.0806 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 49.5610μs | 20.2183μs | 49.4601 KOps/s | 49.2568 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 97.2710μs | 39.1830μs | 25.5213 KOps/s | 25.8611 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1367ms | 70.3823μs | 14.2081 KOps/s | 14.2609 KOps/s | |
test_compile_copy_flat[pytree-eager] | 98.3110μs | 51.9509μs | 19.2490 KOps/s | 19.4575 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 2.3531ms | 0.8233ms | 1.2146 KOps/s | 1.0993 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 3.5611ms | 3.1618ms | 316.2756 Ops/s | 306.4293 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 2.4394ms | 0.8509ms | 1.1752 KOps/s | 1.1000 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 3.5468ms | 3.2139ms | 311.1515 Ops/s | 319.5848 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.1702ms | 0.1250ms | 8.0006 KOps/s | 7.8393 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.4806ms | 66.4832μs | 15.0414 KOps/s | 15.1029 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.2081ms | 0.1163ms | 8.5978 KOps/s | 8.1919 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 89.8410μs | 41.3451μs | 24.1867 KOps/s | 23.3473 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.1681ms | 0.1151ms | 8.6866 KOps/s | 8.1512 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 96.6510μs | 41.1165μs | 24.3211 KOps/s | 22.4988 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.2159ms | 0.1465ms | 6.8280 KOps/s | 6.7693 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.1493ms | 26.0953μs | 38.3211 KOps/s | 38.7306 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.2402ms | 0.1402ms | 7.1332 KOps/s | 7.0583 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 71.8110μs | 20.3604μs | 49.1149 KOps/s | 48.6207 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.2018ms | 0.1410ms | 7.0913 KOps/s | 6.5934 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 67.0510μs | 22.4618μs | 44.5200 KOps/s | 48.3659 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.2290ms | 0.1486ms | 6.7312 KOps/s | 6.6702 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.4654ms | 26.2341μs | 38.1183 KOps/s | 33.0417 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.2157ms | 0.1414ms | 7.0745 KOps/s | 6.5802 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 67.8210μs | 20.4228μs | 48.9650 KOps/s | 48.8872 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.2185ms | 0.1416ms | 7.0627 KOps/s | 6.9347 KOps/s | |
test_compile_indexing[int-pytree-eager] | 0.1061ms | 20.5264μs | 48.7176 KOps/s | 48.9623 KOps/s | |
test_mod_add[eager] | 96.3510μs | 29.5953μs | 33.7892 KOps/s | 28.5292 KOps/s | |
test_mod_add[compile] | 0.1900ms | 81.3718μs | 12.2893 KOps/s | 11.2153 KOps/s | |
test_mod_add[compile-overhead] | 0.3483ms | 0.1589ms | 6.2926 KOps/s | 5.9433 KOps/s | |
test_mod_wrap[eager] | 0.3270ms | 0.2358ms | 4.2401 KOps/s | 3.9255 KOps/s | |
test_mod_wrap[compile] | 0.3567ms | 0.3037ms | 3.2926 KOps/s | 3.1874 KOps/s | |
test_mod_wrap[compile-overhead] | 7.9335ms | 4.1365ms | 241.7526 Ops/s | 234.7315 Ops/s | |
test_mod_wrap_and_backward[eager] | 1.4438ms | 1.3201ms | 757.5375 Ops/s | 707.8046 Ops/s | |
test_mod_wrap_and_backward[compile] | 1.5816ms | 1.3280ms | 753.0310 Ops/s | 748.2054 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 1.3720ms | 0.9033ms | 1.1070 KOps/s | 1.0875 KOps/s | |
test_seq_add[eager] | 0.1625ms | 97.2957μs | 10.2779 KOps/s | 9.8743 KOps/s | |
test_seq_add[compile] | 0.1416ms | 91.1062μs | 10.9762 KOps/s | 10.8292 KOps/s | |
test_seq_add[compile-overhead] | 0.1727ms | 0.1250ms | 8.0023 KOps/s | 7.9182 KOps/s | |
test_seq_wrap[eager] | 0.4629ms | 0.3853ms | 2.5951 KOps/s | 2.5417 KOps/s | |
test_seq_wrap[compile] | 0.4003ms | 0.3161ms | 3.1640 KOps/s | 3.0509 KOps/s | |
test_seq_wrap[compile-overhead] | 0.2743ms | 0.2202ms | 4.5420 KOps/s | 4.4546 KOps/s | |
test_func_call_runtime[False-eager] | 1.0066ms | 0.7446ms | 1.3430 KOps/s | 1.3316 KOps/s | |
test_func_call_runtime[False-compile] | 0.8974ms | 0.8235ms | 1.2143 KOps/s | 1.2555 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.4036ms | 0.3584ms | 2.7900 KOps/s | 2.7591 KOps/s | |
test_func_call_runtime[True-eager] | 0.9539ms | 0.8785ms | 1.1383 KOps/s | 1.1221 KOps/s | |
test_func_call_runtime[True-compile] | 0.8540ms | 0.8147ms | 1.2274 KOps/s | 1.2247 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.4323ms | 0.3784ms | 2.6424 KOps/s | 2.6145 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.7833ms | 0.7126ms | 1.4033 KOps/s | 1.3753 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.8345ms | 0.7947ms | 1.2583 KOps/s | 1.2581 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.4109ms | 0.3588ms | 2.7870 KOps/s | 2.7445 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.0685ms | 0.9888ms | 1.0114 KOps/s | 999.5376 Ops/s | |
test_func_call_cm_runtime[True-compile] | 0.9350ms | 0.8378ms | 1.1936 KOps/s | 1.1696 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.4766ms | 0.4034ms | 2.4787 KOps/s | 2.4387 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.5366ms | 2.0527ms | 487.1635 Ops/s | 482.9895 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 0.9300ms | 0.8542ms | 1.1707 KOps/s | 1.1646 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.5238ms | 0.4092ms | 2.4438 KOps/s | 2.4140 KOps/s | |
test_distributed | 0.7928ms | 0.1555ms | 6.4321 KOps/s | 8.6586 KOps/s | |
test_tdmodule | 48.1300μs | 13.5883μs | 73.5927 KOps/s | 61.7354 KOps/s | |
test_tdmodule_dispatch | 0.3129ms | 26.6292μs | 37.5527 KOps/s | 31.0112 KOps/s | |
test_tdseq | 30.4600μs | 14.3175μs | 69.8444 KOps/s | 57.4555 KOps/s | |
test_tdseq_dispatch | 52.1500μs | 29.3288μs | 34.0962 KOps/s | 28.2578 KOps/s | |
test_instantiation_functorch | 1.9918ms | 1.8343ms | 545.1632 Ops/s | 543.3561 Ops/s | |
test_exec_functorch | 0.2536ms | 0.2067ms | 4.8389 KOps/s | 4.7769 KOps/s | |
test_exec_functional_call | 0.3098ms | 0.2046ms | 4.8885 KOps/s | 4.7822 KOps/s | |
test_exec_td_decorator | 0.4195ms | 0.2543ms | 3.9330 KOps/s | 3.8507 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.7852ms | 0.6684ms | 1.4960 KOps/s | 1.4692 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.7781ms | 0.6682ms | 1.4966 KOps/s | 1.4690 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.7036ms | 0.5915ms | 1.6906 KOps/s | 1.6853 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.6986ms | 0.5923ms | 1.6882 KOps/s | 1.6829 KOps/s | |
test_vmap_transformer_speed_decorator[True-True] | 19.2272ms | 19.1485ms | 52.2234 Ops/s | 52.2768 Ops/s | |
test_vmap_transformer_speed_decorator[True-False] | 19.2429ms | 19.1560ms | 52.2029 Ops/s | 52.1947 Ops/s | |
test_vmap_transformer_speed_decorator[False-True] | 19.1050ms | 19.0271ms | 52.5567 Ops/s | 52.7072 Ops/s | |
test_vmap_transformer_speed_decorator[False-False] | 19.0822ms | 19.0405ms | 52.5195 Ops/s | 52.3717 Ops/s | |
test_to_module_speed[True] | 1.4932ms | 0.9756ms | 1.0250 KOps/s | 1.0071 KOps/s | |
test_to_module_speed[False] | 1.0577ms | 0.9545ms | 1.0477 KOps/s | 1.0180 KOps/s | |
test_tc_init | 54.6810μs | 33.6020μs | 29.7602 KOps/s | 25.9418 KOps/s | |
test_tc_init_nested | 0.1091ms | 69.7483μs | 14.3373 KOps/s | 12.7161 KOps/s | |
test_tc_first_layer_tensor | 12.0016μs | 0.6996μs | 1.4294 MOps/s | 1.4261 MOps/s | |
test_tc_first_layer_nontensor | 19.4500μs | 2.4005μs | 416.5882 KOps/s | 422.6868 KOps/s | |
test_tc_second_layer_tensor | 11.0400μs | 1.4376μs | 695.5907 KOps/s | 688.7028 KOps/s | |
test_tc_second_layer_nontensor | 27.2400μs | 3.1424μs | 318.2321 KOps/s | 322.8610 KOps/s | |
test_unbind | 0.2004s | 9.4916ms | 105.3560 Ops/s | 90.8897 Ops/s | |
test_full_like | 0.7667ms | 0.5731ms | 1.7449 KOps/s | 1.7418 KOps/s | |
test_zeros_like | 0.2585ms | 0.1980ms | 5.0514 KOps/s | 5.0551 KOps/s | |
test_ones_like | 0.2299ms | 0.1977ms | 5.0581 KOps/s | 5.0578 KOps/s | |
test_clone | 0.4830ms | 0.4146ms | 2.4117 KOps/s | 2.4106 KOps/s | |
test_squeeze | 0.1704ms | 9.4166μs | 106.1953 KOps/s | 101.4378 KOps/s | |
test_unsqueeze | 0.2071ms | 72.6606μs | 13.7626 KOps/s | 13.3691 KOps/s | |
test_split | 0.3995ms | 0.1668ms | 5.9959 KOps/s | 5.9235 KOps/s | |
test_permute | 0.2392ms | 0.1887ms | 5.2996 KOps/s | 5.5478 KOps/s | |
test_stack | 1.2491ms | 0.8458ms | 1.1823 KOps/s | 1.1659 KOps/s | |
test_cat | 1.2498ms | 1.2312ms | 812.1882 Ops/s | 812.1508 Ops/s |
vmoens
added a commit
that referenced
this pull request
Oct 24, 2024
ghstack-source-id: be0c0928c387a35ef859bb479d86027d09d2dd9d Pull Request resolved: #1058
vmoens
added a commit
that referenced
this pull request
Oct 24, 2024
ghstack-source-id: be0c0928c387a35ef859bb479d86027d09d2dd9d Pull Request resolved: #1058
vmoens
added a commit
that referenced
this pull request
Oct 25, 2024
ghstack-source-id: f8deadf3ebac5636580f4b2ca6e4748103098ac1 Pull Request resolved: #1058
vmoens
added a commit
that referenced
this pull request
Oct 25, 2024
ghstack-source-id: 9d5436c6bbc743e3c754d5fe5f6d87b005dde014 Pull Request resolved: #1058
vmoens
added a commit
that referenced
this pull request
Oct 25, 2024
ghstack-source-id: 9d5436c6bbc743e3c754d5fe5f6d87b005dde014 Pull Request resolved: #1058
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Refactor
Refactoring code - not a new feature
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Stack from ghstack (oldest at bottom):