-
Notifications
You must be signed in to change notification settings - Fork 76
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Quality] Better error for mismatching TDs #964
Merged
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
facebook-github-bot
added
the
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
label
Aug 13, 2024
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 46.4570μs | 22.4274μs | 44.5883 KOps/s | 44.9646 KOps/s | |
test_plain_set_stack_nested | 61.4550μs | 22.5034μs | 44.4377 KOps/s | 43.7251 KOps/s | |
test_plain_set_nested_inplace | 67.8470μs | 24.5862μs | 40.6732 KOps/s | 40.7158 KOps/s | |
test_plain_set_stack_nested_inplace | 64.6410μs | 24.7990μs | 40.3242 KOps/s | 41.0665 KOps/s | |
test_items | 44.8540μs | 2.6480μs | 377.6495 KOps/s | 387.6159 KOps/s | |
test_items_nested | 1.7829ms | 0.3581ms | 2.7924 KOps/s | 2.9754 KOps/s | |
test_items_nested_locked | 0.5316ms | 0.3569ms | 2.8017 KOps/s | 2.9585 KOps/s | |
test_items_nested_leaf | 0.1612ms | 88.1465μs | 11.3447 KOps/s | 11.8801 KOps/s | |
test_items_stack_nested | 0.5419ms | 0.3540ms | 2.8252 KOps/s | 2.9512 KOps/s | |
test_items_stack_nested_leaf | 0.1610ms | 88.5212μs | 11.2967 KOps/s | 11.5688 KOps/s | |
test_items_stack_nested_locked | 0.5530ms | 0.3580ms | 2.7933 KOps/s | 2.9546 KOps/s | |
test_keys | 43.7760μs | 3.9963μs | 250.2345 KOps/s | 256.3429 KOps/s | |
test_keys_nested | 0.2450ms | 0.1496ms | 6.6841 KOps/s | 6.8968 KOps/s | |
test_keys_nested_locked | 0.7029ms | 0.1546ms | 6.4675 KOps/s | 6.6851 KOps/s | |
test_keys_nested_leaf | 0.1964ms | 0.1302ms | 7.6825 KOps/s | 7.8640 KOps/s | |
test_keys_stack_nested | 0.2760ms | 0.1489ms | 6.7166 KOps/s | 6.7562 KOps/s | |
test_keys_stack_nested_leaf | 0.2499ms | 0.1301ms | 7.6892 KOps/s | 7.9154 KOps/s | |
test_keys_stack_nested_locked | 0.2957ms | 0.1569ms | 6.3717 KOps/s | 6.6349 KOps/s | |
test_values | 10.4412μs | 1.1993μs | 833.8118 KOps/s | 861.1039 KOps/s | |
test_values_nested | 0.1141ms | 53.4591μs | 18.7059 KOps/s | 19.8265 KOps/s | |
test_values_nested_locked | 0.1087ms | 53.1041μs | 18.8309 KOps/s | 19.8022 KOps/s | |
test_values_nested_leaf | 91.3010μs | 48.7138μs | 20.5281 KOps/s | 21.8851 KOps/s | |
test_values_stack_nested | 97.7230μs | 54.1110μs | 18.4805 KOps/s | 19.6698 KOps/s | |
test_values_stack_nested_leaf | 96.8410μs | 47.8682μs | 20.8907 KOps/s | 21.9176 KOps/s | |
test_values_stack_nested_locked | 0.1084ms | 54.8540μs | 18.2302 KOps/s | 19.7165 KOps/s | |
test_membership | 8.3733μs | 0.8814μs | 1.1346 MOps/s | 1.0789 MOps/s | |
test_membership_nested | 22.3120μs | 2.7466μs | 364.0874 KOps/s | 380.9265 KOps/s | |
test_membership_nested_leaf | 20.1180μs | 2.7656μs | 361.5890 KOps/s | 381.8003 KOps/s | |
test_membership_stacked_nested | 33.3320μs | 2.7173μs | 368.0130 KOps/s | 383.3030 KOps/s | |
test_membership_stacked_nested_leaf | 18.8560μs | 2.7510μs | 363.4986 KOps/s | 353.7375 KOps/s | |
test_membership_nested_last | 22.9730μs | 4.1283μs | 242.2321 KOps/s | 251.0795 KOps/s | |
test_membership_nested_leaf_last | 23.6740μs | 4.1340μs | 241.8945 KOps/s | 242.8555 KOps/s | |
test_membership_stacked_nested_last | 36.9300μs | 12.7018μs | 78.7288 KOps/s | 257.0163 KOps/s | |
test_membership_stacked_nested_leaf_last | 38.3720μs | 12.7828μs | 78.2302 KOps/s | 255.7027 KOps/s | |
test_nested_getleaf | 35.9070μs | 10.3290μs | 96.8148 KOps/s | 93.8414 KOps/s | |
test_nested_get | 36.2580μs | 9.8072μs | 101.9659 KOps/s | 98.7601 KOps/s | |
test_stacked_getleaf | 37.2400μs | 10.3719μs | 96.4144 KOps/s | 93.8652 KOps/s | |
test_stacked_get | 42.3690μs | 9.7447μs | 102.6202 KOps/s | 99.9041 KOps/s | |
test_nested_getitemleaf | 40.1650μs | 11.0184μs | 90.7569 KOps/s | 92.2220 KOps/s | |
test_nested_getitem | 42.3200μs | 10.0667μs | 99.3374 KOps/s | 99.1851 KOps/s | |
test_stacked_getitemleaf | 31.2590μs | 10.8992μs | 91.7495 KOps/s | 91.7767 KOps/s | |
test_stacked_getitem | 34.9560μs | 10.0673μs | 99.3317 KOps/s | 100.3910 KOps/s | |
test_lock_nested | 82.6419ms | 0.5860ms | 1.7064 KOps/s | 1.9816 KOps/s | |
test_lock_stack_nested | 0.7873ms | 0.4504ms | 2.2202 KOps/s | 2.1058 KOps/s | |
test_unlock_nested | 89.1255ms | 0.5068ms | 1.9730 KOps/s | 2.3737 KOps/s | |
test_unlock_stack_nested | 0.6862ms | 0.3682ms | 2.7159 KOps/s | 2.5468 KOps/s | |
test_flatten_speed | 0.2109ms | 0.1098ms | 9.1051 KOps/s | 9.5788 KOps/s | |
test_unflatten_speed | 1.0262ms | 0.4831ms | 2.0701 KOps/s | 2.1423 KOps/s | |
test_common_ops | 3.1823ms | 1.0927ms | 915.1408 Ops/s | 874.3161 Ops/s | |
test_creation | 0.2646ms | 2.0317μs | 492.1911 KOps/s | 490.4885 KOps/s | |
test_creation_empty | 0.1454ms | 17.8636μs | 55.9796 KOps/s | 52.6947 KOps/s | |
test_creation_nested_1 | 56.4760μs | 21.0419μs | 47.5243 KOps/s | 44.6032 KOps/s | |
test_creation_nested_2 | 78.6170μs | 25.2994μs | 39.5267 KOps/s | 37.6212 KOps/s | |
test_clone | 0.1043ms | 16.5984μs | 60.2469 KOps/s | 59.4235 KOps/s | |
test_getitem[int] | 1.4737ms | 16.6690μs | 59.9915 KOps/s | 59.4742 KOps/s | |
test_getitem[slice_int] | 0.1208ms | 31.3951μs | 31.8521 KOps/s | 31.3130 KOps/s | |
test_getitem[range] | 0.3824ms | 58.1026μs | 17.2109 KOps/s | 17.5517 KOps/s | |
test_getitem[tuple] | 0.1159ms | 25.2990μs | 39.5273 KOps/s | 39.0654 KOps/s | |
test_getitem[list] | 0.2073ms | 52.3860μs | 19.0891 KOps/s | 19.1244 KOps/s | |
test_setitem_dim[int] | 81.1420μs | 41.7766μs | 23.9368 KOps/s | 22.9557 KOps/s | |
test_setitem_dim[slice_int] | 0.1242ms | 72.5869μs | 13.7766 KOps/s | 13.6427 KOps/s | |
test_setitem_dim[range] | 0.1554ms | 94.6453μs | 10.5658 KOps/s | 10.5989 KOps/s | |
test_setitem_dim[tuple] | 0.1076ms | 60.1467μs | 16.6260 KOps/s | 16.5046 KOps/s | |
test_setitem | 0.1452ms | 29.8367μs | 33.5158 KOps/s | 32.3806 KOps/s | |
test_set | 0.1557ms | 29.1942μs | 34.2534 KOps/s | 33.6318 KOps/s | |
test_set_shared | 1.4669ms | 0.2195ms | 4.5555 KOps/s | 4.5917 KOps/s | |
test_update | 0.1953ms | 35.3480μs | 28.2902 KOps/s | 26.5082 KOps/s | |
test_update_nested | 0.2409ms | 45.5026μs | 21.9768 KOps/s | 20.8547 KOps/s | |
test_update__nested | 0.2186ms | 33.8594μs | 29.5339 KOps/s | 28.7102 KOps/s | |
test_set_nested | 0.1785ms | 30.6571μs | 32.6189 KOps/s | 30.4970 KOps/s | |
test_set_nested_new | 0.1054ms | 35.9582μs | 27.8101 KOps/s | 26.7141 KOps/s | |
test_select | 0.1705ms | 53.5159μs | 18.6860 KOps/s | 18.2865 KOps/s | |
test_select_nested | 0.1773ms | 61.9080μs | 16.1530 KOps/s | 16.7822 KOps/s | |
test_exclude_nested | 0.1511ms | 82.4927μs | 12.1223 KOps/s | 12.9699 KOps/s | |
test_empty[True] | 0.4612ms | 0.3422ms | 2.9227 KOps/s | 3.0752 KOps/s | |
test_empty[False] | 11.3662μs | 1.2391μs | 807.0617 KOps/s | 834.1814 KOps/s | |
test_unbind_speed | 0.4687ms | 0.3130ms | 3.1952 KOps/s | 3.1367 KOps/s | |
test_unbind_speed_stack0 | 0.5002ms | 0.2987ms | 3.3476 KOps/s | 3.2726 KOps/s | |
test_unbind_speed_stack1 | 91.3617ms | 0.7832ms | 1.2769 KOps/s | 1.3582 KOps/s | |
test_split | 90.2281ms | 2.2234ms | 449.7535 Ops/s | 472.6353 Ops/s | |
test_chunk | 89.5225ms | 2.2356ms | 447.3027 Ops/s | 469.8147 Ops/s | |
test_creation[device0] | 4.1553ms | 0.1250ms | 7.9995 KOps/s | 8.1473 KOps/s | |
test_creation_from_tensor | 0.2319ms | 0.1232ms | 8.1138 KOps/s | 8.2640 KOps/s | |
test_add_one[memmap_tensor0] | 0.2173ms | 7.5937μs | 131.6879 KOps/s | 126.4042 KOps/s | |
test_contiguous[memmap_tensor0] | 30.5270μs | 2.0319μs | 492.1517 KOps/s | 485.7138 KOps/s | |
test_stack[memmap_tensor0] | 51.3160μs | 5.8211μs | 171.7897 KOps/s | 177.1161 KOps/s | |
test_memmaptd_index | 1.1578ms | 0.4125ms | 2.4245 KOps/s | 2.4447 KOps/s | |
test_memmaptd_index_astensor | 0.7419ms | 0.4844ms | 2.0642 KOps/s | 2.0552 KOps/s | |
test_memmaptd_index_op | 1.4006ms | 1.0332ms | 967.8675 Ops/s | 933.6744 Ops/s | |
test_serialize_model | 0.1358s | 0.1198s | 8.3447 Ops/s | 7.5248 Ops/s | |
test_serialize_model_pickle | 0.4540s | 0.3935s | 2.5411 Ops/s | 2.4528 Ops/s | |
test_serialize_weights | 0.2153s | 0.1296s | 7.7149 Ops/s | 8.6837 Ops/s | |
test_serialize_weights_returnearly | 0.1703s | 0.1583s | 6.3171 Ops/s | 6.2643 Ops/s | |
test_serialize_weights_pickle | 0.4984s | 0.4180s | 2.3923 Ops/s | 2.4541 Ops/s | |
test_serialize_weights_filesystem | 0.1461s | 0.1443s | 6.9309 Ops/s | 7.1339 Ops/s | |
test_serialize_model_filesystem | 0.2316s | 0.1637s | 6.1104 Ops/s | 6.7486 Ops/s | |
test_reshape_pytree | 85.6210μs | 40.0136μs | 24.9915 KOps/s | 24.7431 KOps/s | |
test_reshape_td | 0.1346ms | 47.2936μs | 21.1445 KOps/s | 21.0095 KOps/s | |
test_view_pytree | 98.3450μs | 39.8157μs | 25.1157 KOps/s | 24.4655 KOps/s | |
test_view_td | 0.1153ms | 54.6061μs | 18.3130 KOps/s | 18.2276 KOps/s | |
test_unbind_pytree | 84.5780μs | 37.7988μs | 26.4559 KOps/s | 26.8062 KOps/s | |
test_unbind_td | 0.3608ms | 45.3884μs | 22.0320 KOps/s | 21.3764 KOps/s | |
test_split_pytree | 97.2130μs | 40.2271μs | 24.8589 KOps/s | 24.8557 KOps/s | |
test_split_td | 0.4976ms | 58.6766μs | 17.0426 KOps/s | 17.2384 KOps/s | |
test_add_pytree | 0.1107ms | 46.9503μs | 21.2991 KOps/s | 21.3103 KOps/s | |
test_add_td | 0.1772ms | 81.2052μs | 12.3145 KOps/s | 11.3275 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1204ms | 54.3463μs | 18.4005 KOps/s | 18.2688 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.3026ms | 0.1908ms | 5.2413 KOps/s | 5.1043 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.3050ms | 55.2439μs | 18.1015 KOps/s | 17.9568 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.2557ms | 0.1478ms | 6.7673 KOps/s | 6.8629 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 69.5300μs | 21.1543μs | 47.2717 KOps/s | 45.7911 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1272ms | 66.2804μs | 15.0874 KOps/s | 15.2866 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.1723ms | 81.7289μs | 12.2356 KOps/s | 12.3743 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.1389ms | 73.4030μs | 13.6234 KOps/s | 13.8043 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.2785ms | 0.1746ms | 5.7268 KOps/s | 5.6546 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.3338ms | 0.1937ms | 5.1624 KOps/s | 5.0803 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 93.5560μs | 39.7062μs | 25.1850 KOps/s | 25.1329 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.4469ms | 73.1557μs | 13.6695 KOps/s | 14.2678 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.3557ms | 0.1744ms | 5.7334 KOps/s | 5.5927 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.4357ms | 0.2882ms | 3.4694 KOps/s | 3.3953 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.4178ms | 0.2097ms | 4.7687 KOps/s | 4.7565 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.4067ms | 0.1779ms | 5.6212 KOps/s | 5.7271 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1894ms | 63.8267μs | 15.6674 KOps/s | 15.7131 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 96.4010μs | 39.9618μs | 25.0239 KOps/s | 24.8830 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.4312ms | 0.2385ms | 4.1923 KOps/s | 4.1773 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.3490ms | 0.1744ms | 5.7328 KOps/s | 5.6575 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 0.1980ms | 0.1089ms | 9.1798 KOps/s | 9.1987 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.1278ms | 61.9128μs | 16.1518 KOps/s | 17.4655 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1749ms | 81.9913μs | 12.1964 KOps/s | 12.1345 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1449ms | 72.2554μs | 13.8398 KOps/s | 13.7010 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 0.2884ms | 0.1903ms | 5.2547 KOps/s | 5.1571 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 2.3849ms | 1.6362ms | 611.1780 Ops/s | 603.6685 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 0.3328ms | 0.1867ms | 5.3567 KOps/s | 5.1888 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 1.8326ms | 1.0788ms | 926.9899 Ops/s | 920.8686 Ops/s | |
test_compile_assign_and_add_stack[compile] | 0.6464ms | 0.4085ms | 2.4481 KOps/s | 2.3843 KOps/s | |
test_compile_assign_and_add_stack[eager] | 4.2054ms | 3.9385ms | 253.9064 Ops/s | 249.8461 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 79.2890μs | 32.5319μs | 30.7390 KOps/s | 28.8577 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.9875ms | 48.3390μs | 20.6872 KOps/s | 20.1579 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 89.2370μs | 28.6829μs | 34.8640 KOps/s | 34.4685 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 87.5940μs | 31.3974μs | 31.8497 KOps/s | 31.6730 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 85.8510μs | 29.1700μs | 34.2818 KOps/s | 34.0071 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 86.1420μs | 31.0333μs | 32.2235 KOps/s | 31.8303 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1704ms | 72.3039μs | 13.8305 KOps/s | 13.5411 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.4940ms | 28.7847μs | 34.7406 KOps/s | 34.6742 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1354ms | 68.0545μs | 14.6941 KOps/s | 14.2769 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 68.4780μs | 24.9471μs | 40.0848 KOps/s | 39.9310 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1541ms | 67.9084μs | 14.7257 KOps/s | 14.7297 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 62.2060μs | 24.9184μs | 40.1311 KOps/s | 40.7051 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1386ms | 71.6375μs | 13.9592 KOps/s | 13.8687 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.8580ms | 28.9190μs | 34.5794 KOps/s | 35.1168 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1445ms | 68.0422μs | 14.6968 KOps/s | 14.6902 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 72.2860μs | 24.8339μs | 40.2675 KOps/s | 41.0547 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1789ms | 66.6050μs | 15.0139 KOps/s | 14.8510 KOps/s | |
test_compile_indexing[int-pytree-eager] | 0.4428ms | 24.7788μs | 40.3570 KOps/s | 40.6536 KOps/s | |
test_mod_add[eager] | 0.2888ms | 25.2595μs | 39.5891 KOps/s | 38.7724 KOps/s | |
test_mod_add[compile] | 92.4240μs | 35.8888μs | 27.8638 KOps/s | 27.0030 KOps/s | |
test_mod_add[compile-overhead] | 94.3270μs | 36.1249μs | 27.6817 KOps/s | 28.4852 KOps/s | |
test_mod_wrap[eager] | 0.4093ms | 0.2113ms | 4.7325 KOps/s | 4.8548 KOps/s | |
test_mod_wrap[compile] | 1.0306ms | 0.2337ms | 4.2787 KOps/s | 4.3958 KOps/s | |
test_mod_wrap[compile-overhead] | 0.4983ms | 0.2273ms | 4.4001 KOps/s | 4.4126 KOps/s | |
test_mod_wrap_and_backward[eager] | 13.6765ms | 10.8810ms | 91.9034 Ops/s | 90.5199 Ops/s | |
test_mod_wrap_and_backward[compile] | 15.5369ms | 12.0158ms | 83.2240 Ops/s | 89.8924 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 14.8567ms | 11.4544ms | 87.3026 Ops/s | 92.0163 Ops/s | |
test_seq_add[eager] | 0.1608ms | 89.5765μs | 11.1636 KOps/s | 11.0623 KOps/s | |
test_seq_add[compile] | 0.1414ms | 58.3846μs | 17.1278 KOps/s | 16.1662 KOps/s | |
test_seq_add[compile-overhead] | 0.1539ms | 58.8825μs | 16.9830 KOps/s | 16.3571 KOps/s | |
test_seq_wrap[eager] | 0.6411ms | 0.3808ms | 2.6263 KOps/s | 2.6164 KOps/s | |
test_seq_wrap[compile] | 0.4568ms | 0.2567ms | 3.8957 KOps/s | 3.7070 KOps/s | |
test_seq_wrap[compile-overhead] | 1.0698ms | 0.2677ms | 3.7357 KOps/s | 3.8076 KOps/s | |
test_func_call_runtime[False-eager] | 0.8431ms | 0.5274ms | 1.8962 KOps/s | 1.9239 KOps/s | |
test_func_call_runtime[False-compile] | 0.9704ms | 0.4938ms | 2.0249 KOps/s | 2.0329 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 1.0941ms | 0.4951ms | 2.0199 KOps/s | 2.0327 KOps/s | |
test_func_call_runtime[True-eager] | 0.8989ms | 0.7550ms | 1.3246 KOps/s | 1.3419 KOps/s | |
test_func_call_runtime[True-compile] | 0.9909ms | 0.5070ms | 1.9726 KOps/s | 1.9541 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.8436ms | 0.5031ms | 1.9875 KOps/s | 1.9593 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.8856ms | 0.5289ms | 1.8907 KOps/s | 1.9383 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.9358ms | 0.4964ms | 2.0146 KOps/s | 2.0215 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.7689ms | 0.5172ms | 1.9335 KOps/s | 2.0051 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.0510ms | 0.9203ms | 1.0866 KOps/s | 1.1361 KOps/s | |
test_func_call_cm_runtime[True-compile] | 1.1799ms | 0.8512ms | 1.1748 KOps/s | 1.1811 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 1.4960ms | 0.8560ms | 1.1682 KOps/s | 1.1878 KOps/s | |
test_distributed | 0.2384ms | 0.1319ms | 7.5787 KOps/s | 7.5372 KOps/s | |
test_tdmodule | 0.1354ms | 17.8907μs | 55.8951 KOps/s | 55.2061 KOps/s | |
test_tdmodule_dispatch | 66.1140μs | 37.1635μs | 26.9081 KOps/s | 26.0004 KOps/s | |
test_tdseq | 36.3180μs | 19.8382μs | 50.4079 KOps/s | 45.7706 KOps/s | |
test_tdseq_dispatch | 70.6720μs | 41.3316μs | 24.1946 KOps/s | 23.4668 KOps/s | |
test_instantiation_functorch | 1.9208ms | 1.6820ms | 594.5338 Ops/s | 586.6105 Ops/s | |
test_instantiation_td | 1.9874ms | 1.2276ms | 814.5760 Ops/s | 829.1160 Ops/s | |
test_exec_functorch | 0.3367ms | 0.1871ms | 5.3444 KOps/s | 5.4681 KOps/s | |
test_exec_functional_call | 0.3099ms | 0.1752ms | 5.7090 KOps/s | 5.8025 KOps/s | |
test_exec_td | 0.3383ms | 0.1749ms | 5.7162 KOps/s | 5.6970 KOps/s | |
test_exec_td_decorator | 0.5871ms | 0.2281ms | 4.3849 KOps/s | 4.3939 KOps/s | |
test_vmap_mlp_speed[True-True] | 0.8435ms | 0.5716ms | 1.7493 KOps/s | 1.7002 KOps/s | |
test_vmap_mlp_speed[True-False] | 0.8806ms | 0.5706ms | 1.7524 KOps/s | 1.7217 KOps/s | |
test_vmap_mlp_speed[False-True] | 0.8497ms | 0.4739ms | 2.1103 KOps/s | 2.0994 KOps/s | |
test_vmap_mlp_speed[False-False] | 0.7512ms | 0.4721ms | 2.1182 KOps/s | 2.0947 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 1.4239ms | 0.6250ms | 1.6000 KOps/s | 1.5648 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 1.0635ms | 0.6317ms | 1.5829 KOps/s | 1.5711 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.7876ms | 0.5218ms | 1.9163 KOps/s | 1.9087 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.6913ms | 0.5173ms | 1.9332 KOps/s | 1.9151 KOps/s | |
test_to_module_speed[True] | 2.1794ms | 1.3506ms | 740.4229 Ops/s | 738.5252 Ops/s | |
test_to_module_speed[False] | 1.4184ms | 1.3051ms | 766.2067 Ops/s | 757.2280 Ops/s | |
test_tc_init | 94.3170μs | 45.8717μs | 21.7999 KOps/s | 22.1316 KOps/s | |
test_tc_init_nested | 0.1920ms | 91.2379μs | 10.9604 KOps/s | 10.9145 KOps/s | |
test_tc_first_layer_tensor | 19.6670μs | 1.6134μs | 619.8174 KOps/s | 701.5692 KOps/s | |
test_tc_first_layer_nontensor | 22.6920μs | 4.6227μs | 216.3252 KOps/s | 237.4797 KOps/s | |
test_tc_second_layer_tensor | 21.2000μs | 3.0121μs | 331.9923 KOps/s | 370.8664 KOps/s | |
test_tc_second_layer_nontensor | 28.0820μs | 6.0950μs | 164.0690 KOps/s | 184.5699 KOps/s | |
test_unbind | 0.4508s | 13.8701ms | 72.0974 Ops/s | 73.1857 Ops/s | |
test_full_like | 8.3638ms | 7.1940ms | 139.0057 Ops/s | 135.1964 Ops/s | |
test_zeros_like | 17.3081ms | 7.6128ms | 131.3572 Ops/s | 140.2380 Ops/s | |
test_ones_like | 12.8873ms | 7.8461ms | 127.4511 Ops/s | 135.6040 Ops/s | |
test_clone | 13.6361ms | 9.3158ms | 107.3450 Ops/s | 104.7805 Ops/s | |
test_squeeze | 61.2250μs | 13.1213μs | 76.2118 KOps/s | 77.3302 KOps/s | |
test_unsqueeze | 0.2076ms | 96.9662μs | 10.3129 KOps/s | 10.3546 KOps/s | |
test_split | 0.5104ms | 0.2069ms | 4.8330 KOps/s | 4.9113 KOps/s | |
test_permute | 0.3407ms | 0.2192ms | 4.5619 KOps/s | 4.5234 KOps/s | |
test_stack | 29.8207ms | 25.3823ms | 39.3976 Ops/s | 39.0834 Ops/s | |
test_cat | 26.8120ms | 25.3348ms | 39.4713 Ops/s | 39.0649 Ops/s |
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 52.3610μs | 16.5410μs | 60.4557 KOps/s | 58.2886 KOps/s | |
test_plain_set_stack_nested | 38.3210μs | 16.9060μs | 59.1508 KOps/s | 58.0171 KOps/s | |
test_plain_set_nested_inplace | 41.8120μs | 17.7374μs | 56.3780 KOps/s | 54.6104 KOps/s | |
test_plain_set_stack_nested_inplace | 32.7000μs | 17.8291μs | 56.0882 KOps/s | 54.4469 KOps/s | |
test_items | 18.7510μs | 4.5928μs | 217.7341 KOps/s | 214.8627 KOps/s | |
test_items_nested | 0.3851ms | 0.3657ms | 2.7348 KOps/s | 2.7287 KOps/s | |
test_items_nested_locked | 0.3876ms | 0.3683ms | 2.7152 KOps/s | 2.7295 KOps/s | |
test_items_nested_leaf | 0.1074ms | 83.8980μs | 11.9192 KOps/s | 12.0097 KOps/s | |
test_items_stack_nested | 0.4248ms | 0.3704ms | 2.7000 KOps/s | 2.7473 KOps/s | |
test_items_stack_nested_leaf | 0.1077ms | 86.3489μs | 11.5809 KOps/s | 11.9067 KOps/s | |
test_items_stack_nested_locked | 0.3924ms | 0.3741ms | 2.6732 KOps/s | 2.7644 KOps/s | |
test_keys | 21.1500μs | 4.3787μs | 228.3767 KOps/s | 227.6312 KOps/s | |
test_keys_nested | 92.1520μs | 67.4586μs | 14.8239 KOps/s | 15.1677 KOps/s | |
test_keys_nested_locked | 0.7620ms | 73.3268μs | 13.6376 KOps/s | 13.8937 KOps/s | |
test_keys_nested_leaf | 72.6610μs | 58.1840μs | 17.1869 KOps/s | 17.9330 KOps/s | |
test_keys_stack_nested | 85.0910μs | 67.5274μs | 14.8088 KOps/s | 14.8692 KOps/s | |
test_keys_stack_nested_leaf | 79.1310μs | 57.6146μs | 17.3567 KOps/s | 17.3896 KOps/s | |
test_keys_stack_nested_locked | 0.1063ms | 72.9024μs | 13.7170 KOps/s | 13.7928 KOps/s | |
test_values | 7.4567μs | 1.7596μs | 568.3189 KOps/s | 568.1145 KOps/s | |
test_values_nested | 51.8910μs | 33.7753μs | 29.6074 KOps/s | 29.8778 KOps/s | |
test_values_nested_locked | 49.9410μs | 35.7851μs | 27.9446 KOps/s | 28.1138 KOps/s | |
test_values_nested_leaf | 51.9710μs | 30.0872μs | 33.2367 KOps/s | 33.3762 KOps/s | |
test_values_stack_nested | 50.7610μs | 34.3679μs | 29.0969 KOps/s | 29.7825 KOps/s | |
test_values_stack_nested_leaf | 45.6210μs | 30.5099μs | 32.7763 KOps/s | 33.0551 KOps/s | |
test_values_stack_nested_locked | 53.3710μs | 36.2995μs | 27.5486 KOps/s | 27.9949 KOps/s | |
test_membership | 1.2140μs | 0.5449μs | 1.8350 MOps/s | 1.8114 MOps/s | |
test_membership_nested | 13.6350μs | 1.9228μs | 520.0773 KOps/s | 483.8987 KOps/s | |
test_membership_nested_leaf | 10.4455μs | 1.9367μs | 516.3511 KOps/s | 503.3977 KOps/s | |
test_membership_stacked_nested | 25.5600μs | 2.0149μs | 496.3055 KOps/s | 497.5716 KOps/s | |
test_membership_stacked_nested_leaf | 16.6600μs | 2.0551μs | 486.5924 KOps/s | 487.6151 KOps/s | |
test_membership_nested_last | 21.6200μs | 2.9221μs | 342.2151 KOps/s | 336.5493 KOps/s | |
test_membership_nested_leaf_last | 16.7210μs | 2.9231μs | 342.0981 KOps/s | 342.4547 KOps/s | |
test_membership_stacked_nested_last | 37.9500μs | 2.9547μs | 338.4415 KOps/s | 335.9822 KOps/s | |
test_membership_stacked_nested_leaf_last | 21.4500μs | 2.9180μs | 342.7018 KOps/s | 338.9170 KOps/s | |
test_nested_getleaf | 30.3300μs | 7.8539μs | 127.3261 KOps/s | 126.7642 KOps/s | |
test_nested_get | 22.4200μs | 7.3729μs | 135.6314 KOps/s | 134.4351 KOps/s | |
test_stacked_getleaf | 24.1110μs | 7.9038μs | 126.5216 KOps/s | 126.4118 KOps/s | |
test_stacked_get | 23.5810μs | 7.3878μs | 135.3582 KOps/s | 134.2370 KOps/s | |
test_nested_getitemleaf | 23.9900μs | 8.1413μs | 122.8300 KOps/s | 123.0083 KOps/s | |
test_nested_getitem | 22.6700μs | 7.6604μs | 130.5413 KOps/s | 131.2229 KOps/s | |
test_stacked_getitemleaf | 30.5700μs | 8.1748μs | 122.3278 KOps/s | 121.7395 KOps/s | |
test_stacked_getitem | 25.2800μs | 7.6698μs | 130.3808 KOps/s | 130.5933 KOps/s | |
test_lock_nested | 9.7904ms | 0.4962ms | 2.0152 KOps/s | 2.0669 KOps/s | |
test_lock_stack_nested | 0.4779ms | 0.4473ms | 2.2356 KOps/s | 2.2709 KOps/s | |
test_unlock_nested | 0.8921ms | 0.4039ms | 2.4759 KOps/s | 2.4805 KOps/s | |
test_unlock_stack_nested | 0.3931ms | 0.3625ms | 2.7589 KOps/s | 2.7650 KOps/s | |
test_flatten_speed | 0.4958ms | 0.1056ms | 9.4667 KOps/s | 9.5421 KOps/s | |
test_unflatten_speed | 0.3545ms | 0.3164ms | 3.1602 KOps/s | 3.1292 KOps/s | |
test_common_ops | 1.5999ms | 1.3850ms | 722.0038 Ops/s | 741.9575 Ops/s | |
test_creation | 13.4100μs | 1.6495μs | 606.2435 KOps/s | 596.1473 KOps/s | |
test_creation_empty | 44.1310μs | 16.8756μs | 59.2572 KOps/s | 57.9580 KOps/s | |
test_creation_nested_1 | 38.7510μs | 18.8746μs | 52.9814 KOps/s | 52.9034 KOps/s | |
test_creation_nested_2 | 38.3810μs | 21.8391μs | 45.7895 KOps/s | 45.8288 KOps/s | |
test_clone | 60.2110μs | 32.1759μs | 31.0792 KOps/s | 31.5981 KOps/s | |
test_getitem[int] | 1.2037ms | 18.6218μs | 53.7005 KOps/s | 54.5760 KOps/s | |
test_getitem[slice_int] | 0.1609ms | 31.3407μs | 31.9073 KOps/s | 31.8617 KOps/s | |
test_getitem[range] | 0.2972ms | 0.1191ms | 8.3985 KOps/s | 8.5324 KOps/s | |
test_getitem[tuple] | 0.1546ms | 27.1780μs | 36.7944 KOps/s | 36.8931 KOps/s | |
test_getitem[list] | 0.2335ms | 0.1067ms | 9.3719 KOps/s | 9.3899 KOps/s | |
test_setitem_dim[int] | 83.9320μs | 57.2523μs | 17.4665 KOps/s | 17.9478 KOps/s | |
test_setitem_dim[slice_int] | 0.1093ms | 81.3223μs | 12.2967 KOps/s | 12.5157 KOps/s | |
test_setitem_dim[range] | 0.2059ms | 0.1466ms | 6.8224 KOps/s | 6.9355 KOps/s | |
test_setitem_dim[tuple] | 94.2220μs | 73.8686μs | 13.5375 KOps/s | 13.7100 KOps/s | |
test_setitem | 71.1910μs | 45.5004μs | 21.9778 KOps/s | 21.9703 KOps/s | |
test_set | 70.1210μs | 44.3314μs | 22.5574 KOps/s | 22.4551 KOps/s | |
test_set_shared | 0.3911ms | 57.0786μs | 17.5197 KOps/s | 17.7722 KOps/s | |
test_update | 85.0810μs | 53.4664μs | 18.7033 KOps/s | 18.6167 KOps/s | |
test_update_nested | 90.2610μs | 61.0734μs | 16.3737 KOps/s | 16.3321 KOps/s | |
test_update__nested | 0.1034ms | 65.4647μs | 15.2754 KOps/s | 15.5599 KOps/s | |
test_set_nested | 0.1069ms | 46.5776μs | 21.4696 KOps/s | 21.4633 KOps/s | |
test_set_nested_new | 80.2510μs | 50.5077μs | 19.7990 KOps/s | 19.5711 KOps/s | |
test_select | 88.1710μs | 65.9364μs | 15.1661 KOps/s | 15.2390 KOps/s | |
test_select_nested | 70.4310μs | 51.6221μs | 19.3715 KOps/s | 19.2938 KOps/s | |
test_exclude_nested | 0.1006ms | 70.0585μs | 14.2738 KOps/s | 14.3492 KOps/s | |
test_empty[True] | 0.3142ms | 0.2840ms | 3.5215 KOps/s | 3.5167 KOps/s | |
test_empty[False] | 2.2760μs | 0.8716μs | 1.1474 MOps/s | 1.1182 MOps/s | |
test_to | 48.6510μs | 28.4447μs | 35.1559 KOps/s | 34.8728 KOps/s | |
test_to_nonblocking | 52.9910μs | 28.0597μs | 35.6383 KOps/s | 36.8104 KOps/s | |
test_unbind_speed | 1.5759ms | 0.3195ms | 3.1300 KOps/s | 3.1627 KOps/s | |
test_unbind_speed_stack0 | 0.3525ms | 0.3140ms | 3.1844 KOps/s | 3.2273 KOps/s | |
test_unbind_speed_stack1 | 90.2440ms | 0.7866ms | 1.2712 KOps/s | 1.3897 KOps/s | |
test_split | 93.4632ms | 2.5035ms | 399.4395 Ops/s | 404.7783 Ops/s | |
test_chunk | 92.8786ms | 2.5115ms | 398.1696 Ops/s | 406.3475 Ops/s | |
test_creation[device0] | 0.1596ms | 0.1072ms | 9.3263 KOps/s | 9.0437 KOps/s | |
test_creation_from_tensor | 0.1661ms | 0.1033ms | 9.6841 KOps/s | 9.4908 KOps/s | |
test_add_one[memmap_tensor0] | 0.1725ms | 10.3655μs | 96.4735 KOps/s | 96.1143 KOps/s | |
test_contiguous[memmap_tensor0] | 27.7210μs | 2.2854μs | 437.5658 KOps/s | 431.1966 KOps/s | |
test_stack[memmap_tensor0] | 31.9010μs | 7.7787μs | 128.5555 KOps/s | 129.7355 KOps/s | |
test_memmaptd_index | 1.2071ms | 0.4837ms | 2.0675 KOps/s | 2.1053 KOps/s | |
test_memmaptd_index_astensor | 0.8130ms | 0.5487ms | 1.8224 KOps/s | 1.8541 KOps/s | |
test_memmaptd_index_op | 1.5799ms | 1.1542ms | 866.4276 Ops/s | 870.4611 Ops/s | |
test_serialize_model | 91.7710ms | 88.5814ms | 11.2890 Ops/s | 11.0260 Ops/s | |
test_serialize_model_pickle | 1.3495s | 1.2398s | 0.8066 Ops/s | 0.8063 Ops/s | |
test_serialize_weights | 0.1839s | 96.5571ms | 10.3566 Ops/s | 9.9762 Ops/s | |
test_serialize_weights_returnearly | 0.2709s | 67.0222ms | 14.9204 Ops/s | 14.9810 Ops/s | |
test_serialize_weights_pickle | 1.3441s | 1.2369s | 0.8084 Ops/s | 0.8079 Ops/s | |
test_reshape_pytree | 0.2630ms | 40.0117μs | 24.9927 KOps/s | 24.5048 KOps/s | |
test_reshape_td | 70.1320μs | 46.8801μs | 21.3310 KOps/s | 20.2839 KOps/s | |
test_view_pytree | 63.2810μs | 40.1074μs | 24.9331 KOps/s | 25.3316 KOps/s | |
test_view_td | 0.2562ms | 52.8031μs | 18.9383 KOps/s | 18.2377 KOps/s | |
test_unbind_pytree | 65.5110μs | 39.2515μs | 25.4767 KOps/s | 25.6033 KOps/s | |
test_unbind_td | 0.4267ms | 48.1278μs | 20.7780 KOps/s | 19.6113 KOps/s | |
test_split_pytree | 81.5210μs | 53.1060μs | 18.8302 KOps/s | 18.0808 KOps/s | |
test_split_td | 0.4881ms | 64.4689μs | 15.5114 KOps/s | 14.1423 KOps/s | |
test_add_pytree | 0.2564ms | 64.6266μs | 15.4735 KOps/s | 14.8805 KOps/s | |
test_add_td | 0.2659ms | 0.1032ms | 9.6894 KOps/s | 9.3220 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.4287ms | 0.2211ms | 4.5237 KOps/s | 4.4370 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.2615ms | 0.1769ms | 5.6543 KOps/s | 5.6412 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1954ms | 0.1541ms | 6.4887 KOps/s | 6.4358 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.2686ms | 0.2021ms | 4.9479 KOps/s | 4.9076 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 48.9210μs | 23.6460μs | 42.2904 KOps/s | 43.1622 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 78.1920μs | 49.5595μs | 20.1778 KOps/s | 20.7579 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.1144ms | 73.3449μs | 13.6342 KOps/s | 13.5999 KOps/s | |
test_compile_copy_nested[pytree-eager] | 80.0610μs | 59.7330μs | 16.7412 KOps/s | 16.7624 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.4960ms | 0.3441ms | 2.9061 KOps/s | 2.8659 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.2729ms | 0.2247ms | 4.4498 KOps/s | 4.4537 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1775ms | 0.1379ms | 7.2533 KOps/s | 7.1761 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.1327ms | 65.2753μs | 15.3197 KOps/s | 15.5154 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.4152ms | 0.3416ms | 2.9272 KOps/s | 2.8788 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.7212ms | 0.6673ms | 1.4987 KOps/s | 1.4884 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.3143ms | 0.2718ms | 3.6788 KOps/s | 3.6417 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.3923ms | 0.3447ms | 2.9007 KOps/s | 2.8501 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1689ms | 76.1347μs | 13.1346 KOps/s | 13.2211 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1771ms | 0.1397ms | 7.1594 KOps/s | 7.1389 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.6886ms | 0.5704ms | 1.7531 KOps/s | 1.7450 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.3833ms | 0.3418ms | 2.9255 KOps/s | 2.8827 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 44.1310μs | 20.1386μs | 49.6560 KOps/s | 50.7204 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.1513ms | 32.5220μs | 30.7484 KOps/s | 31.2298 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1153ms | 76.9525μs | 12.9950 KOps/s | 13.1294 KOps/s | |
test_compile_copy_flat[pytree-eager] | 93.6120μs | 60.3857μs | 16.5602 KOps/s | 16.4536 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 2.5641ms | 0.8775ms | 1.1395 KOps/s | 1.0423 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 3.7968ms | 3.4812ms | 287.2551 Ops/s | 287.4600 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 2.4884ms | 0.8594ms | 1.1636 KOps/s | 1.0525 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 3.7469ms | 3.5208ms | 284.0233 Ops/s | 287.0479 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.1513ms | 0.1184ms | 8.4440 KOps/s | 8.4085 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.2144ms | 69.5715μs | 14.3737 KOps/s | 14.9342 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.1521ms | 0.1101ms | 9.0843 KOps/s | 8.9699 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 0.1435ms | 47.4631μs | 21.0690 KOps/s | 20.8584 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.1550ms | 0.1088ms | 9.1903 KOps/s | 9.0400 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 71.5510μs | 47.1426μs | 21.2122 KOps/s | 20.9632 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1913ms | 0.1488ms | 6.7225 KOps/s | 6.7211 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.1798ms | 27.7174μs | 36.0784 KOps/s | 35.6058 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1713ms | 0.1408ms | 7.1033 KOps/s | 7.0814 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 45.6000μs | 23.8710μs | 41.8918 KOps/s | 41.9993 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1799ms | 0.1397ms | 7.1595 KOps/s | 7.0615 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 51.3700μs | 23.7816μs | 42.0494 KOps/s | 40.9548 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1940ms | 0.1474ms | 6.7837 KOps/s | 6.7661 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.4699ms | 27.8904μs | 35.8546 KOps/s | 35.4663 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.2340ms | 0.1403ms | 7.1301 KOps/s | 7.1206 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 56.3610μs | 24.0548μs | 41.5717 KOps/s | 41.9564 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1760ms | 0.1402ms | 7.1343 KOps/s | 7.0995 KOps/s | |
test_compile_indexing[int-pytree-eager] | 45.0910μs | 23.7715μs | 42.0671 KOps/s | 41.8696 KOps/s | |
test_mod_add[eager] | 64.5210μs | 34.2983μs | 29.1560 KOps/s | 28.2100 KOps/s | |
test_mod_add[compile] | 0.1361ms | 82.1726μs | 12.1695 KOps/s | 13.0444 KOps/s | |
test_mod_add[compile-overhead] | 0.2684ms | 0.1421ms | 7.0356 KOps/s | 6.2675 KOps/s | |
test_mod_wrap[eager] | 0.3488ms | 0.2492ms | 4.0127 KOps/s | 3.9896 KOps/s | |
test_mod_wrap[compile] | 1.2070ms | 0.3147ms | 3.1774 KOps/s | 3.1970 KOps/s | |
test_mod_wrap[compile-overhead] | 8.4453ms | 4.3254ms | 231.1934 Ops/s | 228.9345 Ops/s | |
test_mod_wrap_and_backward[eager] | 1.5033ms | 1.3896ms | 719.6333 Ops/s | 711.1381 Ops/s | |
test_mod_wrap_and_backward[compile] | 1.5712ms | 1.3941ms | 717.2874 Ops/s | 669.0471 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 1.3571ms | 0.9286ms | 1.0769 KOps/s | 964.0786 Ops/s | |
test_seq_add[eager] | 0.1557ms | 0.1020ms | 9.8070 KOps/s | 9.6154 KOps/s | |
test_seq_add[compile] | 0.1343ms | 88.2075μs | 11.3369 KOps/s | 11.1208 KOps/s | |
test_seq_add[compile-overhead] | 0.1658ms | 0.1262ms | 7.9210 KOps/s | 8.0334 KOps/s | |
test_seq_wrap[eager] | 0.4525ms | 0.3943ms | 2.5364 KOps/s | 2.5207 KOps/s | |
test_seq_wrap[compile] | 0.3931ms | 0.3434ms | 2.9123 KOps/s | 3.0044 KOps/s | |
test_seq_wrap[compile-overhead] | 0.2811ms | 0.2416ms | 4.1387 KOps/s | 4.1932 KOps/s | |
test_func_call_runtime[False-eager] | 0.8315ms | 0.7544ms | 1.3256 KOps/s | 1.3293 KOps/s | |
test_func_call_runtime[False-compile] | 1.0037ms | 0.8362ms | 1.1959 KOps/s | 1.1865 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.4522ms | 0.3815ms | 2.6209 KOps/s | 2.5819 KOps/s | |
test_func_call_runtime[True-eager] | 1.0761ms | 0.9423ms | 1.0612 KOps/s | 1.0312 KOps/s | |
test_func_call_runtime[True-compile] | 0.9331ms | 0.8857ms | 1.1290 KOps/s | 1.0842 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.4835ms | 0.4290ms | 2.3312 KOps/s | 2.3255 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.8842ms | 0.7440ms | 1.3441 KOps/s | 1.3436 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.8875ms | 0.8420ms | 1.1877 KOps/s | 1.1844 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.4443ms | 0.3837ms | 2.6062 KOps/s | 2.5909 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.1728ms | 1.0609ms | 942.5778 Ops/s | 944.1563 Ops/s | |
test_func_call_cm_runtime[True-compile] | 1.1105ms | 1.0277ms | 973.0276 Ops/s | 962.7084 Ops/s | |
test_func_call_cm_runtime[True-compile-overhead] | 1.2343ms | 1.0368ms | 964.4729 Ops/s | 963.6128 Ops/s | |
test_distributed | 2.4805ms | 73.2549μs | 13.6510 KOps/s | 13.8861 KOps/s | |
test_tdmodule | 39.3900μs | 15.8952μs | 62.9122 KOps/s | 60.6438 KOps/s | |
test_tdmodule_dispatch | 52.3610μs | 32.0309μs | 31.2199 KOps/s | 30.7030 KOps/s | |
test_tdseq | 31.3210μs | 16.4376μs | 60.8362 KOps/s | 58.5805 KOps/s | |
test_tdseq_dispatch | 66.3910μs | 34.1068μs | 29.3197 KOps/s | 28.8463 KOps/s | |
test_instantiation_functorch | 2.3482ms | 2.0388ms | 490.4964 Ops/s | 481.0264 Ops/s | |
test_instantiation_td | 2.0068ms | 1.3340ms | 749.6434 Ops/s | 744.5129 Ops/s | |
test_exec_functorch | 0.2497ms | 0.2245ms | 4.4542 KOps/s | 4.4750 KOps/s | |
test_exec_functional_call | 0.2441ms | 0.2176ms | 4.5952 KOps/s | 4.5750 KOps/s | |
test_exec_td | 0.2563ms | 0.2256ms | 4.4321 KOps/s | 4.4443 KOps/s | |
test_exec_td_decorator | 0.6190ms | 0.2779ms | 3.5980 KOps/s | 3.5747 KOps/s | |
test_vmap_mlp_speed[True-True] | 0.7187ms | 0.6556ms | 1.5253 KOps/s | 1.5262 KOps/s | |
test_vmap_mlp_speed[True-False] | 0.6774ms | 0.6506ms | 1.5371 KOps/s | 1.5300 KOps/s | |
test_vmap_mlp_speed[False-True] | 0.7192ms | 0.5718ms | 1.7489 KOps/s | 1.7590 KOps/s | |
test_vmap_mlp_speed[False-False] | 0.6692ms | 0.5696ms | 1.7556 KOps/s | 1.7480 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 1.3202ms | 0.7060ms | 1.4164 KOps/s | 1.4216 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.8398ms | 0.7063ms | 1.4159 KOps/s | 1.4236 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.7413ms | 0.6170ms | 1.6208 KOps/s | 1.6331 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.7487ms | 0.6146ms | 1.6271 KOps/s | 1.6238 KOps/s | |
test_vmap_transformer_speed[True-True] | 8.8630ms | 8.7906ms | 113.7581 Ops/s | 113.0776 Ops/s | |
test_vmap_transformer_speed[True-False] | 8.8234ms | 8.7750ms | 113.9601 Ops/s | 113.7765 Ops/s | |
test_vmap_transformer_speed[False-True] | 8.8218ms | 8.7128ms | 114.7740 Ops/s | 114.7009 Ops/s | |
test_vmap_transformer_speed[False-False] | 8.7451ms | 8.6837ms | 115.1585 Ops/s | 114.3497 Ops/s | |
test_vmap_transformer_speed_decorator[True-True] | 21.4952ms | 20.8572ms | 47.9451 Ops/s | 48.0999 Ops/s | |
test_vmap_transformer_speed_decorator[True-False] | 20.9223ms | 20.8570ms | 47.9456 Ops/s | 47.9779 Ops/s | |
test_vmap_transformer_speed_decorator[False-True] | 20.8234ms | 20.7621ms | 48.1648 Ops/s | 48.5177 Ops/s | |
test_vmap_transformer_speed_decorator[False-False] | 20.7776ms | 20.6805ms | 48.3548 Ops/s | 48.3153 Ops/s | |
test_to_module_speed[True] | 2.3279ms | 1.1414ms | 876.0853 Ops/s | 869.6012 Ops/s | |
test_to_module_speed[False] | 1.5806ms | 1.1276ms | 886.8425 Ops/s | 901.3051 Ops/s | |
test_tc_init | 57.8110μs | 39.3928μs | 25.3854 KOps/s | 25.4355 KOps/s | |
test_tc_init_nested | 98.3110μs | 77.0079μs | 12.9857 KOps/s | 12.9535 KOps/s | |
test_tc_first_layer_tensor | 5.8102μs | 0.7897μs | 1.2664 MOps/s | 1.2651 MOps/s | |
test_tc_first_layer_nontensor | 21.7800μs | 2.5293μs | 395.3630 KOps/s | 397.7465 KOps/s | |
test_tc_second_layer_tensor | 6.7167μs | 1.6243μs | 615.6366 KOps/s | 627.7405 KOps/s | |
test_tc_second_layer_nontensor | 18.4210μs | 3.3884μs | 295.1205 KOps/s | 300.5481 KOps/s | |
test_unbind | 0.1845s | 10.4854ms | 95.3703 Ops/s | 63.3532 Ops/s | |
test_full_like | 0.6568ms | 0.5775ms | 1.7315 KOps/s | 1.7336 KOps/s | |
test_zeros_like | 0.2701ms | 0.1979ms | 5.0536 KOps/s | 5.0563 KOps/s | |
test_ones_like | 0.2233ms | 0.1976ms | 5.0606 KOps/s | 5.0634 KOps/s | |
test_clone | 0.4577ms | 0.4145ms | 2.4124 KOps/s | 2.4209 KOps/s | |
test_squeeze | 28.0900μs | 11.0665μs | 90.3629 KOps/s | 88.3244 KOps/s | |
test_unsqueeze | 0.2440ms | 80.1677μs | 12.4738 KOps/s | 11.7321 KOps/s | |
test_split | 0.1699s | 0.2216ms | 4.5124 KOps/s | 5.5895 KOps/s | |
test_permute | 0.2887ms | 0.1945ms | 5.1420 KOps/s | 5.0706 KOps/s | |
test_stack | 1.2502ms | 0.9054ms | 1.1045 KOps/s | 1.0915 KOps/s | |
test_cat | 1.2515ms | 1.2310ms | 812.3657 Ops/s | 811.7371 Ops/s |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Quality
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
No description provided.