-
Notifications
You must be signed in to change notification settings - Fork 76
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BugFix] Fix select in tensorclass #936
Merged
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
facebook-github-bot
added
the
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
label
Jul 31, 2024
3 tasks
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 52.0770μs | 21.9470μs | 45.5643 KOps/s | 44.4902 KOps/s | |
test_plain_set_stack_nested | 56.1750μs | 21.9587μs | 45.5400 KOps/s | 44.1335 KOps/s | |
test_plain_set_nested_inplace | 98.1170μs | 23.8015μs | 42.0141 KOps/s | 40.8042 KOps/s | |
test_plain_set_stack_nested_inplace | 78.7240μs | 23.7970μs | 42.0222 KOps/s | 40.9643 KOps/s | |
test_items | 29.6150μs | 2.6374μs | 379.1660 KOps/s | 363.7008 KOps/s | |
test_items_nested | 0.5018ms | 0.3337ms | 2.9966 KOps/s | 2.9950 KOps/s | |
test_items_nested_locked | 0.4243ms | 0.3350ms | 2.9855 KOps/s | 2.9687 KOps/s | |
test_items_nested_leaf | 1.6059ms | 83.4125μs | 11.9886 KOps/s | 12.0789 KOps/s | |
test_items_stack_nested | 0.4804ms | 0.3355ms | 2.9807 KOps/s | 2.9591 KOps/s | |
test_items_stack_nested_leaf | 0.1660ms | 84.4181μs | 11.8458 KOps/s | 11.7074 KOps/s | |
test_items_stack_nested_locked | 0.4856ms | 0.3363ms | 2.9735 KOps/s | 2.9888 KOps/s | |
test_keys | 26.8410μs | 3.9002μs | 256.3990 KOps/s | 253.2525 KOps/s | |
test_keys_nested | 0.2803ms | 0.1461ms | 6.8432 KOps/s | 6.9978 KOps/s | |
test_keys_nested_locked | 0.7157ms | 0.1523ms | 6.5678 KOps/s | 6.7137 KOps/s | |
test_keys_nested_leaf | 0.2812ms | 0.1266ms | 7.8974 KOps/s | 8.1158 KOps/s | |
test_keys_stack_nested | 0.2536ms | 0.1461ms | 6.8447 KOps/s | 6.9732 KOps/s | |
test_keys_stack_nested_leaf | 0.2654ms | 0.1255ms | 7.9666 KOps/s | 8.0695 KOps/s | |
test_keys_stack_nested_locked | 0.2942ms | 0.1528ms | 6.5450 KOps/s | 6.7420 KOps/s | |
test_values | 9.1130μs | 1.1386μs | 878.2471 KOps/s | 855.6443 KOps/s | |
test_values_nested | 89.4970μs | 50.3334μs | 19.8675 KOps/s | 19.9923 KOps/s | |
test_values_nested_locked | 0.1067ms | 50.1823μs | 19.9273 KOps/s | 19.8553 KOps/s | |
test_values_nested_leaf | 0.1186ms | 45.0117μs | 22.2164 KOps/s | 22.2823 KOps/s | |
test_values_stack_nested | 93.4740μs | 51.2065μs | 19.5288 KOps/s | 19.5882 KOps/s | |
test_values_stack_nested_leaf | 93.0030μs | 45.1436μs | 22.1515 KOps/s | 22.2407 KOps/s | |
test_values_stack_nested_locked | 0.1034ms | 51.3128μs | 19.4883 KOps/s | 19.7985 KOps/s | |
test_membership | 4.9450μs | 0.7640μs | 1.3089 MOps/s | 1.3275 MOps/s | |
test_membership_nested | 44.4330μs | 2.6112μs | 382.9719 KOps/s | 371.2301 KOps/s | |
test_membership_nested_leaf | 30.0250μs | 2.6472μs | 377.7621 KOps/s | 373.2733 KOps/s | |
test_membership_stacked_nested | 40.7660μs | 2.6698μs | 374.5591 KOps/s | 378.3120 KOps/s | |
test_membership_stacked_nested_leaf | 39.1630μs | 2.6222μs | 381.3585 KOps/s | 371.8770 KOps/s | |
test_membership_nested_last | 47.5490μs | 3.8450μs | 260.0808 KOps/s | 254.3660 KOps/s | |
test_membership_nested_leaf_last | 30.7780μs | 3.8860μs | 257.3335 KOps/s | 251.9334 KOps/s | |
test_membership_stacked_nested_last | 34.1840μs | 3.8442μs | 260.1315 KOps/s | 146.7899 KOps/s | |
test_membership_stacked_nested_leaf_last | 30.2260μs | 3.8473μs | 259.9225 KOps/s | 146.9112 KOps/s | |
test_nested_getleaf | 50.8250μs | 10.6105μs | 94.2461 KOps/s | 96.6404 KOps/s | |
test_nested_get | 55.2740μs | 10.1074μs | 98.9372 KOps/s | 102.4839 KOps/s | |
test_stacked_getleaf | 39.5840μs | 10.6024μs | 94.3182 KOps/s | 96.0577 KOps/s | |
test_stacked_get | 55.4240μs | 10.0384μs | 99.6174 KOps/s | 102.5996 KOps/s | |
test_nested_getitemleaf | 54.6520μs | 11.1021μs | 90.0727 KOps/s | 92.3723 KOps/s | |
test_nested_getitem | 53.6400μs | 10.3103μs | 96.9902 KOps/s | 101.3782 KOps/s | |
test_stacked_getitemleaf | 57.6570μs | 11.1377μs | 89.7853 KOps/s | 92.3567 KOps/s | |
test_stacked_getitem | 56.8660μs | 10.2006μs | 98.0332 KOps/s | 99.9158 KOps/s | |
test_lock_nested | 7.5928ms | 0.5031ms | 1.9876 KOps/s | 1.9792 KOps/s | |
test_lock_stack_nested | 0.9355ms | 0.4616ms | 2.1665 KOps/s | 2.1482 KOps/s | |
test_unlock_nested | 0.8792ms | 0.4172ms | 2.3970 KOps/s | 2.4001 KOps/s | |
test_unlock_stack_nested | 0.6952ms | 0.3766ms | 2.6555 KOps/s | 2.6328 KOps/s | |
test_flatten_speed | 0.5604ms | 0.1041ms | 9.6056 KOps/s | 9.5520 KOps/s | |
test_unflatten_speed | 0.5109ms | 0.4334ms | 2.3074 KOps/s | 2.3235 KOps/s | |
test_common_ops | 4.9600ms | 1.0782ms | 927.4601 Ops/s | 899.5678 Ops/s | |
test_creation | 15.7400μs | 2.0577μs | 485.9883 KOps/s | 471.1176 KOps/s | |
test_creation_empty | 60.7530μs | 18.0094μs | 55.5267 KOps/s | 52.2279 KOps/s | |
test_creation_nested_1 | 52.8180μs | 21.1579μs | 47.2636 KOps/s | 44.5479 KOps/s | |
test_creation_nested_2 | 67.6360μs | 24.8934μs | 40.1712 KOps/s | 38.4087 KOps/s | |
test_clone | 0.1031ms | 17.1212μs | 58.4071 KOps/s | 60.5747 KOps/s | |
test_getitem[int] | 1.0941ms | 16.8475μs | 59.3561 KOps/s | 59.6849 KOps/s | |
test_getitem[slice_int] | 0.1733ms | 32.6220μs | 30.6542 KOps/s | 31.3711 KOps/s | |
test_getitem[range] | 0.1789ms | 57.7364μs | 17.3201 KOps/s | 17.4206 KOps/s | |
test_getitem[tuple] | 0.1409ms | 26.9514μs | 37.1038 KOps/s | 38.4088 KOps/s | |
test_getitem[list] | 0.2900ms | 52.9845μs | 18.8734 KOps/s | 19.1434 KOps/s | |
test_setitem_dim[int] | 71.9940μs | 40.3816μs | 24.7638 KOps/s | 23.3980 KOps/s | |
test_setitem_dim[slice_int] | 0.1329ms | 71.9975μs | 13.8894 KOps/s | 13.7743 KOps/s | |
test_setitem_dim[range] | 0.1483ms | 93.2305μs | 10.7261 KOps/s | 10.6661 KOps/s | |
test_setitem_dim[tuple] | 0.1068ms | 57.7218μs | 17.3245 KOps/s | 16.5685 KOps/s | |
test_setitem | 85.6090μs | 29.8013μs | 33.5556 KOps/s | 33.0867 KOps/s | |
test_set | 96.2990μs | 28.8985μs | 34.6039 KOps/s | 33.7125 KOps/s | |
test_set_shared | 1.2603ms | 0.2147ms | 4.6567 KOps/s | 4.6826 KOps/s | |
test_update | 0.1414ms | 35.3911μs | 28.2557 KOps/s | 26.7912 KOps/s | |
test_update_nested | 0.1649ms | 45.8014μs | 21.8334 KOps/s | 21.2995 KOps/s | |
test_update__nested | 0.1038ms | 33.9428μs | 29.4613 KOps/s | 29.6550 KOps/s | |
test_set_nested | 0.1155ms | 31.3770μs | 31.8704 KOps/s | 31.7580 KOps/s | |
test_set_nested_new | 0.1438ms | 36.1729μs | 27.6450 KOps/s | 27.3931 KOps/s | |
test_select | 0.2044ms | 52.7849μs | 18.9448 KOps/s | 18.9482 KOps/s | |
test_select_nested | 0.1142ms | 58.9906μs | 16.9519 KOps/s | 17.0946 KOps/s | |
test_exclude_nested | 0.1445ms | 76.9004μs | 13.0038 KOps/s | 13.0698 KOps/s | |
test_empty[True] | 0.5447ms | 0.3191ms | 3.1335 KOps/s | 3.1195 KOps/s | |
test_empty[False] | 10.5572μs | 1.1621μs | 860.5225 KOps/s | 864.1645 KOps/s | |
test_unbind_speed | 0.5000ms | 0.3059ms | 3.2686 KOps/s | 3.1427 KOps/s | |
test_unbind_speed_stack0 | 0.4811ms | 0.3007ms | 3.3255 KOps/s | 3.2990 KOps/s | |
test_unbind_speed_stack1 | 83.5105ms | 0.7812ms | 1.2801 KOps/s | 1.3920 KOps/s | |
test_split | 2.2463ms | 2.0087ms | 497.8464 Ops/s | 469.1801 Ops/s | |
test_chunk | 86.9611ms | 2.3682ms | 422.2582 Ops/s | 463.7531 Ops/s | |
test_creation[device0] | 0.2291ms | 0.1175ms | 8.5101 KOps/s | 8.4264 KOps/s | |
test_creation_from_tensor | 4.8958ms | 0.1216ms | 8.2252 KOps/s | 8.4146 KOps/s | |
test_add_one[memmap_tensor0] | 0.1517ms | 7.7904μs | 128.3624 KOps/s | 125.8023 KOps/s | |
test_contiguous[memmap_tensor0] | 30.1460μs | 1.9828μs | 504.3427 KOps/s | 501.7016 KOps/s | |
test_stack[memmap_tensor0] | 53.7800μs | 5.7753μs | 173.1511 KOps/s | 172.6876 KOps/s | |
test_memmaptd_index | 1.0164ms | 0.4098ms | 2.4405 KOps/s | 2.3810 KOps/s | |
test_memmaptd_index_astensor | 0.7880ms | 0.4880ms | 2.0491 KOps/s | 1.9995 KOps/s | |
test_memmaptd_index_op | 1.8972ms | 1.0328ms | 968.2363 Ops/s | 923.0122 Ops/s | |
test_serialize_model | 0.1218s | 0.1174s | 8.5200 Ops/s | 7.4970 Ops/s | |
test_serialize_model_pickle | 0.4431s | 0.4002s | 2.4986 Ops/s | 2.4895 Ops/s | |
test_serialize_weights | 0.1967s | 0.1285s | 7.7823 Ops/s | 8.5795 Ops/s | |
test_serialize_weights_returnearly | 0.1801s | 0.1612s | 6.2024 Ops/s | 6.2320 Ops/s | |
test_serialize_weights_pickle | 1.2007s | 0.7112s | 1.4062 Ops/s | 2.5264 Ops/s | |
test_serialize_weights_filesystem | 0.1604s | 0.1421s | 7.0349 Ops/s | 6.9258 Ops/s | |
test_serialize_model_filesystem | 0.2154s | 0.1552s | 6.4444 Ops/s | 6.5181 Ops/s | |
test_reshape_pytree | 87.1730μs | 39.5317μs | 25.2962 KOps/s | 24.5560 KOps/s | |
test_reshape_td | 94.0850μs | 46.6770μs | 21.4238 KOps/s | 20.8300 KOps/s | |
test_view_pytree | 89.3560μs | 39.4436μs | 25.3526 KOps/s | 24.9554 KOps/s | |
test_view_td | 0.1059ms | 53.6221μs | 18.6490 KOps/s | 18.7984 KOps/s | |
test_unbind_pytree | 81.0210μs | 37.4985μs | 26.6677 KOps/s | 26.9451 KOps/s | |
test_unbind_td | 0.3841ms | 46.4682μs | 21.5201 KOps/s | 21.3411 KOps/s | |
test_split_pytree | 85.3690μs | 40.3958μs | 24.7550 KOps/s | 24.8594 KOps/s | |
test_split_td | 0.4822ms | 58.9614μs | 16.9603 KOps/s | 16.9095 KOps/s | |
test_add_pytree | 99.1950μs | 47.0014μs | 21.2760 KOps/s | 21.2116 KOps/s | |
test_add_td | 0.2707ms | 82.7190μs | 12.0891 KOps/s | 11.6020 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1879ms | 52.7556μs | 18.9553 KOps/s | 18.3173 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.3768ms | 0.1887ms | 5.2986 KOps/s | 5.2590 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1323ms | 54.2072μs | 18.4477 KOps/s | 18.0467 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.3571ms | 0.1486ms | 6.7302 KOps/s | 6.8297 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 54.4010μs | 20.7055μs | 48.2963 KOps/s | 49.0944 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1298ms | 64.4300μs | 15.5207 KOps/s | 15.3481 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.1567ms | 79.4685μs | 12.5836 KOps/s | 12.5298 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.1375ms | 72.8390μs | 13.7289 KOps/s | 13.8542 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.2678ms | 0.1724ms | 5.8004 KOps/s | 5.6944 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.4178ms | 0.1968ms | 5.0809 KOps/s | 5.1975 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 90.7990μs | 38.2330μs | 26.1554 KOps/s | 25.8701 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 1.0348ms | 69.7114μs | 14.3449 KOps/s | 14.0493 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.2730ms | 0.1700ms | 5.8808 KOps/s | 5.8242 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.3916ms | 0.2974ms | 3.3630 KOps/s | 3.3480 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.4205ms | 0.2079ms | 4.8111 KOps/s | 4.8692 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.3591ms | 0.1762ms | 5.6751 KOps/s | 5.6592 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.7622ms | 62.7760μs | 15.9296 KOps/s | 15.6483 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 82.7940μs | 39.5446μs | 25.2879 KOps/s | 24.9795 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.4608ms | 0.2441ms | 4.0963 KOps/s | 4.0742 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.2756ms | 0.1710ms | 5.8487 KOps/s | 5.7789 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 0.1983ms | 0.1077ms | 9.2841 KOps/s | 9.2368 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.1101ms | 56.6783μs | 17.6434 KOps/s | 17.5806 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1574ms | 80.8622μs | 12.3667 KOps/s | 12.4302 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1385ms | 71.0541μs | 14.0738 KOps/s | 13.7370 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 0.3777ms | 0.1932ms | 5.1750 KOps/s | 5.2632 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 1.9078ms | 1.6498ms | 606.1255 Ops/s | 613.0225 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 0.2576ms | 0.1887ms | 5.3007 KOps/s | 5.2829 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 1.8331ms | 1.1121ms | 899.1773 Ops/s | 909.4193 Ops/s | |
test_compile_assign_and_add_stack[compile] | 0.8267ms | 0.4216ms | 2.3718 KOps/s | 2.4009 KOps/s | |
test_compile_assign_and_add_stack[eager] | 4.0831ms | 3.7513ms | 266.5773 Ops/s | 255.6329 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 99.0750μs | 32.3318μs | 30.9293 KOps/s | 30.5503 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.7690ms | 49.1497μs | 20.3460 KOps/s | 20.2534 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.2547ms | 27.9760μs | 35.7449 KOps/s | 34.5138 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 0.2647ms | 31.3632μs | 31.8845 KOps/s | 31.8910 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.1189ms | 28.2223μs | 35.4330 KOps/s | 34.1811 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 0.1015ms | 30.9619μs | 32.2978 KOps/s | 31.8775 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1599ms | 70.8865μs | 14.1071 KOps/s | 13.8297 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.3365ms | 28.1483μs | 35.5261 KOps/s | 36.0949 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1494ms | 66.0699μs | 15.1355 KOps/s | 14.8397 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 0.1038ms | 25.5757μs | 39.0996 KOps/s | 41.5236 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1301ms | 66.3608μs | 15.0691 KOps/s | 14.8164 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 68.7990μs | 24.9557μs | 40.0709 KOps/s | 40.6654 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1763ms | 71.7145μs | 13.9442 KOps/s | 13.7050 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 1.0644ms | 28.5164μs | 35.0675 KOps/s | 35.6623 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1997ms | 66.8737μs | 14.9536 KOps/s | 14.8128 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 81.0510μs | 25.3042μs | 39.5191 KOps/s | 41.4795 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1574ms | 66.5733μs | 15.0210 KOps/s | 14.9483 KOps/s | |
test_compile_indexing[int-pytree-eager] | 0.4620ms | 25.2161μs | 39.6571 KOps/s | 41.3872 KOps/s | |
test_mod_add[eager] | 64.6710μs | 25.5448μs | 39.1469 KOps/s | 39.3116 KOps/s | |
test_mod_add[compile] | 78.7270μs | 36.9414μs | 27.0699 KOps/s | 26.3771 KOps/s | |
test_mod_add[compile-overhead] | 84.1970μs | 35.7988μs | 27.9339 KOps/s | 26.5504 KOps/s | |
test_mod_wrap[eager] | 0.3441ms | 0.2083ms | 4.8015 KOps/s | 4.8590 KOps/s | |
test_mod_wrap[compile] | 1.4388ms | 0.2302ms | 4.3447 KOps/s | 4.2991 KOps/s | |
test_mod_wrap[compile-overhead] | 0.4550ms | 0.2266ms | 4.4129 KOps/s | 4.3802 KOps/s | |
test_mod_wrap_and_backward[eager] | 14.5926ms | 11.9062ms | 83.9896 Ops/s | 91.8072 Ops/s | |
test_mod_wrap_and_backward[compile] | 15.1735ms | 11.9139ms | 83.9359 Ops/s | 88.9956 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 13.3237ms | 11.0477ms | 90.5167 Ops/s | 90.8803 Ops/s | |
test_seq_add[eager] | 0.1586ms | 88.2155μs | 11.3359 KOps/s | 11.3219 KOps/s | |
test_seq_add[compile] | 0.1933ms | 58.9782μs | 16.9554 KOps/s | 15.9865 KOps/s | |
test_seq_add[compile-overhead] | 0.1509ms | 58.1977μs | 17.1828 KOps/s | 16.6293 KOps/s | |
test_seq_wrap[eager] | 0.5567ms | 0.3685ms | 2.7140 KOps/s | 2.6111 KOps/s | |
test_seq_wrap[compile] | 0.3545ms | 0.2595ms | 3.8536 KOps/s | 3.7845 KOps/s | |
test_seq_wrap[compile-overhead] | 0.3660ms | 0.2591ms | 3.8593 KOps/s | 3.7923 KOps/s | |
test_func_call_runtime[False-eager] | 0.9355ms | 0.5276ms | 1.8952 KOps/s | 1.9242 KOps/s | |
test_func_call_runtime[False-compile] | 0.6613ms | 0.4899ms | 2.0412 KOps/s | 1.9807 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.8840ms | 0.4898ms | 2.0416 KOps/s | 1.9936 KOps/s | |
test_func_call_runtime[True-eager] | 1.5483ms | 0.7476ms | 1.3377 KOps/s | 1.3537 KOps/s | |
test_func_call_runtime[True-compile] | 0.6687ms | 0.5025ms | 1.9900 KOps/s | 1.9331 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 1.0320ms | 0.5059ms | 1.9767 KOps/s | 1.9256 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.8333ms | 0.5194ms | 1.9252 KOps/s | 1.9151 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.6667ms | 0.4888ms | 2.0460 KOps/s | 1.9725 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.6524ms | 0.4922ms | 2.0319 KOps/s | 1.9693 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.4105ms | 0.8747ms | 1.1432 KOps/s | 1.1391 KOps/s | |
test_func_call_cm_runtime[True-compile] | 1.1563ms | 0.8338ms | 1.1993 KOps/s | 1.1966 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.9915ms | 0.8332ms | 1.2003 KOps/s | 1.1893 KOps/s | |
test_distributed | 0.3580ms | 0.1313ms | 7.6168 KOps/s | 7.4996 KOps/s | |
test_tdmodule | 31.2380μs | 17.2556μs | 57.9522 KOps/s | 57.2297 KOps/s | |
test_tdmodule_dispatch | 60.5130μs | 36.5755μs | 27.3407 KOps/s | 26.9453 KOps/s | |
test_tdseq | 37.5500μs | 18.5971μs | 53.7719 KOps/s | 50.5934 KOps/s | |
test_tdseq_dispatch | 69.8700μs | 39.5153μs | 25.3067 KOps/s | 24.4101 KOps/s | |
test_instantiation_functorch | 1.8927ms | 1.6327ms | 612.4757 Ops/s | 611.4943 Ops/s | |
test_instantiation_td | 1.7956ms | 1.1844ms | 844.2859 Ops/s | 843.8227 Ops/s | |
test_exec_functorch | 0.3189ms | 0.1806ms | 5.5380 KOps/s | 5.5130 KOps/s | |
test_exec_functional_call | 0.3104ms | 0.1686ms | 5.9321 KOps/s | 5.7319 KOps/s | |
test_exec_td | 0.2541ms | 0.1695ms | 5.8996 KOps/s | 5.8590 KOps/s | |
test_exec_td_decorator | 0.4789ms | 0.2204ms | 4.5368 KOps/s | 4.4273 KOps/s | |
test_vmap_mlp_speed[True-True] | 0.7942ms | 0.5679ms | 1.7608 KOps/s | 1.7356 KOps/s | |
test_vmap_mlp_speed[True-False] | 0.8809ms | 0.5664ms | 1.7656 KOps/s | 1.7429 KOps/s | |
test_vmap_mlp_speed[False-True] | 0.6660ms | 0.4685ms | 2.1346 KOps/s | 2.1170 KOps/s | |
test_vmap_mlp_speed[False-False] | 0.8409ms | 0.4723ms | 2.1173 KOps/s | 2.1255 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 1.4417ms | 0.6183ms | 1.6175 KOps/s | 1.5895 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.9359ms | 0.6239ms | 1.6027 KOps/s | 1.5474 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.7058ms | 0.5146ms | 1.9433 KOps/s | 1.9155 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.8245ms | 0.5161ms | 1.9375 KOps/s | 1.9370 KOps/s | |
test_to_module_speed[True] | 1.9938ms | 1.3230ms | 755.8719 Ops/s | 760.2576 Ops/s | |
test_to_module_speed[False] | 2.1656ms | 1.3168ms | 759.4327 Ops/s | 770.3819 Ops/s | |
test_tc_init | 93.2540μs | 45.5445μs | 21.9566 KOps/s | 22.5715 KOps/s | |
test_tc_init_nested | 0.1624ms | 92.1920μs | 10.8469 KOps/s | 11.2361 KOps/s | |
test_tc_first_layer_tensor | 37.7800μs | 1.4615μs | 684.2091 KOps/s | 706.8873 KOps/s | |
test_tc_first_layer_nontensor | 19.4870μs | 4.2976μs | 232.6859 KOps/s | 236.2360 KOps/s | |
test_tc_second_layer_tensor | 42.9000μs | 2.7395μs | 365.0294 KOps/s | 372.9140 KOps/s | |
test_tc_second_layer_nontensor | 28.9950μs | 5.5368μs | 180.6095 KOps/s | 183.1077 KOps/s | |
test_unbind | 0.4410s | 13.6270ms | 73.3840 Ops/s | 73.7560 Ops/s | |
test_full_like | 9.0091ms | 7.1748ms | 139.3765 Ops/s | 139.3740 Ops/s | |
test_zeros_like | 16.3914ms | 6.5801ms | 151.9734 Ops/s | 140.1306 Ops/s | |
test_ones_like | 17.1997ms | 7.6742ms | 130.3072 Ops/s | 130.8723 Ops/s | |
test_clone | 13.5129ms | 9.1482ms | 109.3112 Ops/s | 105.4852 Ops/s | |
test_squeeze | 68.3570μs | 13.1840μs | 75.8494 KOps/s | 75.7340 KOps/s | |
test_unsqueeze | 0.2115ms | 96.5907μs | 10.3530 KOps/s | 10.6330 KOps/s | |
test_split | 0.4476ms | 0.2017ms | 4.9576 KOps/s | 4.9804 KOps/s | |
test_permute | 0.4515ms | 0.2249ms | 4.4454 KOps/s | 4.5871 KOps/s | |
test_stack | 27.7125ms | 24.5815ms | 40.6811 Ops/s | 39.9907 Ops/s | |
test_cat | 32.1648ms | 24.6465ms | 40.5736 Ops/s | 40.2926 Ops/s |
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 0.1499ms | 17.4741μs | 57.2277 KOps/s | 55.9820 KOps/s | |
test_plain_set_stack_nested | 37.5500μs | 17.6734μs | 56.5823 KOps/s | 56.0095 KOps/s | |
test_plain_set_nested_inplace | 34.1500μs | 18.8217μs | 53.1302 KOps/s | 52.8210 KOps/s | |
test_plain_set_stack_nested_inplace | 43.9210μs | 18.8053μs | 53.1766 KOps/s | 52.7128 KOps/s | |
test_items | 18.7000μs | 4.5977μs | 217.5019 KOps/s | 214.5303 KOps/s | |
test_items_nested | 0.4284ms | 0.3644ms | 2.7443 KOps/s | 2.7531 KOps/s | |
test_items_nested_locked | 0.4103ms | 0.3660ms | 2.7324 KOps/s | 2.7176 KOps/s | |
test_items_nested_leaf | 0.1067ms | 84.0407μs | 11.8990 KOps/s | 11.8223 KOps/s | |
test_items_stack_nested | 0.4174ms | 0.3693ms | 2.7080 KOps/s | 2.7191 KOps/s | |
test_items_stack_nested_leaf | 0.1097ms | 84.9758μs | 11.7681 KOps/s | 11.7580 KOps/s | |
test_items_stack_nested_locked | 0.4112ms | 0.3669ms | 2.7256 KOps/s | 2.7090 KOps/s | |
test_keys | 20.2200μs | 4.3610μs | 229.3027 KOps/s | 228.8720 KOps/s | |
test_keys_nested | 93.9820μs | 67.0612μs | 14.9118 KOps/s | 14.7033 KOps/s | |
test_keys_nested_locked | 0.6211ms | 73.2586μs | 13.6503 KOps/s | 13.6973 KOps/s | |
test_keys_nested_leaf | 81.2920μs | 57.3078μs | 17.4496 KOps/s | 17.1329 KOps/s | |
test_keys_stack_nested | 87.0620μs | 65.9474μs | 15.1636 KOps/s | 14.7671 KOps/s | |
test_keys_stack_nested_leaf | 72.5210μs | 57.2697μs | 17.4612 KOps/s | 16.9600 KOps/s | |
test_keys_stack_nested_locked | 89.0820μs | 72.2676μs | 13.8375 KOps/s | 13.7170 KOps/s | |
test_values | 6.5267μs | 1.7649μs | 566.6190 KOps/s | 567.7419 KOps/s | |
test_values_nested | 54.8510μs | 33.8972μs | 29.5010 KOps/s | 29.3252 KOps/s | |
test_values_nested_locked | 47.8210μs | 36.0146μs | 27.7665 KOps/s | 27.9277 KOps/s | |
test_values_nested_leaf | 44.6910μs | 30.0549μs | 33.2724 KOps/s | 33.0407 KOps/s | |
test_values_stack_nested | 54.1810μs | 34.6009μs | 28.9010 KOps/s | 28.8637 KOps/s | |
test_values_stack_nested_leaf | 49.8410μs | 30.6392μs | 32.6379 KOps/s | 32.5197 KOps/s | |
test_values_stack_nested_locked | 60.3410μs | 36.8923μs | 27.1059 KOps/s | 27.3168 KOps/s | |
test_membership | 1.2325μs | 0.5396μs | 1.8531 MOps/s | 1.8075 MOps/s | |
test_membership_nested | 17.1700μs | 2.0423μs | 489.6516 KOps/s | 519.0749 KOps/s | |
test_membership_nested_leaf | 9.9050μs | 1.9729μs | 506.8754 KOps/s | 510.5553 KOps/s | |
test_membership_stacked_nested | 22.3710μs | 2.0479μs | 488.3074 KOps/s | 493.5483 KOps/s | |
test_membership_stacked_nested_leaf | 17.8610μs | 2.0455μs | 488.8725 KOps/s | 499.6166 KOps/s | |
test_membership_nested_last | 25.9700μs | 3.0037μs | 332.9262 KOps/s | 344.7014 KOps/s | |
test_membership_nested_leaf_last | 20.9900μs | 2.9779μs | 335.8123 KOps/s | 344.2658 KOps/s | |
test_membership_stacked_nested_last | 23.0310μs | 3.0282μs | 330.2308 KOps/s | 233.2529 KOps/s | |
test_membership_stacked_nested_leaf_last | 21.8800μs | 2.9870μs | 334.7890 KOps/s | 230.9982 KOps/s | |
test_nested_getleaf | 21.1710μs | 7.9443μs | 125.8765 KOps/s | 125.5772 KOps/s | |
test_nested_get | 23.2600μs | 7.4874μs | 133.5572 KOps/s | 134.0722 KOps/s | |
test_stacked_getleaf | 25.6000μs | 7.9892μs | 125.1695 KOps/s | 125.6395 KOps/s | |
test_stacked_get | 24.5010μs | 7.5290μs | 132.8192 KOps/s | 133.8948 KOps/s | |
test_nested_getitemleaf | 30.0910μs | 8.1535μs | 122.6462 KOps/s | 123.0989 KOps/s | |
test_nested_getitem | 23.1210μs | 7.6161μs | 131.3006 KOps/s | 131.0138 KOps/s | |
test_stacked_getitemleaf | 35.2010μs | 8.1139μs | 123.2452 KOps/s | 122.3016 KOps/s | |
test_stacked_getitem | 24.5010μs | 7.6355μs | 130.9668 KOps/s | 131.0389 KOps/s | |
test_lock_nested | 10.1515ms | 0.4856ms | 2.0595 KOps/s | 2.1122 KOps/s | |
test_lock_stack_nested | 0.4898ms | 0.4387ms | 2.2792 KOps/s | 2.3222 KOps/s | |
test_unlock_nested | 0.8753ms | 0.3942ms | 2.5367 KOps/s | 2.5607 KOps/s | |
test_unlock_stack_nested | 0.3868ms | 0.3508ms | 2.8505 KOps/s | 2.8706 KOps/s | |
test_flatten_speed | 0.4712ms | 0.1040ms | 9.6183 KOps/s | 9.4698 KOps/s | |
test_unflatten_speed | 0.3248ms | 0.2830ms | 3.5331 KOps/s | 3.4415 KOps/s | |
test_common_ops | 1.5469ms | 1.3219ms | 756.4849 Ops/s | 751.1764 Ops/s | |
test_creation | 17.2400μs | 1.6445μs | 608.0979 KOps/s | 608.4865 KOps/s | |
test_creation_empty | 34.7610μs | 18.8816μs | 52.9616 KOps/s | 53.4345 KOps/s | |
test_creation_nested_1 | 37.9210μs | 20.4871μs | 48.8112 KOps/s | 47.3388 KOps/s | |
test_creation_nested_2 | 42.8610μs | 23.0359μs | 43.4106 KOps/s | 43.8259 KOps/s | |
test_clone | 47.1910μs | 29.0995μs | 34.3649 KOps/s | 34.6372 KOps/s | |
test_getitem[int] | 1.3512ms | 16.9172μs | 59.1113 KOps/s | 59.2984 KOps/s | |
test_getitem[slice_int] | 0.1613ms | 29.3969μs | 34.0172 KOps/s | 34.6710 KOps/s | |
test_getitem[range] | 0.2697ms | 0.1154ms | 8.6630 KOps/s | 8.7639 KOps/s | |
test_getitem[tuple] | 0.1539ms | 25.5274μs | 39.1735 KOps/s | 39.8931 KOps/s | |
test_getitem[list] | 92.5205ms | 0.1189ms | 8.4082 KOps/s | 9.6451 KOps/s | |
test_setitem_dim[int] | 76.0010μs | 55.2298μs | 18.1062 KOps/s | 17.6225 KOps/s | |
test_setitem_dim[slice_int] | 0.1109ms | 79.5622μs | 12.5688 KOps/s | 12.4873 KOps/s | |
test_setitem_dim[range] | 0.1701ms | 0.1428ms | 7.0020 KOps/s | 6.9660 KOps/s | |
test_setitem_dim[tuple] | 91.3720μs | 72.2342μs | 13.8439 KOps/s | 13.8378 KOps/s | |
test_setitem | 71.3620μs | 43.7273μs | 22.8690 KOps/s | 23.3722 KOps/s | |
test_set | 65.7110μs | 42.6761μs | 23.4323 KOps/s | 23.6475 KOps/s | |
test_set_shared | 0.3614ms | 53.0004μs | 18.8678 KOps/s | 18.5677 KOps/s | |
test_update | 82.2910μs | 53.4836μs | 18.6973 KOps/s | 18.5845 KOps/s | |
test_update_nested | 90.3520μs | 59.9166μs | 16.6899 KOps/s | 16.1499 KOps/s | |
test_update__nested | 83.4720μs | 60.0065μs | 16.6649 KOps/s | 16.6352 KOps/s | |
test_set_nested | 69.6910μs | 45.4732μs | 21.9910 KOps/s | 22.5033 KOps/s | |
test_set_nested_new | 74.0410μs | 48.9397μs | 20.4333 KOps/s | 20.3172 KOps/s | |
test_select | 0.2068ms | 64.7622μs | 15.4411 KOps/s | 15.3643 KOps/s | |
test_select_nested | 0.4866ms | 52.8514μs | 18.9210 KOps/s | 19.1444 KOps/s | |
test_exclude_nested | 85.0920μs | 68.6936μs | 14.5574 KOps/s | 14.0242 KOps/s | |
test_empty[True] | 0.2978ms | 0.2799ms | 3.5723 KOps/s | 3.5357 KOps/s | |
test_empty[False] | 2.4490μs | 0.8822μs | 1.1336 MOps/s | 1.1431 MOps/s | |
test_to | 63.4620μs | 41.3600μs | 24.1779 KOps/s | 24.4576 KOps/s | |
test_to_nonblocking | 44.6210μs | 25.4695μs | 39.2626 KOps/s | 38.3071 KOps/s | |
test_unbind_speed | 1.2810ms | 0.3057ms | 3.2716 KOps/s | 3.3312 KOps/s | |
test_unbind_speed_stack0 | 0.3386ms | 0.3019ms | 3.3125 KOps/s | 3.3612 KOps/s | |
test_unbind_speed_stack1 | 90.8352ms | 0.7703ms | 1.2982 KOps/s | 1.4266 KOps/s | |
test_split | 91.9636ms | 2.3627ms | 423.2377 Ops/s | 425.0512 Ops/s | |
test_chunk | 2.3626ms | 2.1608ms | 462.7924 Ops/s | 423.0125 Ops/s | |
test_creation[device0] | 0.1656ms | 0.1059ms | 9.4389 KOps/s | 9.4724 KOps/s | |
test_creation_from_tensor | 0.1601ms | 0.1054ms | 9.4903 KOps/s | 9.7299 KOps/s | |
test_add_one[memmap_tensor0] | 0.1017ms | 8.8901μs | 112.4843 KOps/s | 115.3541 KOps/s | |
test_contiguous[memmap_tensor0] | 18.8910μs | 2.2455μs | 445.3255 KOps/s | 452.4062 KOps/s | |
test_stack[memmap_tensor0] | 23.9800μs | 6.5940μs | 151.6520 KOps/s | 149.7723 KOps/s | |
test_memmaptd_index | 1.1230ms | 0.4376ms | 2.2852 KOps/s | 2.2792 KOps/s | |
test_memmaptd_index_astensor | 94.5487ms | 0.5808ms | 1.7218 KOps/s | 1.9709 KOps/s | |
test_memmaptd_index_op | 1.5083ms | 1.0922ms | 915.6062 Ops/s | 906.8555 Ops/s | |
test_serialize_model | 92.4336ms | 89.0335ms | 11.2317 Ops/s | 10.8196 Ops/s | |
test_serialize_model_pickle | 1.3468s | 1.2364s | 0.8088 Ops/s | 0.8068 Ops/s | |
test_serialize_weights | 0.1828s | 96.4963ms | 10.3631 Ops/s | 9.8727 Ops/s | |
test_serialize_weights_returnearly | 70.7002ms | 56.3049ms | 17.7604 Ops/s | 14.6225 Ops/s | |
test_serialize_weights_pickle | 1.4118s | 1.1825s | 0.8457 Ops/s | 0.8081 Ops/s | |
test_reshape_pytree | 86.3220μs | 38.9519μs | 25.6727 KOps/s | 26.5679 KOps/s | |
test_reshape_td | 80.6620μs | 45.1282μs | 22.1591 KOps/s | 22.4542 KOps/s | |
test_view_pytree | 71.5820μs | 38.3302μs | 26.0891 KOps/s | 25.8361 KOps/s | |
test_view_td | 86.8510μs | 50.5880μs | 19.7675 KOps/s | 19.6833 KOps/s | |
test_unbind_pytree | 71.8410μs | 37.4368μs | 26.7117 KOps/s | 26.9370 KOps/s | |
test_unbind_td | 0.4057ms | 46.0428μs | 21.7189 KOps/s | 21.4309 KOps/s | |
test_split_pytree | 87.4120μs | 51.5234μs | 19.4087 KOps/s | 19.5540 KOps/s | |
test_split_td | 0.5132ms | 61.9861μs | 16.1326 KOps/s | 16.3964 KOps/s | |
test_add_pytree | 99.3420μs | 61.5114μs | 16.2571 KOps/s | 16.7677 KOps/s | |
test_add_td | 0.1416ms | 98.5422μs | 10.1479 KOps/s | 10.0082 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.4185ms | 0.2179ms | 4.5890 KOps/s | 4.6122 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.2687ms | 0.1748ms | 5.7218 KOps/s | 5.7120 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.2230ms | 0.1498ms | 6.6771 KOps/s | 6.6864 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.2622ms | 0.1903ms | 5.2540 KOps/s | 5.1439 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 57.0820μs | 23.0528μs | 43.3787 KOps/s | 42.7418 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 78.0910μs | 47.6462μs | 20.9881 KOps/s | 20.6420 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.1203ms | 72.7823μs | 13.7396 KOps/s | 13.8214 KOps/s | |
test_compile_copy_nested[pytree-eager] | 88.6920μs | 59.3325μs | 16.8542 KOps/s | 16.9617 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.5224ms | 0.3345ms | 2.9897 KOps/s | 2.9719 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.2835ms | 0.2215ms | 4.5137 KOps/s | 4.5150 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1858ms | 0.1319ms | 7.5789 KOps/s | 7.4831 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.1206ms | 63.1224μs | 15.8422 KOps/s | 15.8968 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.3789ms | 0.3347ms | 2.9876 KOps/s | 2.9713 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.6761ms | 0.6275ms | 1.5937 KOps/s | 1.5681 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.3186ms | 0.2687ms | 3.7217 KOps/s | 3.6863 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.3770ms | 0.3367ms | 2.9701 KOps/s | 2.9581 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1518ms | 74.4014μs | 13.4406 KOps/s | 13.4592 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1791ms | 0.1339ms | 7.4680 KOps/s | 7.4621 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.6776ms | 0.5295ms | 1.8885 KOps/s | 1.8366 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.3922ms | 0.3350ms | 2.9847 KOps/s | 2.9858 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 44.0110μs | 18.9850μs | 52.6731 KOps/s | 52.2414 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 66.8610μs | 31.8502μs | 31.3970 KOps/s | 30.9612 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1037ms | 76.9411μs | 12.9970 KOps/s | 13.0107 KOps/s | |
test_compile_copy_flat[pytree-eager] | 92.9020μs | 60.3061μs | 16.5821 KOps/s | 16.5414 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 2.5302ms | 0.8669ms | 1.1535 KOps/s | 1.0528 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 3.4087ms | 3.3113ms | 301.9922 Ops/s | 299.5616 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 2.4631ms | 0.8483ms | 1.1789 KOps/s | 1.0694 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 3.3944ms | 3.3320ms | 300.1168 Ops/s | 302.3017 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.1526ms | 0.1133ms | 8.8234 KOps/s | 8.4980 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.2068ms | 61.7818μs | 16.1860 KOps/s | 14.8395 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.2684ms | 0.1057ms | 9.4592 KOps/s | 8.9486 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 98.3720μs | 45.0875μs | 22.1791 KOps/s | 20.6069 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.1523ms | 0.1059ms | 9.4389 KOps/s | 9.1081 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 87.4320μs | 45.1335μs | 22.1565 KOps/s | 20.8189 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1902ms | 0.1417ms | 7.0561 KOps/s | 6.9837 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.1777ms | 26.5305μs | 37.6924 KOps/s | 36.6108 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1641ms | 0.1333ms | 7.5007 KOps/s | 7.1658 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 61.2920μs | 23.0892μs | 43.3103 KOps/s | 43.2080 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1755ms | 0.1334ms | 7.4981 KOps/s | 7.1653 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 63.6110μs | 23.1174μs | 43.2574 KOps/s | 43.3284 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.2039ms | 0.1403ms | 7.1288 KOps/s | 6.8340 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.4896ms | 26.3987μs | 37.8806 KOps/s | 35.4818 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.2902ms | 0.1330ms | 7.5167 KOps/s | 7.1375 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 55.8310μs | 22.6372μs | 44.1751 KOps/s | 42.8677 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1789ms | 0.1330ms | 7.5183 KOps/s | 7.1945 KOps/s | |
test_compile_indexing[int-pytree-eager] | 0.3435ms | 22.9129μs | 43.6435 KOps/s | 41.9699 KOps/s | |
test_mod_add[eager] | 76.1520μs | 38.7596μs | 25.8000 KOps/s | 23.6527 KOps/s | |
test_mod_add[compile] | 0.1194ms | 70.0402μs | 14.2775 KOps/s | 13.5313 KOps/s | |
test_mod_add[compile-overhead] | 0.2605ms | 0.1372ms | 7.2864 KOps/s | 6.6044 KOps/s | |
test_mod_wrap[eager] | 0.3256ms | 0.2517ms | 3.9735 KOps/s | 3.6513 KOps/s | |
test_mod_wrap[compile] | 1.0767ms | 0.2904ms | 3.4430 KOps/s | 3.2206 KOps/s | |
test_mod_wrap[compile-overhead] | 8.6587ms | 4.5998ms | 217.4025 Ops/s | 221.6111 Ops/s | |
test_mod_wrap_and_backward[eager] | 1.9295ms | 1.3484ms | 741.5996 Ops/s | 688.2880 Ops/s | |
test_mod_wrap_and_backward[compile] | 1.4531ms | 1.3438ms | 744.1368 Ops/s | 734.4777 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 1.3497ms | 0.9021ms | 1.1085 KOps/s | 1.0848 KOps/s | |
test_seq_add[eager] | 0.1695ms | 0.1160ms | 8.6218 KOps/s | 8.7607 KOps/s | |
test_seq_add[compile] | 0.1347ms | 87.8789μs | 11.3793 KOps/s | 11.4606 KOps/s | |
test_seq_add[compile-overhead] | 0.1818ms | 0.1247ms | 8.0174 KOps/s | 7.9745 KOps/s | |
test_seq_wrap[eager] | 0.4998ms | 0.4351ms | 2.2984 KOps/s | 2.3176 KOps/s | |
test_seq_wrap[compile] | 0.4008ms | 0.3224ms | 3.1018 KOps/s | 3.0671 KOps/s | |
test_seq_wrap[compile-overhead] | 0.1931s | 89.4611ms | 11.1780 Ops/s | 7.8815 Ops/s | |
test_func_call_runtime[False-eager] | 0.8153ms | 0.7344ms | 1.3617 KOps/s | 1.3263 KOps/s | |
test_func_call_runtime[False-compile] | 0.8563ms | 0.8063ms | 1.2402 KOps/s | 1.2216 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.4300ms | 0.3671ms | 2.7242 KOps/s | 2.6850 KOps/s | |
test_func_call_runtime[True-eager] | 1.0268ms | 0.9416ms | 1.0620 KOps/s | 1.0460 KOps/s | |
test_func_call_runtime[True-compile] | 0.9662ms | 0.8826ms | 1.1330 KOps/s | 1.1553 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.5089ms | 0.4151ms | 2.4089 KOps/s | 2.4086 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.9608ms | 0.7776ms | 1.2860 KOps/s | 1.2685 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.9232ms | 0.8399ms | 1.1906 KOps/s | 1.1911 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.4193ms | 0.3687ms | 2.7120 KOps/s | 2.6945 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.3385ms | 1.0567ms | 946.3568 Ops/s | 941.8371 Ops/s | |
test_func_call_cm_runtime[True-compile] | 1.2708ms | 1.0460ms | 956.0113 Ops/s | 969.2348 Ops/s | |
test_func_call_cm_runtime[True-compile-overhead] | 1.2896ms | 1.0591ms | 944.2031 Ops/s | 969.1661 Ops/s | |
test_distributed | 0.2311ms | 73.2436μs | 13.6531 KOps/s | 13.8513 KOps/s | |
test_tdmodule | 45.7310μs | 17.2849μs | 57.8540 KOps/s | 59.4348 KOps/s | |
test_tdmodule_dispatch | 54.8920μs | 35.1378μs | 28.4594 KOps/s | 28.7113 KOps/s | |
test_tdseq | 35.2210μs | 18.0490μs | 55.4047 KOps/s | 56.1456 KOps/s | |
test_tdseq_dispatch | 58.3210μs | 37.0883μs | 26.9627 KOps/s | 27.4965 KOps/s | |
test_instantiation_functorch | 2.1753ms | 2.0518ms | 487.3878 Ops/s | 488.5806 Ops/s | |
test_instantiation_td | 1.9705ms | 1.3059ms | 765.7325 Ops/s | 762.7968 Ops/s | |
test_exec_functorch | 0.2812ms | 0.2185ms | 4.5771 KOps/s | 4.4835 KOps/s | |
test_exec_functional_call | 0.2782ms | 0.2249ms | 4.4457 KOps/s | 4.5586 KOps/s | |
test_exec_td | 0.2905ms | 0.2360ms | 4.2381 KOps/s | 4.4067 KOps/s | |
test_exec_td_decorator | 1.0356ms | 0.2791ms | 3.5829 KOps/s | 3.5711 KOps/s | |
test_vmap_mlp_speed[True-True] | 0.8867ms | 0.6525ms | 1.5325 KOps/s | 1.5013 KOps/s | |
test_vmap_mlp_speed[True-False] | 0.8048ms | 0.6679ms | 1.4973 KOps/s | 1.4846 KOps/s | |
test_vmap_mlp_speed[False-True] | 0.6224ms | 0.5664ms | 1.7656 KOps/s | 1.7339 KOps/s | |
test_vmap_mlp_speed[False-False] | 0.6497ms | 0.5733ms | 1.7444 KOps/s | 1.7580 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 1.3045ms | 0.7022ms | 1.4242 KOps/s | 1.4147 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.8951ms | 0.7036ms | 1.4212 KOps/s | 1.4098 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.7731ms | 0.6103ms | 1.6385 KOps/s | 1.6222 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.7359ms | 0.6104ms | 1.6382 KOps/s | 1.6225 KOps/s | |
test_vmap_transformer_speed[True-True] | 9.0724ms | 8.6720ms | 115.3140 Ops/s | 114.2865 Ops/s | |
test_vmap_transformer_speed[True-False] | 9.8977ms | 8.6291ms | 115.8869 Ops/s | 114.5828 Ops/s | |
test_vmap_transformer_speed[False-True] | 8.6399ms | 8.5514ms | 116.9400 Ops/s | 115.8787 Ops/s | |
test_vmap_transformer_speed[False-False] | 8.8039ms | 8.5467ms | 117.0045 Ops/s | 115.6200 Ops/s | |
test_vmap_transformer_speed_decorator[True-True] | 21.3312ms | 20.7869ms | 48.1073 Ops/s | 48.7263 Ops/s | |
test_vmap_transformer_speed_decorator[True-False] | 20.6194ms | 20.5393ms | 48.6873 Ops/s | 48.5864 Ops/s | |
test_vmap_transformer_speed_decorator[False-True] | 22.0380ms | 20.3970ms | 49.0269 Ops/s | 48.9890 Ops/s | |
test_vmap_transformer_speed_decorator[False-False] | 21.0909ms | 20.3981ms | 49.0242 Ops/s | 49.0886 Ops/s | |
test_to_module_speed[True] | 1.2445ms | 1.1413ms | 876.1691 Ops/s | 866.8869 Ops/s | |
test_to_module_speed[False] | 1.2240ms | 1.1153ms | 896.6115 Ops/s | 900.4922 Ops/s | |
test_tc_init | 65.2320μs | 41.8784μs | 23.8786 KOps/s | 24.9033 KOps/s | |
test_tc_init_nested | 0.1237ms | 83.7994μs | 11.9333 KOps/s | 12.5145 KOps/s | |
test_tc_first_layer_tensor | 19.4988μs | 0.7856μs | 1.2729 MOps/s | 1.2633 MOps/s | |
test_tc_first_layer_nontensor | 23.2110μs | 2.5729μs | 388.6695 KOps/s | 397.5664 KOps/s | |
test_tc_second_layer_tensor | 9.3403μs | 1.5998μs | 625.0773 KOps/s | 614.2890 KOps/s | |
test_tc_second_layer_nontensor | 21.2410μs | 3.3970μs | 294.3742 KOps/s | 294.6410 KOps/s | |
test_unbind | 0.1820s | 12.2722ms | 81.4852 Ops/s | 63.5884 Ops/s | |
test_full_like | 0.6558ms | 0.5765ms | 1.7347 KOps/s | 1.7291 KOps/s | |
test_zeros_like | 0.2704ms | 0.1977ms | 5.0575 KOps/s | 5.0550 KOps/s | |
test_ones_like | 0.3655ms | 0.1975ms | 5.0622 KOps/s | 5.0575 KOps/s | |
test_clone | 0.4350ms | 0.4138ms | 2.4165 KOps/s | 2.4151 KOps/s | |
test_squeeze | 31.7810μs | 10.8864μs | 91.8580 KOps/s | 87.8315 KOps/s | |
test_unsqueeze | 0.2342ms | 79.4674μs | 12.5838 KOps/s | 12.6087 KOps/s | |
test_split | 0.4464ms | 0.1769ms | 5.6531 KOps/s | 5.6520 KOps/s | |
test_permute | 0.2312ms | 0.1908ms | 5.2418 KOps/s | 5.2853 KOps/s | |
test_stack | 1.2481ms | 0.9080ms | 1.1013 KOps/s | 1.1034 KOps/s | |
test_cat | 1.2564ms | 1.2314ms | 812.1009 Ops/s | 811.6734 Ops/s |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
bug
Something isn't working
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
No description provided.