-
Notifications
You must be signed in to change notification settings - Fork 76
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BugFix] Resilient deterministic_sample for CompositeDist #1000
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
vmoens
added a commit
that referenced
this pull request
Sep 17, 2024
ghstack-source-id: 9844f2db8f2c20928ec17e6f822b9729e82eced7 Pull Request resolved: #1000
facebook-github-bot
added
the
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
label
Sep 17, 2024
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 39.9860μs | 20.8496μs | 47.9625 KOps/s | 47.8771 KOps/s | |
test_plain_set_stack_nested | 57.7190μs | 20.9416μs | 47.7519 KOps/s | 47.3602 KOps/s | |
test_plain_set_nested_inplace | 54.2530μs | 22.5223μs | 44.4005 KOps/s | 44.1538 KOps/s | |
test_plain_set_stack_nested_inplace | 53.7710μs | 22.4959μs | 44.4526 KOps/s | 44.0131 KOps/s | |
test_items | 45.0550μs | 4.1649μs | 240.1012 KOps/s | 242.7182 KOps/s | |
test_items_nested | 0.6800ms | 0.3564ms | 2.8059 KOps/s | 2.8005 KOps/s | |
test_items_nested_locked | 0.7138ms | 0.3593ms | 2.7830 KOps/s | 2.7907 KOps/s | |
test_items_nested_leaf | 0.1324ms | 69.1162μs | 14.4684 KOps/s | 14.5957 KOps/s | |
test_items_stack_nested | 0.6415ms | 0.3584ms | 2.7902 KOps/s | 2.7546 KOps/s | |
test_items_stack_nested_leaf | 0.1479ms | 70.7606μs | 14.1322 KOps/s | 14.1005 KOps/s | |
test_items_stack_nested_locked | 0.4950ms | 0.3602ms | 2.7760 KOps/s | 2.7662 KOps/s | |
test_keys | 30.7980μs | 3.5169μs | 284.3407 KOps/s | 275.7602 KOps/s | |
test_keys_nested | 0.1968ms | 0.1032ms | 9.6901 KOps/s | 10.0341 KOps/s | |
test_keys_nested_locked | 1.6080ms | 0.1086ms | 9.2075 KOps/s | 9.4064 KOps/s | |
test_keys_nested_leaf | 0.1635ms | 85.8466μs | 11.6487 KOps/s | 11.8947 KOps/s | |
test_keys_stack_nested | 0.1744ms | 0.1033ms | 9.6833 KOps/s | 10.0150 KOps/s | |
test_keys_stack_nested_leaf | 0.1425ms | 85.1189μs | 11.7483 KOps/s | 12.0688 KOps/s | |
test_keys_stack_nested_locked | 0.2282ms | 0.1093ms | 9.1458 KOps/s | 9.4354 KOps/s | |
test_values | 6.0614μs | 1.0881μs | 918.9916 KOps/s | 890.3339 KOps/s | |
test_values_nested | 0.1569ms | 74.3594μs | 13.4482 KOps/s | 13.6877 KOps/s | |
test_values_nested_locked | 0.1292ms | 73.5556μs | 13.5952 KOps/s | 13.7356 KOps/s | |
test_values_nested_leaf | 0.1260ms | 62.9920μs | 15.8750 KOps/s | 15.9441 KOps/s | |
test_values_stack_nested | 0.1336ms | 75.1354μs | 13.3093 KOps/s | 13.5951 KOps/s | |
test_values_stack_nested_leaf | 0.1081ms | 62.8241μs | 15.9175 KOps/s | 16.4448 KOps/s | |
test_values_stack_nested_locked | 0.1362ms | 74.9968μs | 13.3339 KOps/s | 13.2781 KOps/s | |
test_membership | 4.2094μs | 0.7312μs | 1.3677 MOps/s | 1.4141 MOps/s | |
test_membership_nested | 28.5740μs | 2.7107μs | 368.9067 KOps/s | 372.0468 KOps/s | |
test_membership_nested_leaf | 27.4020μs | 2.7398μs | 364.9963 KOps/s | 371.5449 KOps/s | |
test_membership_stacked_nested | 30.8180μs | 2.7596μs | 362.3691 KOps/s | 371.1698 KOps/s | |
test_membership_stacked_nested_leaf | 23.7340μs | 2.7554μs | 362.9197 KOps/s | 374.2152 KOps/s | |
test_membership_nested_last | 39.5040μs | 3.8905μs | 257.0354 KOps/s | 257.9925 KOps/s | |
test_membership_nested_leaf_last | 33.1620μs | 3.8901μs | 257.0634 KOps/s | 259.0334 KOps/s | |
test_membership_stacked_nested_last | 32.3010μs | 3.9084μs | 255.8571 KOps/s | 202.3137 KOps/s | |
test_membership_stacked_nested_leaf_last | 29.2850μs | 3.9497μs | 253.1832 KOps/s | 202.9056 KOps/s | |
test_nested_getleaf | 54.4330μs | 10.6264μs | 94.1055 KOps/s | 94.3567 KOps/s | |
test_nested_get | 35.1660μs | 10.0959μs | 99.0499 KOps/s | 99.8085 KOps/s | |
test_stacked_getleaf | 39.9750μs | 10.7579μs | 92.9548 KOps/s | 95.2756 KOps/s | |
test_stacked_get | 38.4920μs | 10.0133μs | 99.8676 KOps/s | 99.6220 KOps/s | |
test_nested_getitemleaf | 37.2000μs | 10.9959μs | 90.9430 KOps/s | 89.9824 KOps/s | |
test_nested_getitem | 33.2130μs | 10.2755μs | 97.3191 KOps/s | 96.6527 KOps/s | |
test_stacked_getitemleaf | 37.2000μs | 11.0285μs | 90.6739 KOps/s | 90.4387 KOps/s | |
test_stacked_getitem | 29.5160μs | 10.3715μs | 96.4181 KOps/s | 96.3673 KOps/s | |
test_lock_nested | 88.4119ms | 0.5875ms | 1.7021 KOps/s | 2.1079 KOps/s | |
test_lock_stack_nested | 0.5949ms | 0.4721ms | 2.1184 KOps/s | 2.2402 KOps/s | |
test_unlock_nested | 91.0526ms | 0.5154ms | 1.9404 KOps/s | 2.5024 KOps/s | |
test_unlock_stack_nested | 0.4704ms | 0.3852ms | 2.5960 KOps/s | 2.7090 KOps/s | |
test_flatten_speed | 0.3407ms | 90.3682μs | 11.0658 KOps/s | 11.5566 KOps/s | |
test_unflatten_speed | 0.7017ms | 0.4776ms | 2.0938 KOps/s | 2.1260 KOps/s | |
test_common_ops | 4.7622ms | 1.1228ms | 890.5986 Ops/s | 897.5745 Ops/s | |
test_creation | 21.9320μs | 2.0372μs | 490.8773 KOps/s | 479.9235 KOps/s | |
test_creation_empty | 48.5520μs | 18.4888μs | 54.0869 KOps/s | 51.9914 KOps/s | |
test_creation_nested_1 | 61.8370μs | 21.6195μs | 46.2544 KOps/s | 44.5408 KOps/s | |
test_creation_nested_2 | 67.5770μs | 25.5297μs | 39.1701 KOps/s | 37.3068 KOps/s | |
test_clone | 0.1195ms | 17.8729μs | 55.9506 KOps/s | 58.5768 KOps/s | |
test_getitem[int] | 1.0787ms | 16.7413μs | 59.7326 KOps/s | 62.0941 KOps/s | |
test_getitem[slice_int] | 0.1324ms | 31.6370μs | 31.6086 KOps/s | 33.4927 KOps/s | |
test_getitem[range] | 0.2552ms | 65.4262μs | 15.2844 KOps/s | 17.2242 KOps/s | |
test_getitem[tuple] | 0.1263ms | 25.5911μs | 39.0760 KOps/s | 40.9522 KOps/s | |
test_getitem[list] | 0.1792ms | 54.7957μs | 18.2496 KOps/s | 18.6404 KOps/s | |
test_setitem_dim[int] | 60.0030μs | 32.7785μs | 30.5078 KOps/s | 32.6646 KOps/s | |
test_setitem_dim[slice_int] | 0.1099ms | 62.1751μs | 16.0836 KOps/s | 16.8442 KOps/s | |
test_setitem_dim[range] | 0.1338ms | 84.6283μs | 11.8164 KOps/s | 12.0311 KOps/s | |
test_setitem_dim[tuple] | 77.2850μs | 49.2333μs | 20.3115 KOps/s | 21.0338 KOps/s | |
test_setitem | 0.1693ms | 30.2953μs | 33.0085 KOps/s | 32.9598 KOps/s | |
test_set | 0.1010ms | 29.4194μs | 33.9911 KOps/s | 33.6858 KOps/s | |
test_set_shared | 1.3139ms | 0.2112ms | 4.7344 KOps/s | 4.7347 KOps/s | |
test_update | 94.4580μs | 36.5394μs | 27.3678 KOps/s | 26.5144 KOps/s | |
test_update_nested | 0.1405ms | 47.1691μs | 21.2003 KOps/s | 21.3804 KOps/s | |
test_update__nested | 83.2460μs | 35.2324μs | 28.3829 KOps/s | 29.6430 KOps/s | |
test_set_nested | 81.2630μs | 31.8969μs | 31.3510 KOps/s | 31.8117 KOps/s | |
test_set_nested_new | 0.1138ms | 37.1580μs | 26.9121 KOps/s | 26.6134 KOps/s | |
test_select | 0.1168ms | 55.8706μs | 17.8985 KOps/s | 18.5569 KOps/s | |
test_select_nested | 0.1333ms | 59.7237μs | 16.7438 KOps/s | 16.9373 KOps/s | |
test_exclude_nested | 0.1476ms | 75.7170μs | 13.2071 KOps/s | 13.2276 KOps/s | |
test_empty[True] | 0.4578ms | 0.3187ms | 3.1376 KOps/s | 3.1507 KOps/s | |
test_empty[False] | 7.3288μs | 1.2157μs | 822.5639 KOps/s | 848.1301 KOps/s | |
test_unbind_speed | 0.4790ms | 0.3095ms | 3.2310 KOps/s | 3.3100 KOps/s | |
test_unbind_speed_stack0 | 0.5239ms | 0.3068ms | 3.2593 KOps/s | 3.3602 KOps/s | |
test_unbind_speed_stack1 | 91.1446ms | 0.8247ms | 1.2126 KOps/s | 1.3565 KOps/s | |
test_split | 89.6945ms | 2.2059ms | 453.3225 Ops/s | 464.8477 Ops/s | |
test_chunk | 3.2221ms | 2.0393ms | 490.3625 Ops/s | 469.4891 Ops/s | |
test_creation[device0] | 4.0637ms | 0.1220ms | 8.1961 KOps/s | 8.6296 KOps/s | |
test_creation_from_tensor | 0.2394ms | 0.1182ms | 8.4626 KOps/s | 8.5033 KOps/s | |
test_add_one[memmap_tensor0] | 0.2092ms | 7.6192μs | 131.2478 KOps/s | 142.1256 KOps/s | |
test_contiguous[memmap_tensor0] | 16.8020μs | 1.9435μs | 514.5393 KOps/s | 540.1375 KOps/s | |
test_stack[memmap_tensor0] | 37.3500μs | 5.8959μs | 169.6091 KOps/s | 179.2993 KOps/s | |
test_memmaptd_index | 0.6683ms | 0.4115ms | 2.4300 KOps/s | 2.5386 KOps/s | |
test_memmaptd_index_astensor | 0.9328ms | 0.4886ms | 2.0466 KOps/s | 2.1260 KOps/s | |
test_memmaptd_index_op | 1.7115ms | 1.0381ms | 963.3139 Ops/s | 991.4408 Ops/s | |
test_serialize_model | 0.2142s | 0.1290s | 7.7515 Ops/s | 8.6695 Ops/s | |
test_serialize_model_pickle | 0.4797s | 0.3999s | 2.5007 Ops/s | 2.4842 Ops/s | |
test_serialize_weights | 0.1234s | 0.1148s | 8.7111 Ops/s | 7.7078 Ops/s | |
test_serialize_weights_returnearly | 0.1750s | 0.1603s | 6.2371 Ops/s | 6.2897 Ops/s | |
test_serialize_weights_pickle | 0.4989s | 0.4250s | 2.3528 Ops/s | 2.2805 Ops/s | |
test_serialize_weights_filesystem | 0.1441s | 0.1407s | 7.1050 Ops/s | 7.1546 Ops/s | |
test_serialize_model_filesystem | 0.1507s | 0.1446s | 6.9155 Ops/s | 6.1282 Ops/s | |
test_reshape_pytree | 0.1321ms | 38.7750μs | 25.7898 KOps/s | 25.9475 KOps/s | |
test_reshape_td | 0.1113ms | 45.6655μs | 21.8984 KOps/s | 22.1637 KOps/s | |
test_view_pytree | 85.4710μs | 38.1534μs | 26.2100 KOps/s | 25.5653 KOps/s | |
test_view_td | 0.1200ms | 52.7386μs | 18.9615 KOps/s | 19.5330 KOps/s | |
test_unbind_pytree | 91.0420μs | 35.9817μs | 27.7919 KOps/s | 27.7389 KOps/s | |
test_unbind_td | 0.3346ms | 46.0682μs | 21.7069 KOps/s | 22.2288 KOps/s | |
test_split_pytree | 89.6990μs | 38.3357μs | 26.0854 KOps/s | 26.3789 KOps/s | |
test_split_td | 0.4764ms | 58.8632μs | 16.9886 KOps/s | 17.9070 KOps/s | |
test_add_pytree | 86.5030μs | 45.3033μs | 22.0734 KOps/s | 22.3240 KOps/s | |
test_add_td | 0.2168ms | 82.5085μs | 12.1200 KOps/s | 12.1073 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1155ms | 57.7556μs | 17.3144 KOps/s | 17.9871 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.3248ms | 0.1763ms | 5.6720 KOps/s | 5.7081 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1174ms | 57.0515μs | 17.5280 KOps/s | 17.8373 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.2992ms | 0.1439ms | 6.9505 KOps/s | 7.2055 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 64.5610μs | 20.8398μs | 47.9852 KOps/s | 48.2811 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1408ms | 67.9395μs | 14.7190 KOps/s | 15.0136 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.1673ms | 76.0677μs | 13.1462 KOps/s | 13.3217 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.1355ms | 68.0301μs | 14.6994 KOps/s | 14.5310 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.2859ms | 0.1751ms | 5.7123 KOps/s | 5.7511 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.3404ms | 0.1899ms | 5.2666 KOps/s | 5.3623 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1443ms | 47.6179μs | 21.0005 KOps/s | 20.2145 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.1248ms | 67.9813μs | 14.7099 KOps/s | 15.1592 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.3094ms | 0.1753ms | 5.7045 KOps/s | 5.7721 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.5355ms | 0.2963ms | 3.3748 KOps/s | 3.5714 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.3301ms | 0.2019ms | 4.9530 KOps/s | 5.0454 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.3564ms | 0.1765ms | 5.6672 KOps/s | 5.7351 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1263ms | 61.3895μs | 16.2894 KOps/s | 16.4937 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1164ms | 48.5277μs | 20.6068 KOps/s | 21.5698 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.4225ms | 0.2370ms | 4.2192 KOps/s | 4.3328 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.3949ms | 0.1754ms | 5.7008 KOps/s | 5.6567 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 0.1816ms | 0.1028ms | 9.7269 KOps/s | 9.8007 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.1508ms | 59.4259μs | 16.8277 KOps/s | 16.9368 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1417ms | 76.0870μs | 13.1429 KOps/s | 12.9197 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1439ms | 68.4719μs | 14.6045 KOps/s | 14.6955 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 0.4153ms | 0.2000ms | 5.0010 KOps/s | 5.1219 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 1.9725ms | 1.7065ms | 585.9957 Ops/s | 619.3588 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 0.2479ms | 0.1934ms | 5.1708 KOps/s | 5.1610 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 1.2075ms | 1.1229ms | 890.5871 Ops/s | 928.4919 Ops/s | |
test_compile_assign_and_add_stack[compile] | 0.5529ms | 0.4179ms | 2.3930 KOps/s | 2.3620 KOps/s | |
test_compile_assign_and_add_stack[eager] | 5.0074ms | 3.8256ms | 261.3989 Ops/s | 265.6817 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.1107ms | 35.2803μs | 28.3445 KOps/s | 29.6127 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.5522ms | 48.5760μs | 20.5863 KOps/s | 21.0464 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 75.0020μs | 30.4485μs | 32.8424 KOps/s | 34.3025 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 98.6960μs | 29.6953μs | 33.6753 KOps/s | 35.1701 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 95.8800μs | 30.4357μs | 32.8561 KOps/s | 33.1787 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 76.9150μs | 30.1221μs | 33.1983 KOps/s | 34.8930 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1314ms | 74.8342μs | 13.3629 KOps/s | 13.9383 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.5440ms | 28.7800μs | 34.7464 KOps/s | 37.1575 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1336ms | 70.2438μs | 14.2361 KOps/s | 14.7552 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 90.8920μs | 23.5548μs | 42.4542 KOps/s | 42.9015 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1415ms | 69.7387μs | 14.3392 KOps/s | 14.7369 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 77.5480μs | 23.4290μs | 42.6821 KOps/s | 42.7628 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1449ms | 74.9377μs | 13.3444 KOps/s | 13.7623 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.8321ms | 28.1266μs | 35.5536 KOps/s | 36.9511 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1401ms | 70.2949μs | 14.2258 KOps/s | 14.6954 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 0.2977ms | 23.7379μs | 42.1266 KOps/s | 43.5166 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1817ms | 70.6838μs | 14.1475 KOps/s | 14.7765 KOps/s | |
test_compile_indexing[int-pytree-eager] | 59.1210μs | 23.1283μs | 43.2370 KOps/s | 43.5921 KOps/s | |
test_mod_add[eager] | 95.8110μs | 25.4116μs | 39.3521 KOps/s | 38.7877 KOps/s | |
test_mod_add[compile] | 0.1084ms | 39.4683μs | 25.3368 KOps/s | 25.6405 KOps/s | |
test_mod_add[compile-overhead] | 92.4040μs | 39.2437μs | 25.4818 KOps/s | 25.5146 KOps/s | |
test_mod_wrap[eager] | 0.4320ms | 0.2111ms | 4.7364 KOps/s | 4.8900 KOps/s | |
test_mod_wrap[compile] | 0.3931ms | 0.2329ms | 4.2942 KOps/s | 4.3276 KOps/s | |
test_mod_wrap[compile-overhead] | 0.4282ms | 0.2315ms | 4.3197 KOps/s | 4.3349 KOps/s | |
test_mod_wrap_and_backward[eager] | 14.1959ms | 10.9949ms | 90.9513 Ops/s | 91.4349 Ops/s | |
test_mod_wrap_and_backward[compile] | 12.1007ms | 10.8819ms | 91.8955 Ops/s | 93.3756 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 12.3760ms | 10.9014ms | 91.7312 Ops/s | 93.1510 Ops/s | |
test_seq_add[eager] | 0.1823ms | 92.6395μs | 10.7945 KOps/s | 10.8124 KOps/s | |
test_seq_add[compile] | 0.1322ms | 65.6215μs | 15.2389 KOps/s | 15.3815 KOps/s | |
test_seq_add[compile-overhead] | 0.1341ms | 64.5872μs | 15.4829 KOps/s | 15.9781 KOps/s | |
test_seq_wrap[eager] | 0.6455ms | 0.3883ms | 2.5752 KOps/s | 2.6119 KOps/s | |
test_seq_wrap[compile] | 1.3145ms | 0.2708ms | 3.6929 KOps/s | 3.7121 KOps/s | |
test_seq_wrap[compile-overhead] | 1.3907ms | 0.2706ms | 3.6949 KOps/s | 3.7443 KOps/s | |
test_func_call_runtime[False-eager] | 0.9428ms | 0.5314ms | 1.8819 KOps/s | 2.0004 KOps/s | |
test_func_call_runtime[False-compile] | 0.9167ms | 0.4989ms | 2.0043 KOps/s | 2.0024 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.9259ms | 0.4979ms | 2.0085 KOps/s | 1.9938 KOps/s | |
test_func_call_runtime[True-eager] | 1.5963ms | 0.7378ms | 1.3553 KOps/s | 1.4024 KOps/s | |
test_func_call_runtime[True-compile] | 0.8920ms | 0.5133ms | 1.9481 KOps/s | 1.9632 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.8707ms | 0.5126ms | 1.9507 KOps/s | 1.9717 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.6734ms | 0.5233ms | 1.9111 KOps/s | 2.0365 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.8067ms | 0.5009ms | 1.9963 KOps/s | 2.0155 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.9336ms | 0.5003ms | 1.9988 KOps/s | 2.0282 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.0277ms | 0.8686ms | 1.1513 KOps/s | 1.1996 KOps/s | |
test_func_call_cm_runtime[True-compile] | 0.9077ms | 0.7428ms | 1.3462 KOps/s | 1.4150 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 1.1788ms | 0.7517ms | 1.3304 KOps/s | 1.4116 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.4678ms | 1.8695ms | 534.8919 Ops/s | 545.4042 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 2.6412ms | 1.9321ms | 517.5808 Ops/s | 527.8031 Ops/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 2.6807ms | 1.9194ms | 521.0031 Ops/s | 530.5067 Ops/s | |
test_distributed | 0.2614ms | 0.1242ms | 8.0530 KOps/s | 7.9608 KOps/s | |
test_tdmodule | 72.5460μs | 18.6130μs | 53.7259 KOps/s | 51.8212 KOps/s | |
test_tdmodule_dispatch | 63.6700μs | 36.3179μs | 27.5346 KOps/s | 26.2556 KOps/s | |
test_tdseq | 36.2180μs | 20.2160μs | 49.4657 KOps/s | 47.6994 KOps/s | |
test_tdseq_dispatch | 58.4000μs | 41.1974μs | 24.2734 KOps/s | 23.7526 KOps/s | |
test_instantiation_functorch | 2.4101ms | 1.6369ms | 610.9271 Ops/s | 634.9073 Ops/s | |
test_instantiation_td | 1.9301ms | 1.2041ms | 830.5188 Ops/s | 839.2915 Ops/s | |
test_exec_functorch | 0.3488ms | 0.1901ms | 5.2604 KOps/s | 5.4211 KOps/s | |
test_exec_functional_call | 0.3360ms | 0.1821ms | 5.4910 KOps/s | 5.9842 KOps/s | |
test_exec_td | 0.3055ms | 0.1766ms | 5.6633 KOps/s | 6.1464 KOps/s | |
test_exec_td_decorator | 1.1334ms | 0.2289ms | 4.3685 KOps/s | 4.6704 KOps/s | |
test_vmap_mlp_speed[True-True] | 0.9959ms | 0.6518ms | 1.5341 KOps/s | 1.5626 KOps/s | |
test_vmap_mlp_speed[True-False] | 0.8795ms | 0.6460ms | 1.5479 KOps/s | 1.5622 KOps/s | |
test_vmap_mlp_speed[False-True] | 0.8074ms | 0.5033ms | 1.9870 KOps/s | 2.0336 KOps/s | |
test_vmap_mlp_speed[False-False] | 0.7431ms | 0.5019ms | 1.9922 KOps/s | 2.0477 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 1.5179ms | 0.6250ms | 1.6000 KOps/s | 1.5657 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 1.1114ms | 0.6305ms | 1.5860 KOps/s | 1.6266 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.9110ms | 0.5143ms | 1.9444 KOps/s | 1.9715 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.7165ms | 0.5127ms | 1.9504 KOps/s | 1.9854 KOps/s | |
test_to_module_speed[True] | 1.9095ms | 1.3099ms | 763.4241 Ops/s | 774.9126 Ops/s | |
test_to_module_speed[False] | 2.1150ms | 1.2667ms | 789.4354 Ops/s | 805.1605 Ops/s | |
test_tc_init | 91.8730μs | 43.4079μs | 23.0373 KOps/s | 21.9321 KOps/s | |
test_tc_init_nested | 0.1625ms | 85.6268μs | 11.6786 KOps/s | 10.8194 KOps/s | |
test_tc_first_layer_tensor | 32.7010μs | 1.5692μs | 637.2598 KOps/s | 628.0657 KOps/s | |
test_tc_first_layer_nontensor | 20.5690μs | 4.8114μs | 207.8395 KOps/s | 214.0838 KOps/s | |
test_tc_second_layer_tensor | 22.5330μs | 2.8583μs | 349.8638 KOps/s | 349.4909 KOps/s | |
test_tc_second_layer_nontensor | 27.4620μs | 6.1227μs | 163.3263 KOps/s | 166.2319 KOps/s | |
test_unbind | 0.4694s | 13.1548ms | 76.0180 Ops/s | 68.3321 Ops/s | |
test_full_like | 7.7732ms | 6.7935ms | 147.1992 Ops/s | 146.9483 Ops/s | |
test_zeros_like | 3.1752ms | 2.6768ms | 373.5873 Ops/s | 366.5454 Ops/s | |
test_ones_like | 11.7562ms | 5.7572ms | 173.6968 Ops/s | 160.7539 Ops/s | |
test_clone | 14.2594ms | 7.5580ms | 132.3103 Ops/s | 127.4759 Ops/s | |
test_squeeze | 70.4420μs | 12.6801μs | 78.8636 KOps/s | 80.2592 KOps/s | |
test_unsqueeze | 0.2204ms | 95.6556μs | 10.4542 KOps/s | 11.0601 KOps/s | |
test_split | 0.4858ms | 0.2012ms | 4.9693 KOps/s | 5.2172 KOps/s | |
test_permute | 0.3864ms | 0.2266ms | 4.4128 KOps/s | 4.5571 KOps/s | |
test_stack | 30.6994ms | 24.3538ms | 41.0613 Ops/s | 39.2619 Ops/s | |
test_cat | 29.2335ms | 24.1307ms | 41.4409 Ops/s | 39.4813 Ops/s |
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 0.6122ms | 14.0154μs | 71.3499 KOps/s | 72.5335 KOps/s | |
test_plain_set_stack_nested | 43.0230μs | 14.2309μs | 70.2695 KOps/s | 73.0826 KOps/s | |
test_plain_set_nested_inplace | 39.5020μs | 14.9875μs | 66.7222 KOps/s | 67.6447 KOps/s | |
test_plain_set_stack_nested_inplace | 59.0140μs | 15.0612μs | 66.3958 KOps/s | 68.9144 KOps/s | |
test_items | 43.3720μs | 2.9215μs | 342.2860 KOps/s | 347.4679 KOps/s | |
test_items_nested | 0.3812ms | 0.3292ms | 3.0380 KOps/s | 3.0584 KOps/s | |
test_items_nested_locked | 0.3979ms | 0.3280ms | 3.0490 KOps/s | 3.0590 KOps/s | |
test_items_nested_leaf | 86.8550μs | 55.9081μs | 17.8865 KOps/s | 17.8563 KOps/s | |
test_items_stack_nested | 0.3911ms | 0.3325ms | 3.0071 KOps/s | 3.0666 KOps/s | |
test_items_stack_nested_leaf | 97.1660μs | 57.2390μs | 17.4706 KOps/s | 17.9142 KOps/s | |
test_items_stack_nested_locked | 0.3957ms | 0.3299ms | 3.0317 KOps/s | 3.0569 KOps/s | |
test_keys | 27.3320μs | 3.3918μs | 294.8245 KOps/s | 275.7192 KOps/s | |
test_keys_nested | 93.0160μs | 56.1907μs | 17.7965 KOps/s | 18.2234 KOps/s | |
test_keys_nested_locked | 2.3082ms | 62.9927μs | 15.8749 KOps/s | 16.1561 KOps/s | |
test_keys_nested_leaf | 75.8350μs | 47.8694μs | 20.8902 KOps/s | 21.5464 KOps/s | |
test_keys_stack_nested | 99.1060μs | 56.7505μs | 17.6210 KOps/s | 17.6837 KOps/s | |
test_keys_stack_nested_leaf | 0.1077ms | 47.6105μs | 21.0038 KOps/s | 21.0290 KOps/s | |
test_keys_stack_nested_locked | 95.6660μs | 61.7696μs | 16.1892 KOps/s | 16.2700 KOps/s | |
test_values | 5.9488μs | 0.8547μs | 1.1700 MOps/s | 1.1945 MOps/s | |
test_values_nested | 69.4440μs | 41.0710μs | 24.3481 KOps/s | 24.5416 KOps/s | |
test_values_nested_locked | 68.9640μs | 43.0224μs | 23.2437 KOps/s | 23.4472 KOps/s | |
test_values_nested_leaf | 67.1140μs | 35.3950μs | 28.2525 KOps/s | 28.2278 KOps/s | |
test_values_stack_nested | 79.4650μs | 42.0714μs | 23.7691 KOps/s | 24.3444 KOps/s | |
test_values_stack_nested_leaf | 59.6040μs | 36.1297μs | 27.6781 KOps/s | 28.1920 KOps/s | |
test_values_stack_nested_locked | 67.5940μs | 43.8104μs | 22.8256 KOps/s | 23.3611 KOps/s | |
test_membership | 1.8261μs | 0.5050μs | 1.9804 MOps/s | 1.9614 MOps/s | |
test_membership_nested | 14.7110μs | 1.8465μs | 541.5749 KOps/s | 542.4229 KOps/s | |
test_membership_nested_leaf | 14.7143μs | 1.8208μs | 549.2215 KOps/s | 557.6604 KOps/s | |
test_membership_stacked_nested | 26.0220μs | 1.8910μs | 528.8250 KOps/s | 543.4608 KOps/s | |
test_membership_stacked_nested_leaf | 31.5520μs | 1.8709μs | 534.5089 KOps/s | 541.2167 KOps/s | |
test_membership_nested_last | 30.5110μs | 2.7220μs | 367.3771 KOps/s | 372.4263 KOps/s | |
test_membership_nested_leaf_last | 29.2820μs | 2.7329μs | 365.9156 KOps/s | 369.0482 KOps/s | |
test_membership_stacked_nested_last | 24.2310μs | 2.6984μs | 370.5857 KOps/s | 324.9700 KOps/s | |
test_membership_stacked_nested_leaf_last | 32.7020μs | 2.7520μs | 363.3786 KOps/s | 322.6647 KOps/s | |
test_nested_getleaf | 34.2720μs | 6.0526μs | 165.2191 KOps/s | 165.9491 KOps/s | |
test_nested_get | 40.2720μs | 5.7215μs | 174.7803 KOps/s | 176.6353 KOps/s | |
test_stacked_getleaf | 29.9420μs | 6.1024μs | 163.8702 KOps/s | 166.6252 KOps/s | |
test_stacked_get | 43.6130μs | 5.7084μs | 175.1802 KOps/s | 176.9800 KOps/s | |
test_nested_getitemleaf | 36.3520μs | 6.1867μs | 161.6375 KOps/s | 162.7892 KOps/s | |
test_nested_getitem | 31.5620μs | 5.7545μs | 173.7768 KOps/s | 174.0689 KOps/s | |
test_stacked_getitemleaf | 29.9620μs | 6.1701μs | 162.0730 KOps/s | 163.0150 KOps/s | |
test_stacked_getitem | 0.5348ms | 5.5850μs | 179.0496 KOps/s | 175.3827 KOps/s | |
test_lock_nested | 4.4613ms | 0.4159ms | 2.4042 KOps/s | 2.4265 KOps/s | |
test_lock_stack_nested | 0.4244ms | 0.3788ms | 2.6400 KOps/s | 2.6687 KOps/s | |
test_unlock_nested | 0.7281ms | 0.3512ms | 2.8471 KOps/s | 2.8528 KOps/s | |
test_unlock_stack_nested | 0.3723ms | 0.3146ms | 3.1785 KOps/s | 3.1949 KOps/s | |
test_flatten_speed | 0.1070ms | 69.0028μs | 14.4922 KOps/s | 14.5990 KOps/s | |
test_unflatten_speed | 0.3387ms | 0.2803ms | 3.5671 KOps/s | 3.5759 KOps/s | |
test_common_ops | 1.5266ms | 1.2264ms | 815.4100 Ops/s | 818.7383 Ops/s | |
test_creation | 27.0220μs | 1.4324μs | 698.1369 KOps/s | 693.4942 KOps/s | |
test_creation_empty | 38.9420μs | 15.2707μs | 65.4847 KOps/s | 66.3708 KOps/s | |
test_creation_nested_1 | 45.7030μs | 16.8019μs | 59.5170 KOps/s | 58.7242 KOps/s | |
test_creation_nested_2 | 50.9630μs | 19.6283μs | 50.9468 KOps/s | 51.3388 KOps/s | |
test_clone | 62.7130μs | 28.4544μs | 35.1439 KOps/s | 35.5200 KOps/s | |
test_getitem[int] | 92.2593ms | 22.2559μs | 44.9320 KOps/s | 64.9886 KOps/s | |
test_getitem[slice_int] | 0.1183ms | 25.6500μs | 38.9864 KOps/s | 38.2965 KOps/s | |
test_getitem[range] | 0.2717ms | 0.1079ms | 9.2719 KOps/s | 9.1604 KOps/s | |
test_getitem[tuple] | 0.1202ms | 22.7614μs | 43.9340 KOps/s | 43.6797 KOps/s | |
test_getitem[list] | 0.1906ms | 99.7969μs | 10.0204 KOps/s | 10.4338 KOps/s | |
test_setitem_dim[int] | 68.1340μs | 43.2212μs | 23.1368 KOps/s | 22.9760 KOps/s | |
test_setitem_dim[slice_int] | 0.1072ms | 66.2013μs | 15.1055 KOps/s | 15.4680 KOps/s | |
test_setitem_dim[range] | 0.1593ms | 0.1266ms | 7.8981 KOps/s | 8.0840 KOps/s | |
test_setitem_dim[tuple] | 86.3550μs | 58.7187μs | 17.0303 KOps/s | 17.0055 KOps/s | |
test_setitem | 79.3140μs | 41.4809μs | 24.1075 KOps/s | 24.8879 KOps/s | |
test_set | 87.3250μs | 40.5186μs | 24.6800 KOps/s | 25.0822 KOps/s | |
test_set_shared | 0.3623ms | 49.8376μs | 20.0652 KOps/s | 20.0563 KOps/s | |
test_update | 94.2750μs | 49.3080μs | 20.2807 KOps/s | 20.9661 KOps/s | |
test_update_nested | 95.4150μs | 55.2936μs | 18.0853 KOps/s | 18.4257 KOps/s | |
test_update__nested | 0.1114ms | 58.0806μs | 17.2175 KOps/s | 17.2234 KOps/s | |
test_set_nested | 81.0550μs | 43.3358μs | 23.0756 KOps/s | 23.7690 KOps/s | |
test_set_nested_new | 87.9950μs | 46.6385μs | 21.4415 KOps/s | 21.9183 KOps/s | |
test_select | 94.3860μs | 60.3537μs | 16.5690 KOps/s | 17.0528 KOps/s | |
test_select_nested | 0.3693ms | 42.8170μs | 23.3552 KOps/s | 24.1576 KOps/s | |
test_exclude_nested | 93.8560μs | 57.4755μs | 17.3987 KOps/s | 17.4063 KOps/s | |
test_empty[True] | 0.3037ms | 0.2422ms | 4.1284 KOps/s | 4.1729 KOps/s | |
test_empty[False] | 3.5502μs | 0.7381μs | 1.3548 MOps/s | 1.3400 MOps/s | |
test_to | 51.9230μs | 25.9162μs | 38.5859 KOps/s | 38.4391 KOps/s | |
test_to_nonblocking | 61.7140μs | 24.6578μs | 40.5551 KOps/s | 41.1793 KOps/s | |
test_unbind_speed | 0.3289ms | 0.2734ms | 3.6571 KOps/s | 3.6655 KOps/s | |
test_unbind_speed_stack0 | 0.3394ms | 0.2731ms | 3.6618 KOps/s | 3.7072 KOps/s | |
test_unbind_speed_stack1 | 92.4047ms | 0.6989ms | 1.4308 KOps/s | 1.4129 KOps/s | |
test_split | 94.3258ms | 2.0908ms | 478.2809 Ops/s | 470.5431 Ops/s | |
test_chunk | 95.0343ms | 2.0668ms | 483.8434 Ops/s | 474.2487 Ops/s | |
test_creation[device0] | 0.3361ms | 0.1230ms | 8.1313 KOps/s | 8.1007 KOps/s | |
test_creation_from_tensor | 0.3666ms | 0.1262ms | 7.9222 KOps/s | 7.9708 KOps/s | |
test_add_one[memmap_tensor0] | 0.2227ms | 8.3897μs | 119.1944 KOps/s | 119.1913 KOps/s | |
test_contiguous[memmap_tensor0] | 36.1520μs | 2.1040μs | 475.2789 KOps/s | 482.8811 KOps/s | |
test_stack[memmap_tensor0] | 36.0920μs | 6.5874μs | 151.8048 KOps/s | 154.5608 KOps/s | |
test_memmaptd_index | 1.2282ms | 0.4055ms | 2.4662 KOps/s | 2.4824 KOps/s | |
test_memmaptd_index_astensor | 0.7510ms | 0.4593ms | 2.1773 KOps/s | 2.2085 KOps/s | |
test_memmaptd_index_op | 1.3803ms | 0.9803ms | 1.0201 KOps/s | 1.0375 KOps/s | |
test_serialize_model | 0.1310s | 0.1289s | 7.7568 Ops/s | 7.7502 Ops/s | |
test_serialize_model_pickle | 1.3518s | 1.2120s | 0.8251 Ops/s | 0.8254 Ops/s | |
test_serialize_weights | 0.1296s | 0.1283s | 7.7930 Ops/s | 7.7916 Ops/s | |
test_serialize_weights_returnearly | 48.4091ms | 45.4859ms | 21.9848 Ops/s | 18.1426 Ops/s | |
test_serialize_weights_pickle | 1.3511s | 1.2126s | 0.8247 Ops/s | 0.8210 Ops/s | |
test_reshape_pytree | 81.6150μs | 35.9242μs | 27.8364 KOps/s | 29.1201 KOps/s | |
test_reshape_td | 77.4650μs | 40.9150μs | 24.4409 KOps/s | 24.0996 KOps/s | |
test_view_pytree | 72.1440μs | 33.4386μs | 29.9056 KOps/s | 29.5789 KOps/s | |
test_view_td | 86.6450μs | 43.8848μs | 22.7869 KOps/s | 21.8568 KOps/s | |
test_unbind_pytree | 62.9340μs | 33.5923μs | 29.7687 KOps/s | 30.2480 KOps/s | |
test_unbind_td | 0.7032ms | 40.6134μs | 24.6224 KOps/s | 23.6717 KOps/s | |
test_split_pytree | 88.5760μs | 43.7380μs | 22.8634 KOps/s | 21.8507 KOps/s | |
test_split_td | 0.1742ms | 52.7224μs | 18.9673 KOps/s | 16.1301 KOps/s | |
test_add_pytree | 0.1057ms | 54.4163μs | 18.3768 KOps/s | 18.2329 KOps/s | |
test_add_td | 0.1239ms | 85.4759μs | 11.6992 KOps/s | 11.2836 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.3959ms | 0.2028ms | 4.9306 KOps/s | 4.8587 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.1981ms | 0.1482ms | 6.7474 KOps/s | 6.7106 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1803ms | 0.1404ms | 7.1210 KOps/s | 7.1953 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.2283ms | 0.1778ms | 5.6258 KOps/s | 5.5805 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 67.3440μs | 20.7010μs | 48.3068 KOps/s | 47.8023 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 87.1950μs | 43.0912μs | 23.2066 KOps/s | 23.4213 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.1135ms | 63.8253μs | 15.6678 KOps/s | 15.8098 KOps/s | |
test_compile_copy_nested[pytree-eager] | 93.3150μs | 49.3561μs | 20.2609 KOps/s | 20.2912 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.3668ms | 0.3040ms | 3.2898 KOps/s | 3.2808 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.2764ms | 0.2062ms | 4.8488 KOps/s | 4.8191 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1720ms | 0.1241ms | 8.0611 KOps/s | 8.1005 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.1145ms | 57.6238μs | 17.3539 KOps/s | 17.3589 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.3488ms | 0.3034ms | 3.2963 KOps/s | 3.2773 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.6831ms | 0.6051ms | 1.6526 KOps/s | 1.6592 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.3070ms | 0.2457ms | 4.0701 KOps/s | 4.0432 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.3546ms | 0.3046ms | 3.2831 KOps/s | 3.2822 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1054ms | 68.3924μs | 14.6215 KOps/s | 14.2304 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1662ms | 0.1246ms | 8.0234 KOps/s | 8.0647 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.5784ms | 0.5203ms | 1.9221 KOps/s | 1.9323 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.3596ms | 0.3030ms | 3.3000 KOps/s | 3.2949 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 44.0930μs | 17.9060μs | 55.8473 KOps/s | 54.5305 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 67.1350μs | 27.2787μs | 36.6587 KOps/s | 36.5251 KOps/s | |
test_compile_copy_flat[pytree-compile] | 96.9660μs | 68.5310μs | 14.5919 KOps/s | 14.6456 KOps/s | |
test_compile_copy_flat[pytree-eager] | 81.8650μs | 50.8914μs | 19.6497 KOps/s | 19.5759 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 2.2958ms | 0.7949ms | 1.2580 KOps/s | 1.1548 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 3.2743ms | 3.1167ms | 320.8476 Ops/s | 325.8290 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 2.2300ms | 0.8010ms | 1.2485 KOps/s | 1.1695 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 3.2710ms | 3.1406ms | 318.4069 Ops/s | 327.3765 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.1576ms | 0.1064ms | 9.3944 KOps/s | 9.3856 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.1828ms | 57.5755μs | 17.3685 KOps/s | 15.8285 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.1454ms | 0.1006ms | 9.9443 KOps/s | 9.9450 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 0.1562ms | 41.7227μs | 23.9678 KOps/s | 22.8352 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.1423ms | 0.1013ms | 9.8729 KOps/s | 9.4573 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 83.3050μs | 41.4171μs | 24.1446 KOps/s | 23.6388 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1808ms | 0.1337ms | 7.4802 KOps/s | 7.4978 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.1604ms | 23.8210μs | 41.9797 KOps/s | 41.2324 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1760ms | 0.1280ms | 7.8127 KOps/s | 7.8806 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 49.2930μs | 19.9281μs | 50.1804 KOps/s | 50.1279 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1948ms | 0.1285ms | 7.7835 KOps/s | 7.8347 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 52.2630μs | 20.0929μs | 49.7689 KOps/s | 50.0492 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.2411ms | 0.1349ms | 7.4105 KOps/s | 7.4767 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.5547ms | 23.4072μs | 42.7220 KOps/s | 41.1332 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1875ms | 0.1281ms | 7.8074 KOps/s | 7.8269 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 0.1577ms | 21.6913μs | 46.1013 KOps/s | 49.1206 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1778ms | 0.1280ms | 7.8133 KOps/s | 7.8169 KOps/s | |
test_compile_indexing[int-pytree-eager] | 48.3530μs | 19.9936μs | 50.0160 KOps/s | 50.2891 KOps/s | |
test_mod_add[eager] | 72.5850μs | 30.8562μs | 32.4084 KOps/s | 32.2111 KOps/s | |
test_mod_add[compile] | 0.2989ms | 67.9645μs | 14.7136 KOps/s | 14.3133 KOps/s | |
test_mod_add[compile-overhead] | 0.2622ms | 0.1342ms | 7.4507 KOps/s | 7.2259 KOps/s | |
test_mod_wrap[eager] | 0.3408ms | 0.2325ms | 4.3012 KOps/s | 4.1180 KOps/s | |
test_mod_wrap[compile] | 0.3701ms | 0.2833ms | 3.5298 KOps/s | 3.4591 KOps/s | |
test_mod_wrap[compile-overhead] | 7.7772ms | 4.0662ms | 245.9276 Ops/s | 247.7028 Ops/s | |
test_mod_wrap_and_backward[eager] | 1.6003ms | 1.3237ms | 755.4392 Ops/s | 705.4024 Ops/s | |
test_mod_wrap_and_backward[compile] | 1.5777ms | 1.2814ms | 780.3674 Ops/s | 717.6850 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 1.2975ms | 0.8792ms | 1.1374 KOps/s | 961.4397 Ops/s | |
test_seq_add[eager] | 0.1386ms | 94.8847μs | 10.5391 KOps/s | 10.5377 KOps/s | |
test_seq_add[compile] | 0.1377ms | 77.9566μs | 12.8276 KOps/s | 12.8912 KOps/s | |
test_seq_add[compile-overhead] | 0.1711ms | 0.1129ms | 8.8552 KOps/s | 8.9496 KOps/s | |
test_seq_wrap[eager] | 0.7664ms | 0.3710ms | 2.6955 KOps/s | 2.6725 KOps/s | |
test_seq_wrap[compile] | 0.7349ms | 0.3106ms | 3.2200 KOps/s | 3.2687 KOps/s | |
test_seq_wrap[compile-overhead] | 0.6187ms | 0.2162ms | 4.6251 KOps/s | 4.6289 KOps/s | |
test_func_call_runtime[False-eager] | 1.1506ms | 0.7305ms | 1.3688 KOps/s | 1.3887 KOps/s | |
test_func_call_runtime[False-compile] | 1.1609ms | 0.7678ms | 1.3025 KOps/s | 1.3070 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.7723ms | 0.3471ms | 2.8809 KOps/s | 2.8849 KOps/s | |
test_func_call_runtime[True-eager] | 0.9925ms | 0.8585ms | 1.1648 KOps/s | 1.1403 KOps/s | |
test_func_call_runtime[True-compile] | 0.8898ms | 0.7838ms | 1.2758 KOps/s | 1.2779 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.4838ms | 0.3659ms | 2.7331 KOps/s | 2.7273 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.8927ms | 0.7037ms | 1.4210 KOps/s | 1.3769 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.9113ms | 0.7580ms | 1.3193 KOps/s | 1.2930 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.4935ms | 0.3491ms | 2.8646 KOps/s | 2.8602 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.1198ms | 0.9575ms | 1.0443 KOps/s | 1.0227 KOps/s | |
test_func_call_cm_runtime[True-compile] | 1.0005ms | 0.8074ms | 1.2386 KOps/s | 1.2311 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.4399ms | 0.3913ms | 2.5556 KOps/s | 2.5546 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.4815ms | 2.0362ms | 491.1076 Ops/s | 488.5219 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 0.9997ms | 0.8264ms | 1.2100 KOps/s | 1.2077 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.7916ms | 0.3941ms | 2.5371 KOps/s | 2.5267 KOps/s | |
test_distributed | 0.6727ms | 0.1517ms | 6.5926 KOps/s | 8.9282 KOps/s | |
test_tdmodule | 33.4320μs | 14.7600μs | 67.7505 KOps/s | 67.8616 KOps/s | |
test_tdmodule_dispatch | 51.6830μs | 28.6045μs | 34.9596 KOps/s | 34.9386 KOps/s | |
test_tdseq | 34.2920μs | 15.1300μs | 66.0940 KOps/s | 64.9431 KOps/s | |
test_tdseq_dispatch | 59.4030μs | 31.0521μs | 32.2040 KOps/s | 32.6141 KOps/s | |
test_instantiation_functorch | 1.8780ms | 1.7873ms | 559.5175 Ops/s | 565.4873 Ops/s | |
test_instantiation_td | 1.7512ms | 1.1575ms | 863.9637 Ops/s | 854.9924 Ops/s | |
test_exec_functorch | 0.2920ms | 0.1991ms | 5.0228 KOps/s | 4.6454 KOps/s | |
test_exec_functional_call | 0.2845ms | 0.1999ms | 5.0018 KOps/s | 4.6220 KOps/s | |
test_exec_td | 0.2482ms | 0.2030ms | 4.9270 KOps/s | 4.4764 KOps/s | |
test_exec_td_decorator | 0.8244ms | 0.2444ms | 4.0913 KOps/s | 3.8094 KOps/s | |
test_vmap_mlp_speed[True-True] | 0.8503ms | 0.6738ms | 1.4841 KOps/s | 1.4648 KOps/s | |
test_vmap_mlp_speed[True-False] | 0.7611ms | 0.6724ms | 1.4871 KOps/s | 1.4688 KOps/s | |
test_vmap_mlp_speed[False-True] | 0.6830ms | 0.5678ms | 1.7612 KOps/s | 1.7657 KOps/s | |
test_vmap_mlp_speed[False-False] | 0.6662ms | 0.5642ms | 1.7725 KOps/s | 1.7441 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 1.1763ms | 0.6601ms | 1.5149 KOps/s | 1.4908 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.7895ms | 0.6592ms | 1.5169 KOps/s | 1.5108 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.6997ms | 0.5764ms | 1.7348 KOps/s | 1.7298 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.7581ms | 0.5782ms | 1.7295 KOps/s | 1.6845 KOps/s | |
test_vmap_transformer_speed[True-True] | 8.3390ms | 8.1944ms | 122.0345 Ops/s | 120.7300 Ops/s | |
test_vmap_transformer_speed[True-False] | 8.2841ms | 8.1866ms | 122.1510 Ops/s | 120.7509 Ops/s | |
test_vmap_transformer_speed[False-True] | 8.1288ms | 8.0093ms | 124.8541 Ops/s | 123.4672 Ops/s | |
test_vmap_transformer_speed[False-False] | 8.2228ms | 8.0031ms | 124.9517 Ops/s | 122.6404 Ops/s | |
test_vmap_transformer_speed_decorator[True-True] | 19.9352ms | 19.4184ms | 51.4975 Ops/s | 51.8376 Ops/s | |
test_vmap_transformer_speed_decorator[True-False] | 19.8120ms | 19.2196ms | 52.0302 Ops/s | 51.8431 Ops/s | |
test_vmap_transformer_speed_decorator[False-True] | 20.2125ms | 19.0341ms | 52.5374 Ops/s | 52.3598 Ops/s | |
test_vmap_transformer_speed_decorator[False-False] | 19.1409ms | 19.0257ms | 52.5606 Ops/s | 52.3969 Ops/s | |
test_to_module_speed[True] | 1.2207ms | 0.9326ms | 1.0723 KOps/s | 1.0855 KOps/s | |
test_to_module_speed[False] | 1.3185ms | 0.9078ms | 1.1016 KOps/s | 1.1056 KOps/s | |
test_tc_init | 79.2640μs | 34.9581μs | 28.6057 KOps/s | 29.7331 KOps/s | |
test_tc_init_nested | 0.1133ms | 66.5241μs | 15.0321 KOps/s | 14.7109 KOps/s | |
test_tc_first_layer_tensor | 5.3617μs | 0.6603μs | 1.5145 MOps/s | 1.4940 MOps/s | |
test_tc_first_layer_nontensor | 30.2420μs | 2.1831μs | 458.0707 KOps/s | 460.4723 KOps/s | |
test_tc_second_layer_tensor | 7.5505μs | 1.3466μs | 742.6165 KOps/s | 700.4684 KOps/s | |
test_tc_second_layer_nontensor | 30.6120μs | 2.8808μs | 347.1309 KOps/s | 345.9700 KOps/s | |
test_unbind | 0.1925s | 12.1511ms | 82.2970 Ops/s | 101.1991 Ops/s | |
test_full_like | 0.7036ms | 0.5742ms | 1.7416 KOps/s | 1.7413 KOps/s | |
test_zeros_like | 0.5240ms | 0.1978ms | 5.0551 KOps/s | 5.0532 KOps/s | |
test_ones_like | 0.5286ms | 0.1978ms | 5.0543 KOps/s | 5.0576 KOps/s | |
test_clone | 0.6060ms | 0.4150ms | 2.4098 KOps/s | 2.4124 KOps/s | |
test_squeeze | 29.3320μs | 9.8475μs | 101.5484 KOps/s | 101.4470 KOps/s | |
test_unsqueeze | 0.4490ms | 74.5397μs | 13.4157 KOps/s | 13.6359 KOps/s | |
test_split | 0.2533ms | 0.1497ms | 6.6799 KOps/s | 6.5653 KOps/s | |
test_permute | 0.5621ms | 0.1710ms | 5.8480 KOps/s | 5.7499 KOps/s | |
test_stack | 1.2546ms | 0.8597ms | 1.1632 KOps/s | 1.1593 KOps/s | |
test_cat | 1.2608ms | 1.2313ms | 812.1299 Ops/s | 811.6914 Ops/s |
vmoens
added a commit
that referenced
this pull request
Sep 17, 2024
ghstack-source-id: 9844f2db8f2c20928ec17e6f822b9729e82eced7 Pull Request resolved: #1000
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
bug
Something isn't working
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Stack from ghstack (oldest at bottom):