-
Notifications
You must be signed in to change notification settings - Fork 76
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Versioning] 0.6.2 #1089
Merged
Merged
[Versioning] 0.6.2 #1089
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
vmoens
added a commit
that referenced
this pull request
Nov 14, 2024
ghstack-source-id: 929ad7fb89e0bd6d25a70ed64b340ad7245fd693 Pull Request resolved: #1089
facebook-github-bot
added
the
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
label
Nov 14, 2024
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 34.0330μs | 17.2155μs | 58.0873 KOps/s | 55.2824 KOps/s | |
test_plain_set_stack_nested | 44.9340μs | 17.3571μs | 57.6132 KOps/s | 53.4492 KOps/s | |
test_plain_set_nested_inplace | 69.0390μs | 18.9931μs | 52.6506 KOps/s | 49.1327 KOps/s | |
test_plain_set_stack_nested_inplace | 48.0400μs | 18.7374μs | 53.3691 KOps/s | 49.9726 KOps/s | |
test_items | 39.8940μs | 4.1019μs | 243.7924 KOps/s | 233.5284 KOps/s | |
test_items_nested | 0.6590ms | 0.3425ms | 2.9199 KOps/s | 2.9658 KOps/s | |
test_items_nested_locked | 0.5674ms | 0.3422ms | 2.9223 KOps/s | 2.9470 KOps/s | |
test_items_nested_leaf | 0.1429ms | 71.5416μs | 13.9779 KOps/s | 14.1360 KOps/s | |
test_items_stack_nested | 0.6603ms | 0.3472ms | 2.8805 KOps/s | 2.9228 KOps/s | |
test_items_stack_nested_leaf | 0.1290ms | 74.2431μs | 13.4693 KOps/s | 13.5762 KOps/s | |
test_items_stack_nested_locked | 0.6497ms | 0.3437ms | 2.9091 KOps/s | 2.9044 KOps/s | |
test_keys | 37.3200μs | 3.5116μs | 284.7721 KOps/s | 280.2664 KOps/s | |
test_keys_nested | 0.2037ms | 0.1382ms | 7.2337 KOps/s | 7.4003 KOps/s | |
test_keys_nested_locked | 1.8223ms | 0.1439ms | 6.9500 KOps/s | 7.0457 KOps/s | |
test_keys_nested_leaf | 0.1941ms | 0.1158ms | 8.6331 KOps/s | 8.4954 KOps/s | |
test_keys_stack_nested | 0.2604ms | 0.1369ms | 7.3065 KOps/s | 7.3153 KOps/s | |
test_keys_stack_nested_leaf | 0.2053ms | 0.1157ms | 8.6420 KOps/s | 8.5152 KOps/s | |
test_keys_stack_nested_locked | 0.2586ms | 0.1416ms | 7.0623 KOps/s | 7.1081 KOps/s | |
test_values | 8.7984μs | 1.0463μs | 955.7487 KOps/s | 976.0191 KOps/s | |
test_values_nested | 0.1111ms | 55.9917μs | 17.8598 KOps/s | 17.9787 KOps/s | |
test_values_nested_locked | 0.1022ms | 55.5160μs | 18.0128 KOps/s | 17.4470 KOps/s | |
test_values_nested_leaf | 0.1174ms | 59.8479μs | 16.7090 KOps/s | 16.6513 KOps/s | |
test_values_stack_nested | 0.1099ms | 57.0915μs | 17.5157 KOps/s | 17.7004 KOps/s | |
test_values_stack_nested_leaf | 0.1222ms | 59.6029μs | 16.7777 KOps/s | 16.3464 KOps/s | |
test_values_stack_nested_locked | 0.1147ms | 57.1298μs | 17.5040 KOps/s | 17.7087 KOps/s | |
test_membership | 36.2080μs | 0.8856μs | 1.1292 MOps/s | 1.1462 MOps/s | |
test_membership_nested | 17.6330μs | 2.7299μs | 366.3095 KOps/s | 361.3660 KOps/s | |
test_membership_nested_leaf | 43.4810μs | 2.7621μs | 362.0422 KOps/s | 361.6734 KOps/s | |
test_membership_stacked_nested | 36.3370μs | 2.7168μs | 368.0760 KOps/s | 370.6455 KOps/s | |
test_membership_stacked_nested_leaf | 33.1020μs | 2.7466μs | 364.0831 KOps/s | 370.2253 KOps/s | |
test_membership_nested_last | 66.3540μs | 4.0223μs | 248.6155 KOps/s | 249.2027 KOps/s | |
test_membership_nested_leaf_last | 46.7560μs | 4.0279μs | 248.2711 KOps/s | 248.8903 KOps/s | |
test_membership_stacked_nested_last | 30.3370μs | 3.9601μs | 252.5162 KOps/s | 252.0878 KOps/s | |
test_membership_stacked_nested_leaf_last | 41.9480μs | 3.9306μs | 254.4125 KOps/s | 250.3836 KOps/s | |
test_nested_getleaf | 55.6940μs | 10.5469μs | 94.8149 KOps/s | 93.4902 KOps/s | |
test_nested_get | 49.0010μs | 10.2675μs | 97.3942 KOps/s | 95.6813 KOps/s | |
test_stacked_getleaf | 39.9740μs | 10.7300μs | 93.1964 KOps/s | 93.8445 KOps/s | |
test_stacked_get | 54.9520μs | 9.9539μs | 100.4636 KOps/s | 99.1376 KOps/s | |
test_nested_getitemleaf | 52.4480μs | 11.0076μs | 90.8462 KOps/s | 85.3475 KOps/s | |
test_nested_getitem | 36.5580μs | 10.4120μs | 96.0433 KOps/s | 97.4754 KOps/s | |
test_stacked_getitemleaf | 54.0010μs | 11.2190μs | 89.1347 KOps/s | 88.5959 KOps/s | |
test_stacked_getitem | 54.3500μs | 10.4133μs | 96.0308 KOps/s | 95.6772 KOps/s | |
test_lock_nested | 3.1251ms | 0.4386ms | 2.2801 KOps/s | 1.8538 KOps/s | |
test_lock_stack_nested | 0.7402ms | 0.4048ms | 2.4705 KOps/s | 2.4496 KOps/s | |
test_unlock_nested | 0.7514ms | 0.3507ms | 2.8515 KOps/s | 2.7762 KOps/s | |
test_unlock_stack_nested | 0.5941ms | 0.3226ms | 3.0995 KOps/s | 3.0407 KOps/s | |
test_flatten_speed | 0.2124ms | 90.7289μs | 11.0218 KOps/s | 10.9802 KOps/s | |
test_unflatten_speed | 0.9748ms | 0.4787ms | 2.0889 KOps/s | 2.1344 KOps/s | |
test_common_ops | 1.5057ms | 0.7397ms | 1.3520 KOps/s | 1.2696 KOps/s | |
test_creation | 26.5700μs | 2.0368μs | 490.9595 KOps/s | 481.4325 KOps/s | |
test_creation_empty | 39.4830μs | 9.5871μs | 104.3071 KOps/s | 86.8033 KOps/s | |
test_creation_nested_1 | 42.5690μs | 12.4216μs | 80.5052 KOps/s | 69.9928 KOps/s | |
test_creation_nested_2 | 79.0570μs | 16.5546μs | 60.4060 KOps/s | 54.3300 KOps/s | |
test_clone | 0.1600ms | 13.1744μs | 75.9050 KOps/s | 76.8560 KOps/s | |
test_getitem[int] | 1.1415ms | 12.4547μs | 80.2910 KOps/s | 81.3166 KOps/s | |
test_getitem[slice_int] | 0.1414ms | 23.7836μs | 42.0458 KOps/s | 43.5816 KOps/s | |
test_getitem[range] | 0.1936ms | 47.3263μs | 21.1299 KOps/s | 21.3250 KOps/s | |
test_getitem[tuple] | 0.1321ms | 19.5599μs | 51.1250 KOps/s | 52.7701 KOps/s | |
test_getitem[list] | 0.2159ms | 43.2474μs | 23.1228 KOps/s | 23.5607 KOps/s | |
test_setitem_dim[int] | 61.6350μs | 24.7116μs | 40.4668 KOps/s | 39.8387 KOps/s | |
test_setitem_dim[slice_int] | 93.5540μs | 48.8392μs | 20.4753 KOps/s | 19.7550 KOps/s | |
test_setitem_dim[range] | 0.1604ms | 71.5831μs | 13.9698 KOps/s | 13.6699 KOps/s | |
test_setitem_dim[tuple] | 71.2520μs | 38.8808μs | 25.7197 KOps/s | 25.1520 KOps/s | |
test_setitem | 0.1628ms | 19.7541μs | 50.6224 KOps/s | 48.4607 KOps/s | |
test_set | 68.3470μs | 18.7899μs | 53.2200 KOps/s | 49.3624 KOps/s | |
test_set_shared | 4.1942ms | 0.1670ms | 5.9880 KOps/s | 5.9583 KOps/s | |
test_update | 0.1316ms | 21.0453μs | 47.5165 KOps/s | 42.6369 KOps/s | |
test_update_nested | 96.0290μs | 29.8465μs | 33.5048 KOps/s | 30.8369 KOps/s | |
test_update__nested | 0.1466ms | 32.5254μs | 30.7452 KOps/s | 31.3182 KOps/s | |
test_set_nested | 0.1330ms | 21.1756μs | 47.2242 KOps/s | 45.1861 KOps/s | |
test_set_nested_new | 83.9750μs | 25.6713μs | 38.9540 KOps/s | 37.1309 KOps/s | |
test_select | 0.2035ms | 42.3380μs | 23.6195 KOps/s | 23.7492 KOps/s | |
test_select_nested | 0.1303ms | 60.4908μs | 16.5314 KOps/s | 16.8990 KOps/s | |
test_exclude_nested | 0.1383ms | 75.2388μs | 13.2910 KOps/s | 13.4896 KOps/s | |
test_empty[True] | 0.4335ms | 0.3536ms | 2.8283 KOps/s | 2.8719 KOps/s | |
test_empty[False] | 9.2397μs | 1.2165μs | 822.0412 KOps/s | 815.5702 KOps/s | |
test_unbind_speed | 0.3048ms | 0.2592ms | 3.8578 KOps/s | 3.8999 KOps/s | |
test_unbind_speed_stack0 | 0.3168ms | 0.2540ms | 3.9376 KOps/s | 3.9233 KOps/s | |
test_unbind_speed_stack1 | 0.1074s | 0.7507ms | 1.3322 KOps/s | 1.4055 KOps/s | |
test_split | 0.1032s | 1.7199ms | 581.4234 Ops/s | 576.0875 Ops/s | |
test_chunk | 0.1038s | 1.7317ms | 577.4727 Ops/s | 571.2454 Ops/s | |
test_consolidate_njt[False-None] | 10.2681ms | 8.1167ms | 123.2025 Ops/s | 122.3270 Ops/s | |
test_creation[device0] | 0.2515ms | 88.5869μs | 11.2884 KOps/s | 10.0688 KOps/s | |
test_creation_from_tensor | 4.4472ms | 93.6161μs | 10.6819 KOps/s | 10.5141 KOps/s | |
test_add_one[memmap_tensor0] | 0.1579ms | 4.8670μs | 205.4640 KOps/s | 212.4853 KOps/s | |
test_contiguous[memmap_tensor0] | 21.6710μs | 0.5216μs | 1.9171 MOps/s | 1.9577 MOps/s | |
test_stack[memmap_tensor0] | 33.1010μs | 3.4137μs | 292.9336 KOps/s | 304.1745 KOps/s | |
test_memmaptd_index | 1.0519ms | 0.2323ms | 4.3053 KOps/s | 4.2958 KOps/s | |
test_memmaptd_index_astensor | 0.7123ms | 0.3100ms | 3.2260 KOps/s | 3.2308 KOps/s | |
test_memmaptd_index_op | 1.2933ms | 0.5450ms | 1.8350 KOps/s | 1.7070 KOps/s | |
test_serialize_model | 0.1239s | 0.1141s | 8.7660 Ops/s | 7.5202 Ops/s | |
test_serialize_model_pickle | 0.4783s | 0.3926s | 2.5474 Ops/s | 2.5102 Ops/s | |
test_serialize_weights | 0.1174s | 0.1129s | 8.8559 Ops/s | 8.7591 Ops/s | |
test_serialize_weights_returnearly | 0.1800s | 0.1551s | 6.4475 Ops/s | 6.2093 Ops/s | |
test_serialize_weights_pickle | 1.1830s | 0.7425s | 1.3468 Ops/s | 2.2347 Ops/s | |
test_serialize_weights_filesystem | 0.1518s | 0.1436s | 6.9619 Ops/s | 6.4377 Ops/s | |
test_serialize_model_filesystem | 0.2505s | 0.1565s | 6.3916 Ops/s | 6.6370 Ops/s | |
test_reshape_pytree | 70.6320μs | 26.9656μs | 37.0843 KOps/s | 37.7827 KOps/s | |
test_reshape_td | 66.3030μs | 32.3685μs | 30.8942 KOps/s | 31.6640 KOps/s | |
test_view_pytree | 68.7980μs | 27.1409μs | 36.8448 KOps/s | 38.1008 KOps/s | |
test_view_td | 79.9090μs | 38.5424μs | 25.9454 KOps/s | 27.1742 KOps/s | |
test_unbind_pytree | 86.0200μs | 30.1755μs | 33.1394 KOps/s | 34.1677 KOps/s | |
test_unbind_td | 0.3546ms | 38.2840μs | 26.1206 KOps/s | 26.3039 KOps/s | |
test_split_pytree | 65.5120μs | 30.0858μs | 33.2383 KOps/s | 34.5185 KOps/s | |
test_split_td | 0.2000ms | 43.6209μs | 22.9248 KOps/s | 22.5244 KOps/s | |
test_add_pytree | 80.7600μs | 35.4442μs | 28.2133 KOps/s | 27.9739 KOps/s | |
test_add_td | 0.1838ms | 53.9451μs | 18.5374 KOps/s | 18.1559 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1180ms | 61.0436μs | 16.3817 KOps/s | 16.3273 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.3767ms | 0.1602ms | 6.2406 KOps/s | 6.2716 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1186ms | 45.4754μs | 21.9899 KOps/s | 22.5049 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.2555ms | 0.1197ms | 8.3560 KOps/s | 8.5603 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 71.4930μs | 25.9722μs | 38.5027 KOps/s | 37.7693 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1158ms | 53.5867μs | 18.6613 KOps/s | 18.5441 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.1700ms | 78.3562μs | 12.7622 KOps/s | 12.7709 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.1334ms | 67.5519μs | 14.8034 KOps/s | 14.8080 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.1926ms | 0.1045ms | 9.5722 KOps/s | 9.7278 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.2732ms | 0.2001ms | 4.9981 KOps/s | 5.0351 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1228ms | 44.4211μs | 22.5118 KOps/s | 22.6403 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.4950ms | 60.8647μs | 16.4299 KOps/s | 16.4551 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.1815ms | 0.1030ms | 9.7132 KOps/s | 9.8997 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.3680ms | 0.2038ms | 4.9074 KOps/s | 4.9526 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.3982ms | 0.2099ms | 4.7642 KOps/s | 4.7882 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.2084ms | 0.1069ms | 9.3558 KOps/s | 9.4292 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.2345ms | 52.6583μs | 18.9904 KOps/s | 18.8650 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1076ms | 45.4351μs | 22.0094 KOps/s | 22.0112 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.5911ms | 0.1612ms | 6.2050 KOps/s | 6.3089 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.1907ms | 0.1041ms | 9.6060 KOps/s | 9.8073 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 54.7720μs | 20.9857μs | 47.6515 KOps/s | 48.6865 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.1140ms | 58.3190μs | 17.1471 KOps/s | 16.7317 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1512ms | 80.9231μs | 12.3574 KOps/s | 12.3844 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1278ms | 70.5547μs | 14.1734 KOps/s | 14.6065 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 0.3006ms | 0.2099ms | 4.7653 KOps/s | 4.9228 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 1.3608ms | 1.2426ms | 804.7553 Ops/s | 794.1810 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 0.2822ms | 0.2027ms | 4.9339 KOps/s | 5.0445 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 0.8619ms | 0.7767ms | 1.2875 KOps/s | 1.2963 KOps/s | |
test_compile_assign_and_add_stack[compile] | 0.8087ms | 0.4555ms | 2.1953 KOps/s | 2.2377 KOps/s | |
test_compile_assign_and_add_stack[eager] | 2.7034ms | 2.4826ms | 402.8013 Ops/s | 380.4091 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 98.5630μs | 35.4647μs | 28.1971 KOps/s | 27.7918 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.7250ms | 32.3670μs | 30.8957 KOps/s | 30.6879 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.1000ms | 28.6510μs | 34.9028 KOps/s | 35.1230 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 86.0700μs | 23.0782μs | 43.3309 KOps/s | 43.5187 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 78.1150μs | 29.5036μs | 33.8941 KOps/s | 34.2947 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 99.7360μs | 23.1433μs | 43.2090 KOps/s | 43.5740 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1268ms | 50.8813μs | 19.6536 KOps/s | 19.6054 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.5874ms | 19.4789μs | 51.3375 KOps/s | 52.3084 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 87.2120μs | 44.5315μs | 22.4560 KOps/s | 22.9969 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 93.5440μs | 19.0228μs | 52.5685 KOps/s | 53.0186 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1202ms | 45.6145μs | 21.9229 KOps/s | 22.4914 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 55.8640μs | 18.9196μs | 52.8552 KOps/s | 54.0902 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1249ms | 51.5426μs | 19.4014 KOps/s | 19.2942 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.9919ms | 19.2224μs | 52.0228 KOps/s | 52.6676 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 93.7750μs | 45.2785μs | 22.0856 KOps/s | 22.3264 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 83.9170μs | 18.7238μs | 53.4081 KOps/s | 54.1108 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1236ms | 45.3106μs | 22.0699 KOps/s | 22.2616 KOps/s | |
test_compile_indexing[int-pytree-eager] | 87.4930μs | 18.9627μs | 52.7351 KOps/s | 54.3177 KOps/s | |
test_mod_add[eager] | 79.7190μs | 24.9932μs | 40.0108 KOps/s | 38.1178 KOps/s | |
test_mod_add[compile] | 93.4840μs | 44.8017μs | 22.3206 KOps/s | 22.5489 KOps/s | |
test_mod_add[compile-overhead] | 96.7000μs | 44.4562μs | 22.4941 KOps/s | 22.6184 KOps/s | |
test_mod_wrap[eager] | 0.3646ms | 0.2077ms | 4.8137 KOps/s | 4.7064 KOps/s | |
test_mod_wrap[compile] | 1.9884ms | 0.2014ms | 4.9661 KOps/s | 4.9556 KOps/s | |
test_mod_wrap[compile-overhead] | 1.9420ms | 0.2031ms | 4.9231 KOps/s | 4.9007 KOps/s | |
test_mod_wrap_and_backward[eager] | 15.8178ms | 12.0110ms | 83.2574 Ops/s | 80.4709 Ops/s | |
test_mod_wrap_and_backward[compile] | 18.4565ms | 13.0104ms | 76.8617 Ops/s | 76.3013 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 19.3093ms | 13.5721ms | 73.6804 Ops/s | 82.8470 Ops/s | |
test_seq_add[eager] | 0.1739ms | 89.8614μs | 11.1282 KOps/s | 10.4934 KOps/s | |
test_seq_add[compile] | 0.1200ms | 59.7173μs | 16.7456 KOps/s | 16.6452 KOps/s | |
test_seq_add[compile-overhead] | 0.1691ms | 58.8444μs | 16.9940 KOps/s | 16.8985 KOps/s | |
test_seq_wrap[eager] | 0.6373ms | 0.3717ms | 2.6905 KOps/s | 2.5546 KOps/s | |
test_seq_wrap[compile] | 0.4100ms | 0.2223ms | 4.4983 KOps/s | 4.5248 KOps/s | |
test_seq_wrap[compile-overhead] | 0.3943ms | 0.2220ms | 4.5039 KOps/s | 4.4812 KOps/s | |
test_func_call_runtime[False-eager] | 0.8542ms | 0.5480ms | 1.8248 KOps/s | 1.8508 KOps/s | |
test_func_call_runtime[False-compile] | 0.8585ms | 0.4243ms | 2.3568 KOps/s | 2.3884 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.5416ms | 0.4248ms | 2.3541 KOps/s | 2.3759 KOps/s | |
test_func_call_runtime[True-eager] | 1.0400ms | 0.7579ms | 1.3194 KOps/s | 1.3159 KOps/s | |
test_func_call_runtime[True-compile] | 0.6152ms | 0.4592ms | 2.1776 KOps/s | 2.1719 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.6362ms | 0.4696ms | 2.1293 KOps/s | 2.1629 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.7171ms | 0.5430ms | 1.8415 KOps/s | 1.8462 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.5478ms | 0.4266ms | 2.3441 KOps/s | 2.3851 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.5906ms | 0.4270ms | 2.3420 KOps/s | 2.3944 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.0721ms | 0.8916ms | 1.1216 KOps/s | 1.1247 KOps/s | |
test_func_call_cm_runtime[True-compile] | 0.6600ms | 0.4895ms | 2.0430 KOps/s | 2.0708 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.7783ms | 0.4927ms | 2.0295 KOps/s | 2.0754 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.7004ms | 1.9425ms | 514.8063 Ops/s | 521.7696 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 0.8765ms | 0.5175ms | 1.9322 KOps/s | 1.9401 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 1.0200ms | 0.5205ms | 1.9214 KOps/s | 1.9618 KOps/s | |
test_distributed | 0.4228ms | 0.1295ms | 7.7216 KOps/s | 7.7573 KOps/s | |
test_tdmodule | 62.3780μs | 17.4953μs | 57.1581 KOps/s | 50.7500 KOps/s | |
test_tdmodule_dispatch | 70.1110μs | 34.7485μs | 28.7782 KOps/s | 26.4395 KOps/s | |
test_tdseq | 58.4290μs | 20.0946μs | 49.7647 KOps/s | 44.9112 KOps/s | |
test_tdseq_dispatch | 0.1191ms | 40.6443μs | 24.6037 KOps/s | 23.3835 KOps/s | |
test_instantiation_functorch | 2.1139ms | 1.5317ms | 652.8832 Ops/s | 661.1570 Ops/s | |
test_exec_functorch | 0.3121ms | 0.1791ms | 5.5824 KOps/s | 5.6675 KOps/s | |
test_exec_functional_call | 0.3894ms | 0.1704ms | 5.8701 KOps/s | 5.7528 KOps/s | |
test_exec_td_decorator | 0.5618ms | 0.2262ms | 4.4214 KOps/s | 4.4221 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.9589ms | 0.6350ms | 1.5748 KOps/s | 1.5098 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.9503ms | 0.6262ms | 1.5969 KOps/s | 1.5525 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.7506ms | 0.5150ms | 1.9416 KOps/s | 1.9226 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.7433ms | 0.5162ms | 1.9374 KOps/s | 1.9033 KOps/s | |
test_to_module_speed[True] | 1.4509ms | 1.2893ms | 775.6408 Ops/s | 763.3252 Ops/s | |
test_to_module_speed[False] | 1.3952ms | 1.2610ms | 793.0327 Ops/s | 796.8724 Ops/s | |
test_tc_init | 85.5790μs | 45.8882μs | 21.7921 KOps/s | 21.7004 KOps/s | |
test_tc_init_nested | 0.1683ms | 89.4738μs | 11.1765 KOps/s | 10.6694 KOps/s | |
test_tc_first_layer_tensor | 33.8330μs | 1.4759μs | 677.5609 KOps/s | 627.1249 KOps/s | |
test_tc_first_layer_nontensor | 41.4370μs | 4.6490μs | 215.0997 KOps/s | 210.1343 KOps/s | |
test_tc_second_layer_tensor | 41.7780μs | 2.7740μs | 360.4916 KOps/s | 352.0511 KOps/s | |
test_tc_second_layer_nontensor | 28.8830μs | 6.0326μs | 165.7648 KOps/s | 162.5791 KOps/s | |
test_unbind | 0.2767s | 14.4453ms | 69.2267 Ops/s | 82.1446 Ops/s | |
test_full_like | 13.1051ms | 10.5778ms | 94.5376 Ops/s | 127.6903 Ops/s | |
test_zeros_like | 5.4033ms | 4.0175ms | 248.9124 Ops/s | 351.9781 Ops/s | |
test_ones_like | 5.3278ms | 4.3190ms | 231.5372 Ops/s | 305.2165 Ops/s | |
test_clone | 9.6131ms | 6.4196ms | 155.7736 Ops/s | 196.4489 Ops/s | |
test_squeeze | 67.9860μs | 11.7408μs | 85.1730 KOps/s | 84.7960 KOps/s | |
test_unsqueeze | 0.1606ms | 86.9902μs | 11.4955 KOps/s | 11.6183 KOps/s | |
test_split | 0.5586ms | 0.1867ms | 5.3554 KOps/s | 5.4530 KOps/s | |
test_permute | 0.3716ms | 0.2165ms | 4.6199 KOps/s | 4.5542 KOps/s | |
test_stack | 37.3730ms | 30.3290ms | 32.9717 Ops/s | 40.9682 Ops/s | |
test_cat | 33.6046ms | 30.0310ms | 33.2990 Ops/s | 41.3347 Ops/s |
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 32.2310μs | 10.3500μs | 96.6182 KOps/s | 97.1439 KOps/s | |
test_plain_set_stack_nested | 34.3510μs | 10.4009μs | 96.1459 KOps/s | 97.0080 KOps/s | |
test_plain_set_nested_inplace | 42.4710μs | 11.2227μs | 89.1054 KOps/s | 89.6670 KOps/s | |
test_plain_set_stack_nested_inplace | 63.1310μs | 11.2195μs | 89.1309 KOps/s | 89.9676 KOps/s | |
test_items | 29.1000μs | 2.9216μs | 342.2834 KOps/s | 342.1267 KOps/s | |
test_items_nested | 0.3680ms | 0.3221ms | 3.1042 KOps/s | 3.1299 KOps/s | |
test_items_nested_locked | 0.3588ms | 0.3245ms | 3.0817 KOps/s | 3.0995 KOps/s | |
test_items_nested_leaf | 0.1000ms | 58.5510μs | 17.0791 KOps/s | 17.1668 KOps/s | |
test_items_stack_nested | 0.3660ms | 0.3237ms | 3.0895 KOps/s | 3.1421 KOps/s | |
test_items_stack_nested_leaf | 86.2220μs | 58.2802μs | 17.1585 KOps/s | 16.8202 KOps/s | |
test_items_stack_nested_locked | 0.3992ms | 0.3248ms | 3.0790 KOps/s | 3.1058 KOps/s | |
test_keys | 26.0810μs | 3.4911μs | 286.4395 KOps/s | 291.0012 KOps/s | |
test_keys_nested | 99.8720μs | 70.6850μs | 14.1473 KOps/s | 14.2529 KOps/s | |
test_keys_nested_locked | 0.7585ms | 75.7069μs | 13.2088 KOps/s | 13.2263 KOps/s | |
test_keys_nested_leaf | 0.1729ms | 60.9861μs | 16.3972 KOps/s | 16.3420 KOps/s | |
test_keys_stack_nested | 0.1196ms | 69.5354μs | 14.3812 KOps/s | 14.2470 KOps/s | |
test_keys_stack_nested_leaf | 93.3320μs | 61.1799μs | 16.3452 KOps/s | 16.2869 KOps/s | |
test_keys_stack_nested_locked | 0.1139ms | 74.8776μs | 13.3551 KOps/s | 13.2617 KOps/s | |
test_values | 11.5985μs | 0.8474μs | 1.1801 MOps/s | 1.1874 MOps/s | |
test_values_nested | 61.0710μs | 31.1214μs | 32.1322 KOps/s | 31.9960 KOps/s | |
test_values_nested_locked | 74.2620μs | 32.7650μs | 30.5204 KOps/s | 30.5716 KOps/s | |
test_values_nested_leaf | 69.0520μs | 33.8453μs | 29.5462 KOps/s | 29.6020 KOps/s | |
test_values_stack_nested | 62.8710μs | 31.4036μs | 31.8435 KOps/s | 31.3535 KOps/s | |
test_values_stack_nested_leaf | 63.1310μs | 33.9859μs | 29.4240 KOps/s | 28.8930 KOps/s | |
test_values_stack_nested_locked | 67.1620μs | 33.4673μs | 29.8799 KOps/s | 29.9873 KOps/s | |
test_membership | 1.6181μs | 0.5118μs | 1.9540 MOps/s | 1.9716 MOps/s | |
test_membership_nested | 20.5305μs | 1.9332μs | 517.2874 KOps/s | 512.6604 KOps/s | |
test_membership_nested_leaf | 27.1305μs | 1.9133μs | 522.6499 KOps/s | 526.0789 KOps/s | |
test_membership_stacked_nested | 20.0900μs | 2.0087μs | 497.8388 KOps/s | 500.1970 KOps/s | |
test_membership_stacked_nested_leaf | 43.5910μs | 2.0194μs | 495.2010 KOps/s | 500.0540 KOps/s | |
test_membership_nested_last | 32.3610μs | 2.8376μs | 352.4112 KOps/s | 352.5223 KOps/s | |
test_membership_nested_leaf_last | 43.0300μs | 2.8416μs | 351.9121 KOps/s | 349.8289 KOps/s | |
test_membership_stacked_nested_last | 35.3100μs | 2.8641μs | 349.1451 KOps/s | 287.7714 KOps/s | |
test_membership_stacked_nested_leaf_last | 27.3300μs | 2.9061μs | 344.1049 KOps/s | 288.0541 KOps/s | |
test_nested_getleaf | 42.6800μs | 6.0566μs | 165.1083 KOps/s | 166.1058 KOps/s | |
test_nested_get | 34.3300μs | 5.7287μs | 174.5608 KOps/s | 175.5863 KOps/s | |
test_stacked_getleaf | 29.5510μs | 6.0539μs | 165.1839 KOps/s | 167.1651 KOps/s | |
test_stacked_get | 35.6300μs | 5.7177μs | 174.8968 KOps/s | 176.0139 KOps/s | |
test_nested_getitemleaf | 36.1800μs | 6.1008μs | 163.9131 KOps/s | 164.3498 KOps/s | |
test_nested_getitem | 32.4300μs | 5.7893μs | 172.7334 KOps/s | 173.3899 KOps/s | |
test_stacked_getitemleaf | 29.1000μs | 6.0709μs | 164.7207 KOps/s | 164.9075 KOps/s | |
test_stacked_getitem | 69.2510μs | 5.7448μs | 174.0708 KOps/s | 174.1637 KOps/s | |
test_lock_nested | 9.5457ms | 0.3791ms | 2.6379 KOps/s | 2.6935 KOps/s | |
test_lock_stack_nested | 0.3768ms | 0.3395ms | 2.9459 KOps/s | 2.9989 KOps/s | |
test_unlock_nested | 0.6752ms | 0.3093ms | 3.2328 KOps/s | 3.2793 KOps/s | |
test_unlock_stack_nested | 0.3086ms | 0.2770ms | 3.6103 KOps/s | 3.6726 KOps/s | |
test_flatten_speed | 0.1081ms | 74.1789μs | 13.4809 KOps/s | 13.7964 KOps/s | |
test_unflatten_speed | 0.3450ms | 0.3009ms | 3.3231 KOps/s | 3.3505 KOps/s | |
test_common_ops | 1.7346ms | 0.5890ms | 1.6978 KOps/s | 1.7160 KOps/s | |
test_creation | 0.1873ms | 1.5005μs | 666.4657 KOps/s | 670.3365 KOps/s | |
test_creation_empty | 35.6110μs | 6.9255μs | 144.3930 KOps/s | 144.5336 KOps/s | |
test_creation_nested_1 | 1.6604ms | 8.5138μs | 117.4562 KOps/s | 118.0231 KOps/s | |
test_creation_nested_2 | 41.6510μs | 11.0968μs | 90.1161 KOps/s | 90.7160 KOps/s | |
test_clone | 56.6610μs | 10.8333μs | 92.3079 KOps/s | 93.6760 KOps/s | |
test_getitem[int] | 93.0885ms | 16.6152μs | 60.1859 KOps/s | 93.9971 KOps/s | |
test_getitem[slice_int] | 0.1050ms | 20.8453μs | 47.9724 KOps/s | 48.7979 KOps/s | |
test_getitem[range] | 0.1308ms | 38.0212μs | 26.3011 KOps/s | 26.3155 KOps/s | |
test_getitem[tuple] | 0.1039ms | 18.4282μs | 54.2648 KOps/s | 55.8936 KOps/s | |
test_getitem[list] | 0.2133ms | 33.8039μs | 29.5823 KOps/s | 29.8347 KOps/s | |
test_setitem_dim[int] | 37.7410μs | 19.0237μs | 52.5659 KOps/s | 53.2697 KOps/s | |
test_setitem_dim[slice_int] | 68.2710μs | 37.8610μs | 26.4124 KOps/s | 26.3485 KOps/s | |
test_setitem_dim[range] | 78.3920μs | 54.0727μs | 18.4936 KOps/s | 18.5517 KOps/s | |
test_setitem_dim[tuple] | 55.4310μs | 32.1995μs | 31.0563 KOps/s | 31.6621 KOps/s | |
test_setitem | 0.1124ms | 14.7890μs | 67.6179 KOps/s | 67.2338 KOps/s | |
test_set | 90.8020μs | 14.4249μs | 69.3245 KOps/s | 70.2046 KOps/s | |
test_set_shared | 1.4624ms | 0.1464ms | 6.8327 KOps/s | 6.7860 KOps/s | |
test_update | 0.6642ms | 16.3444μs | 61.1831 KOps/s | 60.5520 KOps/s | |
test_update_nested | 92.2110μs | 21.6734μs | 46.1395 KOps/s | 46.6154 KOps/s | |
test_update__nested | 0.9515ms | 25.1611μs | 39.7438 KOps/s | 40.6233 KOps/s | |
test_set_nested | 80.3620μs | 15.5695μs | 64.2281 KOps/s | 65.4209 KOps/s | |
test_set_nested_new | 90.9510μs | 18.3208μs | 54.5826 KOps/s | 56.7252 KOps/s | |
test_select | 89.8220μs | 30.5272μs | 32.7576 KOps/s | 33.7320 KOps/s | |
test_select_nested | 69.0210μs | 43.7904μs | 22.8361 KOps/s | 23.1244 KOps/s | |
test_exclude_nested | 93.8720μs | 60.9021μs | 16.4198 KOps/s | 16.5164 KOps/s | |
test_empty[True] | 0.3056ms | 0.2657ms | 3.7633 KOps/s | 3.8199 KOps/s | |
test_empty[False] | 4.1701μs | 0.7900μs | 1.2658 MOps/s | 1.2861 MOps/s | |
test_to | 88.0910μs | 60.8981μs | 16.4209 KOps/s | 18.3387 KOps/s | |
test_to_nonblocking | 0.1413ms | 46.9530μs | 21.2979 KOps/s | 21.7349 KOps/s | |
test_unbind_speed | 1.4043ms | 0.2339ms | 4.2749 KOps/s | 4.2850 KOps/s | |
test_unbind_speed_stack0 | 0.2840ms | 0.2314ms | 4.3213 KOps/s | 4.3627 KOps/s | |
test_unbind_speed_stack1 | 92.4879ms | 0.6489ms | 1.5412 KOps/s | 1.5583 KOps/s | |
test_split | 94.9225ms | 1.7302ms | 577.9564 Ops/s | 570.0969 Ops/s | |
test_chunk | 95.4961ms | 1.6014ms | 624.4380 Ops/s | 677.5376 Ops/s | |
test_consolidate[False-None] | 2.6789ms | 2.6148ms | 382.4375 Ops/s | 347.3850 Ops/s | |
test_consolidate[default-None] | 1.7554ms | 1.6507ms | 605.7921 Ops/s | 592.0299 Ops/s | |
test_consolidate[reduce-overhead-None] | 1.8142ms | 1.6946ms | 590.1220 Ops/s | 582.6146 Ops/s | |
test_consolidate_njt[False-None] | 6.7737ms | 6.6155ms | 151.1593 Ops/s | 150.5161 Ops/s | |
test_to[False-False-None] | 1.8296ms | 1.7550ms | 569.8071 Ops/s | 574.6702 Ops/s | |
test_to[True-False-None] | 1.5543ms | 1.3208ms | 757.1037 Ops/s | 765.3564 Ops/s | |
test_to[within-False-None] | 0.2951s | 5.2068ms | 192.0570 Ops/s | 246.4909 Ops/s | |
test_to[True-default-None] | 5.4639ms | 5.3109ms | 188.2914 Ops/s | 186.6891 Ops/s | |
test_to_njt[False-False-None] | 8.0129ms | 7.0549ms | 141.7460 Ops/s | 140.9537 Ops/s | |
test_to_njt[True-False-None] | 5.9617ms | 5.6243ms | 177.7984 Ops/s | 178.1951 Ops/s | |
test_to_njt[within-False-None] | 12.4829ms | 12.3731ms | 80.8204 Ops/s | 80.0367 Ops/s | |
test_creation[device0] | 0.4568ms | 78.8810μs | 12.6773 KOps/s | 12.5432 KOps/s | |
test_creation_from_tensor | 0.5177ms | 85.4996μs | 11.6960 KOps/s | 11.9726 KOps/s | |
test_add_one[memmap_tensor0] | 0.6161ms | 6.8816μs | 145.3143 KOps/s | 145.3415 KOps/s | |
test_contiguous[memmap_tensor0] | 1.7600μs | 0.4177μs | 2.3941 MOps/s | 2.4038 MOps/s | |
test_stack[memmap_tensor0] | 39.2610μs | 4.4769μs | 223.3669 KOps/s | 226.6126 KOps/s | |
test_memmaptd_index | 1.8065ms | 0.2484ms | 4.0253 KOps/s | 4.0214 KOps/s | |
test_memmaptd_index_astensor | 0.5543ms | 0.3081ms | 3.2456 KOps/s | 3.2563 KOps/s | |
test_memmaptd_index_op | 0.9646ms | 0.5786ms | 1.7282 KOps/s | 1.7185 KOps/s | |
test_serialize_model | 0.1315s | 0.1306s | 7.6558 Ops/s | 7.6741 Ops/s | |
test_serialize_model_pickle | 1.3460s | 1.1845s | 0.8442 Ops/s | 0.8218 Ops/s | |
test_serialize_weights | 0.1303s | 0.1296s | 7.7163 Ops/s | 7.7017 Ops/s | |
test_serialize_weights_returnearly | 0.2592s | 62.3979ms | 16.0262 Ops/s | 23.2122 Ops/s | |
test_serialize_weights_pickle | 1.3772s | 1.2165s | 0.8220 Ops/s | 0.8225 Ops/s | |
test_reshape_pytree | 50.0010μs | 22.8245μs | 43.8127 KOps/s | 43.4044 KOps/s | |
test_reshape_td | 59.5510μs | 26.8734μs | 37.2115 KOps/s | 35.8204 KOps/s | |
test_view_pytree | 62.4610μs | 22.7171μs | 44.0197 KOps/s | 43.5067 KOps/s | |
test_view_td | 65.8820μs | 30.4287μs | 32.8637 KOps/s | 30.9220 KOps/s | |
test_unbind_pytree | 61.8110μs | 28.5560μs | 35.0189 KOps/s | 34.8319 KOps/s | |
test_unbind_td | 0.5436ms | 35.4693μs | 28.1934 KOps/s | 27.0957 KOps/s | |
test_split_pytree | 62.8610μs | 30.7845μs | 32.4839 KOps/s | 32.0312 KOps/s | |
test_split_td | 0.6426ms | 39.0180μs | 25.6292 KOps/s | 25.4945 KOps/s | |
test_add_pytree | 80.6910μs | 35.2154μs | 28.3967 KOps/s | 28.4277 KOps/s | |
test_add_td | 84.9110μs | 47.9794μs | 20.8423 KOps/s | 22.1682 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1700ms | 0.1187ms | 8.4217 KOps/s | 7.9639 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.2189ms | 0.1267ms | 7.8948 KOps/s | 7.8842 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1458ms | 97.0497μs | 10.3040 KOps/s | 10.0585 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.7738ms | 0.1531ms | 6.5321 KOps/s | 6.4911 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 50.3410μs | 23.3917μs | 42.7503 KOps/s | 32.4287 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 56.8410μs | 27.7052μs | 36.0942 KOps/s | 34.8009 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.2799ms | 65.9043μs | 15.1735 KOps/s | 15.1316 KOps/s | |
test_compile_copy_nested[pytree-eager] | 83.6220μs | 50.7019μs | 19.7231 KOps/s | 19.8810 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.1823ms | 0.1433ms | 6.9768 KOps/s | 6.8178 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.2955ms | 0.2083ms | 4.8005 KOps/s | 4.7902 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1627ms | 98.9514μs | 10.1060 KOps/s | 9.9306 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.1078ms | 52.1493μs | 19.1757 KOps/s | 19.3985 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.1856ms | 0.1343ms | 7.4438 KOps/s | 7.1377 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.6445ms | 0.4979ms | 2.0085 KOps/s | 2.0310 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.3631ms | 0.2473ms | 4.0429 KOps/s | 3.9858 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.1881ms | 0.1426ms | 7.0132 KOps/s | 6.8787 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1390ms | 63.8347μs | 15.6655 KOps/s | 15.9192 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1620ms | 98.8160μs | 10.1198 KOps/s | 9.9274 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.4637ms | 0.4222ms | 2.3683 KOps/s | 2.4094 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.1799ms | 0.1360ms | 7.3541 KOps/s | 7.2381 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 0.1455ms | 18.5325μs | 53.9592 KOps/s | 38.9116 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.1425ms | 28.3132μs | 35.3192 KOps/s | 33.7854 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1034ms | 70.4592μs | 14.1926 KOps/s | 14.1176 KOps/s | |
test_compile_copy_flat[pytree-eager] | 85.6310μs | 51.8378μs | 19.2909 KOps/s | 19.5292 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 1.7292ms | 0.4145ms | 2.4124 KOps/s | 2.1998 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 3.2696ms | 2.6291ms | 380.3556 Ops/s | 373.1304 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 1.6631ms | 0.3951ms | 2.5310 KOps/s | 2.2278 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 2.8326ms | 2.7236ms | 367.1668 Ops/s | 364.0755 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.7022ms | 0.1200ms | 8.3328 KOps/s | 8.4031 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.5583ms | 82.5714μs | 12.1107 KOps/s | 11.3429 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.6553ms | 0.1135ms | 8.8113 KOps/s | 8.3911 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 0.1783ms | 70.6300μs | 14.1583 KOps/s | 13.1629 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.2427ms | 0.1129ms | 8.8545 KOps/s | 8.9218 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 0.1258ms | 70.7552μs | 14.1332 KOps/s | 14.3222 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1612ms | 0.1061ms | 9.4237 KOps/s | 9.6146 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.1430ms | 17.2841μs | 57.8567 KOps/s | 54.3894 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1562ms | 0.1003ms | 9.9742 KOps/s | 9.9989 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 57.3210μs | 15.9305μs | 62.7725 KOps/s | 62.1915 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.2154ms | 0.1019ms | 9.8171 KOps/s | 9.9680 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 46.3700μs | 15.9402μs | 62.7345 KOps/s | 61.4314 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1705ms | 0.1045ms | 9.5666 KOps/s | 9.5939 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.6249ms | 16.9876μs | 58.8665 KOps/s | 56.1519 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.2514ms | 98.4203μs | 10.1605 KOps/s | 9.9760 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 47.9110μs | 15.9304μs | 62.7729 KOps/s | 62.5164 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.2263ms | 98.4201μs | 10.1605 KOps/s | 9.9621 KOps/s | |
test_compile_indexing[int-pytree-eager] | 67.0720μs | 16.0605μs | 62.2646 KOps/s | 62.7801 KOps/s | |
test_mod_add[eager] | 0.1719ms | 31.6542μs | 31.5914 KOps/s | 31.1057 KOps/s | |
test_mod_add[compile] | 0.2104ms | 77.8398μs | 12.8469 KOps/s | 12.6486 KOps/s | |
test_mod_add[compile-overhead] | 0.3125ms | 0.1635ms | 6.1163 KOps/s | 5.6153 KOps/s | |
test_mod_wrap[eager] | 0.3281ms | 0.2479ms | 4.0346 KOps/s | 3.8507 KOps/s | |
test_mod_wrap[compile] | 1.6322ms | 0.2888ms | 3.4623 KOps/s | 3.4500 KOps/s | |
test_mod_wrap[compile-overhead] | 7.1532ms | 3.8028ms | 262.9620 Ops/s | 265.7953 Ops/s | |
test_mod_wrap_and_backward[eager] | 1.7308ms | 1.3820ms | 723.5758 Ops/s | 676.1047 Ops/s | |
test_mod_wrap_and_backward[compile] | 1.6895ms | 1.2958ms | 771.7347 Ops/s | 716.2925 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 1.4159ms | 0.9953ms | 1.0047 KOps/s | 931.5088 Ops/s | |
test_seq_add[eager] | 0.1521ms | 96.6033μs | 10.3516 KOps/s | 10.0610 KOps/s | |
test_seq_add[compile] | 0.2776ms | 88.9885μs | 11.2374 KOps/s | 11.1853 KOps/s | |
test_seq_add[compile-overhead] | 0.2807ms | 0.1309ms | 7.6366 KOps/s | 7.5761 KOps/s | |
test_seq_wrap[eager] | 0.5311ms | 0.3807ms | 2.6266 KOps/s | 2.5759 KOps/s | |
test_seq_wrap[compile] | 0.3562ms | 0.3062ms | 3.2656 KOps/s | 3.2505 KOps/s | |
test_seq_wrap[compile-overhead] | 0.2721ms | 0.2271ms | 4.4036 KOps/s | 4.3733 KOps/s | |
test_func_call_runtime[False-eager] | 1.0293ms | 0.7907ms | 1.2647 KOps/s | 1.2971 KOps/s | |
test_func_call_runtime[False-compile] | 1.0011ms | 0.7717ms | 1.2959 KOps/s | 1.3031 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.4273ms | 0.3675ms | 2.7208 KOps/s | 2.7254 KOps/s | |
test_func_call_runtime[True-eager] | 1.2039ms | 0.9153ms | 1.0925 KOps/s | 1.0216 KOps/s | |
test_func_call_runtime[True-compile] | 0.8339ms | 0.7815ms | 1.2796 KOps/s | 1.2487 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.4368ms | 0.3896ms | 2.5665 KOps/s | 2.5780 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.8337ms | 0.7534ms | 1.3272 KOps/s | 1.2409 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.8455ms | 0.7654ms | 1.3065 KOps/s | 1.2608 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.4292ms | 0.3699ms | 2.7034 KOps/s | 2.7090 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.1682ms | 1.0196ms | 980.7826 Ops/s | 950.0709 Ops/s | |
test_func_call_cm_runtime[True-compile] | 1.0293ms | 0.8169ms | 1.2241 KOps/s | 1.2430 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.4809ms | 0.4159ms | 2.4042 KOps/s | 2.3893 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.6272ms | 2.1160ms | 472.5935 Ops/s | 469.9653 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 0.9748ms | 0.8263ms | 1.2102 KOps/s | 1.2163 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.4810ms | 0.4178ms | 2.3932 KOps/s | 2.3925 KOps/s | |
test_distributed | 3.2001ms | 0.1723ms | 5.8030 KOps/s | 8.7105 KOps/s | |
test_tdmodule | 32.3510μs | 13.5113μs | 74.0123 KOps/s | 73.3333 KOps/s | |
test_tdmodule_dispatch | 0.6263ms | 26.7055μs | 37.4454 KOps/s | 36.8491 KOps/s | |
test_tdseq | 50.1410μs | 15.0267μs | 66.5484 KOps/s | 65.8946 KOps/s | |
test_tdseq_dispatch | 48.2410μs | 29.3849μs | 34.0310 KOps/s | 33.2672 KOps/s | |
test_instantiation_functorch | 1.6370ms | 1.5615ms | 640.4117 Ops/s | 636.7610 Ops/s | |
test_exec_functorch | 0.1773ms | 0.1446ms | 6.9162 KOps/s | 6.8336 KOps/s | |
test_exec_functional_call | 0.1893ms | 0.1404ms | 7.1211 KOps/s | 6.9502 KOps/s | |
test_exec_td_decorator | 0.3801ms | 0.1857ms | 5.3840 KOps/s | 5.3395 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.8207ms | 0.6829ms | 1.4644 KOps/s | 1.4589 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.8412ms | 0.6828ms | 1.4646 KOps/s | 1.4560 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.7197ms | 0.6021ms | 1.6610 KOps/s | 1.6557 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.7116ms | 0.6016ms | 1.6623 KOps/s | 1.6532 KOps/s | |
test_vmap_transformer_speed_decorator[True-True] | 19.5453ms | 19.4750ms | 51.3479 Ops/s | 51.4195 Ops/s | |
test_vmap_transformer_speed_decorator[True-False] | 20.2083ms | 19.4854ms | 51.3206 Ops/s | 51.1419 Ops/s | |
test_vmap_transformer_speed_decorator[False-True] | 19.5006ms | 19.4003ms | 51.5457 Ops/s | 51.8681 Ops/s | |
test_vmap_transformer_speed_decorator[False-False] | 19.4380ms | 19.3602ms | 51.6524 Ops/s | 51.5775 Ops/s | |
test_to_module_speed[True] | 1.0541ms | 0.9441ms | 1.0593 KOps/s | 1.0544 KOps/s | |
test_to_module_speed[False] | 1.4018ms | 0.9316ms | 1.0735 KOps/s | 1.0730 KOps/s | |
test_tc_init | 70.6010μs | 36.1690μs | 27.6480 KOps/s | 28.4691 KOps/s | |
test_tc_init_nested | 0.2285ms | 72.1450μs | 13.8610 KOps/s | 13.8736 KOps/s | |
test_tc_first_layer_tensor | 4.4344μs | 0.6998μs | 1.4291 MOps/s | 1.2431 MOps/s | |
test_tc_first_layer_nontensor | 24.8700μs | 2.3690μs | 422.1225 KOps/s | 422.9950 KOps/s | |
test_tc_second_layer_tensor | 7.1628μs | 1.4251μs | 701.6986 KOps/s | 694.4544 KOps/s | |
test_tc_second_layer_nontensor | 28.7710μs | 3.1113μs | 321.4061 KOps/s | 325.4738 KOps/s | |
test_unbind | 0.2194s | 9.6705ms | 103.4077 Ops/s | 150.1627 Ops/s | |
test_full_like | 9.8562ms | 9.4424ms | 105.9048 Ops/s | 104.8956 Ops/s | |
test_zeros_like | 4.9233ms | 4.3394ms | 230.4441 Ops/s | 113.5344 Ops/s | |
test_ones_like | 9.2680ms | 7.2738ms | 137.4790 Ops/s | 229.5147 Ops/s | |
test_clone | 7.2199ms | 6.6442ms | 150.5073 Ops/s | 151.1462 Ops/s | |
test_squeeze | 59.3910μs | 9.3824μs | 106.5830 KOps/s | 105.1477 KOps/s | |
test_unsqueeze | 0.1210ms | 70.9858μs | 14.0873 KOps/s | 13.4441 KOps/s | |
test_split | 0.3853ms | 0.1564ms | 6.3926 KOps/s | 6.0868 KOps/s | |
test_permute | 0.2258ms | 0.1772ms | 5.6444 KOps/s | 5.6125 KOps/s | |
test_stack | 51.8320ms | 51.3084ms | 19.4900 Ops/s | 19.4603 Ops/s | |
test_cat | 51.5221ms | 51.0868ms | 19.5745 Ops/s | 19.3468 Ops/s |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Stack from ghstack (oldest at bottom):