Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BugFix] Resilient deterministic_sample for CompositeDist #1000

Merged
merged 1 commit into from
Sep 17, 2024

Conversation

vmoens
Copy link
Contributor

@vmoens vmoens commented Sep 17, 2024

[ghstack-poisoned]
vmoens added a commit that referenced this pull request Sep 17, 2024
ghstack-source-id: 9844f2db8f2c20928ec17e6f822b9729e82eced7
Pull Request resolved: #1000
@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Sep 17, 2024
Copy link

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 222. Improved: $\large\color{#35bf28}8$. Worsened: $\large\color{#d91a1a}22$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 39.9860μs 20.8496μs 47.9625 KOps/s 47.8771 KOps/s $\color{#35bf28}+0.18\%$
test_plain_set_stack_nested 57.7190μs 20.9416μs 47.7519 KOps/s 47.3602 KOps/s $\color{#35bf28}+0.83\%$
test_plain_set_nested_inplace 54.2530μs 22.5223μs 44.4005 KOps/s 44.1538 KOps/s $\color{#35bf28}+0.56\%$
test_plain_set_stack_nested_inplace 53.7710μs 22.4959μs 44.4526 KOps/s 44.0131 KOps/s $\color{#35bf28}+1.00\%$
test_items 45.0550μs 4.1649μs 240.1012 KOps/s 242.7182 KOps/s $\color{#d91a1a}-1.08\%$
test_items_nested 0.6800ms 0.3564ms 2.8059 KOps/s 2.8005 KOps/s $\color{#35bf28}+0.19\%$
test_items_nested_locked 0.7138ms 0.3593ms 2.7830 KOps/s 2.7907 KOps/s $\color{#d91a1a}-0.28\%$
test_items_nested_leaf 0.1324ms 69.1162μs 14.4684 KOps/s 14.5957 KOps/s $\color{#d91a1a}-0.87\%$
test_items_stack_nested 0.6415ms 0.3584ms 2.7902 KOps/s 2.7546 KOps/s $\color{#35bf28}+1.29\%$
test_items_stack_nested_leaf 0.1479ms 70.7606μs 14.1322 KOps/s 14.1005 KOps/s $\color{#35bf28}+0.22\%$
test_items_stack_nested_locked 0.4950ms 0.3602ms 2.7760 KOps/s 2.7662 KOps/s $\color{#35bf28}+0.35\%$
test_keys 30.7980μs 3.5169μs 284.3407 KOps/s 275.7602 KOps/s $\color{#35bf28}+3.11\%$
test_keys_nested 0.1968ms 0.1032ms 9.6901 KOps/s 10.0341 KOps/s $\color{#d91a1a}-3.43\%$
test_keys_nested_locked 1.6080ms 0.1086ms 9.2075 KOps/s 9.4064 KOps/s $\color{#d91a1a}-2.11\%$
test_keys_nested_leaf 0.1635ms 85.8466μs 11.6487 KOps/s 11.8947 KOps/s $\color{#d91a1a}-2.07\%$
test_keys_stack_nested 0.1744ms 0.1033ms 9.6833 KOps/s 10.0150 KOps/s $\color{#d91a1a}-3.31\%$
test_keys_stack_nested_leaf 0.1425ms 85.1189μs 11.7483 KOps/s 12.0688 KOps/s $\color{#d91a1a}-2.66\%$
test_keys_stack_nested_locked 0.2282ms 0.1093ms 9.1458 KOps/s 9.4354 KOps/s $\color{#d91a1a}-3.07\%$
test_values 6.0614μs 1.0881μs 918.9916 KOps/s 890.3339 KOps/s $\color{#35bf28}+3.22\%$
test_values_nested 0.1569ms 74.3594μs 13.4482 KOps/s 13.6877 KOps/s $\color{#d91a1a}-1.75\%$
test_values_nested_locked 0.1292ms 73.5556μs 13.5952 KOps/s 13.7356 KOps/s $\color{#d91a1a}-1.02\%$
test_values_nested_leaf 0.1260ms 62.9920μs 15.8750 KOps/s 15.9441 KOps/s $\color{#d91a1a}-0.43\%$
test_values_stack_nested 0.1336ms 75.1354μs 13.3093 KOps/s 13.5951 KOps/s $\color{#d91a1a}-2.10\%$
test_values_stack_nested_leaf 0.1081ms 62.8241μs 15.9175 KOps/s 16.4448 KOps/s $\color{#d91a1a}-3.21\%$
test_values_stack_nested_locked 0.1362ms 74.9968μs 13.3339 KOps/s 13.2781 KOps/s $\color{#35bf28}+0.42\%$
test_membership 4.2094μs 0.7312μs 1.3677 MOps/s 1.4141 MOps/s $\color{#d91a1a}-3.28\%$
test_membership_nested 28.5740μs 2.7107μs 368.9067 KOps/s 372.0468 KOps/s $\color{#d91a1a}-0.84\%$
test_membership_nested_leaf 27.4020μs 2.7398μs 364.9963 KOps/s 371.5449 KOps/s $\color{#d91a1a}-1.76\%$
test_membership_stacked_nested 30.8180μs 2.7596μs 362.3691 KOps/s 371.1698 KOps/s $\color{#d91a1a}-2.37\%$
test_membership_stacked_nested_leaf 23.7340μs 2.7554μs 362.9197 KOps/s 374.2152 KOps/s $\color{#d91a1a}-3.02\%$
test_membership_nested_last 39.5040μs 3.8905μs 257.0354 KOps/s 257.9925 KOps/s $\color{#d91a1a}-0.37\%$
test_membership_nested_leaf_last 33.1620μs 3.8901μs 257.0634 KOps/s 259.0334 KOps/s $\color{#d91a1a}-0.76\%$
test_membership_stacked_nested_last 32.3010μs 3.9084μs 255.8571 KOps/s 202.3137 KOps/s $\textbf{\color{#35bf28}+26.47\%}$
test_membership_stacked_nested_leaf_last 29.2850μs 3.9497μs 253.1832 KOps/s 202.9056 KOps/s $\textbf{\color{#35bf28}+24.78\%}$
test_nested_getleaf 54.4330μs 10.6264μs 94.1055 KOps/s 94.3567 KOps/s $\color{#d91a1a}-0.27\%$
test_nested_get 35.1660μs 10.0959μs 99.0499 KOps/s 99.8085 KOps/s $\color{#d91a1a}-0.76\%$
test_stacked_getleaf 39.9750μs 10.7579μs 92.9548 KOps/s 95.2756 KOps/s $\color{#d91a1a}-2.44\%$
test_stacked_get 38.4920μs 10.0133μs 99.8676 KOps/s 99.6220 KOps/s $\color{#35bf28}+0.25\%$
test_nested_getitemleaf 37.2000μs 10.9959μs 90.9430 KOps/s 89.9824 KOps/s $\color{#35bf28}+1.07\%$
test_nested_getitem 33.2130μs 10.2755μs 97.3191 KOps/s 96.6527 KOps/s $\color{#35bf28}+0.69\%$
test_stacked_getitemleaf 37.2000μs 11.0285μs 90.6739 KOps/s 90.4387 KOps/s $\color{#35bf28}+0.26\%$
test_stacked_getitem 29.5160μs 10.3715μs 96.4181 KOps/s 96.3673 KOps/s $\color{#35bf28}+0.05\%$
test_lock_nested 88.4119ms 0.5875ms 1.7021 KOps/s 2.1079 KOps/s $\textbf{\color{#d91a1a}-19.25\%}$
test_lock_stack_nested 0.5949ms 0.4721ms 2.1184 KOps/s 2.2402 KOps/s $\textbf{\color{#d91a1a}-5.44\%}$
test_unlock_nested 91.0526ms 0.5154ms 1.9404 KOps/s 2.5024 KOps/s $\textbf{\color{#d91a1a}-22.46\%}$
test_unlock_stack_nested 0.4704ms 0.3852ms 2.5960 KOps/s 2.7090 KOps/s $\color{#d91a1a}-4.17\%$
test_flatten_speed 0.3407ms 90.3682μs 11.0658 KOps/s 11.5566 KOps/s $\color{#d91a1a}-4.25\%$
test_unflatten_speed 0.7017ms 0.4776ms 2.0938 KOps/s 2.1260 KOps/s $\color{#d91a1a}-1.51\%$
test_common_ops 4.7622ms 1.1228ms 890.5986 Ops/s 897.5745 Ops/s $\color{#d91a1a}-0.78\%$
test_creation 21.9320μs 2.0372μs 490.8773 KOps/s 479.9235 KOps/s $\color{#35bf28}+2.28\%$
test_creation_empty 48.5520μs 18.4888μs 54.0869 KOps/s 51.9914 KOps/s $\color{#35bf28}+4.03\%$
test_creation_nested_1 61.8370μs 21.6195μs 46.2544 KOps/s 44.5408 KOps/s $\color{#35bf28}+3.85\%$
test_creation_nested_2 67.5770μs 25.5297μs 39.1701 KOps/s 37.3068 KOps/s $\color{#35bf28}+4.99\%$
test_clone 0.1195ms 17.8729μs 55.9506 KOps/s 58.5768 KOps/s $\color{#d91a1a}-4.48\%$
test_getitem[int] 1.0787ms 16.7413μs 59.7326 KOps/s 62.0941 KOps/s $\color{#d91a1a}-3.80\%$
test_getitem[slice_int] 0.1324ms 31.6370μs 31.6086 KOps/s 33.4927 KOps/s $\textbf{\color{#d91a1a}-5.63\%}$
test_getitem[range] 0.2552ms 65.4262μs 15.2844 KOps/s 17.2242 KOps/s $\textbf{\color{#d91a1a}-11.26\%}$
test_getitem[tuple] 0.1263ms 25.5911μs 39.0760 KOps/s 40.9522 KOps/s $\color{#d91a1a}-4.58\%$
test_getitem[list] 0.1792ms 54.7957μs 18.2496 KOps/s 18.6404 KOps/s $\color{#d91a1a}-2.10\%$
test_setitem_dim[int] 60.0030μs 32.7785μs 30.5078 KOps/s 32.6646 KOps/s $\textbf{\color{#d91a1a}-6.60\%}$
test_setitem_dim[slice_int] 0.1099ms 62.1751μs 16.0836 KOps/s 16.8442 KOps/s $\color{#d91a1a}-4.52\%$
test_setitem_dim[range] 0.1338ms 84.6283μs 11.8164 KOps/s 12.0311 KOps/s $\color{#d91a1a}-1.79\%$
test_setitem_dim[tuple] 77.2850μs 49.2333μs 20.3115 KOps/s 21.0338 KOps/s $\color{#d91a1a}-3.43\%$
test_setitem 0.1693ms 30.2953μs 33.0085 KOps/s 32.9598 KOps/s $\color{#35bf28}+0.15\%$
test_set 0.1010ms 29.4194μs 33.9911 KOps/s 33.6858 KOps/s $\color{#35bf28}+0.91\%$
test_set_shared 1.3139ms 0.2112ms 4.7344 KOps/s 4.7347 KOps/s $-0.01\%$
test_update 94.4580μs 36.5394μs 27.3678 KOps/s 26.5144 KOps/s $\color{#35bf28}+3.22\%$
test_update_nested 0.1405ms 47.1691μs 21.2003 KOps/s 21.3804 KOps/s $\color{#d91a1a}-0.84\%$
test_update__nested 83.2460μs 35.2324μs 28.3829 KOps/s 29.6430 KOps/s $\color{#d91a1a}-4.25\%$
test_set_nested 81.2630μs 31.8969μs 31.3510 KOps/s 31.8117 KOps/s $\color{#d91a1a}-1.45\%$
test_set_nested_new 0.1138ms 37.1580μs 26.9121 KOps/s 26.6134 KOps/s $\color{#35bf28}+1.12\%$
test_select 0.1168ms 55.8706μs 17.8985 KOps/s 18.5569 KOps/s $\color{#d91a1a}-3.55\%$
test_select_nested 0.1333ms 59.7237μs 16.7438 KOps/s 16.9373 KOps/s $\color{#d91a1a}-1.14\%$
test_exclude_nested 0.1476ms 75.7170μs 13.2071 KOps/s 13.2276 KOps/s $\color{#d91a1a}-0.16\%$
test_empty[True] 0.4578ms 0.3187ms 3.1376 KOps/s 3.1507 KOps/s $\color{#d91a1a}-0.42\%$
test_empty[False] 7.3288μs 1.2157μs 822.5639 KOps/s 848.1301 KOps/s $\color{#d91a1a}-3.01\%$
test_unbind_speed 0.4790ms 0.3095ms 3.2310 KOps/s 3.3100 KOps/s $\color{#d91a1a}-2.39\%$
test_unbind_speed_stack0 0.5239ms 0.3068ms 3.2593 KOps/s 3.3602 KOps/s $\color{#d91a1a}-3.00\%$
test_unbind_speed_stack1 91.1446ms 0.8247ms 1.2126 KOps/s 1.3565 KOps/s $\textbf{\color{#d91a1a}-10.61\%}$
test_split 89.6945ms 2.2059ms 453.3225 Ops/s 464.8477 Ops/s $\color{#d91a1a}-2.48\%$
test_chunk 3.2221ms 2.0393ms 490.3625 Ops/s 469.4891 Ops/s $\color{#35bf28}+4.45\%$
test_creation[device0] 4.0637ms 0.1220ms 8.1961 KOps/s 8.6296 KOps/s $\textbf{\color{#d91a1a}-5.02\%}$
test_creation_from_tensor 0.2394ms 0.1182ms 8.4626 KOps/s 8.5033 KOps/s $\color{#d91a1a}-0.48\%$
test_add_one[memmap_tensor0] 0.2092ms 7.6192μs 131.2478 KOps/s 142.1256 KOps/s $\textbf{\color{#d91a1a}-7.65\%}$
test_contiguous[memmap_tensor0] 16.8020μs 1.9435μs 514.5393 KOps/s 540.1375 KOps/s $\color{#d91a1a}-4.74\%$
test_stack[memmap_tensor0] 37.3500μs 5.8959μs 169.6091 KOps/s 179.2993 KOps/s $\textbf{\color{#d91a1a}-5.40\%}$
test_memmaptd_index 0.6683ms 0.4115ms 2.4300 KOps/s 2.5386 KOps/s $\color{#d91a1a}-4.28\%$
test_memmaptd_index_astensor 0.9328ms 0.4886ms 2.0466 KOps/s 2.1260 KOps/s $\color{#d91a1a}-3.73\%$
test_memmaptd_index_op 1.7115ms 1.0381ms 963.3139 Ops/s 991.4408 Ops/s $\color{#d91a1a}-2.84\%$
test_serialize_model 0.2142s 0.1290s 7.7515 Ops/s 8.6695 Ops/s $\textbf{\color{#d91a1a}-10.59\%}$
test_serialize_model_pickle 0.4797s 0.3999s 2.5007 Ops/s 2.4842 Ops/s $\color{#35bf28}+0.66\%$
test_serialize_weights 0.1234s 0.1148s 8.7111 Ops/s 7.7078 Ops/s $\textbf{\color{#35bf28}+13.02\%}$
test_serialize_weights_returnearly 0.1750s 0.1603s 6.2371 Ops/s 6.2897 Ops/s $\color{#d91a1a}-0.84\%$
test_serialize_weights_pickle 0.4989s 0.4250s 2.3528 Ops/s 2.2805 Ops/s $\color{#35bf28}+3.17\%$
test_serialize_weights_filesystem 0.1441s 0.1407s 7.1050 Ops/s 7.1546 Ops/s $\color{#d91a1a}-0.69\%$
test_serialize_model_filesystem 0.1507s 0.1446s 6.9155 Ops/s 6.1282 Ops/s $\textbf{\color{#35bf28}+12.85\%}$
test_reshape_pytree 0.1321ms 38.7750μs 25.7898 KOps/s 25.9475 KOps/s $\color{#d91a1a}-0.61\%$
test_reshape_td 0.1113ms 45.6655μs 21.8984 KOps/s 22.1637 KOps/s $\color{#d91a1a}-1.20\%$
test_view_pytree 85.4710μs 38.1534μs 26.2100 KOps/s 25.5653 KOps/s $\color{#35bf28}+2.52\%$
test_view_td 0.1200ms 52.7386μs 18.9615 KOps/s 19.5330 KOps/s $\color{#d91a1a}-2.93\%$
test_unbind_pytree 91.0420μs 35.9817μs 27.7919 KOps/s 27.7389 KOps/s $\color{#35bf28}+0.19\%$
test_unbind_td 0.3346ms 46.0682μs 21.7069 KOps/s 22.2288 KOps/s $\color{#d91a1a}-2.35\%$
test_split_pytree 89.6990μs 38.3357μs 26.0854 KOps/s 26.3789 KOps/s $\color{#d91a1a}-1.11\%$
test_split_td 0.4764ms 58.8632μs 16.9886 KOps/s 17.9070 KOps/s $\textbf{\color{#d91a1a}-5.13\%}$
test_add_pytree 86.5030μs 45.3033μs 22.0734 KOps/s 22.3240 KOps/s $\color{#d91a1a}-1.12\%$
test_add_td 0.2168ms 82.5085μs 12.1200 KOps/s 12.1073 KOps/s $\color{#35bf28}+0.10\%$
test_compile_add_one_nested[tensordict-compile] 0.1155ms 57.7556μs 17.3144 KOps/s 17.9871 KOps/s $\color{#d91a1a}-3.74\%$
test_compile_add_one_nested[tensordict-eager] 0.3248ms 0.1763ms 5.6720 KOps/s 5.7081 KOps/s $\color{#d91a1a}-0.63\%$
test_compile_add_one_nested[pytree-compile] 0.1174ms 57.0515μs 17.5280 KOps/s 17.8373 KOps/s $\color{#d91a1a}-1.73\%$
test_compile_add_one_nested[pytree-eager] 0.2992ms 0.1439ms 6.9505 KOps/s 7.2055 KOps/s $\color{#d91a1a}-3.54\%$
test_compile_copy_nested[tensordict-compile] 64.5610μs 20.8398μs 47.9852 KOps/s 48.2811 KOps/s $\color{#d91a1a}-0.61\%$
test_compile_copy_nested[tensordict-eager] 0.1408ms 67.9395μs 14.7190 KOps/s 15.0136 KOps/s $\color{#d91a1a}-1.96\%$
test_compile_copy_nested[pytree-compile] 0.1673ms 76.0677μs 13.1462 KOps/s 13.3217 KOps/s $\color{#d91a1a}-1.32\%$
test_compile_copy_nested[pytree-eager] 0.1355ms 68.0301μs 14.6994 KOps/s 14.5310 KOps/s $\color{#35bf28}+1.16\%$
test_compile_add_one_flat[tensordict-compile] 0.2859ms 0.1751ms 5.7123 KOps/s 5.7511 KOps/s $\color{#d91a1a}-0.67\%$
test_compile_add_one_flat[tensordict-eager] 0.3404ms 0.1899ms 5.2666 KOps/s 5.3623 KOps/s $\color{#d91a1a}-1.78\%$
test_compile_add_one_flat[tensorclass-compile] 0.1443ms 47.6179μs 21.0005 KOps/s 20.2145 KOps/s $\color{#35bf28}+3.89\%$
test_compile_add_one_flat[tensorclass-eager] 0.1248ms 67.9813μs 14.7099 KOps/s 15.1592 KOps/s $\color{#d91a1a}-2.96\%$
test_compile_add_one_flat[pytree-compile] 0.3094ms 0.1753ms 5.7045 KOps/s 5.7721 KOps/s $\color{#d91a1a}-1.17\%$
test_compile_add_one_flat[pytree-eager] 0.5355ms 0.2963ms 3.3748 KOps/s 3.5714 KOps/s $\textbf{\color{#d91a1a}-5.50\%}$
test_compile_add_self_flat[tensordict-eager] 0.3301ms 0.2019ms 4.9530 KOps/s 5.0454 KOps/s $\color{#d91a1a}-1.83\%$
test_compile_add_self_flat[tensordict-compile] 0.3564ms 0.1765ms 5.6672 KOps/s 5.7351 KOps/s $\color{#d91a1a}-1.18\%$
test_compile_add_self_flat[tensorclass-eager] 0.1263ms 61.3895μs 16.2894 KOps/s 16.4937 KOps/s $\color{#d91a1a}-1.24\%$
test_compile_add_self_flat[tensorclass-compile] 0.1164ms 48.5277μs 20.6068 KOps/s 21.5698 KOps/s $\color{#d91a1a}-4.46\%$
test_compile_add_self_flat[pytree-eager] 0.4225ms 0.2370ms 4.2192 KOps/s 4.3328 KOps/s $\color{#d91a1a}-2.62\%$
test_compile_add_self_flat[pytree-compile] 0.3949ms 0.1754ms 5.7008 KOps/s 5.6567 KOps/s $\color{#35bf28}+0.78\%$
test_compile_copy_flat[tensordict-compile] 0.1816ms 0.1028ms 9.7269 KOps/s 9.8007 KOps/s $\color{#d91a1a}-0.75\%$
test_compile_copy_flat[tensordict-eager] 0.1508ms 59.4259μs 16.8277 KOps/s 16.9368 KOps/s $\color{#d91a1a}-0.64\%$
test_compile_copy_flat[pytree-compile] 0.1417ms 76.0870μs 13.1429 KOps/s 12.9197 KOps/s $\color{#35bf28}+1.73\%$
test_compile_copy_flat[pytree-eager] 0.1439ms 68.4719μs 14.6045 KOps/s 14.6955 KOps/s $\color{#d91a1a}-0.62\%$
test_compile_assign_and_add[tensordict-compile] 0.4153ms 0.2000ms 5.0010 KOps/s 5.1219 KOps/s $\color{#d91a1a}-2.36\%$
test_compile_assign_and_add[tensordict-eager] 1.9725ms 1.7065ms 585.9957 Ops/s 619.3588 Ops/s $\textbf{\color{#d91a1a}-5.39\%}$
test_compile_assign_and_add[pytree-compile] 0.2479ms 0.1934ms 5.1708 KOps/s 5.1610 KOps/s $\color{#35bf28}+0.19\%$
test_compile_assign_and_add[pytree-eager] 1.2075ms 1.1229ms 890.5871 Ops/s 928.4919 Ops/s $\color{#d91a1a}-4.08\%$
test_compile_assign_and_add_stack[compile] 0.5529ms 0.4179ms 2.3930 KOps/s 2.3620 KOps/s $\color{#35bf28}+1.31\%$
test_compile_assign_and_add_stack[eager] 5.0074ms 3.8256ms 261.3989 Ops/s 265.6817 Ops/s $\color{#d91a1a}-1.61\%$
test_compile_indexing[tensor-tensordict-compile] 0.1107ms 35.2803μs 28.3445 KOps/s 29.6127 KOps/s $\color{#d91a1a}-4.28\%$
test_compile_indexing[tensor-tensordict-eager] 0.5522ms 48.5760μs 20.5863 KOps/s 21.0464 KOps/s $\color{#d91a1a}-2.19\%$
test_compile_indexing[tensor-tensorclass-compile] 75.0020μs 30.4485μs 32.8424 KOps/s 34.3025 KOps/s $\color{#d91a1a}-4.26\%$
test_compile_indexing[tensor-tensorclass-eager] 98.6960μs 29.6953μs 33.6753 KOps/s 35.1701 KOps/s $\color{#d91a1a}-4.25\%$
test_compile_indexing[tensor-pytree-compile] 95.8800μs 30.4357μs 32.8561 KOps/s 33.1787 KOps/s $\color{#d91a1a}-0.97\%$
test_compile_indexing[tensor-pytree-eager] 76.9150μs 30.1221μs 33.1983 KOps/s 34.8930 KOps/s $\color{#d91a1a}-4.86\%$
test_compile_indexing[slice-tensordict-compile] 0.1314ms 74.8342μs 13.3629 KOps/s 13.9383 KOps/s $\color{#d91a1a}-4.13\%$
test_compile_indexing[slice-tensordict-eager] 0.5440ms 28.7800μs 34.7464 KOps/s 37.1575 KOps/s $\textbf{\color{#d91a1a}-6.49\%}$
test_compile_indexing[slice-tensorclass-compile] 0.1336ms 70.2438μs 14.2361 KOps/s 14.7552 KOps/s $\color{#d91a1a}-3.52\%$
test_compile_indexing[slice-tensorclass-eager] 90.8920μs 23.5548μs 42.4542 KOps/s 42.9015 KOps/s $\color{#d91a1a}-1.04\%$
test_compile_indexing[slice-pytree-compile] 0.1415ms 69.7387μs 14.3392 KOps/s 14.7369 KOps/s $\color{#d91a1a}-2.70\%$
test_compile_indexing[slice-pytree-eager] 77.5480μs 23.4290μs 42.6821 KOps/s 42.7628 KOps/s $\color{#d91a1a}-0.19\%$
test_compile_indexing[int-tensordict-compile] 0.1449ms 74.9377μs 13.3444 KOps/s 13.7623 KOps/s $\color{#d91a1a}-3.04\%$
test_compile_indexing[int-tensordict-eager] 0.8321ms 28.1266μs 35.5536 KOps/s 36.9511 KOps/s $\color{#d91a1a}-3.78\%$
test_compile_indexing[int-tensorclass-compile] 0.1401ms 70.2949μs 14.2258 KOps/s 14.6954 KOps/s $\color{#d91a1a}-3.20\%$
test_compile_indexing[int-tensorclass-eager] 0.2977ms 23.7379μs 42.1266 KOps/s 43.5166 KOps/s $\color{#d91a1a}-3.19\%$
test_compile_indexing[int-pytree-compile] 0.1817ms 70.6838μs 14.1475 KOps/s 14.7765 KOps/s $\color{#d91a1a}-4.26\%$
test_compile_indexing[int-pytree-eager] 59.1210μs 23.1283μs 43.2370 KOps/s 43.5921 KOps/s $\color{#d91a1a}-0.81\%$
test_mod_add[eager] 95.8110μs 25.4116μs 39.3521 KOps/s 38.7877 KOps/s $\color{#35bf28}+1.45\%$
test_mod_add[compile] 0.1084ms 39.4683μs 25.3368 KOps/s 25.6405 KOps/s $\color{#d91a1a}-1.18\%$
test_mod_add[compile-overhead] 92.4040μs 39.2437μs 25.4818 KOps/s 25.5146 KOps/s $\color{#d91a1a}-0.13\%$
test_mod_wrap[eager] 0.4320ms 0.2111ms 4.7364 KOps/s 4.8900 KOps/s $\color{#d91a1a}-3.14\%$
test_mod_wrap[compile] 0.3931ms 0.2329ms 4.2942 KOps/s 4.3276 KOps/s $\color{#d91a1a}-0.77\%$
test_mod_wrap[compile-overhead] 0.4282ms 0.2315ms 4.3197 KOps/s 4.3349 KOps/s $\color{#d91a1a}-0.35\%$
test_mod_wrap_and_backward[eager] 14.1959ms 10.9949ms 90.9513 Ops/s 91.4349 Ops/s $\color{#d91a1a}-0.53\%$
test_mod_wrap_and_backward[compile] 12.1007ms 10.8819ms 91.8955 Ops/s 93.3756 Ops/s $\color{#d91a1a}-1.59\%$
test_mod_wrap_and_backward[compile-overhead] 12.3760ms 10.9014ms 91.7312 Ops/s 93.1510 Ops/s $\color{#d91a1a}-1.52\%$
test_seq_add[eager] 0.1823ms 92.6395μs 10.7945 KOps/s 10.8124 KOps/s $\color{#d91a1a}-0.17\%$
test_seq_add[compile] 0.1322ms 65.6215μs 15.2389 KOps/s 15.3815 KOps/s $\color{#d91a1a}-0.93\%$
test_seq_add[compile-overhead] 0.1341ms 64.5872μs 15.4829 KOps/s 15.9781 KOps/s $\color{#d91a1a}-3.10\%$
test_seq_wrap[eager] 0.6455ms 0.3883ms 2.5752 KOps/s 2.6119 KOps/s $\color{#d91a1a}-1.40\%$
test_seq_wrap[compile] 1.3145ms 0.2708ms 3.6929 KOps/s 3.7121 KOps/s $\color{#d91a1a}-0.52\%$
test_seq_wrap[compile-overhead] 1.3907ms 0.2706ms 3.6949 KOps/s 3.7443 KOps/s $\color{#d91a1a}-1.32\%$
test_func_call_runtime[False-eager] 0.9428ms 0.5314ms 1.8819 KOps/s 2.0004 KOps/s $\textbf{\color{#d91a1a}-5.93\%}$
test_func_call_runtime[False-compile] 0.9167ms 0.4989ms 2.0043 KOps/s 2.0024 KOps/s $\color{#35bf28}+0.09\%$
test_func_call_runtime[False-compile-overhead] 0.9259ms 0.4979ms 2.0085 KOps/s 1.9938 KOps/s $\color{#35bf28}+0.74\%$
test_func_call_runtime[True-eager] 1.5963ms 0.7378ms 1.3553 KOps/s 1.4024 KOps/s $\color{#d91a1a}-3.36\%$
test_func_call_runtime[True-compile] 0.8920ms 0.5133ms 1.9481 KOps/s 1.9632 KOps/s $\color{#d91a1a}-0.77\%$
test_func_call_runtime[True-compile-overhead] 0.8707ms 0.5126ms 1.9507 KOps/s 1.9717 KOps/s $\color{#d91a1a}-1.07\%$
test_func_call_cm_runtime[False-eager] 0.6734ms 0.5233ms 1.9111 KOps/s 2.0365 KOps/s $\textbf{\color{#d91a1a}-6.16\%}$
test_func_call_cm_runtime[False-compile] 0.8067ms 0.5009ms 1.9963 KOps/s 2.0155 KOps/s $\color{#d91a1a}-0.95\%$
test_func_call_cm_runtime[False-compile-overhead] 0.9336ms 0.5003ms 1.9988 KOps/s 2.0282 KOps/s $\color{#d91a1a}-1.45\%$
test_func_call_cm_runtime[True-eager] 1.0277ms 0.8686ms 1.1513 KOps/s 1.1996 KOps/s $\color{#d91a1a}-4.03\%$
test_func_call_cm_runtime[True-compile] 0.9077ms 0.7428ms 1.3462 KOps/s 1.4150 KOps/s $\color{#d91a1a}-4.86\%$
test_func_call_cm_runtime[True-compile-overhead] 1.1788ms 0.7517ms 1.3304 KOps/s 1.4116 KOps/s $\textbf{\color{#d91a1a}-5.75\%}$
test_vmap_func_call_cm_runtime[eager] 2.4678ms 1.8695ms 534.8919 Ops/s 545.4042 Ops/s $\color{#d91a1a}-1.93\%$
test_vmap_func_call_cm_runtime[compile] 2.6412ms 1.9321ms 517.5808 Ops/s 527.8031 Ops/s $\color{#d91a1a}-1.94\%$
test_vmap_func_call_cm_runtime[compile-overhead] 2.6807ms 1.9194ms 521.0031 Ops/s 530.5067 Ops/s $\color{#d91a1a}-1.79\%$
test_distributed 0.2614ms 0.1242ms 8.0530 KOps/s 7.9608 KOps/s $\color{#35bf28}+1.16\%$
test_tdmodule 72.5460μs 18.6130μs 53.7259 KOps/s 51.8212 KOps/s $\color{#35bf28}+3.68\%$
test_tdmodule_dispatch 63.6700μs 36.3179μs 27.5346 KOps/s 26.2556 KOps/s $\color{#35bf28}+4.87\%$
test_tdseq 36.2180μs 20.2160μs 49.4657 KOps/s 47.6994 KOps/s $\color{#35bf28}+3.70\%$
test_tdseq_dispatch 58.4000μs 41.1974μs 24.2734 KOps/s 23.7526 KOps/s $\color{#35bf28}+2.19\%$
test_instantiation_functorch 2.4101ms 1.6369ms 610.9271 Ops/s 634.9073 Ops/s $\color{#d91a1a}-3.78\%$
test_instantiation_td 1.9301ms 1.2041ms 830.5188 Ops/s 839.2915 Ops/s $\color{#d91a1a}-1.05\%$
test_exec_functorch 0.3488ms 0.1901ms 5.2604 KOps/s 5.4211 KOps/s $\color{#d91a1a}-2.97\%$
test_exec_functional_call 0.3360ms 0.1821ms 5.4910 KOps/s 5.9842 KOps/s $\textbf{\color{#d91a1a}-8.24\%}$
test_exec_td 0.3055ms 0.1766ms 5.6633 KOps/s 6.1464 KOps/s $\textbf{\color{#d91a1a}-7.86\%}$
test_exec_td_decorator 1.1334ms 0.2289ms 4.3685 KOps/s 4.6704 KOps/s $\textbf{\color{#d91a1a}-6.46\%}$
test_vmap_mlp_speed[True-True] 0.9959ms 0.6518ms 1.5341 KOps/s 1.5626 KOps/s $\color{#d91a1a}-1.82\%$
test_vmap_mlp_speed[True-False] 0.8795ms 0.6460ms 1.5479 KOps/s 1.5622 KOps/s $\color{#d91a1a}-0.91\%$
test_vmap_mlp_speed[False-True] 0.8074ms 0.5033ms 1.9870 KOps/s 2.0336 KOps/s $\color{#d91a1a}-2.29\%$
test_vmap_mlp_speed[False-False] 0.7431ms 0.5019ms 1.9922 KOps/s 2.0477 KOps/s $\color{#d91a1a}-2.71\%$
test_vmap_mlp_speed_decorator[True-True] 1.5179ms 0.6250ms 1.6000 KOps/s 1.5657 KOps/s $\color{#35bf28}+2.19\%$
test_vmap_mlp_speed_decorator[True-False] 1.1114ms 0.6305ms 1.5860 KOps/s 1.6266 KOps/s $\color{#d91a1a}-2.49\%$
test_vmap_mlp_speed_decorator[False-True] 0.9110ms 0.5143ms 1.9444 KOps/s 1.9715 KOps/s $\color{#d91a1a}-1.37\%$
test_vmap_mlp_speed_decorator[False-False] 0.7165ms 0.5127ms 1.9504 KOps/s 1.9854 KOps/s $\color{#d91a1a}-1.77\%$
test_to_module_speed[True] 1.9095ms 1.3099ms 763.4241 Ops/s 774.9126 Ops/s $\color{#d91a1a}-1.48\%$
test_to_module_speed[False] 2.1150ms 1.2667ms 789.4354 Ops/s 805.1605 Ops/s $\color{#d91a1a}-1.95\%$
test_tc_init 91.8730μs 43.4079μs 23.0373 KOps/s 21.9321 KOps/s $\textbf{\color{#35bf28}+5.04\%}$
test_tc_init_nested 0.1625ms 85.6268μs 11.6786 KOps/s 10.8194 KOps/s $\textbf{\color{#35bf28}+7.94\%}$
test_tc_first_layer_tensor 32.7010μs 1.5692μs 637.2598 KOps/s 628.0657 KOps/s $\color{#35bf28}+1.46\%$
test_tc_first_layer_nontensor 20.5690μs 4.8114μs 207.8395 KOps/s 214.0838 KOps/s $\color{#d91a1a}-2.92\%$
test_tc_second_layer_tensor 22.5330μs 2.8583μs 349.8638 KOps/s 349.4909 KOps/s $\color{#35bf28}+0.11\%$
test_tc_second_layer_nontensor 27.4620μs 6.1227μs 163.3263 KOps/s 166.2319 KOps/s $\color{#d91a1a}-1.75\%$
test_unbind 0.4694s 13.1548ms 76.0180 Ops/s 68.3321 Ops/s $\textbf{\color{#35bf28}+11.25\%}$
test_full_like 7.7732ms 6.7935ms 147.1992 Ops/s 146.9483 Ops/s $\color{#35bf28}+0.17\%$
test_zeros_like 3.1752ms 2.6768ms 373.5873 Ops/s 366.5454 Ops/s $\color{#35bf28}+1.92\%$
test_ones_like 11.7562ms 5.7572ms 173.6968 Ops/s 160.7539 Ops/s $\textbf{\color{#35bf28}+8.05\%}$
test_clone 14.2594ms 7.5580ms 132.3103 Ops/s 127.4759 Ops/s $\color{#35bf28}+3.79\%$
test_squeeze 70.4420μs 12.6801μs 78.8636 KOps/s 80.2592 KOps/s $\color{#d91a1a}-1.74\%$
test_unsqueeze 0.2204ms 95.6556μs 10.4542 KOps/s 11.0601 KOps/s $\textbf{\color{#d91a1a}-5.48\%}$
test_split 0.4858ms 0.2012ms 4.9693 KOps/s 5.2172 KOps/s $\color{#d91a1a}-4.75\%$
test_permute 0.3864ms 0.2266ms 4.4128 KOps/s 4.5571 KOps/s $\color{#d91a1a}-3.17\%$
test_stack 30.6994ms 24.3538ms 41.0613 Ops/s 39.2619 Ops/s $\color{#35bf28}+4.58\%$
test_cat 29.2335ms 24.1307ms 41.4409 Ops/s 39.4813 Ops/s $\color{#35bf28}+4.96\%$

Copy link

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 228. Improved: $\large\color{#35bf28}16$. Worsened: $\large\color{#d91a1a}4$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 0.6122ms 14.0154μs 71.3499 KOps/s 72.5335 KOps/s $\color{#d91a1a}-1.63\%$
test_plain_set_stack_nested 43.0230μs 14.2309μs 70.2695 KOps/s 73.0826 KOps/s $\color{#d91a1a}-3.85\%$
test_plain_set_nested_inplace 39.5020μs 14.9875μs 66.7222 KOps/s 67.6447 KOps/s $\color{#d91a1a}-1.36\%$
test_plain_set_stack_nested_inplace 59.0140μs 15.0612μs 66.3958 KOps/s 68.9144 KOps/s $\color{#d91a1a}-3.65\%$
test_items 43.3720μs 2.9215μs 342.2860 KOps/s 347.4679 KOps/s $\color{#d91a1a}-1.49\%$
test_items_nested 0.3812ms 0.3292ms 3.0380 KOps/s 3.0584 KOps/s $\color{#d91a1a}-0.67\%$
test_items_nested_locked 0.3979ms 0.3280ms 3.0490 KOps/s 3.0590 KOps/s $\color{#d91a1a}-0.32\%$
test_items_nested_leaf 86.8550μs 55.9081μs 17.8865 KOps/s 17.8563 KOps/s $\color{#35bf28}+0.17\%$
test_items_stack_nested 0.3911ms 0.3325ms 3.0071 KOps/s 3.0666 KOps/s $\color{#d91a1a}-1.94\%$
test_items_stack_nested_leaf 97.1660μs 57.2390μs 17.4706 KOps/s 17.9142 KOps/s $\color{#d91a1a}-2.48\%$
test_items_stack_nested_locked 0.3957ms 0.3299ms 3.0317 KOps/s 3.0569 KOps/s $\color{#d91a1a}-0.83\%$
test_keys 27.3320μs 3.3918μs 294.8245 KOps/s 275.7192 KOps/s $\textbf{\color{#35bf28}+6.93\%}$
test_keys_nested 93.0160μs 56.1907μs 17.7965 KOps/s 18.2234 KOps/s $\color{#d91a1a}-2.34\%$
test_keys_nested_locked 2.3082ms 62.9927μs 15.8749 KOps/s 16.1561 KOps/s $\color{#d91a1a}-1.74\%$
test_keys_nested_leaf 75.8350μs 47.8694μs 20.8902 KOps/s 21.5464 KOps/s $\color{#d91a1a}-3.05\%$
test_keys_stack_nested 99.1060μs 56.7505μs 17.6210 KOps/s 17.6837 KOps/s $\color{#d91a1a}-0.35\%$
test_keys_stack_nested_leaf 0.1077ms 47.6105μs 21.0038 KOps/s 21.0290 KOps/s $\color{#d91a1a}-0.12\%$
test_keys_stack_nested_locked 95.6660μs 61.7696μs 16.1892 KOps/s 16.2700 KOps/s $\color{#d91a1a}-0.50\%$
test_values 5.9488μs 0.8547μs 1.1700 MOps/s 1.1945 MOps/s $\color{#d91a1a}-2.05\%$
test_values_nested 69.4440μs 41.0710μs 24.3481 KOps/s 24.5416 KOps/s $\color{#d91a1a}-0.79\%$
test_values_nested_locked 68.9640μs 43.0224μs 23.2437 KOps/s 23.4472 KOps/s $\color{#d91a1a}-0.87\%$
test_values_nested_leaf 67.1140μs 35.3950μs 28.2525 KOps/s 28.2278 KOps/s $\color{#35bf28}+0.09\%$
test_values_stack_nested 79.4650μs 42.0714μs 23.7691 KOps/s 24.3444 KOps/s $\color{#d91a1a}-2.36\%$
test_values_stack_nested_leaf 59.6040μs 36.1297μs 27.6781 KOps/s 28.1920 KOps/s $\color{#d91a1a}-1.82\%$
test_values_stack_nested_locked 67.5940μs 43.8104μs 22.8256 KOps/s 23.3611 KOps/s $\color{#d91a1a}-2.29\%$
test_membership 1.8261μs 0.5050μs 1.9804 MOps/s 1.9614 MOps/s $\color{#35bf28}+0.97\%$
test_membership_nested 14.7110μs 1.8465μs 541.5749 KOps/s 542.4229 KOps/s $\color{#d91a1a}-0.16\%$
test_membership_nested_leaf 14.7143μs 1.8208μs 549.2215 KOps/s 557.6604 KOps/s $\color{#d91a1a}-1.51\%$
test_membership_stacked_nested 26.0220μs 1.8910μs 528.8250 KOps/s 543.4608 KOps/s $\color{#d91a1a}-2.69\%$
test_membership_stacked_nested_leaf 31.5520μs 1.8709μs 534.5089 KOps/s 541.2167 KOps/s $\color{#d91a1a}-1.24\%$
test_membership_nested_last 30.5110μs 2.7220μs 367.3771 KOps/s 372.4263 KOps/s $\color{#d91a1a}-1.36\%$
test_membership_nested_leaf_last 29.2820μs 2.7329μs 365.9156 KOps/s 369.0482 KOps/s $\color{#d91a1a}-0.85\%$
test_membership_stacked_nested_last 24.2310μs 2.6984μs 370.5857 KOps/s 324.9700 KOps/s $\textbf{\color{#35bf28}+14.04\%}$
test_membership_stacked_nested_leaf_last 32.7020μs 2.7520μs 363.3786 KOps/s 322.6647 KOps/s $\textbf{\color{#35bf28}+12.62\%}$
test_nested_getleaf 34.2720μs 6.0526μs 165.2191 KOps/s 165.9491 KOps/s $\color{#d91a1a}-0.44\%$
test_nested_get 40.2720μs 5.7215μs 174.7803 KOps/s 176.6353 KOps/s $\color{#d91a1a}-1.05\%$
test_stacked_getleaf 29.9420μs 6.1024μs 163.8702 KOps/s 166.6252 KOps/s $\color{#d91a1a}-1.65\%$
test_stacked_get 43.6130μs 5.7084μs 175.1802 KOps/s 176.9800 KOps/s $\color{#d91a1a}-1.02\%$
test_nested_getitemleaf 36.3520μs 6.1867μs 161.6375 KOps/s 162.7892 KOps/s $\color{#d91a1a}-0.71\%$
test_nested_getitem 31.5620μs 5.7545μs 173.7768 KOps/s 174.0689 KOps/s $\color{#d91a1a}-0.17\%$
test_stacked_getitemleaf 29.9620μs 6.1701μs 162.0730 KOps/s 163.0150 KOps/s $\color{#d91a1a}-0.58\%$
test_stacked_getitem 0.5348ms 5.5850μs 179.0496 KOps/s 175.3827 KOps/s $\color{#35bf28}+2.09\%$
test_lock_nested 4.4613ms 0.4159ms 2.4042 KOps/s 2.4265 KOps/s $\color{#d91a1a}-0.92\%$
test_lock_stack_nested 0.4244ms 0.3788ms 2.6400 KOps/s 2.6687 KOps/s $\color{#d91a1a}-1.08\%$
test_unlock_nested 0.7281ms 0.3512ms 2.8471 KOps/s 2.8528 KOps/s $\color{#d91a1a}-0.20\%$
test_unlock_stack_nested 0.3723ms 0.3146ms 3.1785 KOps/s 3.1949 KOps/s $\color{#d91a1a}-0.51\%$
test_flatten_speed 0.1070ms 69.0028μs 14.4922 KOps/s 14.5990 KOps/s $\color{#d91a1a}-0.73\%$
test_unflatten_speed 0.3387ms 0.2803ms 3.5671 KOps/s 3.5759 KOps/s $\color{#d91a1a}-0.25\%$
test_common_ops 1.5266ms 1.2264ms 815.4100 Ops/s 818.7383 Ops/s $\color{#d91a1a}-0.41\%$
test_creation 27.0220μs 1.4324μs 698.1369 KOps/s 693.4942 KOps/s $\color{#35bf28}+0.67\%$
test_creation_empty 38.9420μs 15.2707μs 65.4847 KOps/s 66.3708 KOps/s $\color{#d91a1a}-1.34\%$
test_creation_nested_1 45.7030μs 16.8019μs 59.5170 KOps/s 58.7242 KOps/s $\color{#35bf28}+1.35\%$
test_creation_nested_2 50.9630μs 19.6283μs 50.9468 KOps/s 51.3388 KOps/s $\color{#d91a1a}-0.76\%$
test_clone 62.7130μs 28.4544μs 35.1439 KOps/s 35.5200 KOps/s $\color{#d91a1a}-1.06\%$
test_getitem[int] 92.2593ms 22.2559μs 44.9320 KOps/s 64.9886 KOps/s $\textbf{\color{#d91a1a}-30.86\%}$
test_getitem[slice_int] 0.1183ms 25.6500μs 38.9864 KOps/s 38.2965 KOps/s $\color{#35bf28}+1.80\%$
test_getitem[range] 0.2717ms 0.1079ms 9.2719 KOps/s 9.1604 KOps/s $\color{#35bf28}+1.22\%$
test_getitem[tuple] 0.1202ms 22.7614μs 43.9340 KOps/s 43.6797 KOps/s $\color{#35bf28}+0.58\%$
test_getitem[list] 0.1906ms 99.7969μs 10.0204 KOps/s 10.4338 KOps/s $\color{#d91a1a}-3.96\%$
test_setitem_dim[int] 68.1340μs 43.2212μs 23.1368 KOps/s 22.9760 KOps/s $\color{#35bf28}+0.70\%$
test_setitem_dim[slice_int] 0.1072ms 66.2013μs 15.1055 KOps/s 15.4680 KOps/s $\color{#d91a1a}-2.34\%$
test_setitem_dim[range] 0.1593ms 0.1266ms 7.8981 KOps/s 8.0840 KOps/s $\color{#d91a1a}-2.30\%$
test_setitem_dim[tuple] 86.3550μs 58.7187μs 17.0303 KOps/s 17.0055 KOps/s $\color{#35bf28}+0.15\%$
test_setitem 79.3140μs 41.4809μs 24.1075 KOps/s 24.8879 KOps/s $\color{#d91a1a}-3.14\%$
test_set 87.3250μs 40.5186μs 24.6800 KOps/s 25.0822 KOps/s $\color{#d91a1a}-1.60\%$
test_set_shared 0.3623ms 49.8376μs 20.0652 KOps/s 20.0563 KOps/s $\color{#35bf28}+0.04\%$
test_update 94.2750μs 49.3080μs 20.2807 KOps/s 20.9661 KOps/s $\color{#d91a1a}-3.27\%$
test_update_nested 95.4150μs 55.2936μs 18.0853 KOps/s 18.4257 KOps/s $\color{#d91a1a}-1.85\%$
test_update__nested 0.1114ms 58.0806μs 17.2175 KOps/s 17.2234 KOps/s $\color{#d91a1a}-0.03\%$
test_set_nested 81.0550μs 43.3358μs 23.0756 KOps/s 23.7690 KOps/s $\color{#d91a1a}-2.92\%$
test_set_nested_new 87.9950μs 46.6385μs 21.4415 KOps/s 21.9183 KOps/s $\color{#d91a1a}-2.18\%$
test_select 94.3860μs 60.3537μs 16.5690 KOps/s 17.0528 KOps/s $\color{#d91a1a}-2.84\%$
test_select_nested 0.3693ms 42.8170μs 23.3552 KOps/s 24.1576 KOps/s $\color{#d91a1a}-3.32\%$
test_exclude_nested 93.8560μs 57.4755μs 17.3987 KOps/s 17.4063 KOps/s $\color{#d91a1a}-0.04\%$
test_empty[True] 0.3037ms 0.2422ms 4.1284 KOps/s 4.1729 KOps/s $\color{#d91a1a}-1.06\%$
test_empty[False] 3.5502μs 0.7381μs 1.3548 MOps/s 1.3400 MOps/s $\color{#35bf28}+1.10\%$
test_to 51.9230μs 25.9162μs 38.5859 KOps/s 38.4391 KOps/s $\color{#35bf28}+0.38\%$
test_to_nonblocking 61.7140μs 24.6578μs 40.5551 KOps/s 41.1793 KOps/s $\color{#d91a1a}-1.52\%$
test_unbind_speed 0.3289ms 0.2734ms 3.6571 KOps/s 3.6655 KOps/s $\color{#d91a1a}-0.23\%$
test_unbind_speed_stack0 0.3394ms 0.2731ms 3.6618 KOps/s 3.7072 KOps/s $\color{#d91a1a}-1.23\%$
test_unbind_speed_stack1 92.4047ms 0.6989ms 1.4308 KOps/s 1.4129 KOps/s $\color{#35bf28}+1.26\%$
test_split 94.3258ms 2.0908ms 478.2809 Ops/s 470.5431 Ops/s $\color{#35bf28}+1.64\%$
test_chunk 95.0343ms 2.0668ms 483.8434 Ops/s 474.2487 Ops/s $\color{#35bf28}+2.02\%$
test_creation[device0] 0.3361ms 0.1230ms 8.1313 KOps/s 8.1007 KOps/s $\color{#35bf28}+0.38\%$
test_creation_from_tensor 0.3666ms 0.1262ms 7.9222 KOps/s 7.9708 KOps/s $\color{#d91a1a}-0.61\%$
test_add_one[memmap_tensor0] 0.2227ms 8.3897μs 119.1944 KOps/s 119.1913 KOps/s $+0.00\%$
test_contiguous[memmap_tensor0] 36.1520μs 2.1040μs 475.2789 KOps/s 482.8811 KOps/s $\color{#d91a1a}-1.57\%$
test_stack[memmap_tensor0] 36.0920μs 6.5874μs 151.8048 KOps/s 154.5608 KOps/s $\color{#d91a1a}-1.78\%$
test_memmaptd_index 1.2282ms 0.4055ms 2.4662 KOps/s 2.4824 KOps/s $\color{#d91a1a}-0.65\%$
test_memmaptd_index_astensor 0.7510ms 0.4593ms 2.1773 KOps/s 2.2085 KOps/s $\color{#d91a1a}-1.41\%$
test_memmaptd_index_op 1.3803ms 0.9803ms 1.0201 KOps/s 1.0375 KOps/s $\color{#d91a1a}-1.68\%$
test_serialize_model 0.1310s 0.1289s 7.7568 Ops/s 7.7502 Ops/s $\color{#35bf28}+0.09\%$
test_serialize_model_pickle 1.3518s 1.2120s 0.8251 Ops/s 0.8254 Ops/s $\color{#d91a1a}-0.04\%$
test_serialize_weights 0.1296s 0.1283s 7.7930 Ops/s 7.7916 Ops/s $\color{#35bf28}+0.02\%$
test_serialize_weights_returnearly 48.4091ms 45.4859ms 21.9848 Ops/s 18.1426 Ops/s $\textbf{\color{#35bf28}+21.18\%}$
test_serialize_weights_pickle 1.3511s 1.2126s 0.8247 Ops/s 0.8210 Ops/s $\color{#35bf28}+0.45\%$
test_reshape_pytree 81.6150μs 35.9242μs 27.8364 KOps/s 29.1201 KOps/s $\color{#d91a1a}-4.41\%$
test_reshape_td 77.4650μs 40.9150μs 24.4409 KOps/s 24.0996 KOps/s $\color{#35bf28}+1.42\%$
test_view_pytree 72.1440μs 33.4386μs 29.9056 KOps/s 29.5789 KOps/s $\color{#35bf28}+1.10\%$
test_view_td 86.6450μs 43.8848μs 22.7869 KOps/s 21.8568 KOps/s $\color{#35bf28}+4.26\%$
test_unbind_pytree 62.9340μs 33.5923μs 29.7687 KOps/s 30.2480 KOps/s $\color{#d91a1a}-1.58\%$
test_unbind_td 0.7032ms 40.6134μs 24.6224 KOps/s 23.6717 KOps/s $\color{#35bf28}+4.02\%$
test_split_pytree 88.5760μs 43.7380μs 22.8634 KOps/s 21.8507 KOps/s $\color{#35bf28}+4.63\%$
test_split_td 0.1742ms 52.7224μs 18.9673 KOps/s 16.1301 KOps/s $\textbf{\color{#35bf28}+17.59\%}$
test_add_pytree 0.1057ms 54.4163μs 18.3768 KOps/s 18.2329 KOps/s $\color{#35bf28}+0.79\%$
test_add_td 0.1239ms 85.4759μs 11.6992 KOps/s 11.2836 KOps/s $\color{#35bf28}+3.68\%$
test_compile_add_one_nested[tensordict-compile] 0.3959ms 0.2028ms 4.9306 KOps/s 4.8587 KOps/s $\color{#35bf28}+1.48\%$
test_compile_add_one_nested[tensordict-eager] 0.1981ms 0.1482ms 6.7474 KOps/s 6.7106 KOps/s $\color{#35bf28}+0.55\%$
test_compile_add_one_nested[pytree-compile] 0.1803ms 0.1404ms 7.1210 KOps/s 7.1953 KOps/s $\color{#d91a1a}-1.03\%$
test_compile_add_one_nested[pytree-eager] 0.2283ms 0.1778ms 5.6258 KOps/s 5.5805 KOps/s $\color{#35bf28}+0.81\%$
test_compile_copy_nested[tensordict-compile] 67.3440μs 20.7010μs 48.3068 KOps/s 47.8023 KOps/s $\color{#35bf28}+1.06\%$
test_compile_copy_nested[tensordict-eager] 87.1950μs 43.0912μs 23.2066 KOps/s 23.4213 KOps/s $\color{#d91a1a}-0.92\%$
test_compile_copy_nested[pytree-compile] 0.1135ms 63.8253μs 15.6678 KOps/s 15.8098 KOps/s $\color{#d91a1a}-0.90\%$
test_compile_copy_nested[pytree-eager] 93.3150μs 49.3561μs 20.2609 KOps/s 20.2912 KOps/s $\color{#d91a1a}-0.15\%$
test_compile_add_one_flat[tensordict-compile] 0.3668ms 0.3040ms 3.2898 KOps/s 3.2808 KOps/s $\color{#35bf28}+0.27\%$
test_compile_add_one_flat[tensordict-eager] 0.2764ms 0.2062ms 4.8488 KOps/s 4.8191 KOps/s $\color{#35bf28}+0.62\%$
test_compile_add_one_flat[tensorclass-compile] 0.1720ms 0.1241ms 8.0611 KOps/s 8.1005 KOps/s $\color{#d91a1a}-0.49\%$
test_compile_add_one_flat[tensorclass-eager] 0.1145ms 57.6238μs 17.3539 KOps/s 17.3589 KOps/s $\color{#d91a1a}-0.03\%$
test_compile_add_one_flat[pytree-compile] 0.3488ms 0.3034ms 3.2963 KOps/s 3.2773 KOps/s $\color{#35bf28}+0.58\%$
test_compile_add_one_flat[pytree-eager] 0.6831ms 0.6051ms 1.6526 KOps/s 1.6592 KOps/s $\color{#d91a1a}-0.40\%$
test_compile_add_self_flat[tensordict-eager] 0.3070ms 0.2457ms 4.0701 KOps/s 4.0432 KOps/s $\color{#35bf28}+0.67\%$
test_compile_add_self_flat[tensordict-compile] 0.3546ms 0.3046ms 3.2831 KOps/s 3.2822 KOps/s $\color{#35bf28}+0.03\%$
test_compile_add_self_flat[tensorclass-eager] 0.1054ms 68.3924μs 14.6215 KOps/s 14.2304 KOps/s $\color{#35bf28}+2.75\%$
test_compile_add_self_flat[tensorclass-compile] 0.1662ms 0.1246ms 8.0234 KOps/s 8.0647 KOps/s $\color{#d91a1a}-0.51\%$
test_compile_add_self_flat[pytree-eager] 0.5784ms 0.5203ms 1.9221 KOps/s 1.9323 KOps/s $\color{#d91a1a}-0.53\%$
test_compile_add_self_flat[pytree-compile] 0.3596ms 0.3030ms 3.3000 KOps/s 3.2949 KOps/s $\color{#35bf28}+0.15\%$
test_compile_copy_flat[tensordict-compile] 44.0930μs 17.9060μs 55.8473 KOps/s 54.5305 KOps/s $\color{#35bf28}+2.41\%$
test_compile_copy_flat[tensordict-eager] 67.1350μs 27.2787μs 36.6587 KOps/s 36.5251 KOps/s $\color{#35bf28}+0.37\%$
test_compile_copy_flat[pytree-compile] 96.9660μs 68.5310μs 14.5919 KOps/s 14.6456 KOps/s $\color{#d91a1a}-0.37\%$
test_compile_copy_flat[pytree-eager] 81.8650μs 50.8914μs 19.6497 KOps/s 19.5759 KOps/s $\color{#35bf28}+0.38\%$
test_compile_assign_and_add[tensordict-compile] 2.2958ms 0.7949ms 1.2580 KOps/s 1.1548 KOps/s $\textbf{\color{#35bf28}+8.94\%}$
test_compile_assign_and_add[tensordict-eager] 3.2743ms 3.1167ms 320.8476 Ops/s 325.8290 Ops/s $\color{#d91a1a}-1.53\%$
test_compile_assign_and_add[pytree-compile] 2.2300ms 0.8010ms 1.2485 KOps/s 1.1695 KOps/s $\textbf{\color{#35bf28}+6.76\%}$
test_compile_assign_and_add[pytree-eager] 3.2710ms 3.1406ms 318.4069 Ops/s 327.3765 Ops/s $\color{#d91a1a}-2.74\%$
test_compile_indexing[tensor-tensordict-compile] 0.1576ms 0.1064ms 9.3944 KOps/s 9.3856 KOps/s $\color{#35bf28}+0.09\%$
test_compile_indexing[tensor-tensordict-eager] 0.1828ms 57.5755μs 17.3685 KOps/s 15.8285 KOps/s $\textbf{\color{#35bf28}+9.73\%}$
test_compile_indexing[tensor-tensorclass-compile] 0.1454ms 0.1006ms 9.9443 KOps/s 9.9450 KOps/s $-0.01\%$
test_compile_indexing[tensor-tensorclass-eager] 0.1562ms 41.7227μs 23.9678 KOps/s 22.8352 KOps/s $\color{#35bf28}+4.96\%$
test_compile_indexing[tensor-pytree-compile] 0.1423ms 0.1013ms 9.8729 KOps/s 9.4573 KOps/s $\color{#35bf28}+4.39\%$
test_compile_indexing[tensor-pytree-eager] 83.3050μs 41.4171μs 24.1446 KOps/s 23.6388 KOps/s $\color{#35bf28}+2.14\%$
test_compile_indexing[slice-tensordict-compile] 0.1808ms 0.1337ms 7.4802 KOps/s 7.4978 KOps/s $\color{#d91a1a}-0.23\%$
test_compile_indexing[slice-tensordict-eager] 0.1604ms 23.8210μs 41.9797 KOps/s 41.2324 KOps/s $\color{#35bf28}+1.81\%$
test_compile_indexing[slice-tensorclass-compile] 0.1760ms 0.1280ms 7.8127 KOps/s 7.8806 KOps/s $\color{#d91a1a}-0.86\%$
test_compile_indexing[slice-tensorclass-eager] 49.2930μs 19.9281μs 50.1804 KOps/s 50.1279 KOps/s $\color{#35bf28}+0.10\%$
test_compile_indexing[slice-pytree-compile] 0.1948ms 0.1285ms 7.7835 KOps/s 7.8347 KOps/s $\color{#d91a1a}-0.65\%$
test_compile_indexing[slice-pytree-eager] 52.2630μs 20.0929μs 49.7689 KOps/s 50.0492 KOps/s $\color{#d91a1a}-0.56\%$
test_compile_indexing[int-tensordict-compile] 0.2411ms 0.1349ms 7.4105 KOps/s 7.4767 KOps/s $\color{#d91a1a}-0.88\%$
test_compile_indexing[int-tensordict-eager] 0.5547ms 23.4072μs 42.7220 KOps/s 41.1332 KOps/s $\color{#35bf28}+3.86\%$
test_compile_indexing[int-tensorclass-compile] 0.1875ms 0.1281ms 7.8074 KOps/s 7.8269 KOps/s $\color{#d91a1a}-0.25\%$
test_compile_indexing[int-tensorclass-eager] 0.1577ms 21.6913μs 46.1013 KOps/s 49.1206 KOps/s $\textbf{\color{#d91a1a}-6.15\%}$
test_compile_indexing[int-pytree-compile] 0.1778ms 0.1280ms 7.8133 KOps/s 7.8169 KOps/s $\color{#d91a1a}-0.05\%$
test_compile_indexing[int-pytree-eager] 48.3530μs 19.9936μs 50.0160 KOps/s 50.2891 KOps/s $\color{#d91a1a}-0.54\%$
test_mod_add[eager] 72.5850μs 30.8562μs 32.4084 KOps/s 32.2111 KOps/s $\color{#35bf28}+0.61\%$
test_mod_add[compile] 0.2989ms 67.9645μs 14.7136 KOps/s 14.3133 KOps/s $\color{#35bf28}+2.80\%$
test_mod_add[compile-overhead] 0.2622ms 0.1342ms 7.4507 KOps/s 7.2259 KOps/s $\color{#35bf28}+3.11\%$
test_mod_wrap[eager] 0.3408ms 0.2325ms 4.3012 KOps/s 4.1180 KOps/s $\color{#35bf28}+4.45\%$
test_mod_wrap[compile] 0.3701ms 0.2833ms 3.5298 KOps/s 3.4591 KOps/s $\color{#35bf28}+2.04\%$
test_mod_wrap[compile-overhead] 7.7772ms 4.0662ms 245.9276 Ops/s 247.7028 Ops/s $\color{#d91a1a}-0.72\%$
test_mod_wrap_and_backward[eager] 1.6003ms 1.3237ms 755.4392 Ops/s 705.4024 Ops/s $\textbf{\color{#35bf28}+7.09\%}$
test_mod_wrap_and_backward[compile] 1.5777ms 1.2814ms 780.3674 Ops/s 717.6850 Ops/s $\textbf{\color{#35bf28}+8.73\%}$
test_mod_wrap_and_backward[compile-overhead] 1.2975ms 0.8792ms 1.1374 KOps/s 961.4397 Ops/s $\textbf{\color{#35bf28}+18.30\%}$
test_seq_add[eager] 0.1386ms 94.8847μs 10.5391 KOps/s 10.5377 KOps/s $\color{#35bf28}+0.01\%$
test_seq_add[compile] 0.1377ms 77.9566μs 12.8276 KOps/s 12.8912 KOps/s $\color{#d91a1a}-0.49\%$
test_seq_add[compile-overhead] 0.1711ms 0.1129ms 8.8552 KOps/s 8.9496 KOps/s $\color{#d91a1a}-1.05\%$
test_seq_wrap[eager] 0.7664ms 0.3710ms 2.6955 KOps/s 2.6725 KOps/s $\color{#35bf28}+0.86\%$
test_seq_wrap[compile] 0.7349ms 0.3106ms 3.2200 KOps/s 3.2687 KOps/s $\color{#d91a1a}-1.49\%$
test_seq_wrap[compile-overhead] 0.6187ms 0.2162ms 4.6251 KOps/s 4.6289 KOps/s $\color{#d91a1a}-0.08\%$
test_func_call_runtime[False-eager] 1.1506ms 0.7305ms 1.3688 KOps/s 1.3887 KOps/s $\color{#d91a1a}-1.43\%$
test_func_call_runtime[False-compile] 1.1609ms 0.7678ms 1.3025 KOps/s 1.3070 KOps/s $\color{#d91a1a}-0.34\%$
test_func_call_runtime[False-compile-overhead] 0.7723ms 0.3471ms 2.8809 KOps/s 2.8849 KOps/s $\color{#d91a1a}-0.14\%$
test_func_call_runtime[True-eager] 0.9925ms 0.8585ms 1.1648 KOps/s 1.1403 KOps/s $\color{#35bf28}+2.14\%$
test_func_call_runtime[True-compile] 0.8898ms 0.7838ms 1.2758 KOps/s 1.2779 KOps/s $\color{#d91a1a}-0.17\%$
test_func_call_runtime[True-compile-overhead] 0.4838ms 0.3659ms 2.7331 KOps/s 2.7273 KOps/s $\color{#35bf28}+0.21\%$
test_func_call_cm_runtime[False-eager] 0.8927ms 0.7037ms 1.4210 KOps/s 1.3769 KOps/s $\color{#35bf28}+3.20\%$
test_func_call_cm_runtime[False-compile] 0.9113ms 0.7580ms 1.3193 KOps/s 1.2930 KOps/s $\color{#35bf28}+2.04\%$
test_func_call_cm_runtime[False-compile-overhead] 0.4935ms 0.3491ms 2.8646 KOps/s 2.8602 KOps/s $\color{#35bf28}+0.15\%$
test_func_call_cm_runtime[True-eager] 1.1198ms 0.9575ms 1.0443 KOps/s 1.0227 KOps/s $\color{#35bf28}+2.12\%$
test_func_call_cm_runtime[True-compile] 1.0005ms 0.8074ms 1.2386 KOps/s 1.2311 KOps/s $\color{#35bf28}+0.61\%$
test_func_call_cm_runtime[True-compile-overhead] 0.4399ms 0.3913ms 2.5556 KOps/s 2.5546 KOps/s $\color{#35bf28}+0.04\%$
test_vmap_func_call_cm_runtime[eager] 2.4815ms 2.0362ms 491.1076 Ops/s 488.5219 Ops/s $\color{#35bf28}+0.53\%$
test_vmap_func_call_cm_runtime[compile] 0.9997ms 0.8264ms 1.2100 KOps/s 1.2077 KOps/s $\color{#35bf28}+0.19\%$
test_vmap_func_call_cm_runtime[compile-overhead] 0.7916ms 0.3941ms 2.5371 KOps/s 2.5267 KOps/s $\color{#35bf28}+0.41\%$
test_distributed 0.6727ms 0.1517ms 6.5926 KOps/s 8.9282 KOps/s $\textbf{\color{#d91a1a}-26.16\%}$
test_tdmodule 33.4320μs 14.7600μs 67.7505 KOps/s 67.8616 KOps/s $\color{#d91a1a}-0.16\%$
test_tdmodule_dispatch 51.6830μs 28.6045μs 34.9596 KOps/s 34.9386 KOps/s $\color{#35bf28}+0.06\%$
test_tdseq 34.2920μs 15.1300μs 66.0940 KOps/s 64.9431 KOps/s $\color{#35bf28}+1.77\%$
test_tdseq_dispatch 59.4030μs 31.0521μs 32.2040 KOps/s 32.6141 KOps/s $\color{#d91a1a}-1.26\%$
test_instantiation_functorch 1.8780ms 1.7873ms 559.5175 Ops/s 565.4873 Ops/s $\color{#d91a1a}-1.06\%$
test_instantiation_td 1.7512ms 1.1575ms 863.9637 Ops/s 854.9924 Ops/s $\color{#35bf28}+1.05\%$
test_exec_functorch 0.2920ms 0.1991ms 5.0228 KOps/s 4.6454 KOps/s $\textbf{\color{#35bf28}+8.12\%}$
test_exec_functional_call 0.2845ms 0.1999ms 5.0018 KOps/s 4.6220 KOps/s $\textbf{\color{#35bf28}+8.22\%}$
test_exec_td 0.2482ms 0.2030ms 4.9270 KOps/s 4.4764 KOps/s $\textbf{\color{#35bf28}+10.07\%}$
test_exec_td_decorator 0.8244ms 0.2444ms 4.0913 KOps/s 3.8094 KOps/s $\textbf{\color{#35bf28}+7.40\%}$
test_vmap_mlp_speed[True-True] 0.8503ms 0.6738ms 1.4841 KOps/s 1.4648 KOps/s $\color{#35bf28}+1.31\%$
test_vmap_mlp_speed[True-False] 0.7611ms 0.6724ms 1.4871 KOps/s 1.4688 KOps/s $\color{#35bf28}+1.25\%$
test_vmap_mlp_speed[False-True] 0.6830ms 0.5678ms 1.7612 KOps/s 1.7657 KOps/s $\color{#d91a1a}-0.26\%$
test_vmap_mlp_speed[False-False] 0.6662ms 0.5642ms 1.7725 KOps/s 1.7441 KOps/s $\color{#35bf28}+1.63\%$
test_vmap_mlp_speed_decorator[True-True] 1.1763ms 0.6601ms 1.5149 KOps/s 1.4908 KOps/s $\color{#35bf28}+1.62\%$
test_vmap_mlp_speed_decorator[True-False] 0.7895ms 0.6592ms 1.5169 KOps/s 1.5108 KOps/s $\color{#35bf28}+0.40\%$
test_vmap_mlp_speed_decorator[False-True] 0.6997ms 0.5764ms 1.7348 KOps/s 1.7298 KOps/s $\color{#35bf28}+0.29\%$
test_vmap_mlp_speed_decorator[False-False] 0.7581ms 0.5782ms 1.7295 KOps/s 1.6845 KOps/s $\color{#35bf28}+2.67\%$
test_vmap_transformer_speed[True-True] 8.3390ms 8.1944ms 122.0345 Ops/s 120.7300 Ops/s $\color{#35bf28}+1.08\%$
test_vmap_transformer_speed[True-False] 8.2841ms 8.1866ms 122.1510 Ops/s 120.7509 Ops/s $\color{#35bf28}+1.16\%$
test_vmap_transformer_speed[False-True] 8.1288ms 8.0093ms 124.8541 Ops/s 123.4672 Ops/s $\color{#35bf28}+1.12\%$
test_vmap_transformer_speed[False-False] 8.2228ms 8.0031ms 124.9517 Ops/s 122.6404 Ops/s $\color{#35bf28}+1.88\%$
test_vmap_transformer_speed_decorator[True-True] 19.9352ms 19.4184ms 51.4975 Ops/s 51.8376 Ops/s $\color{#d91a1a}-0.66\%$
test_vmap_transformer_speed_decorator[True-False] 19.8120ms 19.2196ms 52.0302 Ops/s 51.8431 Ops/s $\color{#35bf28}+0.36\%$
test_vmap_transformer_speed_decorator[False-True] 20.2125ms 19.0341ms 52.5374 Ops/s 52.3598 Ops/s $\color{#35bf28}+0.34\%$
test_vmap_transformer_speed_decorator[False-False] 19.1409ms 19.0257ms 52.5606 Ops/s 52.3969 Ops/s $\color{#35bf28}+0.31\%$
test_to_module_speed[True] 1.2207ms 0.9326ms 1.0723 KOps/s 1.0855 KOps/s $\color{#d91a1a}-1.21\%$
test_to_module_speed[False] 1.3185ms 0.9078ms 1.1016 KOps/s 1.1056 KOps/s $\color{#d91a1a}-0.36\%$
test_tc_init 79.2640μs 34.9581μs 28.6057 KOps/s 29.7331 KOps/s $\color{#d91a1a}-3.79\%$
test_tc_init_nested 0.1133ms 66.5241μs 15.0321 KOps/s 14.7109 KOps/s $\color{#35bf28}+2.18\%$
test_tc_first_layer_tensor 5.3617μs 0.6603μs 1.5145 MOps/s 1.4940 MOps/s $\color{#35bf28}+1.37\%$
test_tc_first_layer_nontensor 30.2420μs 2.1831μs 458.0707 KOps/s 460.4723 KOps/s $\color{#d91a1a}-0.52\%$
test_tc_second_layer_tensor 7.5505μs 1.3466μs 742.6165 KOps/s 700.4684 KOps/s $\textbf{\color{#35bf28}+6.02\%}$
test_tc_second_layer_nontensor 30.6120μs 2.8808μs 347.1309 KOps/s 345.9700 KOps/s $\color{#35bf28}+0.34\%$
test_unbind 0.1925s 12.1511ms 82.2970 Ops/s 101.1991 Ops/s $\textbf{\color{#d91a1a}-18.68\%}$
test_full_like 0.7036ms 0.5742ms 1.7416 KOps/s 1.7413 KOps/s $\color{#35bf28}+0.02\%$
test_zeros_like 0.5240ms 0.1978ms 5.0551 KOps/s 5.0532 KOps/s $\color{#35bf28}+0.04\%$
test_ones_like 0.5286ms 0.1978ms 5.0543 KOps/s 5.0576 KOps/s $\color{#d91a1a}-0.06\%$
test_clone 0.6060ms 0.4150ms 2.4098 KOps/s 2.4124 KOps/s $\color{#d91a1a}-0.11\%$
test_squeeze 29.3320μs 9.8475μs 101.5484 KOps/s 101.4470 KOps/s $\color{#35bf28}+0.10\%$
test_unsqueeze 0.4490ms 74.5397μs 13.4157 KOps/s 13.6359 KOps/s $\color{#d91a1a}-1.61\%$
test_split 0.2533ms 0.1497ms 6.6799 KOps/s 6.5653 KOps/s $\color{#35bf28}+1.75\%$
test_permute 0.5621ms 0.1710ms 5.8480 KOps/s 5.7499 KOps/s $\color{#35bf28}+1.71\%$
test_stack 1.2546ms 0.8597ms 1.1632 KOps/s 1.1593 KOps/s $\color{#35bf28}+0.33\%$
test_cat 1.2608ms 1.2313ms 812.1299 Ops/s 811.6914 Ops/s $\color{#35bf28}+0.05\%$

@vmoens vmoens added the bug Something isn't working label Sep 17, 2024
@vmoens vmoens merged commit ed837ea into gh/vmoens/22/base Sep 17, 2024
34 of 39 checks passed
vmoens added a commit that referenced this pull request Sep 17, 2024
ghstack-source-id: 9844f2db8f2c20928ec17e6f822b9729e82eced7
Pull Request resolved: #1000
@vmoens vmoens deleted the gh/vmoens/22/head branch September 17, 2024 21:11
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants