Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BugFix] Fix select in tensorclass #936

Merged
merged 2 commits into from
Jul 31, 2024
Merged

Conversation

vmoens
Copy link
Contributor

@vmoens vmoens commented Jul 31, 2024

No description provided.

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jul 31, 2024
@vmoens vmoens linked an issue Jul 31, 2024 that may be closed by this pull request
3 tasks
@vmoens vmoens added the bug Something isn't working label Jul 31, 2024
Copy link

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 219. Improved: $\large\color{#35bf28}12$. Worsened: $\large\color{#d91a1a}7$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 52.0770μs 21.9470μs 45.5643 KOps/s 44.4902 KOps/s $\color{#35bf28}+2.41\%$
test_plain_set_stack_nested 56.1750μs 21.9587μs 45.5400 KOps/s 44.1335 KOps/s $\color{#35bf28}+3.19\%$
test_plain_set_nested_inplace 98.1170μs 23.8015μs 42.0141 KOps/s 40.8042 KOps/s $\color{#35bf28}+2.97\%$
test_plain_set_stack_nested_inplace 78.7240μs 23.7970μs 42.0222 KOps/s 40.9643 KOps/s $\color{#35bf28}+2.58\%$
test_items 29.6150μs 2.6374μs 379.1660 KOps/s 363.7008 KOps/s $\color{#35bf28}+4.25\%$
test_items_nested 0.5018ms 0.3337ms 2.9966 KOps/s 2.9950 KOps/s $\color{#35bf28}+0.06\%$
test_items_nested_locked 0.4243ms 0.3350ms 2.9855 KOps/s 2.9687 KOps/s $\color{#35bf28}+0.57\%$
test_items_nested_leaf 1.6059ms 83.4125μs 11.9886 KOps/s 12.0789 KOps/s $\color{#d91a1a}-0.75\%$
test_items_stack_nested 0.4804ms 0.3355ms 2.9807 KOps/s 2.9591 KOps/s $\color{#35bf28}+0.73\%$
test_items_stack_nested_leaf 0.1660ms 84.4181μs 11.8458 KOps/s 11.7074 KOps/s $\color{#35bf28}+1.18\%$
test_items_stack_nested_locked 0.4856ms 0.3363ms 2.9735 KOps/s 2.9888 KOps/s $\color{#d91a1a}-0.51\%$
test_keys 26.8410μs 3.9002μs 256.3990 KOps/s 253.2525 KOps/s $\color{#35bf28}+1.24\%$
test_keys_nested 0.2803ms 0.1461ms 6.8432 KOps/s 6.9978 KOps/s $\color{#d91a1a}-2.21\%$
test_keys_nested_locked 0.7157ms 0.1523ms 6.5678 KOps/s 6.7137 KOps/s $\color{#d91a1a}-2.17\%$
test_keys_nested_leaf 0.2812ms 0.1266ms 7.8974 KOps/s 8.1158 KOps/s $\color{#d91a1a}-2.69\%$
test_keys_stack_nested 0.2536ms 0.1461ms 6.8447 KOps/s 6.9732 KOps/s $\color{#d91a1a}-1.84\%$
test_keys_stack_nested_leaf 0.2654ms 0.1255ms 7.9666 KOps/s 8.0695 KOps/s $\color{#d91a1a}-1.28\%$
test_keys_stack_nested_locked 0.2942ms 0.1528ms 6.5450 KOps/s 6.7420 KOps/s $\color{#d91a1a}-2.92\%$
test_values 9.1130μs 1.1386μs 878.2471 KOps/s 855.6443 KOps/s $\color{#35bf28}+2.64\%$
test_values_nested 89.4970μs 50.3334μs 19.8675 KOps/s 19.9923 KOps/s $\color{#d91a1a}-0.62\%$
test_values_nested_locked 0.1067ms 50.1823μs 19.9273 KOps/s 19.8553 KOps/s $\color{#35bf28}+0.36\%$
test_values_nested_leaf 0.1186ms 45.0117μs 22.2164 KOps/s 22.2823 KOps/s $\color{#d91a1a}-0.30\%$
test_values_stack_nested 93.4740μs 51.2065μs 19.5288 KOps/s 19.5882 KOps/s $\color{#d91a1a}-0.30\%$
test_values_stack_nested_leaf 93.0030μs 45.1436μs 22.1515 KOps/s 22.2407 KOps/s $\color{#d91a1a}-0.40\%$
test_values_stack_nested_locked 0.1034ms 51.3128μs 19.4883 KOps/s 19.7985 KOps/s $\color{#d91a1a}-1.57\%$
test_membership 4.9450μs 0.7640μs 1.3089 MOps/s 1.3275 MOps/s $\color{#d91a1a}-1.40\%$
test_membership_nested 44.4330μs 2.6112μs 382.9719 KOps/s 371.2301 KOps/s $\color{#35bf28}+3.16\%$
test_membership_nested_leaf 30.0250μs 2.6472μs 377.7621 KOps/s 373.2733 KOps/s $\color{#35bf28}+1.20\%$
test_membership_stacked_nested 40.7660μs 2.6698μs 374.5591 KOps/s 378.3120 KOps/s $\color{#d91a1a}-0.99\%$
test_membership_stacked_nested_leaf 39.1630μs 2.6222μs 381.3585 KOps/s 371.8770 KOps/s $\color{#35bf28}+2.55\%$
test_membership_nested_last 47.5490μs 3.8450μs 260.0808 KOps/s 254.3660 KOps/s $\color{#35bf28}+2.25\%$
test_membership_nested_leaf_last 30.7780μs 3.8860μs 257.3335 KOps/s 251.9334 KOps/s $\color{#35bf28}+2.14\%$
test_membership_stacked_nested_last 34.1840μs 3.8442μs 260.1315 KOps/s 146.7899 KOps/s $\textbf{\color{#35bf28}+77.21\%}$
test_membership_stacked_nested_leaf_last 30.2260μs 3.8473μs 259.9225 KOps/s 146.9112 KOps/s $\textbf{\color{#35bf28}+76.92\%}$
test_nested_getleaf 50.8250μs 10.6105μs 94.2461 KOps/s 96.6404 KOps/s $\color{#d91a1a}-2.48\%$
test_nested_get 55.2740μs 10.1074μs 98.9372 KOps/s 102.4839 KOps/s $\color{#d91a1a}-3.46\%$
test_stacked_getleaf 39.5840μs 10.6024μs 94.3182 KOps/s 96.0577 KOps/s $\color{#d91a1a}-1.81\%$
test_stacked_get 55.4240μs 10.0384μs 99.6174 KOps/s 102.5996 KOps/s $\color{#d91a1a}-2.91\%$
test_nested_getitemleaf 54.6520μs 11.1021μs 90.0727 KOps/s 92.3723 KOps/s $\color{#d91a1a}-2.49\%$
test_nested_getitem 53.6400μs 10.3103μs 96.9902 KOps/s 101.3782 KOps/s $\color{#d91a1a}-4.33\%$
test_stacked_getitemleaf 57.6570μs 11.1377μs 89.7853 KOps/s 92.3567 KOps/s $\color{#d91a1a}-2.78\%$
test_stacked_getitem 56.8660μs 10.2006μs 98.0332 KOps/s 99.9158 KOps/s $\color{#d91a1a}-1.88\%$
test_lock_nested 7.5928ms 0.5031ms 1.9876 KOps/s 1.9792 KOps/s $\color{#35bf28}+0.42\%$
test_lock_stack_nested 0.9355ms 0.4616ms 2.1665 KOps/s 2.1482 KOps/s $\color{#35bf28}+0.85\%$
test_unlock_nested 0.8792ms 0.4172ms 2.3970 KOps/s 2.4001 KOps/s $\color{#d91a1a}-0.13\%$
test_unlock_stack_nested 0.6952ms 0.3766ms 2.6555 KOps/s 2.6328 KOps/s $\color{#35bf28}+0.86\%$
test_flatten_speed 0.5604ms 0.1041ms 9.6056 KOps/s 9.5520 KOps/s $\color{#35bf28}+0.56\%$
test_unflatten_speed 0.5109ms 0.4334ms 2.3074 KOps/s 2.3235 KOps/s $\color{#d91a1a}-0.69\%$
test_common_ops 4.9600ms 1.0782ms 927.4601 Ops/s 899.5678 Ops/s $\color{#35bf28}+3.10\%$
test_creation 15.7400μs 2.0577μs 485.9883 KOps/s 471.1176 KOps/s $\color{#35bf28}+3.16\%$
test_creation_empty 60.7530μs 18.0094μs 55.5267 KOps/s 52.2279 KOps/s $\textbf{\color{#35bf28}+6.32\%}$
test_creation_nested_1 52.8180μs 21.1579μs 47.2636 KOps/s 44.5479 KOps/s $\textbf{\color{#35bf28}+6.10\%}$
test_creation_nested_2 67.6360μs 24.8934μs 40.1712 KOps/s 38.4087 KOps/s $\color{#35bf28}+4.59\%$
test_clone 0.1031ms 17.1212μs 58.4071 KOps/s 60.5747 KOps/s $\color{#d91a1a}-3.58\%$
test_getitem[int] 1.0941ms 16.8475μs 59.3561 KOps/s 59.6849 KOps/s $\color{#d91a1a}-0.55\%$
test_getitem[slice_int] 0.1733ms 32.6220μs 30.6542 KOps/s 31.3711 KOps/s $\color{#d91a1a}-2.29\%$
test_getitem[range] 0.1789ms 57.7364μs 17.3201 KOps/s 17.4206 KOps/s $\color{#d91a1a}-0.58\%$
test_getitem[tuple] 0.1409ms 26.9514μs 37.1038 KOps/s 38.4088 KOps/s $\color{#d91a1a}-3.40\%$
test_getitem[list] 0.2900ms 52.9845μs 18.8734 KOps/s 19.1434 KOps/s $\color{#d91a1a}-1.41\%$
test_setitem_dim[int] 71.9940μs 40.3816μs 24.7638 KOps/s 23.3980 KOps/s $\textbf{\color{#35bf28}+5.84\%}$
test_setitem_dim[slice_int] 0.1329ms 71.9975μs 13.8894 KOps/s 13.7743 KOps/s $\color{#35bf28}+0.84\%$
test_setitem_dim[range] 0.1483ms 93.2305μs 10.7261 KOps/s 10.6661 KOps/s $\color{#35bf28}+0.56\%$
test_setitem_dim[tuple] 0.1068ms 57.7218μs 17.3245 KOps/s 16.5685 KOps/s $\color{#35bf28}+4.56\%$
test_setitem 85.6090μs 29.8013μs 33.5556 KOps/s 33.0867 KOps/s $\color{#35bf28}+1.42\%$
test_set 96.2990μs 28.8985μs 34.6039 KOps/s 33.7125 KOps/s $\color{#35bf28}+2.64\%$
test_set_shared 1.2603ms 0.2147ms 4.6567 KOps/s 4.6826 KOps/s $\color{#d91a1a}-0.55\%$
test_update 0.1414ms 35.3911μs 28.2557 KOps/s 26.7912 KOps/s $\textbf{\color{#35bf28}+5.47\%}$
test_update_nested 0.1649ms 45.8014μs 21.8334 KOps/s 21.2995 KOps/s $\color{#35bf28}+2.51\%$
test_update__nested 0.1038ms 33.9428μs 29.4613 KOps/s 29.6550 KOps/s $\color{#d91a1a}-0.65\%$
test_set_nested 0.1155ms 31.3770μs 31.8704 KOps/s 31.7580 KOps/s $\color{#35bf28}+0.35\%$
test_set_nested_new 0.1438ms 36.1729μs 27.6450 KOps/s 27.3931 KOps/s $\color{#35bf28}+0.92\%$
test_select 0.2044ms 52.7849μs 18.9448 KOps/s 18.9482 KOps/s $\color{#d91a1a}-0.02\%$
test_select_nested 0.1142ms 58.9906μs 16.9519 KOps/s 17.0946 KOps/s $\color{#d91a1a}-0.84\%$
test_exclude_nested 0.1445ms 76.9004μs 13.0038 KOps/s 13.0698 KOps/s $\color{#d91a1a}-0.50\%$
test_empty[True] 0.5447ms 0.3191ms 3.1335 KOps/s 3.1195 KOps/s $\color{#35bf28}+0.45\%$
test_empty[False] 10.5572μs 1.1621μs 860.5225 KOps/s 864.1645 KOps/s $\color{#d91a1a}-0.42\%$
test_unbind_speed 0.5000ms 0.3059ms 3.2686 KOps/s 3.1427 KOps/s $\color{#35bf28}+4.01\%$
test_unbind_speed_stack0 0.4811ms 0.3007ms 3.3255 KOps/s 3.2990 KOps/s $\color{#35bf28}+0.80\%$
test_unbind_speed_stack1 83.5105ms 0.7812ms 1.2801 KOps/s 1.3920 KOps/s $\textbf{\color{#d91a1a}-8.04\%}$
test_split 2.2463ms 2.0087ms 497.8464 Ops/s 469.1801 Ops/s $\textbf{\color{#35bf28}+6.11\%}$
test_chunk 86.9611ms 2.3682ms 422.2582 Ops/s 463.7531 Ops/s $\textbf{\color{#d91a1a}-8.95\%}$
test_creation[device0] 0.2291ms 0.1175ms 8.5101 KOps/s 8.4264 KOps/s $\color{#35bf28}+0.99\%$
test_creation_from_tensor 4.8958ms 0.1216ms 8.2252 KOps/s 8.4146 KOps/s $\color{#d91a1a}-2.25\%$
test_add_one[memmap_tensor0] 0.1517ms 7.7904μs 128.3624 KOps/s 125.8023 KOps/s $\color{#35bf28}+2.04\%$
test_contiguous[memmap_tensor0] 30.1460μs 1.9828μs 504.3427 KOps/s 501.7016 KOps/s $\color{#35bf28}+0.53\%$
test_stack[memmap_tensor0] 53.7800μs 5.7753μs 173.1511 KOps/s 172.6876 KOps/s $\color{#35bf28}+0.27\%$
test_memmaptd_index 1.0164ms 0.4098ms 2.4405 KOps/s 2.3810 KOps/s $\color{#35bf28}+2.50\%$
test_memmaptd_index_astensor 0.7880ms 0.4880ms 2.0491 KOps/s 1.9995 KOps/s $\color{#35bf28}+2.48\%$
test_memmaptd_index_op 1.8972ms 1.0328ms 968.2363 Ops/s 923.0122 Ops/s $\color{#35bf28}+4.90\%$
test_serialize_model 0.1218s 0.1174s 8.5200 Ops/s 7.4970 Ops/s $\textbf{\color{#35bf28}+13.65\%}$
test_serialize_model_pickle 0.4431s 0.4002s 2.4986 Ops/s 2.4895 Ops/s $\color{#35bf28}+0.36\%$
test_serialize_weights 0.1967s 0.1285s 7.7823 Ops/s 8.5795 Ops/s $\textbf{\color{#d91a1a}-9.29\%}$
test_serialize_weights_returnearly 0.1801s 0.1612s 6.2024 Ops/s 6.2320 Ops/s $\color{#d91a1a}-0.47\%$
test_serialize_weights_pickle 1.2007s 0.7112s 1.4062 Ops/s 2.5264 Ops/s $\textbf{\color{#d91a1a}-44.34\%}$
test_serialize_weights_filesystem 0.1604s 0.1421s 7.0349 Ops/s 6.9258 Ops/s $\color{#35bf28}+1.58\%$
test_serialize_model_filesystem 0.2154s 0.1552s 6.4444 Ops/s 6.5181 Ops/s $\color{#d91a1a}-1.13\%$
test_reshape_pytree 87.1730μs 39.5317μs 25.2962 KOps/s 24.5560 KOps/s $\color{#35bf28}+3.01\%$
test_reshape_td 94.0850μs 46.6770μs 21.4238 KOps/s 20.8300 KOps/s $\color{#35bf28}+2.85\%$
test_view_pytree 89.3560μs 39.4436μs 25.3526 KOps/s 24.9554 KOps/s $\color{#35bf28}+1.59\%$
test_view_td 0.1059ms 53.6221μs 18.6490 KOps/s 18.7984 KOps/s $\color{#d91a1a}-0.79\%$
test_unbind_pytree 81.0210μs 37.4985μs 26.6677 KOps/s 26.9451 KOps/s $\color{#d91a1a}-1.03\%$
test_unbind_td 0.3841ms 46.4682μs 21.5201 KOps/s 21.3411 KOps/s $\color{#35bf28}+0.84\%$
test_split_pytree 85.3690μs 40.3958μs 24.7550 KOps/s 24.8594 KOps/s $\color{#d91a1a}-0.42\%$
test_split_td 0.4822ms 58.9614μs 16.9603 KOps/s 16.9095 KOps/s $\color{#35bf28}+0.30\%$
test_add_pytree 99.1950μs 47.0014μs 21.2760 KOps/s 21.2116 KOps/s $\color{#35bf28}+0.30\%$
test_add_td 0.2707ms 82.7190μs 12.0891 KOps/s 11.6020 KOps/s $\color{#35bf28}+4.20\%$
test_compile_add_one_nested[tensordict-compile] 0.1879ms 52.7556μs 18.9553 KOps/s 18.3173 KOps/s $\color{#35bf28}+3.48\%$
test_compile_add_one_nested[tensordict-eager] 0.3768ms 0.1887ms 5.2986 KOps/s 5.2590 KOps/s $\color{#35bf28}+0.75\%$
test_compile_add_one_nested[pytree-compile] 0.1323ms 54.2072μs 18.4477 KOps/s 18.0467 KOps/s $\color{#35bf28}+2.22\%$
test_compile_add_one_nested[pytree-eager] 0.3571ms 0.1486ms 6.7302 KOps/s 6.8297 KOps/s $\color{#d91a1a}-1.46\%$
test_compile_copy_nested[tensordict-compile] 54.4010μs 20.7055μs 48.2963 KOps/s 49.0944 KOps/s $\color{#d91a1a}-1.63\%$
test_compile_copy_nested[tensordict-eager] 0.1298ms 64.4300μs 15.5207 KOps/s 15.3481 KOps/s $\color{#35bf28}+1.12\%$
test_compile_copy_nested[pytree-compile] 0.1567ms 79.4685μs 12.5836 KOps/s 12.5298 KOps/s $\color{#35bf28}+0.43\%$
test_compile_copy_nested[pytree-eager] 0.1375ms 72.8390μs 13.7289 KOps/s 13.8542 KOps/s $\color{#d91a1a}-0.90\%$
test_compile_add_one_flat[tensordict-compile] 0.2678ms 0.1724ms 5.8004 KOps/s 5.6944 KOps/s $\color{#35bf28}+1.86\%$
test_compile_add_one_flat[tensordict-eager] 0.4178ms 0.1968ms 5.0809 KOps/s 5.1975 KOps/s $\color{#d91a1a}-2.24\%$
test_compile_add_one_flat[tensorclass-compile] 90.7990μs 38.2330μs 26.1554 KOps/s 25.8701 KOps/s $\color{#35bf28}+1.10\%$
test_compile_add_one_flat[tensorclass-eager] 1.0348ms 69.7114μs 14.3449 KOps/s 14.0493 KOps/s $\color{#35bf28}+2.10\%$
test_compile_add_one_flat[pytree-compile] 0.2730ms 0.1700ms 5.8808 KOps/s 5.8242 KOps/s $\color{#35bf28}+0.97\%$
test_compile_add_one_flat[pytree-eager] 0.3916ms 0.2974ms 3.3630 KOps/s 3.3480 KOps/s $\color{#35bf28}+0.45\%$
test_compile_add_self_flat[tensordict-eager] 0.4205ms 0.2079ms 4.8111 KOps/s 4.8692 KOps/s $\color{#d91a1a}-1.19\%$
test_compile_add_self_flat[tensordict-compile] 0.3591ms 0.1762ms 5.6751 KOps/s 5.6592 KOps/s $\color{#35bf28}+0.28\%$
test_compile_add_self_flat[tensorclass-eager] 0.7622ms 62.7760μs 15.9296 KOps/s 15.6483 KOps/s $\color{#35bf28}+1.80\%$
test_compile_add_self_flat[tensorclass-compile] 82.7940μs 39.5446μs 25.2879 KOps/s 24.9795 KOps/s $\color{#35bf28}+1.23\%$
test_compile_add_self_flat[pytree-eager] 0.4608ms 0.2441ms 4.0963 KOps/s 4.0742 KOps/s $\color{#35bf28}+0.54\%$
test_compile_add_self_flat[pytree-compile] 0.2756ms 0.1710ms 5.8487 KOps/s 5.7789 KOps/s $\color{#35bf28}+1.21\%$
test_compile_copy_flat[tensordict-compile] 0.1983ms 0.1077ms 9.2841 KOps/s 9.2368 KOps/s $\color{#35bf28}+0.51\%$
test_compile_copy_flat[tensordict-eager] 0.1101ms 56.6783μs 17.6434 KOps/s 17.5806 KOps/s $\color{#35bf28}+0.36\%$
test_compile_copy_flat[pytree-compile] 0.1574ms 80.8622μs 12.3667 KOps/s 12.4302 KOps/s $\color{#d91a1a}-0.51\%$
test_compile_copy_flat[pytree-eager] 0.1385ms 71.0541μs 14.0738 KOps/s 13.7370 KOps/s $\color{#35bf28}+2.45\%$
test_compile_assign_and_add[tensordict-compile] 0.3777ms 0.1932ms 5.1750 KOps/s 5.2632 KOps/s $\color{#d91a1a}-1.68\%$
test_compile_assign_and_add[tensordict-eager] 1.9078ms 1.6498ms 606.1255 Ops/s 613.0225 Ops/s $\color{#d91a1a}-1.13\%$
test_compile_assign_and_add[pytree-compile] 0.2576ms 0.1887ms 5.3007 KOps/s 5.2829 KOps/s $\color{#35bf28}+0.34\%$
test_compile_assign_and_add[pytree-eager] 1.8331ms 1.1121ms 899.1773 Ops/s 909.4193 Ops/s $\color{#d91a1a}-1.13\%$
test_compile_assign_and_add_stack[compile] 0.8267ms 0.4216ms 2.3718 KOps/s 2.4009 KOps/s $\color{#d91a1a}-1.21\%$
test_compile_assign_and_add_stack[eager] 4.0831ms 3.7513ms 266.5773 Ops/s 255.6329 Ops/s $\color{#35bf28}+4.28\%$
test_compile_indexing[tensor-tensordict-compile] 99.0750μs 32.3318μs 30.9293 KOps/s 30.5503 KOps/s $\color{#35bf28}+1.24\%$
test_compile_indexing[tensor-tensordict-eager] 0.7690ms 49.1497μs 20.3460 KOps/s 20.2534 KOps/s $\color{#35bf28}+0.46\%$
test_compile_indexing[tensor-tensorclass-compile] 0.2547ms 27.9760μs 35.7449 KOps/s 34.5138 KOps/s $\color{#35bf28}+3.57\%$
test_compile_indexing[tensor-tensorclass-eager] 0.2647ms 31.3632μs 31.8845 KOps/s 31.8910 KOps/s $\color{#d91a1a}-0.02\%$
test_compile_indexing[tensor-pytree-compile] 0.1189ms 28.2223μs 35.4330 KOps/s 34.1811 KOps/s $\color{#35bf28}+3.66\%$
test_compile_indexing[tensor-pytree-eager] 0.1015ms 30.9619μs 32.2978 KOps/s 31.8775 KOps/s $\color{#35bf28}+1.32\%$
test_compile_indexing[slice-tensordict-compile] 0.1599ms 70.8865μs 14.1071 KOps/s 13.8297 KOps/s $\color{#35bf28}+2.01\%$
test_compile_indexing[slice-tensordict-eager] 0.3365ms 28.1483μs 35.5261 KOps/s 36.0949 KOps/s $\color{#d91a1a}-1.58\%$
test_compile_indexing[slice-tensorclass-compile] 0.1494ms 66.0699μs 15.1355 KOps/s 14.8397 KOps/s $\color{#35bf28}+1.99\%$
test_compile_indexing[slice-tensorclass-eager] 0.1038ms 25.5757μs 39.0996 KOps/s 41.5236 KOps/s $\textbf{\color{#d91a1a}-5.84\%}$
test_compile_indexing[slice-pytree-compile] 0.1301ms 66.3608μs 15.0691 KOps/s 14.8164 KOps/s $\color{#35bf28}+1.71\%$
test_compile_indexing[slice-pytree-eager] 68.7990μs 24.9557μs 40.0709 KOps/s 40.6654 KOps/s $\color{#d91a1a}-1.46\%$
test_compile_indexing[int-tensordict-compile] 0.1763ms 71.7145μs 13.9442 KOps/s 13.7050 KOps/s $\color{#35bf28}+1.75\%$
test_compile_indexing[int-tensordict-eager] 1.0644ms 28.5164μs 35.0675 KOps/s 35.6623 KOps/s $\color{#d91a1a}-1.67\%$
test_compile_indexing[int-tensorclass-compile] 0.1997ms 66.8737μs 14.9536 KOps/s 14.8128 KOps/s $\color{#35bf28}+0.95\%$
test_compile_indexing[int-tensorclass-eager] 81.0510μs 25.3042μs 39.5191 KOps/s 41.4795 KOps/s $\color{#d91a1a}-4.73\%$
test_compile_indexing[int-pytree-compile] 0.1574ms 66.5733μs 15.0210 KOps/s 14.9483 KOps/s $\color{#35bf28}+0.49\%$
test_compile_indexing[int-pytree-eager] 0.4620ms 25.2161μs 39.6571 KOps/s 41.3872 KOps/s $\color{#d91a1a}-4.18\%$
test_mod_add[eager] 64.6710μs 25.5448μs 39.1469 KOps/s 39.3116 KOps/s $\color{#d91a1a}-0.42\%$
test_mod_add[compile] 78.7270μs 36.9414μs 27.0699 KOps/s 26.3771 KOps/s $\color{#35bf28}+2.63\%$
test_mod_add[compile-overhead] 84.1970μs 35.7988μs 27.9339 KOps/s 26.5504 KOps/s $\textbf{\color{#35bf28}+5.21\%}$
test_mod_wrap[eager] 0.3441ms 0.2083ms 4.8015 KOps/s 4.8590 KOps/s $\color{#d91a1a}-1.18\%$
test_mod_wrap[compile] 1.4388ms 0.2302ms 4.3447 KOps/s 4.2991 KOps/s $\color{#35bf28}+1.06\%$
test_mod_wrap[compile-overhead] 0.4550ms 0.2266ms 4.4129 KOps/s 4.3802 KOps/s $\color{#35bf28}+0.75\%$
test_mod_wrap_and_backward[eager] 14.5926ms 11.9062ms 83.9896 Ops/s 91.8072 Ops/s $\textbf{\color{#d91a1a}-8.52\%}$
test_mod_wrap_and_backward[compile] 15.1735ms 11.9139ms 83.9359 Ops/s 88.9956 Ops/s $\textbf{\color{#d91a1a}-5.69\%}$
test_mod_wrap_and_backward[compile-overhead] 13.3237ms 11.0477ms 90.5167 Ops/s 90.8803 Ops/s $\color{#d91a1a}-0.40\%$
test_seq_add[eager] 0.1586ms 88.2155μs 11.3359 KOps/s 11.3219 KOps/s $\color{#35bf28}+0.12\%$
test_seq_add[compile] 0.1933ms 58.9782μs 16.9554 KOps/s 15.9865 KOps/s $\textbf{\color{#35bf28}+6.06\%}$
test_seq_add[compile-overhead] 0.1509ms 58.1977μs 17.1828 KOps/s 16.6293 KOps/s $\color{#35bf28}+3.33\%$
test_seq_wrap[eager] 0.5567ms 0.3685ms 2.7140 KOps/s 2.6111 KOps/s $\color{#35bf28}+3.94\%$
test_seq_wrap[compile] 0.3545ms 0.2595ms 3.8536 KOps/s 3.7845 KOps/s $\color{#35bf28}+1.83\%$
test_seq_wrap[compile-overhead] 0.3660ms 0.2591ms 3.8593 KOps/s 3.7923 KOps/s $\color{#35bf28}+1.77\%$
test_func_call_runtime[False-eager] 0.9355ms 0.5276ms 1.8952 KOps/s 1.9242 KOps/s $\color{#d91a1a}-1.51\%$
test_func_call_runtime[False-compile] 0.6613ms 0.4899ms 2.0412 KOps/s 1.9807 KOps/s $\color{#35bf28}+3.05\%$
test_func_call_runtime[False-compile-overhead] 0.8840ms 0.4898ms 2.0416 KOps/s 1.9936 KOps/s $\color{#35bf28}+2.41\%$
test_func_call_runtime[True-eager] 1.5483ms 0.7476ms 1.3377 KOps/s 1.3537 KOps/s $\color{#d91a1a}-1.18\%$
test_func_call_runtime[True-compile] 0.6687ms 0.5025ms 1.9900 KOps/s 1.9331 KOps/s $\color{#35bf28}+2.95\%$
test_func_call_runtime[True-compile-overhead] 1.0320ms 0.5059ms 1.9767 KOps/s 1.9256 KOps/s $\color{#35bf28}+2.65\%$
test_func_call_cm_runtime[False-eager] 0.8333ms 0.5194ms 1.9252 KOps/s 1.9151 KOps/s $\color{#35bf28}+0.53\%$
test_func_call_cm_runtime[False-compile] 0.6667ms 0.4888ms 2.0460 KOps/s 1.9725 KOps/s $\color{#35bf28}+3.73\%$
test_func_call_cm_runtime[False-compile-overhead] 0.6524ms 0.4922ms 2.0319 KOps/s 1.9693 KOps/s $\color{#35bf28}+3.18\%$
test_func_call_cm_runtime[True-eager] 1.4105ms 0.8747ms 1.1432 KOps/s 1.1391 KOps/s $\color{#35bf28}+0.36\%$
test_func_call_cm_runtime[True-compile] 1.1563ms 0.8338ms 1.1993 KOps/s 1.1966 KOps/s $\color{#35bf28}+0.23\%$
test_func_call_cm_runtime[True-compile-overhead] 0.9915ms 0.8332ms 1.2003 KOps/s 1.1893 KOps/s $\color{#35bf28}+0.92\%$
test_distributed 0.3580ms 0.1313ms 7.6168 KOps/s 7.4996 KOps/s $\color{#35bf28}+1.56\%$
test_tdmodule 31.2380μs 17.2556μs 57.9522 KOps/s 57.2297 KOps/s $\color{#35bf28}+1.26\%$
test_tdmodule_dispatch 60.5130μs 36.5755μs 27.3407 KOps/s 26.9453 KOps/s $\color{#35bf28}+1.47\%$
test_tdseq 37.5500μs 18.5971μs 53.7719 KOps/s 50.5934 KOps/s $\textbf{\color{#35bf28}+6.28\%}$
test_tdseq_dispatch 69.8700μs 39.5153μs 25.3067 KOps/s 24.4101 KOps/s $\color{#35bf28}+3.67\%$
test_instantiation_functorch 1.8927ms 1.6327ms 612.4757 Ops/s 611.4943 Ops/s $\color{#35bf28}+0.16\%$
test_instantiation_td 1.7956ms 1.1844ms 844.2859 Ops/s 843.8227 Ops/s $\color{#35bf28}+0.05\%$
test_exec_functorch 0.3189ms 0.1806ms 5.5380 KOps/s 5.5130 KOps/s $\color{#35bf28}+0.45\%$
test_exec_functional_call 0.3104ms 0.1686ms 5.9321 KOps/s 5.7319 KOps/s $\color{#35bf28}+3.49\%$
test_exec_td 0.2541ms 0.1695ms 5.8996 KOps/s 5.8590 KOps/s $\color{#35bf28}+0.69\%$
test_exec_td_decorator 0.4789ms 0.2204ms 4.5368 KOps/s 4.4273 KOps/s $\color{#35bf28}+2.47\%$
test_vmap_mlp_speed[True-True] 0.7942ms 0.5679ms 1.7608 KOps/s 1.7356 KOps/s $\color{#35bf28}+1.46\%$
test_vmap_mlp_speed[True-False] 0.8809ms 0.5664ms 1.7656 KOps/s 1.7429 KOps/s $\color{#35bf28}+1.30\%$
test_vmap_mlp_speed[False-True] 0.6660ms 0.4685ms 2.1346 KOps/s 2.1170 KOps/s $\color{#35bf28}+0.83\%$
test_vmap_mlp_speed[False-False] 0.8409ms 0.4723ms 2.1173 KOps/s 2.1255 KOps/s $\color{#d91a1a}-0.39\%$
test_vmap_mlp_speed_decorator[True-True] 1.4417ms 0.6183ms 1.6175 KOps/s 1.5895 KOps/s $\color{#35bf28}+1.76\%$
test_vmap_mlp_speed_decorator[True-False] 0.9359ms 0.6239ms 1.6027 KOps/s 1.5474 KOps/s $\color{#35bf28}+3.57\%$
test_vmap_mlp_speed_decorator[False-True] 0.7058ms 0.5146ms 1.9433 KOps/s 1.9155 KOps/s $\color{#35bf28}+1.45\%$
test_vmap_mlp_speed_decorator[False-False] 0.8245ms 0.5161ms 1.9375 KOps/s 1.9370 KOps/s $\color{#35bf28}+0.03\%$
test_to_module_speed[True] 1.9938ms 1.3230ms 755.8719 Ops/s 760.2576 Ops/s $\color{#d91a1a}-0.58\%$
test_to_module_speed[False] 2.1656ms 1.3168ms 759.4327 Ops/s 770.3819 Ops/s $\color{#d91a1a}-1.42\%$
test_tc_init 93.2540μs 45.5445μs 21.9566 KOps/s 22.5715 KOps/s $\color{#d91a1a}-2.72\%$
test_tc_init_nested 0.1624ms 92.1920μs 10.8469 KOps/s 11.2361 KOps/s $\color{#d91a1a}-3.46\%$
test_tc_first_layer_tensor 37.7800μs 1.4615μs 684.2091 KOps/s 706.8873 KOps/s $\color{#d91a1a}-3.21\%$
test_tc_first_layer_nontensor 19.4870μs 4.2976μs 232.6859 KOps/s 236.2360 KOps/s $\color{#d91a1a}-1.50\%$
test_tc_second_layer_tensor 42.9000μs 2.7395μs 365.0294 KOps/s 372.9140 KOps/s $\color{#d91a1a}-2.11\%$
test_tc_second_layer_nontensor 28.9950μs 5.5368μs 180.6095 KOps/s 183.1077 KOps/s $\color{#d91a1a}-1.36\%$
test_unbind 0.4410s 13.6270ms 73.3840 Ops/s 73.7560 Ops/s $\color{#d91a1a}-0.50\%$
test_full_like 9.0091ms 7.1748ms 139.3765 Ops/s 139.3740 Ops/s $+0.00\%$
test_zeros_like 16.3914ms 6.5801ms 151.9734 Ops/s 140.1306 Ops/s $\textbf{\color{#35bf28}+8.45\%}$
test_ones_like 17.1997ms 7.6742ms 130.3072 Ops/s 130.8723 Ops/s $\color{#d91a1a}-0.43\%$
test_clone 13.5129ms 9.1482ms 109.3112 Ops/s 105.4852 Ops/s $\color{#35bf28}+3.63\%$
test_squeeze 68.3570μs 13.1840μs 75.8494 KOps/s 75.7340 KOps/s $\color{#35bf28}+0.15\%$
test_unsqueeze 0.2115ms 96.5907μs 10.3530 KOps/s 10.6330 KOps/s $\color{#d91a1a}-2.63\%$
test_split 0.4476ms 0.2017ms 4.9576 KOps/s 4.9804 KOps/s $\color{#d91a1a}-0.46\%$
test_permute 0.4515ms 0.2249ms 4.4454 KOps/s 4.5871 KOps/s $\color{#d91a1a}-3.09\%$
test_stack 27.7125ms 24.5815ms 40.6811 Ops/s 39.9907 Ops/s $\color{#35bf28}+1.73\%$
test_cat 32.1648ms 24.6465ms 40.5736 Ops/s 40.2926 Ops/s $\color{#35bf28}+0.70\%$

Copy link

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 225. Improved: $\large\color{#35bf28}20$. Worsened: $\large\color{#d91a1a}4$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 0.1499ms 17.4741μs 57.2277 KOps/s 55.9820 KOps/s $\color{#35bf28}+2.23\%$
test_plain_set_stack_nested 37.5500μs 17.6734μs 56.5823 KOps/s 56.0095 KOps/s $\color{#35bf28}+1.02\%$
test_plain_set_nested_inplace 34.1500μs 18.8217μs 53.1302 KOps/s 52.8210 KOps/s $\color{#35bf28}+0.59\%$
test_plain_set_stack_nested_inplace 43.9210μs 18.8053μs 53.1766 KOps/s 52.7128 KOps/s $\color{#35bf28}+0.88\%$
test_items 18.7000μs 4.5977μs 217.5019 KOps/s 214.5303 KOps/s $\color{#35bf28}+1.39\%$
test_items_nested 0.4284ms 0.3644ms 2.7443 KOps/s 2.7531 KOps/s $\color{#d91a1a}-0.32\%$
test_items_nested_locked 0.4103ms 0.3660ms 2.7324 KOps/s 2.7176 KOps/s $\color{#35bf28}+0.54\%$
test_items_nested_leaf 0.1067ms 84.0407μs 11.8990 KOps/s 11.8223 KOps/s $\color{#35bf28}+0.65\%$
test_items_stack_nested 0.4174ms 0.3693ms 2.7080 KOps/s 2.7191 KOps/s $\color{#d91a1a}-0.41\%$
test_items_stack_nested_leaf 0.1097ms 84.9758μs 11.7681 KOps/s 11.7580 KOps/s $\color{#35bf28}+0.09\%$
test_items_stack_nested_locked 0.4112ms 0.3669ms 2.7256 KOps/s 2.7090 KOps/s $\color{#35bf28}+0.61\%$
test_keys 20.2200μs 4.3610μs 229.3027 KOps/s 228.8720 KOps/s $\color{#35bf28}+0.19\%$
test_keys_nested 93.9820μs 67.0612μs 14.9118 KOps/s 14.7033 KOps/s $\color{#35bf28}+1.42\%$
test_keys_nested_locked 0.6211ms 73.2586μs 13.6503 KOps/s 13.6973 KOps/s $\color{#d91a1a}-0.34\%$
test_keys_nested_leaf 81.2920μs 57.3078μs 17.4496 KOps/s 17.1329 KOps/s $\color{#35bf28}+1.85\%$
test_keys_stack_nested 87.0620μs 65.9474μs 15.1636 KOps/s 14.7671 KOps/s $\color{#35bf28}+2.69\%$
test_keys_stack_nested_leaf 72.5210μs 57.2697μs 17.4612 KOps/s 16.9600 KOps/s $\color{#35bf28}+2.96\%$
test_keys_stack_nested_locked 89.0820μs 72.2676μs 13.8375 KOps/s 13.7170 KOps/s $\color{#35bf28}+0.88\%$
test_values 6.5267μs 1.7649μs 566.6190 KOps/s 567.7419 KOps/s $\color{#d91a1a}-0.20\%$
test_values_nested 54.8510μs 33.8972μs 29.5010 KOps/s 29.3252 KOps/s $\color{#35bf28}+0.60\%$
test_values_nested_locked 47.8210μs 36.0146μs 27.7665 KOps/s 27.9277 KOps/s $\color{#d91a1a}-0.58\%$
test_values_nested_leaf 44.6910μs 30.0549μs 33.2724 KOps/s 33.0407 KOps/s $\color{#35bf28}+0.70\%$
test_values_stack_nested 54.1810μs 34.6009μs 28.9010 KOps/s 28.8637 KOps/s $\color{#35bf28}+0.13\%$
test_values_stack_nested_leaf 49.8410μs 30.6392μs 32.6379 KOps/s 32.5197 KOps/s $\color{#35bf28}+0.36\%$
test_values_stack_nested_locked 60.3410μs 36.8923μs 27.1059 KOps/s 27.3168 KOps/s $\color{#d91a1a}-0.77\%$
test_membership 1.2325μs 0.5396μs 1.8531 MOps/s 1.8075 MOps/s $\color{#35bf28}+2.52\%$
test_membership_nested 17.1700μs 2.0423μs 489.6516 KOps/s 519.0749 KOps/s $\textbf{\color{#d91a1a}-5.67\%}$
test_membership_nested_leaf 9.9050μs 1.9729μs 506.8754 KOps/s 510.5553 KOps/s $\color{#d91a1a}-0.72\%$
test_membership_stacked_nested 22.3710μs 2.0479μs 488.3074 KOps/s 493.5483 KOps/s $\color{#d91a1a}-1.06\%$
test_membership_stacked_nested_leaf 17.8610μs 2.0455μs 488.8725 KOps/s 499.6166 KOps/s $\color{#d91a1a}-2.15\%$
test_membership_nested_last 25.9700μs 3.0037μs 332.9262 KOps/s 344.7014 KOps/s $\color{#d91a1a}-3.42\%$
test_membership_nested_leaf_last 20.9900μs 2.9779μs 335.8123 KOps/s 344.2658 KOps/s $\color{#d91a1a}-2.46\%$
test_membership_stacked_nested_last 23.0310μs 3.0282μs 330.2308 KOps/s 233.2529 KOps/s $\textbf{\color{#35bf28}+41.58\%}$
test_membership_stacked_nested_leaf_last 21.8800μs 2.9870μs 334.7890 KOps/s 230.9982 KOps/s $\textbf{\color{#35bf28}+44.93\%}$
test_nested_getleaf 21.1710μs 7.9443μs 125.8765 KOps/s 125.5772 KOps/s $\color{#35bf28}+0.24\%$
test_nested_get 23.2600μs 7.4874μs 133.5572 KOps/s 134.0722 KOps/s $\color{#d91a1a}-0.38\%$
test_stacked_getleaf 25.6000μs 7.9892μs 125.1695 KOps/s 125.6395 KOps/s $\color{#d91a1a}-0.37\%$
test_stacked_get 24.5010μs 7.5290μs 132.8192 KOps/s 133.8948 KOps/s $\color{#d91a1a}-0.80\%$
test_nested_getitemleaf 30.0910μs 8.1535μs 122.6462 KOps/s 123.0989 KOps/s $\color{#d91a1a}-0.37\%$
test_nested_getitem 23.1210μs 7.6161μs 131.3006 KOps/s 131.0138 KOps/s $\color{#35bf28}+0.22\%$
test_stacked_getitemleaf 35.2010μs 8.1139μs 123.2452 KOps/s 122.3016 KOps/s $\color{#35bf28}+0.77\%$
test_stacked_getitem 24.5010μs 7.6355μs 130.9668 KOps/s 131.0389 KOps/s $\color{#d91a1a}-0.06\%$
test_lock_nested 10.1515ms 0.4856ms 2.0595 KOps/s 2.1122 KOps/s $\color{#d91a1a}-2.50\%$
test_lock_stack_nested 0.4898ms 0.4387ms 2.2792 KOps/s 2.3222 KOps/s $\color{#d91a1a}-1.85\%$
test_unlock_nested 0.8753ms 0.3942ms 2.5367 KOps/s 2.5607 KOps/s $\color{#d91a1a}-0.94\%$
test_unlock_stack_nested 0.3868ms 0.3508ms 2.8505 KOps/s 2.8706 KOps/s $\color{#d91a1a}-0.70\%$
test_flatten_speed 0.4712ms 0.1040ms 9.6183 KOps/s 9.4698 KOps/s $\color{#35bf28}+1.57\%$
test_unflatten_speed 0.3248ms 0.2830ms 3.5331 KOps/s 3.4415 KOps/s $\color{#35bf28}+2.66\%$
test_common_ops 1.5469ms 1.3219ms 756.4849 Ops/s 751.1764 Ops/s $\color{#35bf28}+0.71\%$
test_creation 17.2400μs 1.6445μs 608.0979 KOps/s 608.4865 KOps/s $\color{#d91a1a}-0.06\%$
test_creation_empty 34.7610μs 18.8816μs 52.9616 KOps/s 53.4345 KOps/s $\color{#d91a1a}-0.88\%$
test_creation_nested_1 37.9210μs 20.4871μs 48.8112 KOps/s 47.3388 KOps/s $\color{#35bf28}+3.11\%$
test_creation_nested_2 42.8610μs 23.0359μs 43.4106 KOps/s 43.8259 KOps/s $\color{#d91a1a}-0.95\%$
test_clone 47.1910μs 29.0995μs 34.3649 KOps/s 34.6372 KOps/s $\color{#d91a1a}-0.79\%$
test_getitem[int] 1.3512ms 16.9172μs 59.1113 KOps/s 59.2984 KOps/s $\color{#d91a1a}-0.32\%$
test_getitem[slice_int] 0.1613ms 29.3969μs 34.0172 KOps/s 34.6710 KOps/s $\color{#d91a1a}-1.89\%$
test_getitem[range] 0.2697ms 0.1154ms 8.6630 KOps/s 8.7639 KOps/s $\color{#d91a1a}-1.15\%$
test_getitem[tuple] 0.1539ms 25.5274μs 39.1735 KOps/s 39.8931 KOps/s $\color{#d91a1a}-1.80\%$
test_getitem[list] 92.5205ms 0.1189ms 8.4082 KOps/s 9.6451 KOps/s $\textbf{\color{#d91a1a}-12.82\%}$
test_setitem_dim[int] 76.0010μs 55.2298μs 18.1062 KOps/s 17.6225 KOps/s $\color{#35bf28}+2.74\%$
test_setitem_dim[slice_int] 0.1109ms 79.5622μs 12.5688 KOps/s 12.4873 KOps/s $\color{#35bf28}+0.65\%$
test_setitem_dim[range] 0.1701ms 0.1428ms 7.0020 KOps/s 6.9660 KOps/s $\color{#35bf28}+0.52\%$
test_setitem_dim[tuple] 91.3720μs 72.2342μs 13.8439 KOps/s 13.8378 KOps/s $\color{#35bf28}+0.04\%$
test_setitem 71.3620μs 43.7273μs 22.8690 KOps/s 23.3722 KOps/s $\color{#d91a1a}-2.15\%$
test_set 65.7110μs 42.6761μs 23.4323 KOps/s 23.6475 KOps/s $\color{#d91a1a}-0.91\%$
test_set_shared 0.3614ms 53.0004μs 18.8678 KOps/s 18.5677 KOps/s $\color{#35bf28}+1.62\%$
test_update 82.2910μs 53.4836μs 18.6973 KOps/s 18.5845 KOps/s $\color{#35bf28}+0.61\%$
test_update_nested 90.3520μs 59.9166μs 16.6899 KOps/s 16.1499 KOps/s $\color{#35bf28}+3.34\%$
test_update__nested 83.4720μs 60.0065μs 16.6649 KOps/s 16.6352 KOps/s $\color{#35bf28}+0.18\%$
test_set_nested 69.6910μs 45.4732μs 21.9910 KOps/s 22.5033 KOps/s $\color{#d91a1a}-2.28\%$
test_set_nested_new 74.0410μs 48.9397μs 20.4333 KOps/s 20.3172 KOps/s $\color{#35bf28}+0.57\%$
test_select 0.2068ms 64.7622μs 15.4411 KOps/s 15.3643 KOps/s $\color{#35bf28}+0.50\%$
test_select_nested 0.4866ms 52.8514μs 18.9210 KOps/s 19.1444 KOps/s $\color{#d91a1a}-1.17\%$
test_exclude_nested 85.0920μs 68.6936μs 14.5574 KOps/s 14.0242 KOps/s $\color{#35bf28}+3.80\%$
test_empty[True] 0.2978ms 0.2799ms 3.5723 KOps/s 3.5357 KOps/s $\color{#35bf28}+1.04\%$
test_empty[False] 2.4490μs 0.8822μs 1.1336 MOps/s 1.1431 MOps/s $\color{#d91a1a}-0.84\%$
test_to 63.4620μs 41.3600μs 24.1779 KOps/s 24.4576 KOps/s $\color{#d91a1a}-1.14\%$
test_to_nonblocking 44.6210μs 25.4695μs 39.2626 KOps/s 38.3071 KOps/s $\color{#35bf28}+2.49\%$
test_unbind_speed 1.2810ms 0.3057ms 3.2716 KOps/s 3.3312 KOps/s $\color{#d91a1a}-1.79\%$
test_unbind_speed_stack0 0.3386ms 0.3019ms 3.3125 KOps/s 3.3612 KOps/s $\color{#d91a1a}-1.45\%$
test_unbind_speed_stack1 90.8352ms 0.7703ms 1.2982 KOps/s 1.4266 KOps/s $\textbf{\color{#d91a1a}-9.00\%}$
test_split 91.9636ms 2.3627ms 423.2377 Ops/s 425.0512 Ops/s $\color{#d91a1a}-0.43\%$
test_chunk 2.3626ms 2.1608ms 462.7924 Ops/s 423.0125 Ops/s $\textbf{\color{#35bf28}+9.40\%}$
test_creation[device0] 0.1656ms 0.1059ms 9.4389 KOps/s 9.4724 KOps/s $\color{#d91a1a}-0.35\%$
test_creation_from_tensor 0.1601ms 0.1054ms 9.4903 KOps/s 9.7299 KOps/s $\color{#d91a1a}-2.46\%$
test_add_one[memmap_tensor0] 0.1017ms 8.8901μs 112.4843 KOps/s 115.3541 KOps/s $\color{#d91a1a}-2.49\%$
test_contiguous[memmap_tensor0] 18.8910μs 2.2455μs 445.3255 KOps/s 452.4062 KOps/s $\color{#d91a1a}-1.57\%$
test_stack[memmap_tensor0] 23.9800μs 6.5940μs 151.6520 KOps/s 149.7723 KOps/s $\color{#35bf28}+1.26\%$
test_memmaptd_index 1.1230ms 0.4376ms 2.2852 KOps/s 2.2792 KOps/s $\color{#35bf28}+0.26\%$
test_memmaptd_index_astensor 94.5487ms 0.5808ms 1.7218 KOps/s 1.9709 KOps/s $\textbf{\color{#d91a1a}-12.64\%}$
test_memmaptd_index_op 1.5083ms 1.0922ms 915.6062 Ops/s 906.8555 Ops/s $\color{#35bf28}+0.96\%$
test_serialize_model 92.4336ms 89.0335ms 11.2317 Ops/s 10.8196 Ops/s $\color{#35bf28}+3.81\%$
test_serialize_model_pickle 1.3468s 1.2364s 0.8088 Ops/s 0.8068 Ops/s $\color{#35bf28}+0.24\%$
test_serialize_weights 0.1828s 96.4963ms 10.3631 Ops/s 9.8727 Ops/s $\color{#35bf28}+4.97\%$
test_serialize_weights_returnearly 70.7002ms 56.3049ms 17.7604 Ops/s 14.6225 Ops/s $\textbf{\color{#35bf28}+21.46\%}$
test_serialize_weights_pickle 1.4118s 1.1825s 0.8457 Ops/s 0.8081 Ops/s $\color{#35bf28}+4.65\%$
test_reshape_pytree 86.3220μs 38.9519μs 25.6727 KOps/s 26.5679 KOps/s $\color{#d91a1a}-3.37\%$
test_reshape_td 80.6620μs 45.1282μs 22.1591 KOps/s 22.4542 KOps/s $\color{#d91a1a}-1.31\%$
test_view_pytree 71.5820μs 38.3302μs 26.0891 KOps/s 25.8361 KOps/s $\color{#35bf28}+0.98\%$
test_view_td 86.8510μs 50.5880μs 19.7675 KOps/s 19.6833 KOps/s $\color{#35bf28}+0.43\%$
test_unbind_pytree 71.8410μs 37.4368μs 26.7117 KOps/s 26.9370 KOps/s $\color{#d91a1a}-0.84\%$
test_unbind_td 0.4057ms 46.0428μs 21.7189 KOps/s 21.4309 KOps/s $\color{#35bf28}+1.34\%$
test_split_pytree 87.4120μs 51.5234μs 19.4087 KOps/s 19.5540 KOps/s $\color{#d91a1a}-0.74\%$
test_split_td 0.5132ms 61.9861μs 16.1326 KOps/s 16.3964 KOps/s $\color{#d91a1a}-1.61\%$
test_add_pytree 99.3420μs 61.5114μs 16.2571 KOps/s 16.7677 KOps/s $\color{#d91a1a}-3.04\%$
test_add_td 0.1416ms 98.5422μs 10.1479 KOps/s 10.0082 KOps/s $\color{#35bf28}+1.40\%$
test_compile_add_one_nested[tensordict-compile] 0.4185ms 0.2179ms 4.5890 KOps/s 4.6122 KOps/s $\color{#d91a1a}-0.50\%$
test_compile_add_one_nested[tensordict-eager] 0.2687ms 0.1748ms 5.7218 KOps/s 5.7120 KOps/s $\color{#35bf28}+0.17\%$
test_compile_add_one_nested[pytree-compile] 0.2230ms 0.1498ms 6.6771 KOps/s 6.6864 KOps/s $\color{#d91a1a}-0.14\%$
test_compile_add_one_nested[pytree-eager] 0.2622ms 0.1903ms 5.2540 KOps/s 5.1439 KOps/s $\color{#35bf28}+2.14\%$
test_compile_copy_nested[tensordict-compile] 57.0820μs 23.0528μs 43.3787 KOps/s 42.7418 KOps/s $\color{#35bf28}+1.49\%$
test_compile_copy_nested[tensordict-eager] 78.0910μs 47.6462μs 20.9881 KOps/s 20.6420 KOps/s $\color{#35bf28}+1.68\%$
test_compile_copy_nested[pytree-compile] 0.1203ms 72.7823μs 13.7396 KOps/s 13.8214 KOps/s $\color{#d91a1a}-0.59\%$
test_compile_copy_nested[pytree-eager] 88.6920μs 59.3325μs 16.8542 KOps/s 16.9617 KOps/s $\color{#d91a1a}-0.63\%$
test_compile_add_one_flat[tensordict-compile] 0.5224ms 0.3345ms 2.9897 KOps/s 2.9719 KOps/s $\color{#35bf28}+0.60\%$
test_compile_add_one_flat[tensordict-eager] 0.2835ms 0.2215ms 4.5137 KOps/s 4.5150 KOps/s $\color{#d91a1a}-0.03\%$
test_compile_add_one_flat[tensorclass-compile] 0.1858ms 0.1319ms 7.5789 KOps/s 7.4831 KOps/s $\color{#35bf28}+1.28\%$
test_compile_add_one_flat[tensorclass-eager] 0.1206ms 63.1224μs 15.8422 KOps/s 15.8968 KOps/s $\color{#d91a1a}-0.34\%$
test_compile_add_one_flat[pytree-compile] 0.3789ms 0.3347ms 2.9876 KOps/s 2.9713 KOps/s $\color{#35bf28}+0.55\%$
test_compile_add_one_flat[pytree-eager] 0.6761ms 0.6275ms 1.5937 KOps/s 1.5681 KOps/s $\color{#35bf28}+1.63\%$
test_compile_add_self_flat[tensordict-eager] 0.3186ms 0.2687ms 3.7217 KOps/s 3.6863 KOps/s $\color{#35bf28}+0.96\%$
test_compile_add_self_flat[tensordict-compile] 0.3770ms 0.3367ms 2.9701 KOps/s 2.9581 KOps/s $\color{#35bf28}+0.41\%$
test_compile_add_self_flat[tensorclass-eager] 0.1518ms 74.4014μs 13.4406 KOps/s 13.4592 KOps/s $\color{#d91a1a}-0.14\%$
test_compile_add_self_flat[tensorclass-compile] 0.1791ms 0.1339ms 7.4680 KOps/s 7.4621 KOps/s $\color{#35bf28}+0.08\%$
test_compile_add_self_flat[pytree-eager] 0.6776ms 0.5295ms 1.8885 KOps/s 1.8366 KOps/s $\color{#35bf28}+2.82\%$
test_compile_add_self_flat[pytree-compile] 0.3922ms 0.3350ms 2.9847 KOps/s 2.9858 KOps/s $\color{#d91a1a}-0.04\%$
test_compile_copy_flat[tensordict-compile] 44.0110μs 18.9850μs 52.6731 KOps/s 52.2414 KOps/s $\color{#35bf28}+0.83\%$
test_compile_copy_flat[tensordict-eager] 66.8610μs 31.8502μs 31.3970 KOps/s 30.9612 KOps/s $\color{#35bf28}+1.41\%$
test_compile_copy_flat[pytree-compile] 0.1037ms 76.9411μs 12.9970 KOps/s 13.0107 KOps/s $\color{#d91a1a}-0.11\%$
test_compile_copy_flat[pytree-eager] 92.9020μs 60.3061μs 16.5821 KOps/s 16.5414 KOps/s $\color{#35bf28}+0.25\%$
test_compile_assign_and_add[tensordict-compile] 2.5302ms 0.8669ms 1.1535 KOps/s 1.0528 KOps/s $\textbf{\color{#35bf28}+9.57\%}$
test_compile_assign_and_add[tensordict-eager] 3.4087ms 3.3113ms 301.9922 Ops/s 299.5616 Ops/s $\color{#35bf28}+0.81\%$
test_compile_assign_and_add[pytree-compile] 2.4631ms 0.8483ms 1.1789 KOps/s 1.0694 KOps/s $\textbf{\color{#35bf28}+10.23\%}$
test_compile_assign_and_add[pytree-eager] 3.3944ms 3.3320ms 300.1168 Ops/s 302.3017 Ops/s $\color{#d91a1a}-0.72\%$
test_compile_indexing[tensor-tensordict-compile] 0.1526ms 0.1133ms 8.8234 KOps/s 8.4980 KOps/s $\color{#35bf28}+3.83\%$
test_compile_indexing[tensor-tensordict-eager] 0.2068ms 61.7818μs 16.1860 KOps/s 14.8395 KOps/s $\textbf{\color{#35bf28}+9.07\%}$
test_compile_indexing[tensor-tensorclass-compile] 0.2684ms 0.1057ms 9.4592 KOps/s 8.9486 KOps/s $\textbf{\color{#35bf28}+5.71\%}$
test_compile_indexing[tensor-tensorclass-eager] 98.3720μs 45.0875μs 22.1791 KOps/s 20.6069 KOps/s $\textbf{\color{#35bf28}+7.63\%}$
test_compile_indexing[tensor-pytree-compile] 0.1523ms 0.1059ms 9.4389 KOps/s 9.1081 KOps/s $\color{#35bf28}+3.63\%$
test_compile_indexing[tensor-pytree-eager] 87.4320μs 45.1335μs 22.1565 KOps/s 20.8189 KOps/s $\textbf{\color{#35bf28}+6.43\%}$
test_compile_indexing[slice-tensordict-compile] 0.1902ms 0.1417ms 7.0561 KOps/s 6.9837 KOps/s $\color{#35bf28}+1.04\%$
test_compile_indexing[slice-tensordict-eager] 0.1777ms 26.5305μs 37.6924 KOps/s 36.6108 KOps/s $\color{#35bf28}+2.95\%$
test_compile_indexing[slice-tensorclass-compile] 0.1641ms 0.1333ms 7.5007 KOps/s 7.1658 KOps/s $\color{#35bf28}+4.67\%$
test_compile_indexing[slice-tensorclass-eager] 61.2920μs 23.0892μs 43.3103 KOps/s 43.2080 KOps/s $\color{#35bf28}+0.24\%$
test_compile_indexing[slice-pytree-compile] 0.1755ms 0.1334ms 7.4981 KOps/s 7.1653 KOps/s $\color{#35bf28}+4.64\%$
test_compile_indexing[slice-pytree-eager] 63.6110μs 23.1174μs 43.2574 KOps/s 43.3284 KOps/s $\color{#d91a1a}-0.16\%$
test_compile_indexing[int-tensordict-compile] 0.2039ms 0.1403ms 7.1288 KOps/s 6.8340 KOps/s $\color{#35bf28}+4.31\%$
test_compile_indexing[int-tensordict-eager] 0.4896ms 26.3987μs 37.8806 KOps/s 35.4818 KOps/s $\textbf{\color{#35bf28}+6.76\%}$
test_compile_indexing[int-tensorclass-compile] 0.2902ms 0.1330ms 7.5167 KOps/s 7.1375 KOps/s $\textbf{\color{#35bf28}+5.31\%}$
test_compile_indexing[int-tensorclass-eager] 55.8310μs 22.6372μs 44.1751 KOps/s 42.8677 KOps/s $\color{#35bf28}+3.05\%$
test_compile_indexing[int-pytree-compile] 0.1789ms 0.1330ms 7.5183 KOps/s 7.1945 KOps/s $\color{#35bf28}+4.50\%$
test_compile_indexing[int-pytree-eager] 0.3435ms 22.9129μs 43.6435 KOps/s 41.9699 KOps/s $\color{#35bf28}+3.99\%$
test_mod_add[eager] 76.1520μs 38.7596μs 25.8000 KOps/s 23.6527 KOps/s $\textbf{\color{#35bf28}+9.08\%}$
test_mod_add[compile] 0.1194ms 70.0402μs 14.2775 KOps/s 13.5313 KOps/s $\textbf{\color{#35bf28}+5.51\%}$
test_mod_add[compile-overhead] 0.2605ms 0.1372ms 7.2864 KOps/s 6.6044 KOps/s $\textbf{\color{#35bf28}+10.33\%}$
test_mod_wrap[eager] 0.3256ms 0.2517ms 3.9735 KOps/s 3.6513 KOps/s $\textbf{\color{#35bf28}+8.82\%}$
test_mod_wrap[compile] 1.0767ms 0.2904ms 3.4430 KOps/s 3.2206 KOps/s $\textbf{\color{#35bf28}+6.90\%}$
test_mod_wrap[compile-overhead] 8.6587ms 4.5998ms 217.4025 Ops/s 221.6111 Ops/s $\color{#d91a1a}-1.90\%$
test_mod_wrap_and_backward[eager] 1.9295ms 1.3484ms 741.5996 Ops/s 688.2880 Ops/s $\textbf{\color{#35bf28}+7.75\%}$
test_mod_wrap_and_backward[compile] 1.4531ms 1.3438ms 744.1368 Ops/s 734.4777 Ops/s $\color{#35bf28}+1.32\%$
test_mod_wrap_and_backward[compile-overhead] 1.3497ms 0.9021ms 1.1085 KOps/s 1.0848 KOps/s $\color{#35bf28}+2.18\%$
test_seq_add[eager] 0.1695ms 0.1160ms 8.6218 KOps/s 8.7607 KOps/s $\color{#d91a1a}-1.59\%$
test_seq_add[compile] 0.1347ms 87.8789μs 11.3793 KOps/s 11.4606 KOps/s $\color{#d91a1a}-0.71\%$
test_seq_add[compile-overhead] 0.1818ms 0.1247ms 8.0174 KOps/s 7.9745 KOps/s $\color{#35bf28}+0.54\%$
test_seq_wrap[eager] 0.4998ms 0.4351ms 2.2984 KOps/s 2.3176 KOps/s $\color{#d91a1a}-0.83\%$
test_seq_wrap[compile] 0.4008ms 0.3224ms 3.1018 KOps/s 3.0671 KOps/s $\color{#35bf28}+1.13\%$
test_seq_wrap[compile-overhead] 0.1931s 89.4611ms 11.1780 Ops/s 7.8815 Ops/s $\textbf{\color{#35bf28}+41.83\%}$
test_func_call_runtime[False-eager] 0.8153ms 0.7344ms 1.3617 KOps/s 1.3263 KOps/s $\color{#35bf28}+2.66\%$
test_func_call_runtime[False-compile] 0.8563ms 0.8063ms 1.2402 KOps/s 1.2216 KOps/s $\color{#35bf28}+1.52\%$
test_func_call_runtime[False-compile-overhead] 0.4300ms 0.3671ms 2.7242 KOps/s 2.6850 KOps/s $\color{#35bf28}+1.46\%$
test_func_call_runtime[True-eager] 1.0268ms 0.9416ms 1.0620 KOps/s 1.0460 KOps/s $\color{#35bf28}+1.53\%$
test_func_call_runtime[True-compile] 0.9662ms 0.8826ms 1.1330 KOps/s 1.1553 KOps/s $\color{#d91a1a}-1.93\%$
test_func_call_runtime[True-compile-overhead] 0.5089ms 0.4151ms 2.4089 KOps/s 2.4086 KOps/s $\color{#35bf28}+0.01\%$
test_func_call_cm_runtime[False-eager] 0.9608ms 0.7776ms 1.2860 KOps/s 1.2685 KOps/s $\color{#35bf28}+1.38\%$
test_func_call_cm_runtime[False-compile] 0.9232ms 0.8399ms 1.1906 KOps/s 1.1911 KOps/s $\color{#d91a1a}-0.05\%$
test_func_call_cm_runtime[False-compile-overhead] 0.4193ms 0.3687ms 2.7120 KOps/s 2.6945 KOps/s $\color{#35bf28}+0.65\%$
test_func_call_cm_runtime[True-eager] 1.3385ms 1.0567ms 946.3568 Ops/s 941.8371 Ops/s $\color{#35bf28}+0.48\%$
test_func_call_cm_runtime[True-compile] 1.2708ms 1.0460ms 956.0113 Ops/s 969.2348 Ops/s $\color{#d91a1a}-1.36\%$
test_func_call_cm_runtime[True-compile-overhead] 1.2896ms 1.0591ms 944.2031 Ops/s 969.1661 Ops/s $\color{#d91a1a}-2.58\%$
test_distributed 0.2311ms 73.2436μs 13.6531 KOps/s 13.8513 KOps/s $\color{#d91a1a}-1.43\%$
test_tdmodule 45.7310μs 17.2849μs 57.8540 KOps/s 59.4348 KOps/s $\color{#d91a1a}-2.66\%$
test_tdmodule_dispatch 54.8920μs 35.1378μs 28.4594 KOps/s 28.7113 KOps/s $\color{#d91a1a}-0.88\%$
test_tdseq 35.2210μs 18.0490μs 55.4047 KOps/s 56.1456 KOps/s $\color{#d91a1a}-1.32\%$
test_tdseq_dispatch 58.3210μs 37.0883μs 26.9627 KOps/s 27.4965 KOps/s $\color{#d91a1a}-1.94\%$
test_instantiation_functorch 2.1753ms 2.0518ms 487.3878 Ops/s 488.5806 Ops/s $\color{#d91a1a}-0.24\%$
test_instantiation_td 1.9705ms 1.3059ms 765.7325 Ops/s 762.7968 Ops/s $\color{#35bf28}+0.38\%$
test_exec_functorch 0.2812ms 0.2185ms 4.5771 KOps/s 4.4835 KOps/s $\color{#35bf28}+2.09\%$
test_exec_functional_call 0.2782ms 0.2249ms 4.4457 KOps/s 4.5586 KOps/s $\color{#d91a1a}-2.48\%$
test_exec_td 0.2905ms 0.2360ms 4.2381 KOps/s 4.4067 KOps/s $\color{#d91a1a}-3.83\%$
test_exec_td_decorator 1.0356ms 0.2791ms 3.5829 KOps/s 3.5711 KOps/s $\color{#35bf28}+0.33\%$
test_vmap_mlp_speed[True-True] 0.8867ms 0.6525ms 1.5325 KOps/s 1.5013 KOps/s $\color{#35bf28}+2.08\%$
test_vmap_mlp_speed[True-False] 0.8048ms 0.6679ms 1.4973 KOps/s 1.4846 KOps/s $\color{#35bf28}+0.86\%$
test_vmap_mlp_speed[False-True] 0.6224ms 0.5664ms 1.7656 KOps/s 1.7339 KOps/s $\color{#35bf28}+1.82\%$
test_vmap_mlp_speed[False-False] 0.6497ms 0.5733ms 1.7444 KOps/s 1.7580 KOps/s $\color{#d91a1a}-0.78\%$
test_vmap_mlp_speed_decorator[True-True] 1.3045ms 0.7022ms 1.4242 KOps/s 1.4147 KOps/s $\color{#35bf28}+0.67\%$
test_vmap_mlp_speed_decorator[True-False] 0.8951ms 0.7036ms 1.4212 KOps/s 1.4098 KOps/s $\color{#35bf28}+0.81\%$
test_vmap_mlp_speed_decorator[False-True] 0.7731ms 0.6103ms 1.6385 KOps/s 1.6222 KOps/s $\color{#35bf28}+1.00\%$
test_vmap_mlp_speed_decorator[False-False] 0.7359ms 0.6104ms 1.6382 KOps/s 1.6225 KOps/s $\color{#35bf28}+0.97\%$
test_vmap_transformer_speed[True-True] 9.0724ms 8.6720ms 115.3140 Ops/s 114.2865 Ops/s $\color{#35bf28}+0.90\%$
test_vmap_transformer_speed[True-False] 9.8977ms 8.6291ms 115.8869 Ops/s 114.5828 Ops/s $\color{#35bf28}+1.14\%$
test_vmap_transformer_speed[False-True] 8.6399ms 8.5514ms 116.9400 Ops/s 115.8787 Ops/s $\color{#35bf28}+0.92\%$
test_vmap_transformer_speed[False-False] 8.8039ms 8.5467ms 117.0045 Ops/s 115.6200 Ops/s $\color{#35bf28}+1.20\%$
test_vmap_transformer_speed_decorator[True-True] 21.3312ms 20.7869ms 48.1073 Ops/s 48.7263 Ops/s $\color{#d91a1a}-1.27\%$
test_vmap_transformer_speed_decorator[True-False] 20.6194ms 20.5393ms 48.6873 Ops/s 48.5864 Ops/s $\color{#35bf28}+0.21\%$
test_vmap_transformer_speed_decorator[False-True] 22.0380ms 20.3970ms 49.0269 Ops/s 48.9890 Ops/s $\color{#35bf28}+0.08\%$
test_vmap_transformer_speed_decorator[False-False] 21.0909ms 20.3981ms 49.0242 Ops/s 49.0886 Ops/s $\color{#d91a1a}-0.13\%$
test_to_module_speed[True] 1.2445ms 1.1413ms 876.1691 Ops/s 866.8869 Ops/s $\color{#35bf28}+1.07\%$
test_to_module_speed[False] 1.2240ms 1.1153ms 896.6115 Ops/s 900.4922 Ops/s $\color{#d91a1a}-0.43\%$
test_tc_init 65.2320μs 41.8784μs 23.8786 KOps/s 24.9033 KOps/s $\color{#d91a1a}-4.11\%$
test_tc_init_nested 0.1237ms 83.7994μs 11.9333 KOps/s 12.5145 KOps/s $\color{#d91a1a}-4.64\%$
test_tc_first_layer_tensor 19.4988μs 0.7856μs 1.2729 MOps/s 1.2633 MOps/s $\color{#35bf28}+0.76\%$
test_tc_first_layer_nontensor 23.2110μs 2.5729μs 388.6695 KOps/s 397.5664 KOps/s $\color{#d91a1a}-2.24\%$
test_tc_second_layer_tensor 9.3403μs 1.5998μs 625.0773 KOps/s 614.2890 KOps/s $\color{#35bf28}+1.76\%$
test_tc_second_layer_nontensor 21.2410μs 3.3970μs 294.3742 KOps/s 294.6410 KOps/s $\color{#d91a1a}-0.09\%$
test_unbind 0.1820s 12.2722ms 81.4852 Ops/s 63.5884 Ops/s $\textbf{\color{#35bf28}+28.14\%}$
test_full_like 0.6558ms 0.5765ms 1.7347 KOps/s 1.7291 KOps/s $\color{#35bf28}+0.32\%$
test_zeros_like 0.2704ms 0.1977ms 5.0575 KOps/s 5.0550 KOps/s $\color{#35bf28}+0.05\%$
test_ones_like 0.3655ms 0.1975ms 5.0622 KOps/s 5.0575 KOps/s $\color{#35bf28}+0.09\%$
test_clone 0.4350ms 0.4138ms 2.4165 KOps/s 2.4151 KOps/s $\color{#35bf28}+0.06\%$
test_squeeze 31.7810μs 10.8864μs 91.8580 KOps/s 87.8315 KOps/s $\color{#35bf28}+4.58\%$
test_unsqueeze 0.2342ms 79.4674μs 12.5838 KOps/s 12.6087 KOps/s $\color{#d91a1a}-0.20\%$
test_split 0.4464ms 0.1769ms 5.6531 KOps/s 5.6520 KOps/s $\color{#35bf28}+0.02\%$
test_permute 0.2312ms 0.1908ms 5.2418 KOps/s 5.2853 KOps/s $\color{#d91a1a}-0.82\%$
test_stack 1.2481ms 0.9080ms 1.1013 KOps/s 1.1034 KOps/s $\color{#d91a1a}-0.19\%$
test_cat 1.2564ms 1.2314ms 812.1009 Ops/s 811.6734 Ops/s $\color{#35bf28}+0.05\%$

@vmoens vmoens merged commit 37feb13 into main Jul 31, 2024
51 checks passed
@vmoens vmoens deleted the fix-non-tensordict-copy-in-select branch October 21, 2024 14:01
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[BUG] tensorclass select breaks non-selected keys
2 participants