-
Notifications
You must be signed in to change notification settings - Fork 82
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BE] Better errors for TensorDictSequential #1227
Merged
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This was referenced Feb 20, 2025
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 44.9040μs | 20.6183μs | 48.5006 KOps/s | 48.0451 KOps/s | |
test_plain_set_stack_nested | 50.0930μs | 20.8251μs | 48.0189 KOps/s | 47.1825 KOps/s | |
test_plain_set_nested_inplace | 73.8580μs | 22.3674μs | 44.7080 KOps/s | 43.9260 KOps/s | |
test_plain_set_stack_nested_inplace | 72.5360μs | 22.2985μs | 44.8460 KOps/s | 43.5392 KOps/s | |
test_items | 29.6260μs | 4.2503μs | 235.2791 KOps/s | 239.4424 KOps/s | |
test_items_nested | 0.8445ms | 0.4089ms | 2.4454 KOps/s | 2.4624 KOps/s | |
test_items_nested_locked | 0.5880ms | 0.4082ms | 2.4500 KOps/s | 2.4535 KOps/s | |
test_items_nested_leaf | 0.1380ms | 76.6331μs | 13.0492 KOps/s | 12.8726 KOps/s | |
test_items_stack_nested | 0.7321ms | 0.4097ms | 2.4411 KOps/s | 2.4621 KOps/s | |
test_items_stack_nested_leaf | 0.1486ms | 79.2734μs | 12.6146 KOps/s | 12.8534 KOps/s | |
test_items_stack_nested_locked | 0.5368ms | 0.4069ms | 2.4573 KOps/s | 2.4481 KOps/s | |
test_keys | 28.5330μs | 3.5047μs | 285.3315 KOps/s | 287.1990 KOps/s | |
test_keys_nested | 0.2479ms | 0.1655ms | 6.0424 KOps/s | 6.0451 KOps/s | |
test_keys_nested_locked | 1.8070ms | 0.1717ms | 5.8232 KOps/s | 5.7521 KOps/s | |
test_keys_nested_leaf | 0.2385ms | 0.1440ms | 6.9430 KOps/s | 6.8331 KOps/s | |
test_keys_stack_nested | 0.2540ms | 0.1650ms | 6.0616 KOps/s | 5.9659 KOps/s | |
test_keys_stack_nested_leaf | 0.2317ms | 0.1427ms | 7.0068 KOps/s | 6.8188 KOps/s | |
test_keys_stack_nested_locked | 0.2760ms | 0.1700ms | 5.8810 KOps/s | 5.8857 KOps/s | |
test_values | 9.2720μs | 1.0353μs | 965.8705 KOps/s | 950.4657 KOps/s | |
test_values_nested | 0.1079ms | 62.3558μs | 16.0370 KOps/s | 15.9905 KOps/s | |
test_values_nested_locked | 0.1165ms | 62.2284μs | 16.0698 KOps/s | 15.5498 KOps/s | |
test_values_nested_leaf | 0.1546ms | 70.4406μs | 14.1964 KOps/s | 14.0145 KOps/s | |
test_values_stack_nested | 0.1072ms | 62.7686μs | 15.9315 KOps/s | 15.9672 KOps/s | |
test_values_stack_nested_leaf | 0.1236ms | 70.8400μs | 14.1163 KOps/s | 14.0293 KOps/s | |
test_values_stack_nested_locked | 0.1139ms | 63.0794μs | 15.8530 KOps/s | 15.9764 KOps/s | |
test_membership | 2.2737μs | 0.6908μs | 1.4476 MOps/s | 1.4294 MOps/s | |
test_membership_nested | 43.9620μs | 2.8814μs | 347.0520 KOps/s | 336.1479 KOps/s | |
test_membership_nested_leaf | 27.9120μs | 2.8955μs | 345.3649 KOps/s | 329.7320 KOps/s | |
test_membership_stacked_nested | 22.1010μs | 2.9211μs | 342.3340 KOps/s | 328.9154 KOps/s | |
test_membership_stacked_nested_leaf | 33.8430μs | 2.8874μs | 346.3283 KOps/s | 335.5847 KOps/s | |
test_membership_nested_last | 42.6100μs | 4.2317μs | 236.3131 KOps/s | 226.5062 KOps/s | |
test_membership_nested_leaf_last | 30.8180μs | 4.2613μs | 234.6688 KOps/s | 222.8099 KOps/s | |
test_membership_stacked_nested_last | 45.7360μs | 4.2648μs | 234.4760 KOps/s | 226.7860 KOps/s | |
test_membership_stacked_nested_leaf_last | 26.5200μs | 4.2408μs | 235.8019 KOps/s | 229.3549 KOps/s | |
test_nested_getleaf | 35.0750μs | 10.3958μs | 96.1927 KOps/s | 94.1478 KOps/s | |
test_nested_get | 53.4200μs | 9.8121μs | 101.9151 KOps/s | 98.6896 KOps/s | |
test_stacked_getleaf | 41.1570μs | 10.0452μs | 99.5497 KOps/s | 94.1335 KOps/s | |
test_stacked_get | 52.0670μs | 9.8530μs | 101.4915 KOps/s | 100.6518 KOps/s | |
test_nested_getitemleaf | 36.2880μs | 11.0173μs | 90.7664 KOps/s | 89.8193 KOps/s | |
test_nested_getitem | 43.6510μs | 10.4868μs | 95.3580 KOps/s | 94.8115 KOps/s | |
test_stacked_getitemleaf | 38.0120μs | 10.9690μs | 91.1656 KOps/s | 90.2752 KOps/s | |
test_stacked_getitem | 36.6290μs | 10.4055μs | 96.1029 KOps/s | 93.5242 KOps/s | |
test_lock_nested | 0.5668ms | 0.4121ms | 2.4268 KOps/s | 2.4546 KOps/s | |
test_lock_stack_nested | 0.7587ms | 0.4182ms | 2.3910 KOps/s | 2.3952 KOps/s | |
test_unlock_nested | 0.5279ms | 0.3340ms | 2.9943 KOps/s | 3.0378 KOps/s | |
test_unlock_stack_nested | 0.5065ms | 0.3353ms | 2.9823 KOps/s | 2.9654 KOps/s | |
test_flatten_speed | 0.2002ms | 0.1012ms | 9.8774 KOps/s | 9.9545 KOps/s | |
test_unflatten_speed | 0.6909ms | 0.5205ms | 1.9212 KOps/s | 1.9295 KOps/s | |
test_common_ops | 0.9231ms | 0.7961ms | 1.2562 KOps/s | 1.2354 KOps/s | |
test_creation | 26.6200μs | 2.4901μs | 401.5935 KOps/s | 404.7869 KOps/s | |
test_creation_empty | 27.7420μs | 12.5051μs | 79.9672 KOps/s | 79.8019 KOps/s | |
test_creation_nested_1 | 72.8460μs | 15.3738μs | 65.0459 KOps/s | 65.1277 KOps/s | |
test_creation_nested_2 | 64.2800μs | 19.9485μs | 50.1290 KOps/s | 50.4151 KOps/s | |
test_clone | 59.3110μs | 13.1681μs | 75.9409 KOps/s | 75.5198 KOps/s | |
test_getitem[int] | 0.8339ms | 12.6925μs | 78.7866 KOps/s | 79.5982 KOps/s | |
test_getitem[slice_int] | 0.1300ms | 24.4167μs | 40.9556 KOps/s | 41.6432 KOps/s | |
test_getitem[range] | 0.1602ms | 49.4926μs | 20.2050 KOps/s | 19.6816 KOps/s | |
test_getitem[tuple] | 0.1226ms | 20.0549μs | 49.8632 KOps/s | 50.8067 KOps/s | |
test_getitem[list] | 0.1553ms | 45.8901μs | 21.7912 KOps/s | 22.2317 KOps/s | |
test_setitem_dim[int] | 52.8690μs | 25.2152μs | 39.6586 KOps/s | 39.9659 KOps/s | |
test_setitem_dim[slice_int] | 88.2050μs | 49.9254μs | 20.0299 KOps/s | 19.9670 KOps/s | |
test_setitem_dim[range] | 0.1213ms | 74.8983μs | 13.3514 KOps/s | 13.0774 KOps/s | |
test_setitem_dim[tuple] | 76.7840μs | 40.1292μs | 24.9195 KOps/s | 25.2807 KOps/s | |
test_setitem | 57.8280μs | 20.6709μs | 48.3771 KOps/s | 48.1554 KOps/s | |
test_set | 68.1580μs | 20.0325μs | 49.9189 KOps/s | 49.6001 KOps/s | |
test_set_shared | 4.1748ms | 0.1786ms | 5.5983 KOps/s | 5.4910 KOps/s | |
test_update | 0.1132ms | 23.1932μs | 43.1160 KOps/s | 42.5700 KOps/s | |
test_update_nested | 81.2220μs | 33.7041μs | 29.6700 KOps/s | 29.0648 KOps/s | |
test_update__nested | 0.4487ms | 32.9033μs | 30.3920 KOps/s | 29.9174 KOps/s | |
test_set_nested | 64.7810μs | 22.0285μs | 45.3956 KOps/s | 44.7993 KOps/s | |
test_set_nested_new | 85.8900μs | 26.2262μs | 38.1298 KOps/s | 37.7904 KOps/s | |
test_select | 99.4160μs | 41.6933μs | 23.9847 KOps/s | 23.0451 KOps/s | |
test_select_nested | 0.1214ms | 62.5416μs | 15.9894 KOps/s | 15.9079 KOps/s | |
test_exclude_nested | 0.1747ms | 79.9517μs | 12.5076 KOps/s | 12.3976 KOps/s | |
test_empty[True] | 0.8131ms | 0.4092ms | 2.4438 KOps/s | 2.4440 KOps/s | |
test_empty[False] | 12.0375μs | 1.4168μs | 705.8228 KOps/s | 726.1368 KOps/s | |
test_unbind_speed | 0.4481ms | 0.2731ms | 3.6617 KOps/s | 3.7506 KOps/s | |
test_unbind_speed_stack0 | 0.5589ms | 0.2683ms | 3.7277 KOps/s | 3.7679 KOps/s | |
test_unbind_speed_stack1 | 96.3906ms | 0.7174ms | 1.3939 KOps/s | 1.2518 KOps/s | |
test_split | 97.3003ms | 1.7096ms | 584.9163 Ops/s | 576.8414 Ops/s | |
test_chunk | 0.1005s | 1.7232ms | 580.3251 Ops/s | 631.3160 Ops/s | |
test_consolidate_njt[False-None] | 10.1360ms | 8.2719ms | 120.8912 Ops/s | 123.4783 Ops/s | |
test_creation[device0] | 0.2271ms | 89.3813μs | 11.1880 KOps/s | 11.0347 KOps/s | |
test_creation_from_tensor | 3.5363ms | 93.8978μs | 10.6499 KOps/s | 10.4299 KOps/s | |
test_add_one[memmap_tensor0] | 0.1340ms | 4.8815μs | 204.8546 KOps/s | 196.4578 KOps/s | |
test_contiguous[memmap_tensor0] | 9.6680μs | 0.5246μs | 1.9062 MOps/s | 1.9626 MOps/s | |
test_stack[memmap_tensor0] | 28.4940μs | 3.4052μs | 293.6718 KOps/s | 289.1680 KOps/s | |
test_memmaptd_index | 0.3280ms | 0.2239ms | 4.4670 KOps/s | 4.4639 KOps/s | |
test_memmaptd_index_astensor | 1.0755ms | 0.3091ms | 3.2354 KOps/s | 3.2121 KOps/s | |
test_memmaptd_index_op | 1.3265ms | 0.5813ms | 1.7203 KOps/s | 1.6866 KOps/s | |
test_serialize_model | 0.2147s | 0.1289s | 7.7585 Ops/s | 8.7747 Ops/s | |
test_serialize_model_pickle | 0.4617s | 0.3944s | 2.5353 Ops/s | 2.5024 Ops/s | |
test_serialize_weights | 0.1204s | 0.1141s | 8.7634 Ops/s | 8.6956 Ops/s | |
test_serialize_weights_returnearly | 0.1767s | 0.1612s | 6.2052 Ops/s | 6.4392 Ops/s | |
test_serialize_weights_pickle | 0.5552s | 0.4371s | 2.2876 Ops/s | 1.1423 Ops/s | |
test_serialize_weights_filesystem | 0.1534s | 0.1417s | 7.0582 Ops/s | 6.9788 Ops/s | |
test_serialize_model_filesystem | 0.1518s | 0.1459s | 6.8538 Ops/s | 6.9537 Ops/s | |
test_reshape_pytree | 73.4870μs | 26.5993μs | 37.5949 KOps/s | 38.4827 KOps/s | |
test_reshape_td | 69.8810μs | 32.0955μs | 31.1570 KOps/s | 31.8247 KOps/s | |
test_view_pytree | 0.1124ms | 26.1852μs | 38.1895 KOps/s | 38.7790 KOps/s | |
test_view_td | 80.6810μs | 39.8026μs | 25.1240 KOps/s | 25.7007 KOps/s | |
test_unbind_pytree | 63.4490μs | 29.3641μs | 34.0552 KOps/s | 34.2419 KOps/s | |
test_unbind_td | 0.3081ms | 39.9243μs | 25.0474 KOps/s | 25.5689 KOps/s | |
test_split_pytree | 82.4240μs | 29.2410μs | 34.1985 KOps/s | 34.8775 KOps/s | |
test_split_td | 0.2151ms | 44.3822μs | 22.5316 KOps/s | 22.3785 KOps/s | |
test_add_pytree | 81.8840μs | 34.8837μs | 28.6667 KOps/s | 28.1466 KOps/s | |
test_add_td | 0.1305ms | 53.6162μs | 18.6511 KOps/s | 18.0671 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1467ms | 66.5933μs | 15.0165 KOps/s | 15.0646 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.3165ms | 0.1711ms | 5.8444 KOps/s | 5.8702 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1271ms | 45.6914μs | 21.8860 KOps/s | 21.7508 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.2236ms | 0.1176ms | 8.5047 KOps/s | 8.3819 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 74.6190μs | 27.6899μs | 36.1142 KOps/s | 36.0790 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1610ms | 59.6756μs | 16.7573 KOps/s | 16.9811 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.1556ms | 79.4723μs | 12.5830 KOps/s | 12.2989 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.1146ms | 66.7051μs | 14.9914 KOps/s | 14.9541 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.1907ms | 0.1083ms | 9.2316 KOps/s | 9.5120 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.4389ms | 0.2147ms | 4.6572 KOps/s | 4.6170 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1008ms | 46.8039μs | 21.3658 KOps/s | 22.1392 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.1901ms | 65.2608μs | 15.3231 KOps/s | 14.9374 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.1945ms | 99.7689μs | 10.0232 KOps/s | 9.9902 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.4255ms | 0.2002ms | 4.9951 KOps/s | 4.9257 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.4129ms | 0.2289ms | 4.3682 KOps/s | 4.3053 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.2248ms | 0.1124ms | 8.8933 KOps/s | 9.2939 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.2422ms | 61.7260μs | 16.2006 KOps/s | 16.3588 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1043ms | 48.8763μs | 20.4598 KOps/s | 21.1761 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.3458ms | 0.1577ms | 6.3402 KOps/s | 6.2911 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.2197ms | 0.1011ms | 9.8864 KOps/s | 9.8217 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 59.3310μs | 20.9987μs | 47.6219 KOps/s | 47.0910 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.1412ms | 68.5182μs | 14.5947 KOps/s | 14.7516 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1933ms | 81.2244μs | 12.3116 KOps/s | 12.2781 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1365ms | 66.8638μs | 14.9558 KOps/s | 14.8951 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 0.3102ms | 0.2176ms | 4.5960 KOps/s | 4.6246 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 2.4832ms | 1.3582ms | 736.2644 Ops/s | 721.7519 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 0.4204ms | 0.2110ms | 4.7394 KOps/s | 4.7783 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 1.1127ms | 0.8200ms | 1.2195 KOps/s | 1.2054 KOps/s | |
test_compile_assign_and_add_stack[compile] | 0.5741ms | 0.4534ms | 2.2054 KOps/s | 2.1554 KOps/s | |
test_compile_assign_and_add_stack[eager] | 6.1328ms | 2.7276ms | 366.6200 Ops/s | 360.8514 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 87.0630μs | 37.4009μs | 26.7373 KOps/s | 26.3286 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.5873ms | 33.0984μs | 30.2129 KOps/s | 30.5597 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 79.4690μs | 29.9007μs | 33.4441 KOps/s | 31.9850 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 84.8790μs | 22.5117μs | 44.4214 KOps/s | 43.4748 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 99.3360μs | 30.3309μs | 32.9696 KOps/s | 31.9191 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 0.1091ms | 23.2977μs | 42.9228 KOps/s | 42.7183 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1315ms | 53.4909μs | 18.6948 KOps/s | 19.3294 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.3560ms | 19.9351μs | 50.1627 KOps/s | 49.5202 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 97.5020μs | 45.5591μs | 21.9495 KOps/s | 22.0157 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 51.7060μs | 18.6174μs | 53.7133 KOps/s | 53.0627 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1011ms | 46.4853μs | 21.5122 KOps/s | 21.8187 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 52.0370μs | 18.4630μs | 54.1625 KOps/s | 53.9135 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1211ms | 53.7281μs | 18.6122 KOps/s | 18.8043 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.8876ms | 19.8212μs | 50.4511 KOps/s | 50.4631 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 92.5340μs | 45.9779μs | 21.7496 KOps/s | 21.9369 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 53.8800μs | 18.5485μs | 53.9128 KOps/s | 54.6381 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1389ms | 45.9042μs | 21.7845 KOps/s | 21.7498 KOps/s | |
test_compile_indexing[int-pytree-eager] | 45.7660μs | 18.2309μs | 54.8520 KOps/s | 54.3315 KOps/s | |
test_mod_add[eager] | 80.9020μs | 35.8613μs | 27.8852 KOps/s | 27.5738 KOps/s | |
test_mod_add[compile] | 0.1400ms | 63.9983μs | 15.6254 KOps/s | 15.0761 KOps/s | |
test_mod_add[compile-overhead] | 0.1519ms | 62.4097μs | 16.0231 KOps/s | 15.3934 KOps/s | |
test_mod_wrap[eager] | 0.3205ms | 0.2249ms | 4.4469 KOps/s | 4.4260 KOps/s | |
test_mod_wrap[compile] | 2.2197ms | 0.2271ms | 4.4036 KOps/s | 4.3834 KOps/s | |
test_mod_wrap[compile-overhead] | 0.3583ms | 0.2242ms | 4.4612 KOps/s | 4.4768 KOps/s | |
test_mod_wrap_and_backward[eager] | 12.1314ms | 11.0197ms | 90.7467 Ops/s | 92.5662 Ops/s | |
test_mod_wrap_and_backward[compile] | 13.8644ms | 11.2975ms | 88.5154 Ops/s | 93.6987 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 11.6876ms | 10.7746ms | 92.8104 Ops/s | 93.6447 Ops/s | |
test_seq_add[eager] | 0.2491ms | 0.1161ms | 8.6097 KOps/s | 8.3684 KOps/s | |
test_seq_add[compile] | 0.1709ms | 75.5009μs | 13.2449 KOps/s | 13.1908 KOps/s | |
test_seq_add[compile-overhead] | 0.1705ms | 73.2983μs | 13.6429 KOps/s | 13.4448 KOps/s | |
test_seq_wrap[eager] | 0.6684ms | 0.4414ms | 2.2657 KOps/s | 2.2204 KOps/s | |
test_seq_wrap[compile] | 0.3750ms | 0.2424ms | 4.1250 KOps/s | 4.1236 KOps/s | |
test_seq_wrap[compile-overhead] | 0.3769ms | 0.2424ms | 4.1254 KOps/s | 4.1409 KOps/s | |
test_func_call_runtime[False-eager] | 0.9692ms | 0.5492ms | 1.8208 KOps/s | 1.8716 KOps/s | |
test_func_call_runtime[False-compile] | 0.5918ms | 0.4404ms | 2.2708 KOps/s | 2.2593 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.5342ms | 0.4390ms | 2.2779 KOps/s | 2.2557 KOps/s | |
test_func_call_runtime[True-eager] | 1.2202ms | 0.7524ms | 1.3290 KOps/s | 1.3706 KOps/s | |
test_func_call_runtime[True-compile] | 0.6175ms | 0.4614ms | 2.1674 KOps/s | 2.1623 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.5764ms | 0.4642ms | 2.1541 KOps/s | 2.1563 KOps/s | |
test_func_call_cm_runtime[False-eager] | 1.0134ms | 0.5409ms | 1.8487 KOps/s | 1.8842 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.6118ms | 0.4402ms | 2.2716 KOps/s | 2.2441 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.5846ms | 0.4429ms | 2.2579 KOps/s | 2.2476 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.4371ms | 0.8995ms | 1.1117 KOps/s | 1.1260 KOps/s | |
test_func_call_cm_runtime[True-compile] | 1.2900ms | 0.8027ms | 1.2458 KOps/s | 1.2419 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 1.0903ms | 0.8150ms | 1.2269 KOps/s | 1.2377 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.6356ms | 1.9191ms | 521.0704 Ops/s | 522.7162 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 0.7252ms | 0.5382ms | 1.8579 KOps/s | 1.7964 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.7153ms | 0.5363ms | 1.8647 KOps/s | 1.7838 KOps/s | |
test_distributed | 0.2606ms | 0.1260ms | 7.9385 KOps/s | 7.8941 KOps/s | |
test_tdmodule | 55.5240μs | 27.4217μs | 36.4675 KOps/s | 36.8144 KOps/s | |
test_tdmodule_dispatch | 0.1507ms | 52.6343μs | 18.9990 KOps/s | 20.3490 KOps/s | |
test_tdseq | 46.9870μs | 29.3925μs | 34.0223 KOps/s | 33.9509 KOps/s | |
test_tdseq_dispatch | 75.2510μs | 55.2241μs | 18.1080 KOps/s | 18.0779 KOps/s | |
test_instantiation_functorch | 1.6384ms | 1.5277ms | 654.5802 Ops/s | 665.8667 Ops/s | |
test_exec_functorch | 0.3166ms | 0.1784ms | 5.6063 KOps/s | 5.6113 KOps/s | |
test_exec_functional_call | 0.3681ms | 0.1697ms | 5.8930 KOps/s | 5.6871 KOps/s | |
test_exec_td_decorator | 0.4664ms | 0.2306ms | 4.3365 KOps/s | 4.3763 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.8183ms | 0.6582ms | 1.5192 KOps/s | 1.5116 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.8159ms | 0.6553ms | 1.5260 KOps/s | 1.5176 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.7795ms | 0.5288ms | 1.8909 KOps/s | 1.8603 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.7945ms | 0.5305ms | 1.8849 KOps/s | 1.8618 KOps/s | |
test_to_module_speed[True] | 1.8837ms | 1.3263ms | 753.9915 Ops/s | 754.0072 Ops/s | |
test_to_module_speed[False] | 2.1201ms | 1.3004ms | 768.9720 Ops/s | 771.6081 Ops/s | |
test_tc_init | 99.6370μs | 46.8400μs | 21.3493 KOps/s | 21.7316 KOps/s | |
test_tc_init_nested | 0.1566ms | 94.7170μs | 10.5578 KOps/s | 10.7001 KOps/s | |
test_tc_first_layer_tensor | 18.9750μs | 1.5258μs | 655.3815 KOps/s | 649.6538 KOps/s | |
test_tc_first_layer_nontensor | 29.7350μs | 4.7204μs | 211.8449 KOps/s | 214.8195 KOps/s | |
test_tc_second_layer_tensor | 32.4010μs | 2.8766μs | 347.6380 KOps/s | 350.1737 KOps/s | |
test_tc_second_layer_nontensor | 41.1670μs | 6.0813μs | 164.4375 KOps/s | 165.8513 KOps/s | |
test_unbind | 0.2266s | 13.2016ms | 75.7485 Ops/s | 63.6129 Ops/s | |
test_full_like | 8.6136ms | 7.3444ms | 136.1578 Ops/s | 125.8276 Ops/s | |
test_zeros_like | 4.4420ms | 2.8220ms | 354.3644 Ops/s | 221.6055 Ops/s | |
test_ones_like | 4.0181ms | 3.2018ms | 312.3243 Ops/s | 304.6481 Ops/s | |
test_clone | 8.0177ms | 5.0150ms | 199.4015 Ops/s | 194.3764 Ops/s | |
test_squeeze | 61.5950μs | 12.7380μs | 78.5053 KOps/s | 76.2639 KOps/s | |
test_unsqueeze | 0.2972ms | 93.5178μs | 10.6932 KOps/s | 10.9263 KOps/s | |
test_split | 0.3281ms | 0.1923ms | 5.1996 KOps/s | 5.0876 KOps/s | |
test_permute | 0.3419ms | 0.1999ms | 5.0032 KOps/s | 5.0546 KOps/s | |
test_stack | 30.5661ms | 23.8908ms | 41.8571 Ops/s | 39.9568 Ops/s | |
test_cat | 30.1946ms | 23.6893ms | 42.2131 Ops/s | 40.1996 Ops/s |
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 30.4710μs | 12.9132μs | 77.4401 KOps/s | 77.7608 KOps/s | |
test_plain_set_stack_nested | 39.1900μs | 13.0410μs | 76.6815 KOps/s | 77.2322 KOps/s | |
test_plain_set_nested_inplace | 56.3810μs | 14.0129μs | 71.3629 KOps/s | 71.3895 KOps/s | |
test_plain_set_stack_nested_inplace | 0.3930ms | 14.0287μs | 71.2823 KOps/s | 72.0217 KOps/s | |
test_items | 38.6400μs | 2.9037μs | 344.3938 KOps/s | 341.5541 KOps/s | |
test_items_nested | 0.7470ms | 0.3585ms | 2.7892 KOps/s | 2.7533 KOps/s | |
test_items_nested_locked | 0.7476ms | 0.3609ms | 2.7712 KOps/s | 2.7443 KOps/s | |
test_items_nested_leaf | 0.4420ms | 60.2762μs | 16.5903 KOps/s | 16.5938 KOps/s | |
test_items_stack_nested | 0.4748ms | 0.3571ms | 2.8006 KOps/s | 2.7647 KOps/s | |
test_items_stack_nested_leaf | 0.1413ms | 60.0443μs | 16.6544 KOps/s | 16.2632 KOps/s | |
test_items_stack_nested_locked | 0.5415ms | 0.3642ms | 2.7461 KOps/s | 2.7470 KOps/s | |
test_keys | 0.4988ms | 3.4129μs | 293.0068 KOps/s | 269.6456 KOps/s | |
test_keys_nested | 0.1653ms | 87.9001μs | 11.3765 KOps/s | 11.4140 KOps/s | |
test_keys_nested_locked | 0.8721ms | 93.2028μs | 10.7293 KOps/s | 10.8135 KOps/s | |
test_keys_nested_leaf | 0.1680ms | 77.9796μs | 12.8239 KOps/s | 12.7262 KOps/s | |
test_keys_stack_nested | 0.4757ms | 88.0130μs | 11.3620 KOps/s | 11.5220 KOps/s | |
test_keys_stack_nested_leaf | 0.4854ms | 78.9756μs | 12.6621 KOps/s | 12.6823 KOps/s | |
test_keys_stack_nested_locked | 0.4869ms | 93.4436μs | 10.7016 KOps/s | 10.8704 KOps/s | |
test_values | 67.0590μs | 0.8572μs | 1.1666 MOps/s | 1.1657 MOps/s | |
test_values_nested | 0.4275ms | 37.1110μs | 26.9462 KOps/s | 27.0647 KOps/s | |
test_values_nested_locked | 0.4224ms | 39.3083μs | 25.4399 KOps/s | 25.7446 KOps/s | |
test_values_nested_leaf | 0.1224ms | 42.0440μs | 23.7846 KOps/s | 23.6724 KOps/s | |
test_values_stack_nested | 63.1100μs | 37.3008μs | 26.8091 KOps/s | 26.5742 KOps/s | |
test_values_stack_nested_leaf | 85.5110μs | 42.7594μs | 23.3867 KOps/s | 23.5275 KOps/s | |
test_values_stack_nested_locked | 74.5500μs | 39.4567μs | 25.3442 KOps/s | 25.2797 KOps/s | |
test_membership | 1.8935μs | 0.5037μs | 1.9852 MOps/s | 1.9917 MOps/s | |
test_membership_nested | 18.2150μs | 1.9234μs | 519.9093 KOps/s | 510.5427 KOps/s | |
test_membership_nested_leaf | 0.1961ms | 1.9187μs | 521.1860 KOps/s | 508.5987 KOps/s | |
test_membership_stacked_nested | 24.6300μs | 2.0531μs | 487.0637 KOps/s | 486.8383 KOps/s | |
test_membership_stacked_nested_leaf | 25.1710μs | 2.0365μs | 491.0274 KOps/s | 488.5397 KOps/s | |
test_membership_nested_last | 29.0500μs | 3.0126μs | 331.9425 KOps/s | 330.6212 KOps/s | |
test_membership_nested_leaf_last | 0.3973ms | 3.0102μs | 332.2005 KOps/s | 328.2290 KOps/s | |
test_membership_stacked_nested_last | 26.2910μs | 2.9893μs | 334.5260 KOps/s | 123.8808 KOps/s | |
test_membership_stacked_nested_leaf_last | 25.8400μs | 2.9751μs | 336.1262 KOps/s | 121.8234 KOps/s | |
test_nested_getleaf | 0.3904ms | 6.1679μs | 162.1300 KOps/s | 160.5015 KOps/s | |
test_nested_get | 34.3700μs | 5.8923μs | 169.7137 KOps/s | 169.7219 KOps/s | |
test_stacked_getleaf | 31.2410μs | 6.1736μs | 161.9789 KOps/s | 163.5340 KOps/s | |
test_stacked_get | 0.3882ms | 5.8232μs | 171.7284 KOps/s | 173.5251 KOps/s | |
test_nested_getitemleaf | 85.5010μs | 6.3891μs | 156.5169 KOps/s | 155.6491 KOps/s | |
test_nested_getitem | 0.3982ms | 6.0795μs | 164.4865 KOps/s | 163.0187 KOps/s | |
test_stacked_getitemleaf | 33.4410μs | 6.3574μs | 157.2978 KOps/s | 157.3996 KOps/s | |
test_stacked_getitem | 0.4034ms | 5.9966μs | 166.7614 KOps/s | 167.1001 KOps/s | |
test_lock_nested | 9.6023ms | 0.3420ms | 2.9241 KOps/s | 2.9283 KOps/s | |
test_lock_stack_nested | 0.3896ms | 0.3375ms | 2.9627 KOps/s | 2.9789 KOps/s | |
test_unlock_nested | 0.3733ms | 0.2774ms | 3.6047 KOps/s | 3.6104 KOps/s | |
test_unlock_stack_nested | 0.3285ms | 0.2749ms | 3.6383 KOps/s | 3.6754 KOps/s | |
test_flatten_speed | 0.4514ms | 77.0461μs | 12.9792 KOps/s | 12.8632 KOps/s | |
test_unflatten_speed | 0.7307ms | 0.3109ms | 3.2160 KOps/s | 3.1522 KOps/s | |
test_common_ops | 1.0107ms | 0.6197ms | 1.6137 KOps/s | 1.6372 KOps/s | |
test_creation | 68.2000μs | 1.6881μs | 592.3701 KOps/s | 588.0705 KOps/s | |
test_creation_empty | 0.1021ms | 9.2300μs | 108.3419 KOps/s | 108.3151 KOps/s | |
test_creation_nested_1 | 46.5900μs | 11.2319μs | 89.0322 KOps/s | 91.5566 KOps/s | |
test_creation_nested_2 | 37.8300μs | 13.7557μs | 72.6970 KOps/s | 73.9425 KOps/s | |
test_clone | 44.9110μs | 10.3541μs | 96.5798 KOps/s | 98.4696 KOps/s | |
test_getitem[int] | 1.2898ms | 10.7435μs | 93.0798 KOps/s | 92.8960 KOps/s | |
test_getitem[slice_int] | 0.1165ms | 20.5756μs | 48.6012 KOps/s | 48.1119 KOps/s | |
test_getitem[range] | 0.1336ms | 36.3290μs | 27.5262 KOps/s | 28.1796 KOps/s | |
test_getitem[tuple] | 0.1124ms | 17.6820μs | 56.5546 KOps/s | 55.0282 KOps/s | |
test_getitem[list] | 0.1237ms | 31.9257μs | 31.3228 KOps/s | 31.4753 KOps/s | |
test_setitem_dim[int] | 46.9400μs | 17.9221μs | 55.7969 KOps/s | 52.6913 KOps/s | |
test_setitem_dim[slice_int] | 58.4510μs | 36.4514μs | 27.4338 KOps/s | 27.2638 KOps/s | |
test_setitem_dim[range] | 72.4710μs | 49.4067μs | 20.2402 KOps/s | 19.7015 KOps/s | |
test_setitem_dim[tuple] | 52.9310μs | 29.8775μs | 33.4701 KOps/s | 32.0598 KOps/s | |
test_setitem | 54.5300μs | 15.0479μs | 66.4547 KOps/s | 67.4498 KOps/s | |
test_set | 46.7210μs | 14.6014μs | 68.4864 KOps/s | 68.9504 KOps/s | |
test_set_shared | 0.6956ms | 0.1548ms | 6.4605 KOps/s | 6.4671 KOps/s | |
test_update | 0.4179ms | 17.9586μs | 55.6838 KOps/s | 55.1800 KOps/s | |
test_update_nested | 0.4149ms | 23.3708μs | 42.7884 KOps/s | 42.6058 KOps/s | |
test_update__nested | 0.5314ms | 23.9462μs | 41.7604 KOps/s | 41.3067 KOps/s | |
test_set_nested | 0.4138ms | 16.1767μs | 61.8173 KOps/s | 63.9150 KOps/s | |
test_set_nested_new | 60.3300μs | 18.1098μs | 55.2188 KOps/s | 55.4795 KOps/s | |
test_select | 55.6910μs | 29.0305μs | 34.4466 KOps/s | 33.7594 KOps/s | |
test_select_nested | 68.6910μs | 43.8039μs | 22.8290 KOps/s | 23.1506 KOps/s | |
test_exclude_nested | 97.7310μs | 62.3198μs | 16.0463 KOps/s | 16.5921 KOps/s | |
test_empty[True] | 0.6797ms | 0.2916ms | 3.4289 KOps/s | 3.4727 KOps/s | |
test_empty[False] | 41.1284μs | 0.8163μs | 1.2251 MOps/s | 1.2374 MOps/s | |
test_to | 91.7310μs | 55.9663μs | 17.8679 KOps/s | 18.6500 KOps/s | |
test_to_nonblocking | 0.1042ms | 46.0474μs | 21.7167 KOps/s | 22.1836 KOps/s | |
test_unbind_speed | 0.2866ms | 0.2344ms | 4.2660 KOps/s | 4.2333 KOps/s | |
test_unbind_speed_stack0 | 0.3310ms | 0.2356ms | 4.2451 KOps/s | 4.2750 KOps/s | |
test_unbind_speed_stack1 | 94.4231ms | 0.7289ms | 1.3720 KOps/s | 1.3961 KOps/s | |
test_split | 96.1342ms | 1.6610ms | 602.0387 Ops/s | 627.4095 Ops/s | |
test_chunk | 97.0732ms | 1.6566ms | 603.6348 Ops/s | 629.5887 Ops/s | |
test_consolidate[False-None] | 2.8645ms | 2.7205ms | 367.5844 Ops/s | 376.7285 Ops/s | |
test_consolidate[default-None] | 2.1027ms | 1.7109ms | 584.4853 Ops/s | 587.6700 Ops/s | |
test_consolidate[reduce-overhead-None] | 1.8414ms | 1.7103ms | 584.6989 Ops/s | 566.3525 Ops/s | |
test_consolidate_njt[False-None] | 6.6313ms | 6.4195ms | 155.7749 Ops/s | 152.6605 Ops/s | |
test_to[False-False-None] | 2.0419ms | 1.6700ms | 598.8060 Ops/s | 595.1390 Ops/s | |
test_to[True-False-None] | 1.7397ms | 1.3737ms | 727.9666 Ops/s | 752.7428 Ops/s | |
test_to[within-False-None] | 4.5986ms | 4.1690ms | 239.8641 Ops/s | 239.1201 Ops/s | |
test_to[True-default-None] | 5.5278ms | 5.1362ms | 194.6956 Ops/s | 128.3486 Ops/s | |
test_to_njt[False-False-None] | 7.0577ms | 6.7385ms | 148.4007 Ops/s | 139.6318 Ops/s | |
test_to_njt[True-False-None] | 5.5669ms | 5.3506ms | 186.8937 Ops/s | 171.8492 Ops/s | |
test_to_njt[within-False-None] | 12.3745ms | 11.9387ms | 83.7611 Ops/s | 82.3961 Ops/s | |
test_creation[device0] | 0.5043ms | 81.7563μs | 12.2315 KOps/s | 12.2281 KOps/s | |
test_creation_from_tensor | 0.5158ms | 82.4739μs | 12.1251 KOps/s | 11.6812 KOps/s | |
test_add_one[memmap_tensor0] | 0.4310ms | 6.3834μs | 156.6561 KOps/s | 157.1389 KOps/s | |
test_contiguous[memmap_tensor0] | 1.8425μs | 0.3936μs | 2.5406 MOps/s | 2.4577 MOps/s | |
test_stack[memmap_tensor0] | 23.1810μs | 4.6049μs | 217.1599 KOps/s | 212.1378 KOps/s | |
test_memmaptd_index | 1.7465ms | 0.2462ms | 4.0618 KOps/s | 3.7467 KOps/s | |
test_memmaptd_index_astensor | 0.4320ms | 0.3083ms | 3.2437 KOps/s | 3.1683 KOps/s | |
test_memmaptd_index_op | 0.7418ms | 0.5791ms | 1.7268 KOps/s | 1.6760 KOps/s | |
test_serialize_model | 0.1312s | 0.1300s | 7.6924 Ops/s | 7.6714 Ops/s | |
test_serialize_model_pickle | 1.3464s | 1.2112s | 0.8256 Ops/s | 0.8252 Ops/s | |
test_serialize_weights | 0.1309s | 0.1295s | 7.7245 Ops/s | 7.7155 Ops/s | |
test_serialize_weights_returnearly | 0.3327s | 53.6093ms | 18.6535 Ops/s | 15.4710 Ops/s | |
test_serialize_weights_pickle | 1.3794s | 1.2186s | 0.8206 Ops/s | 0.8143 Ops/s | |
test_reshape_pytree | 52.4710μs | 22.3554μs | 44.7318 KOps/s | 45.4708 KOps/s | |
test_reshape_td | 54.6500μs | 26.7203μs | 37.4247 KOps/s | 37.5657 KOps/s | |
test_view_pytree | 47.2910μs | 21.9165μs | 45.6277 KOps/s | 46.6216 KOps/s | |
test_view_td | 65.6710μs | 31.4750μs | 31.7712 KOps/s | 30.6609 KOps/s | |
test_unbind_pytree | 61.6200μs | 27.9115μs | 35.8275 KOps/s | 36.2507 KOps/s | |
test_unbind_td | 0.6795ms | 36.6699μs | 27.2703 KOps/s | 26.7048 KOps/s | |
test_split_pytree | 68.2410μs | 30.3725μs | 32.9245 KOps/s | 33.2774 KOps/s | |
test_split_td | 0.8355ms | 40.5299μs | 24.6731 KOps/s | 25.5808 KOps/s | |
test_add_pytree | 82.0110μs | 33.1637μs | 30.1535 KOps/s | 30.4119 KOps/s | |
test_add_td | 89.3010μs | 48.8630μs | 20.4654 KOps/s | 21.5066 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1703ms | 0.1244ms | 8.0405 KOps/s | 8.0002 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.2213ms | 0.1296ms | 7.7165 KOps/s | 7.5279 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.2101ms | 92.3928μs | 10.8234 KOps/s | 10.4766 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.2455ms | 0.1465ms | 6.8268 KOps/s | 6.8567 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 0.1788ms | 24.4056μs | 40.9742 KOps/s | 41.7232 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1071ms | 27.9924μs | 35.7240 KOps/s | 34.1774 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.3276ms | 62.7211μs | 15.9436 KOps/s | 15.7210 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.1193ms | 48.8767μs | 20.4596 KOps/s | 20.5628 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.2081ms | 0.1433ms | 6.9783 KOps/s | 7.1710 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.3280ms | 0.2148ms | 4.6566 KOps/s | 4.6599 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1408ms | 94.0810μs | 10.6291 KOps/s | 10.4748 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.1206ms | 54.1855μs | 18.4551 KOps/s | 18.1752 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.2154ms | 0.1304ms | 7.6692 KOps/s | 7.3036 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.5386ms | 0.4742ms | 2.1089 KOps/s | 2.1021 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.3690ms | 0.2541ms | 3.9347 KOps/s | 3.7807 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.1942ms | 0.1384ms | 7.2232 KOps/s | 6.7439 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1689ms | 68.8423μs | 14.5259 KOps/s | 14.3824 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1497ms | 0.1005ms | 9.9532 KOps/s | 9.7085 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.4839ms | 0.3990ms | 2.5064 KOps/s | 2.5161 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.2594ms | 0.1312ms | 7.6237 KOps/s | 7.1397 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 69.6010μs | 18.9707μs | 52.7128 KOps/s | 51.5856 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 70.4810μs | 31.1132μs | 32.1407 KOps/s | 31.3662 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1710ms | 68.8740μs | 14.5193 KOps/s | 14.2650 KOps/s | |
test_compile_copy_flat[pytree-eager] | 91.3700μs | 51.8286μs | 19.2944 KOps/s | 18.9091 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 1.6142ms | 0.3878ms | 2.5786 KOps/s | 2.2028 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 2.7235ms | 2.5787ms | 387.7898 Ops/s | 389.7268 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 1.6383ms | 0.4459ms | 2.2426 KOps/s | 2.2918 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 2.8078ms | 2.6116ms | 382.9085 Ops/s | 393.3774 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.3364ms | 0.1148ms | 8.7114 KOps/s | 8.9890 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.5430ms | 77.0361μs | 12.9809 KOps/s | 13.0227 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.5200ms | 0.1074ms | 9.3070 KOps/s | 9.8739 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 0.1297ms | 68.4096μs | 14.6178 KOps/s | 14.3616 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.1830ms | 0.1102ms | 9.0730 KOps/s | 9.2488 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 0.1523ms | 68.6691μs | 14.5626 KOps/s | 14.4606 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1422ms | 98.6118μs | 10.1408 KOps/s | 9.9392 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.1505ms | 17.2954μs | 57.8189 KOps/s | 57.1420 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1739ms | 95.0637μs | 10.5193 KOps/s | 10.5260 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 45.8400μs | 16.0035μs | 62.4864 KOps/s | 62.7075 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1739ms | 99.5598μs | 10.0442 KOps/s | 10.5061 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 77.8710μs | 16.0169μs | 62.4341 KOps/s | 63.3749 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1649ms | 0.1041ms | 9.6079 KOps/s | 10.0777 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.6034ms | 17.2362μs | 58.0173 KOps/s | 58.4293 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1545ms | 99.9098μs | 10.0090 KOps/s | 10.2149 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 62.7510μs | 15.9717μs | 62.6107 KOps/s | 63.1664 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1448ms | 94.9692μs | 10.5297 KOps/s | 10.5468 KOps/s | |
test_compile_indexing[int-pytree-eager] | 0.1480ms | 19.3852μs | 51.5858 KOps/s | 64.0057 KOps/s | |
test_mod_add[eager] | 97.4510μs | 37.9733μs | 26.3343 KOps/s | 26.6094 KOps/s | |
test_mod_add[compile] | 0.3502ms | 78.9903μs | 12.6598 KOps/s | 12.5336 KOps/s | |
test_mod_add[compile-overhead] | 0.3235ms | 0.1642ms | 6.0906 KOps/s | 5.4777 KOps/s | |
test_mod_wrap[eager] | 0.3324ms | 0.2385ms | 4.1921 KOps/s | 4.0854 KOps/s | |
test_mod_wrap[compile] | 0.3902ms | 0.2759ms | 3.6249 KOps/s | 3.5225 KOps/s | |
test_mod_wrap[compile-overhead] | 7.2295ms | 3.8185ms | 261.8858 Ops/s | 270.4672 Ops/s | |
test_mod_wrap_and_backward[eager] | 1.5138ms | 1.3195ms | 757.8639 Ops/s | 701.2853 Ops/s | |
test_mod_wrap_and_backward[compile] | 1.3820ms | 1.2384ms | 807.5182 Ops/s | 738.0317 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 1.3371ms | 0.9014ms | 1.1094 KOps/s | 986.5262 Ops/s | |
test_seq_add[eager] | 0.1736ms | 0.1157ms | 8.6394 KOps/s | 8.6018 KOps/s | |
test_seq_add[compile] | 0.2201ms | 87.6307μs | 11.4115 KOps/s | 11.2190 KOps/s | |
test_seq_add[compile-overhead] | 0.1742ms | 0.1331ms | 7.5106 KOps/s | 7.8631 KOps/s | |
test_seq_wrap[eager] | 0.5429ms | 0.4357ms | 2.2950 KOps/s | 2.4049 KOps/s | |
test_seq_wrap[compile] | 0.3724ms | 0.2966ms | 3.3720 KOps/s | 3.3349 KOps/s | |
test_seq_wrap[compile-overhead] | 0.2941ms | 0.2253ms | 4.4384 KOps/s | 4.4512 KOps/s | |
test_func_call_runtime[False-eager] | 0.7945ms | 0.7124ms | 1.4037 KOps/s | 1.4100 KOps/s | |
test_func_call_runtime[False-compile] | 0.9193ms | 0.7349ms | 1.3606 KOps/s | 1.3513 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.4107ms | 0.3561ms | 2.8084 KOps/s | 2.7893 KOps/s | |
test_func_call_runtime[True-eager] | 0.9360ms | 0.8695ms | 1.1500 KOps/s | 1.1343 KOps/s | |
test_func_call_runtime[True-compile] | 0.8900ms | 0.7706ms | 1.2977 KOps/s | 1.2629 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.5191ms | 0.3759ms | 2.6605 KOps/s | 2.6245 KOps/s | |
test_func_call_cm_runtime[False-eager] | 1.0145ms | 0.7050ms | 1.4185 KOps/s | 1.3155 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.9496ms | 0.7308ms | 1.3684 KOps/s | 1.3163 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.4122ms | 0.3565ms | 2.8047 KOps/s | 2.7751 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.3249ms | 0.9786ms | 1.0218 KOps/s | 1.0115 KOps/s | |
test_func_call_cm_runtime[True-compile] | 1.5195ms | 0.9624ms | 1.0391 KOps/s | 1.0327 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 1.0878ms | 0.9622ms | 1.0392 KOps/s | 1.0259 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.4107ms | 1.9942ms | 501.4663 Ops/s | 491.7777 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 1.0017ms | 0.8081ms | 1.2375 KOps/s | 1.2360 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.4880ms | 0.4076ms | 2.4536 KOps/s | 2.4266 KOps/s | |
test_distributed | 3.0777ms | 0.1242ms | 8.0544 KOps/s | 8.7423 KOps/s | |
test_tdmodule | 0.5917ms | 21.3975μs | 46.7343 KOps/s | 48.5314 KOps/s | |
test_tdmodule_dispatch | 59.0710μs | 37.1758μs | 26.8992 KOps/s | 27.5870 KOps/s | |
test_tdseq | 50.6310μs | 21.4831μs | 46.5482 KOps/s | 47.5277 KOps/s | |
test_tdseq_dispatch | 60.3010μs | 40.0313μs | 24.9805 KOps/s | 25.1863 KOps/s | |
test_instantiation_functorch | 1.9322ms | 1.5103ms | 662.1090 Ops/s | 655.6165 Ops/s | |
test_exec_functorch | 0.5295ms | 0.1388ms | 7.2057 KOps/s | 7.2241 KOps/s | |
test_exec_functional_call | 0.5564ms | 0.1311ms | 7.6276 KOps/s | 7.6957 KOps/s | |
test_exec_td_decorator | 0.3913ms | 0.1852ms | 5.3998 KOps/s | 5.5756 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 1.0867ms | 0.6760ms | 1.4793 KOps/s | 1.5054 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.7933ms | 0.6616ms | 1.5116 KOps/s | 1.5008 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.7158ms | 0.5701ms | 1.7540 KOps/s | 1.7413 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.6919ms | 0.5678ms | 1.7613 KOps/s | 1.7391 KOps/s | |
test_vmap_transformer_speed_decorator[True-True] | 18.7843ms | 18.3524ms | 54.4888 Ops/s | 53.8731 Ops/s | |
test_vmap_transformer_speed_decorator[True-False] | 19.1300ms | 18.5221ms | 53.9895 Ops/s | 53.9078 Ops/s | |
test_vmap_transformer_speed_decorator[False-True] | 18.9682ms | 18.2282ms | 54.8601 Ops/s | 54.1649 Ops/s | |
test_vmap_transformer_speed_decorator[False-False] | 19.0803ms | 18.3768ms | 54.4166 Ops/s | 54.3288 Ops/s | |
test_to_module_speed[True] | 1.4014ms | 0.9529ms | 1.0494 KOps/s | 1.0325 KOps/s | |
test_to_module_speed[False] | 1.1454ms | 0.9356ms | 1.0688 KOps/s | 1.0536 KOps/s | |
test_tc_init | 64.8810μs | 37.3403μs | 26.7807 KOps/s | 26.2177 KOps/s | |
test_tc_init_nested | 0.4512ms | 73.5735μs | 13.5918 KOps/s | 13.1815 KOps/s | |
test_tc_first_layer_tensor | 23.9600μs | 0.7861μs | 1.2720 MOps/s | 1.4407 MOps/s | |
test_tc_first_layer_nontensor | 18.6100μs | 2.2060μs | 453.3135 KOps/s | 453.9305 KOps/s | |
test_tc_second_layer_tensor | 98.6982μs | 1.4010μs | 713.7562 KOps/s | 709.2506 KOps/s | |
test_tc_second_layer_nontensor | 0.3938ms | 2.9142μs | 343.1471 KOps/s | 337.5734 KOps/s | |
test_unbind | 0.2168s | 11.8083ms | 84.6862 Ops/s | 145.4373 Ops/s | |
test_full_like | 9.8052ms | 9.3137ms | 107.3687 Ops/s | 103.7984 Ops/s | |
test_zeros_like | 4.8944ms | 4.3339ms | 230.7371 Ops/s | 114.3572 Ops/s | |
test_ones_like | 9.4157ms | 7.2155ms | 138.5903 Ops/s | 230.2769 Ops/s | |
test_clone | 7.1233ms | 6.6164ms | 151.1395 Ops/s | 151.2322 Ops/s | |
test_squeeze | 58.2700μs | 9.6312μs | 103.8295 KOps/s | 99.6206 KOps/s | |
test_unsqueeze | 0.1353ms | 72.4014μs | 13.8119 KOps/s | 13.0040 KOps/s | |
test_split | 0.6190ms | 0.1567ms | 6.3826 KOps/s | 6.2174 KOps/s | |
test_permute | 0.3153ms | 0.1773ms | 5.6405 KOps/s | 5.3093 KOps/s | |
test_stack | 51.0300ms | 50.1688ms | 19.9327 Ops/s | 19.6818 Ops/s | |
test_cat | 51.1098ms | 50.5102ms | 19.7980 Ops/s | 19.7563 Ops/s |
vmoens
added a commit
that referenced
this pull request
Feb 20, 2025
ghstack-source-id: 2f2098db6ccf16a61f37c6e911fb921e305a96ec Pull Request resolved: #1227
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
BE
Better errors, logs, docs or test utils
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Stack from ghstack (oldest at bottom):