-
Notifications
You must be signed in to change notification settings - Fork 83
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Feature] from_dataclass with dest_cls arg #1250
Merged
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
vmoens
added a commit
that referenced
this pull request
Mar 5, 2025
ghstack-source-id: 14079f32d9270dcdabf4aa3f0d614e8c83a46876 Pull Request resolved: #1250
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 40.5750μs | 20.7606μs | 48.1683 KOps/s | 48.0171 KOps/s | |
test_plain_set_stack_nested | 46.4060μs | 20.8546μs | 47.9510 KOps/s | 47.7608 KOps/s | |
test_plain_set_nested_inplace | 68.6380μs | 22.6499μs | 44.1503 KOps/s | 43.8413 KOps/s | |
test_plain_set_stack_nested_inplace | 59.4510μs | 22.6018μs | 44.2443 KOps/s | 43.8127 KOps/s | |
test_items | 41.2880μs | 4.2082μs | 237.6334 KOps/s | 241.2324 KOps/s | |
test_items_nested | 0.5439ms | 0.4090ms | 2.4450 KOps/s | 2.4182 KOps/s | |
test_items_nested_locked | 0.5486ms | 0.4083ms | 2.4492 KOps/s | 2.4064 KOps/s | |
test_items_nested_leaf | 0.1503ms | 76.9189μs | 13.0007 KOps/s | 12.9700 KOps/s | |
test_items_stack_nested | 0.8367ms | 0.4151ms | 2.4093 KOps/s | 2.3702 KOps/s | |
test_items_stack_nested_leaf | 0.1524ms | 76.7000μs | 13.0378 KOps/s | 12.9484 KOps/s | |
test_items_stack_nested_locked | 0.5913ms | 0.4090ms | 2.4450 KOps/s | 2.3757 KOps/s | |
test_keys | 43.6510μs | 3.4558μs | 289.3716 KOps/s | 286.3559 KOps/s | |
test_keys_nested | 0.2874ms | 0.1670ms | 5.9869 KOps/s | 5.9940 KOps/s | |
test_keys_nested_locked | 0.7025ms | 0.1733ms | 5.7693 KOps/s | 5.7863 KOps/s | |
test_keys_nested_leaf | 0.2588ms | 0.1470ms | 6.8034 KOps/s | 6.7884 KOps/s | |
test_keys_stack_nested | 0.2780ms | 0.1693ms | 5.9066 KOps/s | 5.9405 KOps/s | |
test_keys_stack_nested_leaf | 0.2620ms | 0.1465ms | 6.8241 KOps/s | 6.7647 KOps/s | |
test_keys_stack_nested_locked | 0.2333ms | 0.1732ms | 5.7753 KOps/s | 5.7616 KOps/s | |
test_values | 9.5238μs | 1.0545μs | 948.3264 KOps/s | 900.5473 KOps/s | |
test_values_nested | 0.1154ms | 63.1789μs | 15.8281 KOps/s | 15.8478 KOps/s | |
test_values_nested_locked | 0.1187ms | 62.7351μs | 15.9400 KOps/s | 15.8765 KOps/s | |
test_values_nested_leaf | 0.1260ms | 72.1113μs | 13.8675 KOps/s | 13.7902 KOps/s | |
test_values_stack_nested | 0.1153ms | 62.2975μs | 16.0520 KOps/s | 16.0598 KOps/s | |
test_values_stack_nested_leaf | 0.1372ms | 72.2708μs | 13.8369 KOps/s | 13.4061 KOps/s | |
test_values_stack_nested_locked | 0.1225ms | 63.2144μs | 15.8192 KOps/s | 15.9577 KOps/s | |
test_membership | 2.0333μs | 0.6919μs | 1.4453 MOps/s | 1.4469 MOps/s | |
test_membership_nested | 20.7190μs | 2.8978μs | 345.0848 KOps/s | 344.1723 KOps/s | |
test_membership_nested_leaf | 45.5450μs | 2.8994μs | 344.9009 KOps/s | 345.1008 KOps/s | |
test_membership_stacked_nested | 22.9120μs | 2.8972μs | 345.1590 KOps/s | 344.2580 KOps/s | |
test_membership_stacked_nested_leaf | 47.3380μs | 2.9173μs | 342.7787 KOps/s | 346.1410 KOps/s | |
test_membership_nested_last | 28.0920μs | 4.3222μs | 231.3652 KOps/s | 229.7165 KOps/s | |
test_membership_nested_leaf_last | 46.8370μs | 4.3690μs | 228.8858 KOps/s | 232.8407 KOps/s | |
test_membership_stacked_nested_last | 27.7010μs | 4.3853μs | 228.0371 KOps/s | 231.7050 KOps/s | |
test_membership_stacked_nested_leaf_last | 25.9980μs | 4.3354μs | 230.6592 KOps/s | 233.0949 KOps/s | |
test_nested_getleaf | 35.8770μs | 10.6878μs | 93.5644 KOps/s | 93.6143 KOps/s | |
test_nested_get | 54.9820μs | 10.1156μs | 98.8571 KOps/s | 99.2710 KOps/s | |
test_stacked_getleaf | 54.2710μs | 10.5659μs | 94.6445 KOps/s | 96.1913 KOps/s | |
test_stacked_get | 54.3110μs | 10.1072μs | 98.9391 KOps/s | 98.8315 KOps/s | |
test_nested_getitemleaf | 60.2320μs | 11.2314μs | 89.0364 KOps/s | 89.3209 KOps/s | |
test_nested_getitem | 53.1490μs | 10.6438μs | 93.9512 KOps/s | 93.3433 KOps/s | |
test_stacked_getitemleaf | 41.5080μs | 11.2036μs | 89.2572 KOps/s | 88.5391 KOps/s | |
test_stacked_getitem | 52.5290μs | 10.7611μs | 92.9276 KOps/s | 94.4259 KOps/s | |
test_lock_nested | 0.6276ms | 0.4039ms | 2.4757 KOps/s | 2.4286 KOps/s | |
test_lock_stack_nested | 0.6467ms | 0.4170ms | 2.3979 KOps/s | 2.3528 KOps/s | |
test_unlock_nested | 0.5490ms | 0.3339ms | 2.9949 KOps/s | 2.9692 KOps/s | |
test_unlock_stack_nested | 0.4997ms | 0.3371ms | 2.9664 KOps/s | 2.9005 KOps/s | |
test_flatten_speed | 0.2019ms | 99.8683μs | 10.0132 KOps/s | 9.9765 KOps/s | |
test_unflatten_speed | 0.7419ms | 0.5278ms | 1.8948 KOps/s | 1.9043 KOps/s | |
test_common_ops | 6.2712ms | 0.7965ms | 1.2554 KOps/s | 1.1834 KOps/s | |
test_creation | 24.0040μs | 2.5163μs | 397.4018 KOps/s | 398.5103 KOps/s | |
test_creation_empty | 60.4030μs | 11.5904μs | 86.2783 KOps/s | 79.8977 KOps/s | |
test_creation_nested_1 | 40.8660μs | 14.4484μs | 69.2120 KOps/s | 64.3728 KOps/s | |
test_creation_nested_2 | 69.7000μs | 19.0555μs | 52.4782 KOps/s | 49.5360 KOps/s | |
test_clone | 34.4440μs | 13.2359μs | 75.5522 KOps/s | 72.9165 KOps/s | |
test_getitem[int] | 0.8563ms | 12.5929μs | 79.4095 KOps/s | 78.2263 KOps/s | |
test_getitem[slice_int] | 0.1684ms | 24.4153μs | 40.9580 KOps/s | 38.5239 KOps/s | |
test_getitem[range] | 0.1783ms | 49.4971μs | 20.2032 KOps/s | 19.7463 KOps/s | |
test_getitem[tuple] | 0.1236ms | 19.9462μs | 50.1349 KOps/s | 49.5966 KOps/s | |
test_getitem[list] | 0.1635ms | 45.0366μs | 22.2042 KOps/s | 21.6098 KOps/s | |
test_setitem_dim[int] | 51.0550μs | 25.3128μs | 39.5057 KOps/s | 38.9238 KOps/s | |
test_setitem_dim[slice_int] | 96.4300μs | 51.5984μs | 19.3804 KOps/s | 18.9278 KOps/s | |
test_setitem_dim[range] | 0.1044ms | 76.0653μs | 13.1466 KOps/s | 13.0704 KOps/s | |
test_setitem_dim[tuple] | 83.7860μs | 40.5610μs | 24.6542 KOps/s | 23.8912 KOps/s | |
test_setitem | 79.6890μs | 19.8286μs | 50.4321 KOps/s | 47.0548 KOps/s | |
test_set | 88.0540μs | 19.4483μs | 51.4185 KOps/s | 48.5489 KOps/s | |
test_set_shared | 0.3233ms | 0.1795ms | 5.5721 KOps/s | 5.4543 KOps/s | |
test_update | 0.2017ms | 25.1045μs | 39.8335 KOps/s | 36.9885 KOps/s | |
test_update_nested | 0.4410ms | 40.6891μs | 24.5766 KOps/s | 23.3636 KOps/s | |
test_update__nested | 97.5420μs | 33.3316μs | 30.0016 KOps/s | 28.6732 KOps/s | |
test_set_nested | 0.1047ms | 21.4802μs | 46.5546 KOps/s | 42.6588 KOps/s | |
test_set_nested_new | 0.1050ms | 26.2523μs | 38.0919 KOps/s | 35.6260 KOps/s | |
test_select | 0.1365ms | 42.8456μs | 23.3396 KOps/s | 22.5940 KOps/s | |
test_select_nested | 0.1374ms | 63.1917μs | 15.8249 KOps/s | 15.6619 KOps/s | |
test_exclude_nested | 0.1651ms | 80.8371μs | 12.3706 KOps/s | 12.2088 KOps/s | |
test_empty[True] | 0.7391ms | 0.4129ms | 2.4219 KOps/s | 2.4072 KOps/s | |
test_empty[False] | 12.8162μs | 1.3639μs | 733.1932 KOps/s | 707.0360 KOps/s | |
test_unbind_speed | 0.6290ms | 0.2732ms | 3.6604 KOps/s | 3.6400 KOps/s | |
test_unbind_speed_stack0 | 0.5514ms | 0.2690ms | 3.7175 KOps/s | 3.6626 KOps/s | |
test_unbind_speed_stack1 | 0.1105s | 0.7386ms | 1.3538 KOps/s | 1.1778 KOps/s | |
test_split | 0.1170s | 1.7662ms | 566.1718 Ops/s | 559.1399 Ops/s | |
test_chunk | 0.1102s | 1.7690ms | 565.2822 Ops/s | 620.1914 Ops/s | |
test_consolidate_njt[False-None] | 8.3464ms | 8.1010ms | 123.4418 Ops/s | 107.4907 Ops/s | |
test_creation[device0] | 0.1934ms | 91.6681μs | 10.9089 KOps/s | 10.5445 KOps/s | |
test_creation_from_tensor | 3.1719ms | 96.2867μs | 10.3857 KOps/s | 10.5419 KOps/s | |
test_add_one[memmap_tensor0] | 0.1085ms | 4.8566μs | 205.9048 KOps/s | 199.6840 KOps/s | |
test_contiguous[memmap_tensor0] | 21.4400μs | 0.5161μs | 1.9376 MOps/s | 1.9714 MOps/s | |
test_stack[memmap_tensor0] | 32.5500μs | 3.2566μs | 307.0727 KOps/s | 304.6447 KOps/s | |
test_memmaptd_index | 1.3059ms | 0.2332ms | 4.2875 KOps/s | 4.3862 KOps/s | |
test_memmaptd_index_astensor | 0.6725ms | 0.3198ms | 3.1267 KOps/s | 3.1399 KOps/s | |
test_memmaptd_index_op | 1.0441ms | 0.5739ms | 1.7426 KOps/s | 1.6817 KOps/s | |
test_serialize_model | 0.2311s | 0.1340s | 7.4636 Ops/s | 8.4716 Ops/s | |
test_serialize_model_pickle | 0.4697s | 0.3931s | 2.5439 Ops/s | 2.5257 Ops/s | |
test_serialize_weights | 0.1288s | 0.1156s | 8.6521 Ops/s | 8.5029 Ops/s | |
test_serialize_weights_returnearly | 0.1843s | 0.1611s | 6.2067 Ops/s | 5.5792 Ops/s | |
test_serialize_weights_pickle | 0.4942s | 0.4476s | 2.2343 Ops/s | 2.5564 Ops/s | |
test_serialize_weights_filesystem | 0.2680s | 0.1593s | 6.2777 Ops/s | 6.9367 Ops/s | |
test_serialize_model_filesystem | 0.1573s | 0.1498s | 6.6766 Ops/s | 6.5144 Ops/s | |
test_reshape_pytree | 63.2480μs | 26.1375μs | 38.2592 KOps/s | 37.1505 KOps/s | |
test_reshape_td | 73.4270μs | 32.4424μs | 30.8239 KOps/s | 29.9784 KOps/s | |
test_view_pytree | 75.7520μs | 26.1134μs | 38.2945 KOps/s | 37.7776 KOps/s | |
test_view_td | 89.0170μs | 40.0573μs | 24.9642 KOps/s | 24.0392 KOps/s | |
test_unbind_pytree | 71.3230μs | 28.9814μs | 34.5049 KOps/s | 33.0655 KOps/s | |
test_unbind_td | 0.3383ms | 39.9941μs | 25.0037 KOps/s | 24.6108 KOps/s | |
test_split_pytree | 64.4800μs | 28.9929μs | 34.4913 KOps/s | 32.9212 KOps/s | |
test_split_td | 0.5225ms | 46.3074μs | 21.5948 KOps/s | 21.9544 KOps/s | |
test_add_pytree | 73.3360μs | 35.6919μs | 28.0176 KOps/s | 27.1857 KOps/s | |
test_add_td | 0.1097ms | 55.0903μs | 18.1520 KOps/s | 17.4371 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1436ms | 65.7592μs | 15.2070 KOps/s | 14.9625 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.5368ms | 0.1726ms | 5.7941 KOps/s | 5.7587 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1124ms | 45.6227μs | 21.9189 KOps/s | 21.9285 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.2877ms | 0.1182ms | 8.4607 KOps/s | 8.2553 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 85.3800μs | 28.3905μs | 35.2231 KOps/s | 35.1768 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1113ms | 59.5516μs | 16.7922 KOps/s | 16.4548 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.1888ms | 78.8443μs | 12.6832 KOps/s | 12.2894 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.1516ms | 67.0326μs | 14.9181 KOps/s | 14.7533 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.2038ms | 0.1070ms | 9.3486 KOps/s | 9.2958 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.4668ms | 0.2168ms | 4.6131 KOps/s | 4.6401 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1549ms | 46.3268μs | 21.5858 KOps/s | 21.6539 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.1917ms | 67.4929μs | 14.8164 KOps/s | 14.9660 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.1791ms | 99.8767μs | 10.0123 KOps/s | 10.1023 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.3626ms | 0.1995ms | 5.0117 KOps/s | 4.9607 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.3919ms | 0.2313ms | 4.3237 KOps/s | 4.3105 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.1825ms | 0.1072ms | 9.3283 KOps/s | 9.3179 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.3394ms | 65.5951μs | 15.2450 KOps/s | 16.0638 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.4323ms | 49.1160μs | 20.3600 KOps/s | 20.7042 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.2732ms | 0.1571ms | 6.3671 KOps/s | 6.3358 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.1767ms | 99.4348μs | 10.0568 KOps/s | 10.0226 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 80.8310μs | 21.1323μs | 47.3209 KOps/s | 45.1897 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.1662ms | 68.9571μs | 14.5018 KOps/s | 14.8102 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1580ms | 80.2138μs | 12.4667 KOps/s | 12.1538 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1266ms | 67.0998μs | 14.9032 KOps/s | 14.7363 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 0.2856ms | 0.2143ms | 4.6654 KOps/s | 4.6911 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 1.5340ms | 1.3900ms | 719.4369 Ops/s | 718.4855 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 0.3258ms | 0.2094ms | 4.7751 KOps/s | 4.7345 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 1.0406ms | 0.8280ms | 1.2077 KOps/s | 1.2202 KOps/s | |
test_compile_assign_and_add_stack[compile] | 0.7051ms | 0.4616ms | 2.1663 KOps/s | 2.2064 KOps/s | |
test_compile_assign_and_add_stack[eager] | 2.9551ms | 2.6576ms | 376.2789 Ops/s | 360.9779 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 93.1730μs | 38.9464μs | 25.6763 KOps/s | 26.3739 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.5800ms | 32.6493μs | 30.6286 KOps/s | 30.8601 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 80.3200μs | 30.3454μs | 32.9539 KOps/s | 32.4716 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 68.4480μs | 23.0346μs | 43.4130 KOps/s | 43.5505 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 71.1520μs | 31.9966μs | 31.2534 KOps/s | 31.2695 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 94.8470μs | 22.6779μs | 44.0958 KOps/s | 43.1772 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1311ms | 53.9486μs | 18.5362 KOps/s | 18.6499 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.3625ms | 19.9386μs | 50.1539 KOps/s | 48.1874 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1491ms | 45.9235μs | 21.7754 KOps/s | 21.3557 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 51.3360μs | 18.5130μs | 54.0160 KOps/s | 53.4094 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1014ms | 47.0027μs | 21.2754 KOps/s | 20.7722 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 0.1033ms | 19.2697μs | 51.8950 KOps/s | 53.4230 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1265ms | 55.0194μs | 18.1754 KOps/s | 18.1635 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.9055ms | 19.7639μs | 50.5972 KOps/s | 48.5296 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1335ms | 47.6798μs | 20.9732 KOps/s | 20.7587 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 63.7590μs | 18.6748μs | 53.5481 KOps/s | 53.8684 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1166ms | 47.4281μs | 21.0845 KOps/s | 20.9517 KOps/s | |
test_compile_indexing[int-pytree-eager] | 68.8880μs | 18.6564μs | 53.6008 KOps/s | 53.8840 KOps/s | |
test_mod_add[eager] | 92.3720μs | 37.2341μs | 26.8571 KOps/s | 26.6750 KOps/s | |
test_mod_add[compile] | 0.1434ms | 67.4380μs | 14.8284 KOps/s | 15.1898 KOps/s | |
test_mod_add[compile-overhead] | 0.1342ms | 66.4209μs | 15.0555 KOps/s | 15.4614 KOps/s | |
test_mod_wrap[eager] | 0.4569ms | 0.2224ms | 4.4960 KOps/s | 4.2904 KOps/s | |
test_mod_wrap[compile] | 1.8192ms | 0.2306ms | 4.3357 KOps/s | 4.2786 KOps/s | |
test_mod_wrap[compile-overhead] | 0.4071ms | 0.2253ms | 4.4391 KOps/s | 4.3331 KOps/s | |
test_mod_wrap_and_backward[eager] | 17.3828ms | 12.6656ms | 78.9538 Ops/s | 73.3056 Ops/s | |
test_mod_wrap_and_backward[compile] | 16.9148ms | 13.6171ms | 73.4369 Ops/s | 83.3490 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 15.4097ms | 13.1962ms | 75.7791 Ops/s | 85.7164 Ops/s | |
test_seq_add[eager] | 0.2033ms | 0.1205ms | 8.2960 KOps/s | 8.3400 KOps/s | |
test_seq_add[compile] | 0.1483ms | 78.6919μs | 12.7078 KOps/s | 12.6091 KOps/s | |
test_seq_add[compile-overhead] | 0.1457ms | 78.2465μs | 12.7801 KOps/s | 13.1106 KOps/s | |
test_seq_wrap[eager] | 0.6833ms | 0.4514ms | 2.2153 KOps/s | 2.1462 KOps/s | |
test_seq_wrap[compile] | 0.4167ms | 0.2445ms | 4.0907 KOps/s | 3.9955 KOps/s | |
test_seq_wrap[compile-overhead] | 0.3790ms | 0.2442ms | 4.0953 KOps/s | 4.0823 KOps/s | |
test_func_call_runtime[False-eager] | 0.7353ms | 0.5292ms | 1.8897 KOps/s | 1.7483 KOps/s | |
test_func_call_runtime[False-compile] | 0.5795ms | 0.4488ms | 2.2279 KOps/s | 2.2068 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.5818ms | 0.4480ms | 2.2320 KOps/s | 2.1996 KOps/s | |
test_func_call_runtime[True-eager] | 1.0771ms | 0.7387ms | 1.3538 KOps/s | 1.2713 KOps/s | |
test_func_call_runtime[True-compile] | 0.5740ms | 0.4699ms | 2.1281 KOps/s | 2.1008 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.5839ms | 0.4718ms | 2.1197 KOps/s | 2.0966 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.7514ms | 0.5253ms | 1.9038 KOps/s | 1.7565 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.5568ms | 0.4475ms | 2.2348 KOps/s | 2.2042 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.7534ms | 0.4492ms | 2.2263 KOps/s | 2.2043 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.1839ms | 0.8897ms | 1.1239 KOps/s | 1.0600 KOps/s | |
test_func_call_cm_runtime[True-compile] | 1.6323ms | 0.7976ms | 1.2538 KOps/s | 1.2170 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.9996ms | 0.7928ms | 1.2614 KOps/s | 1.2006 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.5539ms | 1.9059ms | 524.6803 Ops/s | 517.6729 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 0.7975ms | 0.5361ms | 1.8655 KOps/s | 1.8333 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.7105ms | 0.5316ms | 1.8810 KOps/s | 1.8476 KOps/s | |
test_distributed | 0.2674ms | 0.1251ms | 7.9911 KOps/s | 7.9289 KOps/s | |
test_tdmodule | 55.1230μs | 28.5838μs | 34.9848 KOps/s | 34.4668 KOps/s | |
test_tdmodule_dispatch | 72.3750μs | 51.3517μs | 19.4736 KOps/s | 19.0105 KOps/s | |
test_tdseq | 51.6460μs | 29.3995μs | 34.0142 KOps/s | 32.2695 KOps/s | |
test_tdseq_dispatch | 91.3100μs | 55.8901μs | 17.8922 KOps/s | 17.8342 KOps/s | |
test_instantiation_functorch | 1.8548ms | 1.5401ms | 649.3048 Ops/s | 635.9829 Ops/s | |
test_exec_functorch | 0.4111ms | 0.1779ms | 5.6199 KOps/s | 5.3596 KOps/s | |
test_exec_functional_call | 0.2678ms | 0.1686ms | 5.9308 KOps/s | 5.5756 KOps/s | |
test_exec_td_decorator | 0.5210ms | 0.2305ms | 4.3387 KOps/s | 4.1192 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 1.2217ms | 0.6752ms | 1.4810 KOps/s | 1.4997 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.8892ms | 0.6552ms | 1.5263 KOps/s | 1.4487 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.8607ms | 0.5290ms | 1.8902 KOps/s | 1.8649 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.7211ms | 0.5291ms | 1.8899 KOps/s | 1.8687 KOps/s | |
test_to_module_speed[True] | 2.1223ms | 1.3287ms | 752.6056 Ops/s | 746.2825 Ops/s | |
test_to_module_speed[False] | 1.8075ms | 1.2988ms | 769.9648 Ops/s | 768.8240 Ops/s | |
test_tc_init | 92.2920μs | 46.4107μs | 21.5467 KOps/s | 21.1815 KOps/s | |
test_tc_init_nested | 0.1566ms | 90.1032μs | 11.0984 KOps/s | 10.6133 KOps/s | |
test_tc_first_layer_tensor | 41.2170μs | 1.5184μs | 658.6024 KOps/s | 640.5723 KOps/s | |
test_tc_first_layer_nontensor | 20.7490μs | 4.7094μs | 212.3393 KOps/s | 203.7800 KOps/s | |
test_tc_second_layer_tensor | 20.1280μs | 2.8332μs | 352.9601 KOps/s | 348.4751 KOps/s | |
test_tc_second_layer_nontensor | 48.4270μs | 5.9732μs | 167.4147 KOps/s | 164.4088 KOps/s | |
test_unbind | 0.2526s | 14.0519ms | 71.1646 Ops/s | 66.5919 Ops/s | |
test_full_like | 9.6803ms | 8.1258ms | 123.0646 Ops/s | 113.6074 Ops/s | |
test_zeros_like | 5.2544ms | 3.2426ms | 308.3975 Ops/s | 294.1530 Ops/s | |
test_ones_like | 4.2547ms | 3.5762ms | 279.6296 Ops/s | 274.3985 Ops/s | |
test_clone | 7.6227ms | 6.0229ms | 166.0322 Ops/s | 158.8713 Ops/s | |
test_squeeze | 63.6480μs | 12.9216μs | 77.3900 KOps/s | 80.8970 KOps/s | |
test_unsqueeze | 0.2866ms | 94.9194μs | 10.5352 KOps/s | 10.5847 KOps/s | |
test_split | 0.3351ms | 0.2002ms | 4.9949 KOps/s | 5.1515 KOps/s | |
test_permute | 0.3393ms | 0.2044ms | 4.8921 KOps/s | 4.9755 KOps/s | |
test_stack | 34.1060ms | 28.3357ms | 35.2911 Ops/s | 35.5628 Ops/s | |
test_cat | 32.4771ms | 27.7521ms | 36.0333 Ops/s | 35.8983 Ops/s |
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 36.2310μs | 11.4406μs | 87.4077 KOps/s | 81.5809 KOps/s | |
test_plain_set_stack_nested | 34.8210μs | 11.4903μs | 87.0296 KOps/s | 80.8901 KOps/s | |
test_plain_set_nested_inplace | 0.1142ms | 12.5417μs | 79.7337 KOps/s | 75.5789 KOps/s | |
test_plain_set_stack_nested_inplace | 0.2054ms | 12.4023μs | 80.6299 KOps/s | 75.2230 KOps/s | |
test_items | 36.9100μs | 2.8660μs | 348.9241 KOps/s | 347.7772 KOps/s | |
test_items_nested | 0.4339ms | 0.3605ms | 2.7740 KOps/s | 2.7515 KOps/s | |
test_items_nested_locked | 0.4505ms | 0.3620ms | 2.7626 KOps/s | 2.7621 KOps/s | |
test_items_nested_leaf | 0.1173ms | 60.5751μs | 16.5084 KOps/s | 16.5507 KOps/s | |
test_items_stack_nested | 0.4403ms | 0.3638ms | 2.7491 KOps/s | 2.7627 KOps/s | |
test_items_stack_nested_leaf | 92.2420μs | 61.0378μs | 16.3833 KOps/s | 16.5533 KOps/s | |
test_items_stack_nested_locked | 0.4335ms | 0.3647ms | 2.7422 KOps/s | 2.7444 KOps/s | |
test_keys | 26.0400μs | 3.4114μs | 293.1373 KOps/s | 289.8681 KOps/s | |
test_keys_nested | 0.1336ms | 87.2895μs | 11.4561 KOps/s | 11.4104 KOps/s | |
test_keys_nested_locked | 0.7868ms | 93.1768μs | 10.7323 KOps/s | 10.7182 KOps/s | |
test_keys_nested_leaf | 0.1079ms | 78.9390μs | 12.6680 KOps/s | 12.6779 KOps/s | |
test_keys_stack_nested | 0.1410ms | 87.2305μs | 11.4639 KOps/s | 11.4604 KOps/s | |
test_keys_stack_nested_leaf | 0.1155ms | 78.8853μs | 12.6766 KOps/s | 12.7364 KOps/s | |
test_keys_stack_nested_locked | 0.2647ms | 93.2072μs | 10.7288 KOps/s | 10.6713 KOps/s | |
test_values | 29.1605μs | 0.8490μs | 1.1778 MOps/s | 1.1717 MOps/s | |
test_values_nested | 0.1033ms | 37.1464μs | 26.9205 KOps/s | 26.9649 KOps/s | |
test_values_nested_locked | 68.3410μs | 38.7113μs | 25.8322 KOps/s | 25.7750 KOps/s | |
test_values_nested_leaf | 0.2285ms | 42.0965μs | 23.7550 KOps/s | 23.7014 KOps/s | |
test_values_stack_nested | 76.4320μs | 36.9091μs | 27.0936 KOps/s | 26.9024 KOps/s | |
test_values_stack_nested_leaf | 91.9610μs | 42.2471μs | 23.6702 KOps/s | 23.6941 KOps/s | |
test_values_stack_nested_locked | 0.1867ms | 38.7134μs | 25.8309 KOps/s | 25.7226 KOps/s | |
test_membership | 2.1240μs | 0.4994μs | 2.0025 MOps/s | 1.9905 MOps/s | |
test_membership_nested | 16.7905μs | 2.0053μs | 498.6846 KOps/s | 491.7575 KOps/s | |
test_membership_nested_leaf | 14.3805μs | 2.0103μs | 497.4304 KOps/s | 485.2954 KOps/s | |
test_membership_stacked_nested | 26.0810μs | 2.0486μs | 488.1354 KOps/s | 471.0617 KOps/s | |
test_membership_stacked_nested_leaf | 31.9500μs | 2.0752μs | 481.8813 KOps/s | 481.4770 KOps/s | |
test_membership_nested_last | 35.6410μs | 3.0291μs | 330.1341 KOps/s | 329.2344 KOps/s | |
test_membership_nested_leaf_last | 31.3210μs | 3.0137μs | 331.8159 KOps/s | 326.0945 KOps/s | |
test_membership_stacked_nested_last | 66.3620μs | 3.0303μs | 329.9988 KOps/s | 329.6665 KOps/s | |
test_membership_stacked_nested_leaf_last | 25.2700μs | 3.0006μs | 333.2702 KOps/s | 329.5924 KOps/s | |
test_nested_getleaf | 42.3010μs | 6.2154μs | 160.8908 KOps/s | 160.5505 KOps/s | |
test_nested_get | 0.1164ms | 5.9626μs | 167.7132 KOps/s | 168.2630 KOps/s | |
test_stacked_getleaf | 31.9510μs | 6.1610μs | 162.3106 KOps/s | 161.5803 KOps/s | |
test_stacked_get | 36.0410μs | 5.8336μs | 171.4208 KOps/s | 173.7489 KOps/s | |
test_nested_getitemleaf | 0.1255ms | 6.4735μs | 154.4766 KOps/s | 155.9692 KOps/s | |
test_nested_getitem | 35.9210μs | 6.1001μs | 163.9313 KOps/s | 164.5481 KOps/s | |
test_stacked_getitemleaf | 33.4200μs | 6.3868μs | 156.5719 KOps/s | 156.6567 KOps/s | |
test_stacked_getitem | 39.9700μs | 5.9833μs | 167.1307 KOps/s | 167.7430 KOps/s | |
test_lock_nested | 0.4747ms | 0.3308ms | 3.0229 KOps/s | 2.9183 KOps/s | |
test_lock_stack_nested | 0.3941ms | 0.3366ms | 2.9706 KOps/s | 2.8648 KOps/s | |
test_unlock_nested | 0.3573ms | 0.2767ms | 3.6140 KOps/s | 3.5022 KOps/s | |
test_unlock_stack_nested | 0.4217ms | 0.2777ms | 3.6013 KOps/s | 3.4836 KOps/s | |
test_flatten_speed | 0.1062ms | 77.4138μs | 12.9176 KOps/s | 12.9673 KOps/s | |
test_unflatten_speed | 0.3678ms | 0.3191ms | 3.1341 KOps/s | 3.1163 KOps/s | |
test_common_ops | 0.7724ms | 0.5824ms | 1.7169 KOps/s | 1.6307 KOps/s | |
test_creation | 0.1266ms | 1.7263μs | 579.2671 KOps/s | 577.6470 KOps/s | |
test_creation_empty | 27.9500μs | 6.4062μs | 156.0980 KOps/s | 125.1888 KOps/s | |
test_creation_nested_1 | 62.9810μs | 8.0094μs | 124.8540 KOps/s | 103.6756 KOps/s | |
test_creation_nested_2 | 77.9320μs | 10.7271μs | 93.2215 KOps/s | 81.2825 KOps/s | |
test_clone | 0.1939ms | 10.3964μs | 96.1873 KOps/s | 91.5009 KOps/s | |
test_getitem[int] | 1.5392ms | 10.4054μs | 96.1039 KOps/s | 91.8392 KOps/s | |
test_getitem[slice_int] | 0.1085ms | 20.3750μs | 49.0796 KOps/s | 46.2434 KOps/s | |
test_getitem[range] | 0.1735ms | 37.5607μs | 26.6236 KOps/s | 25.8219 KOps/s | |
test_getitem[tuple] | 0.1559ms | 17.6096μs | 56.7873 KOps/s | 54.3665 KOps/s | |
test_getitem[list] | 0.1907ms | 32.6837μs | 30.5962 KOps/s | 29.8704 KOps/s | |
test_setitem_dim[int] | 39.9500μs | 18.6644μs | 53.5780 KOps/s | 50.9895 KOps/s | |
test_setitem_dim[slice_int] | 60.6110μs | 37.3433μs | 26.7785 KOps/s | 25.8724 KOps/s | |
test_setitem_dim[range] | 89.1220μs | 52.6939μs | 18.9775 KOps/s | 18.7532 KOps/s | |
test_setitem_dim[tuple] | 71.9420μs | 31.4483μs | 31.7982 KOps/s | 30.9763 KOps/s | |
test_setitem | 71.2310μs | 14.0465μs | 71.1920 KOps/s | 65.5619 KOps/s | |
test_set | 71.9720μs | 13.3586μs | 74.8582 KOps/s | 68.0937 KOps/s | |
test_set_shared | 0.5102ms | 0.1570ms | 6.3700 KOps/s | 6.3121 KOps/s | |
test_update | 0.2414ms | 16.7603μs | 59.6649 KOps/s | 52.1121 KOps/s | |
test_update_nested | 0.1231ms | 25.5204μs | 39.1844 KOps/s | 35.7283 KOps/s | |
test_update__nested | 0.6402ms | 25.1350μs | 39.7851 KOps/s | 40.3965 KOps/s | |
test_set_nested | 0.1261ms | 14.6855μs | 68.0944 KOps/s | 62.3071 KOps/s | |
test_set_nested_new | 73.6920μs | 16.9853μs | 58.8745 KOps/s | 54.5333 KOps/s | |
test_select | 0.2039ms | 27.4517μs | 36.4276 KOps/s | 33.4798 KOps/s | |
test_select_nested | 0.2386ms | 43.3372μs | 23.0748 KOps/s | 23.0424 KOps/s | |
test_exclude_nested | 0.2587ms | 63.0478μs | 15.8610 KOps/s | 15.8224 KOps/s | |
test_empty[True] | 0.3352ms | 0.2932ms | 3.4108 KOps/s | 3.3717 KOps/s | |
test_empty[False] | 8.9502μs | 0.8170μs | 1.2240 MOps/s | 1.2080 MOps/s | |
test_to | 91.2220μs | 56.9881μs | 17.5475 KOps/s | 17.5806 KOps/s | |
test_to_nonblocking | 0.1898ms | 46.9671μs | 21.2915 KOps/s | 21.4436 KOps/s | |
test_unbind_speed | 0.3784ms | 0.2384ms | 4.1942 KOps/s | 4.1107 KOps/s | |
test_unbind_speed_stack0 | 0.2792ms | 0.2319ms | 4.3124 KOps/s | 4.0833 KOps/s | |
test_unbind_speed_stack1 | 0.1070s | 0.7545ms | 1.3253 KOps/s | 1.3199 KOps/s | |
test_split | 0.1107s | 1.6020ms | 624.2306 Ops/s | 611.0602 Ops/s | |
test_chunk | 0.1169s | 1.6036ms | 623.6107 Ops/s | 610.8381 Ops/s | |
test_consolidate[False-None] | 0.1134s | 3.0418ms | 328.7571 Ops/s | 365.0420 Ops/s | |
test_consolidate[default-None] | 1.8684ms | 1.7219ms | 580.7535 Ops/s | 570.5995 Ops/s | |
test_consolidate[reduce-overhead-None] | 2.8340ms | 1.7500ms | 571.4328 Ops/s | 562.7783 Ops/s | |
test_consolidate_njt[False-None] | 6.8068ms | 6.4009ms | 156.2279 Ops/s | 109.5951 Ops/s | |
test_to[False-False-None] | 1.8865ms | 1.7030ms | 587.2136 Ops/s | 578.0506 Ops/s | |
test_to[True-False-None] | 1.6015ms | 1.3164ms | 759.6647 Ops/s | 739.2369 Ops/s | |
test_to[within-False-None] | 4.2656ms | 4.1141ms | 243.0689 Ops/s | 237.4226 Ops/s | |
test_to[True-default-None] | 5.3621ms | 5.0972ms | 196.1866 Ops/s | 193.7092 Ops/s | |
test_to_njt[False-False-None] | 7.1235ms | 6.9165ms | 144.5819 Ops/s | 142.8386 Ops/s | |
test_to_njt[True-False-None] | 5.7122ms | 5.4549ms | 183.3228 Ops/s | 180.9926 Ops/s | |
test_to_njt[within-False-None] | 12.4901ms | 11.9364ms | 83.7771 Ops/s | 82.4518 Ops/s | |
test_creation[device0] | 0.4556ms | 80.1877μs | 12.4707 KOps/s | 12.4291 KOps/s | |
test_creation_from_tensor | 0.4992ms | 85.7858μs | 11.6569 KOps/s | 11.8301 KOps/s | |
test_add_one[memmap_tensor0] | 0.4474ms | 6.6643μs | 150.0535 KOps/s | 142.4109 KOps/s | |
test_contiguous[memmap_tensor0] | 2.4610μs | 0.4442μs | 2.2511 MOps/s | 2.3342 MOps/s | |
test_stack[memmap_tensor0] | 0.1516ms | 4.3729μs | 228.6821 KOps/s | 214.1648 KOps/s | |
test_memmaptd_index | 1.5841ms | 0.2364ms | 4.2308 KOps/s | 4.0145 KOps/s | |
test_memmaptd_index_astensor | 0.4612ms | 0.2981ms | 3.3548 KOps/s | 3.2172 KOps/s | |
test_memmaptd_index_op | 0.6842ms | 0.5366ms | 1.8635 KOps/s | 1.7124 KOps/s | |
test_serialize_model | 0.1344s | 0.1330s | 7.5196 Ops/s | 7.5375 Ops/s | |
test_serialize_model_pickle | 1.3498s | 1.2165s | 0.8221 Ops/s | 0.8400 Ops/s | |
test_serialize_weights | 0.1326s | 0.1316s | 7.5981 Ops/s | 7.5379 Ops/s | |
test_serialize_weights_returnearly | 0.3452s | 55.1876ms | 18.1200 Ops/s | 13.8965 Ops/s | |
test_serialize_weights_pickle | 1.3739s | 1.2184s | 0.8208 Ops/s | 0.8223 Ops/s | |
test_reshape_pytree | 0.1022ms | 22.6110μs | 44.2263 KOps/s | 43.4829 KOps/s | |
test_reshape_td | 0.1138ms | 26.3021μs | 38.0197 KOps/s | 37.5655 KOps/s | |
test_view_pytree | 0.1176ms | 22.1067μs | 45.2351 KOps/s | 44.8090 KOps/s | |
test_view_td | 84.1920μs | 31.4744μs | 31.7719 KOps/s | 32.5001 KOps/s | |
test_unbind_pytree | 0.1677ms | 27.6734μs | 36.1358 KOps/s | 34.7086 KOps/s | |
test_unbind_td | 0.1830s | 49.3244μs | 20.2739 KOps/s | 26.5537 KOps/s | |
test_split_pytree | 0.1227ms | 29.5328μs | 33.8607 KOps/s | 33.2205 KOps/s | |
test_split_td | 0.1691ms | 37.4952μs | 26.6701 KOps/s | 25.1986 KOps/s | |
test_add_pytree | 0.1463ms | 34.7028μs | 28.8161 KOps/s | 28.0096 KOps/s | |
test_add_td | 0.1887ms | 44.2048μs | 22.6220 KOps/s | 20.3591 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.2971ms | 0.1193ms | 8.3806 KOps/s | 7.7390 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.3184ms | 0.1314ms | 7.6078 KOps/s | 7.5332 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.2869ms | 96.4798μs | 10.3649 KOps/s | 10.2908 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 1.0965ms | 0.1479ms | 6.7627 KOps/s | 6.5653 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 0.1591ms | 23.5990μs | 42.3748 KOps/s | 39.7015 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.2135ms | 29.2823μs | 34.1503 KOps/s | 33.8458 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.2541ms | 63.2925μs | 15.7997 KOps/s | 15.1468 KOps/s | |
test_compile_copy_nested[pytree-eager] | 91.6310μs | 50.4169μs | 19.8346 KOps/s | 19.7016 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.2902ms | 0.1416ms | 7.0624 KOps/s | 7.0869 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.3741ms | 0.2170ms | 4.6089 KOps/s | 4.6173 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.2523ms | 97.7013μs | 10.2353 KOps/s | 10.2404 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.2161ms | 55.3723μs | 18.0596 KOps/s | 17.8864 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.2797ms | 0.1358ms | 7.3626 KOps/s | 7.3878 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.6518ms | 0.4737ms | 2.1110 KOps/s | 2.0570 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.4060ms | 0.2613ms | 3.8270 KOps/s | 3.8424 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.2882ms | 0.1431ms | 6.9882 KOps/s | 7.0422 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.2145ms | 67.7597μs | 14.7580 KOps/s | 14.6023 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.2474ms | 98.8313μs | 10.1183 KOps/s | 10.1166 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.5688ms | 0.4023ms | 2.4855 KOps/s | 2.4736 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.2791ms | 0.1356ms | 7.3731 KOps/s | 7.4351 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 0.1564ms | 18.3789μs | 54.4102 KOps/s | 55.2159 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.1333ms | 31.2873μs | 31.9618 KOps/s | 32.0326 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.2105ms | 69.6066μs | 14.3665 KOps/s | 14.4083 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.2193ms | 52.3956μs | 19.0856 KOps/s | 19.2167 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 1.6689ms | 0.4510ms | 2.2173 KOps/s | 2.1469 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 2.9773ms | 2.6024ms | 384.2574 Ops/s | 375.7479 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 1.6066ms | 0.4344ms | 2.3020 KOps/s | 2.1513 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 2.8019ms | 2.5695ms | 389.1845 Ops/s | 376.1965 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.6989ms | 0.1156ms | 8.6528 KOps/s | 8.5993 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.5836ms | 78.9920μs | 12.6595 KOps/s | 12.2800 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.4404ms | 0.1082ms | 9.2432 KOps/s | 9.3474 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 0.2834ms | 67.8405μs | 14.7405 KOps/s | 13.8884 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.2766ms | 0.1088ms | 9.1879 KOps/s | 8.8016 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 0.2500ms | 68.6665μs | 14.5632 KOps/s | 13.9613 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.2516ms | 0.1004ms | 9.9641 KOps/s | 9.7330 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.1627ms | 16.7924μs | 59.5509 KOps/s | 56.3356 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.2425ms | 95.4046μs | 10.4817 KOps/s | 10.1415 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 0.1454ms | 15.5368μs | 64.3632 KOps/s | 61.1226 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.2469ms | 96.2756μs | 10.3869 KOps/s | 10.1040 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 0.1982ms | 15.5527μs | 64.2976 KOps/s | 61.5900 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.2690ms | 0.1038ms | 9.6377 KOps/s | 9.7135 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.7036ms | 16.8507μs | 59.3446 KOps/s | 56.8098 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.2486ms | 98.3763μs | 10.1650 KOps/s | 10.2787 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 0.1495ms | 15.5141μs | 64.4574 KOps/s | 62.3353 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.2869ms | 97.7696μs | 10.2281 KOps/s | 10.2777 KOps/s | |
test_compile_indexing[int-pytree-eager] | 0.1422ms | 15.9988μs | 62.5047 KOps/s | 62.4621 KOps/s | |
test_mod_add[eager] | 0.2246ms | 37.2329μs | 26.8580 KOps/s | 25.4531 KOps/s | |
test_mod_add[compile] | 0.3412ms | 80.9524μs | 12.3529 KOps/s | 12.3318 KOps/s | |
test_mod_add[compile-overhead] | 0.3619ms | 0.1830ms | 5.4632 KOps/s | 5.2541 KOps/s | |
test_mod_wrap[eager] | 0.4244ms | 0.2529ms | 3.9543 KOps/s | 3.7882 KOps/s | |
test_mod_wrap[compile] | 0.4591ms | 0.2843ms | 3.5179 KOps/s | 3.4565 KOps/s | |
test_mod_wrap[compile-overhead] | 7.6453ms | 3.8057ms | 262.7642 Ops/s | 262.1880 Ops/s | |
test_mod_wrap_and_backward[eager] | 1.6279ms | 1.3916ms | 718.6190 Ops/s | 677.7352 Ops/s | |
test_mod_wrap_and_backward[compile] | 2.0797ms | 1.3769ms | 726.2538 Ops/s | 713.0560 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 1.6051ms | 1.0667ms | 937.5009 Ops/s | 836.3832 Ops/s | |
test_seq_add[eager] | 0.2672ms | 0.1130ms | 8.8489 KOps/s | 8.4933 KOps/s | |
test_seq_add[compile] | 0.2474ms | 89.9250μs | 11.1204 KOps/s | 11.1823 KOps/s | |
test_seq_add[compile-overhead] | 0.3169ms | 0.1292ms | 7.7410 KOps/s | 7.5706 KOps/s | |
test_seq_wrap[eager] | 1.8089ms | 0.4269ms | 2.3422 KOps/s | 2.2579 KOps/s | |
test_seq_wrap[compile] | 0.4838ms | 0.3120ms | 3.2048 KOps/s | 3.2427 KOps/s | |
test_seq_wrap[compile-overhead] | 0.3648ms | 0.2280ms | 4.3859 KOps/s | 4.3154 KOps/s | |
test_func_call_runtime[False-eager] | 0.9704ms | 0.7839ms | 1.2756 KOps/s | 1.2604 KOps/s | |
test_func_call_runtime[False-compile] | 1.2072ms | 0.7488ms | 1.3355 KOps/s | 1.2827 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.5336ms | 0.3632ms | 2.7531 KOps/s | 2.7003 KOps/s | |
test_func_call_runtime[True-eager] | 1.0857ms | 0.9150ms | 1.0929 KOps/s | 1.0885 KOps/s | |
test_func_call_runtime[True-compile] | 0.9269ms | 0.7745ms | 1.2912 KOps/s | 1.2875 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.5196ms | 0.3881ms | 2.5770 KOps/s | 2.5628 KOps/s | |
test_func_call_cm_runtime[False-eager] | 1.0107ms | 0.8003ms | 1.2496 KOps/s | 1.3227 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.9501ms | 0.7533ms | 1.3275 KOps/s | 1.3156 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.5677ms | 0.3663ms | 2.7302 KOps/s | 2.6835 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.1953ms | 1.0209ms | 979.5592 Ops/s | 961.4674 Ops/s | |
test_func_call_cm_runtime[True-compile] | 1.2524ms | 1.0357ms | 965.5459 Ops/s | 977.6814 Ops/s | |
test_func_call_cm_runtime[True-compile-overhead] | 1.2114ms | 1.0288ms | 971.9859 Ops/s | 958.1894 Ops/s | |
test_vmap_func_call_cm_runtime[eager] | 2.5069ms | 2.1049ms | 475.0757 Ops/s | 470.5979 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 0.9697ms | 0.8113ms | 1.2326 KOps/s | 1.2083 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.5809ms | 0.4222ms | 2.3683 KOps/s | 2.3553 KOps/s | |
test_distributed | 2.5170ms | 0.1824ms | 5.4825 KOps/s | 8.4820 KOps/s | |
test_tdmodule | 0.1697ms | 20.1735μs | 49.5700 KOps/s | 47.1064 KOps/s | |
test_tdmodule_dispatch | 74.1310μs | 32.9283μs | 30.3690 KOps/s | 28.1470 KOps/s | |
test_tdseq | 42.1910μs | 19.2230μs | 52.0209 KOps/s | 46.9865 KOps/s | |
test_tdseq_dispatch | 68.9210μs | 35.2773μs | 28.3468 KOps/s | 25.7104 KOps/s | |
test_instantiation_functorch | 1.7045ms | 1.5396ms | 649.5159 Ops/s | 643.1221 Ops/s | |
test_exec_functorch | 0.3021ms | 0.1440ms | 6.9430 KOps/s | 6.8332 KOps/s | |
test_exec_functional_call | 0.2438ms | 0.1376ms | 7.2699 KOps/s | 7.1574 KOps/s | |
test_exec_td_decorator | 0.3752ms | 0.1867ms | 5.3576 KOps/s | 5.2678 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.8779ms | 0.6854ms | 1.4590 KOps/s | 1.4505 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.8852ms | 0.6859ms | 1.4579 KOps/s | 1.4487 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.7573ms | 0.6002ms | 1.6660 KOps/s | 1.6685 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.7807ms | 0.6002ms | 1.6662 KOps/s | 1.6696 KOps/s | |
test_vmap_transformer_speed_decorator[True-True] | 20.1787ms | 19.3692ms | 51.6284 Ops/s | 51.5075 Ops/s | |
test_vmap_transformer_speed_decorator[True-False] | 19.6449ms | 19.3316ms | 51.7287 Ops/s | 51.6116 Ops/s | |
test_vmap_transformer_speed_decorator[False-True] | 19.5940ms | 19.2216ms | 52.0248 Ops/s | 51.9914 Ops/s | |
test_vmap_transformer_speed_decorator[False-False] | 19.4219ms | 19.2002ms | 52.0827 Ops/s | 52.0644 Ops/s | |
test_to_module_speed[True] | 1.5253ms | 0.9608ms | 1.0408 KOps/s | 1.0470 KOps/s | |
test_to_module_speed[False] | 1.0022ms | 0.9357ms | 1.0687 KOps/s | 1.0663 KOps/s | |
test_tc_init | 54.6210μs | 33.1583μs | 30.1584 KOps/s | 27.1418 KOps/s | |
test_tc_init_nested | 0.1585ms | 68.2771μs | 14.6462 KOps/s | 13.8952 KOps/s | |
test_tc_first_layer_tensor | 21.6200μs | 0.8080μs | 1.2376 MOps/s | 1.2461 MOps/s | |
test_tc_first_layer_nontensor | 21.8310μs | 2.2219μs | 450.0637 KOps/s | 451.0852 KOps/s | |
test_tc_second_layer_tensor | 22.8810μs | 1.5274μs | 654.6923 KOps/s | 704.2706 KOps/s | |
test_tc_second_layer_nontensor | 27.6610μs | 2.9679μs | 336.9350 KOps/s | 338.3516 KOps/s | |
test_unbind | 7.3247ms | 7.0033ms | 142.7905 Ops/s | 143.6961 Ops/s | |
test_full_like | 14.4592ms | 11.1026ms | 90.0686 Ops/s | 88.9879 Ops/s | |
test_zeros_like | 6.5990ms | 4.6984ms | 212.8383 Ops/s | 110.5146 Ops/s | |
test_ones_like | 5.7099ms | 4.6866ms | 213.3730 Ops/s | 208.4638 Ops/s | |
test_clone | 8.9898ms | 7.6825ms | 130.1666 Ops/s | 128.5378 Ops/s | |
test_squeeze | 0.1193ms | 9.5409μs | 104.8114 KOps/s | 100.6283 KOps/s | |
test_unsqueeze | 0.1661ms | 71.6593μs | 13.9549 KOps/s | 13.7146 KOps/s | |
test_split | 0.2491s | 0.2364ms | 4.2305 KOps/s | 6.1933 KOps/s | |
test_permute | 0.3655ms | 0.1789ms | 5.5894 KOps/s | 5.4790 KOps/s | |
test_stack | 55.0638ms | 53.6348ms | 18.6446 Ops/s | 18.8363 Ops/s | |
test_cat | 54.6280ms | 53.6243ms | 18.6483 Ops/s | 18.8704 Ops/s |
vmoens
added a commit
that referenced
this pull request
Mar 5, 2025
ghstack-source-id: 14079f32d9270dcdabf4aa3f0d614e8c83a46876 Pull Request resolved: #1250
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
enhancement
New feature or request
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Stack from ghstack (oldest at bottom):