Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Doc] Update readme for v0.3 #635

Merged
merged 7 commits into from
Jan 31, 2024
Merged

[Doc] Update readme for v0.3 #635

merged 7 commits into from
Jan 31, 2024

Conversation

vmoens
Copy link
Contributor

@vmoens vmoens commented Jan 24, 2024

No description provided.

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jan 24, 2024
Copy link

github-actions bot commented Jan 24, 2024

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 124. Improved: $\large\color{#35bf28}5$. Worsened: $\large\color{#d91a1a}18$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 44.0620μs 16.0914μs 62.1451 KOps/s 61.8507 KOps/s $\color{#35bf28}+0.48\%$
test_plain_set_stack_nested 0.2777ms 0.1441ms 6.9385 KOps/s 6.8534 KOps/s $\color{#35bf28}+1.24\%$
test_plain_set_nested_inplace 51.7560μs 18.7827μs 53.2404 KOps/s 54.4500 KOps/s $\color{#d91a1a}-2.22\%$
test_plain_set_stack_nested_inplace 0.3498ms 0.1779ms 5.6227 KOps/s 5.6463 KOps/s $\color{#d91a1a}-0.42\%$
test_items 14.3260μs 2.4554μs 407.2655 KOps/s 404.0374 KOps/s $\color{#35bf28}+0.80\%$
test_items_nested 1.2936ms 0.2700ms 3.7039 KOps/s 3.7037 KOps/s $+0.01\%$
test_items_nested_locked 0.4024ms 0.2715ms 3.6834 KOps/s 3.7115 KOps/s $\color{#d91a1a}-0.76\%$
test_items_nested_leaf 0.7143ms 0.1688ms 5.9235 KOps/s 6.0302 KOps/s $\color{#d91a1a}-1.77\%$
test_items_stack_nested 1.5636ms 1.3178ms 758.8301 Ops/s 759.0550 Ops/s $\color{#d91a1a}-0.03\%$
test_items_stack_nested_leaf 1.8313ms 1.1902ms 840.1887 Ops/s 848.6024 Ops/s $\color{#d91a1a}-0.99\%$
test_items_stack_nested_locked 1.0943ms 0.8731ms 1.1453 KOps/s 1.1473 KOps/s $\color{#d91a1a}-0.17\%$
test_keys 19.2460μs 3.8679μs 258.5358 KOps/s 258.4692 KOps/s $\color{#35bf28}+0.03\%$
test_keys_nested 1.4823ms 0.1477ms 6.7693 KOps/s 6.6380 KOps/s $\color{#35bf28}+1.98\%$
test_keys_nested_locked 0.2601ms 0.1519ms 6.5811 KOps/s 6.5425 KOps/s $\color{#35bf28}+0.59\%$
test_keys_nested_leaf 0.2451ms 0.1307ms 7.6536 KOps/s 7.7170 KOps/s $\color{#d91a1a}-0.82\%$
test_keys_stack_nested 1.7316ms 1.2720ms 786.1626 Ops/s 794.5093 Ops/s $\color{#d91a1a}-1.05\%$
test_keys_stack_nested_leaf 1.4168ms 1.2645ms 790.8430 Ops/s 795.0786 Ops/s $\color{#d91a1a}-0.53\%$
test_keys_stack_nested_locked 1.3843ms 0.8272ms 1.2089 KOps/s 1.2517 KOps/s $\color{#d91a1a}-3.42\%$
test_values 12.2777μs 1.1596μs 862.3610 KOps/s 873.2414 KOps/s $\color{#d91a1a}-1.25\%$
test_values_nested 0.1273ms 51.8651μs 19.2808 KOps/s 19.0550 KOps/s $\color{#35bf28}+1.18\%$
test_values_nested_locked 0.1132ms 51.4983μs 19.4181 KOps/s 18.6942 KOps/s $\color{#35bf28}+3.87\%$
test_values_nested_leaf 0.1311ms 46.1388μs 21.6737 KOps/s 21.5807 KOps/s $\color{#35bf28}+0.43\%$
test_values_stack_nested 6.0561ms 1.0439ms 957.9124 Ops/s 984.7614 Ops/s $\color{#d91a1a}-2.73\%$
test_values_stack_nested_leaf 1.2304ms 1.0123ms 987.8414 Ops/s 981.3342 Ops/s $\color{#35bf28}+0.66\%$
test_values_stack_nested_locked 1.0464ms 0.5993ms 1.6685 KOps/s 1.6715 KOps/s $\color{#d91a1a}-0.18\%$
test_membership 35.1260μs 1.3459μs 742.9978 KOps/s 753.0834 KOps/s $\color{#d91a1a}-1.34\%$
test_membership_nested 30.6070μs 3.5522μs 281.5180 KOps/s 293.0716 KOps/s $\color{#d91a1a}-3.94\%$
test_membership_nested_leaf 50.4940μs 3.5053μs 285.2799 KOps/s 293.5575 KOps/s $\color{#d91a1a}-2.82\%$
test_membership_stacked_nested 44.1820μs 11.6169μs 86.0817 KOps/s 84.8055 KOps/s $\color{#35bf28}+1.50\%$
test_membership_stacked_nested_leaf 58.2580μs 11.5804μs 86.3526 KOps/s 84.0917 KOps/s $\color{#35bf28}+2.69\%$
test_membership_nested_last 42.8600μs 6.5868μs 151.8199 KOps/s 148.8809 KOps/s $\color{#35bf28}+1.97\%$
test_membership_nested_leaf_last 31.8100μs 6.6441μs 150.5091 KOps/s 151.2113 KOps/s $\color{#d91a1a}-0.46\%$
test_membership_stacked_nested_last 0.3659ms 0.1768ms 5.6547 KOps/s 5.6106 KOps/s $\color{#35bf28}+0.79\%$
test_membership_stacked_nested_leaf_last 73.6380μs 13.7734μs 72.6036 KOps/s 72.3438 KOps/s $\color{#35bf28}+0.36\%$
test_nested_getleaf 32.0000μs 10.6540μs 93.8614 KOps/s 92.2736 KOps/s $\color{#35bf28}+1.72\%$
test_nested_get 34.0930μs 10.0101μs 99.8995 KOps/s 97.8015 KOps/s $\color{#35bf28}+2.15\%$
test_stacked_getleaf 0.6642ms 0.4019ms 2.4879 KOps/s 2.5248 KOps/s $\color{#d91a1a}-1.46\%$
test_stacked_get 0.6892ms 0.3658ms 2.7334 KOps/s 2.7683 KOps/s $\color{#d91a1a}-1.26\%$
test_nested_getitemleaf 60.3420μs 12.0594μs 82.9231 KOps/s 81.8461 KOps/s $\color{#35bf28}+1.32\%$
test_nested_getitem 47.7480μs 11.5382μs 86.6689 KOps/s 86.1000 KOps/s $\color{#35bf28}+0.66\%$
test_stacked_getitemleaf 0.5921ms 0.4000ms 2.4998 KOps/s 2.4745 KOps/s $\color{#35bf28}+1.02\%$
test_stacked_getitem 0.7873ms 0.3678ms 2.7189 KOps/s 2.7332 KOps/s $\color{#d91a1a}-0.52\%$
test_lock_nested 2.8812ms 0.3420ms 2.9242 KOps/s 2.9445 KOps/s $\color{#d91a1a}-0.69\%$
test_lock_stack_nested 89.9747ms 6.3665ms 157.0720 Ops/s 155.9929 Ops/s $\color{#35bf28}+0.69\%$
test_unlock_nested 80.4876ms 0.4218ms 2.3709 KOps/s 2.9382 KOps/s $\textbf{\color{#d91a1a}-19.31\%}$
test_unlock_stack_nested 94.0990ms 6.5630ms 152.3704 Ops/s 157.4136 Ops/s $\color{#d91a1a}-3.20\%$
test_flatten_speed 1.2570ms 0.3656ms 2.7354 KOps/s 2.7221 KOps/s $\color{#35bf28}+0.49\%$
test_unflatten_speed 0.7786ms 0.4568ms 2.1891 KOps/s 2.1610 KOps/s $\color{#35bf28}+1.30\%$
test_common_ops 3.8778ms 0.6676ms 1.4978 KOps/s 1.5651 KOps/s $\color{#d91a1a}-4.30\%$
test_creation 24.6950μs 1.8512μs 540.1841 KOps/s 547.1910 KOps/s $\color{#d91a1a}-1.28\%$
test_creation_empty 30.3870μs 8.4522μs 118.3126 KOps/s 129.7338 KOps/s $\textbf{\color{#d91a1a}-8.80\%}$
test_creation_nested_1 67.5450μs 11.1462μs 89.7168 KOps/s 96.3932 KOps/s $\textbf{\color{#d91a1a}-6.93\%}$
test_creation_nested_2 47.3080μs 14.1958μs 70.4433 KOps/s 73.4363 KOps/s $\color{#d91a1a}-4.08\%$
test_clone 70.2110μs 13.0507μs 76.6244 KOps/s 77.6126 KOps/s $\color{#d91a1a}-1.27\%$
test_getitem[int] 56.6250μs 11.0983μs 90.1041 KOps/s 88.9744 KOps/s $\color{#35bf28}+1.27\%$
test_getitem[slice_int] 58.3480μs 21.9645μs 45.5280 KOps/s 45.9402 KOps/s $\color{#d91a1a}-0.90\%$
test_getitem[range] 0.1133ms 41.7264μs 23.9657 KOps/s 24.5531 KOps/s $\color{#d91a1a}-2.39\%$
test_getitem[tuple] 48.3900μs 18.1983μs 54.9502 KOps/s 55.2585 KOps/s $\color{#d91a1a}-0.56\%$
test_getitem[list] 0.2147ms 38.2825μs 26.1216 KOps/s 27.6632 KOps/s $\textbf{\color{#d91a1a}-5.57\%}$
test_setitem_dim[int] 56.2350μs 28.6522μs 34.9013 KOps/s 36.4145 KOps/s $\color{#d91a1a}-4.16\%$
test_setitem_dim[slice_int] 0.1161ms 53.8047μs 18.5857 KOps/s 18.8318 KOps/s $\color{#d91a1a}-1.31\%$
test_setitem_dim[range] 0.1477ms 74.1737μs 13.4819 KOps/s 13.9170 KOps/s $\color{#d91a1a}-3.13\%$
test_setitem_dim[tuple] 99.5030μs 42.8666μs 23.3282 KOps/s 23.7067 KOps/s $\color{#d91a1a}-1.60\%$
test_setitem 0.1844ms 19.6067μs 51.0030 KOps/s 56.6004 KOps/s $\textbf{\color{#d91a1a}-9.89\%}$
test_set 98.9420μs 18.3775μs 54.4144 KOps/s 59.0471 KOps/s $\textbf{\color{#d91a1a}-7.85\%}$
test_set_shared 1.8698ms 0.1387ms 7.2089 KOps/s 6.4241 KOps/s $\textbf{\color{#35bf28}+12.22\%}$
test_update 0.2223ms 20.5116μs 48.7529 KOps/s 54.0120 KOps/s $\textbf{\color{#d91a1a}-9.74\%}$
test_update_nested 0.1193ms 28.8006μs 34.7214 KOps/s 38.6302 KOps/s $\textbf{\color{#d91a1a}-10.12\%}$
test_set_nested 79.3470μs 21.0006μs 47.6177 KOps/s 53.2538 KOps/s $\textbf{\color{#d91a1a}-10.58\%}$
test_set_nested_new 0.1104ms 24.9306μs 40.1114 KOps/s 44.5897 KOps/s $\textbf{\color{#d91a1a}-10.04\%}$
test_select 0.1056ms 37.8069μs 26.4502 KOps/s 27.8924 KOps/s $\textbf{\color{#d91a1a}-5.17\%}$
test_select_nested 0.1268ms 58.0693μs 17.2208 KOps/s 17.2920 KOps/s $\color{#d91a1a}-0.41\%$
test_exclude_nested 0.2178ms 0.1178ms 8.4894 KOps/s 8.6158 KOps/s $\color{#d91a1a}-1.47\%$
test_empty[True] 0.5756ms 0.4067ms 2.4587 KOps/s 2.5070 KOps/s $\color{#d91a1a}-1.93\%$
test_empty[False] 9.3594μs 1.0195μs 980.8850 KOps/s 974.7900 KOps/s $\color{#35bf28}+0.63\%$
test_unbind_speed 0.5465ms 0.2470ms 4.0480 KOps/s 4.1075 KOps/s $\color{#d91a1a}-1.45\%$
test_unbind_speed_stack0 87.1101ms 3.3954ms 294.5153 Ops/s 326.9593 Ops/s $\textbf{\color{#d91a1a}-9.92\%}$
test_unbind_speed_stack1 22.4310μs 1.9771μs 505.8022 KOps/s 517.6808 KOps/s $\color{#d91a1a}-2.29\%$
test_split 2.4998ms 1.4540ms 687.7598 Ops/s 596.1657 Ops/s $\textbf{\color{#35bf28}+15.36\%}$
test_chunk 76.2209ms 1.5492ms 645.5063 Ops/s 635.5897 Ops/s $\color{#35bf28}+1.56\%$
test_creation[device0] 0.2147ms 0.1017ms 9.8364 KOps/s 9.2352 KOps/s $\textbf{\color{#35bf28}+6.51\%}$
test_creation_from_tensor 3.4217ms 83.6868μs 11.9493 KOps/s 12.2285 KOps/s $\color{#d91a1a}-2.28\%$
test_add_one[memmap_tensor0] 0.2759ms 5.1594μs 193.8201 KOps/s 189.4496 KOps/s $\color{#35bf28}+2.31\%$
test_contiguous[memmap_tensor0] 20.7690μs 0.6326μs 1.5807 MOps/s 1.5542 MOps/s $\color{#35bf28}+1.71\%$
test_stack[memmap_tensor0] 44.4020μs 3.5709μs 280.0445 KOps/s 277.3504 KOps/s $\color{#35bf28}+0.97\%$
test_memmaptd_index 0.9854ms 0.2368ms 4.2221 KOps/s 4.3078 KOps/s $\color{#d91a1a}-1.99\%$
test_memmaptd_index_astensor 0.7529ms 0.2956ms 3.3831 KOps/s 3.4007 KOps/s $\color{#d91a1a}-0.52\%$
test_memmaptd_index_op 1.3108ms 0.5528ms 1.8088 KOps/s 1.8098 KOps/s $\color{#d91a1a}-0.06\%$
test_serialize_model 0.1747s 0.1064s 9.3948 Ops/s 8.8661 Ops/s $\textbf{\color{#35bf28}+5.96\%}$
test_serialize_model_pickle 0.4531s 0.3777s 2.6476 Ops/s 2.6085 Ops/s $\color{#35bf28}+1.50\%$
test_serialize_weights 0.1786s 0.1083s 9.2353 Ops/s 8.8315 Ops/s $\color{#35bf28}+4.57\%$
test_serialize_weights_returnearly 0.2049s 0.1347s 7.4218 Ops/s 7.5769 Ops/s $\color{#d91a1a}-2.05\%$
test_serialize_weights_pickle 0.9952s 0.5680s 1.7606 Ops/s 2.4389 Ops/s $\textbf{\color{#d91a1a}-27.81\%}$
test_serialize_weights_filesystem 94.3029ms 90.4905ms 11.0509 Ops/s 9.7870 Ops/s $\textbf{\color{#35bf28}+12.91\%}$
test_serialize_model_filesystem 0.1612s 96.6345ms 10.3483 Ops/s 10.4037 Ops/s $\color{#d91a1a}-0.53\%$
test_reshape_pytree 48.9110μs 20.8201μs 48.0306 KOps/s 47.2901 KOps/s $\color{#35bf28}+1.57\%$
test_reshape_td 66.3730μs 29.9257μs 33.4161 KOps/s 32.8736 KOps/s $\color{#35bf28}+1.65\%$
test_view_pytree 61.9650μs 20.7126μs 48.2799 KOps/s 47.1112 KOps/s $\color{#35bf28}+2.48\%$
test_view_td 82.2953ms 11.6791μs 85.6233 KOps/s 87.4824 KOps/s $\color{#d91a1a}-2.13\%$
test_unbind_pytree 84.0770μs 24.8651μs 40.2171 KOps/s 40.9801 KOps/s $\color{#d91a1a}-1.86\%$
test_unbind_td 0.1375ms 35.5673μs 28.1157 KOps/s 27.6783 KOps/s $\color{#35bf28}+1.58\%$
test_split_pytree 59.3200μs 23.7823μs 42.0482 KOps/s 41.4078 KOps/s $\color{#35bf28}+1.55\%$
test_split_td 0.4985ms 39.0137μs 25.6320 KOps/s 24.9290 KOps/s $\color{#35bf28}+2.82\%$
test_add_pytree 75.9010μs 29.2995μs 34.1302 KOps/s 33.2919 KOps/s $\color{#35bf28}+2.52\%$
test_add_td 0.1083ms 46.6559μs 21.4335 KOps/s 20.7229 KOps/s $\color{#35bf28}+3.43\%$
test_distributed 0.2359ms 97.4453μs 10.2622 KOps/s 9.8395 KOps/s $\color{#35bf28}+4.30\%$
test_tdmodule 0.1023ms 21.9450μs 45.5684 KOps/s 48.0062 KOps/s $\textbf{\color{#d91a1a}-5.08\%}$
test_tdmodule_dispatch 0.2119ms 42.0833μs 23.7624 KOps/s 25.0547 KOps/s $\textbf{\color{#d91a1a}-5.16\%}$
test_tdseq 0.1167ms 24.5833μs 40.6781 KOps/s 42.9147 KOps/s $\textbf{\color{#d91a1a}-5.21\%}$
test_tdseq_dispatch 0.1582ms 46.2949μs 21.6007 KOps/s 22.8926 KOps/s $\textbf{\color{#d91a1a}-5.64\%}$
test_instantiation_functorch 1.7130ms 1.3148ms 760.5445 Ops/s 757.3362 Ops/s $\color{#35bf28}+0.42\%$
test_instantiation_td 1.6828ms 1.0190ms 981.3519 Ops/s 987.0151 Ops/s $\color{#d91a1a}-0.57\%$
test_exec_functorch 0.3376ms 0.1563ms 6.3978 KOps/s 6.2754 KOps/s $\color{#35bf28}+1.95\%$
test_exec_functional_call 0.2809ms 0.1442ms 6.9338 KOps/s 6.7301 KOps/s $\color{#35bf28}+3.03\%$
test_exec_td 0.4226ms 0.1521ms 6.5741 KOps/s 6.9869 KOps/s $\textbf{\color{#d91a1a}-5.91\%}$
test_exec_td_decorator 0.9048ms 0.1760ms 5.6814 KOps/s 5.6009 KOps/s $\color{#35bf28}+1.44\%$
test_vmap_mlp_speed[True-True] 1.2122ms 0.8740ms 1.1441 KOps/s 1.1308 KOps/s $\color{#35bf28}+1.17\%$
test_vmap_mlp_speed[True-False] 0.6189ms 0.4636ms 2.1573 KOps/s 2.1638 KOps/s $\color{#d91a1a}-0.30\%$
test_vmap_mlp_speed[False-True] 1.4387ms 0.8071ms 1.2390 KOps/s 1.2813 KOps/s $\color{#d91a1a}-3.30\%$
test_vmap_mlp_speed[False-False] 0.6723ms 0.3886ms 2.5736 KOps/s 2.5829 KOps/s $\color{#d91a1a}-0.36\%$
test_vmap_mlp_speed_decorator[True-True] 3.6914ms 2.2878ms 437.0925 Ops/s 446.1726 Ops/s $\color{#d91a1a}-2.04\%$
test_vmap_mlp_speed_decorator[True-False] 1.0840ms 0.5192ms 1.9259 KOps/s 1.9615 KOps/s $\color{#d91a1a}-1.81\%$
test_vmap_mlp_speed_decorator[False-True] 2.4881ms 1.8475ms 541.2587 Ops/s 549.2117 Ops/s $\color{#d91a1a}-1.45\%$
test_vmap_mlp_speed_decorator[False-False] 0.8722ms 0.4005ms 2.4969 KOps/s 2.5275 KOps/s $\color{#d91a1a}-1.21\%$

Copy link

github-actions bot commented Jan 24, 2024

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 132. Improved: $\large\color{#35bf28}9$. Worsened: $\large\color{#d91a1a}11$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 62.1512ms 19.0278μs 52.5546 KOps/s 70.9133 KOps/s $\textbf{\color{#d91a1a}-25.89\%}$
test_plain_set_stack_nested 0.1526ms 0.1209ms 8.2696 KOps/s 8.3658 KOps/s $\color{#d91a1a}-1.15\%$
test_plain_set_nested_inplace 45.3100μs 15.3117μs 65.3094 KOps/s 64.6070 KOps/s $\color{#35bf28}+1.09\%$
test_plain_set_stack_nested_inplace 0.1867ms 0.1506ms 6.6383 KOps/s 6.7854 KOps/s $\color{#d91a1a}-2.17\%$
test_items 33.7410μs 4.8453μs 206.3870 KOps/s 200.6073 KOps/s $\color{#35bf28}+2.88\%$
test_items_nested 0.3780ms 0.3384ms 2.9555 KOps/s 2.9280 KOps/s $\color{#35bf28}+0.94\%$
test_items_nested_locked 0.3799ms 0.3445ms 2.9030 KOps/s 2.8965 KOps/s $\color{#35bf28}+0.23\%$
test_items_nested_leaf 0.2531ms 0.2012ms 4.9697 KOps/s 4.9403 KOps/s $\color{#35bf28}+0.60\%$
test_items_stack_nested 1.4281ms 1.3313ms 751.1346 Ops/s 761.7092 Ops/s $\color{#d91a1a}-1.39\%$
test_items_stack_nested_leaf 1.2245ms 1.1703ms 854.4908 Ops/s 864.1900 Ops/s $\color{#d91a1a}-1.12\%$
test_items_stack_nested_locked 1.8906ms 0.9018ms 1.1089 KOps/s 1.1104 KOps/s $\color{#d91a1a}-0.13\%$
test_keys 19.6600μs 4.5581μs 219.3887 KOps/s 218.3960 KOps/s $\color{#35bf28}+0.45\%$
test_keys_nested 0.5495ms 93.9794μs 10.6406 KOps/s 10.4841 KOps/s $\color{#35bf28}+1.49\%$
test_keys_nested_locked 0.1243ms 97.2183μs 10.2861 KOps/s 10.1053 KOps/s $\color{#35bf28}+1.79\%$
test_keys_nested_leaf 0.1816ms 77.6574μs 12.8771 KOps/s 12.6914 KOps/s $\color{#35bf28}+1.46\%$
test_keys_stack_nested 1.2389ms 1.1669ms 856.9635 Ops/s 862.3317 Ops/s $\color{#d91a1a}-0.62\%$
test_keys_stack_nested_leaf 1.2109ms 1.1598ms 862.1960 Ops/s 866.4746 Ops/s $\color{#d91a1a}-0.49\%$
test_keys_stack_nested_locked 0.7988ms 0.7349ms 1.3606 KOps/s 1.2989 KOps/s $\color{#35bf28}+4.75\%$
test_values 9.9433μs 1.8906μs 528.9449 KOps/s 523.3251 KOps/s $\color{#35bf28}+1.07\%$
test_values_nested 69.5600μs 45.8021μs 21.8331 KOps/s 21.8785 KOps/s $\color{#d91a1a}-0.21\%$
test_values_nested_locked 71.5710μs 48.0700μs 20.8030 KOps/s 20.8253 KOps/s $\color{#d91a1a}-0.11\%$
test_values_nested_leaf 54.9100μs 39.9417μs 25.0365 KOps/s 25.0181 KOps/s $\color{#35bf28}+0.07\%$
test_values_stack_nested 1.0436ms 0.9738ms 1.0269 KOps/s 1.0482 KOps/s $\color{#d91a1a}-2.03\%$
test_values_stack_nested_leaf 1.0489ms 0.9747ms 1.0260 KOps/s 1.0543 KOps/s $\color{#d91a1a}-2.69\%$
test_values_stack_nested_locked 0.6294ms 0.5789ms 1.7276 KOps/s 1.7715 KOps/s $\color{#d91a1a}-2.48\%$
test_membership 5.1462μs 0.9333μs 1.0714 MOps/s 1.0629 MOps/s $\color{#35bf28}+0.80\%$
test_membership_nested 31.9710μs 2.8976μs 345.1122 KOps/s 341.5622 KOps/s $\color{#35bf28}+1.04\%$
test_membership_nested_leaf 19.5700μs 2.9200μs 342.4637 KOps/s 340.4904 KOps/s $\color{#35bf28}+0.58\%$
test_membership_stacked_nested 44.8010μs 11.3306μs 88.2567 KOps/s 89.5284 KOps/s $\color{#d91a1a}-1.42\%$
test_membership_stacked_nested_leaf 34.4600μs 11.2480μs 88.9048 KOps/s 89.4454 KOps/s $\color{#d91a1a}-0.60\%$
test_membership_nested_last 34.1600μs 5.3429μs 187.1647 KOps/s 186.0623 KOps/s $\color{#35bf28}+0.59\%$
test_membership_nested_leaf_last 33.1000μs 5.3126μs 188.2308 KOps/s 185.3249 KOps/s $\color{#35bf28}+1.57\%$
test_membership_stacked_nested_last 0.1950ms 0.1602ms 6.2427 KOps/s 6.2893 KOps/s $\color{#d91a1a}-0.74\%$
test_membership_stacked_nested_leaf_last 48.9010μs 13.1778μs 75.8854 KOps/s 75.7040 KOps/s $\color{#35bf28}+0.24\%$
test_nested_getleaf 51.1900μs 8.6018μs 116.2544 KOps/s 118.2736 KOps/s $\color{#d91a1a}-1.71\%$
test_nested_get 31.7910μs 8.1112μs 123.2862 KOps/s 125.3681 KOps/s $\color{#d91a1a}-1.66\%$
test_stacked_getleaf 0.3768ms 0.3320ms 3.0117 KOps/s 3.0000 KOps/s $\color{#35bf28}+0.39\%$
test_stacked_get 0.3478ms 0.3004ms 3.3288 KOps/s 3.3243 KOps/s $\color{#35bf28}+0.14\%$
test_nested_getitemleaf 32.1610μs 10.0224μs 99.7766 KOps/s 101.6623 KOps/s $\color{#d91a1a}-1.85\%$
test_nested_getitem 23.5010μs 9.5316μs 104.9139 KOps/s 106.5136 KOps/s $\color{#d91a1a}-1.50\%$
test_stacked_getitemleaf 0.3818ms 0.3373ms 2.9646 KOps/s 2.9987 KOps/s $\color{#d91a1a}-1.14\%$
test_stacked_getitem 0.3402ms 0.3004ms 3.3291 KOps/s 3.2968 KOps/s $\color{#35bf28}+0.98\%$
test_lock_nested 0.8782ms 0.3596ms 2.7808 KOps/s 2.8304 KOps/s $\color{#d91a1a}-1.75\%$
test_lock_stack_nested 87.5078ms 6.3753ms 156.8560 Ops/s 155.8288 Ops/s $\color{#35bf28}+0.66\%$
test_unlock_nested 80.3152ms 0.4416ms 2.2646 KOps/s 2.8351 KOps/s $\textbf{\color{#d91a1a}-20.12\%}$
test_unlock_stack_nested 88.1902ms 6.4677ms 154.6135 Ops/s 154.0475 Ops/s $\color{#35bf28}+0.37\%$
test_flatten_speed 0.6545ms 0.2653ms 3.7698 KOps/s 3.7893 KOps/s $\color{#d91a1a}-0.51\%$
test_unflatten_speed 0.4092ms 0.3653ms 2.7374 KOps/s 2.7830 KOps/s $\color{#d91a1a}-1.64\%$
test_common_ops 1.1001ms 0.6221ms 1.6076 KOps/s 1.6341 KOps/s $\color{#d91a1a}-1.63\%$
test_creation 13.2600μs 1.5653μs 638.8440 KOps/s 633.3516 KOps/s $\color{#35bf28}+0.87\%$
test_creation_empty 27.8800μs 8.8276μs 113.2809 KOps/s 107.7537 KOps/s $\textbf{\color{#35bf28}+5.13\%}$
test_creation_nested_1 53.5110μs 10.6429μs 93.9597 KOps/s 90.2633 KOps/s $\color{#35bf28}+4.10\%$
test_creation_nested_2 1.5211ms 13.2254μs 75.6118 KOps/s 74.2286 KOps/s $\color{#35bf28}+1.86\%$
test_clone 54.9610μs 14.0593μs 71.1275 KOps/s 73.9736 KOps/s $\color{#d91a1a}-3.85\%$
test_getitem[int] 32.4700μs 11.2406μs 88.9636 KOps/s 90.6969 KOps/s $\color{#d91a1a}-1.91\%$
test_getitem[slice_int] 57.2410μs 22.0036μs 45.4472 KOps/s 47.4600 KOps/s $\color{#d91a1a}-4.24\%$
test_getitem[range] 64.0310μs 37.5728μs 26.6150 KOps/s 27.5555 KOps/s $\color{#d91a1a}-3.41\%$
test_getitem[tuple] 40.5900μs 19.2662μs 51.9044 KOps/s 53.5653 KOps/s $\color{#d91a1a}-3.10\%$
test_getitem[list] 0.1877ms 36.8745μs 27.1190 KOps/s 29.5973 KOps/s $\textbf{\color{#d91a1a}-8.37\%}$
test_setitem_dim[int] 44.4500μs 28.3067μs 35.3274 KOps/s 34.0770 KOps/s $\color{#35bf28}+3.67\%$
test_setitem_dim[slice_int] 68.7210μs 48.7323μs 20.5203 KOps/s 19.3932 KOps/s $\textbf{\color{#35bf28}+5.81\%}$
test_setitem_dim[range] 91.1610μs 67.0512μs 14.9140 KOps/s 15.0448 KOps/s $\color{#d91a1a}-0.87\%$
test_setitem_dim[tuple] 72.7710μs 42.9173μs 23.3006 KOps/s 22.1577 KOps/s $\textbf{\color{#35bf28}+5.16\%}$
test_setitem 94.6910μs 19.5662μs 51.1086 KOps/s 48.3072 KOps/s $\textbf{\color{#35bf28}+5.80\%}$
test_set 77.2010μs 18.9954μs 52.6443 KOps/s 50.0405 KOps/s $\textbf{\color{#35bf28}+5.20\%}$
test_set_shared 2.8326ms 0.1039ms 9.6203 KOps/s 9.8350 KOps/s $\color{#d91a1a}-2.18\%$
test_update 81.9910μs 21.8985μs 45.6653 KOps/s 47.8184 KOps/s $\color{#d91a1a}-4.50\%$
test_update_nested 64.4810μs 28.0657μs 35.6307 KOps/s 34.9455 KOps/s $\color{#35bf28}+1.96\%$
test_set_nested 68.1610μs 20.6557μs 48.4128 KOps/s 50.9026 KOps/s $\color{#d91a1a}-4.89\%$
test_set_nested_new 87.2210μs 23.1432μs 43.2093 KOps/s 42.9411 KOps/s $\color{#35bf28}+0.62\%$
test_select 77.2510μs 37.1117μs 26.9457 KOps/s 26.4280 KOps/s $\color{#35bf28}+1.96\%$
test_select_nested 70.4310μs 53.0584μs 18.8472 KOps/s 18.7358 KOps/s $\color{#35bf28}+0.59\%$
test_exclude_nested 0.1434ms 0.1156ms 8.6539 KOps/s 8.6346 KOps/s $\color{#35bf28}+0.22\%$
test_empty[True] 0.4467ms 0.3935ms 2.5415 KOps/s 2.5673 KOps/s $\color{#d91a1a}-1.00\%$
test_empty[False] 3.0430μs 0.8505μs 1.1757 MOps/s 1.1574 MOps/s $\color{#35bf28}+1.58\%$
test_to 74.9820μs 53.2928μs 18.7643 KOps/s 19.2023 KOps/s $\color{#d91a1a}-2.28\%$
test_to_nonblocking 58.3710μs 35.8790μs 27.8715 KOps/s 30.9606 KOps/s $\textbf{\color{#d91a1a}-9.98\%}$
test_unbind_speed 0.3012ms 0.2738ms 3.6529 KOps/s 3.7313 KOps/s $\color{#d91a1a}-2.10\%$
test_unbind_speed_stack0 90.6567ms 3.8476ms 259.9007 Ops/s 285.9657 Ops/s $\textbf{\color{#d91a1a}-9.11\%}$
test_unbind_speed_stack1 11.2067μs 1.7154μs 582.9448 KOps/s 588.6338 KOps/s $\color{#d91a1a}-0.97\%$
test_split 1.6443ms 1.5686ms 637.5220 Ops/s 586.1429 Ops/s $\textbf{\color{#35bf28}+8.77\%}$
test_chunk 82.1400ms 1.6981ms 588.8915 Ops/s 605.3488 Ops/s $\color{#d91a1a}-2.72\%$
test_creation[device0] 0.1350ms 71.7666μs 13.9341 KOps/s 14.1362 KOps/s $\color{#d91a1a}-1.43\%$
test_creation_from_tensor 0.1366ms 56.6599μs 17.6492 KOps/s 18.1713 KOps/s $\color{#d91a1a}-2.87\%$
test_add_one[memmap_tensor0] 0.2180ms 6.8554μs 145.8707 KOps/s 158.0335 KOps/s $\textbf{\color{#d91a1a}-7.70\%}$
test_contiguous[memmap_tensor0] 23.8910μs 0.6401μs 1.5622 MOps/s 1.5484 MOps/s $\color{#35bf28}+0.89\%$
test_stack[memmap_tensor0] 42.8700μs 4.5192μs 221.2765 KOps/s 230.2343 KOps/s $\color{#d91a1a}-3.89\%$
test_memmaptd_index 1.0398ms 0.2795ms 3.5778 KOps/s 3.6776 KOps/s $\color{#d91a1a}-2.71\%$
test_memmaptd_index_astensor 0.6520ms 0.3367ms 2.9703 KOps/s 3.0321 KOps/s $\color{#d91a1a}-2.04\%$
test_memmaptd_index_op 1.0247ms 0.6442ms 1.5523 KOps/s 1.6140 KOps/s $\color{#d91a1a}-3.82\%$
test_serialize_model 0.1761s 98.3521ms 10.1675 Ops/s 9.6857 Ops/s $\color{#35bf28}+4.97\%$
test_serialize_model_pickle 1.3483s 1.2370s 0.8084 Ops/s 0.8082 Ops/s $\color{#35bf28}+0.03\%$
test_serialize_weights 0.1749s 96.8007ms 10.3305 Ops/s 9.5949 Ops/s $\textbf{\color{#35bf28}+7.67\%}$
test_serialize_weights_returnearly 0.2621s 83.0076ms 12.0471 Ops/s 14.0028 Ops/s $\textbf{\color{#d91a1a}-13.97\%}$
test_serialize_weights_pickle 1.4175s 1.2454s 0.8029 Ops/s 0.8028 Ops/s $\color{#35bf28}+0.02\%$
test_reshape_pytree 47.4800μs 25.8082μs 38.7474 KOps/s 39.6622 KOps/s $\color{#d91a1a}-2.31\%$
test_reshape_td 52.6010μs 30.6335μs 32.6440 KOps/s 32.2332 KOps/s $\color{#35bf28}+1.27\%$
test_view_pytree 41.4500μs 25.2542μs 39.5973 KOps/s 39.2699 KOps/s $\color{#35bf28}+0.83\%$
test_view_td 0.3685ms 6.8927μs 145.0811 KOps/s 99.2542 KOps/s $\textbf{\color{#35bf28}+46.17\%}$
test_unbind_pytree 51.6210μs 30.7306μs 32.5409 KOps/s 33.0231 KOps/s $\color{#d91a1a}-1.46\%$
test_unbind_td 0.1119ms 41.4815μs 24.1071 KOps/s 24.7085 KOps/s $\color{#d91a1a}-2.43\%$
test_split_pytree 44.8310μs 28.9508μs 34.5414 KOps/s 34.9638 KOps/s $\color{#d91a1a}-1.21\%$
test_split_td 0.4091ms 40.7433μs 24.5439 KOps/s 26.0268 KOps/s $\textbf{\color{#d91a1a}-5.70\%}$
test_add_pytree 0.1728ms 36.4860μs 27.4078 KOps/s 28.9595 KOps/s $\textbf{\color{#d91a1a}-5.36\%}$
test_add_td 0.2059ms 53.7290μs 18.6119 KOps/s 20.4562 KOps/s $\textbf{\color{#d91a1a}-9.02\%}$
test_distributed 5.5110ms 87.5635μs 11.4203 KOps/s 14.1601 KOps/s $\textbf{\color{#d91a1a}-19.35\%}$
test_tdmodule 33.2920μs 18.5752μs 53.8352 KOps/s 53.6514 KOps/s $\color{#35bf28}+0.34\%$
test_tdmodule_dispatch 0.2652ms 38.2606μs 26.1365 KOps/s 25.8378 KOps/s $\color{#35bf28}+1.16\%$
test_tdseq 37.5510μs 21.4539μs 46.6116 KOps/s 46.7014 KOps/s $\color{#d91a1a}-0.19\%$
test_tdseq_dispatch 56.5600μs 40.5896μs 24.6369 KOps/s 24.3105 KOps/s $\color{#35bf28}+1.34\%$
test_instantiation_functorch 1.7588ms 1.7080ms 585.4812 Ops/s 600.6023 Ops/s $\color{#d91a1a}-2.52\%$
test_instantiation_td 1.7231ms 1.1766ms 849.9097 Ops/s 769.6718 Ops/s $\textbf{\color{#35bf28}+10.42\%}$
test_exec_functorch 0.2197ms 0.1613ms 6.2000 KOps/s 6.3449 KOps/s $\color{#d91a1a}-2.28\%$
test_exec_functional_call 0.2528ms 0.1594ms 6.2745 KOps/s 6.5111 KOps/s $\color{#d91a1a}-3.63\%$
test_exec_td 0.1777ms 0.1494ms 6.6918 KOps/s 6.9457 KOps/s $\color{#d91a1a}-3.65\%$
test_exec_td_decorator 0.6312ms 0.1863ms 5.3682 KOps/s 5.4922 KOps/s $\color{#d91a1a}-2.26\%$
test_vmap_mlp_speed[True-True] 1.2138ms 1.0293ms 971.5428 Ops/s 974.6090 Ops/s $\color{#d91a1a}-0.31\%$
test_vmap_mlp_speed[True-False] 0.6202ms 0.5842ms 1.7116 KOps/s 1.7037 KOps/s $\color{#35bf28}+0.47\%$
test_vmap_mlp_speed[False-True] 1.0468ms 0.9369ms 1.0673 KOps/s 1.0711 KOps/s $\color{#d91a1a}-0.35\%$
test_vmap_mlp_speed[False-False] 0.5545ms 0.5142ms 1.9447 KOps/s 1.9485 KOps/s $\color{#d91a1a}-0.20\%$
test_vmap_mlp_speed_decorator[True-True] 3.1133ms 2.3502ms 425.4993 Ops/s 427.3596 Ops/s $\color{#d91a1a}-0.44\%$
test_vmap_mlp_speed_decorator[True-False] 1.1174ms 0.6368ms 1.5705 KOps/s 1.5740 KOps/s $\color{#d91a1a}-0.23\%$
test_vmap_mlp_speed_decorator[False-True] 2.3872ms 1.9602ms 510.1476 Ops/s 509.7830 Ops/s $\color{#35bf28}+0.07\%$
test_vmap_mlp_speed_decorator[False-False] 0.8784ms 0.5327ms 1.8774 KOps/s 1.8293 KOps/s $\color{#35bf28}+2.63\%$
test_vmap_transformer_speed[True-True] 12.4988ms 12.0988ms 82.6530 Ops/s 81.5397 Ops/s $\color{#35bf28}+1.37\%$
test_vmap_transformer_speed[True-False] 7.9826ms 7.8549ms 127.3091 Ops/s 125.2471 Ops/s $\color{#35bf28}+1.65\%$
test_vmap_transformer_speed[False-True] 12.2031ms 11.9603ms 83.6100 Ops/s 82.3823 Ops/s $\color{#35bf28}+1.49\%$
test_vmap_transformer_speed[False-False] 8.0521ms 7.7713ms 128.6794 Ops/s 126.3899 Ops/s $\color{#35bf28}+1.81\%$
test_vmap_transformer_speed_decorator[True-True] 73.8711ms 72.7613ms 13.7436 Ops/s 13.4538 Ops/s $\color{#35bf28}+2.15\%$
test_vmap_transformer_speed_decorator[True-False] 20.2736ms 18.7621ms 53.2990 Ops/s 52.2086 Ops/s $\color{#35bf28}+2.09\%$
test_vmap_transformer_speed_decorator[False-True] 65.8271ms 65.4660ms 15.2751 Ops/s 14.9430 Ops/s $\color{#35bf28}+2.22\%$
test_vmap_transformer_speed_decorator[False-False] 20.1154ms 18.3547ms 54.4820 Ops/s 53.6247 Ops/s $\color{#35bf28}+1.60\%$

@vmoens vmoens added the documentation Improvements or additions to documentation label Jan 31, 2024
@vmoens vmoens merged commit 2c73daa into main Jan 31, 2024
23 of 32 checks passed
@vmoens vmoens deleted the update-readme branch January 31, 2024 11:21
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. documentation Improvements or additions to documentation
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants