Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[NOMERG] v0.3.0 release wheels #644

Closed
wants to merge 7 commits into from
Closed

[NOMERG] v0.3.0 release wheels #644

wants to merge 7 commits into from

Conversation

vmoens
Copy link
Contributor

@vmoens vmoens commented Jan 30, 2024

No description provided.

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jan 30, 2024
@vmoens vmoens added the ciflow/binaries/all Build all wheels label Jan 30, 2024
Copy link

github-actions bot commented Jan 30, 2024

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 124. Improved: $\large\color{#35bf28}4$. Worsened: $\large\color{#d91a1a}21$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 40.7860μs 18.2742μs 54.7218 KOps/s 58.2764 KOps/s $\textbf{\color{#d91a1a}-6.10\%}$
test_plain_set_stack_nested 0.1964ms 0.1562ms 6.4008 KOps/s 6.6349 KOps/s $\color{#d91a1a}-3.53\%$
test_plain_set_nested_inplace 57.8980μs 20.8990μs 47.8491 KOps/s 49.6985 KOps/s $\color{#d91a1a}-3.72\%$
test_plain_set_stack_nested_inplace 0.6416ms 0.1957ms 5.1101 KOps/s 5.4063 KOps/s $\textbf{\color{#d91a1a}-5.48\%}$
test_items 0.1391ms 2.6284μs 380.4553 KOps/s 402.3506 KOps/s $\textbf{\color{#d91a1a}-5.44\%}$
test_items_nested 1.2891ms 0.2782ms 3.5947 KOps/s 3.5608 KOps/s $\color{#35bf28}+0.95\%$
test_items_nested_locked 0.5869ms 0.2803ms 3.5673 KOps/s 3.6086 KOps/s $\color{#d91a1a}-1.14\%$
test_items_nested_leaf 0.7278ms 0.1711ms 5.8437 KOps/s 5.7689 KOps/s $\color{#35bf28}+1.30\%$
test_items_stack_nested 2.1888ms 1.3845ms 722.2893 Ops/s 731.8985 Ops/s $\color{#d91a1a}-1.31\%$
test_items_stack_nested_leaf 2.2832ms 1.2445ms 803.5211 Ops/s 818.8684 Ops/s $\color{#d91a1a}-1.87\%$
test_items_stack_nested_locked 2.1767ms 0.9251ms 1.0809 KOps/s 1.1185 KOps/s $\color{#d91a1a}-3.35\%$
test_keys 59.3210μs 3.9849μs 250.9480 KOps/s 263.4512 KOps/s $\color{#d91a1a}-4.75\%$
test_keys_nested 1.6625ms 0.1484ms 6.7392 KOps/s 6.6534 KOps/s $\color{#35bf28}+1.29\%$
test_keys_nested_locked 0.2580ms 0.1538ms 6.5009 KOps/s 6.4833 KOps/s $\color{#35bf28}+0.27\%$
test_keys_nested_leaf 0.2502ms 0.1312ms 7.6193 KOps/s 7.5143 KOps/s $\color{#35bf28}+1.40\%$
test_keys_stack_nested 1.6626ms 1.3327ms 750.3292 Ops/s 768.8269 Ops/s $\color{#d91a1a}-2.41\%$
test_keys_stack_nested_leaf 1.6106ms 1.3186ms 758.3885 Ops/s 772.5573 Ops/s $\color{#d91a1a}-1.83\%$
test_keys_stack_nested_locked 1.5909ms 0.8394ms 1.1913 KOps/s 1.2308 KOps/s $\color{#d91a1a}-3.21\%$
test_values 8.9482μs 1.1665μs 857.2466 KOps/s 854.6003 KOps/s $\color{#35bf28}+0.31\%$
test_values_nested 98.4440μs 53.0158μs 18.8623 KOps/s 19.0084 KOps/s $\color{#d91a1a}-0.77\%$
test_values_nested_locked 0.1162ms 53.2173μs 18.7909 KOps/s 18.9756 KOps/s $\color{#d91a1a}-0.97\%$
test_values_nested_leaf 83.2560μs 47.3512μs 21.1188 KOps/s 21.3655 KOps/s $\color{#d91a1a}-1.15\%$
test_values_stack_nested 1.2919ms 1.0606ms 942.8733 Ops/s 950.0792 Ops/s $\color{#d91a1a}-0.76\%$
test_values_stack_nested_leaf 1.2215ms 1.0560ms 946.9836 Ops/s 958.2350 Ops/s $\color{#d91a1a}-1.17\%$
test_values_stack_nested_locked 0.8721ms 0.6364ms 1.5714 KOps/s 1.6345 KOps/s $\color{#d91a1a}-3.86\%$
test_membership 12.5740μs 1.3329μs 750.2561 KOps/s 753.2158 KOps/s $\color{#d91a1a}-0.39\%$
test_membership_nested 41.4580μs 3.4534μs 289.5679 KOps/s 288.5539 KOps/s $\color{#35bf28}+0.35\%$
test_membership_nested_leaf 23.6330μs 3.4619μs 288.8622 KOps/s 290.7510 KOps/s $\color{#d91a1a}-0.65\%$
test_membership_stacked_nested 51.2760μs 13.0722μs 76.4982 KOps/s 78.1572 KOps/s $\color{#d91a1a}-2.12\%$
test_membership_stacked_nested_leaf 27.8720μs 11.6746μs 85.6560 KOps/s 83.7765 KOps/s $\color{#35bf28}+2.24\%$
test_membership_nested_last 41.4470μs 6.6634μs 150.0738 KOps/s 149.3954 KOps/s $\color{#35bf28}+0.45\%$
test_membership_nested_leaf_last 45.6190μs 6.5693μs 152.2241 KOps/s 151.2769 KOps/s $\color{#35bf28}+0.63\%$
test_membership_stacked_nested_last 0.3238ms 0.1808ms 5.5313 KOps/s 5.6257 KOps/s $\color{#d91a1a}-1.68\%$
test_membership_stacked_nested_leaf_last 38.9930μs 13.8506μs 72.1990 KOps/s 70.1303 KOps/s $\color{#35bf28}+2.95\%$
test_nested_getleaf 30.1860μs 10.6231μs 94.1349 KOps/s 91.4004 KOps/s $\color{#35bf28}+2.99\%$
test_nested_get 27.1000μs 10.0991μs 99.0183 KOps/s 97.0896 KOps/s $\color{#35bf28}+1.99\%$
test_stacked_getleaf 0.9275ms 0.4152ms 2.4084 KOps/s 2.4485 KOps/s $\color{#d91a1a}-1.64\%$
test_stacked_get 1.1772ms 0.3954ms 2.5291 KOps/s 2.6781 KOps/s $\textbf{\color{#d91a1a}-5.56\%}$
test_nested_getitemleaf 30.9180μs 12.0893μs 82.7176 KOps/s 80.4729 KOps/s $\color{#35bf28}+2.79\%$
test_nested_getitem 42.2190μs 11.6220μs 86.0436 KOps/s 83.2695 KOps/s $\color{#35bf28}+3.33\%$
test_stacked_getitemleaf 0.6110ms 0.4201ms 2.3806 KOps/s 2.4081 KOps/s $\color{#d91a1a}-1.14\%$
test_stacked_getitem 0.6807ms 0.3851ms 2.5966 KOps/s 2.6294 KOps/s $\color{#d91a1a}-1.25\%$
test_lock_nested 2.7234ms 0.3377ms 2.9615 KOps/s 2.9988 KOps/s $\color{#d91a1a}-1.24\%$
test_lock_stack_nested 0.1018s 5.9295ms 168.6470 Ops/s 177.2270 Ops/s $\color{#d91a1a}-4.84\%$
test_unlock_nested 73.1396ms 0.4084ms 2.4485 KOps/s 2.9877 KOps/s $\textbf{\color{#d91a1a}-18.05\%}$
test_unlock_stack_nested 0.1101s 6.1250ms 163.2657 Ops/s 170.1963 Ops/s $\color{#d91a1a}-4.07\%$
test_flatten_speed 3.2451ms 0.3717ms 2.6906 KOps/s 2.6579 KOps/s $\color{#35bf28}+1.23\%$
test_unflatten_speed 0.7642ms 0.4720ms 2.1188 KOps/s 2.1344 KOps/s $\color{#d91a1a}-0.73\%$
test_common_ops 1.3406ms 0.7108ms 1.4069 KOps/s 1.4895 KOps/s $\textbf{\color{#d91a1a}-5.54\%}$
test_creation 47.3090μs 1.8336μs 545.3736 KOps/s 555.1959 KOps/s $\color{#d91a1a}-1.77\%$
test_creation_empty 29.8150μs 11.1846μs 89.4084 KOps/s 105.4934 KOps/s $\textbf{\color{#d91a1a}-15.25\%}$
test_creation_nested_1 38.2620μs 14.0768μs 71.0391 KOps/s 82.4515 KOps/s $\textbf{\color{#d91a1a}-13.84\%}$
test_creation_nested_2 0.2747ms 17.1988μs 58.1437 KOps/s 64.0381 KOps/s $\textbf{\color{#d91a1a}-9.20\%}$
test_clone 70.6720μs 13.2498μs 75.4729 KOps/s 78.7317 KOps/s $\color{#d91a1a}-4.14\%$
test_getitem[int] 26.9800μs 11.1280μs 89.8631 KOps/s 90.2546 KOps/s $\color{#d91a1a}-0.43\%$
test_getitem[slice_int] 64.9310μs 22.6284μs 44.1922 KOps/s 43.7763 KOps/s $\color{#35bf28}+0.95\%$
test_getitem[range] 0.1453ms 42.6997μs 23.4194 KOps/s 23.4217 KOps/s $\color{#d91a1a}-0.01\%$
test_getitem[tuple] 51.6460μs 18.1372μs 55.1352 KOps/s 54.1617 KOps/s $\color{#35bf28}+1.80\%$
test_getitem[list] 0.1521ms 36.9482μs 27.0649 KOps/s 25.8024 KOps/s $\color{#35bf28}+4.89\%$
test_setitem_dim[int] 73.9680μs 30.4542μs 32.8362 KOps/s 33.2959 KOps/s $\color{#d91a1a}-1.38\%$
test_setitem_dim[slice_int] 0.1023ms 57.0860μs 17.5174 KOps/s 17.3923 KOps/s $\color{#35bf28}+0.72\%$
test_setitem_dim[range] 0.1449ms 78.0047μs 12.8197 KOps/s 12.8740 KOps/s $\color{#d91a1a}-0.42\%$
test_setitem_dim[tuple] 69.2290μs 46.1032μs 21.6905 KOps/s 22.8036 KOps/s $\color{#d91a1a}-4.88\%$
test_setitem 86.2120μs 20.8553μs 47.9495 KOps/s 52.6528 KOps/s $\textbf{\color{#d91a1a}-8.93\%}$
test_set 66.0640μs 19.9620μs 50.0953 KOps/s 53.8130 KOps/s $\textbf{\color{#d91a1a}-6.91\%}$
test_set_shared 1.8969ms 0.1404ms 7.1228 KOps/s 7.1861 KOps/s $\color{#d91a1a}-0.88\%$
test_update 0.1200ms 23.3110μs 42.8982 KOps/s 47.4040 KOps/s $\textbf{\color{#d91a1a}-9.51\%}$
test_update_nested 0.1266ms 31.9272μs 31.3213 KOps/s 34.0162 KOps/s $\textbf{\color{#d91a1a}-7.92\%}$
test_set_nested 69.3000μs 22.3557μs 44.7313 KOps/s 48.1206 KOps/s $\textbf{\color{#d91a1a}-7.04\%}$
test_set_nested_new 0.2069ms 26.1056μs 38.3059 KOps/s 40.0878 KOps/s $\color{#d91a1a}-4.44\%$
test_select 4.4180ms 38.9566μs 25.6696 KOps/s 26.1527 KOps/s $\color{#d91a1a}-1.85\%$
test_select_nested 0.1121ms 58.7930μs 17.0088 KOps/s 17.0408 KOps/s $\color{#d91a1a}-0.19\%$
test_exclude_nested 0.2464ms 0.1187ms 8.4272 KOps/s 8.4136 KOps/s $\color{#35bf28}+0.16\%$
test_empty[True] 0.6519ms 0.4119ms 2.4279 KOps/s 2.4575 KOps/s $\color{#d91a1a}-1.21\%$
test_empty[False] 5.5102μs 1.0620μs 941.6577 KOps/s 970.7992 KOps/s $\color{#d91a1a}-3.00\%$
test_unbind_speed 0.4366ms 0.2487ms 4.0205 KOps/s 3.8001 KOps/s $\textbf{\color{#35bf28}+5.80\%}$
test_unbind_speed_stack0 78.6013ms 3.4202ms 292.3815 Ops/s 273.4698 Ops/s $\textbf{\color{#35bf28}+6.92\%}$
test_unbind_speed_stack1 19.6670μs 1.9962μs 500.9481 KOps/s 517.7423 KOps/s $\color{#d91a1a}-3.24\%$
test_split 2.5338ms 1.4595ms 685.1734 Ops/s 677.5603 Ops/s $\color{#35bf28}+1.12\%$
test_chunk 71.0895ms 1.5567ms 642.3829 Ops/s 630.9017 Ops/s $\color{#35bf28}+1.82\%$
test_creation[device0] 0.1752ms 0.1010ms 9.9007 KOps/s 9.8501 KOps/s $\color{#35bf28}+0.51\%$
test_creation_from_tensor 3.5822ms 81.8113μs 12.2233 KOps/s 12.2600 KOps/s $\color{#d91a1a}-0.30\%$
test_add_one[memmap_tensor0] 0.1707ms 5.4840μs 182.3487 KOps/s 179.6237 KOps/s $\color{#35bf28}+1.52\%$
test_contiguous[memmap_tensor0] 15.8100μs 0.6458μs 1.5485 MOps/s 1.5735 MOps/s $\color{#d91a1a}-1.59\%$
test_stack[memmap_tensor0] 50.0340μs 3.6754μs 272.0829 KOps/s 278.1979 KOps/s $\color{#d91a1a}-2.20\%$
test_memmaptd_index 1.0444ms 0.2420ms 4.1327 KOps/s 4.1584 KOps/s $\color{#d91a1a}-0.62\%$
test_memmaptd_index_astensor 0.6385ms 0.3041ms 3.2887 KOps/s 3.3330 KOps/s $\color{#d91a1a}-1.33\%$
test_memmaptd_index_op 1.0547ms 0.6230ms 1.6050 KOps/s 1.6745 KOps/s $\color{#d91a1a}-4.15\%$
test_serialize_model 0.1713s 0.1071s 9.3381 Ops/s 8.8566 Ops/s $\textbf{\color{#35bf28}+5.44\%}$
test_serialize_model_pickle 0.4601s 0.3792s 2.6372 Ops/s 2.6200 Ops/s $\color{#35bf28}+0.66\%$
test_serialize_weights 0.1738s 0.1078s 9.2743 Ops/s 8.9916 Ops/s $\color{#35bf28}+3.14\%$
test_serialize_weights_returnearly 0.2025s 0.1343s 7.4446 Ops/s 8.1089 Ops/s $\textbf{\color{#d91a1a}-8.19\%}$
test_serialize_weights_pickle 0.8595s 0.4845s 2.0638 Ops/s 2.3111 Ops/s $\textbf{\color{#d91a1a}-10.70\%}$
test_serialize_weights_filesystem 98.6212ms 92.0720ms 10.8611 Ops/s 10.7882 Ops/s $\color{#35bf28}+0.68\%$
test_serialize_model_filesystem 0.1573s 97.5853ms 10.2474 Ops/s 9.9215 Ops/s $\color{#35bf28}+3.29\%$
test_reshape_pytree 66.8350μs 21.1245μs 47.3385 KOps/s 47.9737 KOps/s $\color{#d91a1a}-1.32\%$
test_reshape_td 65.2720μs 30.6938μs 32.5799 KOps/s 33.3969 KOps/s $\color{#d91a1a}-2.45\%$
test_view_pytree 57.2080μs 20.9453μs 47.7435 KOps/s 47.4086 KOps/s $\color{#35bf28}+0.71\%$
test_view_td 79.3412ms 11.3637μs 87.9999 KOps/s 88.8251 KOps/s $\color{#d91a1a}-0.93\%$
test_unbind_pytree 56.7670μs 24.0533μs 41.5743 KOps/s 41.5214 KOps/s $\color{#35bf28}+0.13\%$
test_unbind_td 0.4469ms 35.6066μs 28.0847 KOps/s 27.8525 KOps/s $\color{#35bf28}+0.83\%$
test_split_pytree 52.5480μs 23.6190μs 42.3388 KOps/s 42.4079 KOps/s $\color{#d91a1a}-0.16\%$
test_split_td 0.1241ms 38.7925μs 25.7782 KOps/s 24.9567 KOps/s $\color{#35bf28}+3.29\%$
test_add_pytree 65.8830μs 29.7141μs 33.6541 KOps/s 33.4145 KOps/s $\color{#35bf28}+0.72\%$
test_add_td 0.1664ms 52.8761μs 18.9121 KOps/s 19.3079 KOps/s $\color{#d91a1a}-2.05\%$
test_distributed 0.1880ms 98.2555μs 10.1775 KOps/s 9.7957 KOps/s $\color{#35bf28}+3.90\%$
test_tdmodule 0.1745ms 24.0733μs 41.5398 KOps/s 45.4758 KOps/s $\textbf{\color{#d91a1a}-8.66\%}$
test_tdmodule_dispatch 0.2057ms 47.1820μs 21.1945 KOps/s 23.8814 KOps/s $\textbf{\color{#d91a1a}-11.25\%}$
test_tdseq 45.0240μs 26.8563μs 37.2352 KOps/s 40.0860 KOps/s $\textbf{\color{#d91a1a}-7.11\%}$
test_tdseq_dispatch 0.1435ms 49.9552μs 20.0179 KOps/s 21.5399 KOps/s $\textbf{\color{#d91a1a}-7.07\%}$
test_instantiation_functorch 2.1224ms 1.3419ms 745.1995 Ops/s 765.5226 Ops/s $\color{#d91a1a}-2.65\%$
test_instantiation_td 1.5515ms 1.0162ms 984.1023 Ops/s 898.5118 Ops/s $\textbf{\color{#35bf28}+9.53\%}$
test_exec_functorch 0.3015ms 0.1612ms 6.2043 KOps/s 6.4097 KOps/s $\color{#d91a1a}-3.20\%$
test_exec_functional_call 0.3646ms 0.1505ms 6.6426 KOps/s 6.8245 KOps/s $\color{#d91a1a}-2.66\%$
test_exec_td 0.2541ms 0.1474ms 6.7836 KOps/s 7.0469 KOps/s $\color{#d91a1a}-3.74\%$
test_exec_td_decorator 0.6718ms 0.1803ms 5.5448 KOps/s 5.6774 KOps/s $\color{#d91a1a}-2.34\%$
test_vmap_mlp_speed[True-True] 1.8494ms 0.9804ms 1.0200 KOps/s 1.1197 KOps/s $\textbf{\color{#d91a1a}-8.91\%}$
test_vmap_mlp_speed[True-False] 0.7338ms 0.4822ms 2.0740 KOps/s 2.1462 KOps/s $\color{#d91a1a}-3.36\%$
test_vmap_mlp_speed[False-True] 1.0702ms 0.8063ms 1.2403 KOps/s 1.2931 KOps/s $\color{#d91a1a}-4.08\%$
test_vmap_mlp_speed[False-False] 0.6136ms 0.3966ms 2.5217 KOps/s 2.6193 KOps/s $\color{#d91a1a}-3.73\%$
test_vmap_mlp_speed_decorator[True-True] 3.1850ms 2.3967ms 417.2402 Ops/s 431.2966 Ops/s $\color{#d91a1a}-3.26\%$
test_vmap_mlp_speed_decorator[True-False] 1.0527ms 0.5341ms 1.8722 KOps/s 1.9409 KOps/s $\color{#d91a1a}-3.54\%$
test_vmap_mlp_speed_decorator[False-True] 2.6618ms 1.9680ms 508.1285 Ops/s 528.7872 Ops/s $\color{#d91a1a}-3.91\%$
test_vmap_mlp_speed_decorator[False-False] 0.7588ms 0.4060ms 2.4629 KOps/s 2.5180 KOps/s $\color{#d91a1a}-2.19\%$

Copy link

github-actions bot commented Jan 30, 2024

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 132. Improved: $\large\color{#35bf28}4$. Worsened: $\large\color{#d91a1a}31$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 62.4173ms 17.5330μs 57.0353 KOps/s 80.2296 KOps/s $\textbf{\color{#d91a1a}-28.91\%}$
test_plain_set_stack_nested 0.1632ms 0.1191ms 8.3935 KOps/s 8.5647 KOps/s $\color{#d91a1a}-2.00\%$
test_plain_set_nested_inplace 44.6410μs 15.7267μs 63.5863 KOps/s 72.0484 KOps/s $\textbf{\color{#d91a1a}-11.75\%}$
test_plain_set_stack_nested_inplace 0.1765ms 0.1474ms 6.7832 KOps/s 6.8792 KOps/s $\color{#d91a1a}-1.40\%$
test_items 24.6600μs 4.8489μs 206.2308 KOps/s 209.5878 KOps/s $\color{#d91a1a}-1.60\%$
test_items_nested 0.3680ms 0.3400ms 2.9415 KOps/s 2.9385 KOps/s $\color{#35bf28}+0.10\%$
test_items_nested_locked 0.3741ms 0.3434ms 2.9117 KOps/s 2.9018 KOps/s $\color{#35bf28}+0.34\%$
test_items_nested_leaf 0.2400ms 0.2002ms 4.9958 KOps/s 4.9747 KOps/s $\color{#35bf28}+0.42\%$
test_items_stack_nested 1.4200ms 1.3058ms 765.8249 Ops/s 760.4850 Ops/s $\color{#35bf28}+0.70\%$
test_items_stack_nested_leaf 1.2964ms 1.1505ms 869.1679 Ops/s 867.7691 Ops/s $\color{#35bf28}+0.16\%$
test_items_stack_nested_locked 1.9425ms 0.9023ms 1.1082 KOps/s 1.1241 KOps/s $\color{#d91a1a}-1.41\%$
test_keys 15.5710μs 4.8081μs 207.9843 KOps/s 219.0968 KOps/s $\textbf{\color{#d91a1a}-5.07\%}$
test_keys_nested 0.4914ms 94.3319μs 10.6009 KOps/s 10.5926 KOps/s $\color{#35bf28}+0.08\%$
test_keys_nested_locked 0.1225ms 97.6383μs 10.2419 KOps/s 10.1900 KOps/s $\color{#35bf28}+0.51\%$
test_keys_nested_leaf 0.2435ms 77.6476μs 12.8787 KOps/s 12.8291 KOps/s $\color{#35bf28}+0.39\%$
test_keys_stack_nested 1.3376ms 1.1519ms 868.0973 Ops/s 868.6870 Ops/s $\color{#d91a1a}-0.07\%$
test_keys_stack_nested_leaf 1.3687ms 1.1538ms 866.6868 Ops/s 890.8122 Ops/s $\color{#d91a1a}-2.71\%$
test_keys_stack_nested_locked 0.8023ms 0.7210ms 1.3870 KOps/s 1.4117 KOps/s $\color{#d91a1a}-1.74\%$
test_values 9.1533μs 1.8897μs 529.1840 KOps/s 531.1883 KOps/s $\color{#d91a1a}-0.38\%$
test_values_nested 77.3120μs 45.4976μs 21.9792 KOps/s 22.0894 KOps/s $\color{#d91a1a}-0.50\%$
test_values_nested_locked 74.3510μs 47.8214μs 20.9111 KOps/s 21.0310 KOps/s $\color{#d91a1a}-0.57\%$
test_values_nested_leaf 67.9910μs 39.9142μs 25.0537 KOps/s 25.3563 KOps/s $\color{#d91a1a}-1.19\%$
test_values_stack_nested 1.0193ms 0.9551ms 1.0470 KOps/s 1.0484 KOps/s $\color{#d91a1a}-0.14\%$
test_values_stack_nested_leaf 1.0194ms 0.9591ms 1.0427 KOps/s 1.0511 KOps/s $\color{#d91a1a}-0.80\%$
test_values_stack_nested_locked 0.7020ms 0.5736ms 1.7433 KOps/s 1.7755 KOps/s $\color{#d91a1a}-1.81\%$
test_membership 5.8460μs 0.9362μs 1.0681 MOps/s 1.0495 MOps/s $\color{#35bf28}+1.78\%$
test_membership_nested 16.6500μs 2.8950μs 345.4193 KOps/s 347.7092 KOps/s $\color{#d91a1a}-0.66\%$
test_membership_nested_leaf 35.1710μs 2.8881μs 346.2504 KOps/s 346.0747 KOps/s $\color{#35bf28}+0.05\%$
test_membership_stacked_nested 36.5610μs 11.3497μs 88.1080 KOps/s 89.5503 KOps/s $\color{#d91a1a}-1.61\%$
test_membership_stacked_nested_leaf 44.2110μs 11.4198μs 87.5674 KOps/s 88.8030 KOps/s $\color{#d91a1a}-1.39\%$
test_membership_nested_last 31.8400μs 5.2517μs 190.4163 KOps/s 186.6369 KOps/s $\color{#35bf28}+2.03\%$
test_membership_nested_leaf_last 20.1710μs 5.3128μs 188.2258 KOps/s 186.9613 KOps/s $\color{#35bf28}+0.68\%$
test_membership_stacked_nested_last 0.2433ms 0.1573ms 6.3581 KOps/s 6.3673 KOps/s $\color{#d91a1a}-0.14\%$
test_membership_stacked_nested_leaf_last 0.1549ms 13.1621μs 75.9760 KOps/s 76.9027 KOps/s $\color{#d91a1a}-1.21\%$
test_nested_getleaf 29.7410μs 8.4444μs 118.4218 KOps/s 118.7393 KOps/s $\color{#d91a1a}-0.27\%$
test_nested_get 22.7100μs 7.9536μs 125.7299 KOps/s 125.6110 KOps/s $\color{#35bf28}+0.09\%$
test_stacked_getleaf 0.4600ms 0.3322ms 3.0098 KOps/s 3.0214 KOps/s $\color{#d91a1a}-0.39\%$
test_stacked_get 0.3391ms 0.2969ms 3.3685 KOps/s 3.3917 KOps/s $\color{#d91a1a}-0.68\%$
test_nested_getitemleaf 34.6600μs 9.8193μs 101.8406 KOps/s 102.1938 KOps/s $\color{#d91a1a}-0.35\%$
test_nested_getitem 32.8710μs 9.3744μs 106.6734 KOps/s 106.7205 KOps/s $\color{#d91a1a}-0.04\%$
test_stacked_getitemleaf 0.4139ms 0.3319ms 3.0128 KOps/s 3.0175 KOps/s $\color{#d91a1a}-0.16\%$
test_stacked_getitem 0.3654ms 0.2998ms 3.3357 KOps/s 3.3054 KOps/s $\color{#35bf28}+0.92\%$
test_lock_nested 0.8777ms 0.3487ms 2.8674 KOps/s 2.8433 KOps/s $\color{#35bf28}+0.85\%$
test_lock_stack_nested 89.1620ms 6.3091ms 158.5013 Ops/s 155.1055 Ops/s $\color{#35bf28}+2.19\%$
test_unlock_nested 82.0870ms 0.4298ms 2.3269 KOps/s 2.8804 KOps/s $\textbf{\color{#d91a1a}-19.22\%}$
test_unlock_stack_nested 89.2402ms 6.3797ms 156.7468 Ops/s 153.2328 Ops/s $\color{#35bf28}+2.29\%$
test_flatten_speed 0.6591ms 0.2602ms 3.8439 KOps/s 3.8157 KOps/s $\color{#35bf28}+0.74\%$
test_unflatten_speed 0.4026ms 0.3620ms 2.7627 KOps/s 2.7497 KOps/s $\color{#35bf28}+0.47\%$
test_common_ops 1.1036ms 0.6538ms 1.5294 KOps/s 1.7986 KOps/s $\textbf{\color{#d91a1a}-14.97\%}$
test_creation 43.2810μs 1.5772μs 634.0214 KOps/s 650.6510 KOps/s $\color{#d91a1a}-2.56\%$
test_creation_empty 0.1244ms 10.0957μs 99.0522 KOps/s 159.4419 KOps/s $\textbf{\color{#d91a1a}-37.88\%}$
test_creation_nested_1 39.4310μs 11.7216μs 85.3123 KOps/s 124.2031 KOps/s $\textbf{\color{#d91a1a}-31.31\%}$
test_creation_nested_2 99.5620μs 14.2722μs 70.0660 KOps/s 95.6479 KOps/s $\textbf{\color{#d91a1a}-26.75\%}$
test_clone 73.3610μs 13.4890μs 74.1345 KOps/s 71.0157 KOps/s $\color{#35bf28}+4.39\%$
test_getitem[int] 25.3400μs 10.5673μs 94.6316 KOps/s 93.9409 KOps/s $\color{#35bf28}+0.74\%$
test_getitem[slice_int] 0.1336ms 21.0614μs 47.4801 KOps/s 48.3358 KOps/s $\color{#d91a1a}-1.77\%$
test_getitem[range] 0.1703ms 34.4621μs 29.0174 KOps/s 28.8627 KOps/s $\color{#35bf28}+0.54\%$
test_getitem[tuple] 43.2000μs 18.5149μs 54.0107 KOps/s 55.9397 KOps/s $\color{#d91a1a}-3.45\%$
test_getitem[list] 0.1629ms 32.2992μs 30.9605 KOps/s 32.0107 KOps/s $\color{#d91a1a}-3.28\%$
test_setitem_dim[int] 51.1010μs 28.5504μs 35.0258 KOps/s 43.4271 KOps/s $\textbf{\color{#d91a1a}-19.35\%}$
test_setitem_dim[slice_int] 83.1720μs 49.5080μs 20.1988 KOps/s 23.4704 KOps/s $\textbf{\color{#d91a1a}-13.94\%}$
test_setitem_dim[range] 0.2062ms 65.5061μs 15.2657 KOps/s 17.5395 KOps/s $\textbf{\color{#d91a1a}-12.96\%}$
test_setitem_dim[tuple] 71.5310μs 45.3294μs 22.0607 KOps/s 26.1670 KOps/s $\textbf{\color{#d91a1a}-15.69\%}$
test_setitem 0.1742ms 20.7495μs 48.1939 KOps/s 57.8071 KOps/s $\textbf{\color{#d91a1a}-16.63\%}$
test_set 0.1550ms 19.9230μs 50.1934 KOps/s 62.0811 KOps/s $\textbf{\color{#d91a1a}-19.15\%}$
test_set_shared 2.7546ms 0.1020ms 9.8014 KOps/s 9.9958 KOps/s $\color{#d91a1a}-1.95\%$
test_update 0.1317ms 21.6314μs 46.2290 KOps/s 56.4017 KOps/s $\textbf{\color{#d91a1a}-18.04\%}$
test_update_nested 93.3120μs 28.4345μs 35.1685 KOps/s 41.5710 KOps/s $\textbf{\color{#d91a1a}-15.40\%}$
test_set_nested 58.6210μs 20.2443μs 49.3965 KOps/s 55.6477 KOps/s $\textbf{\color{#d91a1a}-11.23\%}$
test_set_nested_new 69.5610μs 23.1308μs 43.2324 KOps/s 49.3857 KOps/s $\textbf{\color{#d91a1a}-12.46\%}$
test_select 0.1353ms 35.7625μs 27.9623 KOps/s 30.1840 KOps/s $\textbf{\color{#d91a1a}-7.36\%}$
test_select_nested 94.5320μs 53.7101μs 18.6185 KOps/s 18.7228 KOps/s $\color{#d91a1a}-0.56\%$
test_exclude_nested 0.1478ms 0.1133ms 8.8297 KOps/s 8.7386 KOps/s $\color{#35bf28}+1.04\%$
test_empty[True] 0.4532ms 0.3862ms 2.5892 KOps/s 2.5898 KOps/s $\color{#d91a1a}-0.02\%$
test_empty[False] 2.9920μs 0.8689μs 1.1508 MOps/s 1.1908 MOps/s $\color{#d91a1a}-3.36\%$
test_to 72.0810μs 52.5328μs 19.0357 KOps/s 19.1085 KOps/s $\color{#d91a1a}-0.38\%$
test_to_nonblocking 0.1812ms 32.1824μs 31.0729 KOps/s 30.5778 KOps/s $\color{#35bf28}+1.62\%$
test_unbind_speed 0.3224ms 0.2631ms 3.8006 KOps/s 3.7540 KOps/s $\color{#35bf28}+1.24\%$
test_unbind_speed_stack0 90.1892ms 3.4433ms 290.4179 Ops/s 239.5716 Ops/s $\textbf{\color{#35bf28}+21.22\%}$
test_unbind_speed_stack1 18.8000μs 1.7790μs 562.1202 KOps/s 539.0534 KOps/s $\color{#35bf28}+4.28\%$
test_split 82.1803ms 1.7166ms 582.5547 Ops/s 653.5949 Ops/s $\textbf{\color{#d91a1a}-10.87\%}$
test_chunk 82.1649ms 1.6532ms 604.8787 Ops/s 602.4681 Ops/s $\color{#35bf28}+0.40\%$
test_creation[device0] 0.2427ms 73.9254μs 13.5271 KOps/s 14.2620 KOps/s $\textbf{\color{#d91a1a}-5.15\%}$
test_creation_from_tensor 0.2279ms 56.6466μs 17.6533 KOps/s 19.0040 KOps/s $\textbf{\color{#d91a1a}-7.11\%}$
test_add_one[memmap_tensor0] 0.2357ms 6.3454μs 157.5944 KOps/s 159.4823 KOps/s $\color{#d91a1a}-1.18\%$
test_contiguous[memmap_tensor0] 33.4300μs 0.6260μs 1.5974 MOps/s 1.5926 MOps/s $\color{#35bf28}+0.30\%$
test_stack[memmap_tensor0] 37.0310μs 4.3358μs 230.6371 KOps/s 233.8354 KOps/s $\color{#d91a1a}-1.37\%$
test_memmaptd_index 1.0155ms 0.2568ms 3.8936 KOps/s 3.8466 KOps/s $\color{#35bf28}+1.22\%$
test_memmaptd_index_astensor 0.6330ms 0.3134ms 3.1904 KOps/s 3.1532 KOps/s $\color{#35bf28}+1.18\%$
test_memmaptd_index_op 0.9274ms 0.6205ms 1.6117 KOps/s 1.7643 KOps/s $\textbf{\color{#d91a1a}-8.65\%}$
test_serialize_model 0.1734s 97.9246ms 10.2119 Ops/s 9.6350 Ops/s $\textbf{\color{#35bf28}+5.99\%}$
test_serialize_model_pickle 1.8702s 1.3133s 0.7614 Ops/s 0.7999 Ops/s $\color{#d91a1a}-4.81\%$
test_serialize_weights 0.1733s 96.1166ms 10.4040 Ops/s 10.0333 Ops/s $\color{#35bf28}+3.69\%$
test_serialize_weights_returnearly 0.2750s 73.0992ms 13.6800 Ops/s 14.5913 Ops/s $\textbf{\color{#d91a1a}-6.25\%}$
test_serialize_weights_pickle 1.3882s 1.2426s 0.8048 Ops/s 0.8001 Ops/s $\color{#35bf28}+0.58\%$
test_reshape_pytree 0.1280ms 24.6492μs 40.5692 KOps/s 40.2907 KOps/s $\color{#35bf28}+0.69\%$
test_reshape_td 0.1298ms 29.5413μs 33.8509 KOps/s 34.0858 KOps/s $\color{#d91a1a}-0.69\%$
test_view_pytree 0.1553ms 24.3366μs 41.0904 KOps/s 40.4891 KOps/s $\color{#35bf28}+1.49\%$
test_view_td 85.4327ms 9.9880μs 100.1199 KOps/s 144.9312 KOps/s $\textbf{\color{#d91a1a}-30.92\%}$
test_unbind_pytree 0.1565ms 30.1892μs 33.1245 KOps/s 32.8939 KOps/s $\color{#35bf28}+0.70\%$
test_unbind_td 0.5622ms 40.3254μs 24.7983 KOps/s 25.1225 KOps/s $\color{#d91a1a}-1.29\%$
test_split_pytree 59.8810μs 28.3418μs 35.2836 KOps/s 35.0244 KOps/s $\color{#35bf28}+0.74\%$
test_split_td 0.1102ms 38.4256μs 26.0243 KOps/s 25.7914 KOps/s $\color{#35bf28}+0.90\%$
test_add_pytree 0.1002ms 34.6157μs 28.8887 KOps/s 28.9300 KOps/s $\color{#d91a1a}-0.14\%$
test_add_td 87.3810μs 50.6068μs 19.7602 KOps/s 23.4766 KOps/s $\textbf{\color{#d91a1a}-15.83\%}$
test_distributed 0.2244ms 69.3996μs 14.4093 KOps/s 13.0449 KOps/s $\textbf{\color{#35bf28}+10.46\%}$
test_tdmodule 0.1522ms 18.9778μs 52.6931 KOps/s 59.7125 KOps/s $\textbf{\color{#d91a1a}-11.76\%}$
test_tdmodule_dispatch 0.2475ms 38.9969μs 25.6431 KOps/s 29.2545 KOps/s $\textbf{\color{#d91a1a}-12.34\%}$
test_tdseq 50.4410μs 21.9222μs 45.6160 KOps/s 52.0546 KOps/s $\textbf{\color{#d91a1a}-12.37\%}$
test_tdseq_dispatch 0.1643ms 41.7902μs 23.9290 KOps/s 27.7456 KOps/s $\textbf{\color{#d91a1a}-13.76\%}$
test_instantiation_functorch 1.9493ms 1.6485ms 606.6295 Ops/s 606.6282 Ops/s $+0.00\%$
test_instantiation_td 1.6842ms 1.1541ms 866.5092 Ops/s 877.0856 Ops/s $\color{#d91a1a}-1.21\%$
test_exec_functorch 0.2969ms 0.1544ms 6.4758 KOps/s 6.5581 KOps/s $\color{#d91a1a}-1.26\%$
test_exec_functional_call 0.2232ms 0.1527ms 6.5497 KOps/s 6.6087 KOps/s $\color{#d91a1a}-0.89\%$
test_exec_td 0.2629ms 0.1429ms 6.9989 KOps/s 7.1438 KOps/s $\color{#d91a1a}-2.03\%$
test_exec_td_decorator 0.6029ms 0.1807ms 5.5342 KOps/s 5.5988 KOps/s $\color{#d91a1a}-1.15\%$
test_vmap_mlp_speed[True-True] 1.2033ms 1.0138ms 986.4302 Ops/s 988.2429 Ops/s $\color{#d91a1a}-0.18\%$
test_vmap_mlp_speed[True-False] 0.7508ms 0.5827ms 1.7160 KOps/s 1.7287 KOps/s $\color{#d91a1a}-0.73\%$
test_vmap_mlp_speed[False-True] 1.0878ms 0.9202ms 1.0867 KOps/s 1.0730 KOps/s $\color{#35bf28}+1.27\%$
test_vmap_mlp_speed[False-False] 0.6740ms 0.5089ms 1.9650 KOps/s 1.9342 KOps/s $\color{#35bf28}+1.59\%$
test_vmap_mlp_speed_decorator[True-True] 3.0253ms 2.3299ms 429.2046 Ops/s 439.8851 Ops/s $\color{#d91a1a}-2.43\%$
test_vmap_mlp_speed_decorator[True-False] 1.0708ms 0.6276ms 1.5933 KOps/s 1.3905 KOps/s $\textbf{\color{#35bf28}+14.58\%}$
test_vmap_mlp_speed_decorator[False-True] 0.1195s 2.1853ms 457.6132 Ops/s 524.3404 Ops/s $\textbf{\color{#d91a1a}-12.73\%}$
test_vmap_mlp_speed_decorator[False-False] 0.9041ms 0.5249ms 1.9051 KOps/s 1.8144 KOps/s $\color{#35bf28}+5.00\%$
test_vmap_transformer_speed[True-True] 12.7436ms 12.0126ms 83.2459 Ops/s 84.0837 Ops/s $\color{#d91a1a}-1.00\%$
test_vmap_transformer_speed[True-False] 8.4591ms 7.9846ms 125.2413 Ops/s 126.3768 Ops/s $\color{#d91a1a}-0.90\%$
test_vmap_transformer_speed[False-True] 12.6464ms 12.0337ms 83.1000 Ops/s 84.8331 Ops/s $\color{#d91a1a}-2.04\%$
test_vmap_transformer_speed[False-False] 8.2044ms 7.8196ms 127.8836 Ops/s 125.3573 Ops/s $\color{#35bf28}+2.02\%$
test_vmap_transformer_speed_decorator[True-True] 76.8457ms 74.3253ms 13.4544 Ops/s 14.1231 Ops/s $\color{#d91a1a}-4.73\%$
test_vmap_transformer_speed_decorator[True-False] 20.3982ms 19.0263ms 52.5589 Ops/s 53.3380 Ops/s $\color{#d91a1a}-1.46\%$
test_vmap_transformer_speed_decorator[False-True] 68.4116ms 65.5268ms 15.2609 Ops/s 15.7182 Ops/s $\color{#d91a1a}-2.91\%$
test_vmap_transformer_speed_decorator[False-False] 0.1525s 20.6943ms 48.3226 Ops/s 47.9643 Ops/s $\color{#35bf28}+0.75\%$

@vmoens vmoens added the release label Jan 31, 2024
@vmoens vmoens changed the title v0.3.0 release wheels [NOMErg] v0.3.0 release wheels Jan 31, 2024
@vmoens vmoens changed the title [NOMErg] v0.3.0 release wheels [NOMERG] v0.3.0 release wheels Jan 31, 2024
@vmoens vmoens closed this Feb 4, 2024
@vmoens vmoens deleted the release/0.3.0 branch October 21, 2024 14:02
@vmoens vmoens restored the release/0.3.0 branch October 21, 2024 14:02
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
ciflow/binaries/all Build all wheels CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. release
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants