-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
is pyo3_branchwater much slower than sra_search?? #9
Comments
first cut, I see that mean load looks substantially lower in pyo3_branchwater than in sra_search - see the first original: https://github.com/dib-lab/2022-branchwater-benchmarking/blob/main/01.basic-benchmarks.ipynb left is sra_search, right is pyo3_branchwater. cpu_time is ~comparable (lower in pyo3_branchwater!) but mean load is MUCH smaller in the newer code base. sigh. ![]() ![]() |
ok, with @mr-eyes latest,
|
with sourmash-bio/sourmash_plugin_branchwater@ae0b26a, multithreading also seemed to work fine - I accidentally ran it with 64 threads tho ;)
|
hmm. definitely something funky going on up above tho. user time seems whack. parallelization is not good? |
Could you please get the same 64 cores run output using sourmash-bio/pyo3_branchwater@ |
with sourmash-bio/sourmash_plugin_branchwater@d7086d3:
|
I now have a strong suspicion that the above benchmarks were done with "unoptimized" or dev mode builds - I wasn't installing wheels from So I'm rerunning the last benchmark to see if there's a difference :) |
hmm. no difference I think. hmm.
|
so frustrating - the original code is super fast! and low memory! 😭 I feel like this has to be at least partly an optimization thing.
|
ok, I think I figured it out - after I got
we are now faster than the original searcher binary, but consume a bit more memory. anyway, I think this means we should not be calculating jaccard 😆 . |
old code,
|
(no! it is faster!) |
with pyo3_branchwater 0.7.0:
|
branchwater benchmarking for 5 searches of 1,000 genomes x 10,000 metagenomes
old code,
sra_search
new code,
pyo3_branchwater
v ~0.5.1first cut observations
time is ...a LOT slower. Not sure why that would be!? maybe need to look at other stats too, around efficiency and so on.
why would memory decrease?!
and esp why would I/O decrease?
maybe threading isn't working properly in pyo3_Branchwater?
The text was updated successfully, but these errors were encountered: