-
Notifications
You must be signed in to change notification settings - Fork 15
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
pharokka protein crashed after completing mmseqs searches #300
Comments
Hi @luisalbertoc95 , Thanks for reporting this bug and using Pharokka! I see you're using Phables too :) I'm pretty sure this has to do with the VFDB naming (it's annoying :) ). Would you be able to do a few things:
George |
Hi George, Thanks a lot for you suggestions. Running the code with --hmm_only worked! I'll send the vfdb_results.tsv to you. Thank you, Luis |
Hi @luisalbertoc95 , It took a while but I solved this error - it was a bug in pharokka to do with matching VFDB and other outputs. If you re-run pharokka now it should work (but seemingly you were happy enough with --hmm_only so maybe you've moved on) George |
Hello! I'm running pharokka 1.6.1 (fresh env and database install), and still receiving the same error (below). Running in --fast mode fixes the problem, so I think it seems like it has to do with the VFDB/CARD databases. Pharokka version: 1.6.1 Command:
|
Hi, I am having this issue as well on a fresh mamba+pharokka (1.7.1) install.
Same error. Adding |
Hi @fluhus , how big is your input? Is it very small? I have a feeling this error may be because MMseqs2 found no hits at all. I’ll try and replicate later this week and put in a fix if so. george |
Thanks for the quick response! Here is the input file (111K unzipped): |
Hi @fluhus, I have narrowed down your error to the '#' in the header. If you remove this it will work. I'll put in a bug fix at some point :) George |
Thanks for looking into this! I removed the |
Description
Hi @gbouras13, When trying to run pharokka_proteins.py in a set of 755001 ORFs I'm having an error due to a mismatch in lengths between the keys and columns in the pandas DataFrame. According to the log file, all mmseqs searches were completed.
Thank you!
What I Did
pharokka_proteins_1698789518.5425682.log
The text was updated successfully, but these errors were encountered: