-
Notifications
You must be signed in to change notification settings - Fork 33
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Large difference in quality of the assembly using different filtering methods #46
Comments
Dear Valery, Happy new year too! Wow that's a huge difference! The tormes version does not matter it is not the last one, as the code for those tools did not change.
If there are specific options you would like to test, let me know and we can arrange so ;) You can also use already quality-trimmed reads in tormes. Just include them in your metadata file and reduce the I would recommend you to use some read quality control software, such as FastQC and MultiQC to visualize the quality of your reads. These two tools generate very nice html reports with all the quality information of your reads, adapter content, etc. This will be included in the next release of tormes, but you can run it on your system as:
Hope this helps! |
I could send the reads of this sample, before I had some strange samples with more that 700 contigs too but didn't check them again with shovill for example. |
Hi Valery, Sorry for the late reply.... we are under some deadlines and my time is limited these days... Just curious, have you checked the taxonomy of your reads to discard is a contaminated isolate?
Did you also check the quality with the FASTQC and MULTIQC tools I told you before? We are working in a big release of TORMES, and will definitely investigate this issue with prinseq in detail. In any case, if you get better QC with other tools not in tormes, such as TrimGallore and/or shovill, you could always import the genome to tormes for the downstream analysis. Let me know if I can help you with this! |
Hello, Narciso,
The tool is great! thank you again)
One question to discuss.
When I'm using filtering by default (prinseq), I have near thousand of contigs and the warning from Spades:
Too many erroneous kmers, the estimates might be unreliable
I I use --filtering trimmomatic, I have 111 contigs and no warnings.
How could be so? What is the exact difference between these methods?
The isolate is the same, the version is old 1.2.1
And Happy New Year!
Best regards,
Valery
The text was updated successfully, but these errors were encountered: