-
Notifications
You must be signed in to change notification settings - Fork 354
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Performance of methylation pipeline #3304
Comments
Some samples failed extractor step: bismark_methylation_extractor \
--no_overlap \
--comprehensive \
--cytosine_report \
--genome_folder /genomes/Hsapiens/hg38/bismark/ \
--merge_non_CpG \
--multicore 1 \
--buffer_size 5G \
--bedGraph \
--gzip
/path/work/dedup/sample/sample.nsorted.deduplicated.bam [FATAL ERROR:] The IDs of Read 1 and Read 2 are not the same. |
it seems to happen in some samples because we do
step 4 fails with sorted reads, but for 3 we have to sort. the solution is to skip sorting and deduplication or do step4 in a single-end mode (-s). |
Lambda phage discussion: FelixKrueger/Bismark#361 |
Finished the test cohort + vs Lambda genome. |
Users reported slowdowns of methylation pipeline on trim_galore and bismark steps.
solution: with
bcbio_nextgen.py -n 4
trim_galore runs cutadapt with 1 thread and it is very slow. Increasing bcbio treads speeds up trim_galore.https://bcbio-nextgen.readthedocs.io/en/latest/contents/methylation.html#benchmarking
The currently recommended combination is 16/2/100G.
#3303
#3301
S
The text was updated successfully, but these errors were encountered: