-
Notifications
You must be signed in to change notification settings - Fork 717
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Subsampling with fastp - document and test #1096
Comments
Also @drpatelh notes:
|
We need to test if this really works and document if so. |
This works for me, when applied to the test profile, and I see it suggested in various places as an approach to down-sampling. I don't think paired-end should be an issue. Down-sampling might mean something slightly different to some people than what FASTP is doing- just taking the specified number of reads off the top of the FASTQ file(s) - but I'll add a line to the docs anyway. |
Fixed in #1309 |
Description of feature
It is very helpful when debugging pipelines to run on only a subset of reads, so that it fails fast.
@drpatelh suggested
This has advantages:
Disadvantages:
--reads_to_process
of reads so if there is anything weirdly sorted about the input .fastq file it could be unrepresentativeI'm going to create a separate issue ticket about including an explicit subsampling step in the pipeline, because I think it's a separate issue and something that I would routinely use (and tell all my students to use).
The text was updated successfully, but these errors were encountered: