-
Notifications
You must be signed in to change notification settings - Fork 371
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
MarkDuplicates strategy of flow based reads that looks only at the qualities close to the end of the read #1942
MarkDuplicates strategy of flow based reads that looks only at the qualities close to the end of the read #1942
Conversation
public enum FLOW_DUPLICATE_SELECTION_STRATEGY { | ||
FLOW_QUALITY_SUM_STRATEGY, | ||
FLOW_END_QUALITY_STRATEGY | ||
} | ||
@Argument(doc = "enable parameters and behavior specific to flow based reads.", optional = true) | ||
public boolean FLOW_MODE = false; | ||
|
||
@Argument(doc = "Use specific quality summing strategy for flow based reads. The strategy ensures that the same " + |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Is this doc still up to date with the new enum?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Updated
src/main/java/picard/sam/markduplicates/MarkDuplicatesForFlowHelper.java
Outdated
Show resolved
Hide resolved
@@ -236,8 +273,8 @@ static protected int getFlowSumOfBaseQualities(final SAMRecord rec, final int th | |||
final byte[] bases = rec.getReadBases(); | |||
|
|||
// create iteration range and direction | |||
final int startingOffset = !rec.getReadNegativeStrandFlag() ? 0 : bases.length; | |||
final int endOffset = !rec.getReadNegativeStrandFlag() ? bases.length : 0; | |||
final int startingOffset = !rec.getReadNegativeStrandFlag() ? 0 : bases.length - 1; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
was this previously a bug? I don't see how this change is related to allowing a range in the start position or changing the quality score calculation.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Was a bug indeed
@@ -59,6 +60,7 @@ public class MarkDuplicatesForFlowHelper implements MarkDuplicatesHelper { | |||
public static final char[] CLIPPING_TAG_CONTAINS_AQ = {'A', 'Q'}; | |||
public static final char[] CLIPPING_TAG_CONTAINS_QZ = {'Q', 'Z'}; | |||
|
|||
public static final int DIST_FROM_END = 10; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It's worth mentioning that the distance from the end is hard coded as 10 bases for the FLOW_END_QUALITY_STRATEGY
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Mentioned in the parameter description. Is this OK?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yup, perfect
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@meganshand - comments addressed, please take another look
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
👍
@@ -59,6 +60,7 @@ public class MarkDuplicatesForFlowHelper implements MarkDuplicatesHelper { | |||
public static final char[] CLIPPING_TAG_CONTAINS_AQ = {'A', 'Q'}; | |||
public static final char[] CLIPPING_TAG_CONTAINS_QZ = {'Q', 'Z'}; | |||
|
|||
public static final int DIST_FROM_END = 10; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yup, perfect
Description
UNPAIRED_START_UNCERTAINTY
in flow mode.FLOW_END_QUALITY_STRATEGY
.Checklist (never delete this)
Never delete this, it is our record that procedure was followed. If you find that for whatever reason one of the checklist points doesn't apply to your PR, you can leave it unchecked but please add an explanation below.
Content
Review
For more detailed guidelines, see https://github.com/broadinstitute/picard/wiki/Guidelines-for-pull-requests