DRAFT of AuxAlignment for parsing secondary/supplementary alignments #208

clintval · 2024-12-31T23:00:58Z

This is a draft PR to explore a common hierarchy for "alignments from SAM tags".

Motivated by:

Add a class for parsing secondary alignments from XA/XB #206 (comment)

The diff is ugly but there are only 3 classes. I suggest reviewing like:

AuxAlignment: abstract parent with some common concrete shared functions
SecondaryAlignment: functionality specific to secondary alignments
SupplementaryAlignment: functionality specific to supplementary alignments

I was able to make almost everything backwards compatible except:

SupplementaryAlignment is now a dataclass vs attrs
SupplementaryAlignment had a few init fields renamed

@tfenne what do you think?

I'm looking for a quick 👍 👎 so I can abandon the idea, or merge it into my actual feature branch for additional polish and a real round of review.

codecov · 2024-12-31T23:01:13Z

Codecov Report

Attention: Patch coverage is 88.88889% with 20 lines in your changes missing coverage. Please review.

Project coverage is 90.84%. Comparing base (b0b4227) to head (6d887a1).

Files with missing lines	Patch %	Lines
fgpyo/sam/__init__.py	88.88%	16 Missing and 4 partials ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #208      +/-   ##
==========================================
- Coverage   91.06%   90.84%   -0.22%     
==========================================
  Files          18       18              
  Lines        2283     2437     +154     
  Branches      337      355      +18     
==========================================
+ Hits         2079     2214     +135     
- Misses        133      149      +16     
- Partials       71       74       +3

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

nh13

I did not duplicate my comments from #206, so please incorporate those too as appropriate.

Instead of two sub-classes of AuxAlignment, what about storing an optional enum for if the alignment is secondary or supplementary (or None if neither or unknown)? Then for the from_tag(value: str) method, you could condition on if it's 4 values (XA), or 6 values with strand as the third item (SA), or 6 values with the cigar as the third item (XB). Alternatively, could you explain the motivation for the three classes beyond how they parse differently (which I think is resolved above)? Do we want to match on the class/type?

fgpyo/sam/__init__.py

nh13 · 2025-01-10T04:12:03Z

fgpyo/sam/__init__.py

+        if self.alignment_score is not None and self.alignment_score < 0:
+            errors.append(f"Alignment score cannot be less than 0! Found: {self.alignment_score}")


why not? I think a negative alignment score is aligned, especially in a global alignment.

I'd like to learn more about this concern! I haven't seen a score below zero for any output from bwa so I figured it would be a sign that data was corrupt. Couple questions:

Is there a way to run bwa in which the score can go below zero?

Knowing that this code will probably parse bwa tags for the forseeable future, do you think keeping the strict check is better for safety-sake or would hinder someone in the future with a popular or custom global aligner putting auxiliary alignments in SAM tags?

clintval · 2025-01-10T14:33:25Z

Comments from Nils in the other PR:

…M tags

clintval · 2025-01-10T16:51:33Z

Closing this and will open a clean PR with a refactor.

clintval assigned tfenne Dec 31, 2024

clintval mentioned this pull request Jan 6, 2025

Add a class for parsing secondary alignments from XA/XB #206

Closed

nh13 reviewed Jan 10, 2025

View reviewed changes

clintval changed the title ~~feat: secondary/supplementary alignments inherit from AuxAlignment~~ feat: add AuxAlignment for parsing secondary/supplementary alignments in SAM tags Jan 10, 2025

clintval changed the title ~~feat: add AuxAlignment for parsing secondary/supplementary alignments in SAM tags~~ Add AuxAlignment for parsing secondary/supplementary alignments in SAM tags Jan 10, 2025

clintval force-pushed the cv_aux_alignment_hierarchy branch from 5277047 to 6d887a1 Compare January 10, 2025 14:53

clintval changed the base branch from cv_secondary_from_xb to main January 10, 2025 14:54

coderabbitai bot approved these changes Jan 10, 2025

View reviewed changes

clintval force-pushed the cv_aux_alignment_hierarchy branch from 6d887a1 to 2369449 Compare January 10, 2025 16:48

Add AuxAlignment for parsing secondary/supplementary alignments in SA…

302bf2a

…M tags

clintval force-pushed the cv_aux_alignment_hierarchy branch from 2369449 to 302bf2a Compare January 10, 2025 16:51

clintval changed the title ~~Add AuxAlignment for parsing secondary/supplementary alignments in SAM tags~~ DRAFT of AuxAlignment for parsing secondary/supplementary alignments Jan 10, 2025

clintval closed this Jan 10, 2025

clintval mentioned this pull request Jan 10, 2025

Add AuxAlignment for parsing secondary/supplementary alignments in SAM tags #209

Open

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DRAFT of AuxAlignment for parsing secondary/supplementary alignments #208

DRAFT of AuxAlignment for parsing secondary/supplementary alignments #208

clintval commented Dec 31, 2024 •

edited

Loading

codecov bot commented Dec 31, 2024 •

edited

Loading

nh13 left a comment

nh13 Jan 10, 2025

clintval Jan 10, 2025

clintval commented Jan 10, 2025 •

edited

Loading

clintval commented Jan 10, 2025

		if self.alignment_score is not None and self.alignment_score < 0:
		errors.append(f"Alignment score cannot be less than 0! Found: {self.alignment_score}")

DRAFT of AuxAlignment for parsing secondary/supplementary alignments #208

DRAFT of AuxAlignment for parsing secondary/supplementary alignments #208

Conversation

clintval commented Dec 31, 2024 • edited Loading

codecov bot commented Dec 31, 2024 • edited Loading

Codecov Report

nh13 left a comment

Choose a reason for hiding this comment

nh13 Jan 10, 2025

Choose a reason for hiding this comment

clintval Jan 10, 2025

Choose a reason for hiding this comment

clintval commented Jan 10, 2025 • edited Loading

clintval commented Jan 10, 2025

clintval commented Dec 31, 2024 •

edited

Loading

codecov bot commented Dec 31, 2024 •

edited

Loading

clintval commented Jan 10, 2025 •

edited

Loading