seqkit subseq
with bed file does properly handle uppercase and lowercase accession (not case-sensitive?)
#400
Labels
Hi Wei,
I am using seqkit v2.4.0 subseq to extract some sequences from NBCI NR, but I notice that some accessions are consistently missed (tried with
--id-ncbi
). A more detailed look shows that if two records share an accession (in lower-case) then one of them will be ignored (upper-case). Is it a bug?Update: I suspect this is caused by indexing the fasta, and the result is somehow random (with 50% ignoring the uppercase and 50% the lower, not always the uppercase).
seqtk subseq
does not indexing the fasta and seems not having this issue.bed:
output:
input:
Prerequisites
seqkit version
Describe your issue
The text was updated successfully, but these errors were encountered: