Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fail to build FMIndex for sequence #479

Closed
nsyzrantsev opened this issue Jul 13, 2024 · 1 comment
Closed

Fail to build FMIndex for sequence #479

nsyzrantsev opened this issue Jul 13, 2024 · 1 comment

Comments

@nsyzrantsev
Copy link

nsyzrantsev commented Jul 13, 2024

Hi! Thank you very much for your tool! Seqkit always makes me happy :)

But I found a small bug.

Prerequisites

OS: ubuntu20.04, amd64 (personal computer)
Command:

seqkit grep -s -f barcodes.txt input.fastq -o output.fastq -m 1 -R -30:-1

barcodes.txt file contains such sequences:

AGTAGGCT
GTAGGCTC
CTGTACGA
TGTACGAC
GCACCAAG
CACCAAGC
TACGTTTC
ACGTTTCC

input.fastq contains such read:

@1
NNNNNNNNNNNNNNNNNNNNNNNNNNNN
+
############################

Bug

This command successfully be done without any problems:

seqkit grep -s -f barcodes.txt input.fq -o output.fq -m 1 -R 1:30

But this command fails:

seqkit grep -s -f barcodes.txt input.fq -o out.fq -m 1 -R -30:-1

[INFO] 8 patterns loaded from file
[ERRO] fail to build FMIndex for sequence: 1
@shenwei356
Copy link
Owner

Actually, it's a bug in sequence region parsing.

For -R -30:-1, it returns an empty sequence if the sequence is shorter than 30 (28 for sequence 1).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants