-
Notifications
You must be signed in to change notification settings - Fork 7
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Restore sequence retrieval under mouse cursor #62
Comments
TransposonLayout and MultipleAlignmentLayout (MSA) currently not supported. |
Putting the mouse in the upper left corner gets you the first letter of the sequence with a look-back of the contig name. All whitespace has been scrubbed, which in this case looks a bit off. This is based on file coordinates. Contig coordinates are the next priority. |
…s. Still needs smoother newline handling.
…of sequence, verified +1 index. Fighting with CSS.
…s. Updating to DNASkittleUtils 1.0.11
…asta sources and origins of layouts
…or, fetch that file, find the contig, and show the sequence in that contig. Each fasta source has it's own coordinate frame listed under ContigSpacingJSON, fasta_sources, and each_layout.
…ent files have the same contig name.
…acking. Should work with mouse.
… vertical orientation when advantageous
…ayout now migrated as a subclass of LayoutFrame
…ray code logic to track reversing coordinate frames on odd numbers.
DDV originally had the ability to load the file from the server, then click on a particular spot and get the sequence from that position. This code was disabled during the design of TileLayout, but the inactivated code is all still there inside
nucleotideNumber.js
. This code needs to be reactivated and updated for the new FluentDNA feature set.The main use case to optimize for is retrieving a 300bp sequence from a 1GB genome with multiple scaffolds. Changes might need to be made to handle multiple scaffolds, though currently the "nucleotide number" being output does match the character file position under the mouse cursor. Ideally, grabbing a small sequence should not require requesting the entire genome from the server (this is why the feature was removed in the first place). Instead, loading only the scaffold in question might be a healthy middle ground.
One static file solution would be to create a fasta directory and chunk the genome into 10MB chunks which could be retrieved independently. This would scale well with large genomes as well as draft genomes with millions of contigs. In Python, all the staging for this happens in TileLayout.output_fasta()
Unfortunately in my understanding, it's not possible to submit a dynamic query to the static file server we're using for FluentDNA. In terms of IO speeds, the use running the server locally and browsing a new private genome will be a fairly common use case.
Notes: It might help that in
ContigSpacingJSON
acontig.name
will contain the name of the cursor contig found innucleotide_coordinates_to_sequence_index()
. #39 is relevant in that we are going to need to changeContigSpacingJSON
formatting to list the name of the contig, then the local (rather than file) nucleotide index.The text was updated successfully, but these errors were encountered: