Restore sequence retrieval under mouse cursor #62

josiahseaman · 2018-10-29T12:57:07Z

DDV originally had the ability to load the file from the server, then click on a particular spot and get the sequence from that position. This code was disabled during the design of TileLayout, but the inactivated code is all still there inside nucleotideNumber.js. This code needs to be reactivated and updated for the new FluentDNA feature set.

The main use case to optimize for is retrieving a 300bp sequence from a 1GB genome with multiple scaffolds. Changes might need to be made to handle multiple scaffolds, though currently the "nucleotide number" being output does match the character file position under the mouse cursor. Ideally, grabbing a small sequence should not require requesting the entire genome from the server (this is why the feature was removed in the first place). Instead, loading only the scaffold in question might be a healthy middle ground.

One static file solution would be to create a fasta directory and chunk the genome into 10MB chunks which could be retrieved independently. This would scale well with large genomes as well as draft genomes with millions of contigs. In Python, all the staging for this happens in TileLayout.output_fasta()

Unfortunately in my understanding, it's not possible to submit a dynamic query to the static file server we're using for FluentDNA. In terms of IO speeds, the use running the server locally and browsing a new private genome will be a fairly common use case.

Notes: It might help that in ContigSpacingJSON a contig.name will contain the name of the cursor contig found in nucleotide_coordinates_to_sequence_index(). #39 is relevant in that we are going to need to change ContigSpacingJSON formatting to list the name of the contig, then the local (rather than file) nucleotide index.

Old functionality re enabled
Fetching less than a full genome
Chunking large chromosomes or dynamic query?

The text was updated successfully, but these errors were encountered:

josiahseaman · 2018-11-14T10:18:56Z

TransposonLayout and MultipleAlignmentLayout (MSA) currently not supported.
TransposonLayout composes a source from many fragments.
MultipleAlignmentLayout source is multiple files (feature pending).

josiahseaman · 2018-11-14T13:43:36Z

5602fd0

Putting the mouse in the upper left corner gets you the first letter of the sequence with a look-back of the contig name. All whitespace has been scrubbed, which in this case looks a bit off. This is based on file coordinates. Contig coordinates are the next priority.

…s. Still needs smoother newline handling.

…ove junk labels

…of sequence, verified +1 index. Fighting with CSS.

…s. Updating to DNASkittleUtils 1.0.11

…asta sources and origins of layouts

…ated origin references yet.

…or, fetch that file, find the contig, and show the sequence in that contig. Each fasta source has it's own coordinate frame listed under ContigSpacingJSON, fasta_sources, and each_layout.

…ent files have the same contig name.

… columns in the current layout

…ngle a web of merges

…acking. Should work with mouse.

… vertical orientation when advantageous

…ayout now migrated as a subclass of LayoutFrame

…ray code logic to track reversing coordinate frames on odd numbers.

josiahseaman added the enhancement label Oct 29, 2018

josiahseaman assigned photomedia Oct 29, 2018

josiahseaman added this to the Publication milestone Oct 29, 2018

josiahseaman added a commit that referenced this issue Nov 15, 2018

#62 Brought back old js files for sequence retrieval.

999c961

josiahseaman added a commit that referenced this issue Nov 15, 2018

#62 First working version of sequence display from nucleotideNumber.j…

5602fd0

…s. Still needs smoother newline handling.

josiahseaman added a commit that referenced this issue Nov 15, 2018

#62 read_contigs to a dictionary of sequences. Updated example to rem…

69d08bf

…ove junk labels

josiahseaman added a commit that referenced this issue Nov 15, 2018

#62 providing contig_name in data, cleaning up code

38a1666

josiahseaman added a commit that referenced this issue Nov 15, 2018

#62 Showing Contig_name in sequence display, special cased beginning …

14283ec

…of sequence, verified +1 index. Fighting with CSS.

josiahseaman added a commit that referenced this issue Nov 15, 2018

#62 Smoother handling of titles mouse over #66

72b94c0

josiahseaman added a commit that referenced this issue Nov 15, 2018

#62 Breaking fasta file into chunks files and retrieving all sequence…

21e127a

…s. Updating to DNASkittleUtils 1.0.11

josiahseaman added a commit that referenced this issue Nov 15, 2018

#62 Streaming in full contig chunk files as you mouse over them.

c0e499d

josiahseaman added a commit that referenced this issue Nov 16, 2018

#62 Downloading one contig at a time.

c12aace

josiahseaman added a commit that referenced this issue Nov 16, 2018

#62 WIP: Working on AnnotatedTrack sequence retreival with multiple f…

3e7280b

…asta sources and origins of layouts

josiahseaman added a commit that referenced this issue Nov 21, 2018

#62 WIP Switched ParallelGenomeLayout to LayoutFrame but haven't migr…

5d77344

…ated origin references yet.

josiahseaman added a commit that referenced this issue Nov 21, 2018

#62 js.contigs now has an array position for each file in case differ…

2db0ff4

…ent files have the same contig name.

josiahseaman added a commit that referenced this issue Nov 23, 2018

#62 BioJS sequence display dynamically changes to match the number of…

3e0f750

… columns in the current layout

josiahseaman added a commit that referenced this issue Nov 23, 2018

#62 #64 WIP: AnnotatedTrackLayout combined with custom_layout to unta…

632c201

…ngle a web of merges

josiahseaman added a commit that referenced this issue Nov 23, 2018

#64 #39 #62: MSA layout now using Layouts refactor with new origin tr…

78ab00f

…acking. Should work with mouse.

josiahseaman added a commit that referenced this issue Nov 29, 2018

#64 #39 #62: Reinstated tiltes in AnnotationTrackLayout. Switching to…

d43c006

… vertical orientation when advantageous

josiahseaman added a commit that referenced this issue Dec 4, 2018

#64 #39 #62: Fixed handle_multi_column labels in HighlightedAnnotation

376f748

josiahseaman added a commit that referenced this issue Dec 4, 2018

#64 #39 #62: Fixed handle_multi_column labels in Ideogram. Ideogram L…

665bcc8

…ayout now migrated as a subclass of LayoutFrame

josiahseaman added a commit that referenced this issue Dec 4, 2018

#64 #39 #62: Mouse over functionality for Ideogram. Javascript uses g…

44c664e

…ray code logic to track reversing coordinate frames on odd numbers.

josiahseaman mentioned this issue Dec 4, 2018

Sequence Retreival for Multiple Sequence Alignment #71

Closed

josiahseaman closed this as completed Dec 12, 2018

josiahseaman mentioned this issue Jan 25, 2019

Chromosome Streaming #78

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Restore sequence retrieval under mouse cursor #62

Restore sequence retrieval under mouse cursor #62

josiahseaman commented Oct 29, 2018 •

edited

Loading

josiahseaman commented Nov 14, 2018

josiahseaman commented Nov 14, 2018 •

edited

Loading

Restore sequence retrieval under mouse cursor #62

Restore sequence retrieval under mouse cursor #62

Comments

josiahseaman commented Oct 29, 2018 • edited Loading

josiahseaman commented Nov 14, 2018

josiahseaman commented Nov 14, 2018 • edited Loading

josiahseaman commented Oct 29, 2018 •

edited

Loading

josiahseaman commented Nov 14, 2018 •

edited

Loading