Deprecated Functions in Annotations_Pipeline & New Ensembl Notation #317

jscaber · 2017-03-14T21:41:52Z

I have rerun the annotations pipeline on the newest annotations.
There are some new changes in the recent ensembl releases: Whereas the GTF still has "ENSPXXX" notation, the .pep. file and the .cdna. file now have transcript/protein notation with a suffix: "ENSPXXX.1" etc. Because there are multiple steps at which these tables are combined we need to find a solution to this:
Options:

Remove ".1" suffix when loading these databases (I have implemented this)
Create Additional Columns

While looking at this I found another error that used to fail the pipeline. Pipeline_annotations relies on peptide2cdna methods, which in turn rely on an ancient library written by Adnreas: alignment_light. The library still exists but has been massively rewritten. Given that the cdna/peptide functionality has not been used I have taken the entirety of these function and methods out, as they need to be rewritten from scratch if at all needed (what was its purpose?). The cdna fasta is now made from ensembl cdna data only.

see Pull Request #318
Also See Pull Request in CGATPipelines CGATOxford/CGATPipelines#312

AndreasHeger · 2017-03-14T22:00:22Z

Hi @jscaber , thanks, taking these out is fine - they were mostly useful for gene-prediction tasks and comparative genomics, not so much our focus now.

sebastian-luna-valero · 2017-03-15T08:53:23Z

Many thanks @jscaber and @AndreasHeger !

jscaber mentioned this issue Mar 14, 2017

Annotations Pipeline Fixes CGATOxford/CGATPipelines#312

Merged

jscaber added the bug label Mar 14, 2017

sebastian-luna-valero closed this as completed Mar 15, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Deprecated Functions in Annotations_Pipeline & New Ensembl Notation #317

Deprecated Functions in Annotations_Pipeline & New Ensembl Notation #317

jscaber commented Mar 14, 2017 •

edited

Loading

AndreasHeger commented Mar 14, 2017

sebastian-luna-valero commented Mar 15, 2017

Deprecated Functions in Annotations_Pipeline & New Ensembl Notation #317

Deprecated Functions in Annotations_Pipeline & New Ensembl Notation #317

Comments

jscaber commented Mar 14, 2017 • edited Loading

AndreasHeger commented Mar 14, 2017

sebastian-luna-valero commented Mar 15, 2017

jscaber commented Mar 14, 2017 •

edited

Loading