-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
A few issues #33
Comments
@crj32 Thank you for your suggestions. I really like those people giving feedbacks. Please don't hesitate to do so. And for 3 and 4, I think what you need is an option for output_gtf but not an output folder, right? I will start to cracking these issues. It will probably take a few days since I am also busy with other stuff. |
Yeah, just a custom named output .gtf would be nice. Right you are. |
What if when no annotation, do you prefer something like |
I don't mind because we are always using a reference, but I'd suggest making the names for these IDs the same as Stringtie uses incase any other tools require specific names for compatibility. I think without a reference, it is probably just going to be 'gene_id' and 'transcript_id'? I think you had that already. |
I've just noticed something, gene_id "ERR188044.1"; transcript_id "ERR188044.1.1", from Stringtie include the sample name in the syntax. You may have recognised this already. This is important as when it comes to merging the gtf's, as I have an error with strawberry using Taco to merge the .gtf files, because the .gtf format is not the same as Stringtie (https://tacorna.github.io/). This is an issue for me, and I expect other people, potentially. |
@crj32 I think I have completed all your suggested items. Can you have a try of the current master, commit id |
My VM I do my work on is down today unfortunately so we will have to wait and is it OK to get a precompiled binary instead of the source code? |
This is good, my computer is working again, I have compiled from source and will test the new version for you. |
Ruolin
That is useful with the adding the gene_id from the ensembl annotation, much faster than my python script to do it afterwards. I'll let you know how this pans out with the downstream tools I use. There are a few other minor issues though that would help make the tool better IMO:
gene_id "ERR188044.1"; transcript_id "ERR188044.1.1"; reference_id "NM_018390"; ref_gene_id "NM_018390"; ref_gene_name "PLCXD1"
So we have your tool's gene_id and transcript_id, plus the ref_gene_name and ref_gene_id from ensembl/ reference annotation.
;FPKM "NA";Frac "NA";TPM "NA";
Usually I run stringtie in 1 directory and it outputs its .gtf files into the same directory, each named according to the original file ID. This is easier than having them all in different directories, then having to rename and move them all afterwards, before I run cuffmerge. I actually use taco (https://www.nature.com/articles/nmeth.4078) instead of cuffmerge, it is supposed to be a lot better, and my results made more sense when using this.
I don't really want any log files to be outputted, I only need the .gtf file. I have to include extra code to clean all this up. Is it possible to have a parameter to get a single .gtf and nothing else?
Thanks for your time. These are just some ideas for you to review, I am keen on using strawberry in our work.
Chris
The text was updated successfully, but these errors were encountered: