Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add OrthoSearch of reference files #76

Closed
Tonny-zhou opened this issue Sep 6, 2021 · 3 comments
Closed

Add OrthoSearch of reference files #76

Tonny-zhou opened this issue Sep 6, 2021 · 3 comments
Assignees
Labels
Milestone

Comments

@Tonny-zhou
Copy link

Dear all,
Thank you for this amazing annotation tool for bacterial genomes! I want to known if you can add an option for orthosearch if we have Genbank or Protein FASTA file(s) that we want to annotate genes from as the first priority.
This option has been supplied in softwares dfast-core as "--references" (https://github.com/nigyta/dfast_core) and prokka as "--proteins".

Thanks and regards,
Tonny_Z

@Tonny-zhou Tonny-zhou added the enhancement New feature or request label Sep 6, 2021
@oschwengers oschwengers self-assigned this Sep 6, 2021
@oschwengers oschwengers added this to the v1.2 milestone Sep 6, 2021
@oschwengers oschwengers added feature and removed enhancement New feature or request labels Sep 6, 2021
@oschwengers
Copy link
Owner

Hi @Tonny-zhou , thanks a lot for your kind feedback!
Indeed, I've already thought about this sort of feature and I'll address this in the next minor release (1.2). However, that might take a while.

oschwengers added a commit that referenced this issue Sep 8, 2021
oschwengers added a commit that referenced this issue Sep 8, 2021
oschwengers added a commit that referenced this issue Sep 8, 2021
oschwengers added a commit that referenced this issue Sep 9, 2021
oschwengers added a commit that referenced this issue Sep 9, 2021
oschwengers added a commit that referenced this issue Sep 9, 2021
@oschwengers
Copy link
Owner

oschwengers commented Sep 9, 2021

Hi @Tonny-zhou , I've implemented a --proteins option to allow users to provide a trusted set of protein sequences as a single Fasta file.

Sequences therein can be annotated with additional information using 2 formats: short and long as described here: https://github.com/oschwengers/bakta#input-and-output

Could you please try if this is working for you as expected? The homology search is implemented as an alignment using Diamond.

I might also implement an accompanying GenBank/EMBL converter somewhen in the future.

I hope that works for you.
Best regards!

@Tonny-zhou
Copy link
Author

Thank you so much!!!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants