Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Allow inclusion of multiple species in a single genome note #148

Open
BethYates opened this issue Oct 14, 2024 · 1 comment
Open

Allow inclusion of multiple species in a single genome note #148

BethYates opened this issue Oct 14, 2024 · 1 comment
Labels
enhancement Improvement of the existing features question Further information is requested

Comments

@BethYates
Copy link
Collaborator

Description of feature

A question following the BGA genomenote session:

Stephania Sandoval: I had to leave the session a bit early today but I wanted to ask if the pipeline can be used for more than one species at a time. For example, if you have several species from the same genus and want to put them all in one genome note instead of individual notes, can that be incorporated into this pipeline? thank you!

@BethYates BethYates added the enhancement Improvement of the existing features label Oct 14, 2024
@BethYates
Copy link
Collaborator Author

Hi Stephania, thats not something that the pipeline can do directly at the moment although we had been considering how to implement a similar use case where we would run the pipeline on different haplotypes for the same species and include the statistics and figures for both in a single genome note. I don't think it would be difficult to add the ability to process multiple species in one pipeline run, we would just need to run some of the subworkflows for each of the species and then combine all the information before passing it to a template where the placeholders are named in such a way that the species each placeholder relates to is included.

A possible work around using the existing pipeline would be to run the pipeline sequentially for each species and pass it a template that includes the output from running the pipeline with the other species. So for species "a" "b" and "c" run the pipeline for species "a" using the base template, then run the pipeline for species "b" using the output document for species "a" as the template. Finally for species "c' run the pipeline for a final time using the output document from running the pipeline on species "b". However it would be much more straightforward to expand the pipeline to allow multiple species as inputs.

@muffato muffato moved this from Todo to Triage in Genome After Party Oct 14, 2024
@muffato muffato added the question Further information is requested label Dec 11, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement Improvement of the existing features question Further information is requested
Projects
Status: Triage
Development

No branches or pull requests

2 participants