-
Notifications
You must be signed in to change notification settings - Fork 8
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Update FunctionalAnnotationAggMember
class for compatibility with MetaP Aggregation tables and implement migrator
#2203
Conversation
…AnnotationAggMember
|
berkeley
-MERGER] Update FunctionalAnnotationAggMember
class for compatibility with MetaP Aggregation tables
Prior to merging / deploying this, I wanted to make sure that the needed changes also get rolled into Possibly related? microbiomedata/nmdc-aggregator#24 |
@shreddd - Yes that is the companion issue to keep the aggregator in step with this change. |
berkeley
-MERGER] Update FunctionalAnnotationAggMember
class for compatibility with MetaP Aggregation tablesFunctionalAnnotationAggMember
class for compatibility with MetaP Aggregation tables and implement migrator
@eecavanna or @brynnz22 - Do either of you have guidance for how the migrator should be named? It is currently |
Maybe |
Sorry about the delay! In a "workshop" all day today and Wednesday. I agree with @brynnz22's recommendation. The "from" version is the currently released schema version. The "to" version is still TBD. The value @brynnz22 suggested makes sense to me as a placeholder (sufficient for merging, in my opinion, given that we don't know what the next schema version will be). The requirement of putting the "to" version in the migrator puts us in this chicken-and-egg situation. I do plan to change this part of the procedure "soon" ™. |
@kheal @eecavanna Since I am working on a migrator right now for the |
Updated the migrator to be a partial for the 11.1 release. |
@turbomam - do you want to review this before we merge it in? |
yes, thanks. looking now. |
src/schema/nmdc.yaml
Outdated
count: | ||
range: integer | ||
description: The number of sequences (for a metagenome or metatranscriptome) or spectra (for metaproteomics) associated with the specified function. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
<3
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I would like to move that definition for count to slot_usage
. The count
slot could easily be used with different meanings in other class contexts.
@kheal If you have tests that show valid and invalid re-use of either of the identifying slots, then I approve for merging. Update: I get the sense that illegally duplicated id combinations aren't tested yet. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This will be a great PR with a few tweaks. Let me know if you think any of my requests will be difficult (or harmful!)
src/schema/nmdc.yaml
Outdated
count: | ||
range: integer | ||
description: The number of sequences (for a metagenome or metatranscriptome) or spectra (for metaproteomics) associated with the specified function. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I would like to move that definition for count to slot_usage
. The count
slot could easily be used with different meanings in other class contexts.
@turbomam. Thanks for the input. @turbomam or @sierra-moxon I could use some help with the test and/or the I've moved the description for the |
After consulting with @sierra-moxon, we've determined that uniqueness across objects is not supported in JSONSchema, and because linkml-validate uses the jsonschema serialization of the model to do validation, it is not checking the unique keys constraint. See #2155 for more discussion Because of the lack of verifiability on the |
This was moved from the berkeley fork to this repo. Original PR found here: microbiomedata#242
Description
This PR will
FunctionalAnnotationAggMember
class to be able to accomodate both annotations from metagenomics and metaproteomics analyses. This is accomplished by exchanging the slotmetagenome_anlaysis_id
with the existing slotwas_generated_by
.metagenome_anlaysis_id
slot as there are now no classes using this slot.count
slotAdd a unique_keys toSee Add unique_keys toFunctionalAnnotationAggMember
class that is a combination ofwas_generated_by
andgene_function_id
FunctionalAnnotationAggMember
class #2155 for why this was removed from this PR.This PR will close 1253
PR Information
What type of PR is this? (check all applicable)
slot
Related Issues
Did you add/update any tests?
Could this schema change make it so any valid data becomes invalid?
If you answered "Yes", does this PR branch include that migrator?
Does this PR have any downstream implications?
We will need to update the aggregation scripts (issue filed here: Update generate_metap_agg.py script to source ids from new slot nmdc-aggregator#13) and the data portal to use the new slot