Skip to content

Commit

Permalink
Merge pull request #385 from monarch-initiative/petty-doc-fixes
Browse files Browse the repository at this point in the history
Small documentation updates
  • Loading branch information
ielis authored Jan 6, 2025
2 parents f5c81e8 + ec0cd4d commit 53f14df
Show file tree
Hide file tree
Showing 2 changed files with 21 additions and 7 deletions.
26 changes: 20 additions & 6 deletions docs/tutorial.rst
Original file line number Diff line number Diff line change
Expand Up @@ -216,7 +216,7 @@ The diagram also highlights the protein features (domains, repeats, etc.).
Summarize all variant alleles
-----------------------------

We can prepare a table of all variant alleles that occurr in the cohort.
We can prepare a table of all variant alleles that occur in the cohort.

Each table row corresponds to a single allele and lists the variant key,
the predicted effect on the transcript (*cDNA*) and protein of interest,
Expand All @@ -240,11 +240,25 @@ with one or more variant alleles (*Count*):
Partition the cohort by genotype and phenotype
==============================================

To test for genotype-phenotype associations, we need to divide the cohort into classes.
In GPSEA, we always assign a cohort member into a genotype class,
where each individual is assigned into a single class and the classes do not overlap.
The phenotype is then used to either assign an individual into a class,
or to calculate a numeric score or survival.
Testing for a genotype-phenotype association uses genotype and phenotype as variables.
In GPSEA, the variable value for an individual is computed
either by a :class:`~gpsea.analysis.clf.Classifier`
or by a :class:`~gpsea.analysis.pscore.PhenotypeScorer`.
A `Classifier` assigns the individual into a class,
whereas a `PhenotypeScorer` computes a continuous score.
The classifiers and scorers are applied on all individuals of the cohort
and the resulting variable distributions are then assessed by a statistical test.

In GPSEA, genotype is always treated as a class
and a genotype `Classifier` is a prerequisite for each analysis.
However, there is much more flexibility on the phenotype part,
where either a `Classifier` or a `PhenotypeScorer` can be used to compute the values,
depending on the analysis goals.

In this tutorial section, we first configure a `Classifier` for assigning
the individuals into a genotype class,
and we follow with generating classifiers for testing the presence or exclusion
of HPO terms in the individuals.


Partition by genotype
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -71,7 +71,7 @@ function cuts down the tedium.
Example
-------

For a phenopacket collection (e.g. 156 patients with mutations in *WWOX* gene included in Phenopacket Store version `0.1.18`)
For a phenopacket collection (e.g. 156 patients with mutations in *TBX5* gene included in Phenopacket Store version `0.1.18`)

>>> from ppktstore.registry import configure_phenopacket_registry
>>> registry = configure_phenopacket_registry()
Expand Down

0 comments on commit 53f14df

Please sign in to comment.