Implement `PP3` and `BP4` criteria #77

gromdimon · 2024-04-26T07:54:09Z

Is your feature request related to a problem? Please describe.
Implement PP3 and BP4 criteria

Describe the solution you'd like

Investigate
Make plan

Describe alternatives you've considered
N/A

Additional context

PP3

PP3 (in silico predictions)

Original Definition

Multiple lines of computational evidence support a deleterious effect on the gene or gene product (conservation, evolutionary, splicing impact, etc).

Caveats:

- As many in silico algorithms use the same or very similar input for their predictions, each algorithm should not be counted as an independent criterion.
- PP3 can be used only once in any evaluation of a variant.

-- Richards et al. (2015); Table 4

Preconditions / Precomputations

If the criterion PVS1 was triggered then this criterion is skipped.
If the variant is on chrMT then it is skipped, as we don't have calibration for chrMT yet.
If the variant is not found in dbNSFP or CADD precomputed scores then it is skipped as we don't have calibration for chrMT yet.

Implemented Criterion

An initial prediction is fist done using the general purpose pathogenicity predictors.

If we have a score from the following, then the prediction is used (in descending order of priority):
- REVEL, MutPred2, CADD, BayesDel, VEST4, ..., PhyloP
- we will use the modifiers from :footcite:t:pejaver:2022
If predictions are missing then then PhyloP of the position of the variant is used as a fallback.

Then, for splicing the following is done.

If a SpliceAI prediction is performed then it is interpreted according to :footcite:t:walker:2023.

The highest-scoring variant is used for the final prediction.

User Report

The scores and predictions from the predictors.

Caveats

As described in :ref:acmg_seqvars_criteria-patho-predictions, we are currently limited to the precomputed threshold from the literature.
This hinders us in adopting AlphaMissense effectively, for example.
We need to compute accuracy to rank the implemented methods.
We need our own calibration for chrMT.

Notes

This criterion is similar to :ref:acmg_seqvars_criteria-bp4

BP4

BP4 (in silico predictions)

.. note::

- we have not implemented MitoTip or MitImpact yet
- we are lacking phylop scores yet
- we don't have live CADD scores yet

Original Definition

Multiple lines of computational evidence suggest no impact on gene or gene product (conservation, evolutionary, splicing impact, etc).

Caveat: As many in silico algorithms use the same or very similar input for their predictions, each algorithm cannot be counted as an independent criterion.
BP4 can be used only once in any evaluation of a variant.

-- Richards et al. (2015); Table 4

Preconditions / Precomputations

If the criterion BA1 triggered then this criterion is skipped.
If the variant is on chrMT then it is skipped, as we don't have calibration for chrMT yet.
If the variant is not found in dbNSFP or CADD precomputed scores then it is skipped as we don't have calibration for chrMT yet.

Implemented Criterion

See :ref:acmg_seqvars_criteria-pp3 for details.

User Report

See :ref:acmg_seqvars_criteria-pp3 for details.

Literature

See :ref:acmg_seqvars_criteria-pp3 for details.

Caveats

See :ref:acmg_seqvars_criteria-pp3 for details.

Notes

This criterion is similar to :ref:acmg_seqvars_criteria-pp3

Intervar

PP3 and BP4 by Automated Scoring
When multiple pieces of computational evidence support a deleterious effect on the gene or gene product (conservation, evolutionary, splicing impact, etc.), then the supporting pathogenic evidence of PP3 will be assigned as 1. In comparison, when multiple pieces of computational evidence suggest no impact on the gene or gene product, then supporting benign evidence of BP4 will be assigned as 1. All sets of in silico results must agree when PP3 or BP4 is assigned.
These multiple pieces of computational evidence can be provided by ANNOVAR from the “dbnsfp30a” database, where the MetaSVM score16 is used for deleteriousness prediction and GERP++ is used for evolutionary conservation. The splicing impacts can be inferred by ANNOVAR from the “dbscsnv11” database. For the evidence of PP3 and BP4, we set the cutoff to 0.0 for MetaSVM scores (greater scores indicate more likely deleterious effects), 2.0 for GERP++_RS (smaller scores indicate less conservation), and 0.6 for adaptive boosting (ADA) and random forest (RF) scores of dbscSNV as splicing impact (larger scores indicate more likely splice altering).

The text was updated successfully, but these errors were encountered:

gromdimon added the enhancement New feature or request label Apr 26, 2024

gromdimon added this to the 0.1.0 milestone Apr 26, 2024

gromdimon self-assigned this Apr 26, 2024

gromdimon added the automatable label Apr 26, 2024

gromdimon mentioned this issue May 28, 2024

Init pp3 and bp4 implementation #115

Merged

gromdimon linked a pull request May 28, 2024 that will close this issue

Init pp3 and bp4 implementation #115

Merged

gromdimon closed this as completed in #115 May 28, 2024

gromdimon mentioned this issue May 30, 2024

Finish AutoPP3BP4 #118

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement `PP3` and `BP4` criteria #77

Implement `PP3` and `BP4` criteria #77

gromdimon commented Apr 26, 2024

Implement PP3 and BP4 criteria #77

Implement PP3 and BP4 criteria #77

Comments

gromdimon commented Apr 26, 2024

PP3

PP3 (in silico predictions)

Original Definition

Preconditions / Precomputations

Implemented Criterion

User Report

Caveats

Notes

BP4

BP4 (in silico predictions)

Original Definition

Preconditions / Precomputations

Implemented Criterion

User Report

Literature

Caveats

Notes

Intervar

Intervar

Implement `PP3` and `BP4` criteria #77

Implement `PP3` and `BP4` criteria #77