Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Implement PP3 and BP4 criteria #77

Closed
2 tasks
gromdimon opened this issue Apr 26, 2024 · 0 comments · Fixed by #115
Closed
2 tasks

Implement PP3 and BP4 criteria #77

gromdimon opened this issue Apr 26, 2024 · 0 comments · Fixed by #115
Assignees
Labels
automatable enhancement New feature or request
Milestone

Comments

@gromdimon
Copy link
Collaborator

Is your feature request related to a problem? Please describe.
Implement PP3 and BP4 criteria

Describe the solution you'd like

  • Investigate
  • Make plan

Describe alternatives you've considered
N/A

Additional context

PP3

PP3 (in silico predictions)

Original Definition

Multiple lines of computational evidence support a deleterious effect on the gene or gene product (conservation, evolutionary, splicing impact, etc).

Caveats:

- As many in silico algorithms use the same or very similar input for their predictions, each algorithm should not be counted as an independent criterion.
- PP3 can be used only once in any evaluation of a variant.

-- Richards et al. (2015); Table 4

Preconditions / Precomputations

  • If the criterion PVS1 was triggered then this criterion is skipped.
  • If the variant is on chrMT then it is skipped, as we don't have calibration for chrMT yet.
  • If the variant is not found in dbNSFP or CADD precomputed scores then it is skipped as we don't have calibration for chrMT yet.

Implemented Criterion

An initial prediction is fist done using the general purpose pathogenicity predictors.

  • If we have a score from the following, then the prediction is used (in descending order of priority):
    • REVEL, MutPred2, CADD, BayesDel, VEST4, ..., PhyloP
    • we will use the modifiers from :footcite:t:pejaver:2022
  • If predictions are missing then then PhyloP of the position of the variant is used as a fallback.

Then, for splicing the following is done.

  • If a SpliceAI prediction is performed then it is interpreted according to :footcite:t:walker:2023.

The highest-scoring variant is used for the final prediction.

User Report

  • The scores and predictions from the predictors.

Caveats

  • As described in :ref:acmg_seqvars_criteria-patho-predictions, we are currently limited to the precomputed threshold from the literature.
    This hinders us in adopting AlphaMissense effectively, for example.
  • We need to compute accuracy to rank the implemented methods.
  • We need our own calibration for chrMT.

Notes

  • This criterion is similar to :ref:acmg_seqvars_criteria-bp4

BP4

BP4 (in silico predictions)

.. note::

- we have not implemented MitoTip or MitImpact yet
- we are lacking phylop scores yet
- we don't have live CADD scores yet

Original Definition

Multiple lines of computational evidence suggest no impact on gene or gene product (conservation, evolutionary, splicing impact, etc).

Caveat: As many in silico algorithms use the same or very similar input for their predictions, each algorithm cannot be counted as an independent criterion.
BP4 can be used only once in any evaluation of a variant.

-- Richards et al. (2015); Table 4

Preconditions / Precomputations

  • If the criterion BA1 triggered then this criterion is skipped.
  • If the variant is on chrMT then it is skipped, as we don't have calibration for chrMT yet.
  • If the variant is not found in dbNSFP or CADD precomputed scores then it is skipped as we don't have calibration for chrMT yet.

Implemented Criterion

See :ref:acmg_seqvars_criteria-pp3 for details.

User Report

See :ref:acmg_seqvars_criteria-pp3 for details.

Literature

See :ref:acmg_seqvars_criteria-pp3 for details.

Caveats

See :ref:acmg_seqvars_criteria-pp3 for details.

Notes

  • This criterion is similar to :ref:acmg_seqvars_criteria-pp3

Intervar

Intervar

PP3 and BP4 by Automated Scoring
When multiple pieces of computational evidence support a deleterious effect on the gene or gene product (conservation, evolutionary, splicing impact, etc.), then the supporting pathogenic evidence of PP3 will be assigned as 1. In comparison, when multiple pieces of computational evidence suggest no impact on the gene or gene product, then supporting benign evidence of BP4 will be assigned as 1. All sets of in silico results must agree when PP3 or BP4 is assigned.
These multiple pieces of computational evidence can be provided by ANNOVAR from the “dbnsfp30a” database, where the MetaSVM score16 is used for deleteriousness prediction and GERP++ is used for evolutionary conservation. The splicing impacts can be inferred by ANNOVAR from the “dbscsnv11” database. For the evidence of PP3 and BP4, we set the cutoff to 0.0 for MetaSVM scores (greater scores indicate more likely deleterious effects), 2.0 for GERP++_RS (smaller scores indicate less conservation), and 0.6 for adaptive boosting (ADA) and random forest (RF) scores of dbscSNV as splicing impact (larger scores indicate more likely splice altering).

@gromdimon gromdimon added the enhancement New feature or request label Apr 26, 2024
@gromdimon gromdimon added this to the 0.1.0 milestone Apr 26, 2024
@gromdimon gromdimon self-assigned this Apr 26, 2024
@gromdimon gromdimon linked a pull request May 28, 2024 that will close this issue
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
automatable enhancement New feature or request
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant