Skip to content
This repository has been archived by the owner on May 5, 2021. It is now read-only.

Drug Information

Gautier Koscielny edited this page Nov 12, 2018 · 1 revision

Index of files

ChEMBL Drug Evidence

Open Targets gets evidence from ChEMBL that connects drugs to protein targets and to disease indications based on clinical trials data.

The tsv and JSON files both have 399,301 lines and the column/field names are:

Column name Description
disease.id EFO ID for the disease
disease.efo_info.label Disease name
drug.molecule_name Drug name
drug.molecule_type Drug type
drug.id ChEMBL ID for drug
evidence.drug2clinic.max_phase_for_disease.numeric_index Maximum clinical trial phase for this drug-disease combination
evidence.target2drug.action_type Drug action
evidence.target2drug.mechanism_of_action Drug mechanism of action
target.id Ensembl gene ID
target.gene_info.symbol Gene symbol

Drug Adverse Events

Drug adverse events are available from the openFDA website for 1812 of the 2403 drugs in the Open Targets Platform. This data is provided in tab-separated and JSON format. The files contain up to 100 most frequent adverse events for each drug. The tsv file contains 145,132 rows and the columns in the tsv file are:

Column name Description
Drug name Compound name from ChEMBL
Drug ChEMBL ID ChEMBL ID of the drug
Adverse event Name of the adverse event from FAERS/openFDA
Adverse event count Number of times the adverse event has been reported for this drug

The JSON file contains 1812 lines, one for each drug. This is the structure of the JSON lines:

{
  "drug_name": "VINCRISTINE",
  "drug_id": "CHEMBL90555",
  "adverse_events": [
    {
      "term": "FEBRILE NEUTROPENIA",
      "count": 3781
    },
    {
      "term": "NEUTROPENIA",
      "count": 2685
    },
    {
      "term": "PYREXIA",
      "count": 2238
    }
   ]
}

Repurposed Drugs

Small dataset of curated repurposed drugs from the publications listed on the Wiki homepage.

Number of rows x columns: 66 rows x 9 columns

Column name Description
drug_name drug name
MOA mode of action (if known)
trade_name commercial name of the drug
original_indication original drug indication
originator originator (company name)
repurposed_indication repurposed drug indication
new_name new drug name (if known)
repurposed_innovator if known
comments from publication or Orphan disease