Skip to content
wolfderby edited this page Sep 7, 2022 · 44 revisions

Getting Started

Instructions to execute the standardize FAERS data and generate safety signals ETL process in Pentaho

System Prerequisites

Reference Data Prerequisites

  • a license for the latest available OHDSI CDMV5 Vocabulary tables from the OHDSI
  • Athena website

Your machine's file layout:

image

  • data_from_s3 (is created by pentaho)

  • faersdbstats (is created by cloning in the repo)

  • logs (is created by pentaho in stage_1_setup.kjb)

  • my_config.conf (you create from the example_conf.conf)

AWS S3 FAERS ('12 Q4 and newer) file layout

image

AWS S3 LAERS ('12 Q3 and older) file layout

image

Use cases:

All time

  • config var
    • LOAD_ALL_TIME=1
  • puts all data faers and laers onto your s3 bucket
  • downloads all s3 data to your machine
  • loads into your local postgres database

Add one new quarter of data to local database

  • config vars
    • LOAD_NEW_YR=2022
    • LOAD_NEW_QTR=Q4
    • LOAD_ALL_TIME=0
  • puts new quarter of data onto s3
  • downloads new quarter of data locally
  • loads new quarter of data into your postgres database

1. Clone in repo

git clone https://github.com/dbmi-pitt/faersdbstats.git

2. Setting up your config file

3. Open ./meta.kjb in pentaho

You can right click through the meta.kjb to easily open other stages

image

4. Run stage_#'s in order

  • Follow wiki pages for additional stage documentation