Skip to content

A short tutorial to using DL and keras for genomics with sequence data.

Notifications You must be signed in to change notification settings

bh9/tutorial_dl_for_genomics

 
 

Repository files navigation

DL for Genomics Tutorial

A short tutorial to using DL and keras for genomics with sequence data.

Requirements

  • python 3.5+
    • modules:
      • keras
      • numpy
    • I recommend anaconda https://docs.anaconda.com/ or miniconda to manage environments
      • to create a self contained environment after conda install:
        • conda create --name dl_intro
        • source activate dl_intro
        • conda install pip
        • pip install numpy keras
        • to leave type: source deactivate
  • R v3+
    • packages:
      • ggplot2

Description

This is a short tutorial running through some principles of using DL for genomics, specifically for sequence classification. We will use simulated sequences from 4 different classes that are populated with transcption factor motifs and other DNA patterns. We will build and train a small convolutional neuronal network using keras to learn to classify from the sequence only. We will also apply this network for new predictions and inspect what the network has learned to be important features.

  • Notebook running through the tutorial
  • python script to adjust and run the entire analysis
  • input data
  • helper scripts for simulating data and visualizing kernel weights

About

A short tutorial to using DL and keras for genomics with sequence data.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • HTML 82.5%
  • R 8.3%
  • Jupyter Notebook 7.9%
  • Python 1.3%