Data sources and organisation

This guide provides an overview on how to download and organise most of the data sources used in the pipeline.

The folder tree used in this pipeline and stated in the config file is the following:

data/
├── figures
├── gem
├── genomes
│   ├── aci
│   ├── bme
│   ├── dmi
│   ├── pse
│   ├── rhi1
│   ├── rhi2
│   ├── rho
│   ├── shw
│   ├── tel
│   └── tez
├── metanetx
├── micom
│   └── amils2023
│       ├── equal_abundances
│       └── paper_abundances
├── modelseed
├── papers
│   └── amils2023
├── retropath
│   ├── interesting_metabolites
│   │   └── sources
│   └── amils2023
│       └── sources
├── retropath_classes
└── retrorules

Figures

Contains all the generated figures as well as some images used in the repo READMEs.

GEM

Contains all GEMs generated by the GEM notebook.

Genomes

Contains all annotated genomes manually downloaded from their respective sources. See the table below for more information.

MetaNetX

Contains compound (chem_prop.tsv) and reaction (reac_prop.tsv) data downloaded from MetaNetX.

MICOM

Contains the results for all MICOM analyses (check the MICOM notebook). Ideally, one folder per study should be created (amils2023 in our case). Subfolders separate different experiments performed using data from the same study.

ModelSEED

Contains compound (compounds.tsv) and reaction (reactions.tsv) data downloaded from ModelSEED.

Papers

Contains all data for the different studies being analysed (each in a subfolder). In our case, we used information from Amils et al. 2023.

RetroPath

Constains the results for all RetroPath analyses (check the RetroPath notebook). Ideally, one folder per study should be created (amils2023 and interesting_metabolites in our case).

RetroPath classes

Contains metadata for exploring compound classes in RetroPath results. This data is still not public and, thus, is not provided.

RetroRules

Contrains the rules downloaded from RetroRules. The version used in this work is the initial release (rr01).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Data sources and organisation

Figures

GEM

Genomes

MetaNetX

MICOM

ModelSEED

Papers

RetroPath

RetroPath classes

RetroRules

Files

README.md

Latest commit

History

README.md

File metadata and controls

Data sources and organisation

Figures

GEM

Genomes

MetaNetX

MICOM

ModelSEED

Papers

RetroPath

RetroPath classes

RetroRules