1.generate-profiles

History

Name		Name	Last commit message	Last commit date
parent directory ..
data		data
figures		figures
results		results
scripts		scripts
tables		tables
0.process-labels.ipynb		0.process-labels.ipynb
1.normalize-labels.ipynb		1.normalize-labels.ipynb
2.build-consensus-signatures.ipynb		2.build-consensus-signatures.ipynb
3.perturbation-table.ipynb		3.perturbation-table.ipynb
4.load-example-cell-painting-image.ipynb		4.load-example-cell-painting-image.ipynb
5.cell-count-summary.ipynb		5.cell-count-summary.ipynb
README.md		README.md
generate_profiles.py		generate_profiles.py
profile-pipeline.sh		profile-pipeline.sh

README.md

Processing Data

Gregory Way, 2020

In this module, we present our pipeline for generating image-based profiles from Cell Painting data. We also process Cell Health readouts.

Generating image-based profiles

We primarily use the pycytominer tool for data processing.

See generate_profiles.py for a complete description of our data processing pipeline.

Briefly, our pipeline is as follows:

Step	Notes
Aggregate single cells	Operation: median
Annotate profiles	Merge platemaps with metadata
Normalize profiles	Operation: mad_robustize; using only EMPTY control wells
Feature select profiles	Operations: drop_na_columns, blacklist, variance_threshold, drop_outliers
Audit profiles	Determine quality of the data by pairwise replicate correlations

Processing cell health readouts

We also normalize the output cell health readouts from the Cell Health assay. We simply take the z-score across features.

Generating consensus signatures

We acquire consensus signatures for both Cell Painting and Cell Health assay readouts. We generate two different types of consensus signatures: moderated z score (MODZ) and median consensus.

We use the MODZ operation in all downstream applications and interpretations. MODZ was first introduced in Subramanian et al., 2017 and we use the pycytominer implementation.

This procedure results in a total of 357 profiles with matched Cell Painting and Cell Health data.

Execution

To reprocess the profiles, execute the following command:

# Activate environment
conda activate cell-health

# Perform full profiling pipeline
# Note that step 6 of this pipeline is not currently executed,
# since raw images are required and not included in this repo.
python profile-pipeline.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Files

1.generate-profiles

1.generate-profiles

README.md

Processing Data

Generating image-based profiles

Processing cell health readouts

Generating consensus signatures

Execution

Files

1.generate-profiles

Directory actions

More options

Directory actions

More options

Latest commit

History

1.generate-profiles

Folders and files

parent directory

README.md

Processing Data

Generating image-based profiles

Processing cell health readouts

Generating consensus signatures

Execution