This repository contains the code to reproduce the results in the paper Optimized Conformal Selection: Powerful Selective Inference After Conformity Score Optimization.
simulation/
: Simulation experimentssimulation/Msel/
: Conformity score selection with pre-trained models (Section 5.1)simulation/Full/
: Conformal selection without data splitting (Section 5.2)simulation/Full-Msel/
: Model training and selection with full data (Section 5.3)
real/
: Real-data appliationsreal/drug/
: Drug discovery with model selection (Section 6.1)real/llm/
: Boosting LLM Alignment (Section 6.2)
requirements.txt
: A list of required python packages
For simulation, a single job submission is sufficient to run the experiment.
For the drug discovery application:
- Use
modelpred.py
to generate and save model predictions based on different drug encodings. - Evaluate the performance of various methods using
evaluate.py
.
The code for the LLM alignment application is largely adapted from this repository. After performing report generation and score extraction as described in the linked repository:
- Use
collect.py
to compile all uncertainty/confidence scores and labels into a single.csv
file. - Conduct experiments using different combinations of models via
llm_set1.py
andllm_set2.py
.