Skip to content

zctao/omnifoldTop

Repository files navigation

omnifoldTop

Omnifold for ttbar l+jets final states.

See https://github.com/ericmetodiev/OmniFold.git for the original repo from the OmniFold authors.

Install

git clone https://github.com/zctao/omnifoldTop.git

A singularity container to run the application can be made using the definition file provided in image/. The container is based on a tensorflow image. Sudo privilage is required.

sudo singularity build <some_image_name.sif> image/topOmniFold.def

To start running in the container

singularity run --nv --bind <directories_needed_for_inputs_and_outputs> <some_image_name.sif>

Dependencies

See the singularity definition file

image/topOmniFold.def

Run

Set up environment and Python path:

source setup.sh

The main application to run is

python scripts/unfoldv2.py -d DATA_FILES -s SIMULATION_FILES \
                  [-o OUTPUT_DIRECTORY] \
                  [-i NUMBER_OF_ITERATIONS] \
                  [--observables LIST_OF_VARRIABLES_TO_UNFOLD]

To see all argument options:

python scripts/unfoldv2.py -h

Input files are expected to be ROOT files produced by ntuplerTT. In case the DATA_FILES are actually pseudo data from MC simulations, a flag -t can be added to indicate the "MC truth" is known and can be used to evaluate the performance.

In case one wishes to reuse the previously trained results, an option --unfolded-weights PATH_TO_UNFOLDED_WEIGHTS_FILES can be used to read the event weights from the specified files and apply them directly to other variables. Or --load-models MODELS_DIR can be used to load the trained models for reweighting directly in iterations.

[NEW] Unfolding can also be run from

./run_unfold.py RUN_CONFIG

RUN_CONFIG is a JSON config file that are used to set the arguments for scripts/unfoldv2.py. If the value of an argument is a dictionary, the keys of the dictionary are used as labels to create sub-directory in OUTPUT_DIRECTORY, and unfoldings are run with all combinations of the arguments. See one example run config file in configs/run/basic_test.json.

To rerun plotting using the unfolded weights from previous result:

./replot_unfold.py <path_to_result_dir> [<new_output_dir>]

To resume unfolding previously interrupted or unfinished runs:

./resume_unfold.py <path_to_result_dir>

This will load the trained models to reweight for the finished runs/iterations, and continue to train new models and run unfolding for more runs if needed.

Other useful scripts

  • evaluate_systematics.py [TODO]

[DEPRECATED]

  • evaluateModels.py: if input data files are pseudo data, i.e. MC truth is available, evaluateModels.py can be used to reweight simulation truth directly to the pseudo data truth and compare the reweighted distribution with the actual MC truth in the pseudo data. The goal is to evaluate the performance of the classifier model without running OmniFold iterations.

      python3 evaluateModels.py -d DATA_FILES -s SIMULATION_FILES \
                                [-o OUTPUT_DIRECTORY] \
                                [--observables LIST_OF_VARIABLES_TO_UNFOLD]
    

    To see all argument options:

      python3 evaluateModels.py -h
    
  • scripts/rootReader.py: convert root files into npz files containing numpy structure arrays.

      python3 scripts/rootReader.py ROOT_FILES \
                                    --truth-level parton|particle \
                                    [-o OUTPUT_NAME]
    
  • scripts/makeRunScript.py: generate a bash script to run multiple unfolding iterations with different parameters given a run configuration file.

      python3 scripts/makeRunScript.py RUNCONFIG [-o OUTPUT_NAME]
    

    An example of the run config file is provided in configs/run/basic_tests.json.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 3

  •  
  •  
  •