First implementation of an nT-like likelihood #32

hammannr · 2023-07-12T16:53:21Z

This PR adds the nT-like blueice-based likelihood structure to alea (solves #23 ) and also introduces a suggestion for split configs (#15)
The model seems to work for my first basic tests. I'll test it again with the changes on the current master and then add some docstrings, clean up, and some examples for how to use it 😊 But of course, if you can already spot something that you would suggest doing differently: Let me know! 😄

github-actions · 2023-07-12T17:04:28Z

Pull Request Test Coverage Report for Build 5534394288

0 of 149 (0.0%) changed or added relevant lines in 5 files are covered.
2 unchanged lines in 2 files lost coverage.
Overall coverage remained the same at 0.0%

Changes Missing Coverage	Changed/Added Lines	%
alea/statistical_model.py	7	0.0%
alea/simulators.py	8	0.0%
alea/parameters.py	10	0.0%
alea/utils.py	17	0.0%
alea/blueice_extended_model.py	107	0.0%

Files with Coverage Reduction	New Missed Lines	%
alea/simulators.py	1	0%
alea/utils.py	1	0%

Totals
Change from base Build 5529398966:	0.0%
Covered Lines:	0
Relevant Lines:	2801

💛 - Coveralls

github-actions

Remaining comments which cannot be posted as a review comment to avoid GitHub Rate Limit

pep8

alea/blueice_extended_model.py|142 col 1| S101 Use of assert detected. The enclosed code will be removed when compiling to optimised byte code.
alea/blueice_extended_model.py|151 col 1| D102 Missing docstring in public method
alea/blueice_extended_model.py|154 col 1| D102 Missing docstring in public method
alea/blueice_extended_model.py|157 col 1| S101 Use of assert detected. The enclosed code will be removed when compiling to optimised byte code.
alea/blueice_extended_model.py|177 col 1| D102 Missing docstring in public method
alea/blueice_extended_model.py|178 col 9| WPS221 Found line with high Jones Complexity: 17 > 14
alea/utils.py|266 col 1| D103 Missing docstring in public function
alea/utils.py|744 col 1| D103 Missing docstring in public function
alea/utils.py|750 col 1| S307 Use of possibly insecure function - consider using safer ast.literal_eval.
alea/statistical_model.py|152 col 1| D102 Missing docstring in public method
alea/statistical_model.py|168 col 22| E231 missing whitespace after ':'
alea/statistical_model.py|168 col 30| E231 missing whitespace after ','
alea/parameters.py|218 col 1| D200 One-line docstring should fit on one line with quotes
alea/parameters.py|221 col 9| WPS221 Found line with high Jones Complexity: 16 > 14
alea/parameters.py|229 col 9| WPS221 Found line with high Jones Complexity: 16 > 14

alea/simulators.py

alea/blueice_extended_model.py

github-actions · 2023-07-12T17:08:13Z

Pull Request Test Coverage Report for Build 5572804010

0 of 213 (0.0%) changed or added relevant lines in 5 files are covered.
3 unchanged lines in 3 files lost coverage.
Overall coverage remained the same at 0.0%

Changes Missing Coverage	Changed/Added Lines	%
alea/simulators.py	9	0.0%
alea/parameters.py	13	0.0%
alea/statistical_model.py	20	0.0%
alea/utils.py	31	0.0%
alea/blueice_extended_model.py	140	0.0%

Files with Coverage Reduction	New Missed Lines	%
alea/simulators.py	1	0%
alea/statistical_model.py	1	0%
alea/utils.py	1	0%

Totals
Change from base Build 5529398966:	0.0%
Covered Lines:	0
Relevant Lines:	2852

💛 - Coveralls

alea/blueice_extended_model.py

alea/statistical_model.py

alea/parameters.py

alea/statistical_model.py

kdund · 2023-07-12T18:53:30Z

Tried shortly :)!

I hit some path issues (it could not find the templates) that lead me to this:

https://github.com/XENONnT/alea/blob/0faf6a304ef2bae89ac35152b48b9d97d70f26bc/alea/utils.py#L273C18-L273C18

This will then fail unless you're running it in the base alea folder.
I think in most cases, these paths will be absolute paths (or paths relative to the placement of the yaml file, not to alea-- I think I would remove the prefixed alea. (possibly allow some global template_folder arg instead?)

alea/blueice_extended_model.py

alea/utils.py

hammannr · 2023-07-13T14:27:10Z

The basic functionality of the blueice-based likelihood is now fully implemented (and now also cleaned up). There are a few open points that I marked with TODO or IDEA that I would implement in a dedicated PR since I don't think they are crucial for now and this way we can continue implementing things collaboratively.
If everyone agrees, I'd then add those open items as issues.

Below you can find an example code snippet on how to run the likelihood and what one can do with it.
Looking forward to your comments! 😊

from alea.blueice_extended_model import BlueiceExtendedModel
import numpy as np
import scipy.stats as sps
import matplotlib.pyplot as plt
import os
import alea
from tqdm import tqdm

alea_dir = os.path.dirname(alea.__file__)
config_path = os.path.join(alea_dir, "examples/unbinned_wimp_statistical_model.yaml")

# Initialize the statistical model
statistical_model = BlueiceExtendedModel.from_config(config_path)

##### 1) Simple example #####
# Generate and assign some data
data = statistical_model.generate_data()
statistical_model.data = data

# Fit the model
best_result, best_ll = statistical_model.fit(verbose=True)

# Plot the data
fig, axes = plt.subplots(1, 2, figsize=(10, 5))
for i, ax in enumerate(axes):
    for source in range(2):
        source_selection = data[i]["source"] == source
        source_name = statistical_model._likelihood.likelihood_list[i].source_name_list[source]
        ax.plot(data[i]["cs1"][source_selection],
                 data[i]["cs2"][source_selection],
                 "o", label=source_name)
        ax.legend()
        ax.semilogy()
        ax.set_xlabel("cs1 [PE]")
        ax.set_ylabel("cs2 [PE]")
        ax.set_ylim(1e2, 1e4)
        ax.set_title(f"{statistical_model.likelihood_names[i]}")
plt.show()

##### 2) Compute the confidence interval #####
dl, ul = statistical_model.confidence_interval("wimp_rate_multiplier",
                                      parameter_interval_bounds=[0, 50],
                                      )

# Check the result
rate_vals = np.linspace(max(0, dl - .5), ul + .5, 200)
lls = []
for rate in rate_vals:
    res, ll = statistical_model.fit(wimp_rate_multiplier=rate)
    lls.append(ll)
lls = np.array(lls)

plt.plot(rate_vals, 2 * (best_ll - lls))
plt.axhline(sps.chi2(1).ppf(.9), c="orange", ls="--")
plt.axvline(best_result["wimp_rate_multiplier"], c="r")
plt.axvline(dl, c="orange")
plt.axvline(ul, c="orange")
plt.ylim(0, 4)
plt.xlim(0)
plt.xlabel("wimp_rate_multiplier")
plt.ylabel("-2 * LLR")
plt.show()

##### 3) Compute median upper limit distribution for b-only toys #####
n_mc = 200
dls = []
uls = []
for i in tqdm(range(n_mc), desc="Computing confidence intervals"):
    # generate background only data
    data = statistical_model.generate_data(wimp_rate_multiplier=0)
    statistical_model.data = data
    dl, ul = statistical_model.confidence_interval(
        "wimp_rate_multiplier",
        parameter_interval_bounds=[0, 50]
        )
    dls.append(dl)
    uls.append(ul)
dls = np.array(dls)
uls = np.array(uls)

fig, axes = plt.subplots(1, 2, figsize=(10, 5))
dls[dls == -np.inf] = 0
axes[0].hist(dls, bins=20)
axes[0].set_xlabel("dl [wimp_rate_multiplier]")

axes[1].hist(uls, bins=20)
axes[1].set_xlabel("ul [wimp_rate_multiplier]")
axes[1].axvline(np.median(uls), c="crimson")

for ax in axes:
    ax.semilogy()
    ax.set_xlim(0)

github-actions · 2023-07-13T14:29:08Z

alea/utils.py

+            eval_analysis_space.append(eval_element)
+    return eval_analysis_space
+
+def find_file_resource(file: str, possible_locations: list) -> str:


[pep8] _{reported by reviewdog 🐶}
E302 expected 2 blank lines, found 1

github-actions · 2023-07-13T14:29:08Z

alea/utils.py

+
+def find_file_resource(file: str, possible_locations: list) -> str:
+    """
+    Function to look for a file in a list of folders (the file name may include folder structure too).


[pep8] _{reported by reviewdog 🐶}
E501 line too long (102 > 100 characters)

github-actions · 2023-07-13T14:29:08Z

alea/utils.py

+        fname = Path(loc, file)
+        if fname.is_file():
+            return str(fname)
+    raise FileNotFoundError("{:s} not found in any of {:s}".format(file, ",".join(possible_locations)))


[pep8] _{reported by reviewdog 🐶}
E501 line too long (103 > 100 characters)

kdund

Runs sensibly, code sensible, thanks!

kdund · 2023-07-13T15:01:07Z

alea/blueice_extended_model.py

+            if isinstance(likelihood_config["template_folder"], str):
+                template_folder_list = [likelihood_config["template_folder"]]
+            elif isinstance(likelihood_config["template_folder"], list):
+                template_folder_list = likelihood_config["template_folder"]


What do you think about adding the alea example data folder as a default search folder here?

So you mean always just add "alea/examples/templates/wimp_templates" to the end of the list? It probably doesn't hurt adding it, though I'm not sure how often this will be used (for the examples I think it is good to write things explicitly so that people know what to change when implementing their own model).

… efficiency parameters. This way, blueice will not cache pdfs for different values of these parameters

Removes all hash for parameters not used for each source, and for all…

github-actions · 2023-07-17T07:14:37Z

alea/blueice_extended_model.py

+        ret = dict()
+        for ll in self._likelihood.likelihood_list[:-1]:  # ancillary likelihood does not contribute
+
+            ll_pars = list(ll.rate_parameters.keys()) + list(ll.shape_parameters.keys())


[pep8] _{reported by reviewdog 🐶}
WPS221 Found line with high Jones Complexity: 15 > 14

github-actions · 2023-07-17T07:14:37Z

alea/blueice_extended_model.py

+
+            # add all parameters to extra_dont_hash for each source unless it is used:
+            for i, source in enumerate(config["sources"]):
+                parameters_to_ignore: List[str] = [p.name for p in self.parameters if (p.ptype == "shape")


[pep8] _{reported by reviewdog 🐶}
WPS441 Found control variable used after block: p

github-actions · 2023-07-17T07:14:37Z

alea/blueice_extended_model.py

+
+            # add all parameters to extra_dont_hash for each source unless it is used:
+            for i, source in enumerate(config["sources"]):
+                parameters_to_ignore: List[str] = [p.name for p in self.parameters if (p.ptype == "shape")


[pep8] _{reported by reviewdog 🐶}
WPS465 Found likely bitwise and boolean operation mixup

github-actions · 2023-07-17T07:14:37Z

alea/blueice_extended_model.py

+
+            # add all parameters to extra_dont_hash for each source unless it is used:
+            for i, source in enumerate(config["sources"]):
+                parameters_to_ignore: List[str] = [p.name for p in self.parameters if (p.ptype == "shape")


[pep8] _{reported by reviewdog 🐶}
WPS441 Found control variable used after block: p

github-actions · 2023-07-17T07:14:37Z

alea/blueice_extended_model.py

+
+            # add all parameters to extra_dont_hash for each source unless it is used:
+            for i, source in enumerate(config["sources"]):
+                parameters_to_ignore: List[str] = [p.name for p in self.parameters if (p.ptype == "shape")


[pep8] _{reported by reviewdog 🐶}
E501 line too long (106 > 100 characters)

github-actions · 2023-07-17T07:14:40Z

alea/blueice_extended_model.py

+        assert efficiency_parameter.ptype == "efficiency", "The parameter {:s} must" \
+            " be an efficiency".format(efficiency_name)
+        limits = efficiency_parameter.fit_limits
+        assert 0 <= limits[0], 'Efficiency parameters including {:s} must be' \


[pep8] _{reported by reviewdog 🐶}
N400: Found backslash that is used for line breaking

github-actions · 2023-07-17T07:14:40Z

alea/blueice_extended_model.py

+        assert efficiency_parameter.ptype == "efficiency", "The parameter {:s} must" \
+            " be an efficiency".format(efficiency_name)
+        limits = efficiency_parameter.fit_limits
+        assert 0 <= limits[0], 'Efficiency parameters including {:s} must be' \


[pep8] _{reported by reviewdog 🐶}
WPS309 Found reversed compare order

github-actions · 2023-07-17T07:14:40Z

alea/blueice_extended_model.py

+        limits = efficiency_parameter.fit_limits
+        assert 0 <= limits[0], 'Efficiency parameters including {:s} must be' \
+                               ' constrained to be nonnegative'.format(efficiency_name)
+        assert np.isfinite(limits[1]), 'Efficiency parameters including {:s} must be' \


[pep8] _{reported by reviewdog 🐶}
S101 Use of assert detected. The enclosed code will be removed when compiling to optimised byte code.

github-actions · 2023-07-17T07:14:40Z

alea/blueice_extended_model.py

+        limits = efficiency_parameter.fit_limits
+        assert 0 <= limits[0], 'Efficiency parameters including {:s} must be' \
+                               ' constrained to be nonnegative'.format(efficiency_name)
+        assert np.isfinite(limits[1]), 'Efficiency parameters including {:s} must be' \


[pep8] _{reported by reviewdog 🐶}
N400: Found backslash that is used for line breaking

github-actions · 2023-07-17T07:14:40Z

alea/statistical_model.py

@@ -150,39 +163,33 @@ def store_data(
        kw = {'metadata': metadata} if metadata is not None else dict()
        toydata_to_file(file_name, data_list, data_name_list, **kw)

-    def get_expectations(self):
-        return NotImplementedError("get_expectation is optional to implement")
+    def get_expectation_values(self, **parameter_values):


[pep8] _{reported by reviewdog 🐶}
D102 Missing docstring in public method

hammannr and others added 19 commits June 22, 2023 10:32

draft example configs to think about structure

ccfa0f3

draft structure of BlueiceExtendedModel

afa71a5

improve structure

a94c1bc

start implementing data generation

7905879

try to organize likelihood parameters

2a71187

take notes

6d3415a

start implementing ancillary ll term

025a21c

continue working on anc. likelihood

4d05a81

start _get_constraint_terms

2e9eb09

improve ancillary likelihood

e759366

finish ancillary likelihood implementation

35e065d

restructure generation of ancillary measurements

26e49b0

start likelihood construction

898e1b6

add aux ll

20ec3a4

move blueice extended model to top layer

d630458

adapt config for blueice

a0ea2c6

make it run for the first time

aecc75e

fix likelihood init bug

65aa50d

Merge branch 'master' into nt_likelihood

e9570d4

github-actions bot reviewed Jul 12, 2023

View reviewed changes

undo indenting docstring

8e5b6d8

github-actions bot reviewed Jul 12, 2023

View reviewed changes

hammannr added 3 commits July 12, 2023 19:38

update template path

cdf6ecd

cleanup some outdated things in statistical_model

ff936ec

minor compatibility fixes

0faf6a3

github-actions bot reviewed Jul 12, 2023

View reviewed changes

alea/statistical_model.py Show resolved Hide resolved

Expectation value implementation

031ff42

enable template folder list

329980d

github-actions bot reviewed Jul 13, 2023

View reviewed changes

alea/blueice_extended_model.py Show resolved Hide resolved

alea/utils.py Show resolved Hide resolved

alea/utils.py Outdated Show resolved Hide resolved

alea/utils.py Show resolved Hide resolved

alea/utils.py Show resolved Hide resolved

hammannr added 2 commits July 13, 2023 15:54

template_folder fix

2814612

change livetime names in config

6b16f4f

github-actions bot reviewed Jul 13, 2023

View reviewed changes

alea/utils.py Show resolved Hide resolved

kdund added 2 commits July 13, 2023 10:26

Utils fcn to find file in one of several paths.

1953474

Merge remote-tracking branch 'origin/nt_likelihood' into nt_likelihood

b3614da

hammannr marked this pull request as ready for review July 13, 2023 14:27

hammannr requested review from kdund and dachengx July 13, 2023 14:28

github-actions bot reviewed Jul 13, 2023

View reviewed changes

hammannr added the enhancement New feature or request label Jul 13, 2023

This was linked to issues Jul 13, 2023

Split Configs #15

Closed

Implement blueice-extended likelihood #23

Closed

kdund approved these changes Jul 13, 2023

View reviewed changes

kdund and others added 11 commits July 13, 2023 15:11

Removes all hash for parameters not used for each source, and for all…

64ceaa2

… efficiency parameters. This way, blueice will not cache pdfs for different values of these parameters

Accepted some of the linting comments

0eccaf0

fix ptype - type missmatch

28c5dd6

The dog convinced me: Let's go with ptype instead.

97d3dfe

remove my TODOs

2752b0f

Separate out efficiency set fcn

5fadf96

Remove errant code in get_expectation_values in base class.

ac602d3

Remove errant code in get_expectation_values in base class.

214fd18

linter mess

0c41d99

.type -> .ptype

90c2ca1

Merge pull request #37 from XENONnT/nt_likelihood_efficiency

7fd0d13

Removes all hash for parameters not used for each source, and for all…

github-actions bot reviewed Jul 17, 2023

View reviewed changes

hammannr merged commit 257ab91 into master Jul 17, 2023

hammannr deleted the nt_likelihood branch July 17, 2023 07:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

First implementation of an nT-like likelihood #32

First implementation of an nT-like likelihood #32

hammannr commented Jul 12, 2023

github-actions bot commented Jul 12, 2023

github-actions bot left a comment

github-actions bot commented Jul 12, 2023 •

edited

Loading

kdund commented Jul 12, 2023

hammannr commented Jul 13, 2023

github-actions bot Jul 13, 2023

github-actions bot Jul 13, 2023

github-actions bot Jul 13, 2023

kdund left a comment

kdund Jul 13, 2023

hammannr Jul 13, 2023

github-actions bot Jul 17, 2023

github-actions bot Jul 17, 2023

github-actions bot Jul 17, 2023

github-actions bot Jul 17, 2023

github-actions bot Jul 17, 2023

github-actions bot Jul 17, 2023

github-actions bot Jul 17, 2023

github-actions bot Jul 17, 2023

github-actions bot Jul 17, 2023

github-actions bot Jul 17, 2023

First implementation of an nT-like likelihood #32

First implementation of an nT-like likelihood #32

Conversation

hammannr commented Jul 12, 2023

github-actions bot commented Jul 12, 2023

Pull Request Test Coverage Report for Build 5534394288

💛 - Coveralls

github-actions bot left a comment

Choose a reason for hiding this comment

github-actions bot commented Jul 12, 2023 • edited Loading

Pull Request Test Coverage Report for Build 5572804010

💛 - Coveralls

kdund commented Jul 12, 2023

hammannr commented Jul 13, 2023

github-actions bot Jul 13, 2023

Choose a reason for hiding this comment

github-actions bot Jul 13, 2023

Choose a reason for hiding this comment

github-actions bot Jul 13, 2023

Choose a reason for hiding this comment

kdund left a comment

Choose a reason for hiding this comment

kdund Jul 13, 2023

Choose a reason for hiding this comment

hammannr Jul 13, 2023

Choose a reason for hiding this comment

github-actions bot Jul 17, 2023

Choose a reason for hiding this comment

github-actions bot Jul 17, 2023

Choose a reason for hiding this comment

github-actions bot Jul 17, 2023

Choose a reason for hiding this comment

github-actions bot Jul 17, 2023

Choose a reason for hiding this comment

github-actions bot Jul 17, 2023

Choose a reason for hiding this comment

github-actions bot Jul 17, 2023

Choose a reason for hiding this comment

github-actions bot Jul 17, 2023

Choose a reason for hiding this comment

github-actions bot Jul 17, 2023

Choose a reason for hiding this comment

github-actions bot Jul 17, 2023

Choose a reason for hiding this comment

github-actions bot Jul 17, 2023

Choose a reason for hiding this comment

github-actions bot commented Jul 12, 2023 •

edited

Loading