Adding a tutorial on BO constrained by probability of classification model #2700

FrankWanger · 2025-01-26T16:06:51Z

Motivation

There is no current tutorial on using the probability pulled from classification results as constraints in acquisition functions. And such an application holds strong interest from BO guided laboratory experimentation. A prior discussion (#725) was formed.

Have you read the Contributing Guidelines on pull requests?

Yes

Test Plan

In the present tutorial we show how to deal with feasibility constraints that are observed alongside the optimization process (referred to as 'outcome constriants' in BoTorch document, or sometimes as 'black-box constraints'). More specifically, the feasibility is modelled by a classification model, followed by feeding this learned probability to the acquisition funtion through the constraint argument in SampleReducingMCAcquisitionFunction. Namely, this is achieved through re-weighting the acquisition function by $\alpha_{\text{acqf-con}}=\mathbb{P}(\text{Constraint satisfied})*\alpha_{\text{acqf}}$. To achieve this, the pulled probability of classification model underwent un-sigmoid function and was inversed to fit into the API (as negative values treated as feasibility).

A 2D syntheic problem of Townsend function was used. For the classification model, we implemented approximate GP with a Bernoulli likelihood. qLogExpectedImprovement was selected as the aquisition function.

Below are the plots of the problem landscape, acquisition function value, constraint probability, and the EI value (before weighting) at different iterations:

At iter=1:

At iter=10:

At iter=50:

The log regret after 50 iterations are plotted against random (sobel).

All images can be reproduced by the notebook.

Related PRs

not related to any change of functionality

facebook-github-bot · 2025-01-26T16:06:58Z

Hi @FrankWanger!

Thank you for your pull request and welcome to our community.

Action Required

In order to merge any pull request (code, docs, etc.), we require contributors to sign our Contributor License Agreement, and we don't seem to have one on file for you.

Process

In order for us to review and merge your suggested changes, please sign at https://code.facebook.com/cla. If you are contributing on behalf of someone else (eg your employer), the individual CLA may not be sufficient and your employer may need to sign the corporate CLA.

Once the CLA is signed, our tooling will perform checks and validations. Afterwards, the pull request will be tagged with CLA signed. The tagging process may take up to 1 hour after signing. Please give it that time before contacting us about it.

If you have received this in error or have any questions, please contact us at [email protected]. Thanks!

facebook-github-bot · 2025-01-26T16:54:39Z

Thank you for signing our Contributor License Agreement. We can now accept your code for this (and any) Meta Open Source project. Thanks!

Balandat

Thanks a lot for putting this up, this is great.

My main comment (see inline) is on how to leverage the probability of feasibility produced by the classification model directly, rather than converting it twice, but that would require some changes to botorch itself.

Other than that mostly cosmetic comments.

I see that the method finds what appears to be the optimum very quickly - is this consistent across runs? If so it may make sense to reduce the number of iterations somewhat to cut down the runtime of the tutorial.

notebooks_community/clf_constrained_bo.ipynb

Balandat · 2025-01-26T18:58:20Z

notebooks_community/clf_constrained_bo.ipynb

+    "$$ \n",
+    "where $t = \\arctan\\left(\\frac{x_1}{x_2}\\right)$\n",
+    "\n",
+    "Here, we follow a natural representation where $y_{\\text{con}}=1$ indicates a feasible condition. We will train a classification model to predict the feasibility of the point. Note that in BoTorch's implementation, **negative values** indicate feasibility, thus we need to do conversion later when feeding feasibility into the pipeline.\n",


Suggested change

"Here, we follow a natural representation where $y_{\\text{con}}=1$ indicates a feasible condition. We will train a classification model to predict the feasibility of the point. Note that in BoTorch's implementation, **negative values** indicate feasibility, thus we need to do conversion later when feeding feasibility into the pipeline.\n",

"Here, we follow a natural representation where $y_{\\text{con}}=1$ indicates a feasible condition. We will train a classification model to predict the feasibility of the point. Note that in BoTorch's implementation, **negative values** indicate feasibility, thus we need to do conversion later when feeding feasibility into the pipeline.\n",

"Note that we essentially 'throw away' information contained in the value of $y_{\\text{con}}$ by applying a binary mask - this is for illustration purposes as part of this tutorial, in a real-world application we would model the numerical value of $y_{\\text{con}}$ direction and apply the constraint $y_{\\text{con}}>01$ as part of the optimization.\n",

It's a bit confusing here that $y_{\text{con}}$ is being used both in defining the numerical value of the constraint, as well as the binary mask value in the classification model. I suggest using different notation for this to avoid confusing the reader.

Indeed I have realised the problem of notation. I wanted to add that in many situations in experiments the numerical value of the constraint is not directly observable. So what we have as the data is only binary outcomes of success or failure - and yes here we applied this binary mask to our synthetic problem to throw away information so that we can simulate what we obtain in lab.

notebooks_community/clf_constrained_bo.ipynb

Balandat · 2025-01-26T19:26:49Z

notebooks_community/clf_constrained_bo.ipynb

+    "def pass_con_unsigmoid(Z, model_con, X=None):\n",
+    "    '''\n",
+    "    pass the constraint to the acquisition function\n",
+    "\n",
+    "    Note: Botorch does sigmoid transformation for the constraint by default, \n",
+    "    therefore we need to unsigmoid our probability (0-1) to (-inf,inf)\n",
+    "    also we need to invert the probability, where -inf means the constraint is satisfied. Finally,we add 1e-8 to avoid log(0).\n",
+    "    '''\n",
+    "    y_con = Z[...,1] #get the constraint\n",
+    "\n",
+    "    prob = model_con.likelihood(y_con).probs #obtain the probability of y_con(when constraint satisfied)\n",
+    "    prob_unsigmoid_neg = torch.log(1-prob+1e-8)-torch.log(prob+1e-8) #unsigmoid the probability and invert it to adapt to BoTorch's constraint API\n",
+    "    \n",
+    "    return prob_unsigmoid_neg\n"
+   ]


If the classification model already produces the probabilities of feasibility, it would be great if we could directly use that in the acquisition function, rather than converting it back first. @SebastianAment do you see any major challenges to just accept an additional "probability_of_feasibility" argument to SampleReducingMCAcquisitionFunction (and possibly in other places) and then just use that in the probability weighting?

Even if there are no issues, getting such a change into botorch would require some eng work so I wouldn't want to block this PR on that. That said, the probability of feasibility conversion is not a standard sigmoid though internally, see https://github.com/pytorch/botorch/blob/main/botorch/utils/objective.py#L178 - ideally for the time being (until we can accept the probability directly) we could apply the actual inverse of what is being applied in botorch.

do you see any major challenges to just accept an additional "probability_of_feasibility" argument to SampleReducingMCAcquisitionFunction (and possibly in other places) and then just use that in the probability weighting?

That should be pretty straightforward, mainly taking care of appropriate reshaping, since we are usually applying the feasibility weighting on a per-sample basis, and probability_of_feasibility won't share the MC dimension.

Regarding the inversion of the sigmoid, we are currently using a sigmoid with inverse quadratic asymptotic behavior, which could likely be inverted analytically as well, but that will not be necessary once we support this in the acquisition function directly.

notebooks_community/clf_constrained_bo.ipynb

Co-authored-by: Max Balandat <[email protected]>

FrankWanger · 2025-01-27T21:10:46Z

Thanks a lot for putting this up, this is great.

My main comment (see inline) is on how to leverage the probability of feasibility produced by the classification model directly, rather than converting it twice, but that would require some changes to botorch itself.

Other than that mostly cosmetic comments.

I see that the method finds what appears to be the optimum very quickly - is this consistent across runs? If so it may make sense to reduce the number of iterations somewhat to cut down the runtime of the tutorial.

Thank you so much! I've addressed most of the formatting issues, there is only one that I am not sure how to remove - the KeOps warnings. I've switched to macOS and it did not help. In terms of the results, yes I can see that it is quite consistent so I have halved the iterations to 25 and slighted added the freq of plots.

Balandat · 2025-01-28T04:25:26Z

Great. I may just manually strip the output from the notebook source to keep it clean.

I'll get this merged in since it's in great shape already, but still curious to hear @SebastianAment's thoughts on supporting this better in the acquisition functions themselves (which would be a separate PR anyway).

notebooks_community/clf_constrained_bo.ipynb

facebook-github-bot · 2025-01-28T04:46:29Z

@Balandat has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

codecov · 2025-01-28T04:55:30Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 99.98%. Comparing base (2144440) to head (e13322d).
Report is 2 commits behind head on main.

Additional details and impacted files

@@           Coverage Diff           @@
##             main    #2700   +/-   ##
=======================================
  Coverage   99.98%   99.98%           
=======================================
  Files         202      202           
  Lines       18588    18588           
=======================================
  Hits        18586    18586           
  Misses          2        2

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

facebook-github-bot · 2025-01-28T23:08:53Z

@Balandat merged this pull request in aeda83a.

adding tutorial on constrained bo from clf

132395b

facebook-github-bot added the CLA Signed Do not delete this pull request or issue due to inactivity. label Jan 26, 2025

Balandat reviewed Jan 26, 2025

View reviewed changes

FrankWanger and others added 7 commits January 27, 2025 17:10

Update notebooks_community/clf_constrained_bo.ipynb

2450137

Co-authored-by: Max Balandat <[email protected]>

formatting

7a7fb6f

Co-authored-by: Max Balandat <[email protected]>

typo fix

bdebdc7

Co-authored-by: Max Balandat <[email protected]>

typo fix

1dd456b

Co-authored-by: Max Balandat <[email protected]>

formatting

e5d7fd3

Co-authored-by: Max Balandat <[email protected]>

Merge branch 'pytorch:main' into main

fd6b680

improved annotations and outputs

52ddfa4

Balandat reviewed Jan 28, 2025

View reviewed changes

notebooks_community/clf_constrained_bo.ipynb Outdated Show resolved Hide resolved

Balandat added 2 commits January 27, 2025 20:45

Remove Keops logs

df41f12

Merge branch 'main' into main

e13322d

facebook-github-bot closed this in aeda83a Jan 28, 2025

facebook-github-bot added the Merged label Jan 28, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding a tutorial on BO constrained by probability of classification model #2700

Adding a tutorial on BO constrained by probability of classification model #2700

FrankWanger commented Jan 26, 2025

facebook-github-bot commented Jan 26, 2025

facebook-github-bot commented Jan 26, 2025

Balandat left a comment

Balandat Jan 26, 2025

Balandat Jan 26, 2025

FrankWanger Jan 27, 2025

Balandat Jan 26, 2025

SebastianAment Jan 28, 2025

FrankWanger commented Jan 27, 2025

Balandat commented Jan 28, 2025

facebook-github-bot commented Jan 28, 2025

codecov bot commented Jan 28, 2025 •

edited

Loading

facebook-github-bot commented Jan 28, 2025

	"Here, we follow a natural representation where $y_{\\text{con}}=1$ indicates a feasible condition. We will train a classification model to predict the feasibility of the point. Note that in BoTorch's implementation, negative values indicate feasibility, thus we need to do conversion later when feeding feasibility into the pipeline.\n",
	"Here, we follow a natural representation where $y_{\\text{con}}=1$ indicates a feasible condition. We will train a classification model to predict the feasibility of the point. Note that in BoTorch's implementation, negative values indicate feasibility, thus we need to do conversion later when feeding feasibility into the pipeline.\n",
	"Note that we essentially 'throw away' information contained in the value of $y_{\\text{con}}$ by applying a binary mask - this is for illustration purposes as part of this tutorial, in a real-world application we would model the numerical value of $y_{\\text{con}}$ direction and apply the constraint $y_{\\text{con}}>01$ as part of the optimization.\n",

Adding a tutorial on BO constrained by probability of classification model #2700

Adding a tutorial on BO constrained by probability of classification model #2700

Conversation

FrankWanger commented Jan 26, 2025

Motivation

Have you read the Contributing Guidelines on pull requests?

Test Plan

Related PRs

facebook-github-bot commented Jan 26, 2025

Action Required

Process

facebook-github-bot commented Jan 26, 2025

Balandat left a comment

Choose a reason for hiding this comment

Balandat Jan 26, 2025

Choose a reason for hiding this comment

Balandat Jan 26, 2025

Choose a reason for hiding this comment

FrankWanger Jan 27, 2025

Choose a reason for hiding this comment

Balandat Jan 26, 2025

Choose a reason for hiding this comment

SebastianAment Jan 28, 2025

Choose a reason for hiding this comment

FrankWanger commented Jan 27, 2025

Balandat commented Jan 28, 2025

facebook-github-bot commented Jan 28, 2025

codecov bot commented Jan 28, 2025 • edited Loading

Codecov Report

facebook-github-bot commented Jan 28, 2025

codecov bot commented Jan 28, 2025 •

edited

Loading