Task-Agnostic Operation Toolbox for Gradient-based Bilevel Optimization
Installation |
Docs |
Tutorials |
Examples |
BOAT is a task-agnostic, gradient-based Bi-Level Optimization (BLO) Python library that focuses on abstracting the key BLO process into modular, flexible components. It enables researchers and developers to tackle learning tasks with hierarchical nested nature by providing customizable and diverse operator decomposition, encapsulation, and combination. BOAT supports specialized optimization strategies, including second-order or first-order, nested or non-nested, and with or without theoretical guarantees, catering to various levels of complexity.
To enhance flexibility and efficiency, BOAT incorporates the Dynamic Operation Library (D-OL) and the Hyper Operation Library (H-OL), alongside a collection of state-of-the-art first-order optimization strategies. BOAT also provides multiple implementation versions:
- PyTorch-based: An efficient and widely-used version.
- Jittor-based: An accelerated version for high-performance tasks.
- MindSpore-based: Incorporating the latest first-order optimization strategies to support emerging application scenarios.
BOAT is designed to offer robust computational support for a broad spectrum of BLO research and applications, enabling innovation and efficiency in machine learning and computer vision.
- Dynamic Operation Library (D-OL): Incorporates 4 advanced dynamic system construction operations, enabling users to flexibly tailor optimization trajectories for BLO tasks.
- Hyper-Gradient Operation Library (H-OL): Provides 9 refined operations for hyper-gradient computation, significantly enhancing the precision and efficiency of gradient-based BLO methods.
- First-Order Gradient Methods (FOGMs): Integrates 4 state-of-the-art first-order methods, enabling fast prototyping and validation of new BLO algorithms. With modularized design, BOAT allows flexible combinations of multiple upper-level and lower-level operators, resulting in nearly 85 algorithmic combinations, offering unparalleled adaptability.
- Modularized Design for Customization: Empowers users to flexibly combine dynamic and hyper-gradient operations while customizing the specific forms of problems, parameters, and optimizer choices, enabling seamless integration into diverse task-specific codes.
- Comprehensive Testing & Continuous Integration: Achieves 99% code coverage through rigorous testing with pytest and Codecov, coupled with continuous integration via GitHub Actions, ensuring software robustness and reliability.
- Fast Prototyping & Algorithm Validation: Streamlined support for defining, testing, and benchmarking new BLO algorithms.
- Unified Computational Analysis: Offers a comprehensive complexity analysis of gradient-based BLO techniques to guide users in selecting optimal configurations for efficiency and accuracy.
- Detailed Documentation & Community Support: Offers thorough documentation with practical examples and API references via MkDocs, ensuring accessibility and ease of use for both novice and advanced users.
Existing automatic differentiation (AD) tools primarily focus on specific optimization strategies, such as explicit or implicit methods, and are often targeted at meta-learning or specific application scenarios, lacking support for algorithm customization.
In contrast, BOAT expands the landscape of Bi-Level Optimization (BLO) applications by supporting a broader range of problem-adaptive operations. It bridges the gap between theoretical research and practical deployment, offering unparalleled flexibility to design, customize, and accelerate BLO techniques.
BOAT enables efficient implementation and adaptation of advanced BLO techniques for key applications, including but not limited to:
- Hyperparameter Optimization (HO)
- Neural Architecture Search (NAS)
- Adversarial Training (AT)
- Few-Shot Learning (FSL)
- Generative Adversarial Learning
- Transfer Attack
- ...
To install BOAT, use the following command:
pip install boat-torch
or run
git clone https://github.com/callous-youth/BOAT.git
cd BOAT
pip install -e .
BOAT relies on two key configuration files:
boat_config.json
: Specifies optimization strategies and dynamic/hyper-gradient operations.loss_config.json
: Defines the loss functions for both levels of the BLO process.
import os
import json
import boat_torch as torch
# Load configuration files
with open("path_to_configs/boat_config.json", "r") as f:
boat_config = json.load(f)
with open("path_to_configs/loss_config.json", "r") as f:
loss_config = json.load(f)
You need to specify both the upper-level and lower-level models along with their respective optimizers.
import torch
# Define models
upper_model = UpperModel(*args, **kwargs) # Replace with your upper-level model
lower_model = LowerModel(*args, **kwargs) # Replace with your lower-level model
# Define optimizers
upper_opt = torch.optim.Adam(upper_model.parameters(), lr=0.01)
lower_opt = torch.optim.SGD(lower_model.parameters(), lr=0.01)
Modify the boat_config to include your dynamic and hyper-gradient methods, as well as model and variable details.
# Example dynamic and hyper-gradient methods Combination.
dynamic_method = ["NGD", "DI", "GDA"] # Dynamic Methods (Demo Only)
hyper_method = ["RGT","RAD"] # Hyper-Gradient Methods (Demo Only)
# Add methods and model details to the configuration
boat_config["dynamic_op"] = dynamic_method
boat_config["hyper_op"] = hyper_method
boat_config["lower_level_model"] = lower_model
boat_config["upper_level_model"] = upper_model
boat_config["lower_level_opt"] = lower_opt
boat_config["upper_level_opt"] = upper_opt
boat_config["lower_level_var"] = list(lower_model.parameters())
boat_config["upper_level_var"] = list(upper_model.parameters())
Modify the boat_config to include your dynamic and hyper-gradient methods, as well as model and variable details.
# Initialize the problem
b_optimizer = boat.Problem(boat_config, loss_config)
# Build solvers for lower and upper levels
b_optimizer.build_ll_solver() # Lower-level solver
b_optimizer.build_ul_solver() # Upper-level solver
Prepare the data feeds for both levels of the BLO process, which was further fed into the the upper-level and lower-level objective functions.
# Define data feeds (Demo Only)
ul_feed_dict = {"data": upper_level_data, "target": upper_level_target}
ll_feed_dict = {"data": lower_level_data, "target": lower_level_target}
Execute the optimization loop, optionally customizing the solver strategy for dynamic methods.
# Set number of iterations
iterations = 1000
# Optimization loop (Demo Only)
for x_itr in range(iterations):
# Run a single optimization iteration
loss, run_time = b_optimizer.run_iter(ll_feed_dict, ul_feed_dict, current_iter=x_itr)
- Hyperparameter optimization with approximate gradient (CG)
- Optimizing millions of hyperparameters by implicit differentiation (NS)
- Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks (IAD)
- On First-Order Meta-Learning Algorithms (FOA)
- Bilevel Programming for Hyperparameter Optimization and Meta-Learning (RAD)
- Truncated Back-propagation for Bilevel Optimization (RGT)
- DARTS: Differentiable Architecture Search (FD)
- A Generic First-Order Algorithmic Framework for Bi-Level Programming Beyond Lower-Level Singleton (GDA)
- Towards gradient-based bilevel optimization with non-convex followers and beyond (PTT, DI)
- Averaged Method of Multipliers for Bi-Level Optimization without Lower-Level Strong Convexity(DM)
- Learning With Constraint Learning: New Perspective, Solution Strategy and Various Applications (IGA)
- BOME! Bilevel Optimization Made Easy: A Simple First-Order Approach (VFM)
- A Value-Function-based Interior-point Method for Non-convex Bi-level Optimization (VSM)
- On Penalty-based Bilevel Gradient Descent Method (PGDM)
- Moreau Envelope for Nonconvex Bi-Level Optimization: A Single-loop and Hessian-free Solution Strategy (MESM)
MIT License
Copyright (c) 2024 Yaohua Liu
Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:
The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.
THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.