Skip to content

pagand/ORL_optimizer

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

64 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Model-based Confidence-aware Offline RL

coteaching

Contents

  1. Setup
  2. Usage
  3. Disclaimer

Setup

Clone the repository and build a conda environment from requirement.txt:

git clone https://github.com/pagand/ORL_optimizer
cd ORL_OPTIMIZER
conda create -n orl python=3.10
conda activate orl
pip install --upgrade pip
pip install -r requirements.txt

Now install the following extra packages:

pip install 'cython<3'
pip install scipy==1.12

Install the correct version of pytorch given your CUDA from start locally or previous versions. For CUDA 11.8

pip3 install torch  --index-url https://download.pytorch.org/whl/cu118

Install Jax (only required for the CORL):

pip install "jax[cuda11_cudnn86]"==0.4.7 -f https://storage.googleapis.com/jax-releases/jax_cuda_releases.html

Usage

The code of combustion simulator is in Simulator/simrnn_model.py, the code of model-based offline RL framework, MORE, is in RL/primal_dual_ddpg.py.

  • For Model-based offline RL:
cd MBORL
  • For different offline RL models
cd CORL
  • For vessel training model and simulator
cd VesselModel
  • For MORE paper implementation
cd MORE

Disclaimer

This code is is heavily based on

About

offline RL optimizer

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 3

  •  
  •  
  •