Code for NeurIPS 2020 paper: Self-imitation Learning via Generalized Lower Bound Q-learning

Dependencies

This implementation depends on the following libraries as well as dependencies that support these libraries.

To run experiments with simulated environments, you will also need to install

Run the code

Hyper-parameters are specified in the python code. After running experiments, performance curves will be saved in a sub-directory in the current working directory for further processing.

For example, to run the nstep SIL algorithm with delayed environments, run the following

python td3_nstep_sil.py --env HalfCheetah-v3 --seed 100 --delay 3 --nstep 5 --sil-weights 0.1

To run without SIL, set the proper hyper-parameter

python td3_nstep_sil.py --env HalfCheetah-v3 --seed 100 --delay 3 --nstep 5 --sil-weights 0.0

To run the return based SIL algorithm with delayed environments, run the following

python td3_return_sil.py --env HalfCheetah-v3 --seed 100 --delay 3 --nstep 5 --sil-weights 0.1

Citations

If you find this code base useful, you are encouraged to cite the following

Yunhao Tang, "Self-imitation Learning via Generalized Lower Bound Q-learning". arXiv:2006.07442 [cs.LG], 2020.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
README.md		README.md
core.py		core.py
delay_wrapper.py		delay_wrapper.py
nstep_wrapper.py		nstep_wrapper.py
prioritized_buffer.py		prioritized_buffer.py
td3_nstep_sil.py		td3_nstep_sil.py
td3_return_sil.py		td3_return_sil.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Code for NeurIPS 2020 paper: Self-imitation Learning via Generalized Lower Bound Q-learning

Dependencies

Run the code

Citations

About

Releases

Packages

Languages

robintyh1/nstep-sil

Folders and files

Latest commit

History

Repository files navigation

Code for NeurIPS 2020 paper: Self-imitation Learning via Generalized Lower Bound Q-learning

Dependencies

Run the code

Citations

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages