GitHub

A code-base for 'Explaining RL Decisions with Trajectories' published at ICLR 2023:

Here we provide the code for the gridworld experiments, which can be found in the gridworld_expts.ipynb file. We hope the implementation for other environments would be clear from this example. In case of additional queries, feel free to reach out at: [email protected]

Instructions for usage:

Before running the code-base, install the dependencies using:

    conda create -n xrl python=3.8 -y
    conda activate xrl
    pip install -r requirements.txt
    python -m ipykernel install --user --name xrl

Launch gridworld_expts.ipynb using a jupyter server. Activate the xrl kernel and run the file to generate the results from the paper.

Acknowledgements: We use Dynamic Programming implementation from andrecianflone/dynaq/ and we are thankful to the authors for making it publicly available.

Citation

If you use this code for your research, please cite our paper:

@misc{deshmukh2023explaining,
      title={Explaining RL Decisions with Trajectories}, 
      author={Shripad Vilasrao Deshmukh and Arpan Dasgupta and Balaji Krishnamurthy and Nan Jiang and Chirag Agarwal and Georgios Theocharous and Jayakumar Subramanian},
      year={2023},
      eprint={2305.04073},
      archivePrefix={arXiv},
      primaryClass={cs.AI}
}

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
ckpt_save_dir		ckpt_save_dir
gridworld_results/grid_7by7		gridworld_results/grid_7by7
.DS_Store		.DS_Store
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
env.py		env.py
gridworld_expts.ipynb		gridworld_expts.ipynb
requirements.txt		requirements.txt
traj_clustering_grid.pdf		traj_clustering_grid.pdf
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Instructions for usage:

Citation

About

Releases

Packages

Languages

License

shripaddeshmukh/xrl_with_trajectories

Folders and files

Latest commit

History

Repository files navigation

Instructions for usage:

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages