Reinforcement Learning of Peg Insertion Robot Arm Agent with Multimodal Sensor Fusion

A prelimilary version of the python implementation. The code is not well organized currently.

We will release a nicer version later. ( _(:з」∠)_ painful final exams...)

The idea of this project is inspired by the papers written by Michelle Lee, Yuke Zhu and etc.:

Making Sense of Vision and Touch: Self-Supervised Learning of Multimodal Representations for Contact-Rich Tasks: https://arxiv.org/abs/1810.10191
Making Sense of Vision and Touch: Learning Multimodal Representations for Contact-Rich Tasks: https://arxiv.org/abs/1907.13098

Some of the code are taken from their implementation: https://github.com/stanford-iprl-lab/multimodal_representation

The PPO trainer deployed is borrowed from the Assignment 5 of IERG5350 - Reinforcement Learning: https://github.com/cuhkrlcourse/ierg5350-assignment

The borrowed code has been modified to fit in this application.

The simulation environment is constructed using pybullet. Basicly, it contains a kuka robot arm, a cover box and a button inside the box. There is a hole at the upper side of the box. The kuka's end-effector(the peg) can only press the button by inserting the peg into the hole. The agent will gain 10 reward with touching the cover box and 50 reward with pressing the button. A detailed explainantion will be released later (maybe not).

TODO

A nicer implementation;
Enable simulation parallelism (run multiple simulation at a time);
Variational training for the sensor fusion encoder;

requirements

pip install -r requirements.txt

train the agent

python train_peg_insertion.py

collect the multimodal dataset for encoder pre-train

python environments/kuka_peg_env.py

[Note] You will be able to get more data by changing the random seed.

pre-train the fusion encoder

python multimodal/train_my_fusion_model.py

[Note] Specify the path to the root directory of multimodal dataset

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Reinforcement Learning of Peg Insertion Robot Arm Agent with Multimodal Sensor Fusion

TODO

requirements

train the agent

collect the multimodal dataset for encoder pre-train

pre-train the fusion encoder

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
PPO		PPO
data		data
environments		environments
multimodal		multimodal
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt
test_ft_sensor_reading.py		test_ft_sensor_reading.py
train_peg_insertion.py		train_peg_insertion.py

Henry1iu/ierg5350_rl_course_project

Folders and files

Latest commit

History

Repository files navigation

Reinforcement Learning of Peg Insertion Robot Arm Agent with Multimodal Sensor Fusion

TODO

requirements

train the agent

collect the multimodal dataset for encoder pre-train

pre-train the fusion encoder

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages