3/29 |
Dueling Network Architectures for Deep Reinforcement Learning, Wang et al, 2015. |
Jiyun Kim |
[paper] [review] |
3/29 |
Sim-to-Real Leaning of All Common Bipedal Gaits via Periodic Reward Composition, J. Siekmann et al, 2020. |
Hansol Kang |
[paper] [review] |
4/5 |
Asynchronous Methods for Deep Reinforcement Learning, Mnih et al, 2016. |
Jiwon Chong |
[paper] [review] |
4/12 |
Adversarially Guided Actor-Critic, Y. Flet-Berliac et al, 2021. |
Chris Ohk |
[paper] [review] |
4/12 |
Hindsight Experience Replay, M. Andrychowicz et al, 2017. |
Donggu Kang |
[paper] [review] |
4/19 |
Addressing Function Approximation Error in Actor-Critic Methods, S. Fujimoto et al, 2018. |
Junhyun Park |
[paper] [review] |
4/19 |
Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments, R. Lowe et al, 2017. |
Seungeon Baek |
[paper] [review] |
4/26 |
Generating Text with Deep Reinforcement Learning, H. Guo et al, 2015. |
Wonho Lee |
[paper] [review] |
5/3 |
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor, T. Haarnoja et al, 2018. |
Soohan Kang |
[paper] [review] |
5/10 |
Randomized Ensembled Double Q-Learning: Learning Fast Without a Model, X. Chen et al, 2021. |
Jungyeon Lee |
[paper] [review] |
5/17 |
Efficient Hyperparameters Optimization Through Model-based Reinforcement Learning and Meta-Learning, J. Wu et al, 2020. |
Wonwoo Choi |
[paper] [review] |
5/17 |
Continuous Control with Deep Reinforcement Learning, TP. Lillicrap et al, 2015. |
Bongseok Kim |
[paper] [review] |
5/24 |
Demand-Aware Career Path Recommendations: A Reinforcement Learning Approach, M. Kokkodis et al, 2020. |
Minkyu Shin |
[paper] [review] |
5/24 |
TBA |
Daejin Jo |
[paper] [review] |