Date | Paper | Presenter | Links |
---|---|---|---|
3/29 | Dueling Network Architectures for Deep Reinforcement Learning, Wang et al, 2015. | Jiyun Kim | [paper] [review] |
3/29 | Sim-to-Real Leaning of All Common Bipedal Gaits via Periodic Reward Composition, J. Siekmann et al, 2020. | Hansol Kang | [paper] [review] |
4/5 | Asynchronous Methods for Deep Reinforcement Learning, Mnih et al, 2016. | Jiwon Chong | [paper] [review] |
4/12 | Adversarially Guided Actor-Critic, Y. Flet-Berliac et al, 2021. | Chris Ohk | [paper] [review] |
4/12 | Hindsight Experience Replay, M. Andrychowicz et al, 2017. | Donggu Kang | [paper] [review] |
4/19 | Addressing Function Approximation Error in Actor-Critic Methods, S. Fujimoto et al, 2018. | Junhyun Park | [paper] [review] |
4/19 | Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments, R. Lowe et al, 2017. | Seungeon Baek | [paper] [review] |
4/26 | Generating Text with Deep Reinforcement Learning, H. Guo et al, 2015. | Wonho Lee | [paper] [review] |
5/3 | Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor, T. Haarnoja et al, 2018. | Soohan Kang | [paper] [review] |
5/10 | Randomized Ensembled Double Q-Learning: Learning Fast Without a Model, X. Chen et al, 2021. | Jungyeon Lee | [paper] [review] |
5/17 | Efficient Hyperparameters Optimization Through Model-based Reinforcement Learning and Meta-Learning, J. Wu et al, 2020. | Wonwoo Choi | [paper] [review] |
5/17 | Continuous Control with Deep Reinforcement Learning, TP. Lillicrap et al, 2015. | Bongseok Kim | [paper] [review] |
5/24 | Demand-Aware Career Path Recommendations: A Reinforcement Learning Approach, M. Kokkodis et al, 2020. | Minkyu Shin | [paper] [review] |
5/24 | TBA | Daejin Jo | [paper] [review] |
- Chris Ohk
- Jiyun Kim
- Hansol Kang
- Jiwon Chong
- Donggu Kang
- Junhyun Park
- Seungeon Baek
- Wonho Lee
- Soohan Kang
- Jungyeon Lee
- Wonwoo Choi
- Bongseok Kim
- Minkyu Shin
- Daejin Jo