강화학습 수업정리

1.1. Supervised Learning of Behaviors

post-thumbnail

2.2. Introduction of Reinforcement Learning (Key Concepts of RL)

post-thumbnail

3.3. RL algorithms: Policy-based RL

post-thumbnail

4.4. Off-policy Policy Gradient

post-thumbnail

5.5. Actor-Critic Algorithm

post-thumbnail

6.6. Actor-Critic Design Decisions

post-thumbnail

7.7. Value Function Methods

post-thumbnail