RL

1.강화학습이란?

post-thumbnail

2.Markov Decision Process(MDP)

post-thumbnail

3.Value Function(가치함수)

post-thumbnail

4.Q-function(큐함수)

post-thumbnail

5.Bellman Equation(벨만 방정식)

post-thumbnail

6.Policy Iteration(정책 이터레이션)

post-thumbnail

7.Value Iteration(가치 이터레이션)

post-thumbnail

8.Monte Carlo Prediction and Temporal Difference Error

post-thumbnail

9.SARSA(살사)

post-thumbnail

10.Q-Learning(큐러닝)

post-thumbnail

11.Policy Iteration(code)

post-thumbnail

12.Value Iteration(code)

post-thumbnail

13.SALSA (code)

post-thumbnail

14.Q-Learning (code)

post-thumbnail

15.Deep Sarsa

post-thumbnail