David Silver 교수님의 Introduction to Reinforcement Learning (Website)
Lecture 1: Introduction to Reinforcement Learning (Youtube) 강의 내용을 정리했습니다.
RL is all about Decision Making
Reward Hypothesis
All goals can be described by the maximization of expected cumulative reward
is an information(markov) state if and only if:
markov property
혹시 오타나 잘못된 부분이 있다면 댓글로 알려주시면 감사하겠습니다!