02. Bellman Equation

d4r6j·2024년 1월 4일
0

reinforcement-ai

목록 보기
2/7
post-thumbnail

About value function

In MPR (Exercise 1)

In MRP (Markov Reward Process)

Example

In MDP (Exercise 2)

In MDP (Markov Decision Process)

Example

About action-value function

In MDP (Markov Decision Process)

Example

0개의 댓글