강화학습

1.Reinforcement Learning #1 MDP : Markov Decision Process

post-thumbnail