강화학습

1.Multi-armed bandit problem - (1) (feat. 호구 형님)

post-thumbnail

2.Multi-armed bandit problem - (2) coding

post-thumbnail

3.Finite Markov Decision Process - (1) (feat. kurly)

post-thumbnail

4.Finite Markov Decision Process - (2) notation

post-thumbnail