Our Agent receives state $S_0$ from the Environment (first frame of the game)Based on that state $S_0$, the Agent takes action $A_0$ (agent goes right