@elduvelle_neuro Second, evaluate possible actions at that state, select and see how state change according to your model of the world. It is possible to also evaluate value of different policies (counterfactual thinking for example) at this step. Repeat these two steps, keep the state transition going in your mind.