In a reinforcement learning framework, the Markov property is required for agents to make decisions as a function of the environment’s state signal (i.e., information available to the agent)—it ensures that the state is a sufficient statistic for predicting future states and rewards.
Markov property
A state signal at time has the Markov property, and is a Markov state, if and only if the environment’s response at time can be specified by the distribution
for all and all possible values of past events .