Donneau-Golencer, Thierry D. (2005-10-17)
A large number of sequential decision-making problems in uncertain environments
can be modeled as Markov Decision Processes (MDPs). In such settings, an agent
can observe at each time step the state of the environmen ...