Technical Report
 

A POMDP approximation algorithm that anticipates the need to observe

Público Deposited

Contenido Descargable

Descargar PDF
https://ir.library.oregonstate.edu/concern/technical_reports/gm80hw69f

Descriptions

Attribute NameValues
Creator
Abstract
  • This paper introduces the even-odd POMDP an approximation to POMDPs Partially Observable Markov Decision Problems in which the world is assumed to be fully observable every other time step. This approximation works well for problems with a delayed need to observe. The even-odd POMDP can be converted into an equivalent MDP the 2MDP whose value function, V*[subscript 2MDP], can be combined online with a 2-step lookahead search to provide a good POMDP policy. We prove that this gives an approximation to the POMDPs optimal value function that is at least as good as methods based on the optimal value function of the underlying MDP. We present experimental evidence that the method finds a good policy for a POMDP with states and observations.
  • Keywords: Partially Observable Markov Decision Problems, Even-odd POMDP, POMDP
Resource Type
Fecha Disponible
Fecha de Emisión
Series
Subject
Declaración de derechos
Funding Statement (additional comments about funding)
  • This research was supported by AFOSR F49620-9810375.
Publisher
Peer Reviewed
Language
Replaces

Relaciones

Parents:

This work has no parents.

Elementos