Learning MDP action models via discrete mixture trees

Wynkoop, Michael S.

Graduate Thesis Or Dissertation

Learning MDP action models via discrete mixture trees

Öffentlich Deposited

PDF Herunterladen

Citeable URL: https://ir.library.oregonstate.edu/concern/graduate_thesis_or_dissertations/hh63t030f

Descriptions

Attribute Name	Values
Creator	Wynkoop, Michael S.
Abstract	This thesis addresses the problem of learning dynamic Bayesian network (DBN) models to support reinforcement learning. It focuses on learning regression tree models of the conditional probability distributions of the DBNs. Existing algorithms presume that the stochasticity in the domain can be modeled as a deterministic function with additive noise. This is inappropriate for many RL domains, where the stochasticity takes the form of a random choice over deterministic functions. This paper introduces a regression tree algorithm in which each leaf node is modeled as a finite mixture of deterministic functions. This mixture is approximated via a greedy set cover. To combat overfitting, pruning techniques incorporating log likelihood and KL-Divergence are employed. Experiments on three challenging RL domains, two with stochastic variants, show that this approach finds trees that are more accurate and that are more likely to correctly identify the conditional dependencies in the DBNs based on small samples. Keywords: Regression Tree, Function Mixtures, Dynamic Bayesian Network, Machine Learning
License	All rights reserved
Resource Type	Masters Thesis
Date Available	2008-07-28T15:11:23+00:00
Date Issued	2008-06-09
Degree Level	Master's
Degree Name	Master of Science (M.S.)
Degree Field	Computer Science
Degree Grantor	Oregon State University
Commencement Year	2009
Advisor	Dietterich, Thomas G.
Committee Member	Fern, Alan Tadepalli, Prasad
Academic Affiliation	Electrical Engineering and Computer Science
Non-Academic Affiliation	Oregon State University. Graduate School
Subject	Reinforcement learning (Machine learning) -- Mathematical models
Urheberrechts-Erklärung	In Copyright
Publisher	Oregon State University
Peer Reviewed	No
Language	English [eng]
Replaces	http://hdl.handle.net/1957/9096

Beziehungen

Parents:

This work has no parents.

In Collection:

Graduate Theses and Dissertations (GTD)

Artikel

Miniaturansicht	Titel	Hochladedatum	Sichtbarkeit	Aktionen
	Wynkoop.pdf	2017-08-19	Öffentlich	Herunterladen

Hyrax

Learning MDP action models via discrete mixture trees

Herunterladbarer Inhalt

Descriptions

Beziehungen

Artikel