Index Catalog // ScholarsArchive@OSU

Learning cost-sensitive diagnostic policies from data

Creator:: Zubek, Valentina Bayer
Abstract:: In its simplest form, the process of diagnosis is a decision-making process in which the diagnostician performs a sequence of tests culminating in a diagnostic decision. For example, a physician might perform a series of simple measurements (body tem- perature, weight, etc.) and laboratory measurements (white blood count, CT scan,...
Resource Type:: Dissertation
Full Text:: by Valentina Bayer Zubek A DISSERTATION submitted to Oregon State University in partial ful

AO* revisited

Creator:: Bayer-Zubek, Valentina and Dietterich, Thomas Glen
Abstract:: Preprint submitted to Elsevier Science 30 June 2004.
Resource Type:: Article
Full Text:: paper.dvi AO � Revisited Valentina Bayer Zubek �� Thomas G

Integrating learning from examples into the search for diagnostic policies

Creator:: Oregon State University. Dept. of Computer Science, Bayer-Zubek, Valentina, and Dietterich, Thomas Glen
Abstract:: This paper studies the problem of learning diagnostic policies from training examples. A diagnostic policy is a complete description of the decision-making actions of a diagnostician (i.e., tests followed by a diagnostic decision) for all possible combinations of test results. An optimal diagnostic policy is one that minimizes the expected...
Resource Type:: Research Paper
Full Text:: ∗ Valentina Bayer-Zubek bayer@cs.orst.edu Thomas G. Dietterich tgd@cs.orst.edu School of Electrical

Learning diagnostic policies from examples by systematic search

Creator:: Oregon State University. Dept. of Computer Science and Zubek, Valentina Bayer
Abstract:: A diagnostic policy species what test to perform next based on the results of previous tests and when to stop and make a diagnosis. Cost-sensitive diagnostic policies perform tradeoffs between (a) the costs of tests and (b) the costs of misdiagnoses. An optimal diagnostic policy minimizes the expected total cost....
Resource Type:: Research Paper
Full Text:: Systematic Search Valentina Bayer�Zubek School of Electrical Engineering and Computer Science Oregon State

A POMDP approximation algorithm that anticipates the need to observe

Creator:: Oregon State University. Dept. of Computer Science, Zubek, Valentina Bayer, and Dietterich, Thomas Glen
Abstract:: This paper introduces the even-odd POMDP an approximation to POMDPs Partially Observable Markov Decision Problems in which the world is assumed to be fully observable every other time step. This approximation works well for problems with a delayed need to observe. The even-odd POMDP can be converted into an equivalent...
Resource Type:: Research Paper
Full Text:: Need to Observe Valentina Bayer Zubek and Thomas Dietterich Department of Computer Science� Oregon

Two heuristics for solving POMDPs having a delayed need to observe

Creator:: Oregon State University. Dept. of Computer Science, Zubek, Valentina Bayer, and Dietterich, Thomas Glen
Abstract:: A common heuristic for solving Partially Observable Markov Decision Problems POMDPs is to first solve the underlying Markov Decision Process MDP and then construct a POMDP policy by performing a fixed depth lookahead search in the POMDP and evaluating the leaf nodes using the MDP value function. A problem with...
Resource Type:: Research Paper
Full Text:: Need to Observe Valentina Bayer Zubek and Thomas Dietterich bayer�cs�orst�edu tgd�cs�orst�edu

Pruning improves heuristic search for cost-sensitive learning

Creator:: Oregon State University. Dept. of Computer Science, Zubek, Valentina Bayer, and Dietterich, Thomas Glen
Abstract:: This paper addresses cost-sensitive classification in the setting where there are costs for measuring each attribute as well as costs for misclassification errors. We show how to formulate this as a Markov Decision Process in which the transition model is learned from the training data. Specifically we assume a set...
Resource Type:: Research Paper
Full Text:: Learning Valentina Bayer Zubek bayer�cs�orst�edu Thomas G� Dietterich tgd�cs�orst�edu Department of

A POMDP approximation algorithm that anticipates the need to observe

Creator:: Oregon State University. Dept. of Computer Science, Bayer, Valentina, and Dietterich, Thomas Glen
Abstract:: This paper introduces the even-odd POMDP, an approximation to POMDPs in which the world is assumed to be fully observable every other time step. The even-odd POMDP can be converted into an equivalent MDP, the 2MDP, whose value function, V*[subscript 2MDP], can be combined online with a 2-step lookahead search...
Resource Type:: Research Paper
Full Text:: Observe Valentina Bayer bayer@cs.orst.edu Thomas Dietterich tgd@cs.orst.edu Department of Computer

Approximation algorithms for solving cost observable Markov decision processes

Creator:: Oregon State University. Dept. of Computer Science and Bayer, Valentina
Abstract:: "The specifi c problem addressed in this proposal is the development of good approximation algorithms for solving problems that have partial observability. The model we propose associates costs with obtaining information about the current state. We want to predict when and how much it is necessary to observe. We want...
Resource Type:: Research Paper
Full Text:: Decision Processes Ph.D. Proposal Department of Computer Science Oregon State University Valentina

Model-based approximation methods for reinforcement learning

Creator:: Wang, Xin
Abstract:: The thesis focuses on model-based approximation methods for reinforcement learning with large scale applications such as combinatorial optimization problems. First, the thesis proposes two new model-based methods to stablize the value–function approximation for reinforcement learning. The first one is the BFBP algorithm, a batch-like reinforcement learning process which iterates between...
Resource Type:: Dissertation
Full Text:: , encouragement, and help during the first couple of years, and Bill Langford, Valentina Bayer Zubek, and Dragos

ScholarsArchive@OSU

Learning cost-sensitive diagnostic policies from data

AO* revisited

Integrating learning from examples into the search for diagnostic policies

Learning diagnostic policies from examples by systematic search

A POMDP approximation algorithm that anticipates the need to observe

Two heuristics for solving POMDPs having a delayed need to observe

Pruning improves heuristic search for cost-sensitive learning

A POMDP approximation algorithm that anticipates the need to observe

Approximation algorithms for solving cost observable Markov decision processes

Model-based approximation methods for reinforcement learning

Limit your search

Academic Affiliation

Advisor

Commencement Year

Committee Member

Creator

Date

Decade

Degree Field

Degree Level

Degree Name

File Format

Language

License

Non-Academic Affiliation

Peer Reviewed

Resource Type

Rights Statement

Subject

Search Constraints

Search Results

Limit your search