Index Catalog // ScholarsArchive@OSU

1. Model-based approximation methods for reinforcement learning

Creator:: Wang, Xin
Abstract:: The thesis focuses on model-based approximation methods for reinforcement learning with large scale applications such as combinatorial optimization problems. First, the thesis proposes two new model-based methods to stablize the value–function approximation for reinforcement learning. The first one is the BFBP algorithm, a batch-like reinforcement learning process which iterates between...
Resource Type:: Dissertation
Full Text:: - mark problem. 2.1.2 GrowSupport, Rout, and BFBP Consider a deterministic, episodic MDP. Let s′ = a(s

2. Activity recognition in desktop environments

Creator:: Shen, Jianqiang
Abstract:: Knowledge workers are struggling in the information flood. There is a growing interest in intelligent desktop environments that help knowledge workers organize their daily life. Intelligent desktop environments allow the desktop user to define a set of “activities” that characterize the user’s desktop work. These environments then attempt to identify...
Resource Type:: Dissertation
Full Text:: Kirschner, Stephen Kolibaba, Twinkle Lettkeman, Ben Porter, Anh Tran, Mark Vulfson. I thank Dr. Hung Bui of

3. Interactive fault localization techniques to empower the debugging efforts of end-user programmers

Creator:: Ruthruff, Joseph Ronald
Abstract:: End users develop more software than any other group of programmers, using software authoring devices such as e-mail filtering editors, by-demonstration macro builders, and spreadsheet environments. Despite this, there has been only a little research on finding ways to help these programmers with the dependability of the software they create....
Resource Type:: Masters Thesis
Full Text:: places an X-mark in each cell’s decision box. . .. . . . . . . 13 3.2 Algorithm #1 — The MarkPlaced

4. Simulator-Defined MDP Planning with Applications in Natural Resource Management

Creator:: Alkaee Taleghan, Majid
Abstract:: This work is inspired by problems in natural resource management centered on the challenge of invasive species. Computing optimal management policies for maintaining ecosystem sustainable is challenging. Many ecosystem management problems can be formulated as MDP (Markov Decision Process) planning problems. In a simulator-defined MDP, the Markovian dynamics and rewards...
Resource Type:: Dissertation
Full Text:: would also like to thank Kim Hall, Professor H. Jo Albers, and Mark Crowley for their collaboration on

5. Learning and Improving Policies for Probabilistic Planning Problems

Creator:: Issakkimuthu, Murugeswari
Abstract:: In this work, we study the problem of learning and improving policies for probabilistic planning problems. In the first part, we train neural network policies for probabilistic planning problems modeled as factored Markov decision problems. The objective is to train problem-specific neural networks via supervised learning to imitate the action...
Resource Type:: Dissertation
Full Text:: , Prof. N. S. Narayanaswamy and Prof. Deepak Khemani at IIT Madras, and Prof. Anselm Blumer at Tufts

6. Improving automated email tagging with implicit feedback

Creator:: Sorower, Mohammad Shahed
Abstract:: Machine learning systems are generally trained offline using ground truth data that has been labeled by experts. However, these batch training methods are not a good fit for many applications, especially in the cases where complete ground truth data is not available for offline training. In addition, batch methods do...
Resource Type:: Dissertation
Full Text:: AN ABSTRACT OF THE DISSERTATION OF Mohammad S. Sorower

7. Machine learning methods for public policy : simulation, optimization, and visualization

Creator:: McGregor, Sean
Abstract:: Society faces many complex management problems, particularly in the area of shared public resources such as ecosystems. Existing decision making processes are often guided by personal experience and political ideology rather than state-of-the-art scientific understanding. This dissertation envisions a future in which multiple stakeholders are provided with computational tools for...
Resource Type:: Dissertation
Full Text:: designated start state distribution [7, 48] M = 〈S,A, P,R, γ, P0〉. S is a finite set of states of the world

8. Structured learning with latent variables : theory and algorithms

Creator:: Zhao, Kai
Abstract:: Most tasks in natural language processing (NLP) try to map structured input (e.g., sentence or word sequence) to some form of structured output (tag sequence, parse tree, semantic graph, translated/paraphrased/compressed sentence), a problem known as “structured prediction”. While various learning algorithms such as the perceptron, maximum entropy, and expectation-maximization have...
Resource Type:: Dissertation
Full Text:: nodes to the premise tree nodes. The blue dashed lines mark the entailment relations, and the red

9. Improving and Understanding Deep Models for Natural Language Comprehension

Creator:: Ghaeini, Reza
Abstract:: Natural Language Comprehension is a challenging domain of Natural Language Processing. To improve a model’s language comprehension/understanding, one approach would be to enrich the structure of the model to enhance its capability in learning the latent rules of the language. In this dissertation, we will ﬁrst introduce several deep models...
Resource Type:: Dissertation
Full Text:: Categorical performance analyses (accuracy) of ESIM [11], DR-BiLSTM (DR(S)) and Ensemble DR-BiLSTM (DR(E)) on

10. UCT for tactical assault battles in real-time strategy games

Creator:: Balla, Radha-Krishna
Abstract:: We consider the problem of tactical assault planning in real-time strategy games where a team of friendly agents must launch an assault on an enemy. This problem offers many challenges including a highly dynamic and uncertain environment, multiple agents, durative actions, numeric attributes, and different optimization objectives. While the dynamics...
Resource Type:: Masters Thesis
Full Text:: , most notably the game of Go (see [5] and [4]). UCT‟s ability to deal with the large state-space of Go

ScholarsArchive@OSU

1. Model-based approximation methods for reinforcement learning

2. Activity recognition in desktop environments

3. Interactive fault localization techniques to empower the debugging efforts of end-user programmers

4. Simulator-Defined MDP Planning with Applications in Natural Resource Management

5. Learning and Improving Policies for Probabilistic Planning Problems

6. Improving automated email tagging with implicit feedback

7. Machine learning methods for public policy : simulation, optimization, and visualization

8. Structured learning with latent variables : theory and algorithms

9. Improving and Understanding Deep Models for Natural Language Comprehension

10. UCT for tactical assault battles in real-time strategy games

Limit your search

Academic Affiliation

Advisor

Commencement Year

Committee Member

Creator

Contributor

Date

Decade

Degree Field

Degree Level

Degree Name

File Format

Language

License

Non-Academic Affiliation

Peer Reviewed

Resource Type

Rights Statement

Subject

Search Constraints

Search Results

1. Model-based approximation methods for reinforcement learning

2. Activity recognition in desktop environments

3. Interactive fault localization techniques to empower the debugging efforts of end-user programmers

4. Simulator-Defined MDP Planning with Applications in Natural Resource Management

5. Learning and Improving Policies for Probabilistic Planning Problems

6. Improving automated email tagging with implicit feedback

7. Machine learning methods for public policy : simulation, optimization, and visualization

8. Structured learning with latent variables : theory and algorithms

9. Improving and Understanding Deep Models for Natural Language Comprehension

10. UCT for tactical assault battles in real-time strategy games

Limit your search