Index Catalog // ScholarsArchive@OSU

1. Model-based approximation methods for reinforcement learning

Creator:: Wang, Xin
Abstract:: The thesis focuses on model-based approximation methods for reinforcement learning with large scale applications such as combinatorial optimization problems. First, the thesis proposes two new model-based methods to stablize the value–function approximation for reinforcement learning. The first one is the BFBP algorithm, a batch-like reinforcement learning process which iterates between...
Resource Type:: Dissertation
Full Text:: . . . . . . . . . . 126 4.6.3 On Discounted Finite Horizon MDPs . . . . . . . . . . . 131 4.6.4 On ∣ ∣ ∣ V π(s; θk)− V̂

2. Learning ranking functions for efficient search

Creator:: Xu, Yuehua
Abstract:: This dissertation explores algorithms for learning ranking functions to efficiently solve search problems, with application to automated planning. Specifically, we consider the frameworks of beam search, greedy search, and randomized search, which all aim to maintain tractability at the cost of not guaranteeing completeness nor optimality. Our learning objective for...
Resource Type:: Dissertation
Full Text:: consider this problem for the case of linear ranking functions, where each search node v is associated

3. Shrunken learning rates do not improve AdaBoost on benchmark datasets

Creator:: Forrest, Daniel L. K.
Abstract:: Recent work has shown that AdaBoost can be viewed as an algorithm that maximizes the margin on the training data via functional gradient descent. Under this interpretation, the weight computed by AdaBoost, for each hypothesis generated, can be viewed as a step size parameter in a gradient descent search. Friedman...
Resource Type:: Masters Thesis
Full Text:: 5.1 The relationship of the chosen stopping point to the value of the shrinkage parameter v

4. Activity recognition in desktop environments

Creator:: Shen, Jianqiang
Abstract:: Knowledge workers are struggling in the information flood. There is a growing interest in intelligent desktop environments that help knowledge workers organize their daily life. Intelligent desktop environments allow the desktop user to define a set of “activities” that characterize the user’s desktop work. These environments then attempt to identify...
Resource Type:: Dissertation
Full Text:: ., keeping track of fantasy football results). We are also interested in design- 11 ac t i v i t y a d v

5. Data Collection in Sensor Networks via the Novel Fast Markov Decision Process Framework

Creator:: Duong, Thai
Abstract:: We investigate the data collection problem in sensor networks. The network consists of a number of stationary sensors deployed at different sites for sensing and storing data locally. A mobile element moves from sites to sites to collect data from the sensors periodically. There are different costs associated with the...
Resource Type:: Masters Thesis
Full Text:: selects one action at each decision epoch. Every policy Π is associated with a value function V Π such

6. Machine learning methods for public policy : simulation, optimization, and visualization

Creator:: McGregor, Sean
Abstract:: Society faces many complex management problems, particularly in the area of shared public resources such as ecosystems. Existing decision making processes are often guided by personal experience and political ideology rather than state-of-the-art scientific understanding. This dissertation envisions a future in which multiple stakeholders are provided with computational tools for...
Resource Type:: Dissertation
Full Text:: independencies (MFMCi) . . . . . . . . . . . . 57 3.4.2 Bias and Variance Bound on V πMFMCi(s0

7. Effectiveness of integration of internet technologies in support of teaching and learning computer science programming

Creator:: Hajebi, Mojgan
Abstract:: In spite of wide spread adoption of the internet technology in teaching and learning, little research exists on the use and the effect of web incorporation in teaching and learning. Immediate clarification of how, when, and for whom the web integration in education is beneficial is needed. This qualitative study...
Resource Type:: Dissertation
Full Text:: .................................................................................................. 140 CHAPTER V: DISCUSSION AND IMPLICATIONS .......................... 146 Introduction

8. New learning modes for sequential decision making

Creator:: Judah, Kshitij
Abstract:: This thesis considers the problem in which a teacher is interested in teaching action policies to computer agents for sequential decision making. The vast majority of policy learning algorithms o er teachers little flexibility in how policies are taught. In particular, one of two learning modes is typically considered: 1)...
Resource Type:: Dissertation

9. Incorporating and Learning Behavior Constraints for Sequential Decision Making

Creator:: Pinto, Jervis
Abstract:: Writing a program that performs well in a complex environment is a challenging task. In such problems, a method of deterministic programming combined with reinforcement learning (RL) can be helpful. However, current systems either force developers to encode knowledge in very specific forms (e.g., state-action features), or assume advanced RL...
Resource Type:: Dissertation
Full Text:: the ability to compute the following two properties of an adaptive program during its execution. v

10. Adaptive Multiagent Traffic Management for Autonomous Robotic Systems

Creator:: Rebhuhn, Carrie
Abstract:: There is growing commercial interest in the use of unmanned aerial vehicles (UAVs) in urban environments, specifically for package delivery applications. However, the size, complexity and sheer numbers of expected UAVs makes conventional air traffic management that relies on human air traffic controllers infeasible. To enable UAVs to safely and...
Resource Type:: Dissertation
Full Text:: V do 2: d[u] := f [u] := infinity initialize vertex u 3: color[u] = WHITE 4: p[u] := u 5: end for 6

ScholarsArchive@OSU

1. Model-based approximation methods for reinforcement learning

2. Learning ranking functions for efficient search

3. Shrunken learning rates do not improve AdaBoost on benchmark datasets

4. Activity recognition in desktop environments

5. Data Collection in Sensor Networks via the Novel Fast Markov Decision Process Framework

6. Machine learning methods for public policy : simulation, optimization, and visualization

7. Effectiveness of integration of internet technologies in support of teaching and learning computer science programming

8. New learning modes for sequential decision making

9. Incorporating and Learning Behavior Constraints for Sequential Decision Making

10. Adaptive Multiagent Traffic Management for Autonomous Robotic Systems

Limit your search

Academic Affiliation

Advisor

Commencement Year

Committee Member

Creator

Date

Decade

Degree Field

Degree Level

Degree Name

File Format

Language

License

Non-Academic Affiliation

Peer Reviewed

Resource Type

Rights Statement

Subject

Search Constraints

Search Results

1. Model-based approximation methods for reinforcement learning

2. Learning ranking functions for efficient search

3. Shrunken learning rates do not improve AdaBoost on benchmark datasets

4. Activity recognition in desktop environments

5. Data Collection in Sensor Networks via the Novel Fast Markov Decision Process Framework

6. Machine learning methods for public policy : simulation, optimization, and visualization

7. Effectiveness of integration of internet technologies in support of teaching and learning computer science programming

8. New learning modes for sequential decision making

9. Incorporating and Learning Behavior Constraints for Sequential Decision Making

10. Adaptive Multiagent Traffic Management for Autonomous Robotic Systems

Limit your search