Index Catalog // ScholarsArchive@OSU

1. Adversarial planning by strategy switching in a real-time strategy game

Creator:: King, Brian D.
Abstract:: We consider the problem of strategic adversarial planning in a Real-Time Strategy (RTS) game. Strategic adversarial planning is the generation of a network of high-level tasks to satisfy goals while anticipating an adversary's actions. In this thesis we describe an abstract state and action space used for planning in an...
Resource Type:: Masters Thesis
Full Text:: . . . . . . . . . . . . . . . . . . . . . . 41 6 Results 43 6.1 Experimental Setup . . . . . . . . . . . . . . . . . . . . . . . . . . . 43

2. Learning and Improving Policies for Probabilistic Planning Problems

Creator:: Issakkimuthu, Murugeswari
Abstract:: In this work, we study the problem of learning and improving policies for probabilistic planning problems. In the first part, we train neural network policies for probabilistic planning problems modeled as factored Markov decision problems. The objective is to train problem-specific neural networks via supervised learning to imitate the action...
Resource Type:: Dissertation
Full Text:: Feature-based Aggregation Framework . . . . . . . . . . . . . . . . . . 76 6 Conclusion 80 7 Appendix 82

3. Re-understanding Finite-State Representations of Recurrent Policy Networks

Creator:: Danesh, Mohamad H.
Abstract:: We propose an approach for understanding control policies represented as recurrent neural networks. Recent work has approached this problem by transforming such recurrent policy networks into finite-state machines (FSM) and then analyzing the equivalent minimized FSM. While this led to interesting insights, the minimization process can obscure a deeper understanding...
Resource Type:: Masters Thesis
Full Text:: ) removes unnecessary branches, leaving an open-loop policy. . . . . . . . . 6 3.2 Differential Attention

4. Monte Carlo Tree Search with Fixed and Adaptive Abstractions

Creator:: Hostetler, Jesse A.
Abstract:: Monte Carlo tree search (MCTS) is a class of online planning algorithms for Markov decision processes (MDPs) and related models that has found success in challenging applications. In the online planning approach, the agent makes a decision in the current state by performing a limited forward search over possible futures...
Resource Type:: Dissertation
Full Text:: . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 2.2 Solving MDPs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 2.2.1

5. New learning modes for sequential decision making

Creator:: Judah, Kshitij
Abstract:: This thesis considers the problem in which a teacher is interested in teaching action policies to computer agents for sequential decision making. The vast majority of policy learning algorithms o er teachers little flexibility in how policies are taught. In particular, one of two learning modes is typically considered: 1)...
Resource Type:: Dissertation
Full Text:: Ellwood and Dr. Bill Jarrold were involved in the research presented in chapter 6. TABLE OF CONTENTS

6. Toward computer vision for understanding American football in video

Creator:: Hess, Robin W.
Abstract:: In this work, I examine the problem of understanding American football in video. In particular, I present several mid-level computer vision algorithms that each accomplish a different sub-task within a larger system for annotating, interpreting, and analyzing collections of American football video. The analysis of football video is useful in...
Resource Type:: Dissertation
Full Text:: . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 78 5.6 Summary and Future Work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 82 6

7. Scheduling and Online Planning in Stochastic Diffusion Networks

Creator:: Xue, Shan
Abstract:: Diﬀusion processes in networks are common models for many domains, including species colonization, information/idea cascade, disease propagation and ﬁre spreading. In diﬀusion networks, a diﬀusion event occurs when a behavior spreads from one node to the other following a probabilistic model, where the behavior could be species, an idea, a...
Resource Type:: Dissertation
Full Text:: a HOP action for immediate execution. Compared to standard 6 implementations of HOP that

8. Incorporating and Learning Behavior Constraints for Sequential Decision Making

Creator:: Pinto, Jervis
Abstract:: Writing a program that performs well in a complex environment is a challenging task. In such problems, a method of deterministic programming combined with reinforcement learning (RL) can be helpful. However, current systems either force developers to encode knowledge in very specific forms (e.g., state-action features), or assume advanced RL...
Resource Type:: Dissertation
Full Text:: Go [26]. 6 In a typical LTS application, performance usually improves with increasing search

ScholarsArchive@OSU

1. Adversarial planning by strategy switching in a real-time strategy game

2. Learning and Improving Policies for Probabilistic Planning Problems

3. Re-understanding Finite-State Representations of Recurrent Policy Networks

4. Monte Carlo Tree Search with Fixed and Adaptive Abstractions

5. New learning modes for sequential decision making

6. Toward computer vision for understanding American football in video

7. Scheduling and Online Planning in Stochastic Diffusion Networks

8. Incorporating and Learning Behavior Constraints for Sequential Decision Making

Limit your search

Academic Affiliation

Advisor

Commencement Year

Committee Member

Creator

Contributor

Date

Decade

Degree Field

Degree Level

Degree Name

Language

License

Non-Academic Affiliation

Peer Reviewed

Resource Type

Rights Statement

Subject

Search Constraints

Search Results

1. Adversarial planning by strategy switching in a real-time strategy game

2. Learning and Improving Policies for Probabilistic Planning Problems

3. Re-understanding Finite-State Representations of Recurrent Policy Networks

4. Monte Carlo Tree Search with Fixed and Adaptive Abstractions

5. New learning modes for sequential decision making

6. Toward computer vision for understanding American football in video

7. Scheduling and Online Planning in Stochastic Diffusion Networks

8. Incorporating and Learning Behavior Constraints for Sequential Decision Making

Limit your search