Q-Network Policy Selection For Multiagent Learning

Cook, Joshua

Honors College Thesis

Q-Network Policy Selection For Multiagent Learning

Public Deposited

Download PDF

Citeable URL: https://ir.library.oregonstate.edu/concern/honors_college_theses/gq67jx66z

Descriptions

Attribute Name	Values
Creator	Cook, Joshua
Abstract	Controllers for robotic systems can be complex and difficult to write by hand. Learning offers an approach to improve a controller through direct feedback. Learning is not trivial, as the feedback does not tell the agent how to improve, only how well its current actions solve the given task. Learning in sparsely rewarded domains increases the difficulty of learning as the agent receives less feedback to learn from. This is compounded in multiagent domains which require complex coordination to complete the global objective. Despite the fact that dense rewards are typically easier to learn from, they are not always easy to define; many problems are inherently sparsely rewarded. This work presents an algorithm for learning complex coordination in sparsely rewarded multi-agent domains. The algorithm is split into two steps. The first step involves learning a set of skill, with the second step learn when to use each skill. Experimental evidence is presented, showing the effectiveness of the presented algorithm in a modified version of the rover domain. The algorithm also seeks to provide more explainable learned policies than traditional black-box learners. Key Words: Reinforcement Learning, Multiagent, Multi-Reward, Sparse Reward
License	All rights reserved
Resource Type	Honors College Thesis
Date Issued	2019-06-05
Degree Level	Bachelor's
Degree Name	Honors Bachelor of Science (H.B.S.)
Degree Field	Mechanical Engineering
Degree Grantor	Oregon State University
Commencement Year	2019
Advisor	Tumer, Kagan
Committee Member	Hollinger, Geoff Yates, Connor
Non-Academic Affiliation	Oregon State University. Honors College
Subject	Reinforcement learning
Rights Statement	In Copyright
Publisher	Oregon State University
Peer Reviewed	No
Language	English [eng]

Relationships

Parents:

This work has no parents.

In Collection:

Honors College Theses (HCT)

Items

Thumbnail	Title	Date Uploaded	Visibility	Actions
	CookJoshuaM2019.pdf	2019-06-05	Public	Download

ScholarsArchive@OSU

Q-Network Policy Selection For Multiagent Learning

Downloadable Content

Descriptions

Relationships

Items