Learning from action not taken in multiagent systems

Khani, Newsha

Graduate Thesis Or Dissertation

Learning from action not taken in multiagent systems

公开 Deposited

下载PDF文件

Citeable URL: https://ir.library.oregonstate.edu/concern/graduate_thesis_or_dissertations/td96k517j

Descriptions

Attribute Name	Values
Creator	Khani, Newsha
Abstract	Coordination in large multiagent systems in order to achieve a system level goal is a critical challenge. Given the agents' intention to cooperate, there is no guarantee that the agent actions will lead to good system objective especially when the system becomes large. One of the primary difficulties in such mulitagent systems is the slow learning process. Agents need to learn how to interact with other agents in a complex and dynamic system while adapting in the presence of other agents that are simultaneously learning. Presented in this thesis is a unique multiagent learning approach that signiﬁcantly improves both learning speed and system level performance in multiagent systems by having an agent update its estimate of the reward (e.g., value function in reinforcement learning) for all its available actions, not just the action that was taken. This method is based on the agent receiving the reward for the actions they do not take by estimating the counterfactual reward it would have received had it taken those actions. The experimental results illustrate that the rewards on such "actions not taken" are helpful early in the learning process. The agents then use their team members to estimate these rewards resulting in principally learning as a team. Finally, it is shown that fast learning is essential in a dynamic environment. The ANT reward with teams presents improvement in speed that results in more stability in following the changes in such an environment.
License	All rights reserved
Resource Type	Masters Thesis
Date Available	2009-06-26T22:12:29+00:00
Date Issued	2009-05-29
Degree Level	Master's
Degree Name	Master of Science (M.S.)
Degree Field	Mechanical Engineering
Degree Grantor	Oregon State University
Commencement Year	2009
Advisor	Tumer, Kagan
Committee Member	Schmitt, John Baten, Blenda Prasad, Tadepalli
Academic Affiliation	Mechanical, Industrial, and Manufacturing Engineering
Non-Academic Affiliation	Oregon State University. Graduate School
Subject	Intelligent agents (Computer software) Distributed artificial intelligence
权利声明	In Copyright
Publisher	Oregon State University
Peer Reviewed	No
Language	English [eng]
Replaces	http://hdl.handle.net/1957/11949

关联

Parents:

This work has no parents.

属于 Collection:

Graduate Theses and Dissertations (GTD)

单件

缩略图	标题	上传日期	公开度	行动
	Newsha_thesis.pdf	2017-08-08	公开	下载

蹄兔

Learning from action not taken in multiagent systems

可下载的内容

Descriptions

关联

单件