H-learning : a reinforcement learning method to optimize undiscounted average reward Public Deposited

http://ir.library.oregonstate.edu/concern/technical_reports/2j62s6254

Descriptions

Attribute NameValues
Creator
Abstract or Summary
  • In this paper, we introduce a model-based reinforcement learning method called H-learning, which optimizes undiscounted average reward. We compare it with three other reinforcement learning methods in the domain of scheduling Automatic Guided Vehicles, transportation robots used in modern manufacturing plants and facilities. The four methods differ along two dimensions. They are either model-based or model-free, and optimize discounted total reward or undiscounted average reward. Our experimental results indicate that H-learning is more robust with respect to changes in the domain parameters, and in many cases, converges in fewer steps to better average reward per time step than all the other methods. An added advantage is that unlike the other methods it does not have any parameters to tune.
Resource Type
Date Available
Date Issued
Series
Subject
Rights Statement
Funding Statement (additional comments about funding)
Publisher
Peer Reviewed
Language
Replaces
Additional Information
  • description.provenance : Approved for entry into archive by Laura Wilson(laura.wilson@oregonstate.edu) on 2012-04-17T23:01:34Z (GMT) No. of bitstreams: 1 H learning a reinforcement learning method for optimizing undiscounted average reward.pdf: 365285 bytes, checksum: 0a96d861647cfb2606bfc4fb5b53637f (MD5)
  • description.provenance : Made available in DSpace on 2012-04-17T23:01:34Z (GMT). No. of bitstreams: 1 H learning a reinforcement learning method for optimizing undiscounted average reward.pdf: 365285 bytes, checksum: 0a96d861647cfb2606bfc4fb5b53637f (MD5) Previous issue date: 1994-05-12
  • description.provenance : Submitted by Laura Wilson (laura.wilson@oregonstate.edu) on 2012-04-17T23:00:44Z No. of bitstreams: 1 H learning a reinforcement learning method for optimizing undiscounted average reward.pdf: 365285 bytes, checksum: 0a96d861647cfb2606bfc4fb5b53637f (MD5)

Relationships

In Administrative Set:
Last modified: 07/18/2017

Downloadable Content

Download PDF
Citations:

EndNote | Zotero | Mendeley

Items