Oregon State University. Dept. of Computer Science; Tadepalli, Prasad; Ok, DoKyeong (Corvallis, OR : Oregon State University, Dept. of Computer Science, 1994-05-12)
In this paper, we introduce a model-based reinforcement learning method called H-learning, which optimizes undiscounted average reward. We compare it with three other reinforcement learning methods in the domain of sched ...