MLA

State University. Dept. of Computer Science, Oregon, Prasad Tadepalli, and DoKyeong Ok. H-learning : a Reinforcement Learning Method to Optimize Undiscounted Average Reward. : Corvallis, OR : Oregon State University, Dept. of Computer Science, 1994.