×
Log in
Upload File
Most Popular
Study
Business
Design
Technology
Travel
Explore all categories
The top documents tagged [s expected reward]
Reinforcement Learning : A Beginners Tutorial
5.264 views
Between MDPs and Semi-MDPs: Learning, Planning and Representing Knowledge at Multiple Temporal Scales Richard S. Sutton Doina Precup University of Massachusetts
219 views
Richard S. Sutton Doina Precup University of Massachusetts Satinder Singh University of Colorado
17 views