×
Log in
Upload File
Most Popular
Study
Business
Design
Technology
Travel
Explore all categories
The top documents tagged [greedy policy]
1 Reinforcement Learning: Learning algorithms Function Approximation Yishay Mansour Tel-Aviv University
219 views
CoMotion Computational Methods for Collaborative Motion Pursuit Evasion Games for Networks of UUVs November 2004 Mike Eklund, Jonathan Sprinkle, Shankar
219 views
Reinforcement Learning CSE 446 – Winter 2012
38 views
Chapter 4: Dynamic Programming
81 views
Reinforcement Learning: Learning algorithms
62 views
Summary of MDPs (until Now) Finite-horizon MDPs – Non-stationary policy – Value iteration Compute V 0..V k.. V T the value functions for k stages to go
220 views
Off-Policy Temporal-Difference Learning with Function Approximation Doina Precup McGill University Rich Sutton Sanjoy Dasgupta AT&T Labs
219 views
RL for Large State Spaces: Value Function Approximation
40 views
Concurrent Probabilistic Temporal Planning (CPTP)
37 views
Reinforcement Learning: Learning algorithms
64 views
RL for Large State Spaces: Value Function Approximation
24 views
Summary of MDPs (until Now)
43 views
< Prev
Next >