getCITED   
  Home     Search     Add Content     Reports     Help  
Edit Publication | Edit Contributors | Delete Publication | Edit References | Edit Citations
Add to Bookstack | Show Bookstack | Change Bookstack

On the Convergence of Off-line Temporal Difference Learning with Function Approximation

Post a Comment
CONTRIBUTORS:
  Author Li, Lihong (Rutgers University New Brunswick)
INSTITUTION ID:
  University of Alberta  (Edmonton)
SERIES TITLE:
 
YEAR: 2003
PUB TYPE: Working Paper/Manuscript
WORKING PAPER NUMBER: None
PAGES: 10 p.
SUBJECT(S): off-line reinforcement learning, function approximation, linear function approximation, temporal-difference learning.
DISCIPLINE: Computer Science
HTTP:
LANGUAGE: English
PUB ID: 103-397-080 (Last edited on 2003/11/21 20:51:22 US/Mountain)
SPONSOR(S):
 
ABSTRACT:
A standard reinforcement learning agent learns the optimal policy to achieve its goal, or the value function of a given policy, while interacting with the environment. This paper considers a variation of this standard framework, where the agent learns the value function of a control policy off-line, based on a fixed set of training data. Parameterized function approximation is used and a general update rule is derived. But it has been known that online reinforcement learning algorithms with function approximation can diverge in some cases. This paper considers the convergence property for off-line reinforcement learning when function approximation is used. More extensive analytical convergence analysis is done for the linear function approximation case.
STATISTICS
Click on # to view
 Citations  
 References  
 Comments  
 Quality      0/0.00 
 Interest      0/0.00 
 View(er)s   3/219 
Quality
  N/A
High
  7
  6
  5
  4
  3
  2
  1
Low
Interest
  N/A
High
  7
  6
  5
  4
  3
  2
  1
Low
Prev | Next

    ABOUT getCITED   |    CONTACT US   |    USER INFO   |    PREFERENCES   |    PRIVACY   |    LOG IN   
Comments? Suggestions? Send them to feedback@getCITED.org.

Copyright © 2000-2006 getCITED Inc. All Rights Reserved.