【PPT】 Least squares temporal difference learning

时间:2023-03-08 18:10:18

最小二次方时序差分学习

原文地址:

https://www.google.com/url?sa=t&rct=j&q=&esrc=s&source=web&cd=9&cad=rja&uact=8&ved=2ahUKEwjD6qn5x8zhAhVSuZ4KHfJTCyUQFjAIegQIBBAC&url=https%3A%2F%2Fiu.instructure.com%2Ffiles%2F69696547%2Fdownload%3Fdownload_frd%3D1&usg=AOvVaw1uyAuK3zMTxZ7COM1SrJE7

【PPT】  Least squares temporal difference learning

【PPT】  Least squares temporal difference learning

【PPT】  Least squares temporal difference learning

【PPT】  Least squares temporal difference learning

【PPT】  Least squares temporal difference learning

【PPT】  Least squares temporal difference learning

【PPT】  Least squares temporal difference learning

【PPT】  Least squares temporal difference learning

【PPT】  Least squares temporal difference learning

【PPT】  Least squares temporal difference learning

【PPT】  Least squares temporal difference learning

【PPT】  Least squares temporal difference learning

【PPT】  Least squares temporal difference learning

【PPT】  Least squares temporal difference learning

【PPT】  Least squares temporal difference learning

【PPT】  Least squares temporal difference learning

【PPT】  Least squares temporal difference learning

【PPT】  Least squares temporal difference learning

【PPT】  Least squares temporal difference learning

【PPT】  Least squares temporal difference learning

【PPT】  Least squares temporal difference learning

【PPT】  Least squares temporal difference learning

【PPT】  Least squares temporal difference learning

【PPT】  Least squares temporal difference learning

【PPT】  Least squares temporal difference learning

【PPT】  Least squares temporal difference learning

【PPT】  Least squares temporal difference learning

【PPT】  Least squares temporal difference learning

【PPT】  Least squares temporal difference learning

【PPT】  Least squares temporal difference learning

【PPT】  Least squares temporal difference learning

【PPT】  Least squares temporal difference learning

【PPT】  Least squares temporal difference learning

------------------------------------------------------------------------------------------------------

LSTD

Bradtke and Barto (1996). Linear least-squares algorithms for temporal difference learning.

Geramifard et al (2006). Incremental Least-Squares Temporal Difference Learning.

Szepesv ́ari (2009). Algorithms for Reinforcement Learning.

LSTD(λ)

Boyan (2002). Technical Update: Least-Squares Temporal Difference Learning.

Gehring et al (2016). Incremental Truncated LSTD.

Off-policy LSTD(λ)

Yu (2010). Convergence of Least Squares Temporal Difference Methods Under General Conditions.