Football Table RL: Difference between revisions

Revision as of 16:43, 11 September 2013

Reinforcement Learning

The football table employs on-line value iteration, namely Greedy-GQ $[math]\displaystyle{ (\lambda) }[/math]$ and Approximate-Q $[math]\displaystyle{ (\lambda) }[/math]$ . This page does not explain Reinforcement learning theory, it just touches on the usage and implementation of the provided library (libvfa located on the SVN). Too get a basic understanding of Reinforcement Learning i suggest reading the book by Sutton & Barto ^[1]. For a slightly more in-depth, hands-on, book on using RL with function approximation, i suggest the book by Lucian Busoniu (TU Delft) et al. ^[2], which is freely available as e-book from within the TU/e network. Unfortunately both books very different notations, here we use the notation from the former.

↑ Reinforcement Learning: an introduction, Sutton & Barto, http://webdocs.cs.ualberta.ca/~sutton/book/the-book.html
↑ Reinforcement Learning and Dynamic Programming using Function Approximation, Lucian Busoniu , Robert Babuska , Bart De Schutter & Damien Ernst, 2010, http://www.crcnetbase.com/isbn/9781439821091

Value Function Approximation

[1] Reinforcement Learning: an introduction, Sutton & Barto, http://webdocs.cs.ualberta.ca/~sutton/book/the-book.html

[2] Reinforcement Learning and Dynamic Programming using Function Approximation, Lucian Busoniu , Robert Babuska , Bart De Schutter & Damien Ernst, 2010, http://www.crcnetbase.com/isbn/9781439821091

[1]

[2]

@@ Line 1: / Line 1: @@
 ==Reinforcement Learning==
-<p>The football table employs on-line value iteration, namely Greedy-GQ<math>(\lambda)</math> and Approximate-Q<math>(\lambda)</math>. This page does not explain Reinforcement learning theory, it just touches on the usage and implementation of the provided library (libvfa located on the SVN). Too get a basic understanding of Reinforcement Learning i suggest reading the book by Sutton & Barto <ref>''Reinforcement Learning: an introduction'', Sutton & Barto, http://webdocs.cs.ualberta.ca/~sutton/book/the-book.html </ref>. For a slightly more in-depth, hands-on, book on using RL with function approximation, i suggest the book by Lucian Busoniu (TU Delft) et al. <ref>''Reinforcement Learning and Dynamic Programming using Function Approximation'', Lucian Busoniu , Robert Babuska , Bart De Schutter & Damien Ernst, 2010, http://www.crcnetbase.com/isbn/9781439821091</ref>, which is freely available as e-book from within the TU/e network.</p>
+<p>The football table employs on-line value iteration, namely Greedy-GQ<math>(\lambda)</math> and Approximate-Q<math>(\lambda)</math>. This page does not explain Reinforcement learning theory, it just touches on the usage and implementation of the provided library (libvfa located on the SVN). Too get a basic understanding of Reinforcement Learning i suggest reading the book by Sutton & Barto <ref>''Reinforcement Learning: an introduction'', Sutton & Barto, http://webdocs.cs.ualberta.ca/~sutton/book/the-book.html </ref>. For a slightly more in-depth, hands-on, book on using RL with function approximation, i suggest the book by Lucian Busoniu (TU Delft) et al. <ref>''Reinforcement Learning and Dynamic Programming using Function Approximation'', Lucian Busoniu , Robert Babuska , Bart De Schutter & Damien Ernst, 2010, http://www.crcnetbase.com/isbn/9781439821091</ref>, which is freely available as e-book from within the TU/e network. Unfortunately both books very different notations, here we use the notation from the former.</p>
 <references/>
 ==Value Function Approximation==

Football Table RL: Difference between revisions

Revision as of 16:43, 11 September 2013

Reinforcement Learning

Value Function Approximation

Navigation menu

Search