Football Table RL

From Control Systems Technology Group

Revision as of 16:21, 11 September 2013 by S041723 (talk | contribs) (Created page with '==Reinforcement Learning== The football table employs on-line value iteration, namely <code>Greedy-GQ<math>(\lambda)</math></code> and <code>Approximate-Q<math>(\lambda)</math><…')

(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Jump to navigation Jump to search

Reinforcement Learning

The football table employs on-line value iteration, namely Greedy-GQ $[math]\displaystyle{ (\lambda) }[/math]$ and Approximate-Q $[math]\displaystyle{ (\lambda) }[/math]$ .

==Value Function Approximation

Retrieved from "https://dsdwiki.wtb.tue.nl/index.php?title=Football_Table_RL&oldid=4862"