Introduction - Design of Experiments for Reinforcement Learning

Civil Engineering Reference

In-Depth Information

Whiteson, S., Tanner, B., Taylor, M. E., & Stone, P. (2009). Generalized domains for empirical

evaluations in reinforcement learning. In Proceedings of the 26th International Conference on

Machine Learning: Workshop on Evaluation Methods for Machine Learning, Montreal, Canada,

14-18 June . Retrieved from http://www.site.uottawa.ca/ICML09WS/papers/w8.pdf

Whiteson, S., Tanner, B., Taylor, M. E., & Stone, P. (2011). Protecting against evaluation overfit-

ting in empirical reinforcement learning. In Proceedings of the IEEE Symposium on Adaptive

Dynamic Programming and Reinforcement Learning (ADPRL), Paris, France, 11-15 April

(pp. 120-127). doi: 10.1109/ADPRL.2011.5967363

Wiering, M. A. (1995). TD learning of game evaluation functions with hierarchical neural architec-

tures . Unpublished masters thesis, Department of Computer Science, University of Amsterdam,

Amsterdam, Netherlands.

Wiering, M. A., Patist, J. P., & Mannen, H. (2007). Learning to play board games using

temporal difference methods (Technical Report UU-CS-2005-048, Institute of Informa-

tion and Computing Sciences, Utrecht University). Retrieved from http://www.ai.rug.nl/ ~

Search WWH ::

Custom Search

Home