Civil Engineering Reference
In-Depth Information
Whiteson, S., Tanner, B., Taylor, M. E., & Stone, P. (2009). Generalized domains for empirical
evaluations in reinforcement learning. In Proceedings of the 26th International Conference on
Machine Learning: Workshop on Evaluation Methods for Machine Learning, Montreal, Canada,
14-18 June . Retrieved from http://www.site.uottawa.ca/ICML09WS/papers/w8.pdf
Whiteson, S., Tanner, B., Taylor, M. E., & Stone, P. (2011). Protecting against evaluation overfit-
ting in empirical reinforcement learning. In Proceedings of the IEEE Symposium on Adaptive
Dynamic Programming and Reinforcement Learning (ADPRL), Paris, France, 11-15 April
(pp. 120-127). doi: 10.1109/ADPRL.2011.5967363
Wiering, M. A. (1995). TD learning of game evaluation functions with hierarchical neural architec-
tures . Unpublished masters thesis, Department of Computer Science, University of Amsterdam,
Amsterdam, Netherlands.
Wiering, M. A., Patist, J. P., & Mannen, H. (2007). Learning to play board games using
temporal difference methods (Technical Report UU-CS-2005-048, Institute of Informa-
tion and Computing Sciences, Utrecht University). Retrieved from http://www.ai.rug.nl/ ~
mwiering/GROUP/ARTICLES/learning_games_TR.pdf.
Search WWH ::




Custom Search