Civil Engineering Reference
In-Depth Information
Whiteson, S., Tanner, B., Taylor, M. E., & Stone, P. (2009). Generalized domains for empirical
evaluations in reinforcement learning. In
Proceedings of the 26th International Conference on
Machine Learning: Workshop on Evaluation Methods for Machine Learning, Montreal, Canada,
14-18 June
. Retrieved from http://www.site.uottawa.ca/ICML09WS/papers/w8.pdf
Whiteson, S., Tanner, B., Taylor, M. E., & Stone, P. (2011). Protecting against evaluation overfit-
ting in empirical reinforcement learning. In
Proceedings of the IEEE Symposium on Adaptive
Dynamic Programming and Reinforcement Learning (ADPRL), Paris, France, 11-15 April
(pp. 120-127). doi: 10.1109/ADPRL.2011.5967363
Wiering, M. A. (1995).
TD learning of game evaluation functions with hierarchical neural architec-
tures
. Unpublished masters thesis, Department of Computer Science, University of Amsterdam,
Amsterdam, Netherlands.
Wiering, M. A., Patist, J. P., & Mannen, H. (2007).
Learning to play board games using
temporal difference methods
(Technical Report UU-CS-2005-048, Institute of Informa-
tion and Computing Sciences, Utrecht University). Retrieved from
http://www.ai.rug.nl/
~
Search WWH ::
Custom Search