Civil Engineering Reference
In-Depth Information
Wierstra, D., Förster, A., Peters, J., & Schmidhuber, J. (2010). Recurrent policy gradients.
Logic
Journal of the IGPL
, 18(5), 620-634.
Yamada, K. (2011). Network parameter setting for reinforcement learning approaches using neural
networks.
Journal of Advanced Computational Intelligence and Intelligent Informatics
, 15(7),
822-830.
Yan, X., Diaconis, P., Rusmevichientong, P., & Roy, B. V. (2004). Solitaire: Man versus machine.
In
Advances in Neural Information Processing Systems 17
(pp. 1553-1560). Cambridge, MA:
MIT Press.
Yoshioka, T., Ishii, S., and Ito, M. (1999). Strategy acquisition for the game 'Othello' based on
reinforcement learning.
IEICE Transactions on Information and Systems
, E82-D(12), 1618-
1626.
Search WWH ::
Custom Search