Civil Engineering Reference
In-Depth Information
Wierstra, D., Förster, A., Peters, J., & Schmidhuber, J. (2010). Recurrent policy gradients. Logic
Journal of the IGPL , 18(5), 620-634.
Yamada, K. (2011). Network parameter setting for reinforcement learning approaches using neural
networks. Journal of Advanced Computational Intelligence and Intelligent Informatics , 15(7),
822-830.
Yan, X., Diaconis, P., Rusmevichientong, P., & Roy, B. V. (2004). Solitaire: Man versus machine.
In Advances in Neural Information Processing Systems 17 (pp. 1553-1560). Cambridge, MA:
MIT Press.
Yoshioka, T., Ishii, S., and Ito, M. (1999). Strategy acquisition for the game 'Othello' based on
reinforcement learning. IEICE Transactions on Information and Systems , E82-D(12), 1618-
1626.
Search WWH ::




Custom Search