Browse by Person
![]() | Up a level |
Number of items: 1.
Morimura, Tetsuro and Uchibe, Eiji and Yoshimoto, Junichiro and Peters, Jan and Doya, Kenji (2010):
Derivatives of Logarithmic Stationary Distributions for Policy Gradient Reinforcement Learning.
22, In: Neural Computation, (2), MIT Press, pp. 442-376, [Online-Edition: http://www-clmc.usc.edu/publications/M/Morimura_NC_2010.pdf],
[Article]