TU Darmstadt / ULB / TUbiblio

Browse by Person

Up a level
Export as [feed] Atom [feed] RSS 1.0 [feed] RSS 2.0
Group by: No Grouping | Item Type | Date | Language
Number of items: 1.

Morimura, Tetsuro and Uchibe, Eiji and Yoshimoto, Junichiro and Peters, Jan and Doya, Kenji (2010):
Derivatives of Logarithmic Stationary Distributions for Policy Gradient Reinforcement Learning.
In: Neural Computation, MIT Press, pp. 442-376, 22, (2), [Online-Edition: http://www-clmc.usc.edu/publications/M/Morimura_NC_2010.pdf],
[Article]

This list was generated on Tue Aug 20 01:51:27 2019 CEST.