TU Darmstadt / ULB / TUbiblio

Browse by Person

Up a level
Export as [feed] Atom [feed] RSS 1.0 [feed] RSS 2.0
Group by: No Grouping | Item Type | Date | Language
Number of items: 1.

Morimura, T. and Uchibe, E. and Yoshimoto, J. and Peters, J. and Doya, K. (2010):
Derivatives of Logarithmic Stationary Distributions for Policy Gradient Reinforcement Learning.
In: Neural Computation, 22(2), pp.342-376, [Article]

This list was generated on Sat Jun 6 00:50:00 2020 CEST.