TU Darmstadt / ULB / TUbiblio

Browse by Person

Up a level
Export as [feed] Atom [feed] RSS 1.0 [feed] RSS 2.0
Group by: No Grouping | Item Type | Date | Language
Number of items: 1.

Morimura, T. ; Uchibe, E. ; Yoshimoto, J. ; Peters, J. ; Doya, K. (2010):
Derivatives of Logarithmic Stationary Distributions for Policy Gradient Reinforcement Learning.
In: doi: 10.1162/neco.2009.12-08-922, In: Neural Computation, 22(2), pp.342-376, [Article]

This list was generated on Sat Sep 24 02:31:21 2022 CEST.