TU Darmstadt / ULB / TUbiblio

Browse by Person

Up a level
Export as [feed] Atom [feed] RSS 1.0 [feed] RSS 2.0
Group by: No Grouping | Item Type | Date | Language
Number of items: 1.

Morimura, Tetsuro ; Uchibe, Eiji ; Yoshimoto, Junichiro ; Peters, Jan ; Doya, Kenji (2010):
Derivatives of Logarithmic Stationary Distributions for Policy Gradient Reinforcement Learning.
In: Neural Computation, 22 (2), pp. 442-376. MIT Press, [Article]

This list was generated on Sat Sep 24 02:00:14 2022 CEST.