TU Darmstadt / ULB / TUbiblio

Derivatives of Logarithmic Stationary Distributions for Policy Gradient Reinforcement Learning

Morimura, T. and Uchibe, E. and Yoshimoto, J. and Peters, J. and Doya, K. (2010):
Derivatives of Logarithmic Stationary Distributions for Policy Gradient Reinforcement Learning.
In: Neural Computation, 22(2), pp.342-376, [Article]

Item Type: Article
Erschienen: 2010
Creators: Morimura, T. and Uchibe, E. and Yoshimoto, J. and Peters, J. and Doya, K.
Title: Derivatives of Logarithmic Stationary Distributions for Policy Gradient Reinforcement Learning
Language: English
Journal or Publication Title: Neural Computation, 22(2), pp.342-376
Series Name: doi: 10.1162/neco.2009.12-08-922
Divisions: 20 Department of Computer Science
20 Department of Computer Science > Intelligent Autonomous Systems
Date Deposited: 29 Nov 2011 13:57
Additional Information:

Intelligent Autonomous Systems

Export:
Suche nach Titel in: TUfind oder in Google

Optionen (nur für Redakteure)

View Item View Item