TU Darmstadt / ULB / TUbiblio

Browse by Person

Up a level
Export as [feed] Atom [feed] RSS 1.0 [feed] RSS 2.0
Group by: No Grouping | Item Type | Date | Language
Number of items: 2.

Parmas, P. ; Doya, K. ; Rasmussen, C. ; Peters, J. (2018):
PIPPS: Flexible Model-Based Policy Search Robust to the Curse of Chaos.
In: International Conference on Machine Learning, [Article]

Morimura, T. ; Uchibe, E. ; Yoshimoto, J. ; Peters, J. ; Doya, K. (2010):
Derivatives of Logarithmic Stationary Distributions for Policy Gradient Reinforcement Learning.
In: doi: 10.1162/neco.2009.12-08-922, In: Neural Computation, 22(2), pp.342-376, [Article]

This list was generated on Sat Sep 25 04:07:09 2021 CEST.