TU Darmstadt / ULB / TUbiblio

Browse by Person

Up a level
Export as [feed] Atom [feed] RSS 1.0 [feed] RSS 2.0
Group by: No Grouping | Item Type | Date | Language
Number of items: 1.

Hoffman, Matthew and de Freitas, Nando and Doucet, Arnaud and Peters, Jan (2009):
An Expectation Maximization Algorithm for Continuous Markov Decision Processes with Arbitrary Reward.
5, In: Proceedings of Machine Learning Research, In: Proceedings of the Twelfth International Conference on Artificial Intelligence and Statistics (AIStats), pp. 232-239, [Online-Edition: http://robot-learning.de/uploads/Publications/AIStats2009-Ho...],
[Conference or Workshop Item]

This list was generated on Tue Dec 10 01:32:41 2019 CET.