TU Darmstadt
ULB
TUbiblio
Browse by Person
Up a level |
Number of items: 1.
Hoffman, Matthew ; Freitas, Nando de ; Doucet, Arnaud ; Peters, Jan (2009)
An Expectation Maximization Algorithm for Continuous Markov Decision Processes with Arbitrary Reward.
Proceedings of the Twelfth International Conference on Artificial Intelligence and Statistics (AIStats).
Conference or Workshop Item, Bibliographie