TU Darmstadt / ULB / TUbiblio

Mean squared advantage minimization as a consequence of entropic policy improvement regularization

Belousov, B. and Peters, J. (2018):
Mean squared advantage minimization as a consequence of entropic policy improvement regularization.
In: The 14th European Workshop on Reinforcement Learning (EWRL 2018), Lille, France, October 1-3, 2018, [Online-Edition: https://www.ias.informatik.tu-darmstadt.de/uploads/Team/Bori...],
[Conference or Workshop Item]

Item Type: Conference or Workshop Item
Erschienen: 2018
Creators: Belousov, B. and Peters, J.
Title: Mean squared advantage minimization as a consequence of entropic policy improvement regularization
Language: English
Divisions: 20 Department of Computer Science
20 Department of Computer Science > Intelligent Autonomous Systems
Event Title: The 14th European Workshop on Reinforcement Learning (EWRL 2018)
Event Location: Lille, France
Event Dates: October 1-3, 2018
Date Deposited: 31 Oct 2018 09:34
Official URL: https://www.ias.informatik.tu-darmstadt.de/uploads/Team/Bori...
Export:
Suche nach Titel in: TUfind oder in Google

Optionen (nur für Redakteure)

View Item View Item