TU Darmstadt / ULB / TUbiblio

Correlation Priors for Reinforcement Learning

Alt, B. ; Šošić, A. ; Koeppl, H. (2019)
Correlation Priors for Reinforcement Learning.
33rd Conference on Neural Information Processing Systems (NeurIPS 2019). Vancouver, Kanada (09.12.-13.12.2019)
Konferenzveröffentlichung, Bibliographie

Kurzbeschreibung (Abstract)

Many decision-making problems naturally exhibit pronounced structures inherited from the characteristics of the underlying environment. In a Markov decision pro- cess model, for example, two distinct states can have inherently related semantics or encode resembling physical state configurations. This often implies locally cor- related transition dynamics among the states. In order to complete a certain task in such environments, the operating agent usually needs to execute a series of tempo- rally and spatially correlated actions. Though there exists a variety of approaches to capture these correlations in continuous state-action domains, a principled solution for discrete environments is missing. In this work, we present a Bayesian learning framework based on Pólya-Gamma augmentation that enables an analogous reasoning in such cases. We demonstrate the framework on a number of common decision-making related problems, such as imitation learning, subgoal extraction, system identification and Bayesian reinforcement learning. By explicitly modeling the underlying correlation structures of these problems, the proposed approach yields superior predictive performance compared to correlation-agnostic models, even when trained on data sets that are an order of magnitude smaller in size.

Typ des Eintrags: Konferenzveröffentlichung
Erschienen: 2019
Autor(en): Alt, B. ; Šošić, A. ; Koeppl, H.
Art des Eintrags: Bibliographie
Titel: Correlation Priors for Reinforcement Learning
Sprache: Englisch
Publikationsjahr: 3 September 2019
Veranstaltungstitel: 33rd Conference on Neural Information Processing Systems (NeurIPS 2019)
Veranstaltungsort: Vancouver, Kanada
Veranstaltungsdatum: 09.12.-13.12.2019
URL / URN: https://papers.nips.cc/paper/9564-correlation-priors-for-rei...
Kurzbeschreibung (Abstract):

Many decision-making problems naturally exhibit pronounced structures inherited from the characteristics of the underlying environment. In a Markov decision pro- cess model, for example, two distinct states can have inherently related semantics or encode resembling physical state configurations. This often implies locally cor- related transition dynamics among the states. In order to complete a certain task in such environments, the operating agent usually needs to execute a series of tempo- rally and spatially correlated actions. Though there exists a variety of approaches to capture these correlations in continuous state-action domains, a principled solution for discrete environments is missing. In this work, we present a Bayesian learning framework based on Pólya-Gamma augmentation that enables an analogous reasoning in such cases. We demonstrate the framework on a number of common decision-making related problems, such as imitation learning, subgoal extraction, system identification and Bayesian reinforcement learning. By explicitly modeling the underlying correlation structures of these problems, the proposed approach yields superior predictive performance compared to correlation-agnostic models, even when trained on data sets that are an order of magnitude smaller in size.

Fachbereich(e)/-gebiet(e): 18 Fachbereich Elektrotechnik und Informationstechnik
18 Fachbereich Elektrotechnik und Informationstechnik > Institut für Nachrichtentechnik > Bioinspirierte Kommunikationssysteme
18 Fachbereich Elektrotechnik und Informationstechnik > Institut für Nachrichtentechnik
DFG-Sonderforschungsbereiche (inkl. Transregio)
DFG-Sonderforschungsbereiche (inkl. Transregio) > Sonderforschungsbereiche
Zentrale Einrichtungen
Zentrale Einrichtungen > Centre for Cognitive Science (CCS)
DFG-Sonderforschungsbereiche (inkl. Transregio) > Sonderforschungsbereiche > SFB 1053: MAKI – Multi-Mechanismen-Adaption für das künftige Internet
DFG-Sonderforschungsbereiche (inkl. Transregio) > Sonderforschungsbereiche > SFB 1053: MAKI – Multi-Mechanismen-Adaption für das künftige Internet > B: Adaptionsmechanismen
DFG-Sonderforschungsbereiche (inkl. Transregio) > Sonderforschungsbereiche > SFB 1053: MAKI – Multi-Mechanismen-Adaption für das künftige Internet > B: Adaptionsmechanismen > Teilprojekt B4: Planung
Hinterlegungsdatum: 15 Okt 2019 05:48
Letzte Änderung: 23 Sep 2021 14:30
PPN:
Export:
Suche nach Titel in: TUfind oder in Google
Frage zum Eintrag Frage zum Eintrag

Optionen (nur für Redakteure)
Redaktionelle Details anzeigen Redaktionelle Details anzeigen