TU Darmstadt / ULB / TUbiblio

Start State Selection for Control Policy Learning from Optimal Trajectories

Zelch, Christoph ; Peters, Jan ; Stryk, Oskar von (2023)
Start State Selection for Control Policy Learning from Optimal Trajectories.
2023 IEEE International Conference on Robotics and Automation. London, United Kingdom (29.05.2023-02.06.2023)
doi: 10.1109/ICRA48891.2023.10160978
Konferenzveröffentlichung, Bibliographie

Kurzbeschreibung (Abstract)

Combination of optimal control methods and machine learning approaches allows to profit from complementary benefits of each field in control of robotic systems. Data from optimal trajectories provides valuable information that can be used to learn a near-optimal state-dependent feedback control policy. To obtain high-quality learning data, careful selection of optimal trajectories, determined by a set of start states, is essential to achieve a good learning performance. In this paper, we extend previous work with new comple-menting strategies to generate start points. These methods complement the existing approach, as they introduce new criteria to identify relevant regions in joint state space that need coverage by new trajectories. It is demonstrated that the extensions significantly improve the overall performance of the previous method in simulation on full nonlinear dynamics model of the industrial Manutec r3 robot arm. Further, it is demonstrated that it suffices to learn a policy that reaches the proximity of the goal state, from where a PI controller can be used for stable control reaching the final system state.

Typ des Eintrags: Konferenzveröffentlichung
Erschienen: 2023
Autor(en): Zelch, Christoph ; Peters, Jan ; Stryk, Oskar von
Art des Eintrags: Bibliographie
Titel: Start State Selection for Control Policy Learning from Optimal Trajectories
Sprache: Englisch
Publikationsjahr: 4 Juli 2023
Verlag: IEEE
Buchtitel: Conference Proceedings - ICRA 2023
Veranstaltungstitel: 2023 IEEE International Conference on Robotics and Automation
Veranstaltungsort: London, United Kingdom
Veranstaltungsdatum: 29.05.2023-02.06.2023
DOI: 10.1109/ICRA48891.2023.10160978
URL / URN: https://ieeexplore.ieee.org/document/10160978
Zugehörige Links:
Kurzbeschreibung (Abstract):

Combination of optimal control methods and machine learning approaches allows to profit from complementary benefits of each field in control of robotic systems. Data from optimal trajectories provides valuable information that can be used to learn a near-optimal state-dependent feedback control policy. To obtain high-quality learning data, careful selection of optimal trajectories, determined by a set of start states, is essential to achieve a good learning performance. In this paper, we extend previous work with new comple-menting strategies to generate start points. These methods complement the existing approach, as they introduce new criteria to identify relevant regions in joint state space that need coverage by new trajectories. It is demonstrated that the extensions significantly improve the overall performance of the previous method in simulation on full nonlinear dynamics model of the industrial Manutec r3 robot arm. Further, it is demonstrated that it suffices to learn a policy that reaches the proximity of the goal state, from where a PI controller can be used for stable control reaching the final system state.

Fachbereich(e)/-gebiet(e): 20 Fachbereich Informatik
20 Fachbereich Informatik > Simulation, Systemoptimierung und Robotik
Hinterlegungsdatum: 05 Sep 2023 11:34
Letzte Änderung: 08 Dez 2023 10:13
PPN: 512665192
Export:
Suche nach Titel in: TUfind oder in Google
Frage zum Eintrag Frage zum Eintrag

Optionen (nur für Redakteure)
Redaktionelle Details anzeigen Redaktionelle Details anzeigen