Zelch, Christoph ; Peters, Jan ; Stryk, Oskar von (2023)
Start State Selection for Control Policy Learning from Optimal Trajectories.
2023 IEEE International Conference on Robotics and Automation. London, United Kingdom (29.05.2023-02.06.2023)
doi: 10.1109/ICRA48891.2023.10160978
Konferenzveröffentlichung, Bibliographie
Kurzbeschreibung (Abstract)
Combination of optimal control methods and machine learning approaches allows to profit from complementary benefits of each field in control of robotic systems. Data from optimal trajectories provides valuable information that can be used to learn a near-optimal state-dependent feedback control policy. To obtain high-quality learning data, careful selection of optimal trajectories, determined by a set of start states, is essential to achieve a good learning performance. In this paper, we extend previous work with new comple-menting strategies to generate start points. These methods complement the existing approach, as they introduce new criteria to identify relevant regions in joint state space that need coverage by new trajectories. It is demonstrated that the extensions significantly improve the overall performance of the previous method in simulation on full nonlinear dynamics model of the industrial Manutec r3 robot arm. Further, it is demonstrated that it suffices to learn a policy that reaches the proximity of the goal state, from where a PI controller can be used for stable control reaching the final system state.
Typ des Eintrags: | Konferenzveröffentlichung |
---|---|
Erschienen: | 2023 |
Autor(en): | Zelch, Christoph ; Peters, Jan ; Stryk, Oskar von |
Art des Eintrags: | Bibliographie |
Titel: | Start State Selection for Control Policy Learning from Optimal Trajectories |
Sprache: | Englisch |
Publikationsjahr: | 4 Juli 2023 |
Verlag: | IEEE |
Buchtitel: | Conference Proceedings - ICRA 2023 |
Veranstaltungstitel: | 2023 IEEE International Conference on Robotics and Automation |
Veranstaltungsort: | London, United Kingdom |
Veranstaltungsdatum: | 29.05.2023-02.06.2023 |
DOI: | 10.1109/ICRA48891.2023.10160978 |
URL / URN: | https://ieeexplore.ieee.org/document/10160978 |
Zugehörige Links: | |
Kurzbeschreibung (Abstract): | Combination of optimal control methods and machine learning approaches allows to profit from complementary benefits of each field in control of robotic systems. Data from optimal trajectories provides valuable information that can be used to learn a near-optimal state-dependent feedback control policy. To obtain high-quality learning data, careful selection of optimal trajectories, determined by a set of start states, is essential to achieve a good learning performance. In this paper, we extend previous work with new comple-menting strategies to generate start points. These methods complement the existing approach, as they introduce new criteria to identify relevant regions in joint state space that need coverage by new trajectories. It is demonstrated that the extensions significantly improve the overall performance of the previous method in simulation on full nonlinear dynamics model of the industrial Manutec r3 robot arm. Further, it is demonstrated that it suffices to learn a policy that reaches the proximity of the goal state, from where a PI controller can be used for stable control reaching the final system state. |
Fachbereich(e)/-gebiet(e): | 20 Fachbereich Informatik 20 Fachbereich Informatik > Simulation, Systemoptimierung und Robotik |
Hinterlegungsdatum: | 05 Sep 2023 11:34 |
Letzte Änderung: | 08 Dez 2023 10:13 |
PPN: | 512665192 |
Export: | |
Suche nach Titel in: | TUfind oder in Google |
Frage zum Eintrag |
Optionen (nur für Redakteure)
Redaktionelle Details anzeigen |