TU Darmstadt / ULB / TUbiblio

Temporally consistent sequence-to-sequence translation of cataract surgeries

Frisch, Yannik ; Fuchs, Moritz ; Mukhopadhyay, Anirban (2023)
Temporally consistent sequence-to-sequence translation of cataract surgeries.
In: International Journal of Computer Assisted Radiology and Surgery, 18 (7)
doi: 10.1007/s11548-023-02925-y
Artikel, Bibliographie

Kurzbeschreibung (Abstract)

Purpose: Image-to-image translation methods can address the lack of diversity in publicly available cataract surgery data. However, applying image-to-image translation to videos—which are frequently used in medical downstream applications— induces artifacts. Additional spatio-temporal constraints are needed to produce realistic translations and improve the temporal consistency of translated image sequences. Methods We introduce amotion-translationmodule that translates optical flows between domains to impose such constraints. We combine it with a shared latent space translation model to improve image quality. Evaluations are conducted regarding translated sequences’ image quality and temporal consistency, where we propose novel quantitative metrics for the latter. Finally, the downstream task of surgical phase classification is evaluated when retraining it with additional synthetic translated data. Results Our proposed method produces more consistent translations than state-of-the-art baselines. Moreover, it stays competitive in terms of the per-image translation quality. We further show the benefit of consistently translated cataract surgery sequences for improving the downstream task of surgical phase prediction. Conclusion The proposed module increases the temporal consistency of translated sequences. Furthermore, imposed temporal constraints increase the usability of translated data in downstream tasks. This allows overcoming some of the hurdles of surgical data acquisition and annotation and enables improving models’ performance by translating between existing datasets of sequential frames.

Typ des Eintrags: Artikel
Erschienen: 2023
Autor(en): Frisch, Yannik ; Fuchs, Moritz ; Mukhopadhyay, Anirban
Art des Eintrags: Bibliographie
Titel: Temporally consistent sequence-to-sequence translation of cataract surgeries
Sprache: Englisch
Publikationsjahr: 1 Juli 2023
Verlag: Springer
Titel der Zeitschrift, Zeitung oder Schriftenreihe: International Journal of Computer Assisted Radiology and Surgery
Jahrgang/Volume einer Zeitschrift: 18
(Heft-)Nummer: 7
DOI: 10.1007/s11548-023-02925-y
Kurzbeschreibung (Abstract):

Purpose: Image-to-image translation methods can address the lack of diversity in publicly available cataract surgery data. However, applying image-to-image translation to videos—which are frequently used in medical downstream applications— induces artifacts. Additional spatio-temporal constraints are needed to produce realistic translations and improve the temporal consistency of translated image sequences. Methods We introduce amotion-translationmodule that translates optical flows between domains to impose such constraints. We combine it with a shared latent space translation model to improve image quality. Evaluations are conducted regarding translated sequences’ image quality and temporal consistency, where we propose novel quantitative metrics for the latter. Finally, the downstream task of surgical phase classification is evaluated when retraining it with additional synthetic translated data. Results Our proposed method produces more consistent translations than state-of-the-art baselines. Moreover, it stays competitive in terms of the per-image translation quality. We further show the benefit of consistently translated cataract surgery sequences for improving the downstream task of surgical phase prediction. Conclusion The proposed module increases the temporal consistency of translated sequences. Furthermore, imposed temporal constraints increase the usability of translated data in downstream tasks. This allows overcoming some of the hurdles of surgical data acquisition and annotation and enables improving models’ performance by translating between existing datasets of sequential frames.

Freie Schlagworte: Cataract surgery, Unsupervised image translation, Sequence translation, Temporal consistency, Generative adversarial networks, Generative models
Fachbereich(e)/-gebiet(e): 20 Fachbereich Informatik
20 Fachbereich Informatik > Graphisch-Interaktive Systeme
Hinterlegungsdatum: 27 Nov 2023 14:34
Letzte Änderung: 29 Jan 2024 09:45
PPN: 515105511
Export:
Suche nach Titel in: TUfind oder in Google
Frage zum Eintrag Frage zum Eintrag

Optionen (nur für Redakteure)
Redaktionelle Details anzeigen Redaktionelle Details anzeigen