Yimam, Seid Muhie ; Biemann, Chris ; Majnaric, Ljiljana ; Šabanović, Šefket (2016)
An adaptive annotation approach for biomedical entity and relation recognition. Brain Informatics.
In: Brain Informatics
doi: 10.1007/s40708-016-0036-4
Artikel, Bibliographie
Kurzbeschreibung (Abstract)
In this article, we demonstrate the impact of interactive machine learning: we develop biomedical entity recognition dataset using a human-into-the-loop approach. In contrary to classical machine learning, human-in-the-loop approaches do not operate on predefined training or test sets, but assume that human input regarding system improvement is supplied iteratively. Here, during annotation, a machine learning model is built on previous annotations and used to propose labels for subsequent annotation. To demonstrate that such interactive and iterative annotation speeds up the development of quality dataset annotation, we conduct three experiments. In the first experiment, we carry out an iterative annotation experimental simulation and show that only a handful of medical abstracts need to be annotated to produce suggestions that increase annotation speed. In the second experiment, clinical doctors have conducted a case study in annotating medical terms documents relevant for their research. The third experiment explores the annotation of semantic relations with relation instance learning across documents. The experiments validate our method qualitatively and quantitatively, and give rise to a more personalized, responsive information extraction technology.
Typ des Eintrags: | Artikel |
---|---|
Erschienen: | 2016 |
Autor(en): | Yimam, Seid Muhie ; Biemann, Chris ; Majnaric, Ljiljana ; Šabanović, Šefket |
Art des Eintrags: | Bibliographie |
Titel: | An adaptive annotation approach for biomedical entity and relation recognition. Brain Informatics |
Sprache: | Deutsch |
Publikationsjahr: | Februar 2016 |
Titel der Zeitschrift, Zeitung oder Schriftenreihe: | Brain Informatics |
DOI: | 10.1007/s40708-016-0036-4 |
Zugehörige Links: | |
Kurzbeschreibung (Abstract): | In this article, we demonstrate the impact of interactive machine learning: we develop biomedical entity recognition dataset using a human-into-the-loop approach. In contrary to classical machine learning, human-in-the-loop approaches do not operate on predefined training or test sets, but assume that human input regarding system improvement is supplied iteratively. Here, during annotation, a machine learning model is built on previous annotations and used to propose labels for subsequent annotation. To demonstrate that such interactive and iterative annotation speeds up the development of quality dataset annotation, we conduct three experiments. In the first experiment, we carry out an iterative annotation experimental simulation and show that only a handful of medical abstracts need to be annotated to produce suggestions that increase annotation speed. In the second experiment, clinical doctors have conducted a case study in annotating medical terms documents relevant for their research. The third experiment explores the annotation of semantic relations with relation instance learning across documents. The experiments validate our method qualitatively and quantitatively, and give rise to a more personalized, responsive information extraction technology. |
Freie Schlagworte: | Interactive annotation Machine learning Knowledge discovery Data mining Human in the loop Biomedical entity recognition Relation learning |
ID-Nummer: | TUD-CS-2016-0059 |
Fachbereich(e)/-gebiet(e): | 20 Fachbereich Informatik > Sprachtechnologie 20 Fachbereich Informatik |
Hinterlegungsdatum: | 31 Dez 2016 09:42 |
Letzte Änderung: | 30 Mai 2018 12:51 |
PPN: | |
Export: | |
Suche nach Titel in: | TUfind oder in Google |
Frage zum Eintrag |
Optionen (nur für Redakteure)
Redaktionelle Details anzeigen |