TU Darmstadt / ULB / TUbiblio

Bridging the gap between extractive and abstractive summaries: Creation and evaluation of coherent extracts from heterogeneous sources

Benikova, Darina ; Mieskes, Margot ; Meyer, Christian M. ; Gurevych, Iryna (2016)
Bridging the gap between extractive and abstractive summaries: Creation and evaluation of coherent extracts from heterogeneous sources.
Osaka, Japan
Konferenzveröffentlichung, Bibliographie

Kurzbeschreibung (Abstract)

In our work, we present a corpus of heterogeneous documents for summarization to address the issue that information seekers usually face a range of different types of information sources. A second issue we address, is the summary type, as most manual summaries are abstractive, whereas automatic methods mainly create extractive summaries, which are hard to compare to each other using standard evaluation methods. Therefore, we suggest a multi-step process for creating \emph{coherent extracts}, which are based on information taken directly from the source documents, but minimally redacted and meaningfully ordered to form a coherent text. Our qualitative and quantitative evaluation show that quantitative results are not sufficient to judge the quality of a summary and that other quality criteria, such as coherence, should also be taken into account. We find that our corpus is of high quality and that it has the potential to bridge the gap between reference corpora of abstracts and automatic methods producing extracts. Our corpus is available to the research community for further development.

Typ des Eintrags: Konferenzveröffentlichung
Erschienen: 2016
Autor(en): Benikova, Darina ; Mieskes, Margot ; Meyer, Christian M. ; Gurevych, Iryna
Art des Eintrags: Bibliographie
Titel: Bridging the gap between extractive and abstractive summaries: Creation and evaluation of coherent extracts from heterogeneous sources
Sprache: Englisch
Publikationsjahr: Dezember 2016
Buchtitel: Proceedings of the 26th International Conference on Computational Linguistics (COLING)
Veranstaltungsort: Osaka, Japan
URL / URN: http://aclweb.org/anthology/C16-1099
Zugehörige Links:
Kurzbeschreibung (Abstract):

In our work, we present a corpus of heterogeneous documents for summarization to address the issue that information seekers usually face a range of different types of information sources. A second issue we address, is the summary type, as most manual summaries are abstractive, whereas automatic methods mainly create extractive summaries, which are hard to compare to each other using standard evaluation methods. Therefore, we suggest a multi-step process for creating \emph{coherent extracts}, which are based on information taken directly from the source documents, but minimally redacted and meaningfully ordered to form a coherent text. Our qualitative and quantitative evaluation show that quantitative results are not sufficient to judge the quality of a summary and that other quality criteria, such as coherence, should also be taken into account. We find that our corpus is of high quality and that it has the potential to bridge the gap between reference corpora of abstracts and automatic methods producing extracts. Our corpus is available to the research community for further development.

Freie Schlagworte: UKP_reviewed;AIPHES_corpus
ID-Nummer: TUD-CS-2016-1445
Fachbereich(e)/-gebiet(e): 20 Fachbereich Informatik
20 Fachbereich Informatik > Ubiquitäre Wissensverarbeitung
DFG-Graduiertenkollegs
DFG-Graduiertenkollegs > Graduiertenkolleg 1994 Adaptive Informationsaufbereitung aus heterogenen Quellen
Hinterlegungsdatum: 31 Dez 2016 14:29
Letzte Änderung: 24 Jan 2020 12:03
PPN:
Export:
Suche nach Titel in: TUfind oder in Google
Frage zum Eintrag Frage zum Eintrag

Optionen (nur für Redakteure)
Redaktionelle Details anzeigen Redaktionelle Details anzeigen