TU Darmstadt / ULB / TUbiblio

Automatic Analysis of Arguments about Controversial Educational Topics in Web Documents

Kluge, Roland (2014)
Automatic Analysis of Arguments about Controversial Educational Topics in Web Documents.
Technische Universität Darmstadt
Masterarbeit, Bibliographie

Kurzbeschreibung (Abstract)

Decision making in social communities, such as families, companies, or parties, builds on debates and discussions, where arguments on particular topics are exchanged. With this work, we contribute to the efforts in automatically processing arguments for decision making, which is embedded in the field of Argumentation Mining. Since only few corpora for Argumentation Mining exist, we first built a corpus of argumentative German Web documents, containing 79 documents from 7 educational topics, which were annotated by 3 annotators according to the claim-premise argumentation model. The corpus comprises 70,000 tokens, annotated with 5,000 argument units, i.e., claims and premises. We found that the annotators performed similarly with regard to surface statistics such as the distribution of argument unit types or lengths. Each annotator’s annotations cover on average ca. 74 of the tokens, which indicates the argumentative nature of the dataset. The inter-annotator agreement evaluates to ca. 44% for Fleiss’ and to ca. 40% for Krippendorff’s unitized alpha. We found that agreement correlates slightly negatively with annotation time demand per document. Finally, we present a number of experiments on the role of 360 discourse markers for discriminating claims from premises. Our results show that several intensifying discourse particles are distinctive for claims and premises. Furthermore, we confirmed expectations from the literature that the discourse relation concession introduce counter-arguments. The discourse relations comparison/contrast and result frequently indicate claims, while the discourse relations alternative, reason, and sequence tend to indicate premises. Another experiment investigated the role of discourse markers as features for Machine Learning. Using a Naïve Bayes classifier, we found that discourse markers as sole features for discriminating claims and premises yielded an improvement of 13 percentage points over the majority class baseline.

Typ des Eintrags: Masterarbeit
Erschienen: 2014
Autor(en): Kluge, Roland
Art des Eintrags: Bibliographie
Titel: Automatic Analysis of Arguments about Controversial Educational Topics in Web Documents
Sprache: Englisch
Referenten: Eckle-Kohler, Judith ; Gurevych, Iryna
Publikationsjahr: April 2014
URL / URN: https://download.hrz.tu-darmstadt.de/media/FB20/Dekanat/Publ...
Kurzbeschreibung (Abstract):

Decision making in social communities, such as families, companies, or parties, builds on debates and discussions, where arguments on particular topics are exchanged. With this work, we contribute to the efforts in automatically processing arguments for decision making, which is embedded in the field of Argumentation Mining. Since only few corpora for Argumentation Mining exist, we first built a corpus of argumentative German Web documents, containing 79 documents from 7 educational topics, which were annotated by 3 annotators according to the claim-premise argumentation model. The corpus comprises 70,000 tokens, annotated with 5,000 argument units, i.e., claims and premises. We found that the annotators performed similarly with regard to surface statistics such as the distribution of argument unit types or lengths. Each annotator’s annotations cover on average ca. 74 of the tokens, which indicates the argumentative nature of the dataset. The inter-annotator agreement evaluates to ca. 44% for Fleiss’ and to ca. 40% for Krippendorff’s unitized alpha. We found that agreement correlates slightly negatively with annotation time demand per document. Finally, we present a number of experiments on the role of 360 discourse markers for discriminating claims from premises. Our results show that several intensifying discourse particles are distinctive for claims and premises. Furthermore, we confirmed expectations from the literature that the discourse relation concession introduce counter-arguments. The discourse relations comparison/contrast and result frequently indicate claims, while the discourse relations alternative, reason, and sequence tend to indicate premises. Another experiment investigated the role of discourse markers as features for Machine Learning. Using a Naïve Bayes classifier, we found that discourse markers as sole features for discriminating claims and premises yielded an improvement of 13 percentage points over the majority class baseline.

Freie Schlagworte: Knowledge Discovery in Scientific Literature;UKP_a_ENLP;UKP_a_WALL
ID-Nummer: TUD-CS-2014-0080
Fachbereich(e)/-gebiet(e): 20 Fachbereich Informatik
20 Fachbereich Informatik > Ubiquitäre Wissensverarbeitung
Hinterlegungsdatum: 31 Dez 2016 14:29
Letzte Änderung: 21 Sep 2018 06:56
PPN:
Referenten: Eckle-Kohler, Judith ; Gurevych, Iryna
Export:
Suche nach Titel in: TUfind oder in Google
Frage zum Eintrag Frage zum Eintrag

Optionen (nur für Redakteure)
Redaktionelle Details anzeigen Redaktionelle Details anzeigen