TU Darmstadt / ULB / TUbiblio

The InsightsNet Climate Change Corpus (ICCC)

Volkanovska, Elena ; Tan, Sherry ; Duan, Changxu ; Bartsch, Sabine ; Stille, Wolfgang (2023)
The InsightsNet Climate Change Corpus (ICCC).
In: Datenbank-Spektrum, 23 (3)
doi: 10.1007/s13222-023-00454-1
Artikel, Bibliographie

Kurzbeschreibung (Abstract)

The discourse on climate change has become a centerpiece of public debate, thereby creating a pressing need to analyze the multitude of messages created by the participants in this communication process. In addition to text, information on this topic is conveyed multimodally, through images, videos, tables and other data objects that are embedded within documents and accompany the text. This paper presents the process of building a multimodal pilot corpus to the InsightsNet Climate Change Corpus (ICCC) and using natural language processing (NLP) tools to enrich corpus (meta)data, thus creating a dataset that lends itself to the exploration of the interplay between the various modalities that constitute the discourse on climate change. We demonstrate how the pilot corpus can be queried for relevant information in two types of databases, and how the proposed data model promotes a more comprehensive sentiment analysis approach.

Typ des Eintrags: Artikel
Erschienen: 2023
Autor(en): Volkanovska, Elena ; Tan, Sherry ; Duan, Changxu ; Bartsch, Sabine ; Stille, Wolfgang
Art des Eintrags: Bibliographie
Titel: The InsightsNet Climate Change Corpus (ICCC)
Sprache: Deutsch
Publikationsjahr: 11 September 2023
Verlag: Springer Nature
Titel der Zeitschrift, Zeitung oder Schriftenreihe: Datenbank-Spektrum
Jahrgang/Volume einer Zeitschrift: 23
(Heft-)Nummer: 3
DOI: 10.1007/s13222-023-00454-1
Kurzbeschreibung (Abstract):

The discourse on climate change has become a centerpiece of public debate, thereby creating a pressing need to analyze the multitude of messages created by the participants in this communication process. In addition to text, information on this topic is conveyed multimodally, through images, videos, tables and other data objects that are embedded within documents and accompany the text. This paper presents the process of building a multimodal pilot corpus to the InsightsNet Climate Change Corpus (ICCC) and using natural language processing (NLP) tools to enrich corpus (meta)data, thus creating a dataset that lends itself to the exploration of the interplay between the various modalities that constitute the discourse on climate change. We demonstrate how the pilot corpus can be queried for relevant information in two types of databases, and how the proposed data model promotes a more comprehensive sentiment analysis approach.

Fachbereich(e)/-gebiet(e): Zentrale Einrichtungen
Zentrale Einrichtungen > hessian.AI - Hessisches Zentrum für Künstliche Intelligenz
Hinterlegungsdatum: 14 Sep 2023 13:56
Letzte Änderung: 14 Sep 2023 13:56
PPN:
Export:
Suche nach Titel in: TUfind oder in Google
Frage zum Eintrag Frage zum Eintrag

Optionen (nur für Redakteure)
Redaktionelle Details anzeigen Redaktionelle Details anzeigen