TU Darmstadt / ULB / TUbiblio

An Algorithm for the Detection of Hidden Propaganda in Mixed-Code Text over the Internet

Tundis, Andrea ; Mukherjee, Gaurav ; Mühlhäuser, Max (2021)
An Algorithm for the Detection of Hidden Propaganda in Mixed-Code Text over the Internet.
In: Applied Sciences, 11 (5)
doi: 10.3390/app11052196
Artikel, Bibliographie

Kurzbeschreibung (Abstract)

Internet-based communication systems have become an increasing tool for spreading misinformation and propaganda. Even though there exist mechanisms that are able to track unwarranted information and messages, users made up different ways to avoid their scrutiny and detection. An example is represented by the mixed-code language, that is text written in an unconventional form by combining different languages, symbols, scripts and shapes. It aims to make more difficult the detection of specific content, due to its custom and ever changing appearance, by using special characters to substitute for alphabet letters. Indeed, such substitute combinations of symbols, which tries to resemble the shape of the intended alphabet’s letter, makes it still intuitively readable to humans, however nonsensical to machines. In this context, the paper explores the possibility of identifying propaganda in such mixed-code texts over the Internet, centred on a machine learning based approach. In particular, an algorithm in combination with a deep learning models for character identification is proposed in order to detect and analyse whether an element contains propaganda related content. The overall approach is presented, the results gathered from its experimentation are discussed and the achieved performances are compared with the related works.

Typ des Eintrags: Artikel
Erschienen: 2021
Autor(en): Tundis, Andrea ; Mukherjee, Gaurav ; Mühlhäuser, Max
Art des Eintrags: Bibliographie
Titel: An Algorithm for the Detection of Hidden Propaganda in Mixed-Code Text over the Internet
Sprache: Englisch
Publikationsjahr: 3 März 2021
Verlag: MDPI
Titel der Zeitschrift, Zeitung oder Schriftenreihe: Applied Sciences
Jahrgang/Volume einer Zeitschrift: 11
(Heft-)Nummer: 5
DOI: 10.3390/app11052196
URL / URN: https://www.mdpi.com/2076-3417/11/5/2196
Kurzbeschreibung (Abstract):

Internet-based communication systems have become an increasing tool for spreading misinformation and propaganda. Even though there exist mechanisms that are able to track unwarranted information and messages, users made up different ways to avoid their scrutiny and detection. An example is represented by the mixed-code language, that is text written in an unconventional form by combining different languages, symbols, scripts and shapes. It aims to make more difficult the detection of specific content, due to its custom and ever changing appearance, by using special characters to substitute for alphabet letters. Indeed, such substitute combinations of symbols, which tries to resemble the shape of the intended alphabet’s letter, makes it still intuitively readable to humans, however nonsensical to machines. In this context, the paper explores the possibility of identifying propaganda in such mixed-code texts over the Internet, centred on a machine learning based approach. In particular, an algorithm in combination with a deep learning models for character identification is proposed in order to detect and analyse whether an element contains propaganda related content. The overall approach is presented, the results gathered from its experimentation are discussed and the achieved performances are compared with the related works.

Zusätzliche Informationen:

Art.No.: 2196; This paper is an extended version of the paper published in the 15th International Conference on Availability, Reliability and Security (ARES 2020), Virtual Event, Dublin, Ireland, 25–28 August 2020; Article 76

Fachbereich(e)/-gebiet(e): 20 Fachbereich Informatik
20 Fachbereich Informatik > Telekooperation
Hinterlegungsdatum: 26 Mär 2021 08:14
Letzte Änderung: 14 Jun 2021 06:14
PPN:
Export:
Suche nach Titel in: TUfind oder in Google
Frage zum Eintrag Frage zum Eintrag

Optionen (nur für Redakteure)
Redaktionelle Details anzeigen Redaktionelle Details anzeigen