TU Darmstadt / ULB / TUbiblio

Tag Similarity in Folksonomies

Sergieh, Hatem Mousselly and Egyed-Zsigmond, Elöd and Gianini, Gabriele and Döller, Mario and Kosch, Harald and Pinon, Jean-Marie (2013):
Tag Similarity in Folksonomies.
In: Actes du XXXIème Congrès INFORSID, Paris, France, 29-31 Mai 2013., Paris, France, [Online-Edition: https://liris.cnrs.fr/Documents/Liris-6007.pdf],
[Conference or Workshop Item]

Abstract

Folksonomies - collections of user-contributed tags, proved to be efficient in reducing the inherent semantic gap. However, user tags are noisy; thus, they need to be processed before they can be used by further applications. In this paper, we propose an approach for bootstrapping semantics from folksonomy tags. Our goal is to automatically identify semantically related tags. The approach is based on creating probability distribution for each tag based on co-occurrence statistics. Subsequently, the similarity between two tags is determined by the distance between their corresponding probability distributions. For this purpose, we propose an extension for the well-known Jensen-Shannon Divergence. We compared our approach to a widely used method for identifying similar tags based on the cosine measure. The evaluation shows promising results and emphasizes the advantage of our approach.

Item Type: Conference or Workshop Item
Erschienen: 2013
Creators: Sergieh, Hatem Mousselly and Egyed-Zsigmond, Elöd and Gianini, Gabriele and Döller, Mario and Kosch, Harald and Pinon, Jean-Marie
Title: Tag Similarity in Folksonomies
Language: English
Abstract:

Folksonomies - collections of user-contributed tags, proved to be efficient in reducing the inherent semantic gap. However, user tags are noisy; thus, they need to be processed before they can be used by further applications. In this paper, we propose an approach for bootstrapping semantics from folksonomy tags. Our goal is to automatically identify semantically related tags. The approach is based on creating probability distribution for each tag based on co-occurrence statistics. Subsequently, the similarity between two tags is determined by the distance between their corresponding probability distributions. For this purpose, we propose an extension for the well-known Jensen-Shannon Divergence. We compared our approach to a widely used method for identifying similar tags based on the cosine measure. The evaluation shows promising results and emphasizes the advantage of our approach.

Title of Book: Actes du XXXIème Congrès INFORSID, Paris, France, 29-31 Mai 2013.
Divisions: 20 Department of Computer Science
20 Department of Computer Science > Ubiquitous Knowledge Processing
Event Location: Paris, France
Date Deposited: 31 Dec 2016 14:29
Official URL: https://liris.cnrs.fr/Documents/Liris-6007.pdf
Identification Number: TUD-CS-2013-0453
Export:
Suche nach Titel in: TUfind oder in Google

Optionen (nur für Redakteure)

View Item View Item