Sergieh, Hatem Mousselly ; Egyed-Zsigmond, Elöd ; Gianini, Gabriele ; Döller, Mario ; Kosch, Harald ; Pinon, Jean-Marie (2013):
Tag Similarity in Folksonomies.
In: Actes du XXXIème Congrès INFORSID, Paris, France, 29-31 Mai 2013., pp. 319-334,
Paris, France, [Conference or Workshop Item]
Abstract
Folksonomies - collections of user-contributed tags, proved to be efficient in reducing the inherent semantic gap. However, user tags are noisy; thus, they need to be processed before they can be used by further applications. In this paper, we propose an approach for bootstrapping semantics from folksonomy tags. Our goal is to automatically identify semantically related tags. The approach is based on creating probability distribution for each tag based on co-occurrence statistics. Subsequently, the similarity between two tags is determined by the distance between their corresponding probability distributions. For this purpose, we propose an extension for the well-known Jensen-Shannon Divergence. We compared our approach to a widely used method for identifying similar tags based on the cosine measure. The evaluation shows promising results and emphasizes the advantage of our approach.
Item Type: | Conference or Workshop Item |
---|---|
Erschienen: | 2013 |
Creators: | Sergieh, Hatem Mousselly ; Egyed-Zsigmond, Elöd ; Gianini, Gabriele ; Döller, Mario ; Kosch, Harald ; Pinon, Jean-Marie |
Title: | Tag Similarity in Folksonomies |
Language: | English |
Abstract: | Folksonomies - collections of user-contributed tags, proved to be efficient in reducing the inherent semantic gap. However, user tags are noisy; thus, they need to be processed before they can be used by further applications. In this paper, we propose an approach for bootstrapping semantics from folksonomy tags. Our goal is to automatically identify semantically related tags. The approach is based on creating probability distribution for each tag based on co-occurrence statistics. Subsequently, the similarity between two tags is determined by the distance between their corresponding probability distributions. For this purpose, we propose an extension for the well-known Jensen-Shannon Divergence. We compared our approach to a widely used method for identifying similar tags based on the cosine measure. The evaluation shows promising results and emphasizes the advantage of our approach. |
Book Title: | Actes du XXXIème Congrès INFORSID, Paris, France, 29-31 Mai 2013. |
Divisions: | 20 Department of Computer Science 20 Department of Computer Science > Ubiquitous Knowledge Processing |
Event Location: | Paris, France |
Date Deposited: | 31 Dec 2016 14:29 |
URL / URN: | https://liris.cnrs.fr/Documents/Liris-6007.pdf |
Identification Number: | TUD-CS-2013-0453 |
PPN: | |
Export: | |
Suche nach Titel in: | TUfind oder in Google |
![]() |
Send an inquiry |
Options (only for editors)
![]() |
Show editorial Details |