Simpson, Edwin ; Reece, Steven ; Roberts, Stephen J. (2017)
Bayesian Heatmaps: Probabilistic Classification with Multiple Unreliable Information Sources.
Skopje, Macedonia
Konferenzveröffentlichung, Bibliographie
Kurzbeschreibung (Abstract)
Unstructured data from diverse sources, such as social media and aerial imagery, can provide valuable up-to-date information for intelligent situation assessment. Mining these different information sources could bring major benefits to applications such as situation awareness in disaster zones and mapping the spread of diseases. Such applications depend on classifying the situation across a region of interest, which can be depicted as a spatial 'heatmap'. Annotating unstructured data using crowdsourcing or automated classifiers produces individual classifications at sparse locations that typically contain many errors. We propose a novel Bayesian approach that models the relevance, error rates and bias of each information source, enabling us to learn a spatial Gaussian Process classifier by aggregating data from multiple sources with varying reliability and relevance. Our method does not require gold-labelled data and can make predictions at any location in an area of interest given only sparse observations. We show empirically that our approach can handle noisy and biased data sources, and that simultaneously inferring reliability and transferring information between neighbouring reports leads to more accurate predictions. We demonstrate our method on two real-world problems from disaster response, showing how our approach reduces the amount of crowdsourced data required and can be used to generate valuable heatmap visualisations from SMS messages and satellite images.
Typ des Eintrags: | Konferenzveröffentlichung |
---|---|
Erschienen: | 2017 |
Autor(en): | Simpson, Edwin ; Reece, Steven ; Roberts, Stephen J. |
Art des Eintrags: | Bibliographie |
Titel: | Bayesian Heatmaps: Probabilistic Classification with Multiple Unreliable Information Sources |
Sprache: | Englisch |
Publikationsjahr: | September 2017 |
Verlag: | Springer |
(Heft-)Nummer: | 10535 |
Buchtitel: | Machine Learning and Knowledge Discovery in Databases (ECML PKDD 2017) |
Reihe: | Lecture Notes in Computer Science |
Band einer Reihe: | Lecture Notes in Computer |
Veranstaltungsort: | Skopje, Macedonia |
URL / URN: | https://link.springer.com/chapter/10.1007/978-3-319-71246-8_... |
Zugehörige Links: | |
Kurzbeschreibung (Abstract): | Unstructured data from diverse sources, such as social media and aerial imagery, can provide valuable up-to-date information for intelligent situation assessment. Mining these different information sources could bring major benefits to applications such as situation awareness in disaster zones and mapping the spread of diseases. Such applications depend on classifying the situation across a region of interest, which can be depicted as a spatial 'heatmap'. Annotating unstructured data using crowdsourcing or automated classifiers produces individual classifications at sparse locations that typically contain many errors. We propose a novel Bayesian approach that models the relevance, error rates and bias of each information source, enabling us to learn a spatial Gaussian Process classifier by aggregating data from multiple sources with varying reliability and relevance. Our method does not require gold-labelled data and can make predictions at any location in an area of interest given only sparse observations. We show empirically that our approach can handle noisy and biased data sources, and that simultaneously inferring reliability and transferring information between neighbouring reports leads to more accurate predictions. We demonstrate our method on two real-world problems from disaster response, showing how our approach reduces the amount of crowdsourced data required and can be used to generate valuable heatmap visualisations from SMS messages and satellite images. |
Freie Schlagworte: | Bayesian machine learning, gaussian process, crowdsourcing, variational |
ID-Nummer: | TUD-CS-2017-0245 |
Fachbereich(e)/-gebiet(e): | 20 Fachbereich Informatik 20 Fachbereich Informatik > Ubiquitäre Wissensverarbeitung |
Hinterlegungsdatum: | 12 Sep 2017 19:22 |
Letzte Änderung: | 14 Sep 2018 15:23 |
PPN: | |
Export: | |
Suche nach Titel in: | TUfind oder in Google |
Frage zum Eintrag |
Optionen (nur für Redakteure)
Redaktionelle Details anzeigen |