Bormann, Pascal ; Krämer, Michel ; Würz, Hendrik M. ; Göhringer, Patrick (2024)
Executing Ad-Hoc Queries on Large Geospatial Data Sets Without Acceleration Structures.
In: SN Computer Science, 5
doi: 10.1007/s42979-024-02986-z
Artikel, Bibliographie
Kurzbeschreibung (Abstract)
In this case study, we investigate if it is possible to harness the capabilities of modern commodity hardware to perform ad-hoc queries on large raw geospatial data sets. Normally, this requires building an index structure, which is a time-consuming process. We aim to provide means to individual users who receive a new or updated geospatial data set and want to directly start working with it without having to build such an index structure first. To this end, we conduct various experiments on two distinct types of data: 3D building models and point clouds. For the former, we demonstrate that well-known algorithms such as fast string search allow a wide range of queries to be answered in at most a few seconds on data sets with over a million buildings. The usage of progressive indexing additionally improves query run time by more than a factor of two. Regarding point clouds, we achieve similar run times using the popular LAS file format and a query throughput of up to a billion points per second when using a columnar memory layout. The run time of ad-hoc queries is often on par with that of database-driven solutions, sometimes even outperforming them. Considering that ad-hoc queries require no preprocessing, our results show that they are a viable alternative to acceleration structures when working with geospatial data.
Typ des Eintrags: | Artikel |
---|---|
Erschienen: | 2024 |
Autor(en): | Bormann, Pascal ; Krämer, Michel ; Würz, Hendrik M. ; Göhringer, Patrick |
Art des Eintrags: | Bibliographie |
Titel: | Executing Ad-Hoc Queries on Large Geospatial Data Sets Without Acceleration Structures |
Sprache: | Englisch |
Publikationsjahr: | 13 Juni 2024 |
Verlag: | Springer |
Titel der Zeitschrift, Zeitung oder Schriftenreihe: | SN Computer Science |
Jahrgang/Volume einer Zeitschrift: | 5 |
DOI: | 10.1007/s42979-024-02986-z |
Kurzbeschreibung (Abstract): | In this case study, we investigate if it is possible to harness the capabilities of modern commodity hardware to perform ad-hoc queries on large raw geospatial data sets. Normally, this requires building an index structure, which is a time-consuming process. We aim to provide means to individual users who receive a new or updated geospatial data set and want to directly start working with it without having to build such an index structure first. To this end, we conduct various experiments on two distinct types of data: 3D building models and point clouds. For the former, we demonstrate that well-known algorithms such as fast string search allow a wide range of queries to be answered in at most a few seconds on data sets with over a million buildings. The usage of progressive indexing additionally improves query run time by more than a factor of two. Regarding point clouds, we achieve similar run times using the popular LAS file format and a query throughput of up to a billion points per second when using a columnar memory layout. The run time of ad-hoc queries is often on par with that of database-driven solutions, sometimes even outperforming them. Considering that ad-hoc queries require no preprocessing, our results show that they are a viable alternative to acceleration structures when working with geospatial data. |
Freie Schlagworte: | Geospatial data, Point clouds, 3D City models, Information retrieval |
ID-Nummer: | Artikel-ID: 647 |
Zusätzliche Informationen: | Artikel 647 |
Fachbereich(e)/-gebiet(e): | 20 Fachbereich Informatik 20 Fachbereich Informatik > Graphisch-Interaktive Systeme |
Hinterlegungsdatum: | 21 Jun 2024 06:51 |
Letzte Änderung: | 21 Jun 2024 09:53 |
PPN: | 519314611 |
Export: | |
Suche nach Titel in: | TUfind oder in Google |
Frage zum Eintrag |
Optionen (nur für Redakteure)
Redaktionelle Details anzeigen |