TU Darmstadt / ULB / TUbiblio

IDEBench: A Benchmark for Interactive Data Exploration

Eichmann, Philipp ; Zgraggen, Emanuel ; Binnig, Carsten ; Kraska, Tim (2020)
IDEBench: A Benchmark for Interactive Data Exploration.
SIGMOD/PODS '20: International Conference on Management of Data. virtual Conference (14.06.2020-19.06.2020)
doi: 10.1145/3318464.3380574
Konferenzveröffentlichung, Bibliographie

Kurzbeschreibung (Abstract)

In recent years, many query processing techniques have been developed to better support interactive data exploration (IDE) of large structured datasets. To evaluate and compare database engines in terms of how well they support such workloads, experimenters have mostly used self-designed evaluation procedures rather than established benchmarks. In this paper we argue that this is due to the fact that the workloads and metrics of popular analytical benchmarks such as TPC-H or TPC-DS were designed for traditional performance reporting scenarios, and do not capture distinctive IDE characteristics. Guided by the findings of several user studies we present a new benchmark called IDEBench, designed to evaluate database engines based on common IDE workflows and metrics that matter to the end-user. We demonstrate the applicability of IDEBench through a number of experiments with five different database engines, and present and discuss our findings.

Typ des Eintrags: Konferenzveröffentlichung
Erschienen: 2020
Autor(en): Eichmann, Philipp ; Zgraggen, Emanuel ; Binnig, Carsten ; Kraska, Tim
Art des Eintrags: Bibliographie
Titel: IDEBench: A Benchmark for Interactive Data Exploration
Sprache: Englisch
Publikationsjahr: 2020
Verlag: ACM
Buchtitel: Proceedings of the 2020 ACM SIGMOD International Conference on Management of Data
Veranstaltungstitel: SIGMOD/PODS '20: International Conference on Management of Data
Veranstaltungsort: virtual Conference
Veranstaltungsdatum: 14.06.2020-19.06.2020
DOI: 10.1145/3318464.3380574
Kurzbeschreibung (Abstract):

In recent years, many query processing techniques have been developed to better support interactive data exploration (IDE) of large structured datasets. To evaluate and compare database engines in terms of how well they support such workloads, experimenters have mostly used self-designed evaluation procedures rather than established benchmarks. In this paper we argue that this is due to the fact that the workloads and metrics of popular analytical benchmarks such as TPC-H or TPC-DS were designed for traditional performance reporting scenarios, and do not capture distinctive IDE characteristics. Guided by the findings of several user studies we present a new benchmark called IDEBench, designed to evaluate database engines based on common IDE workflows and metrics that matter to the end-user. We demonstrate the applicability of IDEBench through a number of experiments with five different database engines, and present and discuss our findings.

Fachbereich(e)/-gebiet(e): 20 Fachbereich Informatik
20 Fachbereich Informatik > Data Management (2022 umbenannt in Data and AI Systems)
Hinterlegungsdatum: 14 Dez 2020 09:23
Letzte Änderung: 21 Apr 2022 09:03
PPN:
Export:
Suche nach Titel in: TUfind oder in Google
Frage zum Eintrag Frage zum Eintrag

Optionen (nur für Redakteure)
Redaktionelle Details anzeigen Redaktionelle Details anzeigen