TU Darmstadt / ULB / TUbiblio

IDEBench: A Benchmark for Interactive Data Exploration

Eichmann, Philipp ; Zgraggen, Emanuel ; Binnig, Carsten ; Kraska, Tim (2020):
IDEBench: A Benchmark for Interactive Data Exploration.
In: Proceedings of the 2020 ACM SIGMOD International Conference on Management of Data, pp. 1555-1569,
ACM, SIGMOD/PODS '20: International Conference on Management of Data, virtual Conference, 14.-19.06.2020, ISBN 978-1-4503-6735-6,
DOI: 10.1145/3318464.3380574,
[Conference or Workshop Item]

Abstract

In recent years, many query processing techniques have been developed to better support interactive data exploration (IDE) of large structured datasets. To evaluate and compare database engines in terms of how well they support such workloads, experimenters have mostly used self-designed evaluation procedures rather than established benchmarks. In this paper we argue that this is due to the fact that the workloads and metrics of popular analytical benchmarks such as TPC-H or TPC-DS were designed for traditional performance reporting scenarios, and do not capture distinctive IDE characteristics. Guided by the findings of several user studies we present a new benchmark called IDEBench, designed to evaluate database engines based on common IDE workflows and metrics that matter to the end-user. We demonstrate the applicability of IDEBench through a number of experiments with five different database engines, and present and discuss our findings.

Item Type: Conference or Workshop Item
Erschienen: 2020
Creators: Eichmann, Philipp ; Zgraggen, Emanuel ; Binnig, Carsten ; Kraska, Tim
Title: IDEBench: A Benchmark for Interactive Data Exploration
Language: English
Abstract:

In recent years, many query processing techniques have been developed to better support interactive data exploration (IDE) of large structured datasets. To evaluate and compare database engines in terms of how well they support such workloads, experimenters have mostly used self-designed evaluation procedures rather than established benchmarks. In this paper we argue that this is due to the fact that the workloads and metrics of popular analytical benchmarks such as TPC-H or TPC-DS were designed for traditional performance reporting scenarios, and do not capture distinctive IDE characteristics. Guided by the findings of several user studies we present a new benchmark called IDEBench, designed to evaluate database engines based on common IDE workflows and metrics that matter to the end-user. We demonstrate the applicability of IDEBench through a number of experiments with five different database engines, and present and discuss our findings.

Book Title: Proceedings of the 2020 ACM SIGMOD International Conference on Management of Data
Publisher: ACM
ISBN: 978-1-4503-6735-6
Divisions: 20 Department of Computer Science
20 Department of Computer Science > Data Management (2022 umbenannt in Data and AI Systems)
Event Title: SIGMOD/PODS '20: International Conference on Management of Data
Event Location: virtual Conference
Event Dates: 14.-19.06.2020
Date Deposited: 14 Dec 2020 09:23
DOI: 10.1145/3318464.3380574
PPN:
Export:
Suche nach Titel in: TUfind oder in Google
Send an inquiry Send an inquiry

Options (only for editors)
Show editorial Details Show editorial Details