TU Darmstadt / ULB / TUbiblio

Conquering Noise With Hardware Counters on HPC Systems

Ritter, Marcus ; Tarraf, Ahmad ; Geiß, Alexander ; Daoud, Nour ; Mohr, Bernd ; Wolf, Felix (2023)
Conquering Noise With Hardware Counters on HPC Systems.
Workshop on Programming and Performance Visualization Tools (ProTools), held in conjunction with the Supercomputing Conference (SC22). Dallas, USA (13.-18.11.2022)
doi: 10.1109/ProTools56701.2022.00007
Conference or Workshop Item, Bibliographie

Abstract

With increasing system performance and complexity, it is becoming increasingly crucial to examine the scaling behavior of an application and thus determine performance bottlenecks at early stages. Unfortunately, modeling this trend is a challenging task in the presence of noise, as the measurements can become irreproducible and misleading, thus resulting in strong deviations from the actual behavior. While noise impacts the application runtime, it has little to no effect on some hardware counters like floating-point operations. However, selecting the appropriate counters for performance modeling demands some investigation. In this paper, we perform a noise analysis on various hardware counters. Using our noise generator, we add additional noise on top of the system noise to inspect the counters' variability. We perform the analysis on five systems with three applications in the presence of various noise patterns and categorize the counters across the systems according to their noise resilience.

Item Type: Conference or Workshop Item
Erschienen: 2023
Creators: Ritter, Marcus ; Tarraf, Ahmad ; Geiß, Alexander ; Daoud, Nour ; Mohr, Bernd ; Wolf, Felix
Type of entry: Bibliographie
Title: Conquering Noise With Hardware Counters on HPC Systems
Language: English
Date: 19 November 2023
Publisher: IEEE
Book Title: Proceedings of ProTools 2022: Workshop on Programming and Performance Visualization Tools
Event Title: Workshop on Programming and Performance Visualization Tools (ProTools), held in conjunction with the Supercomputing Conference (SC22)
Event Location: Dallas, USA
Event Dates: 13.-18.11.2022
DOI: 10.1109/ProTools56701.2022.00007
Abstract:

With increasing system performance and complexity, it is becoming increasingly crucial to examine the scaling behavior of an application and thus determine performance bottlenecks at early stages. Unfortunately, modeling this trend is a challenging task in the presence of noise, as the measurements can become irreproducible and misleading, thus resulting in strong deviations from the actual behavior. While noise impacts the application runtime, it has little to no effect on some hardware counters like floating-point operations. However, selecting the appropriate counters for performance modeling demands some investigation. In this paper, we perform a noise analysis on various hardware counters. Using our noise generator, we add additional noise on top of the system noise to inspect the counters' variability. We perform the analysis on five systems with three applications in the presence of various noise patterns and categorize the counters across the systems according to their noise resilience.

Additional Information:

Held in conjunction with SC22: The International Conference for High Performance Computing, Networking, Storage and Analysis

Divisions: 20 Department of Computer Science
20 Department of Computer Science > Parallel Programming
Zentrale Einrichtungen
Zentrale Einrichtungen > University IT-Service and Computing Centre (HRZ)
Zentrale Einrichtungen > University IT-Service and Computing Centre (HRZ) > Hochleistungsrechner
Date Deposited: 13 Feb 2024 15:34
Last Modified: 11 Apr 2024 08:35
PPN: 517095491
Export:
Suche nach Titel in: TUfind oder in Google
Send an inquiry Send an inquiry

Options (only for editors)
Show editorial Details Show editorial Details