TU Darmstadt / ULB / TUbiblio

Conquering Noise With Hardware Counters on HPC Systems

Ritter, Marcus ; Tarraf, Ahmad ; Geiß, Alexander ; Daoud, Nour ; Mohr, Bernd ; Wolf, Felix (2023)
Conquering Noise With Hardware Counters on HPC Systems.
Workshop on Programming and Performance Visualization Tools (ProTools), held in conjunction with the Supercomputing Conference (SC22). Dallas, USA (13.-18.11.2022)
doi: 10.1109/ProTools56701.2022.00007
Konferenzveröffentlichung, Bibliographie

Kurzbeschreibung (Abstract)

With increasing system performance and complexity, it is becoming increasingly crucial to examine the scaling behavior of an application and thus determine performance bottlenecks at early stages. Unfortunately, modeling this trend is a challenging task in the presence of noise, as the measurements can become irreproducible and misleading, thus resulting in strong deviations from the actual behavior. While noise impacts the application runtime, it has little to no effect on some hardware counters like floating-point operations. However, selecting the appropriate counters for performance modeling demands some investigation. In this paper, we perform a noise analysis on various hardware counters. Using our noise generator, we add additional noise on top of the system noise to inspect the counters' variability. We perform the analysis on five systems with three applications in the presence of various noise patterns and categorize the counters across the systems according to their noise resilience.

Typ des Eintrags: Konferenzveröffentlichung
Erschienen: 2023
Autor(en): Ritter, Marcus ; Tarraf, Ahmad ; Geiß, Alexander ; Daoud, Nour ; Mohr, Bernd ; Wolf, Felix
Art des Eintrags: Bibliographie
Titel: Conquering Noise With Hardware Counters on HPC Systems
Sprache: Englisch
Publikationsjahr: 19 November 2023
Verlag: IEEE
Buchtitel: Proceedings of ProTools 2022: Workshop on Programming and Performance Visualization Tools
Veranstaltungstitel: Workshop on Programming and Performance Visualization Tools (ProTools), held in conjunction with the Supercomputing Conference (SC22)
Veranstaltungsort: Dallas, USA
Veranstaltungsdatum: 13.-18.11.2022
DOI: 10.1109/ProTools56701.2022.00007
Kurzbeschreibung (Abstract):

With increasing system performance and complexity, it is becoming increasingly crucial to examine the scaling behavior of an application and thus determine performance bottlenecks at early stages. Unfortunately, modeling this trend is a challenging task in the presence of noise, as the measurements can become irreproducible and misleading, thus resulting in strong deviations from the actual behavior. While noise impacts the application runtime, it has little to no effect on some hardware counters like floating-point operations. However, selecting the appropriate counters for performance modeling demands some investigation. In this paper, we perform a noise analysis on various hardware counters. Using our noise generator, we add additional noise on top of the system noise to inspect the counters' variability. We perform the analysis on five systems with three applications in the presence of various noise patterns and categorize the counters across the systems according to their noise resilience.

Zusätzliche Informationen:

Held in conjunction with SC22: The International Conference for High Performance Computing, Networking, Storage and Analysis

Fachbereich(e)/-gebiet(e): 20 Fachbereich Informatik
20 Fachbereich Informatik > Parallele Programmierung
Zentrale Einrichtungen
Zentrale Einrichtungen > Hochschulrechenzentrum (HRZ)
Zentrale Einrichtungen > Hochschulrechenzentrum (HRZ) > Hochleistungsrechner
Hinterlegungsdatum: 13 Feb 2024 15:34
Letzte Änderung: 11 Apr 2024 08:35
PPN: 517095491
Export:
Suche nach Titel in: TUfind oder in Google
Frage zum Eintrag Frage zum Eintrag

Optionen (nur für Redakteure)
Redaktionelle Details anzeigen Redaktionelle Details anzeigen