TU Darmstadt / ULB / TUbiblio

MPI-CorrBench: Towards an MPI Correctness Benchmark Suite

Lehr, Jan-Patrick ; Jammer, Tim ; Bischof, Christian (2021)
MPI-CorrBench: Towards an MPI Correctness Benchmark Suite.
30th International Symposium on High-Performance Parallel and Distributed Computing. virtual Conference (21.-25.06.2021)
doi: 10.1145/3431379.3460652
Konferenzveröffentlichung, Bibliographie

Kurzbeschreibung (Abstract)

The Message Passing Interface (MPI) is the de-facto standard for distributed memory computing in high-performance computing (HPC). To aid developers write correct MPI programs, different tools have been proposed, e.g., Intel Trace Analyzer and Collector (ITAC), MUST, Parcoach and MPI-Checker. Unfortunately, the effectiveness of these tools is hard to compare, as they have not been evaluated on a common set of applications. More importantly, well-known and widespread benchmarks, which tend to be well-tested and error free, were used for their evaluation. To enable a structured comparison and improve the coverage and reliability of available MPI correctness tools, we propose MPI-CorrBench as a common test harness. MPI-CorrBench enables a structured comparison of the different tools available w.r.t. various types of errors. In our evaluation, we use MPI-CorrBench to provide a well-defined set of error-cases to MUST, ITAC, Parcoach and MPI-Checker. In particular, we find that ITAC and MUST complement each other in many cases. In general, MUST works better for detecting type errors while ITAC is better in detecting errors in non-blocking operations. Although the most-used functions of MPI are well supported, MPI-CorrBench shows that for one sided communication, the error detection capability of all evaluated tools needs improvement. Moreover, our experiments reveal a MPI standard violation in the MPICH test suite as well as several cases of discouraged use of MPI functionality.

Typ des Eintrags: Konferenzveröffentlichung
Erschienen: 2021
Autor(en): Lehr, Jan-Patrick ; Jammer, Tim ; Bischof, Christian
Art des Eintrags: Bibliographie
Titel: MPI-CorrBench: Towards an MPI Correctness Benchmark Suite
Sprache: Englisch
Publikationsjahr: 2021
Verlag: ACM
Buchtitel: HPDC '21: Proceedings of the 30th International Symposium on High-Performance Parallel and Distributed Computing
Veranstaltungstitel: 30th International Symposium on High-Performance Parallel and Distributed Computing
Veranstaltungsort: virtual Conference
Veranstaltungsdatum: 21.-25.06.2021
DOI: 10.1145/3431379.3460652
Kurzbeschreibung (Abstract):

The Message Passing Interface (MPI) is the de-facto standard for distributed memory computing in high-performance computing (HPC). To aid developers write correct MPI programs, different tools have been proposed, e.g., Intel Trace Analyzer and Collector (ITAC), MUST, Parcoach and MPI-Checker. Unfortunately, the effectiveness of these tools is hard to compare, as they have not been evaluated on a common set of applications. More importantly, well-known and widespread benchmarks, which tend to be well-tested and error free, were used for their evaluation. To enable a structured comparison and improve the coverage and reliability of available MPI correctness tools, we propose MPI-CorrBench as a common test harness. MPI-CorrBench enables a structured comparison of the different tools available w.r.t. various types of errors. In our evaluation, we use MPI-CorrBench to provide a well-defined set of error-cases to MUST, ITAC, Parcoach and MPI-Checker. In particular, we find that ITAC and MUST complement each other in many cases. In general, MUST works better for detecting type errors while ITAC is better in detecting errors in non-blocking operations. Although the most-used functions of MPI are well supported, MPI-CorrBench shows that for one sided communication, the error detection capability of all evaluated tools needs improvement. Moreover, our experiments reveal a MPI standard violation in the MPICH test suite as well as several cases of discouraged use of MPI functionality.

Freie Schlagworte: mpi, hpc, message passing interface, correctness
Fachbereich(e)/-gebiet(e): 20 Fachbereich Informatik
20 Fachbereich Informatik > Scientific Computing
Zentrale Einrichtungen
Zentrale Einrichtungen > Hochschulrechenzentrum (HRZ)
Zentrale Einrichtungen > Hochschulrechenzentrum (HRZ) > Hochleistungsrechner
Hinterlegungsdatum: 28 Jun 2021 06:58
Letzte Änderung: 30 Jun 2021 10:15
PPN:
Export:
Suche nach Titel in: TUfind oder in Google
Frage zum Eintrag Frage zum Eintrag

Optionen (nur für Redakteure)
Redaktionelle Details anzeigen Redaktionelle Details anzeigen