TU Darmstadt / ULB / TUbiblio

An Approach to Visualize Remote Socket Traffic on the Intel Nehalem-EX

Iwainsky, Christian ; Reichstein, Thomas ; Dahnken, Christopher ; Mey, Dieter an ; Terboven, Christian ; Semin, Andrey ; Bischof, Christian
Hrsg.: Guarracino, M. ; Vivien, F. ; Träff, J. ; Cannatoro, M. ; Danelutto, M. ; Hast, A. ; Perla, F. ; Knüpfer, A. ; Di Martino, B. ; Alexander, M. (2011)
An Approach to Visualize Remote Socket Traffic on the Intel Nehalem-EX.
In: Euro-Par 2010 Parallel Processing Workshops, Auflage: 1.Auflage
doi: 10.1007/978-3-642-21878-1_64
Buchkapitel, Bibliographie

Kurzbeschreibung (Abstract)

The integration of the memory controller on the processor die enables ever larger core counts in commodity hardware shared memory systems with Non-Uniform Memory Architecture properties. Shared memory parallelization with OpenMP is an elegant and widely used approach to leverage the power of such systems. The binding of the OpenMP threads to compute cores and the corresponding memory association are becoming even more critical in order to obtain optimal performance. In this work we provide a method to measure the amount of remote socket memory accesses a thread generates. We use available performance monitoring CPU counters in combination with thread binding on a quad socket Nehalem EX system. For visualization of the collected data we use Vampir.

Typ des Eintrags: Buchkapitel
Erschienen: 2011
Herausgeber: Guarracino, M. ; Vivien, F. ; Träff, J. ; Cannatoro, M. ; Danelutto, M. ; Hast, A. ; Perla, F. ; Knüpfer, A. ; Di Martino, B. ; Alexander, M.
Autor(en): Iwainsky, Christian ; Reichstein, Thomas ; Dahnken, Christopher ; Mey, Dieter an ; Terboven, Christian ; Semin, Andrey ; Bischof, Christian
Art des Eintrags: Bibliographie
Titel: An Approach to Visualize Remote Socket Traffic on the Intel Nehalem-EX
Sprache: Englisch
Publikationsjahr: 2011
Ort: Berlin / Heidelberg
Verlag: Springer
(Heft-)Nummer: 6586
Buchtitel: Euro-Par 2010 Parallel Processing Workshops
Reihe: Lecture Notes in Computer Science
Band einer Reihe: 6586
Auflage: 1.Auflage
DOI: 10.1007/978-3-642-21878-1_64
Zugehörige Links:
Kurzbeschreibung (Abstract):

The integration of the memory controller on the processor die enables ever larger core counts in commodity hardware shared memory systems with Non-Uniform Memory Architecture properties. Shared memory parallelization with OpenMP is an elegant and widely used approach to leverage the power of such systems. The binding of the OpenMP threads to compute cores and the corresponding memory association are becoming even more critical in order to obtain optimal performance. In this work we provide a method to measure the amount of remote socket memory accesses a thread generates. We use available performance monitoring CPU counters in combination with thread binding on a quad socket Nehalem EX system. For visualization of the collected data we use Vampir.

Fachbereich(e)/-gebiet(e): 20 Fachbereich Informatik
20 Fachbereich Informatik > Scientific Computing
Zentrale Einrichtungen
Hinterlegungsdatum: 22 Mär 2013 10:34
Letzte Änderung: 04 Aug 2021 16:00
PPN:
Zugehörige Links:
Export:
Suche nach Titel in: TUfind oder in Google
Frage zum Eintrag Frage zum Eintrag

Optionen (nur für Redakteure)
Redaktionelle Details anzeigen Redaktionelle Details anzeigen