TU Darmstadt / ULB / TUbiblio

Items in division

Up a level
Export as [feed] Atom [feed] RSS 1.0 [feed] RSS 2.0
Group by: Creators | Date | Item Type | Language | No Grouping
Jump to: A | B | C | D | E | F | G | H | I | J | K | L | M | N | P | R | S | T | U | V | W | X | Z
Number of items at this level: 214.

A

Atre, Rohit and Huda, Zia Ul and Jannesari, Ali and Wolf, Felix :
Dissecting sequential programs for parallelization - an approach based on computational units.
In: 10th International Symposium on High-Level Parallel Programming and Applications, Valladolid, Spain.
[Conference or Workshop Item] , (2017)

Atre, Rohit and Jannesari, Ali and Wolf, Felix :
The Basic Building Blocks of Parallel Tasks.
[Online-Edition: http://doi.acm.org/10.1145/2723772.2723778]
In: Proc. of the International Workshop on Code Optimisation for Multi and Many Cores, San Francisco, CA, USA. ACM
[Conference or Workshop Item] , (2015)

an Mey, Dieter and Biersdorff, Scott and Bischof, Christian and Diethelm, Kai and Eschweiler, Dominic and Gerndt, Michael and Knüpfer, Andreas and Saviankou, Pavel and Schmidl, Dirk and Shende, Sameer S. and Wagner, Michael and Wesarg, Bert and Wolf, Felix and Lorenz, Daniel and Mallony, Allen D. and Nagel, Wolfgang E. and Oleynik, Yury and Rössel, Christian :
Score-P: A Unified Performance Measurement System for Petascale Applications.
[Online-Edition: http://www.springerlink.com/content/t041605372024474/?MUD=MP]
In: Proc. of the CiHPC: Competence in High Performance Computing, HPC Status Konferenz der Gauß-Allianz e.V., Schwetzingen, Germany, June 2010. Springer
[Conference or Workshop Item] , (2012)

Attig, Norbert and Berberich, Florian and Detert, Ulrich and Eicker, Nobert and Eickermann, Thomas and Gibbon, Paul and Gürich, Wolfgang and Homberg, Willi and Illich, Antonia and Rinke, Sebastian and Stephan, Michael and Wolkersdorfer, Klaus and Lippert, Thomas
Münster, Gernot and Wolf, Dietrich and Kremer, Manfred (eds.) :

Entering the Petaflop-Era - New Developments in Supercomputing.
In: NIC Symposium 2010, Jülich. In: IAS Series , 3 . John von Neumann Institute for Supercomputing
[Conference or Workshop Item] , (2010)

Aguilera, Gaby and Teller, Patricia J. and Taufer, Michaela and Wolf, Felix :
A Systematic Multi-step Methodology for Performance Analysis of Communication Traces of Distributed Applications based on Hierarchical Clustering.
In: Proc. of the 5th International Workshop on Performance Modeling, Evaluation, and Organization of Parallel and Distributed Systems (PMEO-PDS, in conjunction with IPDPS 2006), Rhodes Island, Greece. IEEE Computer Society
[Conference or Workshop Item] , (2006)

B

Berens, Yannick :
Scalability Validation of Parallel Sorting Algorithms.
[Online-Edition: http://tuprints.ulb.tu-darmstadt.de/6825]
Technische Universität , Darmstadt
[Bachelor Thesis] , (2017)

Böhme, David and Geimer, Markus and Arnold, Lukas and Voigtländer, Felix and Wolf, Felix :
Identifying the root causes of wait states in large-scale parallel applications.
In: ACM Transactions on Parallel Computing, 3 (2) Article No. 11, 24 pages. ISSN 2329-4949
[Article] , (2016)

Becker, Daniel and Geimer, Markus and Rabenseifner, Rolf and Wolf, Felix :
Extending the scope of the controlled logical clock.
In: Cluster Computing, 16 (1) pp. 171-189. ISSN 1386-7857
[Article] , (2013)

Böhme, David and Geimer, Markus and Wolf, Felix :
Characterizing Load and Communication Imbalance in Large-Scale Parallel Applications.
In: Proc. of the 26th IEEE International Parallel & Distributed Processing Symposium Workshops & PhD Forum (IPDPSW), Shanghai, China. IEEE Computer Society
[Conference or Workshop Item] , (2012)

Böhme, David and Hermanns, Marc-André and Wolf, Felix :
Scalasca.
In: Entwicklung und Evolution von Forschungssoftware, Rolduc, November 2011. In: Aachener Informatik-Berichte, Software Engineering , 14 . Shaker
[Conference or Workshop Item] , (2012)

Böhme, David and Geimer, Markus and Wolf, Felix and Arnold, Lukas :
Identifying the root causes of wait states in large-scale parallel applications.
In: Proc. of the 39th International Conference on Parallel Processing (ICPP), San Diego, CA, USA. IEEE Computer Society
[Conference or Workshop Item] , (2010)

Becker, Daniel and Geimer, Markus and Rabenseifner, Rolf and Wolf, Felix :
Synchronizing the Timestamps of Concurrent Events in Traces of Hybrid MPI/OpenMP Applications.
In: Proc. of IEEE International Conference on Cluster Computing (CLUSTER), Heraklion, Greece. IEEE Computer Society
[Conference or Workshop Item] , (2010)

Böhme, David and Hermanns, Marc-André and Geimer, Markus and Wolf, Felix :
Performance Simulation of Non-blocking Communication in Message-Passing Applications.
In: Proc. of the 2nd Workshop on Productivity and Performance (PROPER) in conjunction with Euro-Par 2009, Delft, The Netherlands. In: Lecture Notes in Computer Science (ISSN 0302-9743) , 6043 . Springer
[Conference or Workshop Item] , (2010)

Becker, Daniel and Rabenseifner, Rolf and Wolf, Felix and Linford, John :
Scalable timestamp synchronization for event traces of message-passing applications.
In: Parallel Computing, 35 (12) pp. 595-607.
[Article] , (2009)

Becker, Daniel and Rabenseifner, Rolf and Wolf, Felix and Linford, John :
Replay-based synchronization of timestamps in event traces of massively parallel applications.
In: Scalable Computing: Practice and Experience, 10 (1) pp. 49-60. ISSN 1895-1767
[Article] , (2009)

Becker, Daniel and Linford, John and Rabenseifner, Rolf and Wolf, Felix :
Replay-based synchronization of timestamps in event traces of massively parallel applications.
In: Proc. of the International Conference on Parallel Processing Workshops (ICPPW), 1st International Workshop on Simulation and Modelling in Emergent Computational Systems (SMECS), Portland, OR, USA. IEEE Computer Society
[Conference or Workshop Item] , (2008)

Becker, Daniel and Rabenseifner, Rolf and Wolf, Felix :
Implications of non-constant clock drifts for the timestamps of concurrent events.
In: Proc. of the IEEE International Conference on Cluster Computing (CLUSTER), Tsukuba, Japan. IEEE Computer Society
[Conference or Workshop Item] , (2008)

Becker, Daniel and Riedel, Morris and Streit, Achim and Wolf, Felix :
Grid-Based Workflow Management for Automatic Performance Analysis of Massively Parallel Applications.
In: Proc. of the 3rd CoreGRID Workshop on Grid Middleware, Barcelona, Spain. In: CoreGRID Series . Springer
[Conference or Workshop Item] , (2008)

Becker, Daniel and Frings, Wolfgang and Wolf, Felix :
Performance Evaluation and Optimization of Parallel Grid Computing Applications.
In: Proc. of the 16th Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP), Toulouse, France. IEEE Computer Society
[Conference or Workshop Item] , (2008)

Becker, Daniel and Frings, Wolfgang and Wolf, Felix :
Performance Evaluation and Optimization of Metacomputing Applications.
[Online-Edition: http://darwin.bth.rwth-aachen.de/opus3/volltexte/2007/2113/p...]
In: Proc. of the 3rd Workshop on Communication in Cluster- and Grid-Systems (KiCC, Kommunikation in Clusterrechnern und Clusterverbundsystemen), Aachen, Germany.
[Conference or Workshop Item] , (2007)

Becker, Daniel and Rabenseifner, Rolf and Wolf, Felix :
Timestamp Synchronization for Event Traces of Large-Scale Message-Passing Applications.
In: Proc. of the 14th European PVM/MPI Users' Group Meeting (EuroPVM/MPI), Paris, France. In: Lecture Notes in Computer Science , 4757 . Springer
[Conference or Workshop Item] , (2007)

Becker, Daniel and Wolf, Felix and Frings, Wolfgang and Geimer, Markus and Wylie, Brian J. N. and Mohr, Bernd :
Automatic Trace-Based Performance Analysis of Metacomputing Applications.
In: Proc. of the International Parallel and Distributed Processing Symposium (IPDPS), Long Beach, CA, USA. IEEE Computer Society
[Conference or Workshop Item] , (2007)

Behbahani, Mehdi and Behr, Marek and Bischof, Christian and Wolf, Felix :
Kranken Herzen helfen.
In: RWTH Themen, 1 pp. 44-46.
[Article] , (2007)

Bischof, Christian and Wolf, Felix :
Produktivität versus Performanz in der Simulation.
In: RWTH Themen, 2 pp. 38-39.
[Article] , (2007)

Bhatia, Nikhil and Song, Fengguang and Wolf, Felix and Mohr, Bernd and Dongarra, Jack and Moore, Shirley :
Automatic Experimental Analysis of Communication Patterns in Virtual Topologies.
In: Proc. of the International Conference on Parallel Processing (ICPP), Oslo, Norway. IEEE Society
[Conference or Workshop Item] , (2005)

Bhatia, Nikhil and Moore, Shirley and Wolf, Felix and Dongarra, Jack and Mohr, Bernd :
A Pattern-Based Approach to Automated Application Performance Analysis.
[Online-Edition: http://charm.cs.uiuc.edu/patHPC/papers/moore.pdf]
In: Workshop on Patterns in High Performance Computing (patHPC 2005), Urbana-Champaign, IL, USA.
[Conference or Workshop Item] , (2005)

C

Calotoiu, Alexandru and Graf, Alexander and Hoefler, Torsten and Lorenz, Daniel and Wolf, Felix :
Lightweight Requirements Engineering for Exascale Co-design.
In: Proc. of the 2018 IEEE International Conference on Cluster Computing (CLUSTER), Belfast, UK. IEEE Computer Society
[Conference or Workshop Item] , (2018) (Submitted)

Calotoiu, Alexandru :
Automatic Empirical Performance Modeling of Parallel Programs.
[Online-Edition: http://tuprints.ulb.tu-darmstadt.de/7234]
Technische Universität , Darmstadt
[Ph.D. Thesis], (2017)

Calotoiu, Alexandru and Beckingsale, David and Earl, Christopher W. and Hoefler, Torsten and Karlin, Ian and Schulz, Martin and Wolf, Felix :
Fast Multi-Parameter Performance Modeling.
In: Proc. of the 2016 IEEE International Conference on Cluster Computing (CLUSTER), Taipei, Taiwan. IEEE Computer Society
[Conference or Workshop Item] , (2016)

Calotoiu, Alexandru and Hoefler, Torsten and Wolf, Felix :
Mass-producing Insightful Performance Models.
[Online-Edition: http://hpc.pnl.gov/modsim/2014/index.shtml]
In: Workshop on Modeling & Simulation of Systems and Applications, University of Washington, Seattle, Washington. Seattle, Washington
[Conference or Workshop Item] , (2014)

Calotoiu, Alexandru and Hoefler, Torsten and Poke, Marius and Wolf, Felix :
Using Automated Performance Modeling to Find Scalability Bugs in Complex Codes.
In: Proc. of the ACM/IEEE Conference on Supercomputing (SC13), Denver, CO, USA. ACM
[Conference or Workshop Item] , (2013)

Calotoiu, Alexandru and Siebert, Christian and Wolf, Felix :
Pattern-Independent Detection of Manual Collectives in MPI Programs.
In: Proc. of the 18th Euro-Par Conference, Rhodes Island, Greece. In: Lecture Notes in Computer Science (ISSN 0302-9743) , 7484 . Springer
[Conference or Workshop Item] , (2012)

D

DeRose, Luiz A. and Wolf, Felix :
CATCH — A Call-Graph Based Automatic Tool for Capture of Hardware Performance Metrics for MPI and OpenMP Applications.
In: Proc. of the 8th Euro-Par Conference, Paderborn, Germany. In: Lecture Notes in Computer Science , 2400 . Springer
[Conference or Workshop Item] , (2002)

E

Eschweiler, Dominic and Wagner, Michael and Geimer, Markus and Knüpfer, Andreas and Nagel, Wolfgang E. and Wolf, Felix :
Open Trace Format 2 - The Next Generation of Scalable Trace Formats and Support Libraries.
In: Proc. of the Intl. Conference on Parallel Computing (ParCo), Ghent, Belgium, August 30 — September 2 2011. In: Advances in Parallel Computing , 22 . IOS Press
[Conference or Workshop Item] , (2012)

Eschweiler, Dominic and Becker, Daniel and Wolf, Felix :
Patterns of inefficient performance behavior in GPU applications.
In: Proc. of the 19th Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP), Ayia Napa, Cyprus. IEEE Computer Society
[Conference or Workshop Item] , (2011)

F

Friedrich, Daniel and Li, Zhen and Jannesari, Ali and Wolf, Felix :
Predicting Parallelization of Sequential Programs Using Supervised Learning.
[Online-Edition: http://ieeexplore.ieee.org/xpl/articleDetails.jsp?tp=&arnumb...]
In: Proc. of the 12th IEEE International Conference on Machine Learning and Applications (ICMLA), Miami, FL, USA. IEEE Computer Society
[Conference or Workshop Item] , (2013)

Frings, Wolfgang and Ahn, Dong H. and LeGendre, Matthew and Gamblin, Todd and de Supinski, Bronis R. and Wolf, Felix :
Massively Parallel Loading.
In: Proc. of the 27th International Conference on Supercomputing (ICS), Eugene, OR, USA. ACM
[Conference or Workshop Item] , (2013)

Frings, Wolfgang and Wolf, Felix and Petkov, Ventsislav :
Scalable Massively Parallel I/O to Task-Local Files.
In: Proc. of the ACM/IEEE Conference on Supercomputing (SC09), Portland, OR, USA. ACM
[Conference or Workshop Item] , (2009)

Fahringer, Thomas and Gerndt, Michael and Mohr, Bernd and Riley, Graham and Träff, Jesper Larsson and Wolf, Felix :
Knowledge Specification for Automatic Performance Analysis.
Forschungszentrum Jülich
[Report] , (2001)

G

Graf, Alexander :
Modeling Cache Locality with Extra-P.
[Online-Edition: http://tuprints.ulb.tu-darmstadt.de/6562]
Technische Universität , Darmstadt
[Bachelor Thesis] , (2017)

Galonska, Andreas and Gibbon, Paul and Imbeaux, Frederic and Frauel, Yann and Guillerminet, Bernard and Manduchi, Gabriele and Wolf, Felix :
Parallel Universal Access Layer: A Scalable I/O Library for Integrated Tokamak Modelling.
In: Computer Physics Communications, 184 (3) 638–646.
[Article] , (2013)

Geimer, Markus and Saviankou, Pavel and Strube, Alexandre and Szebenyi, Zoltán and Wolf, Felix and Wylie, Brian J. N. :
Further improving the scalability of the Scalasca toolset.
In: Proc. of PARA 2010: State of the Art in Scientific and Parallel Computing, Part II: Minisymposium Scalable tools for High Performance Computing, Reykjavik, Iceland, June 6--9 2010. In: Lecture Notes in Computer Science , 7134 . Springer
[Conference or Workshop Item] , (2012)

Geimer, Markus and Hermanns, Marc-André and Siebert, Christian and Wolf, Felix and Wylie, Brian J. N. :
Scaling Performance Tool MPI Communicator Management.
In: Proc. of the 18th European MPI Users' Group Meeting (EuroMPI), Santorini, Greece. In: Lecture Notes in Computer Science , 6960 . Springer
[Conference or Workshop Item] , (2011)

Geimer, Markus and Wolf, Felix and Wylie, Brian J. N. and Ábrahám, Erika and Becker, Daniel and Mohr, Bernd :
The Scalasca performance toolset architecture.
In: Concurrency and Computation: Practice and Experience, 22 (6) pp. 702-719.
[Article] , (2010)

Geimer, Markus and Wolf, Felix and Wylie, Brian J. N. and Becker, Daniel and Böhme, David and Frings, Wolfgang and Hermanns, Marc-André and Mohr, Bernd and Szebenyi, Zoltán
Müller, Matthias S. and Resch, Michael M. and Nagel, Wolfgang E. and Schulz, Alexander (eds.) :

Recent Developments in the Scalasca Toolset.
In: Tools for High Performance Computing 2009, Proc. of the 3rd Parallel Tools Workshop, Dresden, Germany, September 2009. Springer , pp. 39-51. ISBN 978-3-642-11260-7
[Book Section] , (2010)

Geimer, Markus and Wolf, Felix and Wylie, Brian J. N. and Mohr, Bernd :
A scalable tool architecture for diagnosing wait states in massively parallel applications.
In: Parallel Computing, 35 (7) pp. 375-388. ISSN 0167-8191
[Article] , (2009)

Geimer, Markus and Shende, Sameer S. and Malony, Allen D. and Wolf, Felix
Allen, Gabrielle and Nabrzyski, Jarek and Seidel, Ed and van Albada, Geert Dick and Dongarra, Jack and Sloot, Peter M. A. (eds.) :

A Generic and Configurable Source-Code Instrumentation Component.
In: Proc. of the International Conference on Computational Science (ICCS), Baton Rouge, LA, USA. In: Lecture Notes in Computer Science , 5545 . Springer
[Conference or Workshop Item] , (2009)

Geimer, Markus and Wolf, Felix and Wylie, Brian J. N. and Abraham, Erika and Becker, Daniel and Mohr, Bernd :
The SCALASCA Performance Toolset Architecture.
In: International Workshop on Scalable Tools for High-End Computing (STHEC), Kos, Greece.
[Conference or Workshop Item] , (2008)

Geimer, Markus and Kuhlmann, Björn and Pulatova, Farzona and Wolf, Felix and Wylie, Brian J. N. :
Scalable Collation and Presentation of Call-Path Profile Data with CUBE.
In: Proc. of the Conference on Parallel Computing (ParCo), Aachen/Jülich, Germany.
[Conference or Workshop Item] , (2007)

Geimer, Markus and Wolf, Felix and Knüpfer, Andreas and Mohr, Bernd and Wylie, Brian J. N. :
A Parallel Trace-Data Interface for Scalable Performance Analysis.
In: Proc. of the 8th International Workshop on State-of-the-Art in Scientific and Parallel Computing (PARA), Umeå, Sweden, June 2006. In: Lecture Notes in Computer Science , 4699 . Springer
[Conference or Workshop Item] , (2007)

Geimer, Markus and Wolf, Felix and Wylie, Brian J. N. and Mohr, Bernd :
Scalable Parallel Trace-Based Performance Analysis.
In: Proc. of the 13th European PVM/MPI Users' Group Meeting (EuroPVM/MPI), Bonn, Germany. In: Lecture Notes in Computer Science , 4192 . Springer
[Conference or Workshop Item] , (2006)

Geimer, Markus and Wolf, Felix and Wylie, Brian J. N. and Mohr, Bernd :
Scalable Parallel Trace-Based Performance Analysis.
In: Innovatives Supercomputing in Deutschland (inSiDE), 4 (2) pp. 16-19.
[Article] , (2006)

Gerndt, Michael and Mohr, Bernd and Wolf, Felix and Pantano, Mario :
Performance Analysis for Cray T3E.
In: Proc. of the 7th Euromicro Workshop on Parallel and Disributed Pocessing (PDP), Funchal, Madeira, Portugal. IEEE Computer Society
[Conference or Workshop Item] , (1999)

Gerndt, Michael and Mohr, Bernd and Pantano, Mario and Wolf, Felix :
Automatic Performance Analysis for Cray T3E.
In: Proc. of the 7th Workshop on Compilers for Parallel Computers (CPC), University of Linköping, Sweden.
[Conference or Workshop Item] , (1998)

H

Hermanns, Marc-André and Geimer, Markus and Mohr, Bernd and Wolf, Felix
Niethammer, Christoph and Gracia, José and Hilbrich, Tobias and Knüpfer, Andreas and Resch, Michael and Nagel, Wolfgang E. (eds.) :

Trace-based Detection of Lock Contention in MPI One-Sided Communication.
[Online-Edition: http://juser.fz-juelich.de/record/830159]
In: Tools for High Performance Computing 2016, Proceedings of the 10th Parallel Tools Workshop, Stuttgart, Germany, October 2016. Springer, Cham , pp. 97-114. ISBN 978-3-319-56701-3
[Book Section] , (2017)

Huda, Zia Ul and Atre, Rohit and Jannesari, Ali and Wolf, Felix :
Automatic Parallel Pattern Detection in the Algorithm Structure Design Space.
[Online-Edition: http://dx.doi.org/10.1109/IPDPS.2016.60]
In: Proc. of the 30th IEEE International Parallel and Distributed Processing Symposium (IPDPS), Chicago, USA. IEEE Computer Society
[Conference or Workshop Item] , (2016)

Harlacher, Monika and Calotoiu, Alexandru and Dennis, John and Wolf, Felix
Binder, Kurt and Müller, Marcus and Kremer, Manfred and Schnurpfeil, Alexander (eds.) :

Analysing the Scalability of Climate Codes Using New Features of Scalasca.
In: Proc. of the John von Neumann Institute for Computing (NIC) Symposium 2016, Juelich, Germany. In: NIC Series , 48 . John von Neumann-Institut for Computing
[Conference or Workshop Item] , (2016)

Hermanns, Marc-André and Miklosch, Manfred and Böhme, David and Wolf, Felix :
Understanding the formation of wait states in applications with one-sided communication.
[Online-Edition: http://doi.acm.org/10.1145/2488551.2488569]
In: EuroMPI '13: Proc. of the 20th European MPI Users' Group Meeting, Madrid, Spain, September 15--18, 2013, New York, NY, USA. ACM , New York, NY, USA
[Conference or Workshop Item] , (2013)

Hermanns, Marc-André and Krishnamoorthy, Sriram and Wolf, Felix :
A scalable infrastructure for the performance analysis of passive target synchronization.
[Online-Edition: http://www.sciencedirect.com/science/article/pii/S0167819112...]
In: Parallel Computing, 39 (3) pp. 132-145. ISSN 0167-8191
[Article] , (2013)

Hermanns, Marc-André and Geimer, Markus and Mohr, Bernd and Wolf, Felix :
Scalable detection of MPI-2 remote memory access inefficiency patterns.
In: Intl. Journal of High Performance Computing Applications (IJHPCA), 26 (3) pp. 227-236.
[Article] , (2012)

Harlacher, Daniel and Klimach, Harald and Roller, Sabine and Siebert, Christian and Wolf, Felix :
Dynamic Load Balancing for Unstructured Meshes on Space-Filling Curves.
In: Proc. of the IEEE 26th International Parallel and Distributed Processing Symposium (IPDPS) Workshops & PhD Forum, Workshop on Large-Scale Parallel Processing, Shanghai, China. IEEE Computer Society
[Conference or Workshop Item] , (2012)

Hermanns, Marc-André and Krishnamoorthy, Sriram and Wolf, Felix :
A Scalable Replay-based Infrastructure for the Performance Analysis of One-sided Communication.
In: Proc. of the 1st Intl. Workshop on High-performance Infrastructure for Scalable Tools (WHIST), held in conjunction with the International Conference on Supercomputing (ICS), Tucson, AZ, USA.
[Conference or Workshop Item] , (2011)

Hermanns, Marc-André and Geimer, Markus and Mohr, Bernd and Wolf, Felix :
Scalable Detection of MPI-2 Remote Memory Access Inefficiency Patterns.
In: Proc. of the 16th European PVM/MPI Users' Group Meeting (EuroPVM/MPI), Espoo, Finland. In: Lecture Notes in Computer Science , 5759 . Springer
[Conference or Workshop Item] , (2009)

Hermanns, Marc-André and Geimer, Markus and Wolf, Felix and Wylie, Brian J. N. :
Verifying Causality Between Distant Performance Phenomena in Large-Scale MPI Applications.
In: Proc. of the 17th Euromicro International Conference on Parallel, Distributed, and Network-Based Processing (PDP), Weimar, Germany. IEEE Computer Society
[Conference or Workshop Item] , (2009)

Hernandez, Oscar and Song, Fengguang and Chapman, Barbara and Dongarra, Jack and Mohr, Bernd and Moore, Shirley and Wolf, Felix :
Performance Instrumentation and Compiler Optimizations for MPI/OpenMP Applications.
In: Proc. of the 2nd International Workshop on OpenMP (IWOMP 2006), Reims, France. In: Lecture Notes in Computer Science , 4315 . Springer
[Conference or Workshop Item] , (2008)

Hermanns, Marc-André and Mohr, Bernd and Wolf, Felix :
Event-based Measurement and Analysis of One-sided Communication.
In: Proc. of the 11th Euro-Par Conference, Lisboa, Portugal. In: Lecture Notes in Computer Science , 3648 . Springer
[Conference or Workshop Item] , (2005)

I

Ilyas, Kashif and Calotoiu, Alexandru and Wolf, Felix :
Off-Road Performance Modeling — How to Deal with Segmented Data.
In: Proc. of the 23rd Euro-Par Conference, Santiago de Compostela, Spain. In: Lecture Notes in Computer Science . Springer
[Conference or Workshop Item] , (2017)

Iwainsky, Christian and Shudler, Sergei and Calotoiu, Alexandru and Strube, Alexandre and Knobloch, Michael and Bischof, Christian and Wolf, Felix
Träff, Jesper Larsson and Hunold, Sascha and Versaci, Francesco (eds.) :

How many threads will be too many? On the scalability of OpenMP implementations.
[Online-Edition: http://dx.doi.org/10.1007/978-3-662-48096-0_35]
In: Euro-Par 2015: Parallel Processing. Lecture Notes in Computer Science, 9233. Springer, Heidelberg, Germany Heidelberg, Germany , pp. 451-463. ISBN 978-3-662-48095-3
[Book Section] , (2015)

J

Jannesari, Ali :
A Software Development Methodology for Multicore Systems.
[Online-Edition: https://hds.hebis.de/ulbda/Record/HEB416193544]
Habilitation, Technische Universität Darmstadt , Darmstadt, Germany
[Habilitation] , (2017)

Jannesari, Ali and Huda, Zia Ul and Atre, Rohit and Li, Zhen and Wolf, Felix :
Parallelizing Audio Analysis Applications - A Case Study.
[Online-Edition: https://doi.org/10.1109/ICSE-SEET.2017.9]
In: Proc. of the 39th International Conference on Software Engineering, Software Engineering Education and Training Track (ICSE-SEET).
[Conference or Workshop Item] , (2017)

Jannesari, Ali and Wolf, Felix and Tichy, Walter F. :
Special Issue on Software Engineering for Parallel Systems.
[Online-Edition: https://www.sciencedirect.com/science/article/pii/S016412121...]
In: Journal of Systems and Software, 125 pp. 380-448. ISSN 0164-1212
[Article] , (2017)

Jannesari, Ali :
A software development methodology for multicore systems.
Darmstadt
[Habilitation] , (2017)

Jannesari, Ali and Wolf, Felix :
Automatic Generation of Unit Tests for Correlated Variables in Parallel Programs.
[Online-Edition: http://dx.doi.org/10.1007/s10766-015-0363-8]
In: International Journal of Parallel Programming (IJPP), 44 (3) pp. 644-662. ISSN 1573-7640
[Article] , (2016)

Jeyakumaran, Thireshan and Atoofian, Ehsan and Xiao, Yang and Li, Zhen and Jannesari, Ali :
Improving Performance of Transactional Applications through Adaptive Transactional Memory.
In: Proc. of the 24th Euromicro International Conference on Parallel, Distributed, and Network-Based Processing (PDP), Heraklion, Crete, Greece.
[Conference or Workshop Item] , (2016)

Jannesari, Ali and Benkner, Siegfried and Zhao, Xinghui and Atoofian, Ehsan and Sato, Yukionri :
Workshop Preview of the 2nd International Workshop on Software for Parallel Systems (SEPS 2015).
In: Companion Proceedings of the 2015 ACM SIGPLAN International Conference on Systems, Programming, Languages and Applications: Software for Humanity, Pittsburgh, PA, USA. In: SPLASH Companion 2015 . ACM , New York, NY, USA
[Conference or Workshop Item] , (2015)

Jannesari, Ali :
Detection of High-Level Synchronization Anomalies in Parallel Programs.
In: International Journal of Parallel Programming (IJPP), 43 (4) pp. 656-678. ISSN 0885-7458
[Article] , (2015)

Jannesari, Ali and Tichy, Walter F. :
Library-Independent Data Race Detection.
In: IEEE Transactions on Parallel and Distributed Systems (TPDS), 25 (10) pp. 2606-2616. ISSN 1045-9219
[Article] , (2014)

Jannesari, Ali and Koprowski, Nico and Schimmel, Jochen and Wolf, Felix :
Generating Classified Parallel Unit Tests.
[Online-Edition: http://dx.doi.org/10.1007/978-3-319-09099-3_9]
In: Lecture Notes in Computer Science , 8570 .
[Conference or Workshop Item] , (2014)

Jannesari, Ali and Wolf, Felix :
Unit Tests for Correlated Variables in Multi-threaded Code.
In: 7th International Symposium on High-level Parallel Programming and Applications (HLPP), Amsterdam, Netherlands.
[Conference or Workshop Item] , (2014)

Jannesari, Ali and Wolf, Felix and Tichy, Walter F. :
A Summary of the First International Workshop on Software Engineering for Parallel Systems.
In: Proc. of the Companion Publication of the 2014 ACM SIGPLAN Conference on Systems, Programming, and Applications: Software for Humanity (SPLASH), Portland, OR, USA. ACM , New York, NY, USA
[Conference or Workshop Item] , (2014)

Jannesari, Ali and Koprowski, Nico and Schimmel, Jochen and Wolf, Felix and Tichy, Walter F. :
Detecting Correlation Violations and Data Races by Inferring Non-deterministic Reads.
In: Proc. of the 19th IEEE International Conference on Parallel and Distributed Systems (ICPADS), Seoul, Korea. IEEE Computer Society
[Conference or Workshop Item] , (2013)

Jannesari, Ali and Westphal-Furuya, Markus and Tichy, Walter F. :
Dynamic data race detection for correlated variables.
[Online-Edition: http://dl.acm.org/citation.cfm?id=2075416.2075421]
In: ICA3PP'11, Melbourne, Australia. In: ICA3PP'11 . Springer-Verlag , Berlin, Heidelberg
[Conference or Workshop Item] , (2011)

Jannesari, Ali and Tichy, Walter F. :
Identifying ad-hoc synchronization for enhanced race detection.
In: IPDPS, IEEE 2010, Atlanta, GA, USA. Parallel Distributed Processing (IPDPS), 2010 IEEE International Symposium on
[Conference or Workshop Item] , (2010)

Jannesari, Ali and Bao, Kaibin and Pankratius, Victor and Tichy, Walter F. :
Helgrind+: An efficient dynamic race detector.
In: IPDPS 2009. IEEE International Symposium. Parallel Distributed Processing, 2009. IPDPS 2009. IEEE International Symposium on Rom, Italien
[Conference or Workshop Item] , (2009)

Jannesari, Ali and Tichy, Walter F. :
On-the-fly race detection in multi-threaded programs.
In: PADTAD '08, 6th workshop on Parallel and distributed systems, Seattle, WA, USA. In: PADTAD '08 . ACM , New York, NY, USA
[Conference or Workshop Item] , (2008)

Jannesari, Ali :
Analysis of Agile Software Development Methods for Embedded Systems.
[Online-Edition: http://www.ias.uni-stuttgart.de]
University of Stuttgart , Stuttgart, Germany
[Master Thesis] , (2005)

K

Kuo, Chihsong and Shah, Aamer and Nomura, Akihiro and Matsuoka, Satoshi and Wolf, Felix :
How File Access Patterns Influence Interference Among Cluster Applications.
In: Proc. of the IEEE International Conference on Cluster Computing (CLUSTER), Madrid, Spain. IEEE Computer Society
[Conference or Workshop Item] , (2014)

Knüpfer, Andreas and Dietrich, Robert and Doleschal, Jens and Geimer, Markus and Hermanns, Marc-André and Rössel, Christian and Tschülter, Ronny and Wesarg, Bert and Wolf, Felix
Cheptsov, Alexey and Brinkmann, Steffen and Gracia, José and Resch, Michael M. and Nagel, Wolfgang E. (eds.) :

Generic Support for Remote Memory Access Operations in Score-P and OTF2.
In: Tools for High Performance Computing 2012, Proc. of the 6th Parallel Tools Workshop, Stuttgart, Germany, September 2012. Springer , pp. 57-74. ISBN 978-3-642-37348-0
[Book Section] , (2013)

Knüpfer, Andreas and Rössel, Christian and an Mey, Dieter and Biersdorff, Scott and Diethelm, Kai and Eschweiler, Dominic and Geimer, Markus and Gerndt, Michael and Lorenz, Daniel and Malony, Allen D. and Nagel, Wolfgang E. and Oleynik, Yury and Phillipen, Peter and Saviankou, Pavel and Schmidl, Dirk and Shende, Sameer S. and Tschülter, Ronny and Wagner, Michael and Wesarg, Bert and Wolf, Felix :
Score-P — A Joint Performance Measurement Run-Time Infrastructure for Periscope, Scalasca, TAU, and Vampir.
[Online-Edition: http://dx.doi.org/10.1007/978-3-642-31476-6_7]
In: Tools for High Performance Computing 2011, Proc. of 5th Parallel Tools Workshop, Dresden, Germany, September 2011. Springer , pp. 79-91. ISBN 978-3-642-31476-6
[Book Section] , (2012)

Kühnal, Andrej and Hermanns, Marc-André and Mohr, Bernd and Wolf, Felix :
Specification of Inefficiency Patterns for MPI-2 One-sided Communication.
In: Proc. of the 12th Euro-Par Conference, Dresden, Germany. In: Lecture Notes in Computer Science , 4128 . Springer
[Conference or Workshop Item] , (2006)

L

Lorenz, Daniel and Feld, Christian :
Scaling Score-P to the next level.
In: Proc. of the International Conference of Computational Science Workshops. Elsevier , Zürich, Switzerland
[Conference or Workshop Item] , (2017)

Li, Zhen :
Discovery of Potential Parallelism in Sequential Programs.
[Online-Edition: http://tuprints.ulb.tu-darmstadt.de/5741/]
Technische Universität Darmstadt , Darmstadt, Germany
[Ph.D. Thesis]

Li, Zhen :
Discovery of Potential Parallelism in Sequential Programs.
[Online-Edition: http://tuprints.ulb.tu-darmstadt.de/5741]
Technische Universität Darmstadt , Darmstadt
[Ph.D. Thesis], (2016)

Li, Zhen and Atre, Rohit and Huda, Zia Ul and Jannesari, Ali and Wolf, Felix :
Unveiling Parallelization Opportunities in Sequential Programs.
In: Journal of Systems and Software, 117 282–295.
[Article] , (2016)

Li, Zhen and Zhao, Bo and Jannesari, Ali and Wolf, Felix :
Beyond Data Parallelism: Identifying Parallel Tasks in Sequential Programs.
In: Proc. of 15th International Conference on Algorithms and Architectures for Parallel Processing (ICA3PP), Zhangjiajie, China. In: Lecture Notes in Computer Science , 9531 . Springer International Publishing
[Conference or Workshop Item] , (2015)

Li, Zhen and Beaumont, Michael and Jannesari, Ali and Wolf, Felix :
Fast Data-Dependence Profiling by Skipping Repeatedly Executed Memory Operations.
In: Proc. of 15th International Conference on Algorithms and Architectures for Parallel Processing (ICA3PP), Zhangjiajie, China. In: Lecture Notes in Computer Science , 9531 . Springer International Publishing
[Conference or Workshop Item] , (2015)

Lorenz, Daniel and Shudler, Sergei and Wolf, Felix :
Preventing the explosion of exascale profile data with smart thread-level aggregation.
In: Proc. of ESPT2015: Workshop on Extreme Scale Programming Tools, held in conjunction with the Supercomputing Conference (SC15), Austin, TX, USA. ACM
[Conference or Workshop Item] , (2015)

Li, Zhen and Jannesari, Ali and Wolf, Felix :
An Efficient Data-Dependence Profiler for Sequential and Parallel Programs.
[Online-Edition: http://dx.doi.org/10.1109/IPDPS.2015.41]
In: Proc. of the 29th IEEE International Parallel and Distributed Processing Symposium (IPDPS), Hyderabad, India. IEEE Computer Society
[Conference or Workshop Item] , (2015)

Li, Zhen and Atre, Rohit and Ul-Huda, Zia and Jannesari, Ali and Wolf, Felix :
DiscoPoP: A Profiling Tool to Identify Parallelization Opportunities.
[Online-Edition: http://www.springer.com/us/book/9783319160115]
In: Tools for High Performance Computing 2014, Proc. of the 8th Parallel Tools Workshop,Stuttgart, Germany, October 2014. Springer International Publishing , pp. 37-54. ISBN 978-3-319-16011-5
[Book Section] , (2015)

Li, Zhen and Jannesari, Ali and Wolf, Felix :
Discovering Parallelization Opportunities in Sequential Programs — A Closer-to-Complete Solution.
In: First International Workshop on Software Engineering for Parallel Systems.
[Conference or Workshop Item] , (2014)

Lorenz, Daniel and Dietrichü, Robert and Tschüter, Ronny and Wolf, Felix :
A comparison between OPARI2 and the OpenMP tools interface in the context of Score-P.
[Online-Edition: http://dx.doi.org/10.1007/978-3-319-11454-5_12]
In: Proc. of the 10th International Workshop on OpenMP (IWOMP), Salvador, Brazil, September 2014. In: LNCS , 8766 . Springer International Publishing
[Conference or Workshop Item] , (2014)

Lengauer, Christian and Bougë, Luc and Wolf, Felix :
Special issue: Euro-Par 2013.
In: Concurrency and Computation: Practice and Experience, 26 (14) pp. 2345-2346. ISSN 1532-0634
[Article] , (2014)

Li, Zhen and Jannesari, Ali and Wolf, Felix :
Discovery of Potential Parallelism in Sequential Programs.
[Online-Edition: http://dx.doi.org/10.1109/ICPP.2013.119]
In: Proc. of the 42nd International Conference on Parallel Processing Workshops (ICPPW), Workshop on Parallel Software Tools and Tool Infrastructures (PSTI), Lyon, France.
[Conference or Workshop Item] , (2013)

Lorenz, Daniel and Philippen, Peter and Schmidl, Dirk and Wolf, Felix :
Profiling of OpenMP tasks with Score-P.
In: Proc. of the 41st International Conference on Parallel Processing Workshops (ICPPW), Workshop on Parallel Software Tools and Tool Infrastructures (PSTI).
[Conference or Workshop Item] , (2012)

Lorenz, Daniel and Mohr, Bernd and Rössel, Christian and Schmidl, Dirk and Wolf, Felix :
How to reconcile event-based performance analysis with tasking in OpenMP.
In: Proc. of 6th Int. Workshop of OpenMP (IWOMP), Tsukuba, Japan. In: Lecture Notes in Computer Science , 6132 . Springer
[Conference or Workshop Item] , (2010)

M

Mazaheri, Arya and Wolf, Felix and Jannesari, Ali :
Unveiling Thread Communication Bottlenecks Using Hardware-Independent Metrics.
In: Proc. of the 47th International Conference on Parallel Processing (ICPP), August 13 - 16, 2018, Eugene, OR, USA.
[Conference or Workshop Item] , (2018)

Moskewicz, Matthew W. and Jannesari, Ali and Keutzer, Kurt :
Boda: A Holistic Approach for Implementing Neural Network Computations.
[Online-Edition: http://doi.acm.org/10.1145/3075564.3077382]
In: Proc. of the ACM International Conference on Computing Frontiers, Siena, Italy. In: CF '17 . ACM , New York, NY, USA
[Conference or Workshop Item] , (2017)

Mohr, Bernd and Wolf, Felix :
The Virtual Institute – High-Productivity Supercomputing Celebrates its 10th Anniversary.
[Online-Edition: http://inside.hlrs.de/editions/17autumn.html#the-virtual-ins...]
In: Innovatives Supercomputing in Deutschland (inSiDE), 15 (2) pp. 40-41.
[Article] , (2017)

Moskewicz, Matthew W. and Jannesari, Ali and Keutzer, Kurt :
A Metaprogramming and Autotuning Framework for Deploying Deep Learning Applications.
In: arXiv preprint arXiv:1611.06945
[Article] , (2016)

Mazaheri, Arya and Jannesari, Ali and Mirzaei, Abdolreza and Wolf, Felix :
Characterizing Loop-Level Communication Patterns in Shared Memory Applications.
In: Proc. of the 44th International Conference on Parallel Processing (ICPP), Beijing, China.
[Conference or Workshop Item] , (2015)

Mao, Gouyong and Böhme, David and Hermanns, Marc-André and Geimer, Markus and Lorenz, Daniel and Wolf, Felix :
Catching Idlers with Ease: A Lightweight Wait-State Profiler for MPI Programs.
[Online-Edition: http://doi.acm.org/10.1145/2642769.2642783]
In: EuroMPI '14: Proc. of the 21th European MPI Users' Group Meeting, New York, NY, USA. ACM , New York, NY, USA
[Conference or Workshop Item] , (2014)

Mohr, Bernd and Voevodin, Vladimir and Giméz, Judit and Hagersten, Erik and Knüpfer, Andreas and Nilsson, Mats and Nikitenko, Dmitry A. and Servat, Harald and Shah, Aamer and Winkler, Frank and Wolf, Felix and Zhukov, Ilya
Cheptsov, Alexey and Brinkmann, Steffen and Gracia, José and Resch, Michael M. and Nagel, Wolfgang E. (eds.) :

The HOPSA Workflow and Tools.
In: Tools for High Performance Computing 2012, Proc. of the 6th Parallel Tools Workshop, Stuttgart, Germany, September 2012. Springer , pp. 127-146. ISBN 978-3-642-37348-0
[Book Section] , (2013)

Mohr, Bernd and Wolf, Felix and Calotoiu, Alexandru and Hoefler, Torsten :
The Catwalk Project – A Quick Development Path for Performance Models.
[Online-Edition: http://inside.hlrs.de/htm/Edition_02_13/article_17.html]
In: Innovatives Supercomputing in Deutschland (inSiDE), 11 (2) pp. 68-71.
[Article] , (2013)

Mußler, Jan and Lorenz, Daniel and Wolf, Felix :
Reducing the overhead of direct application instrumentation using prior static analysis.
In: Proc. of the 17th Euro-Par Conference, Bordeaux, France. In: Lecture Notes in Computer Science , 6852 . Springer
[Conference or Workshop Item] , (2011)

Memon, Mohammad Shahbaz and Riedel, Morris and Memon, Ahmed Shiraz and Wolf, Felix and Streit, Achim and Lippert, Thomas and Plociennik, M. and Owsiak, M. and Tskhakaya, D. and Konz, Ch. :
Lessons Learned From Jointly Using HTC- and HPC-driven e-Science Infrastructures in Fusion Science.
In: Proc. of the International Conference on Information and Emerging Technologies (ICIET), Karachi, Pakistan. IEEE
[Conference or Workshop Item] , (2010)

Mohr, Bernd and Wylie, Brian J. N. and Wolf, Felix :
Performance measurement and analysis tools for extremely scalable systems.
In: Concurrency and Computation: Practice and Experience, 22 (16) pp. 2212-2229.
[Article] , (2010)

Memon, Mohammad Shahbaz and Memon, Ahmed Shiraz and Riedel, Morris and Streit, Achim and Wolf, Felix :
Enabling Grid Interoperability by Extending HPC-driven Job Management with an Open Standard Information Model.
In: Proc. of the 8th IEEE/ACIS International Conference on Computer and Information Science (ICIS), Shanghai, China. IEEE Computer Society
[Conference or Workshop Item] , (2009)

Malony, Allen D. and Shende, Sameer S. and Morris, Alan and Wolf, Felix :
Compensation of Measurement Overhead in Parallel Performance Profiling.
In: International Journal of High Performance Computing Applications, 21 (2) pp. 174-194. ISSN 1094-3420
[Article] , (2007)

Moore, Shirley and Wolf, Felix and Dongarra, Jack and Shende, Sameer S. and Malony, Allen D. and Mohr, Bernd :
A Scalable Approach to MPI Application Performance Analysis.
In: Proc. of the 12th European PVM/MPI Users' Group Meeting (EuroPVM/MPI), Sorrento, Italy. In: Lecture Notes in Computer Science , 3666 . Springer
[Conference or Workshop Item] , (2005)

Mohr, Bernd and Kühnal, Andrej and Hermanns, Marc-André and Wolf, Felix :
Performance Analysis of One-sided Communication Mechanisms.
In: Proc. of the Conference on Parallel Computing (ParCo), Malaga, Spain.
[Conference or Workshop Item] , (2005)

Moore, Shirley and Wolf, Felix and Dongarra, Jack and Mohr, Bernd :
Improving Time to Solution with Automated Performance Analysis.
[Online-Edition: http://www.cs.utk.edu/~shirley/papers/pphec05.pdf]
In: 2nd Workshop on Productivity and Performance in High-End Computing (P-PHEC), San Francisco, CA, USA.
[Conference or Workshop Item] , (2005)

Mucci, Philip and Dongarra, Jack and Kufrin, Rick and Moore, Shirley and Song, Fengguang and Wolf, Felix :
Automating the Large-Scale Collection and Analysis of Performance Data on Linux Clusters.
[Online-Edition: http://www.linuxclustersinstitute.org/conferences/archive/20...]
In: 5th LCI International Conference on Linux Clusters: The HPC Revolution, Austin, TX, USA.
[Conference or Workshop Item] , (2004)

Mohr, Bernd and Malony, Allen D. and Shende, Sameer S. and Wolf, Felix :
Design and Prototype of a Performance Tool Interface for OpenMP.
In: The Journal of Supercomputing, 23 (1) pp. 105-128.
[Article] , (2002)

Mohr, Bernd and Malony, Allen D. and Shende, Sameer S. and Wolf, Felix :
Design and Prototype of a Performance Tool Interface for OpenMP.
In: 2nd Annual Los Alamos Computer Science Institute Symposium (LACSI), Santa Fe, NM, USA.
[Conference or Workshop Item] , (2001)

Mohr, Bernd and Malony, Allen D. and Shende, Sameer S. and Wolf, Felix :
Towards a Performance Tool Interface for OpenMP: An Approach based on Directive Rewriting.
In: 3rd European Workshop on OpenMP (EWOMP), Barcelona, Spain.
[Conference or Workshop Item] , (2001)

N

Norouzi, Mohammad and Jannesari, Ali :
Resource and application-aware resource discovery in computing environments.
[Online-Edition: http://dx.doi.org/10.1007/s11227-014-1327-2]
In: The Journal of Supercomputing, 71 (3) pp. 824-839. ISSN 0920-8542
[Article] , (2015)

P

Prabhakaran, Suraj and Neumann, Marcel and Wolf, Felix :
Efficient Fault Tolerance through Dynamic Node Replacement.
In: Proc. of the 18th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid), Washington, DC, USA.
[Conference or Workshop Item] , (2018)

Prabhakaran, Suraj :
Dynamic Resource Management and Job Scheduling for High Performance Computing.
[Online-Edition: http://tuprints.ulb.tu-darmstadt.de/5720/]
Technische Universität Darmstadt , Darmstadt, Germany
[Ph.D. Thesis]

Prabhakaran, Suraj :
Dynamic Resource Management and Job Scheduling for High Performance Computing.
[Online-Edition: http://tuprints.ulb.tu-darmstadt.de/5720]
Technische Universität Darmstadt , Darmstadt
[Ph.D. Thesis], (2016)

Prabhakaran, Suraj and Neumann, Marcel and Rinke, Sebastian and Wolf, Felix and Gupta, Abhishek and Kalë, Laxmikant V. :
A Batch System with Efficient Scheduling for Malleable and Evolving Applications.
[Online-Edition: http://dx.doi.org/10.1109/IPDPS.2015.34]
In: Proc. of the 29th IEEE International Parallel and Distributed Processing Symposium (IPDPS), Hyderabad, India. IEEE Computer Society
[Conference or Workshop Item] , (2015)

Prabhakaran, Suraj and Iqbal, Mohsin and Rinke, Sebastian and Windisch, Christian and Wolf, Felix :
A Batch System with Fair Scheduling for Evolving Applications.
In: Proc. of the 43rd International Conference on Parallel Processing (ICPP), Minneapolis, MN, USA.
[Conference or Workshop Item] , (2014)

Prabhakaran, Suraj and Iqbal, Mohsin and Rinke, Sebastian and Wolf, Felix :
A Dynamic Resource Management System for Network-Attached Accelerator Clusters.
In: Proc. of the 42nd International Conference on Parallel Processing Workshops (ICPPW), Workshop on Scheduling and Resource Management for Parallel and Distributed Systems (SRMPDS), Lyon, France.
[Conference or Workshop Item] , (2013)

Pankratius, Victor and Jannesari, Ali and Tichy, Walter F. :
Parallelizing Bzip2: A Case Study in Multicore Software Engineering.
In: IEEE Software, 26 (6) 70 -77. ISSN 0740-7459
[Article] , (2009)

Pankratius, Victor and Schaefer, Christoph and Jannesari, Ali and Tichy, Walter F. :
Software engineering for multicore systems: an experience report.
In: 1st international workshop on Multicore software engineering, ICSE'08, Leipzig, Germany. In: IWMSE '08 . ACM , New York, NY, USA
[Conference or Workshop Item] , (2008)

Pankratius, Victor and Schaefer, Christoph and Jannesari, Ali and Tichy, Walter F. :
Software Engineering for Multicore Systems - An Experience Report.
[Online-Edition: http://www.ipd.uni-karlsruhe.de/multicore/research/download/...]
University of Karlsruhe
[Report] , (2007)

R

Roth, Philip C. and Huck, Kevin and Gopalakrishnan, Ganesh and Wolf, Felix :
Using Deep Learning for Automated Communication Pattern Characterization: Little Steps and Big Challenges.
In: Proc. of the 5th Workshop on Visual Performance Analysis (VPA), held in conjunction with the Supercomputing Conference (SC18), Dallas, TX, USA.
[Conference or Workshop Item] , (2018)

Rinke, Sebastian and Butz-Ostendorf, Markus and Hermanns, Mikaël and Wolf, Felix :
A Scalable Algorithm for Simulating the Structural Plasticity of the Brain.
[Online-Edition: http://dx.doi.org/10.1016/j.jpdc.2017.11.019]
In: Journal of Parallel and Distributed Computing
[Article] , (2018)

Rinke, Sebastian :
A Scalable Parallel Algorithm for the Simulation of Structural Plasticity in the Brain.
[Online-Edition: https://tuprints.ulb.tu-darmstadt.de/7756]
Technische Universität , Darmstadt
[Ph.D. Thesis], (2018)

Reisert, Patrick and Calotoiu, Alexandru and Shudler, Sergei and Wolf, Felix :
Following the Blind Seer — Creating Better Performance Models Using Less Information.
In: Proc. of the 23rd Euro-Par Conference, Santiago de Compostela, Spain. In: Lecture Notes in Computer Science . Springer
[Conference or Workshop Item] , (2017)

Rinke, Sebastian and Naveau, Mikaël and Wolf, Felix and Butz-Ostendorf, Markus
van Ooyen, Arjen and Butz-Ostendorf, Markus (eds.) :

The Rewiring Brain: A Computational Approach to Structural Plasticity in the Adult Brain.
In: Homeostatic Structural Plasticity in a Full-Scale Model of the Developing Cortical Column. Academic Press, San Diego San Diego , pp. 177-202. ISBN 9780128037843
[Book Section] , (2017)

Rinke, Sebastian and Butz-Ostendorf, Markus and Hermanns, Marc-André and Naveau, Mikaël and Wolf, Felix :
A Scalable Algorithm for Simulating the Structural Plasticity of the Brain.
In: Proc. of the 28th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD), Los Angeles, CA, USA.
[Conference or Workshop Item] , (2016)

Rinke, Sebastian and Prabhakaran, Suraj and Wolf, Felix :
Efficient Offloading of Parallel Kernels Using MPICommspawn.
In: Proc. of the 42nd International Conference on Parallel Processing Workshops (ICPPW), Workshop on Heterogeneous and Unconventional Cluster Architectures and Applications (HUCAA), Lyon, France.
[Conference or Workshop Item] , (2013)

Rinke, Sebastian and Becker, Daniel and Lippert, Thomas and Prabhakaran, Suraj and Westphal, Lidia and Wolf, Felix :
A Dynamic Accelerator-Cluster Architecture.
In: Proc. of the 41st International Conference on Parallel Processing Workshops (ICPPW), Workshop on Scheduling and Resource Management for Parallel and Distributed Systems (SRMPDS), Pittsburgh, PA, USA.
[Conference or Workshop Item] , (2012)

Rössel, Christian and Mohr, Bernd and Gerndt, Michael and Wolf, Felix :
Performance Dynamics of Massively Parallel Codes.
[Online-Edition: http://inside.hlrs.de/pdfs/inSiDE_autumn2012.pdf]
In: Innovatives Supercomputing in Deutschland (inSiDE), 10 (2) pp. 72-73.
[Article] , (2012)

Rössel, Christian and Mohr, Bernd and Wolf, Felix :
Score-P.
In: Entwicklung und Evolution von Forschungssoftware, Rolduc, November 2011. In: Aachener Informatik-Berichte, Software Engineering , 14 . Shaker
[Conference or Workshop Item] , (2012)

Riedel, Morris and Schuller, Bernd and Rambadt, Michael and Memon, Mohammad Shahbaz and Memon, Ahmed Shiraz and Streit, Achim and Lippert, Thomas and Zasada, Stefan J. and Manos, Steven and Coveney, Peter V. and Wolf, Felix and Kranzlmüller, Dieter :
Exploring the Potential of Using Multiple E-science Infrastructures with Emerging Open Standards-Based E-health Research Tools.
In: Proc. of the 10th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid), Melbourne, Victoria, Australia. IEEE Computer Society
[Conference or Workshop Item] , (2010)

Rinke, Sebastian and Böttiger, Hans and Krill, Benjamin :
QPACE: Energy-Efficient High Performance Computing.
In: Proc. of 23th International Conference on Architecture of Computing Systems (ARCS), PARS Workshop, Hannover, Germany.
[Conference or Workshop Item] , (2010)

Riedel, Morris and Frings, Wolfgang and Eickermann, Thomas and Habbinga, Sonja and Gibbon, Paul and Mallmann, Daniel and Streit, Achim and Wolf, Felix and Lippert, Thomas :
Collaborative Interactivity in Parallel HPC Applications.
In: Proc. of the Instrumenting the Grid (InGrid) 2008 Workshop, Lacco Ameno, Island of Ischia, Italy. Springer
[Conference or Workshop Item] , (2010)

Riedel, Morris and Streit, Achim and Mallmann, Daniel and Wolf, Felix and Lippert, Thomas :
Experiences and Requirements for Interoperability Between HTC and HPC-driven e-Science Infrastructure.
In: Future Application and Middleware Technology on e-Science. Springer US , pp. 113-123. ISBN 978-1-4419-1724-9
[Book Section] , (2010)

Rinke, Sebastian and Mehlan, Torsten and Rehm, Wolfgang :
Evaluation of Task Mapping Strategies for Regular Network Topologies.
In: ParCo 2009, Lyon, France. In: Advances in Parallel Computing , 19 .
[Conference or Workshop Item] , (2010)

Riedel, Morris and Laure, E. and Soddemann, Th. and Field, L. and Navarro, J. P. and Casey, J. and Lithmaath, M. and Baud, J. Ph. and Koblitz, B. and Catlett, C. and Skow, D. and Zheng, C. and Papadopoulos, P. M. and Katz, M. and Sharma, N. and Smirnova, O. and Kónya, B. and Arzberger, P. and Würthwein, F. and Rana, A. S. and Martin, T. and Wan, M. and Welch, V. and Rimovsky, T. and Newhouse, S. and Vanni, A. and Tanaka, Y. and Tanimura, Y. and Ikegami, T. and Abramson, D. and Enticott, C. and Jenkins, G. and Pordes, R. and Timm, S. and Moont, G. and Aggarwal, M. and Colling, D. and van der Aa, O. and Sim, A. and Natarajan, V. and Shoshani, A. and Gu, J. and Galang, G. and Zappi, R. and Magnoni, L. and Ciaschini, V. and Pace, M. and Venturi, Valerio and Marzolla, Moreno and Andreetto, Paolo and Cowles, B. and Wang, S. and Saeki, Y. and Sato, H. and Matsuoka, S and Uthayopas, P. and Sriprayoonsakul, S. and Koeroo, O. and Viljoen, M. and Pearlman, L. and Pickles, S. and Wallom, D. and Moloney, G. and Lauret, J. and Marsteller, J. and Sheldon, P. and Pathak, S. and De Witt, S. and Mencák, J. and Jensen, J. and Hodges, M. and Ross, D. and Phatanapherom, S. and Netzer, G. and Gregersen, A. R. and Jones, M. and Chen, S. and Kacsuk, P. and Streit, Achim and Mallmann, Daniel and Wolf, Felix and Lippert, Thomas and Delaitre, Th. and Huedo, E. and Geddes, N. :
Interoperation of World-Wide Production e-Science Infrastructures.
In: Concurrency and Computation: Practice and Experience, 21 (8) pp. 961-990. ISSN 1532-0626
[Article] , (2009)

Riedel, Morris and Wolf, Felix and Kranzlmüller, Dieter and Streit, Achim and Lippert, Thomas :
Research Advances by Using Interoperable e-Science Infrastructures - The Infrastructure Interoperability Reference Model Applied in e-Science.
In: Cluster Computing, 12 (4) pp. 357-372. ISSN 1386-7857
[Article] , (2009)

Riedel, Morris and Streit, Achim and Lippert, Thomas and Wolf, Felix and Kranzlmüller, Dieter :
Concepts and Design of an Interoperability Reference Model for Scientific- and Grid Computing Infrastructures.
In: Proc. of the Applied Computing Conference, in Mathematical Methods and Applied Computing, Volume II. WSEAS Press
[Conference or Workshop Item] , (2009)

Riedel, Morris and Streit, Achim and Wolf, Felix and Lippert, Thomas and Kranzlmüller, Dieter :
Classification of Different Approaches for e-Science Applications in Next Generation Computing Infrastructures.
In: Proc. of the 4th IEEE Conference on e-Science (e-Science), Indianapolis, USA.
[Conference or Workshop Item] , (2008)

Riedel, Morris and Frings, Wolfgang and Habbinga, Sonja and Eickermann, Thomas and Mallmann, Daniel and Streit, Achim and Wolf, Felix and Lippert, Thomas :
Extending the Collaborative Online Visualization and Steering Framework for Computational Grids with Attribute-based Authorization.
In: Proc. of the 9th IEEE/ACM International Conference on Grid Computing (Grid 2008), Tsukuba, Japan. IEEE Computer Society
[Conference or Workshop Item] , (2008)

Riedel, Morris and Memon, Ahmed Shiraz and Memon, Mohammad Shahbaz and Mallmann, Daniel and Streit, Achim and Wolf, Felix and Lippert, Thomas and Venturi, Valerio and Andreetto, Paolo and Marzolla, Moreno and Ferraro, Andrea and Ghiselli, Antonia and Hedman, Fredrik and Shah, Zeeshan Ali and Salzemann, Jean and Da Costa, Ana and Breton, Vincent and Kasam, Vinod and Hofmann-Apitius, Martin and Snelling, David and van den Berghe, Sven and Li, Vivian and Brewer, Steve and Dunlop, Alistair and De Silva, Nishadi :
Improving e-Science with Interoperability of the e-Infrastructures EGEE and DEISA.
In: Proc. of the 31st International Convention MIPRO, Conference on Grid and Visualization Systems (GVS), Opatija, Croatia. Croatian Society for Information and Communication Technology, Electronics and Microelectronics
[Conference or Workshop Item] , (2008)

Riedel, Morris and Eickermann, Thomas and Habbinga, S. and Frings, Wolfgang and Gibbon, Paul and Mallmann, Daniel and Streit, Achim and Lippert, Thomas and Wolf, Felix and Schiffmann, W. and Ernst, A. and Spurzem, R. and Nagel, W. E. :
Computational Steering and Online Visualization of Scientific Applications on Large-Scale HPC Systems within e-Science Infrastructures.
[Online-Edition: http://portal.acm.org/citation.cfm?id=1332478.1333527&coll=p...]
In: Proc. of 3rd IEEE International Conference on e-Science and Grid Computing, Bangalore, India. IEEE Computer Society
[Conference or Workshop Item] , (2007)

Riedel, Morris and Eickermann, Thomas and Frings, Wolfgang and Dominiczak, Sonja and Mallmann, Daniel and Düssel, Thomas and Streit, Achim and Gibbon, Paul and Wolf, Felix and Schiffmann, Wolfram and Lippert, Thomas :
Design and Evaluation of a Collaborative Online Visualization and Steering Framework Implementation for Computational Grids.
In: Proc. of the 8th IEEE/ACM International Conference on Grid Computing (Grid 2007), Austin, Texas, USA.
[Conference or Workshop Item] , (2007)

Riedel, Morris and Frings, Wolfgang and Dominiczak, Sonja and Eickermann, Thomas and Düssel, Thomas and Gibbon, Paul and Mallmann, Daniel and Wolf, Felix and Schiffmann, Wolfram :
Requirements and Design of a Collaborative Online Visualization and Steering Framework for Grid and e-Science Infrastructures.
In: Proc. of the German e-Science Conference, Baden-Baden, Germany. Max Planck Digital Library - ID 316630.0
[Conference or Workshop Item] , (2007)

S

Shudler, Sergei and Vrabec, Jadran and Wolf, Felix :
Understanding the Scalability of Molecular Simulation using Empirical Performance Modeling.
In: Proc. of the 7th Workshop on Extreme Scale Programming Tools (ESPT), held in conjunction with the Supercomputing Conference (SC18), Dallas, TX, USA,.
[Conference or Workshop Item] , (2018)

Shudler, Sergei :
Scalability Engineering for Parallel Programs Using Empirical Performance Models.
[Online-Edition: http://tuprints.ulb.tu-darmstadt.de/7471]
Technische Universität , Darmstadt
[Ph.D. Thesis], (2018)

Shudler, Sergei and Calotoiu, Alexandru and Hoefler, Torsten and Wolf, Felix :
Isoefficiency in Practice: Configuring and Understanding the Performance of Task-based Applications.
In: Proc. of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP), Austin, TX, USA. ACM
[Conference or Workshop Item] , (2017)

Shudler, Sergei and Calotoiu, Alexandru and Hoefler, Torsten and Strube, Alexandre and Wolf, Felix :
Exascaling Your Library: Will Your Implementation Meet Your Expectations?
In: Proc. of the International Conference on Supercomputing (ICS), Newport Beach, CA, USA. ACM
[Conference or Workshop Item] , (2015)

Schimmel, Jochen and Molitorisz, Korbinian and Jannesari, Ali and Tichy, Walter F. :
Combining Unit Tests for Data Race Detection.
[Online-Edition: http://dl.acm.org/citation.cfm?id=2819261.2819275]
In: Proc. of 10th IEEE/ACM International Workshop on Automation of Software Test (AST 2015), Florence, Italy. IEEE
[Conference or Workshop Item] , (2015)

Shah, Aamer and Wolf, Felix and Zhumatiy, Sergey and Voevodin, Vladimir :
Capturing inter-application interference on clusters.
In: Proc. of the IEEE International Conference on Cluster Computing (CLUSTER), Indianapolis, IN, USA. IEEE Computer Society
[Conference or Workshop Item] , (2013)

Schimmel, Jochen and Molitorisz, Korbinian and Jannesari, Ali and Tichy, Walter F. :
Automatic Generation of Parallel Unit Tests.
In: Proc. of the 8th International Workshop on Automation of Software Test (AST), San Francisco, CA, USA. ACM
[Conference or Workshop Item] , (2013)

Schmidl, Dirk and Philippen, Peter and Lorenz, Daniel and Rössel, Christian and Geimer, Markus and an Me, Dieter and Mohr, Bernd and Wolf, Felix :
Performance Analysis Techniques for Task-Based OpenMP Applications.
[Online-Edition: http://dx.doi.org/10.1007/978-3-642-30961-8_15]
In: Proc. of the 8th International Workshop on OpenMP (IWOMP), Rome, Italy, Berlin / Heidelberg. In: Lecture Notes in Computer Science , 7312 . Springer , Berlin / Heidelberg
[Conference or Workshop Item] , (2012)

Siebert, Christian and Wolf, Felix
Cotronis, Yiannis and Danalis, Anthony and Nikolopoulos, Dimitrios and Dongarra, Jack (eds.) :

Parallel Sorting with Minimal Data.
In: Proc. of the 18th European MPI Users' Group Meeting (EuroMPI), Santorini, Greece. In: Lecture Notes in Computer Science , 6960 . Springer
[Conference or Workshop Item] , (2011)

Szebenyi, Zoltán and Gamblin, Todd and Schulz, Martin and de Supinski, Bronis R. and Wolf, Felix and Wylie, Brian J. N. :
Reconciling Sampling and Direct Instrumentation for Unintrusive Call-Path Profiling of MPI Programs.
In: Proc. of the 25th IEEE International Parallel & Distributed Processing Symposium (IPDPS), Anchorage, AK, USA. IEEE Computer Society
[Conference or Workshop Item] , (2011)

Szebenyi, Zoltán and Wolf, Felix and Wylie, Brian J. N. :
Performance Analysis of Long-running Applications.
In: Proc. of the 25th IEEE Int'l Parallel & Distributed Processing Symposium (IPDPS) PhD Forum, Anchorage, AK, USA. IEEE Computer Society
[Conference or Workshop Item] , (2011)

Szebenyi, Zoltán and Wolf, Felix and Wylie, Brian J. N. :
Space-Efficient Time-Series Call-Path Profiling of Parallel Applications.
[Online-Edition: http://dl.acm.org/ft_gateway.cfm?id=1654097&type=pdf&coll=DL...]
In: Proc. of the ACM/IEEE Conference on Supercomputing (SC09), Portland, OR, USA. ACM
[Conference or Workshop Item] , (2009)

Szebenyi, Zoltán and Wylie, Brian J. N. and Wolf, Felix :
Scalasca Parallel Performance Analyses of PEPC.
In: Proc. of the 1st Workshop on Productivity and Performance (PROPER) in conjunction with Euro-Par 2008, Las Palmas de Gran Canaria, Spain. In: Lecture Notes in Computer Science (ISSN 0302-9743) , 5415 . Springer
[Conference or Workshop Item] , (2009)

Szebenyi, Zoltán and Wylie, Brian J. N. and Wolf, Felix :
SCALASCA Parallel Performance Analyses of SPEC MPI2007 Applications.
In: Proc. of the 1st SPEC International Performance Evaluation Workshop (SIPEW), Darmstadt, Germany. In: Lecture Notes in Computer Science , 5119 . Springer
[Conference or Workshop Item] , (2008)

Shende, Sameer S. and Malony, Allen D. and Morrison, Alan and Wolf, Felix :
Performance Profiling Overhead Compensation for MPI Programs.
In: Proc. of the 12th European PVM/MPI Users' Group Meeting (EuroPVM/MPI), Sorrento, Italy. In: Lecture Notes in Computer Science , 3666 . Springer
[Conference or Workshop Item] , (2005)

Song, Fengguang and Wolf, Felix and Bhatia, Nikhil and Dongarra, Jack and Moore, Shirley :
An Algebra for Cross-Experiment Performance Analysis.
In: Proc. of the International Conference on Parallel Processing (ICPP), Montreal, Canada. IEEE Society
[Conference or Workshop Item] , (2004)

Song, Fengguang and Wolf, Felix :
CUBE User Manual.

[Report] , (2004)

T

Theisen, Lucas and Shah, Aamer and Wolf, Felix :
Down to Earth — How to Visualize Traffic on High-dimensional Torus Networks.
In: Proc. of VPA: First workshop on Visual Performance Analysis, held in conjunction with Supercomputer 2014, New Orleans, LA.
[Conference or Workshop Item] , (2014)

U

Ul-Huda, Zia and Jannesari, Ali and Wolf, Felix :
Using Template Matching to Infer Parallel Design Patterns.
In: ACM Transactions on Architecture and Code Optimization, 11 (4) 64:1-64:21. ISSN 1544-3566
[Article] , (2015)

V

Vogel, Andreas and Calotoiu, Alexandru and Nagel, Arne and Reiter, Sebastian and Strube, Alexandre and Wittum, Gabriel and Wolf, Felix :
Software for Exascale Computing - SPPEXA 2013-2015.
In: Software for Exascale Computing - SPPEXA 2013-2015. Lecture Notes in Computational Science and Engineering, 113. Springer International Publishing , pp. 467-481. ISBN 978-3-319-40528-5
[Book Section] , (2016)

Vogel, Andreas and Calotoiu, Alexandru and Strube, Alexandre and Reiter, Sebastian and Nägel, Arne and Wolf, Felix and Wittum, Gabriel :
10,000 Performance Models per Minute - Scalability of the UG4 Simulation Framework.
In: Proc. of the 21st Euro-Par Conference, Vienna, Austria. In: Lecture Notes in Computer Science , 9233 . Springer
[Conference or Workshop Item] , (2015)

von Rüden, Laura and Hermanns, Marc-André and Behrisch, Michael and Keim, Daniel and Mohr, Bernd and Wolf, Felix :
Separating the Wheat from the Chaff: Identifying Relevant and Similar Performance Data with Visual Analytics.
In: Proceedings of the 2nd Workshop on Visual Performance Analysis. ACM Austin, TX, USA , 4:1-4:8. ISBN 978-1-4503-4013-7
[Book Section] , (2015)

W

Wolf, Felix and Bischof, Christian and Calotoiu, Alexandru and Hoefler, Torsten and Iwainsky, Christian and Kwasniewski, Grzegorz and Mohr, Bernd and Shudler, Sergei and Strube, Alexandre and Vogel, Andreas and Wittum, Gabriel :
Software for Exascale Computing - SPPEXA 2013-2015.
In: Software for Exascale Computing - SPPEXA 2013-2015. Lecture Notes in Computational Science and Engineering, 113. Springer International Publishing , pp. 445-465. ISBN 978-3-319-40528-5
[Book Section] , (2016)

Wolf, Felix and Bischof, Christian and Hoefler, Torsten and Mohr, Bernd and Wittum, Gabriel and Calotoiu, Alexandru and Iwainsky, Christian and Strube, Alexandre and Vogel, Andreas
Lopez, Luis (ed.) :

Catwalk: A Quick Development Path for Performance Models.
In: Euro-Par 2014: Parallel Processing Workshops. Lecture Notes in Computer Science (8806). Springer International Publishing, Cham , pp. 589-600. ISBN 978-3-319-14312-5
[Book Section] , (2014)

Wolf, Felix and Mohr, Bernd:
Euro-Par 2013: Parallel Processing.
Lecture Notes in Computer Science, Advanced Research in Computing and Software Science, 8097. Springer Berlin Heidelberg
[Book] , (2013)

Wolf, Felix :
Scalasca.
In: Encyclopedia of Parallel Computing. Springer , pp. 1775-1785.
[Book Section] , (2011)

Wolf, Felix :
Understanding the Formation of Wait States in Parallel Programs.
[Online-Edition: http://inside.hlrs.de/htm/Edition_01_11/article_23.html]
In: Innovatives Supercomputing in Deutschland (inSiDE), 1 (9) pp. 94-95.
[Article] , (2011)

Wylie, Brian J. N. and Geimer, Markus and Mohr, Bernd and Böhme, David and Wolf, Felix and Szebenyi, Zoltán :
Large-scale performance analysis of Sweep3D with the Scalasca toolset.
In: Parallel Processing Letters, 20 (4) pp. 397-414.
[Article] , (2010)

Wylie, Brian J. N. and Böhme, David and Wolf, Felix and Frings, Wolfgang and Geimer, Markus and Mohr, Bernd and Szebenyi, Zoltán and Becker, Daniel and Hermanns, Marc-André :
Scalable performance analysis of large-scale parallel applications on Cray XT systems with Scalasca.
[Online-Edition: https://cug.org/5-publications/proceedings_attendee_lists/CU...]
In: Proc. 52nd Cray User Group Meeting, Edinburgh, Scotland. Cray User Group Incorporated
[Conference or Workshop Item] , (2010)

Wylie, Brian J. N. and Böhme, David and Wolf, Felix and Szebenyi, Zoltán and Mohr, Bernd :
Performance analysis of Sweep3D on Blue Gene/P with the Scalasca toolset.
In: Proc. 24th Int'l Parallel & Distributed Processing Symposium and Workshops (IPDPS), Atlanta, GA, USA. IEEE Computer Society
[Conference or Workshop Item] , (2010)

Wolf, Felix and Böhme, David and Geimer, Markus and Hermanns, Marc-André and Mohr, Bernd and Szebenyi, Zoltán and Wylie, Brian J. N.
Münster, Gernot and Wolf, Dietrich and Kremer, Manfred (eds.) :

Performance Tuning in the Petascale Era.
In: Proc. of the John von Neumann Institute for Computing (NIC) Symposium 2010, Jülich, Germany. In: IAS Series , 3 . John von Neumann-Institut for Computing
[Conference or Workshop Item] , (2010)

Wolf, Felix :
Performance Tools for Petascale Systems.
In: Innovatives Supercomputing in Deutschland (inSiDE), 7 (2) pp. 38-39.
[Article] , (2009)

Wolf, Felix and Becker, Daniel and Geimer, Markus and Wylie, Brian J. N. :
Scalable Performance Analysis Methods for the Next Generation of Supercomputers.
In: Proc. of the John von Neumann Institute for Computing (NIC) Symposium, Jülich, Germany. In: NIC-Series , 39 .
[Conference or Workshop Item] , (2008)

Wolf, Felix and Wylie, Brian J. N. and Ábrahám, Erika and Becker, Daniel and Frings, Wolfgang and Fürlinger, Karl and Geimer, Markus and Hermanns, Marc-André and Mohr, Bernd and Moore, Shirley and Pfeifer, Matthias and Szebenyi, Zoltán :
Usage of the SCALASCA Toolset for Scalable Performance Analysis of Large-Scale Parallel Applications.
In: Tools for High Performance Computing, Proc. of the 2nd Parallel Tools Workshop, Stuttgart, Germany, July 2008. Springer , pp. 157-167. ISBN ISBN 978-3-540-68561-6
[Book Section] , (2008)

Wylie, Brian J. N. and Geimer, Markus and Wolf, Felix :
Performance measurement and analysis of large-scale parallel applications on leadership computing systems.
In: Scientific Programming, 16 (2-3) pp. 167-181. ISSN 1058-9244
[Article] , (2008)

Wolf, Felix and Mohr, Bernd and Dongarra, Jack and Moore, Shirley :
Automatic analysis of inefficiency patterns in parallel applications.
In: Concurrency and Computation: Practice and Experience, 19 (11) pp. 1481-1496.
[Article] , (2007)

Wylie, Brian J. N. and Wolf, Felix and Mohr, Bernd and Geimer, Markus :
Integrated Runtime Measurement Summarisation and Selective Event Tracing for Scalable Parallel Execution Performance Diagnosis.
In: Proc. of the 8th International Workshop on State-of-the-Art in Scientific and Parallel Computing (PARA), Umeå, Sweden, June 2006. In: Lecture Notes in Computer Science , 4699 . Springer
[Conference or Workshop Item] , (2007)

Wolf, Felix and Freitag, Felix and Mohr, Bernd and Moore, Shirley and Wylie, Brian J. N. :
Large Event Traces in Parallel Performance Analysis.
In: Proc. of the 8th Workshop on Parallel Systems and Algorithms (PASA), Frankfurt, Germany. In: Lecture Notes in Informatics , P-81 . Gesellschaft f\"r Informatik
[Conference or Workshop Item] , (2006)

Wylie, Brian J. N. and Mohr, Bernd and Wolf, Felix :
Holistic Hardware Counter Performance Analysis of Parallel Programs.
In: Proc. of the Conference on Parallel Computing (ParCo), Malaga, Spain.
[Conference or Workshop Item] , (2005)

Wolf, Felix and Malony, Allen D. and Shende, Sameer S. and Morris, Alan :
Trace-Based Parallel Performance Overhead Compensation.
In: Proc. of the International Conference on High Performance Computing and Communications (HPCC), Sorrento, Italy. In: Lecture Notes in Computer Science , 3726 . Springer
[Conference or Workshop Item] , (2005)

Worley, P. and Candy, J. and Carrington, L. and Huck, K. and Kaiser, T. and Mahinthakumar, G. and Malony, Allen D. and Moore, Shirley and Reed, D. and Roth, P. and Shan, H. and Shende, Sameer S. and Snavely, A. and Sreepathi, S. and Wolf, Felix and Zhang, Y. :
Performance Analysis of GYRO: A Tool Evaluation.
In: Proc. of the 2005 SciDAC Conference, San Francisco, CA, USA.
[Conference or Workshop Item] , (2005)

Wolf, Felix :
EARL - API Documentation.

[Report] , (2004)

Wolf, Felix and Mohr, Bernd and Dongarra, Jack and Moore, Shirley :
Efficient Pattern Search in Large Traces through Successive Refinement.
In: Proc. of the 10th Euro-Par Conference, Pisa, Italy. In: Lecture Notes in Computer Science , 3149 . Springer
[Conference or Workshop Item] , (2004)

Wolf, Felix and Mohr, Bernd :
Hardware-Counter Based Automatic Performance Analysis of Parallel Programs.
In: Proc. of the Conference on Parallel Computing (ParCo), Dresden, Germany.
[Conference or Workshop Item] , (2003)

Wolf, Felix and Mohr, Bernd :
KOJAK - A Tool Set for Automatic Performance Analysis of Parallel Applications.
In: Proc. of the 9th Euro-Par Conference, Klagenfurt, Austria. In: Lecture Notes in Computer Science , 2790 . Springer
[Conference or Workshop Item] , (2003)

Wolf, Felix :
Automatic Performance Analysis on Parallel Computers with SMP Nodes.
[Online-Edition: http://hdl.handle.net/2128/2928]
RWTH Aachen , Forschungszentrum Jülich
[Ph.D. Thesis]

Wolf, Felix and Mohr, Bernd :
Automatic Performance Analysis of Hybrid MPI/OpenMP Applications.
In: Proc. of 11th Euromicro Workshop on Parallel Distributed and Network-Based Processing (PDP), Genua, Italy. IEEE Computer Society
[Conference or Workshop Item] , (2003)

Wolf, Felix and Mohr, Bernd :
Automatic performance analysis of hybrid MPI/OpenMP applications.
In: Journal of Systems Architecture, 49 (10-11) pp. 421-439.
[Article] , (2003)

Wolf, Felix and Mohr, Bernd :
Specifying Performance Properties of Parallel Applications Using Compound Events.
In: Parallel and Distributed Computing Practices, 4 (3) pp. 301-317.
[Article] , (2001)

Wolf, Felix and Mohr, Bernd :
Automatic Performance Analysis of MPI Applications Based on Event Traces.
In: Proc. of the 6th Euro-Par Conference, Munich, Germany. In: Lecture Notes in Computer Science , 1900 . Springer
[Conference or Workshop Item] , (2000)

Wolf, Felix and Mohr, Bernd :
EARL - A Programmable and Extensible Toolkit for Analyzing Event Traces of Message Passing Programs.
In: Proc. of the 7th International Conference on High Performance Computing and Networking Europe (HPCN), Amsterdam, The Netherlands. In: Lecture Notes in Computer Science , 1593 . Springer
[Conference or Workshop Item] , (1999)

Wolf, Felix :
EARL - Eine programmierbare Umgebung zur Bewertung paralleler Prozesse auf Message-Passing-Systemen.
RWTH Aachen , Forschungszentrum Jülich, Jül-Bericht 3551
[Master Thesis] , (1998)

X

Xiao, Yang and Jeyakumaran, Thireshan and Atoofian, Ehsan and Jannesari, Ali :
Improving Performance of Transactional Memory through Machine Learning.
In: Concurrency and Computation: Practice and Experience pp. 1-24.
[Article] , (2017)

Xiao, Yang and Li, Zhen and Atoofian, Ehsan and Jannesari, Ali :
Automatic Optimization of Software Transactional Memory through Linear Regression and Decision Tree.
In: Proc. of 15th International Conference on Algorithms and Architectures for Parallel Processing (ICA3PP), Zhangjiajie, China. In: Lecture Notes in Computer Science , 9531 . Springer International Publishing
[Conference or Workshop Item] , (2015)

Z

Zhao, Bo and Li, Zhen and Jannesari, Ali and Wolf, Felix and Wu, Weiguo :
Dependence-Based Code Transformation for Coarse-Grained Parallelism.
In: Proc. of the International Workshop on Code Optimisation for Multi and Many Cores, San Francisco, CA, USA. ACM
[Conference or Workshop Item] , (2015)

This list was generated on Sun May 26 01:53:37 2019 CEST.