Abstract
This paper presents the design, implementation, and application of ParaProf, a portable, extensible, and scalable tool for parallel performance profile analysis. ParaProf attempts to offer “best of breed” capabilities to performance analysts – those inherited from a rich history of single processor profilers and those being pioneered in parallel tools research. We present ParaProf as a parallel profile analysis framework that can be retargeted and extended as required. ParaProf’s design and operation is discussed, and its novel support for large-scale parallel analysis demonstrated with a 512-processor application profile generated using the TAU performance system.
Chapter PDF
Similar content being viewed by others
Keywords
- Parallel Performance
- Performance Analyst
- Performance Analysis System
- High Performance Computing Application
- Hardware Counter
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
APART, IST Working Group on Automatic Performance Analysis: Real Tools, See http://www.fz-juelich.de
Browne, S., Dongarra, J., Garner, N., Ho, G., Mucci, P.: A Portable Programming Interface for Performance Evaluation on Modern Processors. International Journal of High Performance Computing Applications 14(3), 189–204 (Fall 2000)
DeRose, L.: The Hardware Performance Monitor Toolkit. Euro-Par 2001 (2001)
Fahringer, T., Seragiotto, C.: Experience with Aksum: A Semi-Automatic Multi-Experiment Performance Analysis Tool for Parallel and Distributed Applications. In: Workshop on Performance Analysis and Distributed Computing (2002)
Graham, S., Kessler, P., McKusick, M.: gprof: A Call Graph Execution Profiler. In: SIGPLAN 1982 Symposium on Compiler Construction, June 1982, pp. 120–126 (1982)
Hall, R.: Call Path Profiling . In: International Conference on Software Engineering, pp. 296–306 (1992)
Hoeflinger, J., et al.: An Integrated Performance Visualizer for MPI/OpenMP Programs. In: Workshop on OpenMP Applications and Tools (WOMPAT) (July 2001)
Hornung, R., Kohn, S.: Managing Application Complexity in the SAMRAI Object-Oriented Framework, Concurrency and Computation: Practice and Experience, special issue on Software Architectures for Scientific Applications (2001)
Janssen, C.: The Visual Profiler, http://aros.ca.sandia.gov/cljanss/perf/vprof/
KAI Software .: a division of Intel Americas, GuideView Performance Analyzer , (2001), http://www.kai.com/parallel/kapro/guideview
Knuth, D.: An Empirical Study of FORTRAN Programs. Software – Practice and Experience 1, 105–133 (1971)
Lindlan, K., Cuny, J., Malony, A., Shende, S., Mohr, B., Rivenburgh, R., Rasmussen, C.: Tool Framework for Static and Dynamic Analysis of Object-Oriented Software with Templates. In: Proc. Supercomputing 2000 (November 2000)
Mellor-Crummey, J., Fowler, R., Marin, G.: HPCView: A Tool for Top-down Analysis of Node Performance. The Journal of Supercomputing 23, 81–104 (2002)
Mucci, P.: Dynaprof, http://www.cs.utk.edu/mucci/dynaprof
Reed, D., DeRose, L., Zhang, Y.: SvPablo: A Multi-Language Performance Analysis System. In: 10th International Conference on Performance Tools, September 1998, pp. 352– 355 (1998)
TAU (Tuning and Analysis Utilities), http://www.acl.lanl.gov/tau
Unix Programmer’s Manual, “prof command,” Section 1, Bell Laboratories, Murray Hill, NJ (January 1979)
VTF, Virtual Test Shock Facility, Center for Simulation of Dynamic Response of Materials, http://www.cacr.caltech.edu/ASAP
Wolf, F., Mohr, B.: Automatic Performance Analysis of SMP Cluster Applications. Technical Report IB 2001-05, Research Centre Jülich (2001)
Zagha, M., Larson, B., Turner, S., Itzkowitz, M.: Performance Analysis Using the MIPS R10000 Performance Counters. In: Supercomputing 1996 (November 1996)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Bell, R., Malony, A.D., Shende, S. (2003). ParaProf: A Portable, Extensible, and Scalable Tool for Parallel Performance Profile Analysis. In: Kosch, H., Böszörményi, L., Hellwagner, H. (eds) Euro-Par 2003 Parallel Processing. Euro-Par 2003. Lecture Notes in Computer Science, vol 2790. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-45209-6_7
Download citation
DOI: https://doi.org/10.1007/978-3-540-45209-6_7
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-40788-1
Online ISBN: 978-3-540-45209-6
eBook Packages: Springer Book Archive