Abstract
This paper addresses the problem of partitioning the nonzeros of sparse nonsymmetric and nonsquare matrices in order to efficiently compute parallel matrix-vector and matrix-transpose-vector multiplies. Our goal is to balance the work per processor while keeping communications costs low. Although the symmetric partitioning problem has been well-studied, the nonsymmetric and rectangular cases have received scant attention. We show that this problem can be described as a partitioning problem on a bipartite graph. We then describe how to use (modified) multilevel methods to partition these graphs and how to implement the matrix multiplies in parallel to take advantage of the partitioning. Finally, we compare various multilevel and other partitioning strategies on matrices from different applications. The multilevel methods are shown to be best.
This work was supported by the Applied Mathematical Sciences Research Program, Office of Energy Research, U.S. Department of Energy, under contracts DE-AC05-96OR22464 and DE-AL04-94AL85000 with Lockheed Martin Energy Research Corporation.
Preview
Unable to display preview. Download preview PDF.
References
Stephen T. Barnard and Horst D. Simon. A fast multilevel implementation of recursive spectral bisection for partitioning unstructured problems. Concurrency: Practice and Experience, 6:101–117, 1994.
Michael W. Berry, Bruce Hendrickson, and Padma Raghavan. Sparse matrix reordering schemes for browsing hypertext. In James Renegar, Michael Shub, and Steve Smale, editors, The Mathematics of Numerical Analysis, volume 32 of Lectures in Applied Mathematics, pages 99–122. American Mathematical Society, 1996.
Roland W. Freund and Noël M. Nachtigal. QMR: A quasi-minimal residual method for non-Hermitian linear systems. Numer. Math., 60:315–339, 1991.
Michael R. Garey and David S. Johnson. Computers and Intractability: A Guide to the Theory of NP-Completeness. W. H. Freeman and Company, New York, 1979.
Markus Hegland. Description and use of animal breeding data for large least squares problems. Technical Report TR-PA-93-50, CERFACS, Toulouse, France, 1993.
Bruce Hendrickson and Tamara G. Kolda. Partitioning nonsquare and nonsymmetric matrices for parallel processing. Technical Memorandum TM-13657, Oak Ridge National Laboratory, Oak Ridge, TN 37831, 1998. Submitted to SIAM J. Scientific Computing.
Bruce Hendrickson and Robert Leland. The Chaco user’s guide, version 2.0. Technical Report SAND95-2344, Sandia Natl. Lab., Albuquerque, NM, 87185, 1995.
Bruce Hendrickson and Robert Leland. A multilevel algorithm for partitioning graphs. In Proc. Supercomputing ’95. ACM, 1995.
A. Hofer. Schätzung von Zuchtwerten feldgeprüfter Schweine mit einem Mehrmerkmals-Tiermodell. PhD thesis, ETH-Zurich, 1990. Cited in Markus Hegland. Description and use of animal breeding data for large least squares problems. Technical Report TR-PA-93-50, CERFACS, Toulouse, France, 1993.
George Karypis and Vipin Kumar. A fast and high quality multilevel scheme for partitioning irregular graphs. Technical Report 95-035, Dept. Computer Science, Univ. Minnesota, Minneapolis, MN 55455, 1995.
George Karypis and Vipin Kumar. Parallel multilevel graph partitioning. Technical Report 95-036, Dept. Computer Science, Univ. Minnesota, Minneapolis, MN 55455, 1995.
B. W. Kernighan and S. Lin. An efficient heuristic procedure for partitioning graphs. Bell System Technical J., 1970.
Tamara G. Kolda. Partitioning sparse rectangular matrices for parallel processing. In Proc. 5th Intl. Symposium on Solving Irregularly Structured Problems in Parallel (Irregular ’98), to appear.
Christopher C. Paige and Michael A. Saunders. LSQR: An algorithm for sparse linear equations and sparse least squares. ACM Trans. Mathematical Software, 8:43–71, 1982.
Weichung Wang and Dianne P. O’Leary. Adaptive use of iterative methods in interior point methods for linear programming. Technical Report CS-TR-3560, Dept. Computer Science, Univ. Maryland, College Park, MD 20742, 1995.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1998 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Hendrickson, B., Kolda, T.G. (1998). Partitioning sparse rectangular matrices for parallel computations of Ax and A T v . In: Kågström, B., Dongarra, J., Elmroth, E., Waśniewski, J. (eds) Applied Parallel Computing Large Scale Scientific and Industrial Problems. PARA 1998. Lecture Notes in Computer Science, vol 1541. Springer, Berlin, Heidelberg . https://doi.org/10.1007/BFb0095342
Download citation
DOI: https://doi.org/10.1007/BFb0095342
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-65414-8
Online ISBN: 978-3-540-49261-0
eBook Packages: Springer Book Archive