{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2024,10,22]],"date-time":"2024-10-22T19:03:42Z","timestamp":1729623822769,"version":"3.28.0"},"publisher-location":"New York, NY, USA","reference-count":36,"publisher":"ACM","funder":[{"name":"Spanish Ministry of Economy Industry and Competitiveness","award":["RYC-2017-23269"]},{"name":"Spanish Ministry of Science and Technology","award":["ID2019- 107255GB"]},{"name":"EXCELLERAT project funded by the European Commission's ICT activity of the H2020 Programme","award":["823691"]},{"name":"Generalitat de Catalunya","award":["2017-SGR-1414"]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2021,6,21]]},"DOI":"10.1145\/3431379.3460642","type":"proceedings-article","created":{"date-parts":[[2021,6,17]],"date-time":"2021-06-17T04:09:26Z","timestamp":1623902966000},"page":"81-93","update-policy":"http:\/\/dx.doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":1,"title":["Cache-aware Sparse Patterns for the Factorized Sparse Approximate Inverse Preconditioner"],"prefix":"10.1145","author":[{"given":"Sergi","family":"Laut","sequence":"first","affiliation":[{"name":"Barcelona Supercomputing Center & Universitat Polit\u00e8cnica de Catalunya, Barcelona, Spain"}]},{"given":"Ricard","family":"Borrell","sequence":"additional","affiliation":[{"name":"Barcelona Supercomputing Center, Barcelona, Spain"}]},{"given":"Marc","family":"Casas","sequence":"additional","affiliation":[{"name":"Barcelona Supercomputing Center & Universitat Polit\u00e8cnica de Catalunya, Barcelona, Spain"}]}],"member":"320","published-online":{"date-parts":[[2021,6,21]]},"reference":[{"volume-title":"Understanding HPC Benchmark Performance on Intel Broadwell and Cascade Lake Processors","author":"Alappat Christie L.","key":"e_1_3_2_1_1_1","unstructured":"Christie L. Alappat , Johannes Hofmann , Georg Hager , Holger Fehske , Alan R. Bishop , and Gerhard Wellein . 2020. Understanding HPC Benchmark Performance on Intel Broadwell and Cascade Lake Processors . In High Performance Computing, Ponnuswamy Sadayappan, Bradford L. Chamberlain, Guido Juckeland, and Hatem Ltaief (Eds.). Springer International Publishing , Cham , 412--433. Christie L. Alappat, Johannes Hofmann, Georg Hager, Holger Fehske, Alan R. Bishop, and Gerhard Wellein. 2020. Understanding HPC Benchmark Performance on Intel Broadwell and Cascade Lake Processors. In High Performance Computing, Ponnuswamy Sadayappan, Bradford L. Chamberlain, Guido Juckeland, and Hatem Ltaief (Eds.). Springer International Publishing, Cham, 412--433."},{"volume-title":"Sparse approximate inverse preconditioning for dense linear systems arising in computational electro-magnetics. Numerical Algorithms 16 (02","year":"1997","author":"All\u00e9on Guillaume","key":"e_1_3_2_1_2_1","unstructured":"Guillaume All\u00e9on , Michele Benzi , and Luc Giraud . 1997. Sparse approximate inverse preconditioning for dense linear systems arising in computational electro-magnetics. Numerical Algorithms 16 (02 1997 ), 1--15. https:\/\/doi.org\/10.1023\/A: 1019170609950 Guillaume All\u00e9on, Michele Benzi, and Luc Giraud. 1997. Sparse approximate inverse preconditioning for dense linear systems arising in computational electro-magnetics. Numerical Algorithms 16 (02 1997), 1--15. https:\/\/doi.org\/10.1023\/A: 1019170609950"},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"crossref","unstructured":"Hartwig Anzt Edmond Chow Thomas Huckle and Jack Dongarra. 2016. Batched Generation of Incomplete Sparse Approximate Inverses on GPUs. 49--56. https: \/\/doi.org\/10.1109\/ScalA.2016.011 Hartwig Anzt Edmond Chow Thomas Huckle and Jack Dongarra. 2016. Batched Generation of Incomplete Sparse Approximate Inverses on GPUs. 49--56. https: \/\/doi.org\/10.1109\/ScalA.2016.011","DOI":"10.1109\/ScalA.2016.011"},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1137\/S1064827594271421"},{"volume-title":"A numerical experimental study of inverse preconditioning for the parallel iterative solution to 3D finite element flow equations. J. Comput. Appl. Math. 210 (12","year":"2007","author":"Bergamaschi Luca","key":"e_1_3_2_1_5_1","unstructured":"Luca Bergamaschi , Giuseppe Gambolati , and Giorgio Pini . 2007. A numerical experimental study of inverse preconditioning for the parallel iterative solution to 3D finite element flow equations. J. Comput. Appl. Math. 210 (12 2007 ), 64--70. https:\/\/doi.org\/10.1016\/j.cam.2006.10.056 Luca Bergamaschi, Giuseppe Gambolati, and Giorgio Pini. 2007. A numerical experimental study of inverse preconditioning for the parallel iterative solution to 3D finite element flow equations. J. Comput. Appl. Math. 210 (12 2007), 64--70. https:\/\/doi.org\/10.1016\/j.cam.2006.10.056"},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"crossref","first-page":"1694","DOI":"10.1016\/j.amc.2005.09.015","article-title":"Parallel preconditioned conjugate gradient optimization of the Rayleigh quotient for the solution of sparse eigenproblems","volume":"175","author":"Bergamaschi Luca","year":"2006","unstructured":"Luca Bergamaschi , \u00c1ngeles Mart\u00ednez , and Giorgio Pini . 2006 . Parallel preconditioned conjugate gradient optimization of the Rayleigh quotient for the solution of sparse eigenproblems . Appl. Math. Comput. 175 , 2 (2006), 1694 -- 1715 . https:\/\/doi.org\/10.1016\/j.amc.2005.09.015 Luca Bergamaschi, \u00c1ngeles Mart\u00ednez, and Giorgio Pini. 2006. Parallel preconditioned conjugate gradient optimization of the Rayleigh quotient for the solution of sparse eigenproblems. Appl. Math. Comput. 175, 2 (2006), 1694 -- 1715. https:\/\/doi.org\/10.1016\/j.amc.2005.09.015","journal-title":"Appl. Math. Comput."},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1137\/15M1027826"},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1137\/18M1197461"},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.camwa.2015.12.008"},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1137\/S106482759833913X"},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1177\/109434200101500106"},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1137\/S1064827594270415"},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1145\/2049662.2049663"},{"volume-title":"Current Developments, and Future Perspectives. ISRN Applied Mathematics 2012 (12","year":"2012","author":"Ferronato Massimiliano","key":"e_1_3_2_1_14_1","unstructured":"Massimiliano Ferronato . 2012. Preconditioning for Sparse Linear Systems at the Dawn of the 21st Century: History , Current Developments, and Future Perspectives. ISRN Applied Mathematics 2012 (12 2012 ). https:\/\/doi.org\/10.5402\/2012\/127647 Massimiliano Ferronato. 2012. Preconditioning for Sparse Linear Systems at the Dawn of the 21st Century: History, Current Developments, and Future Perspectives. ISRN Applied Mathematics 2012 (12 2012). https:\/\/doi.org\/10.5402\/2012\/127647"},{"volume-title":"Carlo Janna, and Giorgio Pini","year":"2012","author":"Ferronato Massimiliano","key":"e_1_3_2_1_15_1","unstructured":"Massimiliano Ferronato , Carlo Janna, and Giorgio Pini . 2012 . Shifted FSAI preconditioners for the efficient parallel solution of non-linear groundwater flow models. Internat. J. Numer. Methods Engrg . 89 (03 2012), 1707--1719. https:\/\/doi.org\/10.1002\/nme.3309 Massimiliano Ferronato, Carlo Janna, and Giorgio Pini. 2012. Shifted FSAI preconditioners for the efficient parallel solution of non-linear groundwater flow models. Internat. J. Numer. Methods Engrg. 89 (03 2012), 1707--1719. https:\/\/doi.org\/10.1002\/nme.3309"},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1137\/S0895479887139455"},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1137\/S1064827594276552"},{"volume-title":"Multi-Grid Methods and Applications","author":"Hackbusch Wolfgang","key":"e_1_3_2_1_18_1","unstructured":"Wolfgang Hackbusch . 1985. Multi-Grid Methods and Applications . Vol. 4 . https: \/\/doi.org\/10.1007\/978-3-662-02427-0 Wolfgang Hackbusch. 1985. Multi-Grid Methods and Applications. Vol. 4. https: \/\/doi.org\/10.1007\/978-3-662-02427-0"},{"volume-title":"An efficient sparse approximate inverse preconditioning algorithm on GPU. Concurrency and Computation: Practice and Experience 32 (12","year":"2019","author":"He Guixia","key":"e_1_3_2_1_19_1","unstructured":"Guixia He , Renjie Yin , and Jiaquan Gao . 2019. An efficient sparse approximate inverse preconditioning algorithm on GPU. Concurrency and Computation: Practice and Experience 32 (12 2019 ). https:\/\/doi.org\/10.1002\/cpe.5598 Guixia He, Renjie Yin, and Jiaquan Gao. 2019. An efficient sparse approximate inverse preconditioning algorithm on GPU. Concurrency and Computation: Practice and Experience 32 (12 2019). https:\/\/doi.org\/10.1002\/cpe.5598"},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1016\/S0168-9274(98)00117-2"},{"key":"e_1_3_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1023988426844"},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1137\/100810368"},{"key":"e_1_3_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1137\/090779760"},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1145\/2629475"},{"key":"e_1_3_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1002\/nla.2080"},{"volume-title":"A Power Sparse Approximate Inverse Preconditioning Procedure for Large Sparse Linear Systems. Numerical Linear Algebra with Applications 16 (04","year":"2009","author":"Jia Zhongxiao","key":"e_1_3_2_1_26_1","unstructured":"Zhongxiao Jia and Baochen Zhu . 2009. A Power Sparse Approximate Inverse Preconditioning Procedure for Large Sparse Linear Systems. Numerical Linear Algebra with Applications 16 (04 2009 ), 259--299. https:\/\/doi.org\/10.1002\/nla.614 Zhongxiao Jia and Baochen Zhu. 2009. A Power Sparse Approximate Inverse Preconditioning Procedure for Large Sparse Linear Systems. Numerical Linear Algebra with Applications 16 (04 2009), 259--299. https:\/\/doi.org\/10.1002\/nla.614"},{"volume-title":"Factorized Sparse Approximate Inverse Preconditionings. IV: Simple Approaches to Rising Efficiency. Numerical Linear Algebra With Applications - NUMER LINEAR ALGEBR APPL 6 (10","year":"1999","author":"Kolotilina Liliya Yu.","key":"e_1_3_2_1_27_1","unstructured":"Liliya Yu. Kolotilina , Andy A. Nikishin , and Alex Yu. Yeremin . 1999. Factorized Sparse Approximate Inverse Preconditionings. IV: Simple Approaches to Rising Efficiency. Numerical Linear Algebra With Applications - NUMER LINEAR ALGEBR APPL 6 (10 1999 ), 515--531. https:\/\/doi.org\/10.1002\/(SICI)1099-1506(199910\/11)6: 73.0.CO;2-0 Liliya Yu. Kolotilina, Andy A. Nikishin, and Alex Yu. Yeremin. 1999. Factorized Sparse Approximate Inverse Preconditionings. IV: Simple Approaches to Rising Efficiency. Numerical Linear Algebra With Applications - NUMER LINEAR ALGEBR APPL 6 (10 1999), 515--531. https:\/\/doi.org\/10.1002\/(SICI)1099-1506(199910\/11)6: 73.0.CO;2-0"},{"key":"e_1_3_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1137\/0614004"},{"key":"e_1_3_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.advengsoft.2015.01.006"},{"volume-title":"Parallel Computational Fluid Dynamics","year":"1993","author":"Mark","key":"e_1_3_2_1_30_1","unstructured":"Mark [D. Kremenetsky], John Richardson , and Horst [D. Simon]. 1995. - Parallel preconditioning for CFD problems on the CM-5 . In Parallel Computational Fluid Dynamics 1993 , A. Ecer, J. Hauser, P. Leca, and J. Periaux (Eds.). North-Holland, Amsterdam , 401--410. https:\/\/doi.org\/10.1016\/B978-044481999--4\/50173-0 Mark [D. Kremenetsky], John Richardson, and Horst [D. Simon]. 1995. - Parallel preconditioning for CFD problems on the CM-5. In Parallel Computational Fluid Dynamics 1993, A. Ecer, J. Hauser, P. Leca, and J. Periaux (Eds.). North-Holland, Amsterdam, 401--410. https:\/\/doi.org\/10.1016\/B978-044481999--4\/50173-0"},{"key":"e_1_3_2_1_31_1","unstructured":"I.B. Labutin and I.V. Surodina. 2013. Algorithm for sparse approximate inverse preconditioners in the conjugate gradient method. 19 (01 2013) 120--126. I.B. Labutin and I.V. Surodina. 2013. Algorithm for sparse approximate inverse preconditioners in the conjugate gradient method. 19 (01 2013) 120--126."},{"key":"e_1_3_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.compfluid.2013.10.035"},{"volume-title":"Proceedings of the international workshop on solution techniques for large-scale CFD problems (02","year":"2002","author":"Saad Yousef","key":"e_1_3_2_1_33_1","unstructured":"Yousef Saad . 2002 . Preconditioned Krylov Subspace Methods for CFD Applications . Proceedings of the international workshop on solution techniques for large-scale CFD problems (02 2002). Yousef Saad. 2002. Preconditioned Krylov Subspace Methods for CFD Applications. Proceedings of the international workshop on solution techniques for large-scale CFD problems (02 2002)."},{"edition":"2","volume-title":"Iterative Methods for Sparse Linear Systems","author":"Saad Yousef","key":"e_1_3_2_1_34_1","unstructured":"Yousef Saad . 2003. Iterative Methods for Sparse Linear Systems ( 2 nd ed.). Society for Industrial and Applied Mathematics , USA. Yousef Saad. 2003. Iterative Methods for Sparse Linear Systems (2nd ed.). Society for Industrial and Applied Mathematics, USA."},{"key":"e_1_3_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.finel.2010.11.005"},{"volume-title":"Smith","year":"2018","author":"Zhang Hong","key":"e_1_3_2_1_36_1","unstructured":"Hong Zhang , Richard T. Mills , Karl Rupp , and Barry F . Smith . 2018 . Vectorized Parallel Sparse Matrix-Vector Multiplication in PETSc Using AVX-512 (ICPP 2018). Association for Computing Machinery , New York, NY, USA, Article 55, 10 pages. https:\/\/doi.org\/10.1145\/3225058.3225100 Hong Zhang, Richard T. Mills, Karl Rupp, and Barry F. Smith. 2018. Vectorized Parallel Sparse Matrix-Vector Multiplication in PETSc Using AVX-512 (ICPP 2018). Association for Computing Machinery, New York, NY, USA, Article 55, 10 pages. https:\/\/doi.org\/10.1145\/3225058.3225100"}],"event":{"name":"HPDC '21: The 30th International Symposium on High-Performance Parallel and Distributed Computing","sponsor":["University of Arizona University of Arizona","SIGHPC ACM Special Interest Group on High Performance Computing, Special Interest Group on High Performance Computing","SIGARCH ACM Special Interest Group on Computer Architecture"],"location":"Virtual Event Sweden","acronym":"HPDC '21"},"container-title":["Proceedings of the 30th International Symposium on High-Performance Parallel and Distributed Computing"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3431379.3460642","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,11,5]],"date-time":"2023-11-05T00:04:27Z","timestamp":1699142667000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3431379.3460642"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,6,21]]},"references-count":36,"alternative-id":["10.1145\/3431379.3460642","10.1145\/3431379"],"URL":"https:\/\/doi.org\/10.1145\/3431379.3460642","relation":{},"subject":[],"published":{"date-parts":[[2021,6,21]]},"assertion":[{"value":"2021-06-21","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}