{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2023,8,31]],"date-time":"2023-08-31T04:43:57Z","timestamp":1693457037795},"reference-count":31,"publisher":"Wiley","issue":"4","license":[{"start":{"date-parts":[[2021,3,10]],"date-time":"2021-03-10T00:00:00Z","timestamp":1615334400000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100002347","name":"Bundesministerium f\u00fcr Bildung und Forschung","doi-asserted-by":"publisher","award":["01EC1408B","01IH16005"],"id":[{"id":"10.13039\/501100002347","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100002418","name":"Intel Corporation","doi-asserted-by":"publisher","award":["IPCC"],"id":[{"id":"10.13039\/100002418","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["onlinelibrary.wiley.com"],"crossmark-restriction":true},"short-container-title":["Numerical Linear Algebra App"],"published-print":{"date-parts":[[2021,8]]},"abstract":"Abstract<\/jats:title>The growing discrepancy between CPU computing power and memory bandwidth drives more and more numerical algorithms into a bandwidth\u2010bound regime. One example is the overlapping Schwarz smoother, a highly effective building block for iterative multigrid solution of elliptic equations with higher order finite elements. Two options of reducing the required memory bandwidth are sparsity exploiting storage layouts and representing matrix entries with reduced precision in floating point or fixed point format. We investigate the impact of several options on storage demand and contraction rate, both analytically in the context of subspace correction methods and numerically at an example of solid mechanics. Both perspectives agree on the favourite scheme: fixed point representation of Cholesky factors in nested dissection storage.<\/jats:p>","DOI":"10.1002\/nla.2366","type":"journal-article","created":{"date-parts":[[2021,3,10]],"date-time":"2021-03-10T11:08:51Z","timestamp":1615374531000},"update-policy":"http:\/\/dx.doi.org\/10.1002\/crossmark_policy","source":"Crossref","is-referenced-by-count":0,"title":["Impact of mixed precision and storage layout on additive Schwarz smoothers"],"prefix":"10.1002","volume":"28","author":[{"given":"Jakob","family":"Schneck","sequence":"first","affiliation":[{"name":"Numerical Mathematics Zuse Institute Berlin Berlin Germany"}]},{"ORCID":"http:\/\/orcid.org\/0000-0002-1071-0044","authenticated-orcid":false,"given":"Martin","family":"Weiser","sequence":"additional","affiliation":[{"name":"Numerical Mathematics Zuse Institute Berlin Berlin Germany"}]},{"given":"Florian","family":"Wende","sequence":"additional","affiliation":[{"name":"Supercomputing Zuse Institute Berlin Berlin Germany"}]}],"member":"311","published-online":{"date-parts":[[2021,3,10]]},"reference":[{"key":"e_1_2_8_2_1","doi-asserted-by":"publisher","DOI":"10.1515\/9783110283112"},{"key":"e_1_2_8_3_1","volume-title":"The finite element method","author":"Zienkiewicz OC","year":"2005"},{"key":"e_1_2_8_4_1","doi-asserted-by":"publisher","DOI":"10.1002\/nla.1979"},{"key":"e_1_2_8_5_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-39929-4_26"},{"key":"e_1_2_8_6_1","doi-asserted-by":"publisher","DOI":"10.1007\/BF01385709"},{"key":"e_1_2_8_7_1","doi-asserted-by":"publisher","DOI":"10.1093\/imanum\/drl046"},{"key":"e_1_2_8_8_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-540-34469-8_89"},{"issue":"12","key":"e_1_2_8_9_1","first-page":"19","article-title":"Memory bandwidth and machine balance in current high performance computers","author":"McCalpin JD","year":"1995","journal-title":"IEEE TCCA Newslett"},{"key":"e_1_2_8_10_1","doi-asserted-by":"publisher","DOI":"10.1017\/S0962492916000076"},{"key":"e_1_2_8_11_1","doi-asserted-by":"publisher","DOI":"10.1007\/s006070050015"},{"key":"e_1_2_8_12_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.cpc.2010.05.002"},{"key":"e_1_2_8_13_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-32820-6_89"},{"key":"e_1_2_8_14_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.cpc.2008.11.005"},{"key":"e_1_2_8_15_1","first-page":"297","volume-title":"Parallel computing is everywhere. vol. 32 of Advances in Parallel Computing","author":"Cherubin S","year":"2018"},{"key":"e_1_2_8_16_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-540-75199-1_44"},{"key":"e_1_2_8_17_1","volume-title":"Fast and accurate finite\u2010element multigrid solvers for PDE simulations on GPU clusters","author":"G\u00f6ddeke D","year":"2010"},{"key":"e_1_2_8_18_1","doi-asserted-by":"publisher","DOI":"10.1002\/cpe.4460"},{"key":"e_1_2_8_19_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-23321-5_10"},{"key":"e_1_2_8_20_1","doi-asserted-by":"publisher","DOI":"10.1109\/TVCG.2014.2346458"},{"key":"e_1_2_8_21_1","doi-asserted-by":"publisher","DOI":"10.1137\/1.9780898718881"},{"key":"e_1_2_8_22_1","doi-asserted-by":"publisher","DOI":"10.1137\/1034116"},{"key":"e_1_2_8_23_1","doi-asserted-by":"publisher","DOI":"10.1090\/S0025-5718-1990-1023042-6"},{"key":"e_1_2_8_24_1","doi-asserted-by":"publisher","DOI":"10.1137\/070706148"},{"key":"e_1_2_8_25_1","volume-title":"Iterative substructuring algorithms for the p\u2010version finite element method for elliptic problems","author":"Bic\u0103 I","year":"1997"},{"key":"e_1_2_8_26_1","doi-asserted-by":"publisher","DOI":"10.1007\/PL00005386"},{"key":"e_1_2_8_27_1","doi-asserted-by":"publisher","DOI":"10.1137\/18M1219370"},{"key":"e_1_2_8_28_1","first-page":"e2306","article-title":"A local Fourier analysis for additive Vanka relaxation for the Stokes equations","author":"Farrell PE","year":"2019","journal-title":"Numer Linear Alg Appl"},{"key":"e_1_2_8_29_1","volume-title":"Computer solution of large sparse positive definite systems","author":"George A","year":"1981"},{"key":"e_1_2_8_30_1","doi-asserted-by":"publisher","DOI":"10.1002\/nme.1620080115"},{"key":"e_1_2_8_31_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-28589-9_8"},{"key":"e_1_2_8_32_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.camwa.2020.02.011"}],"container-title":["Numerical Linear Algebra with Applications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/onlinelibrary.wiley.com\/doi\/pdf\/10.1002\/nla.2366","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/onlinelibrary.wiley.com\/doi\/full-xml\/10.1002\/nla.2366","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/onlinelibrary.wiley.com\/doi\/pdf\/10.1002\/nla.2366","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,8,30]],"date-time":"2023-08-30T15:48:29Z","timestamp":1693410509000},"score":1,"resource":{"primary":{"URL":"https:\/\/onlinelibrary.wiley.com\/doi\/10.1002\/nla.2366"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,3,10]]},"references-count":31,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2021,8]]}},"alternative-id":["10.1002\/nla.2366"],"URL":"https:\/\/doi.org\/10.1002\/nla.2366","archive":["Portico"],"relation":{},"ISSN":["1070-5325","1099-1506"],"issn-type":[{"value":"1070-5325","type":"print"},{"value":"1099-1506","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,3,10]]},"assertion":[{"value":"2019-02-12","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2021-01-03","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2021-03-10","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}