{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,2,21]],"date-time":"2025-02-21T02:30:17Z","timestamp":1740105017278,"version":"3.37.3"},"reference-count":57,"publisher":"Wiley","issue":"1","license":[{"start":{"date-parts":[[2013,12,27]],"date-time":"2013-12-27T00:00:00Z","timestamp":1388102400000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/onlinelibrary.wiley.com\/termsAndConditions#vor"}],"funder":[{"DOI":"10.13039\/100000015","name":"U.S. Department of Energy","doi-asserted-by":"publisher","award":["ER26057"],"id":[{"id":"10.13039\/100000015","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000015","name":"U.S. Department of Energy","doi-asserted-by":"publisher","award":["ER26167"],"id":[{"id":"10.13039\/100000015","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000015","name":"U.S. Department of Energy","doi-asserted-by":"publisher","award":["ER26098"],"id":[{"id":"10.13039\/100000015","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000015","name":"U.S. Department of Energy","doi-asserted-by":"publisher","award":["ER26005"],"id":[{"id":"10.13039\/100000015","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000015","name":"U.S. Department of Energy","doi-asserted-by":"publisher","award":["DE-AC02-06CH11357"],"id":[{"id":"10.13039\/100000015","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100000781","name":"European Research Council","doi-asserted-by":"publisher","award":["ERC-2010-AdG Proposal No. 267374"],"id":[{"id":"10.13039\/501100000781","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100002341","name":"Academy of Finland","doi-asserted-by":"crossref","award":["Project 110013 and the Center of Excellence program"],"id":[{"id":"10.13039\/501100002341","id-type":"DOI","asserted-by":"crossref"}]},{"name":"Grupo Consolidado UPV\/EHU del Gobierno Vasco","award":["IT578-13"]},{"name":"Tekes MASI-program"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Concurrency and Computation"],"published-print":{"date-parts":[[2015,1]]},"abstract":"SUMMARY<\/jats:title>Density function theory (DFT) is the most widely employed electronic structure method because of its favorable scaling with system size and accuracy for a broad range of molecular and condensed\u2010phase systems. The advent of massively parallel supercomputers has enhanced the scientific community's ability to study larger system sizes. Ground\u2010state DFT calculations on\u2009\u223c\u2009103<\/jats:sup> valence electrons using traditional algorithms can be routinely performed on present\u2010day supercomputers. The performance characteristics of these massively parallel DFT codes on\u2009>\u2009104<\/jats:sup> computer cores are not well understood. The GPAW code was ported an optimized for the Blue Gene\/P architecture. We present our algorithmic parallelization strategy and interpret the results for a number of benchmark test cases.Copyright \u00a9 2013 John Wiley & Sons, Ltd.<\/jats:p>","DOI":"10.1002\/cpe.3199","type":"journal-article","created":{"date-parts":[[2013,12,27]],"date-time":"2013-12-27T06:22:37Z","timestamp":1388125357000},"page":"69-93","source":"Crossref","is-referenced-by-count":2,"title":["Design and performance characterization of electronic structure calculations on massively parallel supercomputers: a case study of GPAW on the Blue Gene\/P architecture"],"prefix":"10.1002","volume":"27","author":[{"given":"N.A.","family":"Romero","sequence":"first","affiliation":[{"name":"Leadership Computing Facility Argonne National Laboratory Argonne, IL 60439 USA"}]},{"given":"C.","family":"Glinsvad","sequence":"additional","affiliation":[{"name":"Center for Individual Nanoparticle Functionality, Department of Physics Technical University of Denmark DK\u20102800 Kgs. Lyngby Denmark"}]},{"given":"A.H.","family":"Larsen","sequence":"additional","affiliation":[{"name":"Center for Atomic\u2010scale Materials Design, Department of Physics Technical University of Denmark DK\u20102800 Kgs. Lyngby Denmark"},{"name":"Nano\u2010bio Spectroscopy Group and ETSF Scientific Development Center, Departamento de F\u00edsica de Materiales Universidad del Pa\u00eds Vasco, CSIC\u2013UPV\/EHU\u2013MPC and DIPC Avenida de Tolosa 72 E\u201020018 San Sebasti\u00e1n Spain"}]},{"given":"J.","family":"Enkovaara","sequence":"additional","affiliation":[{"name":"CSC \u2010 IT Center for Science Ltd. P.O. Box 405 FI\u201002101 Espoo Finland"},{"name":"Department of Applied Physics, School of Science Aalto University P.O. Box 11100 FI\u201000076 AALTO Finland"}]},{"given":"S.","family":"Shende","sequence":"additional","affiliation":[{"name":"Performance Research Laboratory University of Oregon Eugene, OR, 97403 USA"}]},{"given":"V.A.","family":"Morozov","sequence":"additional","affiliation":[{"name":"Leadership Computing Facility Argonne National Laboratory Argonne, IL 60439 USA"}]},{"given":"J.J.","family":"Mortensen","sequence":"additional","affiliation":[{"name":"Center for Atomic\u2010scale Materials Design, Department of Physics Technical University of Denmark DK\u20102800 Kgs. Lyngby Denmark"}]}],"member":"311","published-online":{"date-parts":[[2013,12,27]]},"reference":[{"key":"e_1_2_10_2_1","doi-asserted-by":"publisher","DOI":"10.1103\/PhysRev.136.B864"},{"key":"e_1_2_10_3_1","doi-asserted-by":"publisher","DOI":"10.1103\/PhysRev.140.A1133"},{"key":"e_1_2_10_4_1","first-page":"225","article-title":"Validity of the single processor approach to achieving large\u2010scale computing capabilities","volume":"30","author":"Amdahl G","year":"1967","journal-title":"American Federation of Information Processing Societies Conference Proceedings"},{"key":"e_1_2_10_5_1","doi-asserted-by":"publisher","DOI":"10.1103\/PhysRevB.71.035109"},{"key":"e_1_2_10_6_1","doi-asserted-by":"publisher","DOI":"10.1088\/0953-8984\/22\/25\/253202"},{"key":"e_1_2_10_7_1","doi-asserted-by":"publisher","DOI":"10.1103\/PhysRevLett.72.1240"},{"key":"e_1_2_10_8_1","doi-asserted-by":"publisher","DOI":"10.1103\/PhysRevB.52.R5471"},{"key":"e_1_2_10_9_1","doi-asserted-by":"publisher","DOI":"10.1103\/PhysRevB.50.17953"},{"key":"e_1_2_10_10_1","doi-asserted-by":"publisher","DOI":"10.1063\/1.2943138"},{"key":"e_1_2_10_11_1","doi-asserted-by":"publisher","DOI":"10.1103\/PhysRevB.83.245122"},{"key":"e_1_2_10_12_1","doi-asserted-by":"publisher","DOI":"10.1103\/PhysRevB.81.085103"},{"key":"e_1_2_10_13_1","doi-asserted-by":"publisher","DOI":"10.1103\/PhysRevB.86.045208"},{"key":"e_1_2_10_14_1","doi-asserted-by":"publisher","DOI":"10.1103\/PhysRevB.80.195112"},{"key":"e_1_2_10_15_1","unstructured":"LarsenAH.Efficient electronic structure methods applied to metal nanoparticles.Ph.D. Thesis Technical University of Denmark Lyngby Denmark 2011."},{"key":"e_1_2_10_16_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.procs.2011.04.003"},{"volume-title":"Using MPI: Portable Parallel Programming with the Message Passing Interface","year":"1999","author":"Gropp W","key":"e_1_2_10_17_1"},{"key":"e_1_2_10_18_1","doi-asserted-by":"crossref","DOI":"10.7551\/mitpress\/7055.001.0001","volume-title":"Using MPI\u20102: Advanced Features of the Message\u2010Passing Interface","author":"Gropp W","year":"1999"},{"key":"e_1_2_10_19_1","doi-asserted-by":"publisher","DOI":"10.1147\/rd.521.0199"},{"volume-title":"Real\u2010Space Finite\u2010Difference Paw Method for Large\u2010Scale Applications on Massively Parallel Computers","year":"2012","author":"Baumeister P","key":"e_1_2_10_20_1"},{"key":"e_1_2_10_21_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.commatsci.2007.07.019"},{"key":"e_1_2_10_22_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.parco.2004.12.004"},{"key":"e_1_2_10_23_1","doi-asserted-by":"publisher","DOI":"10.1002\/cphc.200500059"},{"issue":"1","key":"e_1_2_10_24_1","first-page":"1","article-title":"Architecture of Qbox: a scalable first\u2010principles molecular dynamics code","volume":"52","author":"Gygi F","year":"2008","journal-title":"IBM Journal of Research and Development"},{"key":"e_1_2_10_25_1","first-page":"49","article-title":"Large\u2010scale computing with Quantum ESPRESSO","volume":"32","author":"Giannozzi P","year":"2009","journal-title":"Nuovo Cimento C"},{"key":"e_1_2_10_26_1","doi-asserted-by":"publisher","DOI":"10.1088\/1742-6596\/180\/1\/012028"},{"key":"e_1_2_10_27_1","unstructured":"WangLW. Available from:https:\/\/hpcrd.lbl.gov\/~linwang\/PEtot\/PEtot.html."},{"key":"e_1_2_10_28_1","doi-asserted-by":"publisher","DOI":"10.1088\/1742\u20106596\/125\/1\/012058"},{"key":"e_1_2_10_29_1","doi-asserted-by":"publisher","DOI":"10.1103\/PhysRevB.54.11169"},{"key":"e_1_2_10_30_1","doi-asserted-by":"publisher","DOI":"10.1088\/0305-4470\/18\/9\/018"},{"key":"e_1_2_10_31_1","doi-asserted-by":"publisher","DOI":"10.1103\/RevModPhys.64.1045"},{"key":"e_1_2_10_32_1","doi-asserted-by":"publisher","DOI":"10.1002\/(SICI)1097-0207(19960715)39:13<2209::AID-NME951>3.0.CO;2-R"},{"key":"e_1_2_10_33_1","doi-asserted-by":"publisher","DOI":"10.1016\/0021-9991(75)90065-0"},{"key":"e_1_2_10_34_1","doi-asserted-by":"publisher","DOI":"10.1016\/0167-8191(90)90156-4"},{"key":"e_1_2_10_35_1","doi-asserted-by":"publisher","DOI":"10.1006\/jpdc.1994.1108"},{"key":"e_1_2_10_36_1","doi-asserted-by":"publisher","DOI":"10.1137\/0915074"},{"key":"e_1_2_10_37_1","doi-asserted-by":"publisher","DOI":"10.1137\/1.9780898719604"},{"key":"e_1_2_10_38_1","doi-asserted-by":"publisher","DOI":"10.1137\/1.9780898719642"},{"key":"e_1_2_10_39_1","article-title":"Introducing the MADNESS numerical framework for petascale computing","author":"Thorton WS","year":"2009","journal-title":"Proceedings of the Cray Users Group"},{"key":"e_1_2_10_40_1","doi-asserted-by":"publisher","DOI":"10.1177\/1094342006064503"},{"key":"e_1_2_10_41_1","doi-asserted-by":"publisher","DOI":"10.1145\/567806.567807"},{"key":"e_1_2_10_42_1","unstructured":"Engineering and Scientific Subroutine Library (ESSL)."},{"key":"e_1_2_10_43_1","doi-asserted-by":"crossref","unstructured":"SolomonikE DemmelJ.Communication\u2010optimal parallel 2.5d matrix multiplication and LU factorization algorithms.Technical Report UCB\/EECS\u20102011\u201010 University of California Berkeley 2011February.","DOI":"10.1007\/978-3-642-23397-5_10"},{"key":"e_1_2_10_44_1","unstructured":"SchatzMD PoulsonJP Van De GeijnRA.Scalable universal matrix multiplication algorithms: 2d and 3d variations on a theme.Technical Report University of Texas at Austin 2012."},{"key":"e_1_2_10_45_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-1-4615-4489-0_5"},{"key":"e_1_2_10_46_1","doi-asserted-by":"crossref","unstructured":"ShendeS MalonyA CunyJ LindlanK BeckmanP KarmesinS.Portable profiling and tracing for parallel scientific applications using C++.Proceedings 2nd SIGMETRICS Symposium on Parallel and Distributed Tools (SPDT'98) Welches OR 1998;134\u2013145.","DOI":"10.1145\/281035.281049"},{"key":"e_1_2_10_47_1","unstructured":"ShendeS MalonyA Ansell\u2010BellR.Instrumentation and measurement strategies for flexible and portable empirical performance evaluation.Proceedings Tools and Techniques for Performance Evaluation Workshop PDPTA Vol. 3 CSREA Las Vegas NV 2001;1150\u20131156."},{"key":"e_1_2_10_48_1","doi-asserted-by":"publisher","DOI":"10.1177\/1094342006064482"},{"volume-title":"IBM System Blue Gene Solution: Blue Gene\/P Application Development","year":"2009","author":"Sosa C","key":"e_1_2_10_49_1"},{"volume-title":"Designing and Building Parallel Programs","year":"1995","author":"Foster I","key":"e_1_2_10_50_1"},{"volume-title":"Graphical Methods for Data Analysis","year":"1995","author":"Chambers JM","key":"e_1_2_10_51_1"},{"key":"e_1_2_10_52_1","doi-asserted-by":"publisher","DOI":"10.1145\/1375527.1375544"},{"key":"e_1_2_10_53_1","first-page":"223","article-title":"A parallel divide and conquer algorithm for symmetric eigenvalue problem on distributed architectures","volume":"6","author":"Tisseur F","year":"1999","journal-title":"SIAM Journal of Scientific Computing"},{"key":"e_1_2_10_54_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-0-387-09707-7_5"},{"key":"e_1_2_10_55_1","doi-asserted-by":"publisher","DOI":"10.1137\/110848803"},{"key":"e_1_2_10_56_1","doi-asserted-by":"publisher","DOI":"10.1145\/1188455.1188502"},{"key":"e_1_2_10_57_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10562-011-0632-0"},{"key":"e_1_2_10_58_1","doi-asserted-by":"publisher","DOI":"10.1021\/jz3018286"}],"container-title":["Concurrency and Computation: Practice and Experience"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/api.wiley.com\/onlinelibrary\/tdm\/v1\/articles\/10.1002%2Fcpe.3199","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/api.wiley.com\/onlinelibrary\/tdm\/v1\/articles\/10.1002%2Fcpe.3199","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/onlinelibrary.wiley.com\/doi\/pdf\/10.1002\/cpe.3199","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,10,6]],"date-time":"2023-10-06T10:21:59Z","timestamp":1696587719000},"score":1,"resource":{"primary":{"URL":"https:\/\/onlinelibrary.wiley.com\/doi\/10.1002\/cpe.3199"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2013,12,27]]},"references-count":57,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2015,1]]}},"alternative-id":["10.1002\/cpe.3199"],"URL":"https:\/\/doi.org\/10.1002\/cpe.3199","archive":["Portico"],"relation":{},"ISSN":["1532-0626","1532-0634"],"issn-type":[{"type":"print","value":"1532-0626"},{"type":"electronic","value":"1532-0634"}],"subject":[],"published":{"date-parts":[[2013,12,27]]}}}