{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2024,10,30]],"date-time":"2024-10-30T20:44:40Z","timestamp":1730321080574,"version":"3.28.0"},"publisher-location":"New York, NY, USA","reference-count":48,"publisher":"ACM","license":[{"start":{"date-parts":[[2019,6,26]],"date-time":"2019-06-26T00:00:00Z","timestamp":1561507200000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"NSF CAREER Award","award":["CNS-1750760"]},{"name":"CCF","award":["CCF-1823005"]},{"name":"National R&D Program of China","award":["No.2018YFB1004800"]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61602301,61632017,61702328"],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2019,6,26]]},"DOI":"10.1145\/3330345.3330351","type":"proceedings-article","created":{"date-parts":[[2019,6,18]],"date-time":"2019-06-18T12:14:30Z","timestamp":1560860070000},"page":"58-68","update-policy":"http:\/\/dx.doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":20,"title":["Laius"],"prefix":"10.1145","author":[{"given":"Wei","family":"Zhang","sequence":"first","affiliation":[{"name":"Shanghai Jiao Tong University"}]},{"given":"Weihao","family":"Cui","sequence":"additional","affiliation":[{"name":"Shanghai Jiao Tong University"}]},{"given":"Kaihua","family":"Fu","sequence":"additional","affiliation":[{"name":"Shanghai Jiao Tong University"}]},{"given":"Quan","family":"Chen","sequence":"additional","affiliation":[{"name":"Shanghai Jiao Tong University"}]},{"given":"Daniel Edward","family":"Mawhirter","sequence":"additional","affiliation":[{"name":"Colorado School of Mines"}]},{"given":"Bo","family":"Wu","sequence":"additional","affiliation":[{"name":"Colorado School of Mines"}]},{"given":"Chao","family":"Li","sequence":"additional","affiliation":[{"name":"Shanghai Jiao Tong University"}]},{"given":"Minyi","family":"Guo","sequence":"additional","affiliation":[{"name":"Shanghai Jiao Tong University"}]}],"member":"320","published-online":{"date-parts":[[2019,6,26]]},"reference":[{"key":"e_1_3_2_1_1_1","unstructured":"{n. d.}. Apple Siri. https:\/\/www.apple.com\/siri\/. {n. d.}. Apple Siri. https:\/\/www.apple.com\/siri\/."},{"key":"e_1_3_2_1_2_1","unstructured":"{n. d.}. Google Translate. https:\/\/translate.google.com\/. {n. d.}. Google Translate. https:\/\/translate.google.com\/."},{"key":"e_1_3_2_1_3_1","unstructured":"{n. d.}. Nvidia Night Compute. https:\/\/docs.nvidia.com\/nsight-compute\/NsightCompute\/index.html. {n. d.}. Nvidia Night Compute. https:\/\/docs.nvidia.com\/nsight-compute\/NsightCompute\/index.html."},{"key":"e_1_3_2_1_4_1","unstructured":"{n. d.}. Prisma. https:\/\/prisma-ai.com\/. {n. d.}. Prisma. https:\/\/prisma-ai.com\/."},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1109\/HPCA.2012.6168946"},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"crossref","DOI":"10.1007\/978-3-031-01741-4","volume-title":"The datacenter as a computer: An introduction to the design of warehouse-scale machines. Synthesis lectures on computer architecture 8, 3","author":"Barroso Luiz Andr\u00e9","year":"2013","unstructured":"Luiz Andr\u00e9 Barroso , Jimmy Clidaras , and Urs H\u00f6lzle . 2013. The datacenter as a computer: An introduction to the design of warehouse-scale machines. Synthesis lectures on computer architecture 8, 3 ( 2013 ), 1--154. Luiz Andr\u00e9 Barroso, Jimmy Clidaras, and Urs H\u00f6lzle. 2013. The datacenter as a computer: An introduction to the design of warehouse-scale machines. Synthesis lectures on computer architecture 8, 3 (2013), 1--154."},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"crossref","DOI":"10.1007\/978-3-031-01741-4","volume-title":"The datacenter as a computer: An introduction to the design of warehouse-scale machines. Synthesis lectures on computer architecture 8, 3","author":"Barroso Luiz Andr\u00e9","year":"2013","unstructured":"Luiz Andr\u00e9 Barroso , Jimmy Clidaras , and Urs H\u00f6lzle . 2013. The datacenter as a computer: An introduction to the design of warehouse-scale machines. Synthesis lectures on computer architecture 8, 3 ( 2013 ), 1--154. Luiz Andr\u00e9 Barroso, Jimmy Clidaras, and Urs H\u00f6lzle. 2013. The datacenter as a computer: An introduction to the design of warehouse-scale machines. Synthesis lectures on computer architecture 8, 3 (2013), 1--154."},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1145\/792550.792552"},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1109\/IISWC.2009.5306797"},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1145\/3037697.3037700"},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1145\/2980024.2872368"},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1145\/2996864"},{"key":"e_1_3_2_1_13_1","volume-title":"cudnn: Efficient primitives for deep learning. arXiv preprint arXiv:1410.0759","author":"Chetlur Sharan","year":"2014","unstructured":"Sharan Chetlur , Cliff Woolley , Philippe Vandermersch , Jonathan Cohen , John Tran , Bryan Catanzaro , and Evan Shelhamer . 2014. cudnn: Efficient primitives for deep learning. arXiv preprint arXiv:1410.0759 ( 2014 ). Sharan Chetlur, Cliff Woolley, Philippe Vandermersch, Jonathan Cohen, John Tran, Bryan Catanzaro, and Evan Shelhamer. 2014. cudnn: Efficient primitives for deep learning. arXiv preprint arXiv:1410.0759 (2014)."},{"key":"e_1_3_2_1_14_1","volume-title":"Treating constraints as objectives for single-objective evolutionary optimization. Engineering Optimization+ A35 32, 3","author":"Coello Coello Carlos A","year":"2000","unstructured":"Carlos A Coello Coello . 2000. Treating constraints as objectives for single-objective evolutionary optimization. Engineering Optimization+ A35 32, 3 ( 2000 ), 275--308. Carlos A Coello Coello. 2000. Treating constraints as objectives for single-objective evolutionary optimization. Engineering Optimization+ A35 32, 3 (2000), 275--308."},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1145\/2408776.2408794"},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1145\/2408776.2408794"},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1145\/2499368.2451125"},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1145\/2644865.2541941"},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1109\/RTSS.2013.12"},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1145\/2749469.2749472"},{"volume-title":"An introduction to statistical learning","author":"James Gareth","key":"e_1_3_2_1_21_1","unstructured":"Gareth James , Daniela Witten , Trevor Hastie , and Robert Tibshirani . 2013. An introduction to statistical learning . Vol. 112 . Springer . Gareth James, Daniela Witten, Trevor Hastie, and Robert Tibshirani. 2013. An introduction to statistical learning. Vol. 112. Springer."},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1038\/505146a"},{"key":"e_1_3_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICPP.2011.37"},{"key":"e_1_3_2_1_24_1","volume-title":"Proc. USENIXATC. 17--30","author":"Kato Shinpei","year":"2011","unstructured":"Shinpei Kato , Karthik Lakshmanan , Raj Rajkumar , and Yutaka Ishikawa . 2011 . TimeGraph: GPU scheduling for real-time multi-tasking environments . In Proc. USENIXATC. 17--30 . Shinpei Kato, Karthik Lakshmanan, Raj Rajkumar, and Yutaka Ishikawa. 2011. TimeGraph: GPU scheduling for real-time multi-tasking environments. In Proc. USENIXATC. 17--30."},{"key":"e_1_3_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1109\/HPEC.2014.7040988"},{"key":"e_1_3_2_1_26_1","volume-title":"Proceedings of the conference on Design, Automation & Test in Europe. European Design and Automation Association, 220","author":"Lee Haeseung","year":"2014","unstructured":"Haeseung Lee , Al Faruque , and Mohammad Abdullah . 2014 . GPU-EvR: Run-time event based real-time scheduling framework on GPGPU platform . In Proceedings of the conference on Design, Automation & Test in Europe. European Design and Automation Association, 220 . Haeseung Lee, Al Faruque, and Mohammad Abdullah. 2014. GPU-EvR: Run-time event based real-time scheduling framework on GPGPU platform. In Proceedings of the conference on Design, Automation & Test in Europe. European Design and Automation Association, 220."},{"key":"e_1_3_2_1_27_1","volume-title":"Reordering GPU kernel launches to enable efficient concurrent execution. arXiv preprint arXiv:1511.07983","author":"Li Teng","year":"2015","unstructured":"Teng Li , Vikram K Narayana , and Tarek El-Ghazawi . 2015. Reordering GPU kernel launches to enable efficient concurrent execution. arXiv preprint arXiv:1511.07983 ( 2015 ). Teng Li, Vikram K Narayana, and Tarek El-Ghazawi. 2015. Reordering GPU kernel launches to enable efficient concurrent execution. arXiv preprint arXiv:1511.07983 (2015)."},{"key":"e_1_3_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1145\/2749469.2749475"},{"key":"e_1_3_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-04441-0_8"},{"key":"e_1_3_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1145\/2155620.2155650"},{"key":"e_1_3_2_1_31_1","unstructured":"NVIDIA. 2012. Sharing a GPU between MPI processes: multi-process service(MPS). NVIDIA. 2012. Sharing a GPU between MPI processes: multi-process service(MPS)."},{"key":"e_1_3_2_1_32_1","unstructured":"NVIDIA. 2015. Multi-Process Service. https:\/\/docs.nvidia.com\/deploy\/mps\/index.htmltopic_6_1_2. NVIDIA. 2015. Multi-Process Service. https:\/\/docs.nvidia.com\/deploy\/mps\/index.htmltopic_6_1_2."},{"key":"e_1_3_2_1_33_1","first-page":"31","article-title":"Cublas library. NVIDIA Corporation, Santa Clara","volume":"15","author":"Nvidia CUDA","year":"2008","unstructured":"CUDA Nvidia . 2008 . Cublas library. NVIDIA Corporation, Santa Clara , California 15 , 27 (2008), 31 . CUDA Nvidia. 2008. Cublas library. NVIDIA Corporation, Santa Clara, California 15, 27 (2008), 31.","journal-title":"California"},{"key":"e_1_3_2_1_34_1","volume-title":"Nvidias next generation cuda compute architecture: Kepler gk110. Whitepaper (2012)","author":"Nvidia C","year":"2012","unstructured":"C Nvidia . 2012. Nvidias next generation cuda compute architecture: Kepler gk110. Whitepaper (2012) ( 2012 ). C Nvidia. 2012. Nvidias next generation cuda compute architecture: Kepler gk110. Whitepaper (2012) (2012)."},{"key":"e_1_3_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1145\/2451116.2451160"},{"key":"e_1_3_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.1145\/3037697.3037707"},{"key":"e_1_3_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1109\/HPCA.2015.7056037"},{"key":"e_1_3_2_1_38_1","volume-title":"A survey of decision tree classifier methodology","author":"Rasoul Safavian S","year":"1991","unstructured":"S Rasoul Safavian and David Landgrebe . 1991. A survey of decision tree classifier methodology . IEEE transactions on systems, man, and cybernetics 21, 3 ( 1991 ), 660--674. S Rasoul Safavian and David Landgrebe. 1991. A survey of decision tree classifier methodology. IEEE transactions on systems, man, and cybernetics 21, 3 (1991), 660--674."},{"key":"e_1_3_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.1145\/321864.321873"},{"volume-title":"Linear regression analysis","author":"Seber George AF","key":"e_1_3_2_1_40_1","unstructured":"George AF Seber and Alan J Lee . 2012. Linear regression analysis . Vol. 329 . John Wiley & Sons . George AF Seber and Alan J Lee. 2012. Linear regression analysis. Vol. 329. John Wiley & Sons."},{"key":"e_1_3_2_1_41_1","volume-title":"USENIX Annual Technical Conference. 109--120","author":"Suzuki Yusuke","year":"2014","unstructured":"Yusuke Suzuki , Shinpei Kato , Hiroshi Yamada , and Kenji Kono . 2014 . GPUvm: Why not virtualizing GPUs at the hypervisor? . In USENIX Annual Technical Conference. 109--120 . Yusuke Suzuki, Shinpei Kato, Hiroshi Yamada, and Kenji Kono. 2014. GPUvm: Why not virtualizing GPUs at the hypervisor?. In USENIX Annual Technical Conference. 109--120."},{"key":"e_1_3_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.1145\/2259016.2259018"},{"key":"e_1_3_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.1109\/HPCA.2016.7446078"},{"key":"e_1_3_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.1145\/3037697.3037742"},{"key":"e_1_3_2_1_45_1","doi-asserted-by":"publisher","DOI":"10.1145\/2508148.2485974"},{"key":"e_1_3_2_1_46_1","doi-asserted-by":"publisher","DOI":"10.1145\/2996913.2997016"},{"key":"e_1_3_2_1_47_1","doi-asserted-by":"publisher","DOI":"10.1109\/MICRO.2014.53"},{"key":"e_1_3_2_1_48_1","doi-asserted-by":"publisher","DOI":"10.1109\/LCA.2018.2851207"}],"event":{"name":"ICS '19: 2019 International Conference on Supercomputing","sponsor":["SIGARCH ACM Special Interest Group on Computer Architecture"],"location":"Phoenix Arizona","acronym":"ICS '19"},"container-title":["Proceedings of the ACM International Conference on Supercomputing"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3330345.3330351","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,12]],"date-time":"2023-01-12T00:34:18Z","timestamp":1673483658000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3330345.3330351"}},"subtitle":["<u>T<\/u>owards <u>l<\/u>atency <u>a<\/u>wareness and <u>i<\/u>mproved <u>u<\/u>tilization of <u>s<\/u>patial multitasking accelerators in datacenters"],"short-title":[],"issued":{"date-parts":[[2019,6,26]]},"references-count":48,"alternative-id":["10.1145\/3330345.3330351","10.1145\/3330345"],"URL":"https:\/\/doi.org\/10.1145\/3330345.3330351","relation":{},"subject":[],"published":{"date-parts":[[2019,6,26]]},"assertion":[{"value":"2019-06-26","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}