{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2024,6,18]],"date-time":"2024-06-18T11:20:44Z","timestamp":1718709644954},"reference-count":25,"publisher":"Association for Computing Machinery (ACM)","issue":"6","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Proc. VLDB Endow."],"published-print":{"date-parts":[[2015,2]]},"abstract":"Modern processor technologies have driven new designs and implementations in main-memory hash joins. Recently, Intel Many Integrated Core (MIC) co-processors (commonly known as Xeon Phi) embrace emerging x86 single-chip many-core techniques. Compared with contemporary multi-core CPUs, Xeon Phi has quite different architectural features: wider SIMD instructions, many cores and hardware contexts, as well as lower-frequency in-order cores. In this paper, we experimentally revisit the state-of-the-art hash join algorithms on Xeon Phi co-processors. In particular, we study two camps of hash join algorithms: hardware-conscious ones that advocate careful tailoring of the join algorithms to underlying hardware architectures and hardware-oblivious ones that omit such careful tailoring. For each camp, we study the impact of architectural features and software optimizations on Xeon Phi in comparison with results on multi-core CPUs. Our experiments show two major findings on Xeon Phi, which are quantitatively different from those on multi-core CPUs. First, the impact of architectural features and software optimizations has quite different behavior on Xeon Phi in comparison with those on the CPU, which calls for new optimization and tuning on Xeon Phi. Second, hardware oblivious algorithms can outperform hardware conscious algorithms on a wide parameter window. These two findings further shed light on the design and implementation of query processing on new-generation single-chip many-core technologies.<\/jats:p>","DOI":"10.14778\/2735703.2735704","type":"journal-article","created":{"date-parts":[[2015,5,12]],"date-time":"2015-05-12T15:37:52Z","timestamp":1431445072000},"page":"642-653","source":"Crossref","is-referenced-by-count":60,"title":["Improving main memory hash joins on Intel Xeon Phi processors"],"prefix":"10.14778","volume":"8","author":[{"given":"Saurabh","family":"Jha","sequence":"first","affiliation":[{"name":"Nanyang Technological University, Singapore"}]},{"given":"Bingsheng","family":"He","sequence":"additional","affiliation":[{"name":"Nanyang Technological University, Singapore"}]},{"given":"Mian","family":"Lu","sequence":"additional","affiliation":[{"name":"A*STAR IHPC, Singapore"}]},{"given":"Xuntao","family":"Cheng","sequence":"additional","affiliation":[{"name":"Nanyang Technological University"}]},{"given":"Huynh Phung","family":"Huynh","sequence":"additional","affiliation":[{"name":"A*STAR IHPC, Singapore"}]}],"member":"320","published-online":{"date-parts":[[2015,2]]},"reference":[{"key":"e_1_2_1_1_1","unstructured":"Hash functions. http:\/\/www.cse.yorku.ca\/~oz\/hash.html. Hash functions. http:\/\/www.cse.yorku.ca\/~oz\/hash.html."},{"key":"e_1_2_1_2_1","unstructured":"Intel xeon phi coprocessor 5110p: http:\/\/ark.intel.com\/products\/71992\/intel-xeon-phi-coprocessor-5110p-8gb-1_053-ghz-60-core. Intel xeon phi coprocessor 5110p: http:\/\/ark.intel.com\/products\/71992\/intel-xeon-phi-coprocessor-5110p-8gb-1_053-ghz-60-core."},{"key":"e_1_2_1_3_1","unstructured":"Intel xeon processor e5-2687w: http:\/\/ark.intel.com\/products\/64582\/intel-xeon-processor-e5-2687w-20m-cache-3_10-ghz-8_00-gts-intel-qpi. Intel xeon processor e5-2687w: http:\/\/ark.intel.com\/products\/64582\/intel-xeon-processor-e5-2687w-20m-cache-3_10-ghz-8_00-gts-intel-qpi."},{"key":"e_1_2_1_4_1","unstructured":"Mumurhash3. https:\/\/code.google.com\/p\/smhasher\/wiki\/MurmurHash3. Mumurhash3. https:\/\/code.google.com\/p\/smhasher\/wiki\/MurmurHash3."},{"key":"e_1_2_1_5_1","unstructured":"Optimization and performance tuning for intel xeon phi coprocessors. https:\/\/software.intel.com\/en-us\/articles\/optimization-and-performance-tuning-for-intel-xeon-phi-coprocessors-part-2-understanding. Optimization and performance tuning for intel xeon phi coprocessors. https:\/\/software.intel.com\/en-us\/articles\/optimization-and-performance-tuning-for-intel-xeon-phi-coprocessors-part-2-understanding."},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.14778\/2732219.2732227"},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDE.2013.6544839"},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1145\/1989323.1989328"},{"key":"e_1_2_1_9_1","volume-title":"VLDB","author":"Boncz P. A.","year":"1999"},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1145\/1272743.1272747"},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.14778\/2735508.2735513"},{"key":"e_1_2_1_12_1","volume-title":"ICDE. IEEE","author":"Graefe G.","year":"1994"},{"key":"e_1_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1145\/1620585.1620588"},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1145\/1376616.1376670"},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.14778\/2536206.2536216"},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.14778\/2536360.2536370"},{"key":"e_1_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1145\/2236584.2236592"},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.14778\/1687553.1687564"},{"key":"e_1_2_1_19_1","volume-title":"TPDS. IEEE","author":"Lu M.","year":"2014"},{"key":"e_1_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1145\/1989323.1989326"},{"key":"e_1_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2002.1019210"},{"key":"e_1_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1109\/IPDPS.2013.44"},{"key":"e_1_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDE.2013.6544810"},{"key":"e_1_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1145\/1807167.1807207"},{"key":"e_1_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.14778\/2536274.2536319"}],"container-title":["Proceedings of the VLDB Endowment"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.14778\/2735703.2735704","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,12,28]],"date-time":"2022-12-28T10:32:31Z","timestamp":1672223551000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.14778\/2735703.2735704"}},"subtitle":["an experimental approach"],"short-title":[],"issued":{"date-parts":[[2015,2]]},"references-count":25,"journal-issue":{"issue":"6","published-print":{"date-parts":[[2015,2]]}},"alternative-id":["10.14778\/2735703.2735704"],"URL":"https:\/\/doi.org\/10.14778\/2735703.2735704","relation":{},"ISSN":["2150-8097"],"issn-type":[{"value":"2150-8097","type":"print"}],"subject":[],"published":{"date-parts":[[2015,2]]}}}