{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,2,21]],"date-time":"2025-02-21T02:30:48Z","timestamp":1740105048614,"version":"3.37.3"},"reference-count":26,"publisher":"Wiley","issue":"18","license":[{"start":{"date-parts":[[2015,7,31]],"date-time":"2015-07-31T00:00:00Z","timestamp":1438300800000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/onlinelibrary.wiley.com\/termsAndConditions#vor"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Concurrency and Computation"],"published-print":{"date-parts":[[2015,12,25]]},"abstract":"Summary<\/jats:title>Efficiently scheduling MapReduce tasks is considered as one of the major challenges that face MapReduce frameworks. Many algorithms were introduced to tackle this issue. Most of these algorithms are focusing on the data locality property for tasks scheduling. The data locality may cause less physical resources utilization in non\u2010virtualized clusters and more power consumption. Virtualized clusters provide a viable solution to support both data locality and better cluster resources utilization. In this paper, we evaluate the major MapReduce scheduling algorithms such as FIFO, Matchmaking, Delay, and multithreading locality (MTL) on virtualized infrastructure. Two major factors are used to test the evaluated algorithms: the simulation time and the energy consumption. The evaluated schedulers are compared, and the results show the superiority and the preference of the MTL scheduler over the other existing schedulers. Also, we present a comparison study between virtualized and non\u2010virtualized clusters for MapReduce tasks scheduling. Copyright \u00a9 2015 John Wiley & Sons, Ltd.<\/jats:p>","DOI":"10.1002\/cpe.3595","type":"journal-article","created":{"date-parts":[[2015,7,31]],"date-time":"2015-07-31T23:22:54Z","timestamp":1438384974000},"page":"5686-5699","source":"Crossref","is-referenced-by-count":19,"title":["Evaluating map reduce tasks scheduling algorithms over cloud computing infrastructure"],"prefix":"10.1002","volume":"27","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-7474-4198","authenticated-orcid":false,"given":"Qutaibah","family":"Althebyan","sequence":"first","affiliation":[{"name":"Software Engineering Department Jordan University of Science and Technology Irbid Jordan"}]},{"given":"Yaser","family":"Jararweh","sequence":"additional","affiliation":[{"name":"Computer Science Department Jordan University of Science and Technology Irbid Jordan"}]},{"given":"Qussai","family":"Yaseen","sequence":"additional","affiliation":[{"name":"Computer Information Systems Department Jordan University of Science and Technology Irbid Jordan"}]},{"given":"Omar","family":"AlQudah","sequence":"additional","affiliation":[{"name":"Computer Science Department Jordan University of Science and Technology Irbid Jordan"}]},{"given":"Mahmoud","family":"Al\u2010Ayyoub","sequence":"additional","affiliation":[{"name":"Computer Science Department Jordan University of Science and Technology Irbid Jordan"}]}],"member":"311","published-online":{"date-parts":[[2015,7,31]]},"reference":[{"volume-title":"Big Data: The Management Revolution","year":"2012","author":"Mcafee A","key":"e_1_2_6_2_1"},{"key":"e_1_2_6_3_1","unstructured":"VillarsR OlofsonC EastwoodM.Big Data: What It Is And Why You Should Care. White paper MA USA 2011."},{"key":"e_1_2_6_4_1","unstructured":"Manyika J Chui M Brown B Bughin J Dobbs R Roxburgh C Byers AH.Big Data: The Next Frontier For Innovation Competition And Productivity 2011."},{"volume-title":"Enhancing Enterprise Systems with Big Data","year":"2013","author":"Statchuk C","key":"e_1_2_6_5_1"},{"volume-title":"Flying Yellow Elephant: Predictable and Efficient MapReduce in the Cloud","year":"2010","author":"Schad J","key":"e_1_2_6_6_1"},{"volume-title":"VirtualCloud \u2010 A Cloud Environment Simulator","year":"2010","author":"Krishnadhan D","key":"e_1_2_6_7_1"},{"key":"e_1_2_6_8_1","doi-asserted-by":"crossref","unstructured":"AlthebyanQ AlqudahO JararwehY YaseenQ.Multi\u2010threading based map reduce tasks scheduling. In5th International Conference on Information and Communication Systems (ICICS):Irbid Jordan 2014;1\u20133.","DOI":"10.1109\/IACS.2014.6841943"},{"key":"e_1_2_6_9_1","article-title":"A scalable Map Reduce tasks scheduling: a threading based approach","author":"Althebyan Q","year":"2015","journal-title":"International Journal of Computational Science and Engineering"},{"key":"e_1_2_6_10_1","unstructured":"The Apache software foundation.Hadoop Apache 2012. (Available from:http:\/\/hadoop.apache.org\/) [Accessed on 10 February 2014]."},{"key":"e_1_2_6_11_1","unstructured":"DeanJ GhemawatS.MapReduce: simplified data processing on large clusters. InOSDI 04: In the Proceedings of the 6th Symposium on Operating Systems Design and Implementation:California USA 2004."},{"issue":"9","key":"e_1_2_6_12_1","first-page":"0975","article-title":"Survey on improved scheduling in Hadoop MapReduce in cloud environments","volume":"34","author":"Rao BT","year":"2011","journal-title":"International Journal of Computer Applications"},{"key":"e_1_2_6_13_1","unstructured":"Under the hood: scheduling MapReduce jobs more efficiently with Corona. (Available from:https:\/\/github.com\/facebook\/hadoop\u201020\/tree\/master\/src\/contrib\/corona) [Accessed on 10 February 2014]."},{"issue":"4","key":"e_1_2_6_14_1","first-page":"929","article-title":"A task scheduling algorithm for Hadoop platform","volume":"8","author":"Chen J","year":"2013","journal-title":"Journal of Computers"},{"key":"e_1_2_6_15_1","doi-asserted-by":"crossref","unstructured":"VermaA CherkasovaL CampbellR.Two sides of a coin: optimizing the schedule of MapReduce jobs to minimize their makespan and improve cluster performance. InIn the Proceedings of the 2012 IEEE 20th International Symposium on Modeling Analysis and Simulation of Computer and Telecommunication Systems:Washington DC USA 2012;11\u201318.","DOI":"10.1109\/MASCOTS.2012.12"},{"key":"e_1_2_6_16_1","doi-asserted-by":"crossref","unstructured":"KcK AnyanwuK.Scheduling Hadoop jobs to meet deadlines. InIEEE Second International Conference on Cloud Computing Technology and Science (CloudCom):Indianapolis Indiana USA 2010;388\u2013392.","DOI":"10.1109\/CloudCom.2010.97"},{"key":"e_1_2_6_17_1","doi-asserted-by":"crossref","unstructured":"PalanisamyB SinghA LiuL JainB.Purlieus: locality\u2010aware resource allocation for MapReduce in a cloud. InProceedings of International Conference for High Performance Computing Networking Storage and Analysis:Seattle Washington USA 2011;Article Number 58.","DOI":"10.1145\/2063384.2063462"},{"key":"e_1_2_6_18_1","doi-asserted-by":"crossref","unstructured":"GuoZ FoxG ZhouM.Investigation of data locality in MapReduce. InProceedings of the 2012 12th IEEE\/ACM International Symposium on Cluster Cloud and Grid Computing (CCGRID 2012):Ottawa Canada 2012;419\u2013426.","DOI":"10.1109\/CCGrid.2012.42"},{"key":"e_1_2_6_19_1","doi-asserted-by":"crossref","unstructured":"HammoudM SakrM.Locality\u2010aware reduce task scheduling for MapReduce. InProceedings of the 2011 IEEE Third International Conference on Cloud Computing Technology and Science (CloudCom):Athens Greece 2011;570\u2013576.","DOI":"10.1109\/CloudCom.2011.87"},{"key":"e_1_2_6_20_1","first-page":"49","article-title":"Research on scheduling scheme for Hadoop clusters","volume":"17","author":"Xie J","year":"2013","journal-title":"International Conference on Computational Science (ICC)"},{"key":"e_1_2_6_21_1","doi-asserted-by":"publisher","DOI":"10.14257\/ijfgcn.2015.8.2.33"},{"key":"e_1_2_6_22_1","doi-asserted-by":"crossref","unstructured":"EhsanM RaduS.LiPS: A cost\u2010efficient data and task co\u2010scheduler for MapReduce. InProceedings of the 2013 IEEE 27th International Symposium on Parallel and Distributed Processing Workshops and PhD Forum.IEEE Computer Society 2013;2230\u20132233.","DOI":"10.1109\/IPDPSW.2013.175"},{"key":"e_1_2_6_23_1","unstructured":"BikashS TimothyW ChitaRD.HybridMR: a hierarchical MapReduce scheduler for hybrid data centers. InProceedings of the 33rd International Conference on Distributed Computing Systems ICDCS'13:Philadelphia USA 2013;102\u2013111."},{"key":"e_1_2_6_24_1","first-page":"40","volume-title":"the Proceedings of the International Conference on Cloud Computing Technology and Science, Athens, Greece","author":"He C","year":"2011"},{"key":"e_1_2_6_25_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.simpat.2014.09.003"},{"key":"e_1_2_6_26_1","unstructured":"JararwehY AlsharaZ JarrahM KharbutliM AlsalehM.TeachCloud: cloud computing educational toolkit. In1st International IBM Cloud Academy Conference ICA CON 2012:North Carolina 2012;1\u201316."},{"key":"e_1_2_6_27_1","doi-asserted-by":"crossref","unstructured":"QuwaiderM JararwehY.Cloudlet\u2010based for big data collection in body area networks. InIn 8th International Conference for Internet Technology and Secured Transactions (ICITST) vol.\u00a0137:London UK 2013;137\u2013141.","DOI":"10.1109\/ICITST.2013.6750178"}],"container-title":["Concurrency and Computation: Practice and Experience"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/api.wiley.com\/onlinelibrary\/tdm\/v1\/articles\/10.1002%2Fcpe.3595","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/onlinelibrary.wiley.com\/doi\/pdf\/10.1002\/cpe.3595","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,9,13]],"date-time":"2023-09-13T11:53:28Z","timestamp":1694606008000},"score":1,"resource":{"primary":{"URL":"https:\/\/onlinelibrary.wiley.com\/doi\/10.1002\/cpe.3595"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2015,7,31]]},"references-count":26,"journal-issue":{"issue":"18","published-print":{"date-parts":[[2015,12,25]]}},"alternative-id":["10.1002\/cpe.3595"],"URL":"https:\/\/doi.org\/10.1002\/cpe.3595","archive":["Portico"],"relation":{},"ISSN":["1532-0626","1532-0634"],"issn-type":[{"type":"print","value":"1532-0626"},{"type":"electronic","value":"1532-0634"}],"subject":[],"published":{"date-parts":[[2015,7,31]]}}}