{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,4,11]],"date-time":"2025-04-11T17:44:45Z","timestamp":1744393485959,"version":"3.37.3"},"reference-count":204,"publisher":"Association for Computing Machinery (ACM)","issue":"6","funder":[{"DOI":"10.13039\/501100007601","name":"Horizon 2020 Framework Programme","doi-asserted-by":"publisher","award":["825258"],"id":[{"id":"10.13039\/501100007601","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Comput. Surv."],"published-print":{"date-parts":[[2021,11,30]]},"abstract":"\n One of the most critical tasks for improving data quality and increasing the reliability of data analytics is\n Entity Resolution<\/jats:italic>\n (ER), which aims to identify different descriptions that refer to the same real-world entity. Despite several decades of research, ER remains a challenging problem. In this survey, we highlight the novel aspects of resolving Big Data entities when we should satisfy more than one of the Big Data characteristics simultaneously (i.e., Volume and Velocity with Variety). We present the basic concepts, processing steps, and execution strategies that have been proposed by database, semantic Web, and machine learning communities in order to cope with the loose\n structuredness<\/jats:italic>\n , extreme\n diversity<\/jats:italic>\n , high\n speed,<\/jats:italic>\n and large\n scale<\/jats:italic>\n of entity descriptions used by real-world applications. We provide an end-to-end view of ER workflows\u00a0for\u00a0Big Data, critically review the pros and cons of existing methods, and conclude with the main open research\u00a0directions.\n <\/jats:p>","DOI":"10.1145\/3418896","type":"journal-article","created":{"date-parts":[[2020,12,6]],"date-time":"2020-12-06T22:23:20Z","timestamp":1607293400000},"page":"1-42","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":117,"title":["An Overview of End-to-End Entity Resolution for Big Data"],"prefix":"10.1145","volume":"53","author":[{"given":"Vassilis","family":"Christophides","sequence":"first","affiliation":[{"name":"ENSEA, ETIS Lab, Cergy France"}]},{"given":"Vasilis","family":"Efthymiou","sequence":"additional","affiliation":[{"name":"IBM Research, USA"}]},{"given":"Themis","family":"Palpanas","sequence":"additional","affiliation":[{"name":"Universit\u00e9 de Paris and French University Institute (IUF), Paris, France"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7298-9431","authenticated-orcid":false,"given":"George","family":"Papadakis","sequence":"additional","affiliation":[{"name":"National and Kapodistrian University of Athens, Panepistimioupolis, Ilissia, Athens, Greece"}]},{"given":"Kostas","family":"Stefanidis","sequence":"additional","affiliation":[{"name":"Tampere University, Kalevantie, Tampere, Finland"}]}],"member":"320","published-online":{"date-parts":[[2020,12,6]]},"reference":[{"volume-title":"Aizawa and Keizo Oyama","year":"2005","author":"Akiko","key":"e_1_2_1_1_1"},{"key":"e_1_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.14778\/2732967.2732975"},{"key":"e_1_2_1_3_1","doi-asserted-by":"crossref","unstructured":"Yasser Altowim and Sharad Mehrotra. 2017. Parallel progressive approach to entity resolution using MapReduce. In ICDE. 909--920. Yasser Altowim and Sharad Mehrotra. 2017. Parallel progressive approach to entity resolution using MapReduce. In ICDE. 909--920.","DOI":"10.1109\/ICDE.2017.139"},{"key":"e_1_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2016.2623607"},{"key":"e_1_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.14778\/2850583.2850587"},{"key":"e_1_2_1_6_1","doi-asserted-by":"crossref","unstructured":"Rohit Ananthakrishna Surajit Chaudhuri and Venkatesh Ganti. 2002. Eliminating fuzzy duplicates in data warehouses. In VLDB. 586--597. Rohit Ananthakrishna Surajit Chaudhuri and Venkatesh Ganti. 2002. Eliminating fuzzy duplicates in data warehouses. In VLDB. 586--597.","DOI":"10.1016\/B978-155860869-6\/50058-5"},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1145\/3236024.3264590"},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2014.2365779"},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1145\/3341105.3375776"},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.7155\/jgaa.00084"},{"key":"e_1_2_1_11_1","doi-asserted-by":"crossref","unstructured":"Tadas Baltrusaitis Chaitanya Ahuja and Louis-Philippe Morency. 2019. Challenges and applications in multimodal machine learning. In The Handbook of Multimodal-Multisensor Interfaces. ACM and Morgan 8 Claypool 17--48. Tadas Baltrusaitis Chaitanya Ahuja and Louis-Philippe Morency. 2019. Challenges and applications in multimodal machine learning. In The Handbook of Multimodal-Multisensor Interfaces. ACM and Morgan 8 Claypool 17--48.","DOI":"10.1145\/3107990.3107993"},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1023\/B:MACH.0000033116.57574.95"},{"key":"e_1_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.5555\/944919.944966"},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDCS.2007.96"},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1007\/s00778-008-0098-x"},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1145\/1217299.1217304"},{"key":"e_1_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.5555\/1622637.1622653"},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.is.2018.02.005"},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDM.2005.18"},{"key":"e_1_2_1_20_1","doi-asserted-by":"crossref","unstructured":"M. Bilenko and R. J. Mooney. 2003. Adaptive duplicate detection using learnable string similarity measures. In SIGKDD. M. Bilenko and R. J. Mooney. 2003. Adaptive duplicate detection using learnable string similarity measures. In SIGKDD.","DOI":"10.1145\/956750.956759"},{"volume-title":"LINDA: Distributed web-of-data-scale entity matching. In CIKM.","year":"2012","author":"B\u00f6hm Christoph","key":"e_1_2_1_21_1"},{"volume-title":"Proceedings of the 23nd International Conference on Extending Database Technology (EDBT\u201920)","year":"2020","author":"Brunner Ursin","key":"e_1_2_1_22_1"},{"key":"e_1_2_1_23_1","unstructured":"Chengliang Chai Guoliang Li Jian Li Dong Deng and Jianhua Feng. 2016. Cost-effective crowdsourced entity resolution: A partial-order approach. In SIGMOD. Chengliang Chai Guoliang Li Jian Li Dong Deng and Jianhua Feng. 2016. Cost-effective crowdsourced entity resolution: A partial-order approach. In SIGMOD."},{"key":"e_1_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1007\/s00778-018-0509-6"},{"key":"e_1_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1137\/S0097539702418498"},{"key":"e_1_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.5555\/3304222.3304326"},{"key":"e_1_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.24963\/ijcai.2017\/209"},{"key":"e_1_2_1_28_1","unstructured":"Xiao Chen. 2015. Crowdsourcing entity resolution: A short overview and open issues. In GvDB. 72--77. Xiao Chen. 2015. Crowdsourcing entity resolution: A short overview and open issues. In GvDB. 72--77."},{"volume-title":"Cloud-scale entity resolution: Current state and open challenges. OJBD 4, 1","year":"2018","author":"Chen Xiao","key":"e_1_2_1_29_1"},{"volume-title":"Naughton","year":"2014","author":"Chiang Yueh-Hsuan","key":"e_1_2_1_30_1"},{"key":"e_1_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.14778\/2732279.2732284"},{"key":"e_1_2_1_32_1","unstructured":"Kyunghyun Cho Bart van Merrienboer \u00c7aglar G\u00fcl\u00e7ehre Dzmitry Bahdanau Fethi Bougares Holger Schwenk and Yoshua Bengio. 2014. Learning phrase representations using RNN encoder-decoder for statistical machine translation. In EMNLP. 1724--1734. Kyunghyun Cho Bart van Merrienboer \u00c7aglar G\u00fcl\u00e7ehre Dzmitry Bahdanau Fethi Bougares Holger Schwenk and Yoshua Bengio. 2014. Learning phrase representations using RNN encoder-decoder for statistical machine translation. In EMNLP. 1724--1734."},{"key":"e_1_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1145\/1401890.1402020"},{"volume-title":"Data Matching","author":"Christen Peter","key":"e_1_2_1_34_1","doi-asserted-by":"crossref","DOI":"10.1007\/978-3-642-31164-2"},{"key":"e_1_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2011.127"},{"key":"e_1_2_1_36_1","doi-asserted-by":"crossref","unstructured":"Peter Christen Ross W. Gayler and David Hawking. 2009. Similarity-aware indexing for real-time entity resolution. In CIKM. 1565--1568. Peter Christen Ross W. Gayler and David Hawking. 2009. Similarity-aware indexing for real-time entity resolution. In CIKM. 1565--1568.","DOI":"10.1145\/1645953.1646173"},{"key":"e_1_2_1_37_1","doi-asserted-by":"crossref","unstructured":"Vassilis Christophides Vasilis Efthymiou and Kostas Stefanidis. 2015. Entity Resolution in the Web of Data. Morgan 8 Claypool. Vassilis Christophides Vasilis Efthymiou and Kostas Stefanidis. 2015. Entity Resolution in the Web of Data. Morgan 8 Claypool.","DOI":"10.1007\/978-3-031-79468-1"},{"key":"e_1_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.14778\/2983200.2983203"},{"key":"e_1_2_1_39_1","doi-asserted-by":"crossref","unstructured":"Yeounoh Chung Tim Kraska Neoklis Polyzotis K. Tae and Steven Euijong Whang. 2019. Slice finder: Automated data slicing for model validation. In ICDE. Yeounoh Chung Tim Kraska Neoklis Polyzotis K. Tae and Steven Euijong Whang. 2019. Slice finder: Automated data slicing for model validation. In ICDE.","DOI":"10.1109\/ICDE.2019.00139"},{"key":"e_1_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1103\/PhysRevE.70.066111"},{"volume-title":"Cohen and Jacob Richman","year":"2002","author":"William","key":"e_1_2_1_41_1"},{"volume-title":"Falcon: Scaling up hands-off crowdsourced entity matching to build cloud services. In SIGMOD. 1431--1446.","year":"2017","author":"Das Sanjib","key":"e_1_2_1_42_1"},{"key":"e_1_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.1145\/1327452.1327492"},{"key":"e_1_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.3233\/SW-180306"},{"key":"e_1_2_1_45_1","doi-asserted-by":"publisher","DOI":"10.1007\/s00778-013-0324-z"},{"key":"e_1_2_1_46_1","doi-asserted-by":"publisher","DOI":"10.1016\/S0377-2217(00)00108-9"},{"key":"e_1_2_1_47_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10115-019-01347-0"},{"key":"e_1_2_1_48_1","doi-asserted-by":"crossref","unstructured":"Xin Dong Alon Y. Halevy and Jayant Madhavan. 2005. Reference reconciliation in complex information spaces. In SIGMOD. 85--96. Xin Dong Alon Y. Halevy and Jayant Madhavan. 2005. Reference reconciliation in complex information spaces. In SIGMOD. 85--96.","DOI":"10.1145\/1066157.1066168"},{"key":"e_1_2_1_49_1","doi-asserted-by":"crossref","unstructured":"Xin Luna Dong and Divesh Srivastava. 2015. Big Data Integration. Morgan 8 Claypool. Xin Luna Dong and Divesh Srivastava. 2015. Big Data Integration. Morgan 8 Claypool.","DOI":"10.1007\/978-3-031-01853-4"},{"volume-title":"Approximate data instance matching: A survey. KAIS 27, 1 (01","year":"2011","author":"Dorneles Carina Friedrich","key":"e_1_2_1_50_1"},{"key":"e_1_2_1_51_1","unstructured":"Uwe Draisbach and Felix Naumann. 2010. DuDe: The duplicate detection toolkit. In QDB. Uwe Draisbach and Felix Naumann. 2010. DuDe: The duplicate detection toolkit. In QDB."},{"key":"e_1_2_1_52_1","first-page":"1454","article-title":"Distributed representations of tuples for entity resolution","volume":"11","author":"Ebraheem Muhammad","year":"2018","journal-title":"PVLDB"},{"key":"e_1_2_1_53_1","doi-asserted-by":"publisher","DOI":"10.14778\/3368289.3368303"},{"key":"e_1_2_1_54_1","doi-asserted-by":"crossref","unstructured":"Vasilis Efthymiou Oktie Hassanzadeh Mariano Rodriguez-Muro and Vassilis Christophides. 2017. Matching web tables with knowledge base entities: From entity lookups to entity embeddings. In ISWC. 260--277. Vasilis Efthymiou Oktie Hassanzadeh Mariano Rodriguez-Muro and Vassilis Christophides. 2017. Matching web tables with knowledge base entities: From entity lookups to entity embeddings. In ISWC. 260--277.","DOI":"10.1007\/978-3-319-68288-4_16"},{"key":"e_1_2_1_55_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.is.2016.12.001"},{"key":"e_1_2_1_56_1","unstructured":"Vasilis Efthymiou George Papadakis Kostas Stefanidis and Vassilis Christophides. 2019. MinoanER: Schema-agnostic non-iterative massively parallel resolution of web entities. In EDBT. 373--384. Vasilis Efthymiou George Papadakis Kostas Stefanidis and Vassilis Christophides. 2019. MinoanER: Schema-agnostic non-iterative massively parallel resolution of web entities. In EDBT. 373--384."},{"volume-title":"Big data entity resolution: From highly to somehow similar entity descriptions in the Web","author":"Efthymiou Vasilis","key":"e_1_2_1_57_1"},{"key":"e_1_2_1_58_1","doi-asserted-by":"publisher","DOI":"10.5555\/1191547.1191739"},{"key":"e_1_2_1_59_1","doi-asserted-by":"publisher","DOI":"10.1207\/s15516709cog1402_1"},{"key":"e_1_2_1_60_1","doi-asserted-by":"crossref","unstructured":"Jos\u00e9 Esquivel Dyaa Albakour Miguel Martinez-Alvarez David Corney and Samir Moussa. 2017. On the long-tail entities in news. In ECIR. Jos\u00e9 Esquivel Dyaa Albakour Miguel Martinez-Alvarez David Corney and Samir Moussa. 2017. On the long-tail entities in news. In ECIR.","DOI":"10.1007\/978-3-319-56608-5_67"},{"key":"e_1_2_1_61_1","doi-asserted-by":"publisher","DOI":"10.1007\/s00778-010-0206-6"},{"key":"e_1_2_1_62_1","doi-asserted-by":"publisher","DOI":"10.14778\/1687627.1687674"},{"key":"e_1_2_1_63_1","doi-asserted-by":"publisher","DOI":"10.1080\/01621459.1969.10501049"},{"key":"e_1_2_1_64_1","doi-asserted-by":"publisher","DOI":"10.14778\/2876473.2876474"},{"key":"e_1_2_1_65_1","doi-asserted-by":"publisher","DOI":"10.1145\/2783258.2783396"},{"key":"e_1_2_1_66_1","doi-asserted-by":"publisher","DOI":"10.1080\/15427951.2004.10129093"},{"key":"e_1_2_1_67_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1007662407062"},{"key":"e_1_2_1_68_1","doi-asserted-by":"publisher","DOI":"10.24963\/ijcai.2019\/689"},{"key":"e_1_2_1_69_1","doi-asserted-by":"publisher","DOI":"10.14778\/2733004.2733068"},{"key":"e_1_2_1_70_1","doi-asserted-by":"crossref","unstructured":"Sainyam Galhotra Donatella Firmani Barna Saha and Divesh Srivastava. 2018. Robust entity resolution using random graphs. In SIGMOD. 3--18. Sainyam Galhotra Donatella Firmani Barna Saha and Divesh Srivastava. 2018. Robust entity resolution using random graphs. In SIGMOD. 3--18.","DOI":"10.1145\/3183713.3183755"},{"volume-title":"C","year":"2018","author":"Gao Nengneng","key":"e_1_2_1_71_1"},{"key":"e_1_2_1_72_1","doi-asserted-by":"publisher","DOI":"10.14778\/2367502.2367564"},{"key":"e_1_2_1_73_1","doi-asserted-by":"publisher","DOI":"10.1145\/1217299.1217303"},{"key":"e_1_2_1_74_1","doi-asserted-by":"publisher","DOI":"10.1145\/2588555.2588576"},{"key":"e_1_2_1_75_1","doi-asserted-by":"crossref","unstructured":"Behzad Golshan Alon Y. Halevy George A. Mihaila and Wang-Chiew Tan. 2017. Data integration: After the teenage years. In PODS. 101--106. Behzad Golshan Alon Y. Halevy George A. Mihaila and Wang-Chiew Tan. 2017. Data integration: After the teenage years. In PODS. 101--106.","DOI":"10.1145\/3034786.3056124"},{"key":"e_1_2_1_76_1","doi-asserted-by":"publisher","DOI":"10.14778\/3229863.3236255"},{"key":"e_1_2_1_77_1","first-page":"9","article-title":"Incremental record linkage","volume":"7","author":"Gruenheid Anja","year":"2014","journal-title":"PVLDB"},{"volume-title":"Proceedings of the 38th International Conference on Software Engineering (ICSE\u201916)","author":"Gulzar M. A.","key":"e_1_2_1_78_1"},{"key":"e_1_2_1_79_1","doi-asserted-by":"crossref","unstructured":"Sara Hajian Francesco Bonchi and Carlos Castillo. 2016. Algorithmic bias: From discrimination discovery to fairness-aware data mining. In KDD. Sara Hajian Francesco Bonchi and Carlos Castillo. 2016. Algorithmic bias: From discrimination discovery to fairness-aware data mining. In KDD.","DOI":"10.1145\/2939672.2945386"},{"key":"e_1_2_1_80_1","doi-asserted-by":"publisher","DOI":"10.14778\/1687627.1687771"},{"key":"e_1_2_1_81_1","doi-asserted-by":"publisher","DOI":"10.1007\/s00778-009-0161-2"},{"key":"e_1_2_1_82_1","unstructured":"Taher H. Haveliwala Aristides Gionis and Piotr Indyk. 2000. Scalable techniques for clustering the Web. In WebDB. 129--134. Taher H. Haveliwala Aristides Gionis and Piotr Indyk. 2000. Scalable techniques for clustering the Web. In WebDB. 129--134."},{"key":"e_1_2_1_83_1","doi-asserted-by":"publisher","DOI":"10.1145\/2452376.2452440"},{"volume-title":"Stolfo","year":"1995","author":"Hern\u00e0ndez Mauricio A.","key":"e_1_2_1_84_1"},{"key":"e_1_2_1_85_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.artint.2012.06.001"},{"key":"e_1_2_1_86_1","first-page":"1","article-title":"The rise of crowdsourcing","volume":"14","author":"Howe Jeff","year":"2006","journal-title":"Wired Magazine"},{"volume-title":"Ilyas and Xu Chu","year":"2019","author":"Ihab","key":"e_1_2_1_87_1"},{"key":"e_1_2_1_88_1","doi-asserted-by":"publisher","DOI":"10.14778\/1920841.1920898"},{"key":"e_1_2_1_89_1","doi-asserted-by":"crossref","unstructured":"Ekaterini Ioannou Claudia Nieder\u00e9e and Wolfgang Nejdl. 2008. Probabilistic entity linkage for heterogeneous information spaces. In CAiSE. Ekaterini Ioannou Claudia Nieder\u00e9e and Wolfgang Nejdl. 2008. Probabilistic entity linkage for heterogeneous information spaces. In CAiSE.","DOI":"10.1007\/978-3-540-69534-9_41"},{"key":"e_1_2_1_90_1","doi-asserted-by":"publisher","DOI":"10.1007\/s13740-012-0015-8"},{"key":"e_1_2_1_91_1","doi-asserted-by":"publisher","DOI":"10.14778\/2350229.2350276"},{"key":"e_1_2_1_92_1","doi-asserted-by":"publisher","DOI":"10.1145\/331499.331504"},{"volume-title":"Fine-grained record integration and linkage tool. BDR 82, 11","year":"2008","author":"Jurczyk Pawel","key":"e_1_2_1_93_1"},{"key":"e_1_2_1_94_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.is.2017.06.006"},{"key":"e_1_2_1_95_1","unstructured":"Alexandros Karakasidis and Evaggelia Pitoura. 2019. Identifying bias in name matching tasks. In EDBT. 626--629. Alexandros Karakasidis and Evaggelia Pitoura. 2019. Identifying bias in name matching tasks. In EDBT. 626--629."},{"volume-title":"Verykios","year":"2018","author":"Karapiperis Dimitrios","key":"e_1_2_1_96_1"},{"key":"e_1_2_1_97_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P19-1586"},{"key":"e_1_2_1_98_1","doi-asserted-by":"publisher","DOI":"10.14778\/3229863.3236225"},{"volume-title":"Miranker","year":"2013","author":"Kejriwal Mayank","key":"e_1_2_1_99_1"},{"volume-title":"Miranker","year":"2014","author":"Kejriwal Mayank","key":"e_1_2_1_100_1"},{"volume-title":"Miranker","year":"2015","author":"Kejriwal Mayank","key":"e_1_2_1_101_1"},{"key":"e_1_2_1_102_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.websem.2015.07.002"},{"volume-title":"Khan and Hector Garcia-Molina","year":"2016","author":"Asif","key":"e_1_2_1_103_1"},{"volume-title":"Magellan: Toward building entity matching management systems. PVLDB 9, 12","year":"2016","author":"Konda Pradap","key":"e_1_2_1_104_1"},{"key":"e_1_2_1_105_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.datak.2009.10.003"},{"volume-title":"Evaluation of entity resolution approaches on real-world match problems. PVLDB 3, 1","year":"2010","author":"K\u00f6pcke Hanna","key":"e_1_2_1_106_1"},{"key":"e_1_2_1_107_1","doi-asserted-by":"crossref","unstructured":"Nick Koudas Sunita Sarawagi and Divesh Srivastava. 2006. Record linkage: Similarity measures and algorithms. In SIGMOD. 802--803. Nick Koudas Sunita Sarawagi and Divesh Srivastava. 2006. Record linkage: Similarity measures and algorithms. In SIGMOD. 802--803.","DOI":"10.1145\/1142473.1142599"},{"key":"e_1_2_1_108_1","doi-asserted-by":"publisher","DOI":"10.1145\/321138.321140"},{"key":"e_1_2_1_109_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDE.2019.00027"},{"key":"e_1_2_1_110_1","doi-asserted-by":"publisher","DOI":"10.14778\/3311880.3311883"},{"key":"e_1_2_1_111_1","unstructured":"Simon Lacoste-Julien Konstantina Palla Alex Davies Gjergji Kasneci Thore Graepel and Zoubin Ghahramani. 2013. SIGMa: Simple greedy matching for aligning large knowledge bases. In SIGKDD. 572--580. Simon Lacoste-Julien Konstantina Palla Alex Davies Gjergji Kasneci Thore Graepel and Zoubin Ghahramani. 2013. SIGMa: Simple greedy matching for aligning large knowledge bases. In SIGKDD. 572--580."},{"volume-title":"Anno Langen, and Yang Li.","year":"2017","author":"Li Furong","key":"e_1_2_1_112_1"},{"key":"e_1_2_1_113_1","doi-asserted-by":"crossref","unstructured":"Guoliang Li Yudian Zheng Ju Fan Jiannan Wang and Reynold Cheng. 2017. Crowdsourced data management: Overview and challenges. In SIGMOD. Guoliang Li Yudian Zheng Ju Fan Jiannan Wang and Reynold Cheng. 2017. Crowdsourced data management: Overview and challenges. In SIGMOD.","DOI":"10.1145\/3035918.3054776"},{"key":"e_1_2_1_114_1","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2008.202"},{"key":"e_1_2_1_115_1","first-page":"1","article-title":"Scalable lineage capture for debugging DISC analytics","volume":"17","author":"Logothetis Dionysios","year":"2013","journal-title":"SoCC."},{"key":"e_1_2_1_116_1","doi-asserted-by":"publisher","DOI":"10.1145\/2433396.2433439"},{"key":"e_1_2_1_117_1","unstructured":"Claire Mathieu Ocan Sankur and Warren Schudy. 2010. Online correlation clustering. In STACS. 573--584. Claire Mathieu Ocan Sankur and Warren Schudy. 2010. Online correlation clustering. In STACS. 573--584."},{"volume-title":"Proceedings of the 6th ACM International Conference on Knowledge Discovery and Data Mining (KDD","year":"2000","author":"McCallum Andrew","key":"e_1_2_1_118_1"},{"volume-title":"Proceedings of the 10th International Workshop on Quality in Databases (QDB\u201912)","year":"2012","author":"McNeill W. P.","key":"e_1_2_1_119_1"},{"volume-title":"Wilson","year":"1970","author":"McVitie David G.","key":"e_1_2_1_120_1"},{"key":"e_1_2_1_121_1","doi-asserted-by":"crossref","unstructured":"Gr\u00e9goire Mesnil Xiaodong He Li Deng and Yoshua Bengio. 2013. Investigation of recurrent-neural-network architectures and learning methods for spoken language understanding. In INTERSPEECH. 3771--3775. Gr\u00e9goire Mesnil Xiaodong He Li Deng and Yoshua Bengio. 2013. Investigation of recurrent-neural-network architectures and learning methods for spoken language understanding. In INTERSPEECH. 3771--3775.","DOI":"10.21437\/Interspeech.2013-596"},{"key":"e_1_2_1_122_1","doi-asserted-by":"crossref","unstructured":"Sidharth Mudgal Han Li Theodoros Rekatsinas AnHai Doan Youngchoon Park Ganesh Krishnan Rohit Deep Esteban Arcaute and Vijay Raghavendra. 2018. Deep learning for entity matching: A design space exploration. In SIGMOD. 19--34. Sidharth Mudgal Han Li Theodoros Rekatsinas AnHai Doan Youngchoon Park Ganesh Krishnan Rohit Deep Esteban Arcaute and Vijay Raghavendra. 2018. Deep learning for entity matching: A design space exploration. In SIGMOD. 19--34.","DOI":"10.1145\/3183713.3196926"},{"key":"e_1_2_1_123_1","doi-asserted-by":"crossref","unstructured":"Charini Nanayakkara Peter Christen and Thilina Ranbaduge. 2019. Robust temporal graph clustering for group record linkage. In PAKDD. Charini Nanayakkara Peter Christen and Thilina Ranbaduge. 2019. Robust temporal graph clustering for group record linkage. In PAKDD.","DOI":"10.1007\/978-3-030-16145-3_41"},{"key":"e_1_2_1_124_1","doi-asserted-by":"crossref","unstructured":"Felix Naumann and Melanie Herschel. 2010. An Introduction to Duplicate Detection. Morgan 8 Claypool. Felix Naumann and Melanie Herschel. 2010. An Introduction to Duplicate Detection. Morgan 8 Claypool.","DOI":"10.1007\/978-3-031-01835-0"},{"key":"e_1_2_1_125_1","unstructured":"E. D. Nelson and J. R. Talburt. 2011. Entity resolution for longitudinal studies in education using OYSTER. In IKE. E. D. Nelson and J. R. Talburt. 2011. Entity resolution for longitudinal studies in education using OYSTER. In IKE."},{"key":"e_1_2_1_126_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDMW.2016.0035"},{"key":"e_1_2_1_127_1","doi-asserted-by":"publisher","DOI":"10.3233\/SW-150210"},{"key":"e_1_2_1_128_1","unstructured":"Axel-Cyrille Ngonga Ngomo and S\u00f6ren Auer. 2011. LIMES\u2014A time-efficient approach for large-scale link discovery on the web of data. In IJCAI. Axel-Cyrille Ngonga Ngomo and S\u00f6ren Auer. 2011. LIMES\u2014A time-efficient approach for large-scale link discovery on the web of data. In IJCAI."},{"key":"e_1_2_1_129_1","unstructured":"Maximilian Nickel and Douwe Kiela. 2017. Poincar\u00e9 embeddings for learning hierarchical representations. In NIPS. 6338--6347. Maximilian Nickel and Douwe Kiela. 2017. Poincar\u00e9 embeddings for learning hierarchical representations. In NIPS. 6338--6347."},{"volume-title":"Proceedings of the 6th International Conference on Knowledge Engineering: Practice and Patterns (EKAW\u201908)","author":"Nikolov Andriy","key":"e_1_2_1_130_1"},{"key":"e_1_2_1_131_1","doi-asserted-by":"publisher","DOI":"10.5555\/1304611.1306578"},{"volume-title":"Linking and Mining Heterogeneous and Multi-view Data","author":"O\u2019Hare Kevin","key":"e_1_2_1_132_1"},{"key":"e_1_2_1_133_1","doi-asserted-by":"publisher","DOI":"10.14778\/2856318.2856326"},{"key":"e_1_2_1_134_1","doi-asserted-by":"crossref","unstructured":"George Papadakis Konstantina Bereta Themis Palpanas and Manolis Koubarakis. 2017. Multi-core meta-blocking for big linked data. In SEMANTICS. George Papadakis Konstantina Bereta Themis Palpanas and Manolis Koubarakis. 2017. Multi-core meta-blocking for big linked data. In SEMANTICS.","DOI":"10.1145\/3132218.3132230"},{"key":"e_1_2_1_135_1","doi-asserted-by":"publisher","DOI":"10.1145\/2124295.2124305"},{"key":"e_1_2_1_136_1","first-page":"2665","article-title":"A blocking framework for entity resolution in highly heterogeneous information spaces","volume":"25","author":"Papadakis George","year":"2013","journal-title":"IEEE TKDE"},{"volume-title":"Meta-blocking: Taking entity resolution to the next level. TKDE 26, 8","year":"2014","author":"Papadakis George","key":"e_1_2_1_137_1"},{"key":"e_1_2_1_138_1","doi-asserted-by":"publisher","DOI":"10.14778\/2733085.2733098"},{"volume-title":"Proceedings of the 19th International Conference on Extending Database Technology (EDBT\u201916)","year":"2016","author":"Papadakis George","key":"e_1_2_1_139_1"},{"volume-title":"A survey of blocking and filtering techniques for entity resolution. ACM Comput. Surv. 53, 2","year":"2020","author":"Papadakis George","key":"e_1_2_1_140_1"},{"key":"e_1_2_1_141_1","doi-asserted-by":"publisher","DOI":"10.14778\/2947618.2947624"},{"key":"e_1_2_1_142_1","unstructured":"George Papadakis Leonidas Tsekouras Emmanouil Thanos Nikiforos Pittaras Giovanni Simonini Dimitrios Skoutas Paul Isaris George Giannakopoulos Themis Palpanas and Manolis Koubarakis. 2020. JedAI3: Beyond batch blocking-based entity resolution. In EDBT. 603--606. George Papadakis Leonidas Tsekouras Emmanouil Thanos Nikiforos Pittaras Giovanni Simonini Dimitrios Skoutas Paul Isaris George Giannakopoulos Themis Palpanas and Manolis Koubarakis. 2020. JedAI 3 : Beyond batch blocking-based entity resolution. In EDBT. 603--606."},{"key":"e_1_2_1_143_1","first-page":"1316","article-title":"Progressive duplicate detection","volume":"27","author":"Papenbrock Thorsten","year":"2015","journal-title":"IEEE TKDE"},{"volume-title":"Manning","year":"2014","author":"Pennington Jeffrey","key":"e_1_2_1_144_1"},{"key":"e_1_2_1_145_1","doi-asserted-by":"crossref","unstructured":"Banda Ramadan and Peter Christen. 2014. Forest-based dynamic sorted neighborhood indexing for real-time entity resolution. In CIKM. Banda Ramadan and Peter Christen. 2014. Forest-based dynamic sorted neighborhood indexing for real-time entity resolution. In CIKM.","DOI":"10.1145\/2661829.2661869"},{"key":"e_1_2_1_146_1","article-title":"Dynamic sorted neighborhood indexing for real-time entity resolution","volume":"6","author":"Ramadan Banda","year":"2015","journal-title":"J. Data Inf. Quality"},{"key":"e_1_2_1_147_1","doi-asserted-by":"crossref","unstructured":"Banda Ramadan Peter Christen Huizhi Liang Ross W. Gayler and David Hawking. 2013. Dynamic similarity-aware inverted indexing for real-time entity resolution. In Trends and Applications in Knowledge Discovery and Data Mining\u2014PAKDD International Workshops. 47--58. Banda Ramadan Peter Christen Huizhi Liang Ross W. Gayler and David Hawking. 2013. Dynamic similarity-aware inverted indexing for real-time entity resolution. In Trends and Applications in Knowledge Discovery and Data Mining\u2014PAKDD International Workshops. 47--58.","DOI":"10.1007\/978-3-642-40319-4_5"},{"key":"e_1_2_1_148_1","doi-asserted-by":"publisher","DOI":"10.14778\/1938545.1938546"},{"key":"e_1_2_1_149_1","doi-asserted-by":"publisher","DOI":"10.14778\/3157794.3157797"},{"key":"e_1_2_1_150_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.datak.2017.10.004"},{"volume-title":"Proceedings of the MultiConference on Computer Simulation. 150--155","year":"2007","author":"Rice Stephen V","key":"e_1_2_1_151_1"},{"key":"e_1_2_1_152_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-35176-1_29"},{"key":"e_1_2_1_153_1","doi-asserted-by":"publisher","DOI":"10.7250\/csimq.2018-16.04"},{"key":"e_1_2_1_154_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-66917-5_19"},{"key":"e_1_2_1_155_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-93417-4_37"},{"key":"e_1_2_1_156_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.jbi.2011.02.008"},{"volume-title":"Proceedings of the 21st ACM International Conference on Information and Knowledge Management (CIKM\u201912)","year":"2012","author":"Sarma Anish Das","key":"e_1_2_1_157_1"},{"volume-title":"Proceedings of the 2018 World Wide Web Conference on World Wide Web (WWW\u201918)","author":"Schneider Andrew T.","key":"e_1_2_1_158_1"},{"key":"e_1_2_1_159_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11390-016-1620-z"},{"key":"e_1_2_1_160_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDE.2011.5767835"},{"key":"e_1_2_1_161_1","doi-asserted-by":"publisher","DOI":"10.14778\/2994509.2994533"},{"key":"e_1_2_1_162_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.is.2019.03.006"},{"key":"e_1_2_1_163_1","first-page":"1208","article-title":"Schema-agnostic progressive entity resolution","volume":"31","author":"Simonini Giovanni","year":"2019","journal-title":"IEEE TKDE"},{"volume-title":"Proceedings of the 25th International Conference on Data Engineering (ICDE\u201909)","author":"Sismanis Y.","key":"e_1_2_1_164_1"},{"key":"e_1_2_1_165_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-25073-6_41"},{"key":"e_1_2_1_166_1","doi-asserted-by":"publisher","DOI":"10.1145\/2567948.2577263"},{"volume-title":"Proceedings of the 2014 International Conference on Privacy in Statistical Databases (PSD\u201914)","author":"Steorts Rebecca C.","key":"e_1_2_1_167_1"},{"key":"e_1_2_1_168_1","first-page":"578","article-title":"Record matching over query results from multiple web databases","volume":"22","author":"Su Weifeng","year":"2010","journal-title":"IEEE TKDE"},{"key":"e_1_2_1_169_1","doi-asserted-by":"publisher","DOI":"10.14778\/2078331.2078332"},{"key":"e_1_2_1_170_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-68288-4_37"},{"key":"e_1_2_1_171_1","doi-asserted-by":"publisher","DOI":"10.5555\/3304222.3304381"},{"key":"e_1_2_1_172_1","unstructured":"Zequn Sun Qingheng Zhang Wei Hu Chengming Wang Muhao Chen Farahnaz Akrami and Chengkai Li. 2020. A benchmarking study of embedding-based entity alignment for knowledge graphs. CoRR abs\/2003.07743. Zequn Sun Qingheng Zhang Wei Hu Chengming Wang Muhao Chen Farahnaz Akrami and Chengkai Li. 2020. A benchmarking study of embedding-based entity alignment for knowledge graphs. CoRR abs\/2003.07743."},{"key":"e_1_2_1_173_1","unstructured":"Saravanan Thirumuruganathan Shameem A. Puthiya Parambath Mourad Ouzzani Nan Tang and Shafiq Joty. 2018. Reuse and adaptation for entity resolution through transfer learning. CoRR abs\/1809.11084. Saravanan Thirumuruganathan Shameem A. Puthiya Parambath Mourad Ouzzani Nan Tang and Shafiq Joty. 2018. Reuse and adaptation for entity resolution through transfer learning. CoRR abs\/1809.11084."},{"key":"e_1_2_1_174_1","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v33i01.3301297"},{"volume-title":"Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC\u201916)","year":"2016","author":"van Erp Marieke","key":"e_1_2_1_176_1"},{"key":"e_1_2_1_177_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDE.2015.7113286"},{"key":"e_1_2_1_178_1","doi-asserted-by":"publisher","DOI":"10.1145\/3035918.3035931"},{"key":"e_1_2_1_179_1","doi-asserted-by":"publisher","DOI":"10.14778\/2732977.2732982"},{"volume-title":"Proceedings of the WWW2009 Workshop on Linked Data on the Web (LDOW\u201909)","year":"2009","author":"Volz Julius","key":"e_1_2_1_180_1"},{"key":"e_1_2_1_181_1","doi-asserted-by":"publisher","DOI":"10.14778\/2350229.2350263"},{"key":"e_1_2_1_182_1","doi-asserted-by":"publisher","DOI":"10.1145\/2588555.2610505"},{"key":"e_1_2_1_183_1","doi-asserted-by":"publisher","DOI":"10.1145\/2463676.2465280"},{"key":"e_1_2_1_184_1","doi-asserted-by":"publisher","DOI":"10.14778\/2021017.2021020"},{"key":"e_1_2_1_185_1","doi-asserted-by":"publisher","DOI":"10.1145\/2723372.2723739"},{"key":"e_1_2_1_186_1","first-page":"47","article-title":"Explaining data integration","volume":"41","author":"Wang Xiaolan","year":"2018","journal-title":"IEEE Data Eng. Bull."},{"volume-title":"Jeffrey Xu Yu, and Hong Cheng","year":"2017","author":"Wang Yihan","key":"e_1_2_1_187_1"},{"key":"e_1_2_1_188_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D18-1032"},{"volume-title":"Proceedings of the International Workshop on Information Quality in Information Systems (IQIS\u201904)","year":"2004","author":"Weis Melanie","key":"e_1_2_1_189_1"},{"key":"e_1_2_1_190_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDE.2006.49"},{"key":"e_1_2_1_191_1","doi-asserted-by":"publisher","DOI":"10.1145\/2396761.2398719"},{"key":"e_1_2_1_192_1","doi-asserted-by":"publisher","DOI":"10.14778\/2536336.2536337"},{"key":"e_1_2_1_193_1","first-page":"1111","article-title":"Pay-as-you-go entity resolution","volume":"25","author":"Whang Steven Euijong","year":"2013","journal-title":"IEEE TKDE"},{"volume-title":"Proceedings of the 2009 ACM SIGMOD International Conference on Management of Data (SIGMOD\u201909)","author":"Whang S. E.","key":"e_1_2_1_194_1"},{"key":"e_1_2_1_195_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-00887-0_13"},{"key":"e_1_2_1_196_1","doi-asserted-by":"publisher","DOI":"10.1162\/neco.1989.1.2.270"},{"key":"e_1_2_1_197_1","doi-asserted-by":"publisher","DOI":"10.1145\/3132847.3132876"},{"volume-title":"Similarity Search\u2014The Metric Space Approach","author":"Zezula Pavel","key":"e_1_2_1_198_1","doi-asserted-by":"crossref","DOI":"10.1007\/0-387-29151-2"},{"key":"e_1_2_1_199_1","doi-asserted-by":"publisher","DOI":"10.1145\/2795218.2795222"},{"key":"e_1_2_1_200_1","doi-asserted-by":"publisher","DOI":"10.1088\/1742-6596\/887\/1\/012058"},{"key":"e_1_2_1_201_1","doi-asserted-by":"publisher","DOI":"10.24963\/ijcai.2019\/754"},{"key":"e_1_2_1_202_1","doi-asserted-by":"publisher","DOI":"10.1145\/3336191.3371813"},{"key":"e_1_2_1_203_1","doi-asserted-by":"publisher","DOI":"10.1145\/3308558.3313578"},{"key":"e_1_2_1_204_1","unstructured":"Qibin Zheng Xingchun Diao Jianjun Cao Xiaolei Zhou Yi Liu and Hongmei Li. 2018. Multi-modal space structure: A new kind of latent correlation for multi-modal entity resolution. CoRR abs\/1804.08010. Qibin Zheng Xingchun Diao Jianjun Cao Xiaolei Zhou Yi Liu and Hongmei Li. 2018. Multi-modal space structure: A new kind of latent correlation for multi-modal entity resolution. CoRR abs\/1804.08010."},{"key":"e_1_2_1_205_1","doi-asserted-by":"publisher","DOI":"10.24963\/ijcai.2017\/595"}],"container-title":["ACM Computing Surveys"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3418896","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,1]],"date-time":"2023-01-01T23:45:15Z","timestamp":1672616715000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3418896"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,12,6]]},"references-count":204,"journal-issue":{"issue":"6","published-print":{"date-parts":[[2021,11,30]]}},"alternative-id":["10.1145\/3418896"],"URL":"https:\/\/doi.org\/10.1145\/3418896","relation":{},"ISSN":["0360-0300","1557-7341"],"issn-type":[{"type":"print","value":"0360-0300"},{"type":"electronic","value":"1557-7341"}],"subject":[],"published":{"date-parts":[[2020,12,6]]},"assertion":[{"value":"2020-04-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2020-08-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2020-12-06","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}