{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2024,10,30]],"date-time":"2024-10-30T21:26:43Z","timestamp":1730323603699,"version":"3.28.0"},"publisher-location":"New York, NY, USA","reference-count":45,"publisher":"ACM","content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2021,1,2]]},"DOI":"10.1145\/3430984.3431031","type":"proceedings-article","created":{"date-parts":[[2020,12,28]],"date-time":"2020-12-28T05:34:44Z","timestamp":1609133684000},"page":"208-212","update-policy":"http:\/\/dx.doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":4,"title":["Detecting Data Accuracy Issues in Textual Geographical Data by a Clustering-based Approach"],"prefix":"10.1145","author":[{"given":"Maria Angela","family":"Pellegrino","sequence":"first","affiliation":[{"name":"Universit\u00e0 degli Studi di Salerno, Italy"}]},{"given":"Luca","family":"Postiglione","sequence":"additional","affiliation":[{"name":"Universit\u00e0 degli Studi di Salerno, Italy"}]},{"given":"Vittorio","family":"Scarano","sequence":"additional","affiliation":[{"name":"Universit\u00e0 degli Studi di Salerno, Italy"}]}],"member":"320","published-online":{"date-parts":[[2021,1,2]]},"reference":[{"key":"e_1_3_2_1_1_1","first-page":"313","article-title":"Dimensions and assessment methods of data quality in health information systems","volume":"33","author":"Alipour Jahanpour","year":"2017","unstructured":"Jahanpour Alipour and Maryam Ahmadi . 2017 . Dimensions and assessment methods of data quality in health information systems . Acta Medica Mediterranea 33 , 2 (2017), 313 \u2013 320 . Jahanpour Alipour and Maryam Ahmadi. 2017. Dimensions and assessment methods of data quality in health information systems. Acta Medica Mediterranea 33, 2 (2017), 313\u2013320.","journal-title":"Acta Medica Mediterranea"},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1287\/mnsc.31.2.150"},{"key":"e_1_3_2_1_3_1","volume-title":"Methodologies for data quality assessment and improvement. ACM computing surveys 41, 3","author":"Batini Carlo","year":"2009","unstructured":"Carlo Batini , Cinzia Cappiello , Chiara Francalanci , and Andrea Maurino . 2009. Methodologies for data quality assessment and improvement. ACM computing surveys 41, 3 ( 2009 ), 1\u201352. Carlo Batini, Cinzia Cappiello, Chiara Francalanci, and Andrea Maurino. 2009. Methodologies for data quality assessment and improvement. ACM computing surveys 41, 3 (2009), 1\u201352."},{"key":"e_1_3_2_1_4_1","volume-title":"Open Data Hopes and Fears: Determining the Barriers of Open Data. In Conference for E-Democracy and Open Government (CeDEM). 69\u201381","author":"Beno Martin","year":"2017","unstructured":"Martin Beno , Kathrin Figl , Jurgen Umbrich , and Axel Polleres . 2017 . Open Data Hopes and Fears: Determining the Barriers of Open Data. In Conference for E-Democracy and Open Government (CeDEM). 69\u201381 . Martin Beno, Kathrin Figl, Jurgen Umbrich, and Axel Polleres. 2017. Open Data Hopes and Fears: Determining the Barriers of Open Data. In Conference for E-Democracy and Open Government (CeDEM). 69\u201381."},{"key":"e_1_3_2_1_5_1","volume-title":"Linked Data - Design Issues. https:\/\/www.w3.org\/DesignIssues\/LinkedData.html [Online] Last access","author":"Berners-Lee Tim","year":"2019","unstructured":"Tim Berners-Lee . 2006. Linked Data - Design Issues. https:\/\/www.w3.org\/DesignIssues\/LinkedData.html [Online] Last access October 2019 . Tim Berners-Lee. 2006. Linked Data - Design Issues. https:\/\/www.w3.org\/DesignIssues\/LinkedData.html [Online] Last access October 2019."},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1145\/3308558.3313602"},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDE.2011.5767864"},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.3390\/ijerph110505170"},{"key":"e_1_3_2_1_9_1","first-page":"236","article-title":"Using Machine Learning techniques for Data Quality Monitoring in CMS and ALICE experiments","volume":"350","author":"Deja Kamil","year":"2019","unstructured":"Kamil Deja . 2019 . Using Machine Learning techniques for Data Quality Monitoring in CMS and ALICE experiments . Proceedings of Science 350 , 236 . Kamil Deja. 2019. Using Machine Learning techniques for Data Quality Monitoring in CMS and ALICE experiments. Proceedings of Science 350, 236.","journal-title":"Proceedings of Science"},{"key":"e_1_3_2_1_10_1","volume-title":"A clustering approach for detecting implausible observation values in electronic health records data. BMC medical informatics and decision making 19, 1","author":"Estiri Hossein","year":"2019","unstructured":"Hossein Estiri , Jeffrey\u00a0 G Klann , and Shawn\u00a0 N Murphy . 2019. A clustering approach for detecting implausible observation values in electronic health records data. BMC medical informatics and decision making 19, 1 ( 2019 ), 142. Hossein Estiri, Jeffrey\u00a0G Klann, and Shawn\u00a0N Murphy. 2019. A clustering approach for detecting implausible observation values in electronic health records data. BMC medical informatics and decision making 19, 1 (2019), 142."},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1109\/CIST.2018.8596389"},{"volume-title":"Electronic Government - 18th IFIP WG 8.5 International Conference. 168\u2013179.","author":"Ferretti Giuseppe","key":"e_1_3_2_1_12_1","unstructured":"Giuseppe Ferretti , Delfina Malandrino , Maria\u00a0Angela Pellegrino , Andrea Petta , Gianluigi Renzi , Vittorio Scarano , and Luigi Serra . 2019. Orchestrated Co-creation of High-Quality Open Data Within Large Groups . In Electronic Government - 18th IFIP WG 8.5 International Conference. 168\u2013179. Giuseppe Ferretti, Delfina Malandrino, Maria\u00a0Angela Pellegrino, Andrea Petta, Gianluigi Renzi, Vittorio Scarano, and Luigi Serra. 2019. Orchestrated Co-creation of High-Quality Open Data Within Large Groups. In Electronic Government - 18th IFIP WG 8.5 International Conference. 168\u2013179."},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1145\/3325112.3325230"},{"key":"e_1_3_2_1_14_1","unstructured":"Google. 2010. OpenRefine. Open Source Tool. Google. 2010. OpenRefine. Open Source Tool."},{"key":"e_1_3_2_1_15_1","volume-title":"ADQuaTe: An Automated Data Quality Test Approach for Constraint Discovery and Fault Detection. In 20th International Conference on Information Reuse and Integration for Data Science (IRI). IEEE, 61\u201368","author":"Homayouni Hajar","year":"2019","unstructured":"Hajar Homayouni , Sudipto Ghosh , and Indrakshi Ray . 2019 . ADQuaTe: An Automated Data Quality Test Approach for Constraint Discovery and Fault Detection. In 20th International Conference on Information Reuse and Integration for Data Science (IRI). IEEE, 61\u201368 . Hajar Homayouni, Sudipto Ghosh, and Indrakshi Ray. 2019. ADQuaTe: An Automated Data Quality Test Approach for Constraint Discovery and Fault Detection. In 20th International Conference on Information Reuse and Integration for Data Science (IRI). IEEE, 61\u201368."},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.3390\/s20071992"},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1080\/01621459.1989.10478785"},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.5958\/0976-5506.2019.00682.X"},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICITSI.2016.7858197"},{"key":"e_1_3_2_1_20_1","unstructured":"Vladimir\u00a0I Levenshtein. 1966. Binary codes capable of correcting deletions insertions and reversals. In Soviet physics doklady Vol.\u00a010. 707\u2013710. Vladimir\u00a0I Levenshtein. 1966. Binary codes capable of correcting deletions insertions and reversals. In Soviet physics doklady Vol.\u00a010. 707\u2013710."},{"key":"e_1_3_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.3233\/IDA-184024"},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"crossref","unstructured":"Ling Lin and Jinshan Su. 2019. Anomaly detection method for sensor network data streams based on sliding window sampling and optimized clustering. Safety science 118(2019) 70\u201375. Ling Lin and Jinshan Su. 2019. Anomaly detection method for sensor network data streams based on sliding window sampling and optimized clustering. Safety science 118(2019) 70\u201375.","DOI":"10.1016\/j.ssci.2019.04.047"},{"key":"e_1_3_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1145\/3371425.3371435"},{"key":"e_1_3_2_1_24_1","volume-title":"An Automated Big Data Accuracy Assessment Tool. IEEE 4th International Conference on Big Data Analytics (ICBDA)","author":"Mylavarapu Goutam","year":"2019","unstructured":"Goutam Mylavarapu , J. Thomas , and K. Viswanathan . 2019 . An Automated Big Data Accuracy Assessment Tool. IEEE 4th International Conference on Big Data Analytics (ICBDA) ( 2019 ), 193\u2013197. Goutam Mylavarapu, J. Thomas, and K. Viswanathan. 2019. An Automated Big Data Accuracy Assessment Tool. IEEE 4th International Conference on Big Data Analytics (ICBDA) (2019), 193\u2013197."},{"key":"e_1_3_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.35940\/ijrte.C6435.098319"},{"key":"e_1_3_2_1_26_1","doi-asserted-by":"crossref","unstructured":"Zahra Nematzadeh Roliana Ibrahim Ali Selamat and Vahdat Nazerian. 2020. The synergistic combination of fuzzy C-means and ensemble filtering for class noise detection. Engineering Computations(2020). Zahra Nematzadeh Roliana Ibrahim Ali Selamat and Vahdat Nazerian. 2020. The synergistic combination of fuzzy C-means and ensemble filtering for class noise detection. Engineering Computations(2020).","DOI":"10.1108\/EC-05-2019-0242"},{"key":"e_1_3_2_1_27_1","unstructured":"Organisation for Economic Co-operation and Development (OECD). 2019. Open Government Data. https:\/\/www.oecd.org\/gov\/digital-government\/open-government-data.htm Organisation for Economic Co-operation and Development (OECD). 2019. Open Government Data. https:\/\/www.oecd.org\/gov\/digital-government\/open-government-data.htm"},{"key":"e_1_3_2_1_28_1","volume-title":"Data Cleansing Techniques for Large Enterprise Datasets. SRII Global Conference, 135 \u2013 144","author":"Prasad K.","year":"2011","unstructured":"K. Prasad , Tanveer Faruquie , Sachindra Joshi , Snigdha Chaturvedi , L.V. Subramaniam , and Mukesh Mohania . 2011 . Data Cleansing Techniques for Large Enterprise Datasets. SRII Global Conference, 135 \u2013 144 . K. Prasad, Tanveer Faruquie, Sachindra Joshi, Snigdha Chaturvedi, L.V. Subramaniam, and Mukesh Mohania. 2011. Data Cleansing Techniques for Large Enterprise Datasets. SRII Global Conference, 135 \u2013 144."},{"key":"e_1_3_2_1_29_1","unstructured":"T. Redman. 2016. Harvard business review. https:\/\/hbr.org\/2016\/09\/bad-data-costs-the-u-s-3-trillion-per-year [Online]. T. Redman. 2016. Harvard business review. https:\/\/hbr.org\/2016\/09\/bad-data-costs-the-u-s-3-trillion-per-year [Online]."},{"key":"e_1_3_2_1_30_1","volume-title":"Data Quality for the Information Age","author":"Redman C.","unstructured":"Thomas\u00a0 C. Redman . 1997. Data Quality for the Information Age ( 1 st ed.). Artech House, Inc. Thomas\u00a0C. Redman. 1997. Data Quality for the Information Age(1st ed.). Artech House, Inc.","edition":"1"},{"volume-title":"Data mining and knowledge discovery handbook","author":"Rokach Lior","key":"e_1_3_2_1_31_1","unstructured":"Lior Rokach and Oded Maimon . 2005. Clustering methods . In Data mining and knowledge discovery handbook . Springer , 321\u2013352. Lior Rokach and Oded Maimon. 2005. Clustering methods. In Data mining and knowledge discovery handbook. Springer, 321\u2013352."},{"key":"e_1_3_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.ijinfomgt.2017.01.003"},{"key":"e_1_3_2_1_33_1","volume-title":"The Detection Algorithms for Similar Duplicate Data. In 6th International Conference on Systems and Informatics (ICSAI). IEEE, 1534\u20131542","author":"Yu Quan","year":"2019","unstructured":"Jin-yu Song, Quan Yu , and Ruo-yu Bao. 2019 . The Detection Algorithms for Similar Duplicate Data. In 6th International Conference on Systems and Informatics (ICSAI). IEEE, 1534\u20131542 . Jin-yu Song, Quan Yu, and Ruo-yu Bao. 2019. The Detection Algorithms for Similar Duplicate Data. In 6th International Conference on Systems and Informatics (ICSAI). IEEE, 1534\u20131542."},{"key":"e_1_3_2_1_34_1","doi-asserted-by":"crossref","unstructured":"Besiki Stvilia Les Gasser Michael\u00a0B Twidale and Linda\u00a0C Smith. 2007. A framework for information quality assessment. J. of the American society for information science and technology 58 12(2007) 1720\u20131733. Besiki Stvilia Les Gasser Michael\u00a0B Twidale and Linda\u00a0C Smith. 2007. A framework for information quality assessment. J. of the American society for information science and technology 58 12(2007) 1720\u20131733.","DOI":"10.1002\/asi.20652"},{"key":"e_1_3_2_1_35_1","first-page":"383","article-title":"Detecting systemic data quality issues in electronic health records","volume":"264","author":"Ta N","year":"2019","unstructured":"Casey\u00a0 N Ta and Chunhua Weng . 2019 . Detecting systemic data quality issues in electronic health records . Studies in Health Technology and Informatics 264 (2019), 383 \u2013 387 . Casey\u00a0N Ta and Chunhua Weng. 2019. Detecting systemic data quality issues in electronic health records. Studies in Health Technology and Informatics 264 (2019), 383\u2013387.","journal-title":"Studies in Health Technology and Informatics"},{"volume-title":"Proc. Series(2019)","author":"Thawanthaleunglit N.","key":"e_1_3_2_1_36_1","unstructured":"N. Thawanthaleunglit and K. Sripanidkulchai . 2019. Sweeper: Automated data quality processing and model generation for data classification. ACM Inter. Conf . Proc. Series(2019) , 17\u201323. N. Thawanthaleunglit and K. Sripanidkulchai. 2019. Sweeper: Automated data quality processing and model generation for data classification. ACM Inter. Conf. Proc. Series(2019), 17\u201323."},{"key":"e_1_3_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.cmpb.2019.01.012"},{"key":"e_1_3_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.giq.2016.02.001"},{"key":"e_1_3_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.1080\/07421222.1996.11518099"},{"key":"e_1_3_2_1_40_1","doi-asserted-by":"crossref","unstructured":"Yan Wang Hao Zhang Yaxin Li Deyun Wang Yanlin Ma Tong Zhou and Jianguo Lu. 2016. A Data Cleaning Method for CiteSeer Dataset. In Web Information Systems Engineering. 35\u201349. Yan Wang Hao Zhang Yaxin Li Deyun Wang Yanlin Ma Tong Zhou and Jianguo Lu. 2016. A Data Cleaning Method for CiteSeer Dataset. In Web Information Systems Engineering. 35\u201349.","DOI":"10.1007\/978-3-319-48740-3_3"},{"key":"e_1_3_2_1_41_1","volume-title":"Proceedings of the Section on Survey Research","author":"Winkler E.","year":"1990","unstructured":"William\u00a0 E. Winkler . 1990 . String Comparator Metrics and Enhanced Decision Rules in the Fellegi-Sunter Model of Record Linkage . In Proceedings of the Section on Survey Research ( Wachington, DC). 354\u2013359. William\u00a0E. Winkler. 1990. String Comparator Metrics and Enhanced Decision Rules in the Fellegi-Sunter Model of Record Linkage. In Proceedings of the Section on Survey Research(Wachington, DC). 354\u2013359."},{"key":"e_1_3_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIE.2019.2903774"},{"key":"e_1_3_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.1007\/s12599-019-00608-0"},{"key":"e_1_3_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.1109\/JSYST.2016.2576026"},{"key":"e_1_3_2_1_45_1","doi-asserted-by":"publisher","DOI":"10.1109\/INFOMAN.2017.7950422"}],"event":{"name":"CODS COMAD 2021: 8th ACM IKDD CODS and 26th COMAD","acronym":"CODS COMAD 2021","location":"Bangalore India"},"container-title":["Proceedings of the 3rd ACM India Joint International Conference on Data Science & Management of Data (8th ACM IKDD CODS & 26th COMAD)"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3430984.3431031","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,13]],"date-time":"2023-01-13T18:47:25Z","timestamp":1673635645000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3430984.3431031"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,1,2]]},"references-count":45,"alternative-id":["10.1145\/3430984.3431031","10.1145\/3430984"],"URL":"https:\/\/doi.org\/10.1145\/3430984.3431031","relation":{},"subject":[],"published":{"date-parts":[[2021,1,2]]},"assertion":[{"value":"2021-01-02","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}