LANCE: Piercing to the Heart of Instance Matching Tools

Saveta, Tzanina; Daskalaki, Evangelia; Flouris, Giorgos; Fundulaki, Irini; Herschel, Melanie; Ngomo, Axel-Cyrille Ngonga

doi:10.1007/978-3-319-25007-6_22

Tzanina Saveta²⁵,
Evangelia Daskalaki²⁵,
Giorgos Flouris²⁵,
Irini Fundulaki²⁵,
Melanie Herschel²⁶ &
…
Axel-Cyrille Ngonga Ngomo²⁷

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9366))

Included in the following conference series:

International Semantic Web Conference

2507 Accesses

Abstract

One of the main challenges in the Data Web is the identification of instances that refer to the same real-world entity. Choosing the right framework for this purpose remains tedious, as current instance matching benchmarks fail to provide end users and developers with the necessary insights pertaining to how current frameworks behave when dealing with real data. In this paper, we present lance, a domain-independent instance matching benchmark generator which focuses on benchmarking instance matching systems for Linked Data. lance is the first Linked Data benchmark generator to support complex semantics-aware test cases that take into account expressive OWL constructs, in addition to the standard test cases related to structure and value transformations. lance supports the definition of matching tasks with varying degrees of difficulty and produces a weighted gold standard, which allows a more fine-grained analysis of the performance of instance matching tools. It can accept any linked dataset and its accompanying schema as input to produce a target dataset implementing test cases of varying levels of difficulty. We provide a comparative analysis with lance benchmarks to assess and identify the capabilities of state of the art instance matching systems as well as an evaluation to demonstrate the scalability of lance’s test case generator.

This work was partially supported by the EU FP7 projects LDBC (FP7-ICT-2011-8 #317548) and H2020 PARTHENOS (#654119).

Download to read the full chapter text

Chapter PDF

Putting Instance Matching to the Test: Is Instance Matching Ready for Reliable Data Linking?

DLUBM: A Benchmark for Distributed Linked Data Knowledge Base Systems

RiMOM-IM: A Novel Iterative Framework for Instance Matching

Article 08 January 2016

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Bhattacharya, I., Getoor, L.: Entity resolution in graphs. Mining Graph Data. Wiley and Sons (2006)
Google Scholar
Elmagarmid, A.K., Ipeirotis, P.G., et al.: Duplicate Record Detection: A Survey. TKDE 19(1) (2007)
Google Scholar
Li, C., Jin, L., et al.: Supporting efficient record linkage for large data sets using mapping techniques. In: WWW (2006)
Google Scholar
Noessner, J., Niepert, M., Meilicke, C., Stuckenschmidt, H.: Leveraging terminological structure for object reconciliation. In: Aroyo, L., Antoniou, G., Hyvönen, E., ten Teije, A., Stuckenschmidt, H., Cabral, L., Tudorache, T. (eds.) ESWC 2010. LNCS, vol. 6089, pp. 334–348. Springer, Heidelberg (2010)
Chapter Google Scholar
Isele, R., Jentzsch, A., et al.: Silk server - adding missing links while consuming linked data. In: COLD (2010)
Google Scholar
Ngonga Ngomo, A.-C., Auer, S.: LIMES - a time-efficient approach for large-scale link discovery on the web of data. IJCAI (2011)
Google Scholar
Stefanidis, K., Efthymiou, V., et al.: Entity resolution in the web of data. In: WWW, Companion Volume (2014)
Google Scholar
Weis, M., Naumann, F., et al.: A duplicate detection benchmark for XML and relational data. In: IQIS (2006)
Google Scholar
Ontology Alignment Evaluation Initiative. http://oaei.ontologymatching.org/
Zaiss, K., Conrad, S., et al.: A benchmark for testing instance-based ontology matching methods. In: KMIS (2010)
Google Scholar
Alexe, B., Tan, W.-C., et al.: STBenchmark: towards a benchmark for mapping systems. In: PVLDB (2008)
Google Scholar
Saveta, T., Daskalaki, E., et al.: Pushing the limits of instance matching systems: a semantics-aware benchmark for linked data. In: WWW, Companion Volume (2015)
Google Scholar
Daskalaki, E., Fundulaki, I., et al.: Instance matching benchmarks for linked data. In: ISWC (Tutorial) (2014)
Google Scholar
Euzenat, J., Ferrara, A., et al.: Results of the ontology alignment evaluation initiative 2009. In: ISWC Workshop on Ontology Matching (OM) (2009)
Google Scholar
Euzenat, J.: Results of the ontology alignment evaluation initiative. In: OM (2010)
Google Scholar
OAEI Instance Matching (2010). http://oaei.ontologymatching.org/2010
Euzenat, J., et. al.: Final results of the ontology alignment evaluation initiative 2011. In: OM (2011)
Google Scholar
Aguirre, J.L., et. al.: Results of the ontology alignment evaluation initiative 2012. In: OM (2012)
Google Scholar
Dragisic, Z., Eckert, K., et al.: Results of the ontology alignment evaluation initiative 2013. In: OM (2013)
Google Scholar
Dragisic, Z., Eckert, K., et al.: Results of the ontology alignment evaluation initiative 2014. In: OM (2014)
Google Scholar
Ferrara, A., Montanelli, S., Noessner, J., Stuckenschmidt, H.: Benchmarking matching applications on the semantic web. In: Antoniou, G., Grobelnik, M., Simperl, E., Parsia, B., Plexousakis, D., De Leenheer, P., Pan, J. (eds.) ESWC 2011, Part II. LNCS, vol. 6644, pp. 108–122. Springer, Heidelberg (2011)
Chapter Google Scholar
Krompass, D., Nickel, M., et al.: Non-negative tensor factorization with RESCAL. In: TML (2013)
Google Scholar
Nickel, M., Tresp, V., et al.: Factorizing YAGO: scalable machine learning for linked data. In: WWW (2012)
Google Scholar
Goutte, C., Gaussier, É.: A probabilistic interpretation of precision, recall and F-score, with implication for evaluation. In: Losada, D.E., Fernández-Luna, J.M. (eds.) ECIR 2005. LNCS, vol. 3408, pp. 345–359. Springer, Heidelberg (2005)
Chapter Google Scholar
Jiménez-Ruiz, E., Cuenca Grau, B.: LogMap: logic-based and scalable ontology matching. In: Aroyo, L., Welty, C., Alani, H., Taylor, J., Bernstein, A., Kagal, L., Noy, N., Blomqvist, E. (eds.) ISWC 2011, Part I. LNCS, vol. 7031, pp. 273–288. Springer, Heidelberg (2011)
Chapter Google Scholar
Romero, A.A., Grau, B.C., et al.: MORe: a modular OWL reasoner for ontology classification. In: ORE, pp. 61–67 (2013)
Google Scholar
Daskalaki, E., Plexousakis, D.: OtO matching system: a multi-strategy approach to instance matching. In: Ralyté, J., Franch, X., Brinkkemper, S., Wrycza, S. (eds.) CAiSE 2012. LNCS, vol. 7328, pp. 286–300. Springer, Heidelberg (2012)
Chapter Google Scholar
Ngomo, A.-C.N., Lyko, K.: EAGLE: efficient active learning of link specifications using genetic programming. In: Simperl, E., Cimiano, P., Polleres, A., Corcho, O., Presutti, V. (eds.) ESWC 2012. LNCS, vol. 7295, pp. 149–163. Springer, Heidelberg (2012)
Chapter Google Scholar
Li, J., Tang, J., et al.: Rimom: A dynamic multistrategy ontology alignment framework. TKDE 21(8) (2009)
Google Scholar
Massmann, S., Raunich, S., et al.: Evolution of the COMA match system. Ontology Matching 49 (2011)
Google Scholar
Bizer, C., Schultz, A.: The Berlin SPARQL Benchmark. IJSWIS 5(2) (2009)
Google Scholar
Morsey, M., Lehmann, J., Auer, S., Ngonga Ngomo, A.-C.: DBpedia SPARQL benchmark – performance assessment with real queries on real data. In: Aroyo, L., Welty, C., Alani, H., Taylor, J., Bernstein, A., Kagal, L., Noy, N., Blomqvist, E. (eds.) ISWC 2011, Part I. LNCS, vol. 7031, pp. 454–469. Springer, Heidelberg (2011)
Chapter Google Scholar
Ma, L., Yang, Y., Qiu, Z., Xie, G., Pan, Y., Liu, S.: Towards a complete OWL ontology benchmark. In: Sure, Y., Domingue, J. (eds.) ESWC 2006. LNCS, vol. 4011, pp. 125–139. Springer, Heidelberg (2006)
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Computer Science-FORTH, Heraklion, Greece
Tzanina Saveta, Evangelia Daskalaki, Giorgos Flouris & Irini Fundulaki
IPVS - University of Stuttgart, Stuttgart, Germany
Melanie Herschel
IFI/AKSW, University of Leipzig, Leipzig, Germany
Axel-Cyrille Ngonga Ngomo

Authors

Tzanina Saveta
View author publications
You can also search for this author in PubMed Google Scholar
Evangelia Daskalaki
View author publications
You can also search for this author in PubMed Google Scholar
Giorgos Flouris
View author publications
You can also search for this author in PubMed Google Scholar
Irini Fundulaki
View author publications
You can also search for this author in PubMed Google Scholar
Melanie Herschel
View author publications
You can also search for this author in PubMed Google Scholar
Axel-Cyrille Ngonga Ngomo
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Tzanina Saveta .

Editor information

Editors and Affiliations

Pontificia Universidad Católica de Chile, Santiago de Chile, Chile
Marcelo Arenas
Universidad Politecnica de Madrid, Boadilla del Monte, Spain
Oscar Corcho
University of Southampton, Southampton, United Kingdom
Elena Simperl
Department of Computational Social Science, GESIS Leibniz-Institut, Köln, Nordrhein-Westfalen, Germany
Markus Strohmaier
The Open University, Milton Keynes, United Kingdom
Mathieu d'Aquin
IBM Research, Yorktown Heights, New York, USA
Kavitha Srinivas
Elsevier Labs., Amsterdam, The Netherlands
Paul Groth
School of Medicine, Stanford University, Stanford, California, USA
Michel Dumontier
Lehigh University, Bethlehem, Pennsylvania, USA
Jeff Heflin
DAYTON, Ohio, USA
Krishnaprasad Thirunarayan
Wright State University, Dayton, Ohio, USA
Krishnaprasad Thirunarayan
University of Koblenz-Landau, Koblenz, Rheinland-Pfalz, Germany
Steffen Staab

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Saveta, T., Daskalaki, E., Flouris, G., Fundulaki, I., Herschel, M., Ngomo, AC.N. (2015). LANCE: Piercing to the Heart of Instance Matching Tools. In: Arenas, M., et al. The Semantic Web - ISWC 2015. ISWC 2015. Lecture Notes in Computer Science(), vol 9366. Springer, Cham. https://doi.org/10.1007/978-3-319-25007-6_22

Download citation

DOI: https://doi.org/10.1007/978-3-319-25007-6_22
Published: 30 October 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-25006-9
Online ISBN: 978-3-319-25007-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

LANCE: Piercing to the Heart of Instance Matching Tools

Abstract

Chapter PDF

Similar content being viewed by others

Putting Instance Matching to the Test: Is Instance Matching Ready for Reliable Data Linking?

DLUBM: A Benchmark for Distributed Linked Data Knowledge Base Systems

RiMOM-IM: A Novel Iterative Framework for Instance Matching

Keywords

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

LANCE: Piercing to the Heart of Instance Matching Tools

Abstract

Chapter PDF

Similar content being viewed by others

Putting Instance Matching to the Test: Is Instance Matching Ready for Reliable Data Linking?

DLUBM: A Benchmark for Distributed Linked Data Knowledge Base Systems

RiMOM-IM: A Novel Iterative Framework for Instance Matching

Keywords

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation