Abstract
The Semantic Web vision involves the production and use of large amounts of RDF data. There have been recent initiatives amongst the Semantic Web community, in particular the Linking Open Data activity and our own ReSIST project, to publish large amounts of RDF that are both interlinked and dereferenceable. The proliferation of such data gives rise to millions of URIs for non-information resources such as people, places and abstract things. Frequently, different data providers will mint different URIs for the same resource, giving rise to the problem of coreference. This paper describes the phenomenon of coreference, where it occurs in other disciplines and how it is relevant to the Semantic Web. We propose a ‘Consistent Reference Service’ for URI identity management and describe how this is being used in the infrastructure of a scalable Semantic Web system.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Shadbolt, N.R., Gibbins, N., Glaser, H., Harris, S., Schraefel, M.C.: CS AKTive Space or how we stopped worrying and learned to love the Semantic Web. IEEE Intelligent Systems, 41–47 (2004)
Lei, Y., Uren, V.S., Motta, E.: SemSearch: a search engine for the Semantic Web. In: Proceedings of 15th International Conference on Knowledge Engineering and Knowledge Management, Podebrady, Czech Republic, pp. 238–245 (2006)
Shirky, C.: 2001. The Semantic Web, Syllogism, and Worldview [15 February 15, 2007] [online], http://www.shirky.com/writings/semantic_syllogism.html
Suchanek, F.M., Kasneci, G., Weikum, G.: YAGO:A Core of Semantic knowledge. In: Proceedings International WWW Conference 2007, Banff, Alberta, Canada, pp. 697–706. ACM Press, New York (2007)
Linking Open Data Project, http://www.linkeddata.org/
Bizer, C., Cyganiak, R., Heath, T.: How to Publish Linked Data on the Web [July 20, 2007] [online], http://sites.wiwiss.fu-berlin.de/suhl/bizer/pub/LinkedDataTutorial/
Berners-Lee, T., Chen, Y., Chilton, L., Connoly, D., Dhanara, R., Hollenbach, J., Lerer, A., Sheets, D.: Tabulator:Exploring and Analyzing Linked Data on the Web. In: Proceedings 3rd International Semantic Web User Interaction Workshop, Athens, Georgia (2006)
DBpedia [July 1, 2007] [online], http://dbpedia.org/docs
Equivalence Mining and Matching Frameworks [July 1, 2007] [online], http://esw.w3.org/topic/TaskForces/CommunityProjects/LinkingOpenData/EquivalenceMining
Yang, K., Jiang, J., Lee, H., Ho, J.: Extracting Citation Relationships from Web Documents for Author Disambiguation, Technical Report No.TR-IIS-06-017,Institute of Information Science, Academia Sinica, Taipei, Taiwan (December 2006)
Tan, Y.F., Kan, M.-Y., Lee, D.: Search Engine Driven Author Disambiguation. In: Proceedings 6th ACM/IEEE-CS Joint Conference on Digital Libraries, pp. 314–315. ACM Press, New York
Hernandez, M., Stolfo, S.: Real-world Data is Dirty: Data Cleansing and the Merge/Purge Problem. Data Mining and Knowledge Discovery 2(1), 9–37
Fellegi, I.P., Sunter, A.B.: A Theory for Record Linkage. Journal of the American Statistical Association 64(328), 1183–1210 (1969)
Alani, H., Dasmahapatra, S., Gibbins, N., Glaser, H., Harris, S., Kalfoglou, Y., O’Hara, K., Shadbolt, N.: Managing Reference: Ensuring Referential Integrity of Ontologies for the Semantic Web. In: Proceedings of 13th International Conference on Knowledge Engineering and Knowledge Management, Sigenza, Spain, pp. 317–334 (2002)
Resilience for Survivability in IST (ReSIST) Network of Excellence, http://resist-noe.eu
Bechofer, S., Van Harmelen, F., Hendler, J., Horrocks, I., Mcguiness, D.L., Schneider, P.F., Stein, L.A.: OWL Web Ontology Language Reference, Technical Report, W3C [online], http://www.w3.org/TR/owl-ref/
Booth, D.: URIs and the Myth of Resource Identity. In: Proceedings of the Workshop on Identity. Meaning and the Web (IMW 2006) at International World Wide Web Conference 2006, Edinburgh, Scotland (2006)
Halpin, H.: Identity, Reference and Meaning on the Web. In: Proceedings of the Workshop on Identity, Meaning and the Web (IMW 2006) at International World Wide Web Conference 2006, Edinburgh, Scotland (2006)
Berners-Lee, T.: Cool URIs Don’t Change [online], http://www.w3.org/Provider/Style/URI
Fielding, R.: W3C Technical Architecture Group mailing list (June 18, 2005) [online] http://lists.w3.org/Archives/Public/www-tag/2005Jun/0039
W3C Mailing List Discussion Thread, Terminology Question Concerning Web Architecture and Linked Data, http://lists.w3.org/Archives/Public/semantic-web/2007Jul/0049.html
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Jaffri, A., Glaser, H., Millard, I. (2007). URI Identity Management for Semantic Web Data Integration and Linkage. In: Meersman, R., Tari, Z., Herrero, P. (eds) On the Move to Meaningful Internet Systems 2007: OTM 2007 Workshops. OTM 2007. Lecture Notes in Computer Science, vol 4806. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-76890-6_40
Download citation
DOI: https://doi.org/10.1007/978-3-540-76890-6_40
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-76889-0
Online ISBN: 978-3-540-76890-6
eBook Packages: Computer ScienceComputer Science (R0)