Connecting Semantic Situation Descriptions with Data Quality Evaluations—Towards a Framework of Automatic Thematic Map Evaluation
Abstract
:1. Introduction
2. State of the Art
2.1. Related Work on Data Quality
2.2. Definitions of Data Quality for Geospatial Data
2.3. Data Quality Metrics
2.4. Grounding Data Quality
2.5. Thematic Maps
2.6. Ontologies for Modelling (Situation-Specific) Geospatial Data
2.7. Ontologies for Modelling Data Quality
2.8. Related Work on Map Data Quality Assessment
3. Modelling
3.1. Ontological Model for Situations
3.1.1. Related Thematic Map Exploration: Situations
- More than one unique object needs to be linked by the property
- The property needs to be frequently used with the individuals of this class, which is to be determined by a relative threshold
- If the objects described by the property constitute a string value or an owl:Class, the individual string value or class name occurrence needs to be greater than 1
SELECT (COUNT(distinct ?con) AS ?countcon) (COUNT(?rel) AS ?countrel) (COUNT(distinct ?val) AS ?countval) ?rel ?relLabel WHERE { ?con wdt:P31 wd:Q3914 . ?con wdt:P625 ?coord . ?con wdt:P17 wd:Q183 . ?con ?rel ?val . SERVICE wikibase:label {bd:serviceParam wikibase:language “en” . } } GROUP BY ?rel ?relLabel ORDER BY ?relLabel
3.1.2. Modeling Activities and Situations
3.1.3. Activities
3.2. Relating Data Quality Metrics to Situations
3.2.1. Defining Relations to Be Evaluated
SELECT ?prop ?val ?valLabel ?super ?superLabel WHERE { ?prop wikibase:directClaim <PROPERTYURI> . ?prop wdt:P1629 ?val . ?val wdt:P31* ?super . SERVICE wikibase:label bd:serviceParam wikibase:language “[AUTOLANGUAGE],en”. } }
3.2.2. Defining Eligible Ranges
- For Data Properties:
- Analysis of assigned values and clustering of the achieved results.Assumption: The majority of assigned values should be well-formed and represent a tendency to a positive or neutral assignment. This assumption may not be true in every knowledge base, but may be a fair assumption in crowdsourced data.
- If annotated as is the case for example with many Wikidata properties (e.g., P2048-height), the minimum and maximum cardinalities can be used as first indications of a valid range
- For Object Properties:
- If a domain is defined, check if the assigned individual fits the domain
- If no domain is defined check the graph distance of the assigned concept to the original concept (e.g., the hospital)
3.3. Data Quality Metric Eligibility
Eligible Data Quality Metric: A data quality metric is an eligible data quality metric if the dataset is of such a structure that the data quality metric could be executed on it.
- Geometry Data Quality Metrics: Data quality metrics (intrinsic/extrinsic) operating on one or more geometries with or without a reference data set comparison.Requirements: A geometry or raster data set
- Attribute Data Quality Metrics: Data quality metrics (intrinsic/extrinsic) dealing with attribute completeness or the existence of data attributesRequirements: A feature set
- Thematic Data Quality metrics: Data quality metrics dealing with the values of thematic attributes.Requirements: Thematic Mapping and attributes
- Metadata Quality metrics: Operating on one or more metadata of geometriesRequirements: Metadata annotations
- Vicinity Data Quality metrics: Operating on a neighbourhood of geometries. Only relevant if the thematic data relies on a neighbourhood descriptionRequirements: Geometry neighbourhood
3.4. Data Quality Metric Feasibility
Feasible Data Quality Metric: A data quality metric is a feasible metric if all requirements for the calculation of the data quality metric have been met.
3.5. Data Quality Metric Relevance
Relevant Data Quality Metric: A data quality metric is a relevant data quality metric if it either contributes to the thematic map representation or is linked to a property relevant to the thematic map representation.
3.6. Data Quality Metric Priority
- Priority 1: Every metric which negative result renders the map completely unusable in a certain area: Missing or erroneous Dealbreaker attributes and Geometry Validity attributes which might expose an invalid or non-renderable geometry
- Priority 2: Metrics which are directly or indirectly related to the dealbreaker attribute of the thematic map and/or evaluate related relations: The reasoning here is that these values potentially contribute more to the thematic map’s quality than other relations
- Priority 3: Trustfulness Metrics: Metrics which may expose the quality of the whole data set by analyzing metadata about its creation
- Priority 4: General geospatial data quality metrics analyzing the quality of the geometry apart from geometry validity
- Priority 5: All other metrics which are eligible, feasible and relevant for the application
3.7. Combining Eligibility and Relevance: Constructing a Requirement Profile
ex:HospitalAccessibility rdf:type owl:NamedIndividual , semgis:ThematicMap ; semgis:hasRequirementProfile ex:HospitalAccessibilityRequirements . ex:HospitalAccessibilityRequirements rdf:type owl:NamedIndividual , semgis:RequirementProfile ; semgis:hasDealbreakerAttribute <http://www.wikidata.org/prop/direct/P2846> . semgis:hasEligibleMetric dq:GeometryValidity . semgis:hasEligibleMetric dq:AttributeAvailability . ....
- The requirement profile can be downloaded and assessed using data quality web services which results determine the quality assurance of the map
- The requirement profile can be evaluated using reasoning rules in a knowledge base. An external process (e.g., the Data Quality Service) enters data quality results for this analysis.
3.8. Requirement Profile Similarity
Similar Requirement Profile: A requirement profile is similar to another requirement profile if the concepts of the situations which are to be evaluated have a close similarity score and the data quality metrics used in the requirement profile are similar as defined by the metrics grounding, dimension and classification.
- Semantic Similarity [60] of the class/situation to evaluate according to a threshold (percentage score)
- Matching data quality metrics (awarded 1 point per matching metric) and similar data quality metrics according to a given threshold (awarded 0.5 points per metric) divided weighed by the metrics priority in the requirement profile and divided by the total number of data quality metrics being used.
Algorithm 1: Semantic Similarity algorithm: The algorithm consists of the calculation of a semantic similarity using a given metric (semanticSimilarity) which is used to relate single metrics and the class which describes the requirement profile. The aggregated percentage score is used as the similarity score by the system. |
3.9. Ontology Model Overview
4. Towards Constructing an Automated Data Quality Evaluation System
4.1. System Workflow
- Thematic Map Exploration: The system analyzes the knowledge base of geospatial objects for eligible thematic map candidates. It generates a list of thematic map candidates for the end-user
- Data Quality Evaluation Generation: The system generates a list of eligible data quality metrics for each thematic map and performs a data quality analysis of general data quality metrics (cf. Section 2.2)
- Preferential Range assignment: The system tries to determine the preferred ranges for eligible properties without a use case context. This first assessment is likely to produce a general applicability result and not a use case-specific result and is marked as automatically generated in the knowledge base
- Manual Usecase Definition: The user defines data sources needed for the use case at hand which may itself be defined using a Semantic concept, if applicable
- Requirement Profile Suggestion: A set of requirement profiles is suggested to the end-user either by finding appropriate requirement profiles in a semantic database or by generating a suggestion as shown in Section 3.7. Suggested requirement profiles are related according to their similarity scores, as suggested in Section 3.8.
- Initial data quality assessment: An initial data quality assessment based on the generated or loaded requirement profile is conducted. An aggregated quality map layer is created.
- Manual improvements by the user: The user improves parameters of the generated requirement profile until the exact parameters are met. The requirement profile is saved accordingly to be reused by other users.
4.2. System Components
- Data Quality Service: A web service application providing data quality metric calculation services with the option to store and to use user-defined reference data sets and/or online resources (e.g., OpenStreetMap, Here Map) for extrinsic data quality metrics.
- Data Quality Triple Store: A triple store consisting of semantic descriptions of data quality metrics and links to other knowledge bases, including use case specific information.Requirement Profiles are stored in this triple store.
- Geospatial Data Repository: A triple store or set of web services providing access to geospatial data as a basis for thematic map creation. If geospatial data is not provided in RDF, it can be converted to an ontology model such as described in Reference [61] using appropriate methods on-the-fly [62,63]
- Linked Open Data Cloud: Further ontologies and linked data applications which give context to the requirement profile generation service
- Requirement Profile Generation Service: A web application using the Data Quality Triple Store and Linked Open Data Cloud to generate requirement profiles
- Similarity Evaluation Service: A web service suggesting similar requirement profiles for users based on the similarity score calculation. This web service should be co-located with the Requirement Profile generation service.
- Data Quality Evaluation Service: A web application loading a saved requirement profile from the Data Quality Triple store and using the Geospatial Data Repository triggers the execution of metrics in the Data Quality Service. Finally, this service stores the metric results in the Data Quality Triple Store, a third-party triple store or just returns the result as JSON [64]
- WebFrontend: A web frontend which allows to:
- Detect Thematic Maps
- Trigger requirement profile generation
- Trigger the Data Quality Evaluation service
- Visualizes the result as a layer for data quality
4.3. A Repository for Data Quality Metrics
- Intrinsic Metric , for example, Geometry Validity of a geometry.
- Intrinsic History Metric whereas a list of timepoints is given which is used to calculate the intrinsic metric upon. The tolerance value lists the timely tolerance in the history, for example,
- Extrinsic Metric whereas timepoint indicates the time revision of the extrinsic dataset to compare against, , a resource locator of the extrinsic dataset and tolerance a tolerance considering the timepoint, that is, the maximum amount of time between extrinsic dataset creation and theThe parameter might be omitted if an appropriate dataset can be inferred by other means
- Metadata Metric whereas the metadata of the given dataset is compared to the gold standard dataset
- Extrinsic data source suitability description: An ontological description of the suitability of an extrinsic data source to be used for certain situations
- Data Quality metric preferable tendency: Indicates which outcome of the data quality metric value is usually preferable
5. Experimental Setup
5.1. Input Data
5.2. Thematic Map Creation and Basic Evaluation
5.3. Evaluating a Situation
5.4. Requirement Profile Generation
- Thematic Map : Hospital Capacity
- Thematic Map : Road Network (updated for disaster circumstances)
- Thematic Map : Evacuation Points
- : Hospital Capacity (e.g., number of beds)
- : Road Network (the geometries itself)
- : Evacuation Points (classification)
5.4.1. Related Attributes
- Hospital: Amount of doctors
- Road Network: None
- Evacuation Point: None
5.4.2. Eligible/Feasible/Relevant Data Quality Metrics
Completeness, Freshness, Positional Accuracy, Attribute Existence, Geometry Validity, HausdorffDistance
Attribute Existence, Completeness, Freshness, Positional Accuracy, Geometry Validity
Attribute Existence, Geometry Validity, Positional Accuracy
5.4.3. Final Joint Requirement Profile for Situation
5.5. Interpretation
5.6. Reasoning of Suitable Data Quality Metrics
RescueMission(?mission) & hasPart(?mission,?thematicmap) & hasRequirementProfile(?thematicmap,?reqprof) & has DQEvaluation(?reqprof,?measurement) & IsAvailableStudents(?measurement) & Validity(?vmeasurement) & hasValue(?vmeasurement,?vmeasurementvalue) & smallerThan(?vmeasurementvalue, true8sd:boolean) -> Feasible(?mission)
SELECT ?item ?geom WHERE { ?item rdf:type wd:Q16917 . ?item geo:hasGeometry/geo:asWKT ?geom . ?item dq:hasRequirementProfile ?reqprof . ?reqprof semgis:isAppliedOn wd:Q16917 . ?reqprof rdf:type semgis:HospitalAccessibilityRequirements . ?reqprof semgis:hasDQEvaluation ?evaluation . ?evaluation rdf:type semgis:Feasible . }
6. Discussion
- Improve their own quality assurance processes by evaluating how other people evaluated the same kind of data
- Quickly estimate which map in which area is suitable for a use case by reusing requirement profiles for the same or similar situations
- Get suggestions on how to evaluate thematic maps and use cases using the algorithm presented in this publication
Limitations
7. Conclusions and Future Work
Funding
Conflicts of Interest
References
- Homburg, T.; Boochs, F. Situation-Dependent Data Quality Analysis for Geospatial Data Using Semantic Technologies. In Business Information Systems Workshops; Abramowicz, W., Paschke, A., Eds.; Springer International Publishing: Cham, Switzerland, 2019; pp. 566–578. [Google Scholar] [CrossRef]
- Homburg, T.; Steppan, S.; Wuerriehausen, F. Semantic Data integration and quality assurance of thematic maps in the German geographic authority. In Proceedings of the 2nd Workshop on Quality of Open Data (QOD2019), Sevilla, Spain, 28 June 2019. [Google Scholar]
- Antenucci, J.C.; Brown, K.; Croswell, P.L.; Kevany, M.J.; Archer, H. Geographic Information Systems: A Guide to the Technology; Springer: New York, NY, USA, 1991. [Google Scholar]
- Pipino, L.L.; Lee, Y.W.; Wang, R.Y. Data quality assessment. Commun. ACM 2002, 45, 211–218. [Google Scholar] [CrossRef]
- Ballou, D.P.; Tayi, G.K. Enhancing data quality in data warehouse environments. Commun. ACM 1999, 42, 73–78. [Google Scholar] [CrossRef]
- Wang, R.Y.; Strong, D.M. Beyond accuracy: What data quality means to data consumers. J. Manag. Inf. Syst. 1996, 12, 5–33. [Google Scholar] [CrossRef]
- Strong, D.M.; Lee, Y.W.; Wang, R.Y. Data quality in context. Commun. ACM 1997, 40, 103–110. [Google Scholar] [CrossRef]
- Wang, R.Y. A product perspective on total data quality management. Commun. ACM 1998, 41, 58–65. [Google Scholar] [CrossRef]
- Redman, T.C. The impact of poor data quality on the typical enterprise. Commun. ACM 1998, 41, 79–82. [Google Scholar] [CrossRef]
- Wang, R.Y.; Storey, V.C.; Firth, C.P. A framework for analysis of data quality research. IEEE Trans. Knowl. Data Eng. 1995, 7, 623–640. [Google Scholar] [CrossRef] [Green Version]
- Redman, T.C. Data Quality: The Field Guide; Butterworth-Heinemann: Oxford, UK, 2001. [Google Scholar]
- Lee, Y.W.; Pipino, L.; Funk, J.D.; Wang, R.Y. Journey to Data Quality; MIT Press: Cambridge, MA, USA, 2006. [Google Scholar]
- Wang, R.Y.; Ziad, M.; Lee, Y.W. Data Quality; Springer Science & Business Media: Berlin/Heidelberg, Germany, 2006; Volume 23. [Google Scholar]
- Batini, C.; Cappiello, C.; Francalanci, C.; Maurino, A. Methodologies for data quality assessment and improvement. Acm Comput. Surv. CSUR 2009, 41, 1–52. [Google Scholar] [CrossRef] [Green Version]
- Pôças, I.; Gonçalves, J.; Marcos, B.; Alonso, J.; Castro, P.; Honrado, J.P. Evaluating the fitness for use of spatial data sets to promote quality in ecological assessment and monitoring. Int. J. Geogr. Inf. Sci. 2014, 28, 2356–2371. [Google Scholar] [CrossRef]
- Devillers, R.; Jeansoulin, R.; Goodchild, M.F. Fundamentals of Spatial Data Quality; ISTE: London, UK, 2006; ISBN 1905209568. [Google Scholar]
- Jackson, S.; Mullen, W.; Agouris, P.; Crooks, A.; Croitoru, A.; Stefanidis, A. Assessing completeness and spatial error of features in volunteered geographic information. ISPRS Int. J. Geo-Inf. 2013, 2, 507–530. [Google Scholar] [CrossRef]
- Drummond, J. Positional accuracy. Elem. Spat. Data Qual. 1995, 31–58. [Google Scholar]
- Stehman, S.V.; Wickham, J.; Smith, J.; Yang, L. Thematic accuracy of the 1992 National Land-Cover Data for the eastern United States: Statistical methodology and regional results. Remote Sens. Environ. 2003, 86, 500–516. [Google Scholar] [CrossRef]
- Chaudhuri, G.; Clarke, K.C. Temporal accuracy in urban growth forecasting: A study using the SLEUTH model. Trans. GIS 2014, 18, 302–320. [Google Scholar] [CrossRef]
- Salgé, F. Semantic accuracy. In Elements of Spatial Data Quality; Elsevier: Amsterdam, Netherlands, 1995; pp. 139–151. [Google Scholar]
- Gronroos, C. Service quality: The six criteria of good perceived service. Rev. Bus. 1988, 9, 10. [Google Scholar]
- Castelfranchi, C.; Falcone, R.; Pezzulo, G. Integrating trustfulness and decision using fuzzy cognitive maps. In Proceedings of the International Conference on Trust Management, Heraklion, Greece, 28–30 May 2003; pp. 195–210. [Google Scholar]
- Zaveri, A.; Rula, A.; Maurino, A.; Pietrobon, R.; Lehmann, J.; Auer, S. Quality assessment for linked data: A survey. Semant. Web 2016, 7, 63–93. [Google Scholar] [CrossRef]
- Heinrich, B.; Kaiser, M.; Klier, M. How to Measure Data Quality? A Metric-Based Approach; Twenty Eighth International Conference on Information Systems: Montreal, QC, Canada, 2007. [Google Scholar]
- Mooney, P.; Corcoran, P.; Winstanley, A.C. Towards quality metrics for OpenStreetMap. In Proceedings of the 18th SIGSPATIAL International Conference on Advances in Geographic Information Systems, San Jose, CA, USA, 2 November 2010; pp. 514–517. [Google Scholar]
- Barron, C.; Neis, P.; Zipf, A. Towards intrinsic quality analysis of OpenStreetMap datasets. In Proceedings of the International Workshop on Action and Interaction in Volunteered Geographic Information (ACTIVITY), Leuven, Belgium, 14–17 May 2013. [Google Scholar]
- Barron, C.; Neis, P.; Zipf, A. A comprehensive framework for intrinsic OpenStreetMap quality analysis. Trans. GIS 2014, 18, 877–895. [Google Scholar] [CrossRef]
- Neis, P.; Zipf, A. Analyzing the contributor activity of a volunteered geographic information project—The case of OpenStreetMap. ISPRS Int. J. Geo-Inf. 2012, 1, 146–165. [Google Scholar] [CrossRef]
- Lebo, T.; Sahoo, S.; McGuinness, D.; Belhajjame, K.; Cheney, J.; Corsar, D.; Garijo, D.; Soiland-Reyes, S.; Zednik, S.; Zhao, J. Prov-O: The Prov Ontology; W3C Recommendation; World Wide Web Consortium: Cambridge, MA, USA, 30 April 2013. [Google Scholar]
- Di, L.; Shao, Y.; Kang, L. Implementation of geospatial data provenance in a web service workflow environment with ISO 19115 and ISO 19115-2 lineage model. IEEE Trans. Geosci. Remote. Sens. 2013, 51, 5082–5089. [Google Scholar]
- Yue, P.; He, L. Geospatial data provenance in cyberinfrastructure. In Proceedings of the 2009 17th International Conference on Geoinformatics, Fairfax, VA, USA, 12–14 August 2009; pp. 1–4. [Google Scholar]
- Mocnik, F.B.; Mobasheri, A.; Griesbaum, L.; Eckle, M.; Jacobs, C.; Klonner, C. A grounding-based ontology of data quality measures. J. Spat. Inf. Sci. 2018, 2018, 1–25. [Google Scholar] [CrossRef]
- Mocnik, F.B. Linked Open Data Vocabularies for Semantically Annotated Repositories of Data Quality Measures (Short Paper). In Proceedings of the 10th International Conference on Geographic Information Science (GIScience 2018), Melbourne, Australia, 28 August 2018. [Google Scholar]
- Thrower, N.J. Maps and Civilization: Cartography in Culture and Society; University of Chicago Press: Chicago, IL, USA, 2008. [Google Scholar]
- Mauro, N.; Ardissono, L. Thematic Maps for Geographical Information Search. In Proceedings of the Adjunct Publication of the 25th Conference on User Modeling, Adaptation and Personalization, Bratislava, Slovakia, 9–12 July 2017; pp. 337–342. [Google Scholar]
- Rowell, J.; Ungar, S. Feeling our way: Tactile map user requirements-a survey. In Proceedings of the International Cartographic Conference, La Coruna, Spain, 9–16 July 2005. [Google Scholar]
- Ramm, F.; Topf, J.; Chilton, S. OpenStreetMap: Using and Enhancing the Free Map of the World; UIT Cambridge: Cambridge, UK, 2011. [Google Scholar]
- Pavlenko, A. Mapnik. Retrieved December 2019. 2012. Available online: https://github.com/mapnik/mapnik (accessed on 15 November 2020).
- Bärlocher, M. OpenSeaMap—Die freie seekarte. Hydrogr. Nachr. 2012, 91, 10–13. [Google Scholar]
- Battle, R.; Kolas, D. Enabling the geospatial semantic web with parliament and geosparql. Semant. Web 2012, 3, 355–370. [Google Scholar] [CrossRef]
- Meditskos, G.; Dasiopoulou, S.; Efstathiou, V.; Kompatsiaris, I. Ontology patterns for complex activity modelling. In Proceedings of the International Workshop on Rules and Rule Markup Languages for the semantic web, Seattle, WA, USA, 11–13 July 2013; pp. 144–157. [Google Scholar]
- Kokar, M.M.; Matheus, C.J.; Baclawski, K. Ontology-based situation awareness. Inf. Fusion 2009, 10, 83–98. [Google Scholar] [CrossRef]
- Matheus, C.J.; Kokar, M.M.; Baclawski, K. A core ontology for situation awareness. In Proceedings of the Sixth International Conference on Information Fusion, Cairns, Queensland, Australia, 8–11 July 2003; Volume 1, pp. 545–552. [Google Scholar]
- Debattista, J.; Lange, C.; Auer, S. DaQ, An Ontology for Dataset Quality Information; Linked Data On The Web Workshop: Seoul, Korea, 2014. [Google Scholar]
- Missier, P.; Belhajjame, K.; Cheney, J. The W3C PROV family of specifications for modelling provenance metadata. In Proceedings of the 16th International Conference on Extending Database Technology, Genoa, Italy, 18–22 March 2013; pp. 773–776. [Google Scholar]
- Yilmaz, C.; Comert, C.; Yildirim, D. SDQO and SfO, Ontologies for Spatial Data Quality Assessment. Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci. 2019, 42, 1275–1279. [Google Scholar] [CrossRef] [Green Version]
- Mondzech, J.; Sester, M. Quality analysis of OpenStreetMap data based on application needs. Cartogr. Int. J. Geogr. Inf. Geovisualization 2011, 46, 115–125. [Google Scholar] [CrossRef]
- Bruin, S.D.; Bregt, A.; Ven, M.V.d. Assessing fitness for use: The expected value of spatial data sets. Int. J. Geogr. Inf. Sci. 2001, 15, 457–471. [Google Scholar] [CrossRef]
- Agumya, A.; Hunter, G.J. A risk-based approach to assessing the ‘fitness for use’of spatial data. URISA J. 1999, 11, 33–44. [Google Scholar]
- Devillers, R.; Bédard, Y.; Jeansoulin, R.; Moulin, B. Towards spatial data quality information analysis tools for experts assessing the fitness for use of spatial data. Int. J. Geogr. Inf. Sci. 2007, 21, 261–282. [Google Scholar] [CrossRef]
- McBride, B. The resource description framework (RDF) and its vocabulary description language RDFS. In Handbook on Ontologies; Springer: Berlin/Heidelberg, Germany, 2004; pp. 51–65. [Google Scholar]
- Antoniou, G.; Van Harmelen, F. Web ontology language: Owl. In Handbook on Ontologies; Springer: Berlin/Heidelberg, Germany, 2004; pp. 67–92. [Google Scholar]
- Pérez, J.; Arenas, M.; Gutierrez, C. Semantics and Complexity of SPARQL. In Proceedings of the International semantic web Conference, Athens, GA, USA, 5–9 November 2006; pp. 30–43. [Google Scholar]
- Vrandečić, D.; Krötzsch, M. Wikidata: A free collaborative knowledge base. Commun. ACM 2014, 57, 78–85. [Google Scholar] [CrossRef]
- Meng, L.; Huang, R.; Gu, J. A review of semantic similarity measures in wordnet. Int. J. Hybrid Inf. Technol. 2013, 6, 1–12. [Google Scholar]
- World Wide Web Consortium. Data Catalog Vocabulary (DCAT). W3C Recommendation. 2014. Available online: https://www.w3.org/TR/vocab-dcat/ (accessed on 15 November 2020).
- Huttenlocher, D.P.; Klanderman, G.A.; Rucklidge, W.J. Comparing images using the Hausdorff distance. IEEE Trans. Pattern Anal. Mach. Intell. 1993, 15, 850–863. [Google Scholar] [CrossRef] [Green Version]
- Lassila, O.; Swick, R.R.; World Wide Web Consortium. Resource Description Framework (RDF) Model and Syntax Specification. W3C Recommendation. 1998. Available online: https://www.w3.org/TR/1998/WD-rdf-syntax-19980720/ (accessed on 15 November 2020).
- Ngan, L.D.; Hang, T.M.; Goh, A.E.S. Semantic similarity between concepts from different OWL ontologies. In Proceedings of the 2006 4th IEEE International Conference on Industrial Informatics, Singapore, 16–18 August 2006; pp. 618–623. [Google Scholar]
- Homburg, T.; Prudhomme, C.; Würriehausen, F.; Karmacharya, A.; Boochs, F.; Roxin, A.; Cruz, C. Interpreting heterogeneous geospatial data using semantic web technologies. In Proceedings of the International Conference on Computational Science and Its Applications, Beijing, China, 4–7 July 2016; pp. 240–255. [Google Scholar]
- Prudhomme, C.; Homburg, T.; Ponciano, J.J.; Boochs, F.; Roxin, A.; Cruz, C. Automatic Integration of Spatial Data into the semantic web. In Proceedings of the 13th International Conference on Web Information Systems and Technologies, Porto, Portugal, 25–27 April 2017; pp. 107–115. [Google Scholar]
- Prudhomme, C.; Homburg, T.; Ponciano, J.J.; Boochs, F.; Cruz, C.; Roxin, A.M. Interpretation and automatic integration of geospatial data into the semantic web. Computing 2020, 102, 365–391. [Google Scholar] [CrossRef]
- Bray, T. The Javascript Object Notation (Json) Data Interchange Format. RFC 8259; The Internet Engineering Task Force. 2017. Available online: https://tools.ietf.org/id/draft-ietf-jsonbis-rfc7159bis-04.html (accessed on 15 November 2020).
- Butler, H.; Daly, M.; Doyle, A.; Gillies, S.; Hagen, S.; Schaub, T. The Geojson Format. RFC 7946; The Internet Engineering Task Force. 2016. Available online: https://tools.ietf.org/html/rfc7946 (accessed on 15 November 2020).
- Takacs, B. Comparing face images using the modified Hausdorff distance. Pattern Recognit. 1998, 31, 1873–1881. [Google Scholar] [CrossRef]
- Horrocks, I.; Patel-Schneider, P.F.; Boley, H.; Tabet, S.; Grosof, B.; Dean, M. SWRL: A semantic web rule language combining OWL and RuleML. W3C Memb. Submiss. 2004, 21, 1–31. [Google Scholar]
- Sui, D.; Elwood, S.; Goodchild, M. Crowdsourcing Geographic Knowledge: Volunteered Geographic Information (VGI) in Theory and Practice; Springer Science & Business Media: Berlin/Heidelberg, Germany, 2012. [Google Scholar]
- Sahuguet, A.; Azavant, F. Wysiwyg Web Wrapper Factory (W4f); Working Paper, UPenn Engineering. 1999. Available online: https://www.semanticscholar.org/paper/WysiWyg-Web-Wrapper-Factory-(W4F)-Sahuguet-Azavant/fa6c421e483e012578642829524e5b2961e1c097 (accessed on 15 November 2020).
Classes | Definition |
---|---|
semgis:Situation | Defines a situation |
semgis:RequirementProfile | List of requirements |
semgis:ThematicMap | ThematicMap definition |
dq:Metric | Data Quality Metric |
dq:Observation | Data Quality Measurement |
dq:Dimension | Data Quality Dimension |
Class | Associated Property | Definitions |
---|---|---|
semgis:RequirementProfile | semgis:hasDealBreakAttribute | Dealbreaker Attribute Definition |
semgis:RequirementProfile | semgis:hasEligibleMetric | Defines an eligible metric |
semgis:RequirementProfile | semgis:hasRelevantMetric | Defines a relevant metric |
semgis:RequirementProfile | semgis:hasFeasibleMetric | Defines a feasible metric for a siuation |
dq:Metric | semgis:hasRange | Defines a metrics eligible range to be considered good |
dq:Metric | semgis:isCommonlyAssociatedWith | Defines concepts which commonly used this metric for evaluation |
dq:Metric | semgis:hasPriority | Defines the priority of the metric for the particular requirement profile |
semgis:ThematicMap | semgis:isAbout | Defines the topic of a Thematic Map as an owl:Class |
semgis:ThematicMap | semgis:isPartOf | ThematicMap as part of a situation |
semgis:ThematicMap | semgis:isEvaluatedBy | RequirementProfile |
semgis:Situation | semgis:hasPart | Thematic Map |
Properties | Definitions |
---|---|
semgis:isRelatedTo | Related Situation or Thematic Map |
semgis:isEvaluatedBy | Eligible data quality metric |
semgis:dependsOn | Required Datasets |
semgis:hasSubject | Related owl Class describing the knowledge domain of the activity |
Metric | Target | Priority | Range | Dealbreaker |
---|---|---|---|---|
Completeness | Number of Students | 1 | >0 | true |
Geometry Validity | Geometry | 1 | true | true |
Positional Accuracy | Geometry | 4 | >12 | false |
The_geom | Name | Capacity | Address |
---|---|---|---|
POINT(..) | Catholic Clinic Mainz | 717 | An der Goldgrube 11, Mainz, Germany |
Metric | Target | Priority | Range | Dealbreaker |
---|---|---|---|---|
Completeness | Number of Beds | 1 | N/A | true |
Geometry Validity | Geometry | 1 | N/A | true |
Positional Accuracy | Geometry | 4 | >12 | false |
Metric | Target | Priority | Range | Dealbreaker |
---|---|---|---|---|
Geometry Validity | Geometry | 1 | true | true |
Positional Accuracy | Geometry | 2 | >12 | false |
HausdorffDistance | Geometry | 2 | >0.8 | false |
Metric | Target | Priority | Range | Dealbreaker |
---|---|---|---|---|
Completeness | Elevation | 1 | >0 | true |
Completeness | Capacity | 1 | >0 | true |
Geometry Validity | Geometry | 1 | true | true |
Positional Accuracy | Geometry | 4 | >12 | false |
Metric | Dataset | Target | Priority | Range | Dealbreaker |
---|---|---|---|---|---|
Completeness | Hospital | Number of Beds | 1 | >300 | true |
Completeness | Rescue Points | Elevation | 1 | >0 | true |
Completeness | Rescue Points | Operator | 1 | >90% | true |
Completeness | Rescue Points | Capacity | 1 | >0 | true |
Geometry Validity | All | Geometry | 1 | true | true |
Freshness | All | Geometry + Attributes | 3 | <365 days | false |
Positional Accuracy | All | Geometry | 4 | >12 | false |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2020 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
Share and Cite
Homburg, T. Connecting Semantic Situation Descriptions with Data Quality Evaluations—Towards a Framework of Automatic Thematic Map Evaluation. Information 2020, 11, 532. https://doi.org/10.3390/info11110532
Homburg T. Connecting Semantic Situation Descriptions with Data Quality Evaluations—Towards a Framework of Automatic Thematic Map Evaluation. Information. 2020; 11(11):532. https://doi.org/10.3390/info11110532
Chicago/Turabian StyleHomburg, Timo. 2020. "Connecting Semantic Situation Descriptions with Data Quality Evaluations—Towards a Framework of Automatic Thematic Map Evaluation" Information 11, no. 11: 532. https://doi.org/10.3390/info11110532
APA StyleHomburg, T. (2020). Connecting Semantic Situation Descriptions with Data Quality Evaluations—Towards a Framework of Automatic Thematic Map Evaluation. Information, 11(11), 532. https://doi.org/10.3390/info11110532