An Empirical Evaluation of Data Interoperability—A Case of the Disaster Management Sector in Uganda
Next Article in Journal
A Study on Storm and Flood Insurance Management Mapping: Case Study of Incheon Metropolitan City
Next Article in Special Issue
Adaptive Choropleth Mapper: An Open-Source Web-Based Tool for Synchronous Exploration of Multiple Variables at Multiple Spatial Extents
Previous Article in Journal
An Efficient and Scene-Adaptive Algorithm for Vehicle Detection in Aerial Images Using an Improved YOLOv3 Framework
Previous Article in Special Issue
Performance Testing of istSOS under High Load Scenarios
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

An Empirical Evaluation of Data Interoperability—A Case of the Disaster Management Sector in Uganda

1
Department of Geomatics and Land Management, Makerere University, Kampala 00000, Uganda
2
Mediterranean Institute of Technology, South Mediterranean University, Tunis 1053, Tunisia
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
ISPRS Int. J. Geo-Inf. 2019, 8(11), 484; https://doi.org/10.3390/ijgi8110484
Submission received: 10 September 2019 / Revised: 17 October 2019 / Accepted: 22 October 2019 / Published: 26 October 2019
(This article belongs to the Special Issue Open Science in the Geospatial Domain)

Abstract

:
One of the grand challenges of disaster management is for stakeholders to be able to discover, access, integrate and analyze task-appropriate data together with their associated algorithms and work-flows. Even with a growing number of initiatives to publish data in the disaster management sector using open principles, integration and reuse are still difficult due to existing interoperability barriers within datasets. Several frameworks for assessing data interoperability exist but do not generate best practice solutions to existing barriers based on the assessment they use. In this study, we assess interoperability for datasets in the disaster management sector in Uganda and identify generic solutions to interoperability challenges in the context of disaster management. Semi-structured interviews and focus group discussions were used to collect qualitative data from sector stakeholders in Uganda. Data interoperability was measured to provide an understanding of interoperability in the sector. Interoperability maturity is measured using qualitative methods, while data compatibility metrics are computed from identifiers in the RDF-triple model. Results indicate high syntactic and technical interoperability maturity for data in the sector. On the contrary, there exists considerable semantic and legal interoperability barriers that hinder data integration and reuse in the sector. A mapping of the interoperability challenges in the disaster management sector to solutions reveals a potential to reuse established patterns for managing data interoperability. These include; the federated pattern, linked data patterns, broadcast pattern, rights and policy harmonization patterns, dissemination and awareness pattern, ontology design patterns among others. Thus a systematic approach to combining patterns is critical to managing data interoperability barriers among actors in the disaster management ecosystem.

1. Introduction

Disasters cause severe losses to communities, the effects of which are felt by the most vulnerable in communities. In order to cope with impacts of hazard events, organizations require resources and capabilities of other stakeholder institutions thus making effective collaboration and inter-operation essential. To make full use of existing data in the disaster management sector, the disaster community needs to adopt mechanisms that ensure resource findability, accessibility, interoperability, reuse and promote the growth of open data initiatives. Even with growing open data initiatives, integration and reuse of existing data by sector stakeholders is hindered by existing interoperability barriers. The barriers in sharing and coordinating information before and after hazard events often leads to a breakdown in prevention, preparedness and response to extreme hazard events [1]. Thus lack of interoperability for disaster/hazard related data is in itself an indicator of lack of resilience. Wilkinson et al. [2,3] discuss guiding principles to make data in repositories Findable, Accessible, Interoperable and Reusable (FAIR) and associated metrics for evaluating compliance. In the disaster domain, [4] have developed a framework for assessing interoperability as well as other domain interoperability frameworks listed by [5] can be reused in the disaster management sector.
While these frameworks provide a way of measuring interoperability, they lack approaches for identifying and development of best practice solutions to interoperability challenges based on interoperability measurement. A combination of interoperability concepts and patterns have been proposed as a way of overcoming the interoperability barriers [6,7]. Patterns provide reusable proven best practice solutions for recurring interoperability problems in a given context [8]. However, there still exists a gap in pattern demand and availability to enable domain experts to construct high quality interoperability solutions without requiring expert knowledge in interoperability.
Therefore, the contribution of this paper is: First, to provide an understanding of interoperability maturity and compatibility between existing data sets in the disaster management sector in Uganda. Second, from interoperability assessments, we identify potentially reusable interoperability patterns—also known as interoperability best practices. In this paper, we define interoperability patterns as typical solutions to commonly reoccurring interoperability problems within in the context of disaster management. These patterns can be systematically combined to manage interoperability problems in the disaster management sector in Uganda. The rest of the paper is structured as follows: Related work is presented in Section 2, materials and methods in Section 3 while results and discussions in Section 4 and Section 5 respectively. Section 6 contains the conclusions and future work.

2. Related Work

Currently, there exists is a strategic shift in disaster risk management with actors placing more attention on proactive disaster preparedness and prevention as opposed to reactive measures [9]. Some of the proactive measures include rigorous risk assessments, enhanced early warning systems, awareness and coordination of actors. Early warning typically constitute multiple actors (such as national sectors, private sector, civil society and communities) playing different yet complementary roles in disaster risk management [10,11]. The success of an enhanced early warning system typically depends on the ability of multiple actors to share data/ information (i.e., data about past events, real-time streams etc.) [12,13].

2.1. Disaster Data Interoperability

Vast amounts of hazard/disaster-related data existing today are typically dispersed geographically and owned by various entities making it difficult to access and reuse for disaster management. As such, Open Data initiatives aiming at increasing the availability of machine-readable data provided under an open license and facilitate data reuse have been advanced [14,15,16]. The FAIR principles defined by [2] provide a guide for ensuring such open disaster related data is findable, accessible, interoperable and reusable (FAIR) by machines/humans to support disaster management functions and so become as valuable as is possible [17]. The concept of interoperability serves to enable creation, exchange and consumption of data with explicit, shared expectation of context and meaning [18]. Different frameworks have been advanced to reconcile several perspectives on the notion of data interoperability [19]. These frameworks include the IMI [20], EIF [21] and LCIM [22] to mention but a few. After reviewing these frameworks, Rezaei et al. [23] categorises data interoperability into several distinct levels. These include: syntactic, semantic, technical and organizational interoperability. To ensure that data providers publish a machine-readable open data in an interoperable way, Barners-lee [24,25] introduces a five-star rating for data that relates to the levels of interoperability defined by Rezaei et al. [23].

2.2. Interoperability Assessment

Leal et al. [5] classify interoperability assessment types used by different frameworks into three categories. The first assessment type is the potentiality assessment that evaluates whether a system is mature enough to overcome existing interoperability barriers. Examples of such an assessment include [26] used in Uganda and [4] in the disaster domain. The second type is the compatibility assessment that evaluates the current state of data to identify potential conflicts that may cause interoperability problems as exemplified in  [22,27,28]. The last assessment type evaluates the quality of implementing an interoperable solution at run-time. While these frameworks provide a way to measure interoperability, Leal et al. [5] conclude that they lack approaches for identifying best practices necessary to improve interoperability based on assessment results.

2.3. Interoperability Patterns

To solve interoperability challenges, different authors [6,7,23] propose a combination of interoperability concepts and patterns (also known as information supply chain patterns). The concept of patterns emerges from software engineering [29], data modeling [30] and knowledge engineering [31] among others to interoperability. To overcome technical interoperability barriers, several patterns have been developed as best practice solutions. Examples of application domains where interoperability patterns have been applied to manage technical barriers include the Internet of Things (IoT) [32], enterprise information systems exchange (a catalog of patterns can be found here: http://project-interoperability.github.io/exchange-patterns/) [33] and health domain [7]. In the semantic ecosystem, Gangemi [34] proposes ontology design patterns (ODPs) as common strategies for implementing semantic interoperability  [8]. To solve organization interoperability barriers [6,7] identify several patterns. While the prospects for using interoperability patterns to manage interoperability barriers are enormous, there still exists a gap between their demand and availability [35]. Thus, there is a need to develop a critical amount of reusable patterns for managing disaster/ hazard related data interoperability.

3. Materials and Methods

In this study, we use the case of Uganda that experiences several disasters such as famine, floods, earthquakes, conflict, landslides and epidemics. With each hazard, there exists a lead institution (See the list of lead institutions in the Uganda disaster preparedness and management policy that can be downloaded from here: https://reliefweb.int/report/uganda/national-policy-disaster-preparedness-and-management) responsible for monitoring, assessing and reporting risk levels. For the institutions to achieve their mandates, they need to share, integrate and reuse data from other co-opted stakeholders. Through the disaster management sector working group, the Office of the Prime Minister (OPM) coordinates all stakeholders (data publishers and users) to promote integrated monitoring, reporting and actions to minimize hazard impacts. As a matter of fact, the current study investigates interoperability issues that affect the sharing and reuse of data among actors in the disaster management sector. In terms of theoretical scope, we investigate the data interoperability concern wherein two types of data are considered for the study (i.e., raw data and derived data like meteorological data obtained after processing raw sensor data). Therefore, this paper mainly addresses two research questions (RQ).
  • RQ1: To what extent is disaster/ hazard-related data interoperable in Uganda?
    Knowledge of data interoperability conflicts and maturity level are needed to guide the mapping of appropriate solutions to barriers in sharing/reusing  data in the disaster management sector.
  • RQ2: Are there emerging interoperability patterns for managing data interoperability barriers in the disaster sector?
    Pattern instances revealed through the mapping of local use case interoperability problems in the sector to generic solutions are critical for stakeholders to implement appropriate solutions with ease.
To answer RQ1, two assessment types (i.e., the maturity and compatibility assessment) are used to explore data interoperability in the disaster sector as shown in Figure 1. For the maturity test, data are collected using semi-structured interviews from seven data points representing disaster sector stakeholders (see Table A3) that generate and reuse data in the sector. To gain more insight into interoperability issues in the disaster sector, data from members of the disaster sector working group was collected through the formulation of four Focused Group Discussions (FGD) constituting a total of 16 data users from 12 stakeholder institutions (see description of institutions in Table A4). The disaster sector working group is an inter-agency disaster risk reduction platform that constitutes the following categories of actors; lead Ministries, UN agencies, NGOs, CSOs, researchers and other relevant stakeholders as defined in the disaster policy. In this study, we endeavor to cover all actors by selecting at least a data point in each category to guarantee good representation. Thus a total of 15 institutions were selected to participate in the interviews and FGD based on purposive sampling. This represents about 24.6% of the total number of sector stakeholders (The derived percentage is based on 61 institutions included in the DRR platform/sector mailing list of 20th July 2018). The compatibility assessment utilizes structured data obtained during prioritization in FGDs. However, disaster sector data is both derived and raw (i.e., requires pre-processing for use as input in assessment models e.g., extracting indicator data from imagery). To avoid researcher bias that could affect the internal validity of the study, we do not pre-process any raw data into derived data but rather use existing data.
Qualitative data collected from interviews and FGD were transcribed into text and later coded in atlas-Ti. The codes were then grouped into code groups and analyzed using content thematic analysis. To indirectly measure the interoperability maturity (i.e., the ability to overcome existing interoperability barriers), metrics in Table 1 were developed and classified according to data interoperability barriers presented by [23].
To generate metrics that are in line with recommendations of the FAIR framework, we combine FAIR metrics [3] with the 5-star rating for data [25]/ linked open data [24] as illustrated in Table A1. The decision of the final metric is based on a generalization of FAIR interoperability metrics and data rating concepts. Where no FAIR interoperability metrics exist, 5-star rating metrics are selected taking into consideration their compliance to findability, accessibility and reusability metrics of the FAIR framework e.g. in metric T1. A numerical measure of maturity is generated by assigning compliance value codes (where 0 means false, 1 for True and 0.5 for partially) based on metrics in Table 1. Finally, the maturity indicator is computed by averaging compliance values.
To measure interoperability compatibility, test datasets (research tools and data used can be found here https://github.com/mazimweal/mazimweal.github.io) are compared using the method in Colpaert et al. [27]. This method is best suited for evaluating semantic inter-linkedness between RDF datasets. However, it also provides a good experimentation on the semantic inter-linkedness and similarity of concepts in tabular datasets using simple RDF syntax.
Consider two datasets A and B, (seen in Figure 2) being assessed for compatibility. Four scenarios regarding sharing of identifiers and concepts are possible (see Figure 2). Below is an explanation of implications for scenarios in Figure 2 on interoperability.
  • Scenario 1: Does not affect interoperability since neither identifiers nor concepts are shared.
  • Scenario 2: Denotes true ID matches where identifiers increase interoperability since they represent different concepts that are shared by datasets.
  • Scenario 3: Two datasets share an identifier for which the concepts do not match. Therefore, it denotes a false ID match that negatively affects interoperability.
  • Scenario 4: Also negatively affects interoperability since concepts match in the two data sets yet the identifiers do not match.
In this method, a pair of identifiers (IDs) shared by two datasets is evaluated to determine whether they represent the same concept. Therefore, the interoperability calculation will only consider scenarios 2, 3, 4 and exclude 1.
The data (as illustrated in Figure 3) are transformed into the RDF format from which a list of identifiers is generated (see example in Figure 4). We then combine all identifiers from participating datasets into one file (totaling to 149 IDs).
Using Algorithm (Figure 5a), every pair of identifiers in any two datasets is compared to generate a list of questions of the form < i d 1 > ? u n k n o w n < i d 2 > (i.e., from 149 IDs, we generate 149 × 148 = 22,052 questions). The list of IDs are then compared for concept similarity by replacing the ? u n k n o w n predicate with the iiop:SameAs or iiop:notSameAs depending on real-world inputs crowd-sourced from domain experts. The relationship between two IDs is denoted by iiop:SameAs if concepts are the same and iiop:notSameAs (the identifier interoperability vocabulary can be found here; http://semweb.mmlab.be/ns/iiop#) when different in two datasets as shown below.
Prefix ns1: <https://w3id.org/gicentre/onto#>
@prefix iiop: <http://semweb.mmlab.be/ns/iiop#>
      <ns1:desinventar/Event>iiop:SameAs <ns1:response/Disaster_type>
<ns1:desinventar/Location>iiop:SameAs <ns1:response/Location>
<ns1:desinventar/Serial>iiop:SameAs <ns1:response/id>
The EYE reasoning server (find reasoning examples here https://n3.restdesc.org/n3/) [36] was then used to reason through the questions thus generating a list of answers (also known as interoperability statements) based on query Figure 5b. EYE is a semantic web reasoning engine that supports Euler paths. Through interoperability with Cwm via NTriple (N3), EYE provides a mechanism to query check, transform and filter information (See query in Figure 5b).
To measure semantic interoperability, the open DesInventar dataset is used as a reference for computing the interoperability ratio in Equation (1). DesInventar (click this link to access the Desinventar dataset: https://www.desinventar.net/DesInventar/profiletab.jsp?countrycode=uga) is an open disaster management information system managed by the mandated disaster management institution (OPM). The system enables collection, documentation and analysis of data about losses caused by disasters associated with natural hazards. Thus it helps to understand disaster trends and their impacts for better prevention, mitigation and preparedness among communities.
I d e n t i f i e r I n t e r o p e r a b i l i t y = ρ i d e n t i f i e r s ρ r e a l w o r l d
Consider a pair of datasets (see Figure 2), the ρ r e a l w o r l d relevance in Equation (1) is equal to the occurrence of consolidated identifiers in a joined dataset. While ρ i d e n t i f i e r relevance is calculated similarly as the real-world relevance except that the strings in the identifiers and the respective concepts they represent need to match. In other words, the ρ i d e n t i f i e r is the number of triples in a joined dataset with matching true positives (see scenario 2 in Figure 2). Based on expert opinion, the elicited datasets used in the compatibility assessment are rated to determine how easy it is to merge them with the reference dataset. These ratings are compared with interoperability metrics from Equation (1).
To answer RQ2, the resulting challenges from qualitative data analysis are matched with the problem part of pattern templates [37] to generate a list of relevant patterns in the disaster domain as illustrated in Figure 6. The solution space constitutes of generic patterns applied in other contexts other than disaster data interoperability. These were derived from literature and existing catalogs. The decision on the final pattern selected also takes into consideration the results from data interoperability assessment in the sector.

4. Results

4.1. To What Extent Is Disaster/Hazard-Related Data Interoperable among Stakeholders?

This section presents results of interoperability maturity level and compatibility measures for disaster data in Uganda. The results are organized based on the four interoperability levels defined by [23].

4.1.1. Technical Interoperability

Technical interoperability is reached when data/information can be shared and accessed using common standard protocols. For instance, a good indicator of technical interoperability is the ability to access disaster data via Http/URL and download it fully (see metric T1 in Table 2). All interviewed respondents manage web portals with downloadable disaster data/information except for one as shown in Table 2. This implies that technical interoperability can fully be achieved in the disaster sector in Uganda. In addition, sharing of disaster data is also achieved through email and flash-drives.

4.1.2. Syntactic Interoperability

From Table A2, all participating organization have unstructured data (e.g., pdf), structured data (S1) and machine readable data (S2). Data are encoded in different formats depending on intent for data/information use. For instance, the intent of keeping disaster data as spreadsheets best suits analysis whereas shape file format suits mapping. The only open machine readable format used is the CSV, and that no institution uses RDF and XML machine readable formats for sharing data. Interview and FGDs indicated that the respondents had no working knowledge of XML and RDF which are also semantic markup languages signaling a lack of capacity in linked data/semantic technology among data publishers and users (see Section 4.1.4). However, since there existed both structured and machine readable data all of which the serializations are common among all institutions, we can conclude from Table 2 that at 93%, syntactic interoperability for data in the disaster management sector is high.

4.1.3. Organization/Legal Interoperability

A logical indicator for measuring legal interoperability is whether data has an open license such that the machine can understand what the reuse rights are. Generally, disaster sector stakeholders have no licenses attached to data (O1 in Table 2) making it hard to determine the openness of the data available. However, most of this data can freely be accessed, downloaded and reused (O2). In some instances, only derived data are available rather than the raw data e.g., climatic data. Inter-institutional arrangements among users and publishers exist, as illustrated in O3 to enable sharing of disaster related data. The data publishers asserted that they have institutional policies on how data should be accessed. This makes it hard for data users to access the data because each institution has its own policy as illustrated in the quote below.
Respondent 4; “We know data is there but obtaining it is a problem. In most cases you need to use informal means to access it”
To solve these organizational interoperability barriers, there is a need to encourage license definition for data as well as harmonization of policies for maintaining and accessing data in the sector.

4.1.4. Semantic Interoperability

From Table 2, all respondents (data providers) have not used base URI for their data (Sem1) and neither do they have linked vocabularies (Sem2) describing concepts in their dataset. However, from qualitative results, semantic issues still exist while utilizing disaster/hazard-related data as illustrated in the quotes below.
Respondent 1: “The climate domain uses difficult terminology that is often understood differently in other domains or by the common man for example the concepts above average rainfall, late start, dry spell are understood differently for agricultural domain and yet for layman it could also mean something else”
Focus group discussion 2; “Since the desinventar database is populated by different volunteers, we are not sure whether all volunteers will conceptualize events for reporting purposes in the same way. For instance, there is no clear reporting on terms like rains, heavy rains, storms, windstorms, hailstorms.”
The quotes above imply that data users are aware that semantic issues exist when reusing disaster/hazard-related data. From Section 4.1.2, we note that there is a lack of capacity in using semantic markup. Therefore, the absence of implementations to manage semantic interoperability issues in data can be explained by the lack of capacity among data producers. An important indicator of whether data can be semantically integrated is through comparison of identifiers in different data sets to check for concepts that are similar in the real world. Upon experimenting with the RDF model, results in Table 3 generally indicate a high interoperability identifier ratio except for the land cover dataset. Since the land cover dataset does not share identifiers and concepts with any other dataset and vice-versa, it’s interoperability identifier value is zero. Thus has no impact on the interoperability of other datasets (see scenario 1 in Figure 2). Therefore, the results in Table 3 indicate that data in the disaster management sector is highly semantically inter-linked.
However, the semantic issue presented by respondent 1 requires a more expressive language to infer extreme weather events/conditions from rainfall data that can not be captured by simple RDF syntax used in the experiment above. While results in Table 3 suggest that linked data could improve semantic inter-linkedness between sector datasets, a more expressive way for representing explicit but also shared conceptualizations (i.e., through use of formal ontologies in OWL-DL) could be explored. Organizing sector data with ontologies requires an understanding of the complex and abstract ontology modeling notions expressed in OWL-DL. However, disaster domain experts are not often familiar with these constructs and will thus need a steep learning curve [38]. To mitigate the issue of lack of technical capacity in semantic technologies(as expressed in Section 4.1.2), best practice solutions (ontology design patterns and tools) can be adopted to make ontology modeling easy and intuitive enough for domain experts to develop ontologies for organising sector data efficiently and correctly without the help of Knowledge Engineers.

4.2. Are There Emerging Patterns for Managing Data Interoperability in the Disaster Management Sector?

Table 4 presents patterns (a description of revealed patterns is available here: https://github.com/mazimweal/mazimweal.github.io) obtained by mapping interoperability challenges with the problem part of the pattern template in the context of disaster management. Stakeholders in the disaster management sector share both raw and derived data. To help integrate and reuse data, respondents recommended the inclusion of metadata with pre-processing techniques. Also, respondents noted that shared data often has gaps, duplication and erroneous entries. To solve these data issues, respondents recommend best practices that will not only fill sector data gaps by enrichment but also ensure data validation with related datasets from other stakeholders. Linked data is a suitable technology for increasing data interoperability as explored by results in Table 3 not only by uniquely interlinking concepts but also through validation and enrichment of sector data with other sources. Therefore, several linked data patterns (a description of the patterns for modeling publishing and consuming of linked data are available at http://patterns.dataincubator.org/book/) for publishing and consuming linked data emerge. These include the “Follow Your Nose pattern” to find additional relevant data, “Progressive Enrichment pattern” for data quality and “Missing Isn’t Broken pattern” to handle messy and incomplete data.
A technical interoperability barrier identified in the sector is the inability to access data scattered in multiple disaster/hazard related repositories. The federated query pattern provides a solution to enable access to multiple data stores with a single query. Furthermore, disaster/early warning managers expressed need for a solution that can send geolocated alert or notification to stakeholders across multiple areas to communicate details of specific hazard events. The broadcast pattern would be instrumental in solving such a problem.
From Section 4.1.3, there is a difficulty in finding and accessing data from publishers as organizations have different policies and regulations for maintaining/accessing disaster-related data. Best practice solutions from respondents show the need for organizations to advertise their data. The political will of the government to invest in data is critical. Besides, mandated government institutions ought to take lead in the formulation and implementation of policies supporting the growth of open data initiatives and harmonization of data sharing practices. Thus the following patterns emerge for reuse: dissemination and awareness pattern, policy development and harmonization patterns (e.g. rights pattern, private policy pattern, policy override and conflict mediation patterns), coordination and implementation patterns. These patterns are synonymous with pattern instances described in a different context by Nehta [7].
To resolve semantic barriers identified in Section 4.1.4, respondents proposed a unified vocabulary across domains and institutions (see Table 4). Given that institutions maintain data based on different institutional policies as illustrated in Table 2, a bottom-up approach to ontology engineering that does not necessitate policy harmonization is preferable. As such, we identify content ontology design patterns for organizing data in the disaster management sector. Content ODPs allow institutions to align themselves to the interoperability infrastructure/global pattern schema on a gradual basis depending on the organizational capacity (see Section 4.1.4) and policies.

5. Discussion

5.1. Disaster Data Interoperability Evaluation

From the results above, there exists a high level of maturity for technical and syntactic interoperability for data in the sector. Data users can only access machine-readable CSV format (for instance, one can easily download disaster impact data from DesInventar database as CSV) implying that data is limited to the 3rd star of open data rating by BarnesLee [25]. On the contrary, semantic and organization interoperability is still a challenge in the sector. Since the study uses FAIR interoperability metrics for interoperability assessment, the maturity results show that disaster/hazard related data are far from fulfilling FAIR recommendations. In this paper, we explore a way of resolving interoperability issues through a mapping to best practice solutions. Therefore, future work could also focus on holistically exploring the FAIRness of disaster/hazard related data by further evaluating its findability, accessibility and reusability.
The identifier interoperability provides a mechanism to reason about the semantics of concepts in datasets based on the RDF data model. Results from compatibility measurement suggest that using the linked data can improve interoperability. However, users in the disaster management sector often use both structured, unstructured data (like pdf) and raw imagery that can be easily transformed using well-established procedures. A big challenge to this study is obtaining only the structured data available in the disaster sector to enable examination of conflicts in data. To address this problem, we triangulate data sources and interoperability assessment types to gain a bigger picture of data interoperability in the disaster sector. This allows demonstration of the fact that RDF alone is not sufficient to capture complex semantic notions (e.g. inferring extreme weather events/conditions from rainfall data/thresholds). This implies that a more expressive mechanism is required to capture such semantic constraints of data such as OWL-DL. However, there is a lack of technical capacity in semantic markup technologies such as XML/RDF/OWL. Thus, future work can focus on furthering work on intuitive RDF interfaces (such as RDF data generators https://rdforms.org/#!start.md), and ODPs [39] to support managing of data by users/ disaster domain experts.
From a legal perspective, clearly defined license (open/proprietary) increases interoperability for both humans and machine. In Table 2, most disaster-related data are freely available but have no license attached. Thus machines will not explicitly understand data reuse rights even when data are seemingly open. Even with inter-institutional collaborations, data interoperability in the sector will remain a challenge since institutions maintain data based on different policies/regulations. The mandated hazard lead government institutions have a role to develop policies that will open and harmonize data management among stakeholders. Since the role of policy development is outside the purview of a typical user/stakeholder in the disaster sector, there is a need to develop interoperable disaster management solutions that take into consideration changing policies and related constraints.

5.2. Interoperability Patterns

The interoperability assessment (in Section 4.1) together with elicited challenges enable identification of suitable interoperability best practices for managing data in the disaster management sector. Patterns identified can be grouped using different classification systems such as categorization by interoperability barrier (see Table 4), granularity (e.g., federated and broadcast patterns are more coarse than ODPs) etc. Construction of high-quality interoperability solutions by domain experts without expert knowledge entails a systematic effort to combine patterns such as those in Table 4. Figure 7 illustrates the concept of pattern combination in the context of a multi-hazard early warning system. This suggests that a catalog of patterns and tools for combining these patterns must be readily accessible. While patterns identified in Table 4 are specific to interoperability issues in Uganda’s disaster management sector, future work could also focus on the identification of new patterns at the different classification levels in a broader disaster community. The choice of pattern instances used in a composition is largely dependent on organizational policies and business processes. For instance, the selection of content ODPs in Section 4.2 to manage semantic barriers takes into consideration the fact that multi-hazard indicator data is managed based on different organizational policies. Therefore, applying patterns without reflecting on organizational policies and processes can lead to incorrectly modeled interoperability solutions.
At a coarse level(in this study, coarse level patterns are similar to architectural patterns), Figure 7 combines the federated and broadcast patterns to enable query multi-hazard data from different actors in the sector and generate alerts and notifications based on the queries. At a finer scale, willing institutions organise and publish hazard-related linked data using ontology design patterns as illustrated in [40]. Publishing data using ODPs allows for continuous connection to the unified global ontology pattern schema that is implicitly glued together with a foundational ontology. This linked hazard data can further be enriched and validated with related data on the web using linked-data patterns. A federated data query can then be posted onto data sources of various actors in the disaster management sector using interfaces such as that defined by [41]. The reasoning possibilities of ontologies facilitate the deduction of new facts during federated data queries that are critical for early warning.
Nonetheless, federated queries on linked data require that disaster data users and publishers are knowledgeable in the query technology in use e.g., SPARQL or GraphQL-LD. In Table 4, respondents recommended semantic systems with interfaces that hide such complexity. To overcome this challenge, future work could explore the use of natural language by data users/novices to make federated queries to decentralized repositories where semantic issues have been treated with ODPs. The work of [42,43] is useful to guide this conversation in the disaster management sector.

6. Conclusions

The success of disaster risk management efforts depends on the ability of multiple stakeholders to integrate and reuse existing disaster data. In this paper, we evaluate the extent to which data from multiple stakeholders in the disaster management sector is interoperable and identify patterns for reuse while solving interoperability challenges in the domain. Results indicate high technical and syntactic interoperability maturity for data in the sector. On the contrary, the maturity of semantic and legal interoperability is still low. The maturity results show that data in the disaster sector is far from fulfilling FAIR interoperability recommendations. Data users and publishers in the sector have a clear understanding of semantic issues existing in the datasets but lack the technical capacity to implement semantic technologies. Moreover, there also exists gaps in published data that are not easily accessed from multiple repositories. To solve these issues, we identify patterns that will enable disaster domain experts to construct highly interoperable solutions without the help of interoperability experts. Pattern identification takes into consideration the results of interoperability assessments. Pattern reuse requires a systematic effort to combine them to solve existing data interoperability barriers in the sector. The choice of patterns used will typically depend on organizational policies and business processes. In this study, we use a multi-hazard early warning solution to illustrate modular interoperability pattern reuse based on the assessment. The architecture of a multi-hazard early warning system will typically constitute the federated and broadcast patterns at a coarse level and linked data published using ODPs at a finer level. Publishing disaster data as linked data using ODPs is critical for data enrichment, validation and handling of semantic issues while taking into consideration the technical capacity and organizational constraints. Linked data from multiple publishers can, therefore, be integrated using federated queries. Thus future work could focus on the use of natural language for ODP based linked data query answering over federated repositories. This will enable data users to access data without technical competence in semantic technologies. More so, future work could consider identification of more interoperability patterns at different classification levels. For instance, at a finer level, researchers could identify specific ontology design patterns that can be reused to publish data in the disaster management sector. This study assesses interoperability and provides best practices solutions to data interoperability challenges based interoperability metrics of the FAIR framework. Future work could also focus on holistically exploring the FAIRness of disaster data/vocabularies in Uganda not only by further evaluating its Findability, Accessibility, and Reusability but also proposing ways to increase its FAIRness.

Author Contributions

Conceptualization, I.H., and A.M.; methodology, A.M. and I.H.; software, A.M., data curation, A.M. and A.G.; writing—original draft preparation, A.M.; writing—review and editing, I.H., A.G. and A.M.; supervision, I.H. and A.G.

Funding

This work is supported by the Swedish International Development Agency (SIDA) funding to Makerere University, Kampala in partnership with Chalmers and Gothenburg University, Sweden under the BRIGHT project 317.

Acknowledgments

We acknowledge support from Pieter Colpaert with regard to ideas on the FAIR framework and compatibility assessment.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:
CSOsCivil Society Organisations
CSVComma separated variable format
CwmCwm (pronounced coom) is a general-purpose data processor for the semantic web
DRRDisaster Risk Reduction
EWSEarly Warning Systems
EIFEuropean Interoperability Framework
EYEEuler Yet another proof Engine
FAIRFindable, Accessible, Interoperable and Reusable
FGDsFocused Group Discussions
GraphQL-LDGraphQL Query Language for linked data
HTMLHypertext Markup Language
HTTPHypertext Transfer Protocol
IMIInformation Modeling and Interoperability
LCIMLevel of Conceptual Interoperability Model
MDAMinistries, Departments and Agencies
NEWSNational Early Warning System
NGOsNon-Governmental Organisations
NOAANational Oceanic and Atmospheric Administration
ODPsOntology Design Patterns
OSMOpen Street Map
OWL_DLWeb Ontology Language -Description Logic profile
RDFResource description framework
SPARQLLanguage for querying linked data
UNUnited Nations
UNMAUganda National Meteorological Authority
URIUniform Resource Identifier
USGSUnited States Geological Survey (Context: data explorer)
XMLExtensible Markup Language

Appendix A. Interoperability Maturity

Appendix A.1. Interoperability Maturity Metrics

A full list of FAIR Metrics and corresponding descriptions can be found at https://github.com/FAIRMetrics/Metrics/tree/master/MaturityIndicators/Gen1.
Table A1. Defining metrics for interoperability maturity assessment.
Table A1. Defining metrics for interoperability maturity assessment.
Rezaei et al. (2014)5 Star Data Rating (Berners-Lee 2006)5 Stars of Linked Data Vocabulary Use (Janowicz et al. 2014)FAIR MetricsMetrics Derived for Study
Technical1st Star:Make your stuff available on the Web (whatever format) under an open license FM-A1.1: Access Protocol is open, free, and universally implementableT1: Data is published in any format and accessed via a common protocol such as Http URL
Syntactic2nd Star: make it available as structured data (e.g., Excel instead of image scan of a table) S1: Data is available as structured data e.g., excel instead of a scanned image
3rd Star: Make it available in a non-proprietary open format2nd Star: The information is available as machine-readable explicit axiomatization of the vocabulary. E.g OWL, RDF W3C standardsFM-RI: Meets Community Standards (meta)data meet domain-relevant community standards
FM-I1: Use a Knowledge Representation Language-(meta)data use a formal, accessible, shared, and broadly applicable language for knowledge representation
S2: Data is available in a machine-readable non-proprietary format e.g., CSV, XML, RDF, OWL, XML, GML etc
Semantic4th Star: use URIs to denote things, so that people can point at stuff1st Star: Linked Data without any vocabulary Sem 1: Use URI to denote concepts to enable unique identification
5th Star: link your data to other data to provide context3rd Star: The vocabulary is linked to other vocabularies
4th Star: Metadata about the vocabulary is available
5th Star: The vocabulary is linked to by other vocabularies (note: this is the reverse of 3rd star)
FM-I1: Use a Knowledge Representation Language: (meta)data use a formal, accessible, shared, and broadly applicable language for knowledge representation
FM-I2: Use FAIR Vocabularies: IRIs representing the vocabularies used for (meta)data
FM-I3: Use Qualified References: (meta)data include qualified references to other (meta)data
Sem2: Data is available with vocabularies that are linked via unique and qualified references
OrganisationIst Star: Make your stuff available on the Web (whatever format) under an open license FM-R1.1: Accessible Usage License: (meta)data are released with a clear and accessible data usage licenseO1: License attached to data
O2: Data is accessed freely (but may does not have explicit license but it is openly accessible Observed in interview data)
FM-F1B: Identifier persistence-Whether there is a policy that describes what the provider will do in the event an identifier scheme becomes deprecated. O4: Uses a data
 policy that is binding
 to all institutions
O3: Presence of institutional arrangements (From interview data)

Appendix A.2. Summary of Data Used for Measuring Maturity

Table A2. Qualitative measurement of disaster data interoperability.
Table A2. Qualitative measurement of disaster data interoperability.
RespondentSyntaxTechnicalSemanticOrganisation
Respondent 1Excel/CSV, html, ppt, docx, pdf, shapefileEmail, websites-URL, DatabaseNo URI and
No linked data
Data License: not sure
Data is open: Partially
Policy: institutional
Institutional arrangements:yes
Respondent 2Excel/CSV, shapefile, pdf, htmlEmail, websites-URL, Flash DriveNo URI and
No linked data
Data License: No
Data is open: yes
Nature of Policy: institutional
Institutional arrangements
Respondent 3html, shapefile, OSMWebsite-URLNo URI and
No linked data
Data License: yes
Data is open: yes
Nature of Policy: institutional
Institutional arrangements:Yes
Respondent 4Excel/CSV, html, ppt, docx, pdf, shapefileWebsite, URLNo URI and
No linked data
Data License: yes
Data is open: yes
Nature of Policy: institutional
Institutional arrangements:Yes
Respondent 5Excel/CSV, html, ppt, docx, pdfDatabase, Website-URL, emailNo URI and
No linked data
Data License: No
Data is open: yes
Nature of Policy: institutional
Institutional arrangements:Yes
Respondent 6Excel/CSV, html, ppt, docx, pdfDatabase, Website-URL, emailNo URI and
No linked data
Data License: No
Data is open: partially
Nature of Policy: institutional
Institutional arrangements:Yes
Respondent 7ppt, docx, pdf, shapefileDatabase, flashdriveNo URI and
No linked data
Data License: No
Data is open: partially
Nature of Policy: institutional
Institutional arrangements:Yes

Appendix B. Nature of Participants in the Study

Appendix B.1. Summary of Respondents that Participated in Interviews

Table A3. Description of respondents in the interview.
Table A3. Description of respondents in the interview.
RespondentNature of OrganizationData Shared between Stakeholders
Respondent 1Institution interested in food insecurity scenario analysis and early warning based on relevant drivers.Collects primary data such as price data and volumes of informal trade but relies on a network of partners to access data on farm gain, Climate and weather data from NOAA and USGS, Vegetation conditions, Rainfall data from UNMA, consumer prices, nutrition data, conflict data
Respondent 2National statistical bodyProvides fundamental data sets like administrative boundary data upon which disaster data is overlayed, consumer prices, vulnerability indicators
Respondent 3Humanitarian that collects open data and maps it on OSM platformGenerates exposure data and maps post disaster impacts. Also relies on data from other institutions such as administration boundaries etc.
Respondent 4Humanitarian organization for disaster response and preparedness. Collaborates with lead agencies on Flood modeling and early warning, indigenous knowledge EWS, Forecast based financing (FbF)Generate rapid assessment reports/response data, relies on data from other institutions for joint Hazard, Vulnerability and Risk assessment. Such data include disaster Impact data, land cover, imagery etc
Respondent 5National Office mandated to coordinate disaster preparedness, prevention and emergency responseManages DesInventar impact data, Institution does not primarily collect data but rather relies on data from other institutions. Coordinates early warning efforts of other institutions e.g., weather/climate forecasts, food security, crop and pasture conditions, Multi hazard early warning (based on sector data) etc
Respondent 6Semi autonomous government authority for weather and climate servicesWeather data (Rainfall/ humidity/temperature data) and fore casts (daily, 10 day dekadal, monthly and seasonal forecasts), climatic statistics
Respondent 7Water Resource monitoring department in charge flood hazardsProvide data on water levels in various rivers and lakes being monitored; flood hazard and sanitation data

Appendix B.2. Description of Institutions that Participated in Focused Group Discussions

Table A4. Nature of organization where focused group discussions (FGD) participants were drawn.
Table A4. Nature of organization where focused group discussions (FGD) participants were drawn.
NoNature of Organization
1Non governmental organization offering relief services to Refugees in Uganda
2United Nations agency supporting disaster management in Uganda
3Office mandated to coordinate disaster management in Uganda
4Higher learning and research institution in Uganda
5International Non Governmental Organization that supports humanitarian efforts in Uganda. Also supports data management for community EWS
6Ministry in charge of health in Uganda (provides human epidemic data)
7Church based humanitarian organization in Uganda
8Ministry that provides National data and information on agriculture sector; department involved in food insecurity analysis and early warning
9Non governmental Organization involved in climate-smart disaster risk reduction and Forecast based financing
10Civil Society Organization helping reduce Community Vulnerability to Climate Change Induced Disasters
11Humanitarian Organization spear heading Forecast based Financing in Uganda.

Appendix C. Interoperability Patterns

Technical patterns include the federated and broadcast patterns. Semantic patterns include a list of Linked data patterns, and Ontology design patterns. Organisational patterns include the dissemination pattern, rights pattern among others. All Pattern templates and diagrams can be found here: https://github.com/mazimweal/mazimweal.github.io.

References

  1. Bharosa, N.; Lee, J.; Janssen, M. Challenges and obstacles in sharing and coordinating information during multi-agency disaster response: Propositions from field exercises. Inf. Syst. Front. 2010, 12, 49–65. [Google Scholar] [CrossRef]
  2. Wilkinson, M.D.; Dumontier, M.; Aalbersberg, I.J.; Appleton, G.; Axton, M.; Baak, A.; Blomberg, N.; Boiten, J.W.; da Silva Santos, L.B.; Bourne, P.E.; et al. The FAIR Guiding Principles for scientific data management and stewardship. Sci. Data 2016, 3, 160018. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  3. Wilkinson, M.D.; Sansone, S.A.; Schultes, E.; Doorn, P.; da Silva Santos, L.O.B.; Dumontier, M. A design framework and exemplar metrics for FAIRness. Sci. Data 2018, 5, 180118. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  4. da Silva Avanzi, D.; Foggiatto, A.; dos Santos, V.A.; Deschamps, F.; Loures, E.D. A framework for interoperability assessment in crisis management. J. Ind. Inf. Integr. 2017, 5, 26–38. [Google Scholar]
  5. Leal, G.D.; Guédria, W.; Panetto, H. Interoperability assessment: A systematic literature review. Comput. Ind. 2019, 106, 111–132. [Google Scholar] [CrossRef]
  6. Milosevic, Z. Addressing interoperability in E-health: An Australian Approach. In Proceedings of the 10th IEEE International Enterprise Distributed Object Computing Conference Workshops (EDOCW’06’), Hong Kong, China, 16–20 October 2006. [Google Scholar]
  7. NEHTA. Interoperability Framework V2. 2007. Available online: https://www.ghdonline.org/uploads/NEHTA_2007_-_InteroperabilityFramework.pdf (accessed on 24 October 2019).
  8. Fritzsche, D.; Grüninger, M.; Baclawski, K.; Bennett, M.; Berg-Cross, G.; Schneider, T.; Sriram, R.; Underwood, M.; Westerinen, A. Ontology Summit 2016 Communique: Ontologies within semantic interoperability ecosystems. Appl. Ontol. 2017, 12, 91–111. [Google Scholar] [CrossRef] [Green Version]
  9. Field, C.B.; Barros, V.; Stocker, T.F.; Dahe, Q. Managing the Risks of Extreme Events and Disasters to Advance Climate Change Adaptation: Special Report of the Intergovernmental Panel on Climate Change; Cambridge University Press: New York, NY, USA, 2012. [Google Scholar]
  10. Haigh, R.; Amaratunga, D.; Hemachandra, K. A capacity analysis framework for multi-hazard early warning in coastal communities. Procedia Eng. 2018, 212, 1139–1146. [Google Scholar] [CrossRef]
  11. Al Hmoudi, A.; El, R.; Aziz, Z. Integrated elements of early warning systems to enhance disaster resilience in the Arab region. J. Geod. Geomat. Eng. 2015, 2, 73–81. [Google Scholar] [CrossRef]
  12. Rogers, D.; Tsirkunov, V. Implementing Hazard Early Warning Systems. GFDRR WCIDS Report 11-03. 2011. Available online: https://www.preventionweb.net/files/24259_implementingearlywarningsystems1108.pdf (accessed on 17 September 2019).
  13. Samarasundera, E.; Hansell, A.; Leibovici, D.; Horwell, C.J.; Anand, S.; Oppenheimer, C. Geological hazards: From early warning systems to public health toolkits. Health Place 2014, 30, 116–119. [Google Scholar] [CrossRef]
  14. Yao, K. Open Government Data for Disaster Risk Reduction. In Proceedings of the Training Workshop on Knowledge and Policy Gaps in Disaster Risk Reduction and Development Planning, Bangkok, Thailand, 8–9 March 2016; pp. 8–9. [Google Scholar]
  15. Li, G.; Zhao, J.; Murray, V.; Song, C.; Zhang, L. Gap analysis on open data interconnectivity for disaster risk research. Geo-Spat. Inf. Sci. 2019, 22, 45–58. [Google Scholar] [CrossRef] [Green Version]
  16. Kucera, J.; Chlapek, D.; Klímek, J.; Necaskỳ, M. Methodologies and Best Practices for Open Data Publication; DATESO: Jičín, Czech Republic, 2015; pp. 52–64. [Google Scholar]
  17. Mons, B.; Neylon, C.; Velterop, J.; Dumontier, M.; da Silva Santos, L.O.B.; Wilkinson, M.D. Cloudy, increasingly FAIR; revisiting the FAIR Data guiding principles for the European Open Science Cloud. Inf. Serv. Use 2017, 37, 49–56. [Google Scholar] [CrossRef] [Green Version]
  18. Chen, D.; Daclin, N. Framework for enterprise interoperability. In Proceedings of the IFAC Workshop EI2N, Bordeaux, France, 22–24 March 2006; pp. 77–88. [Google Scholar]
  19. Ford, T.C.; Colombi, J.M.; Graham, S.R.; Jacques, D.R. Survey on Interoperability Measurement. In Proceedings of the International Command and Control Research and Technology Symposium (ICCRTS), Newport, RI, USA, 18–21 June 2007. [Google Scholar]
  20. Melnik, S.; Decker, S. A layered approach to information modeling and interoperability on the web. In Proceedings of the ECDL’00 Workshop on the Semantic Web, Lisbon, Portugal, 21 September 2000. [Google Scholar]
  21. Commission, E. European Interoperability Framework—Implementation Strategy. 2017. Available online: https://eur-lex.europa.eu/legal-content/EN/TXT/?uri=COM:2017:134:FIN (accessed on 5 September 2019).
  22. Wang, W.; Tolk, A.; Wang, W. The levels of conceptual interoperability model: Applying systems engineering principles to M&S. arXiv 2009, arXiv:0908.0191. [Google Scholar]
  23. Rezaei, R.; Chiew, T.K.; Lee, S.P. A review on E-business Interoperability Frameworks. J. Syst. Softw. 2014, 93, 199–216. [Google Scholar] [CrossRef]
  24. Janowicz, K.; Hitzler, P.; Adams, B.; Kolas, D.; Vardeman, I. Five stars of linked data vocabulary use. Semant. Web 2014, 5, 173–176. [Google Scholar]
  25. Berners-Lee, T. Linked Data. Design Issues for the World Wide Web. World Wide Web Consortium, 2006. Available online: http://www.w3.org/DesignIssues/LinkedData.html? (accessed on 5 September 2019).
  26. Moses, M.; Stevens, T.; Bax, G. GIS Data Interoperability in Uganda. Int. J. Spat. Data Infrastruct. Res. 2012, 7, 488–507. [Google Scholar]
  27. Colpaert, P.; Van Compernolle, M.; De Vocht, L.; Dimou, A.; Vander Sande, M.; Verborgh, R.; Mechant, P.; Mannens, E. Quantifying the interoperability of open government datasets. Computer 2014, 47, 50–56. [Google Scholar] [CrossRef]
  28. Yahia, E.; Lezoche, M.; Aubry, A.; Panetto, H. Semantics enactment for interoperability assessment in Enterprise Information Systems. Annu. Rev. Control 2012, 36, 101–117. [Google Scholar] [CrossRef]
  29. Gamma, E.; Helm, R.; Johnson, R.E.; Vlissides, J. Design Patterns: Elements of Reusable Object-Oriented Software; Addison-Wesley: Reading, MA, USA, 1995. [Google Scholar]
  30. Hay, D.C. Data Model Patterns: A Metadata Map; Elsevier: Amsterdam, The Netherlands, 2010. [Google Scholar]
  31. Clark, P.; Thompson, J.; Porter, B. Knowledge Patterns. In Principles of Knowledge Representation and Reasoning; Springer: Berlin/Heidelberg, Germany, 2000; pp. 591–600. [Google Scholar]
  32. Bröring, A.; Schmid, S.; Schindhelm, C.K.; Khelil, A.; Kabisch, S.; Kramer, D.; Le Phuoc, D.; Mitic, J.; Anicic, D.; Teniente López, E. Enabling IoT ecosystems through platform interoperability. IEEE Softw. 2017, 34, 54–61. [Google Scholar] [CrossRef]
  33. Spalazzese, R.; Inverardi, P. Mediating connector patterns for components interoperability. In European Conference on Software Architecture; Springer: Berlin/Heidelberg, Germany, 2010; pp. 335–343. [Google Scholar]
  34. Gangemi, A.; Presutti, V. Ontology design patterns. In Handbook on Ontologies; Springer: Berlin/Heidelberg, Germany, 2009; pp. 221–243. [Google Scholar]
  35. Blomqvist, E.; Hitzler, P.; Janowicz, K.; Krisnadhi, A.; Narock, T.; Solanki, M. Considerations regarding Ontology Design Patterns. Semant. Web 2016, 7, 1–7. [Google Scholar] [CrossRef]
  36. Verborgh, R.; De Roo, J. Drawing conclusions from linked data on the web: The EYE reasoner. IEEE Softw. 2015, 32, 23–27. [Google Scholar] [CrossRef]
  37. Tešanovic, A. What Is a Pattern; Linköping Univeristy: Linköping, Sweden, 2004. [Google Scholar]
  38. Dimitrova, V.; Denaux, R.; Hart, G.; Dolbear, C.; Holt, I.; Cohn, A.G. Involving domain experts in authoring OWL ontologies. In International Semantic Web Conference; Springer: Berlin/Heidelberg, Germany, 2008; pp. 1–16. [Google Scholar]
  39. Hammar, K. Content Ontology Design Patterns: Qualities, Methods, and Tools; Linköping University Electronic Press: Linköping, Sweden, 2017; Volume 1879. [Google Scholar]
  40. Rodrıguez-Doncel, V.; Krisnadhi, A.A.; Hitzler, P.; Cheatham, M.; Karima, N.; Amini, R. Pattern-based linked data publication: The linked chess dataset case. In Proceedings of the 6th International Workshop on Consuming Linked Data co-located with 14th International Semantic Web Conference (ISWC 2105), Bethlehem, PA, USA, 12 October 2015. [Google Scholar]
  41. Taelman, R.; Van Herwegen, J.; Vander Sande, M.; Verborgh, R. Comunica: A modular SPARQL query engine for the web. In International Semantic Web Conference; Springer: Berlin/Heidelberg, Germany, 2018; pp. 239–255. [Google Scholar]
  42. Ngomo, N. 9th Challenge on Question Answering over Linked Data (QALD-9). Semdeep/NLIWoD@ ISWC. 2018, pp. 58–64. Available online: ceur-ws.org/Vol-2241/paper-06.pdf (accessed on 5 September 2019).
  43. Atzori, M.; Mazzeo, G.M.; Zaniolo, C. QA3: A natural language approach to question answering over RDF data cubes. Semant. Web 2019, 10, 587–604. [Google Scholar] [CrossRef]
Figure 1. General flow of methodology.
Figure 1. General flow of methodology.
Ijgi 08 00484 g001
Figure 2. Dataset comparison (adopted from Colpaert et al. [27]).
Figure 2. Dataset comparison (adopted from Colpaert et al. [27]).
Ijgi 08 00484 g002
Figure 3. Data extracts.
Figure 3. Data extracts.
Ijgi 08 00484 g003
Figure 4. NTriple (N3) and ID generation from datasets in Figure 3.
Figure 4. NTriple (N3) and ID generation from datasets in Figure 3.
Ijgi 08 00484 g004
Figure 5. Computing identifier interoperability [27].
Figure 5. Computing identifier interoperability [27].
Ijgi 08 00484 g005
Figure 6. Problem and solution matching resulting into reusable patterns, i.e., “interoperability patterns”.
Figure 6. Problem and solution matching resulting into reusable patterns, i.e., “interoperability patterns”.
Ijgi 08 00484 g006
Figure 7. Illustrative example of a modular pattern composition in an early warning system.
Figure 7. Illustrative example of a modular pattern composition in an early warning system.
Ijgi 08 00484 g007
Table 1. Interoperability maturity metrics.
Table 1. Interoperability maturity metrics.
Technical Interoperability
T1:Data is published in any format and accessed via a common protocol such as Http URL
Syntactic Interoperability
S1:Data is available as structured data e.g., shapefile, excel instead of scanned image
S2:Data is available in a machine readable non proprietary format e.g., CSV, XML, GML, RDF, OWL etc
Semantic Interoperability
Sem1:Use URI to denote concepts for ensuring unique identification
Sem2:Data is available with vocabularies that are linked
Organization Interoperability
O1:Licence attached to data
O2:Data can be accessed freely
O3:Have institutional arrangements to share data
O4:Uses a data policy that is binding to other stakeholders in the sector(one that is not only institutional)
Table 2. Interoperability maturity (see Table 1 and Table A2).
Table 2. Interoperability maturity (see Table 1 and Table A2).
Interoperability LevelTechnicalSyntaxSemanticOrganisation
MetricT1S1S2Sem1Sem2O1O2O3O4
Respondent 11100000.510
Respondent 2111000110
Respondent 3111001110
Respondent 4111001110
Respondent 51110000.510
Respondent 61110000.510
Respondent 70110000.510
Total676002570
Average metric86%100%86%0%0%14%71%100%0%
Maturity metric86%93%0%46%
Table 3. Interoperability identifier ratio with the DesInventar dataset as reference.
Table 3. Interoperability identifier ratio with the DesInventar dataset as reference.
DatasetRelevance of String Matching IDsRelevance of Real-World ConceptsInteroperability IdentifierRemark
Landcover000%low
Rainfall data2020100%high
Emdat_impact data20930069.7%medium
Risk data3232100%high
Response data27033081.8%high
Poverty data3838100%high
Table 4. Emerging interoperability patterns.
Table 4. Emerging interoperability patterns.
LevelInteroperability ChallengesRecommended SolutionEmerging Patterns
General1-Missing meta data
2-Missing, incomplete, and erroneous datasets
3-Data is not up to date
4-No/ immature standards
5-Duplication of data collection efforts.
1-Metadata documentation portal
2-Enriching and linking disaster/hazard related data to fill gaps and validation of data
3-Develop standards for sector data
1-Linked data patterns e.g.,
-Follow Your Nose pattern to find additional relevant data
-Progressive Enrichment pattern for data quality
-Missing Isn’t Broken pattern to handle messy and incomplete data
Technical6-Data is not easily retrievable because it is scattered4-Centralized access to repositories with linked documents2-Federated queries pattern
Technical7-Absence of directed alerts, notifications based sharing8-Geo-tagged alerts, and notifications
9-Embrace changing open architectures
3-Broadcast pattern
Syntax8-Lack of capacity in semantic mark-ups and technologies10-Semantic systems that hide their complexity behind an easy to use and intuitive interface
Semantic9-Terminology is not standard across datasets to be integrated11-Unified vocabulary across domains
e.g., glossary. However, the vocabulary should be expressive across different domains
4-Content Ontology design patterns (ODPs)
Organisation10-Data is not easily accessed and retrieved but it is available in different institutions
11-Limited funding for data collection and management in the sector -
12-Mandated institutions should take lead in policy formulation
13-Develop policies that Open up government data (i.e. embrace Open Government Data (OGD) principles)
14-Government should invest in data management
15-Create awareness and disseminate available disaster resources
16-organizations should add licenses to data
5-Coordination and implementation pattern
6-Dissemination
7-privacy policy pattern
8-Policy harmonization(i.e. policy
override and conflict mediation patterns)
9-Rights pattern

Share and Cite

MDPI and ACS Style

Mazimwe, A.; Hammouda, I.; Gidudu, A. An Empirical Evaluation of Data Interoperability—A Case of the Disaster Management Sector in Uganda. ISPRS Int. J. Geo-Inf. 2019, 8, 484. https://doi.org/10.3390/ijgi8110484

AMA Style

Mazimwe A, Hammouda I, Gidudu A. An Empirical Evaluation of Data Interoperability—A Case of the Disaster Management Sector in Uganda. ISPRS International Journal of Geo-Information. 2019; 8(11):484. https://doi.org/10.3390/ijgi8110484

Chicago/Turabian Style

Mazimwe, Allan, Imed Hammouda, and Anthony Gidudu. 2019. "An Empirical Evaluation of Data Interoperability—A Case of the Disaster Management Sector in Uganda" ISPRS International Journal of Geo-Information 8, no. 11: 484. https://doi.org/10.3390/ijgi8110484

APA Style

Mazimwe, A., Hammouda, I., & Gidudu, A. (2019). An Empirical Evaluation of Data Interoperability—A Case of the Disaster Management Sector in Uganda. ISPRS International Journal of Geo-Information, 8(11), 484. https://doi.org/10.3390/ijgi8110484

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop