Search results

1 – 8 of 8
Article
Publication date: 1 February 2016

Chongchong Zhao, Chao Dong and Xiaoming Zhang

The integration and retrieval of the vast data have attracted sufficient attention, thus the W3C workgroup releases R2RML to standardize the transformation from relational data to…

2316

Abstract

Purpose

The integration and retrieval of the vast data have attracted sufficient attention, thus the W3C workgroup releases R2RML to standardize the transformation from relational data to semantic-aware data. However, it only provides a data transform mechanism to resource description framework (RDF). The generation of mapping alignments still needs manual work or other algorithms. Therefore, the purpose of this paper is to propose a domain-oriented automatic mapping method and an application of the R2RML standard.

Design/methodology/approach

In this paper, materials science is focussed to show an example of domain-oriented mapping. source field concept and M3B2 (Metal Materials Mapping Background Base) knowledge bases are established to support the auto-recommending algorithm. As for the generation of RDF files, the idea is to generate the triples and the links, respectively. The links of the triples follow the object-subject relationship, and the links of the object properties can be achieved by the range individuals and the trail path.

Findings

Consequently based on the previous work, the authors proposed Engine for Metal Materials Mapping Background Base (EM3B2), a semantic integration engine for materials science. EM3B2 not only offers friendly graphical interfaces, but also provides auto-recommending mapping based on materials knowledge to enable users to avoid vast manually work. The experimental result indicates that EM3B2 supplies accurate mapping. Moreover, the running time of E3MB2 is also competitive as classical methods.

Originality/value

This paper proposed EM3B2 semantic integration engine, which contributes to the relational database-to-RDF mapping by the application of W3C R2RML standard and the domain-oriented mapping.

Details

Program, vol. 50 no. 1
Type: Research Article
ISSN: 0033-0337

Keywords

Article
Publication date: 6 November 2017

Ademar Crotti Junior, Christophe Debruyne, Rob Brennan and Declan O’Sullivan

This paper aims to evaluate the state-of-the-art in CSV uplift tools. Based on this evaluation, a method that incorporates data transformations into uplift mapping languages by…

Abstract

Purpose

This paper aims to evaluate the state-of-the-art in CSV uplift tools. Based on this evaluation, a method that incorporates data transformations into uplift mapping languages by means of functions is proposed and evaluated. Typically, tools that map non-resource description framework (RDF) data into RDF format rely on the technology native to the source of the data when data transformation is required. Depending on the data format, data manipulation can be performed using underlying technology, such as relational database management system (RDBMS) for relational databases or XPath for XML. For CSV/Tabular data, there is no such underlying technology, and instead, it requires either a transformation of source data into another format or pre/post-processing techniques.

Design/methodology/approach

To evaluate the state-of-the-art in CSV uplift tools, the authors present a comparison framework and have applied it to such tools. A key feature evaluated in the comparison framework is data transformation functions. They argue that existing approaches for transformation functions are complex – in that a number of steps and tools are required. The proposed method, FunUL, in contrast, defines functions independent of the source data being mapped into RDF, as resources within the mapping itself.

Findings

The approach was evaluated using two typical real-world use cases. The authors have compared how well our approach and others (that include transformation functions as part of the uplift mapping) could implement an uplift mapping from CSV/Tabular into RDF. This comparison indicates that the authors’ approach performs well for these use cases.

Originality/value

This paper presents a comparison framework and applies it to the state-of-the-art in CSV uplift tools. Furthermore, the authors describe FunUL, which, unlike other related work, defines functions as resources within the uplift mapping itself, integrating data transformation functions and mapping definitions. This makes the generation of RDF from source data transparent and traceable. Moreover, as functions are defined as resources, these can be reused multiple times within mappings.

Details

International Journal of Web Information Systems, vol. 13 no. 4
Type: Research Article
ISSN: 1744-0084

Keywords

Article
Publication date: 3 November 2014

Nikolaos Konstantinou, Dimitrios-Emmanuel Spanos, Nikos Houssos and Nikolaos Mitrou

– This paper aims to introduce a transformation engine which can be used to convert an existing institutional repository installation into a Linked Open Data repository.

1773

Abstract

Purpose

This paper aims to introduce a transformation engine which can be used to convert an existing institutional repository installation into a Linked Open Data repository.

Design/methodology/approach

The authors describe how the data that exist in a DSpace repository can be semantically annotated to serve as a Semantic Web (meta)data repository.

Findings

The authors present a non-intrusive, standards-compliant approach that can run alongside with current practices, while incorporating state-of-the art methodologies.

Originality/value

Also, they propose a set of mappings between domain vocabularies that can be (re)used towards this goal, thus offering an approach that covers both the technical and semantic aspects of the procedure.

Details

The Electronic Library, vol. 32 no. 6
Type: Research Article
ISSN: 0264-0473

Keywords

Article
Publication date: 18 August 2021

Jameel Ahamed, Roohie Naaz Mir and Mohammad Ahsan Chishti

A huge amount of diverse data is generated in the Internet of Things (IoT) because of heterogeneous devices like sensors, actuators, gateways and many more. Due to assorted nature…

Abstract

Purpose

A huge amount of diverse data is generated in the Internet of Things (IoT) because of heterogeneous devices like sensors, actuators, gateways and many more. Due to assorted nature of devices, interoperability remains a major challenge for IoT system developers. The purpose of this study is to use mapping techniques for converting relational database (RDB) to resource directory framework (RDF) for the development of ontology. Ontology helps in achieving semantic interoperability in application areas of IoT which results in shared/common understanding of the heterogeneous data generated by the diverse devices used in health-care domain.

Design/methodology/approach

To overcome the issue of semantic interoperability in healthcare domain, the authors developed an ontology for patients having cardio vascular diseases. Patients located at any place around the world can be diagnosed by Heart Experts located at another place by using this approach. This mechanism deals with the mapping of heterogeneous data into the RDF format in an integrated and interoperable manner. This approach is used to integrate the diverse data of heart patients needed for diagnosis with respect to cardio vascular diseases. This approach is also applicable in other fields where IoT is mostly used.

Findings

Experimental results showed that the RDF works better than the relational database for semantic interoperability in the IoT. This concept-based approach is better than key-based approach and reduces the computation time and storage of the data.

Originality/value

The proposed approach helps in overcoming the demerits of relational database like standardization, expressivity, provenance and supports SPARQL. Therefore, it helps to overcome the heterogeneity, thereby enabling the semantic interoperability in IoT.

Details

International Journal of Pervasive Computing and Communications, vol. 17 no. 4
Type: Research Article
ISSN: 1742-7371

Keywords

Content available
Article
Publication date: 8 July 2022

Vania Vidal, Valéria Magalhães Pequeno, Narciso Moura Arruda Júnior and Marco Antonio Casanova

Enterprise knowledge graphs (EKG) in resource description framework (RDF) consolidate and semantically integrate heterogeneous data sources into a comprehensive dataspace…

Abstract

Purpose

Enterprise knowledge graphs (EKG) in resource description framework (RDF) consolidate and semantically integrate heterogeneous data sources into a comprehensive dataspace. However, to make an external relational data source accessible through an EKG, an RDF view of the underlying relational database, called an RDB2RDF view, must be created. The RDB2RDF view should be materialized in situations where live access to the data source is not possible, or the data source imposes restrictions on the type of query forms and the number of results. In this case, a mechanism for maintaining the materialized view data up-to-date is also required. The purpose of this paper is to address the problem of the efficient maintenance of externally materialized RDB2RDF views.

Design/methodology/approach

This paper proposes a formal framework for the incremental maintenance of externally materialized RDB2RDF views, in which the server computes and publishes changesets, indicating the difference between the two states of the view. The EKG system can then download the changesets and synchronize the externally materialized view. The changesets are computed based solely on the update and the source database state and require no access to the content of the view.

Findings

The central result of this paper shows that changesets computed according to the formal framework correctly maintain the externally materialized RDB2RDF view. The experiments indicate that the proposed strategy supports live synchronization of large RDB2RDF views and that the time taken to compute the changesets with the proposed approach was almost three orders of magnitude smaller than partial rematerialization and three orders of magnitude smaller than full rematerialization.

Originality/value

The main idea that differentiates the proposed approach from previous work on incremental view maintenance is to explore the object-preserving property of typical RDB2RDF views so that the solution can deal with views with duplicates. The algorithms for the incremental maintenance of relational views with duplicates published in the literature require querying the materialized view data to precisely compute the changesets. By contrast, the approach proposed in this paper requires no access to view data. This is important when the view is maintained externally, because accessing a remote data source may be too slow.

Details

International Journal of Web Information Systems, vol. 18 no. 5/6
Type: Research Article
ISSN: 1744-0084

Keywords

Article
Publication date: 18 August 2022

Henrik Dibowski

The curation of ontologies and knowledge graphs (KGs) is an essential task for industrial knowledge-based applications, as they rely on the contained knowledge to be correct and…

Abstract

Purpose

The curation of ontologies and knowledge graphs (KGs) is an essential task for industrial knowledge-based applications, as they rely on the contained knowledge to be correct and error-free. Often, a significant amount of a KG is curated by humans. Established validation methods, such as Shapes Constraint Language, Shape Expressions or Web Ontology Language, can detect wrong statements only after their materialization, which can be too late. Instead, an approach that avoids errors and adequately supports users is required.

Design/methodology/approach

For solving that problem, Property Assertion Constraints (PACs) have been developed. PACs extend the range definition of a property with additional logic expressed with SPARQL. For the context of a given instance and property, a tailored PAC query is dynamically built and triggered on the KG. It can determine all values that will result in valid property value assertions.

Findings

PACs can avoid the expansion of KGs with invalid property value assertions effectively, as their contained expertise narrows down the valid options a user can choose from. This simplifies the knowledge curation and, most notably, relieves users or machines from knowing and applying this expertise, but instead enables a computer to take care of it.

Originality/value

PACs are fundamentally different from existing approaches. Instead of detecting erroneous materialized facts, they can determine all semantically correct assertions before materializing them. This avoids invalid property value assertions and provides users an informed, purposeful assistance. To the author's knowledge, PACs are the only such approach.

Details

Data Technologies and Applications, vol. 57 no. 2
Type: Research Article
ISSN: 2514-9288

Keywords

Article
Publication date: 25 October 2022

Samir Sellami and Nacer Eddine Zarour

Massive amounts of data, manifesting in various forms, are being produced on the Web every minute and becoming the new standard. Exploring these information sources distributed in…

Abstract

Purpose

Massive amounts of data, manifesting in various forms, are being produced on the Web every minute and becoming the new standard. Exploring these information sources distributed in different Web segments in a unified way is becoming a core task for a variety of users’ and companies’ scenarios. However, knowledge creation and exploration from distributed Web data sources is a challenging task. Several data integration conflicts need to be resolved and the knowledge needs to be visualized in an intuitive manner. The purpose of this paper is to extend the authors’ previous integration works to address semantic knowledge exploration of enterprise data combined with heterogeneous social and linked Web data sources.

Design/methodology/approach

The authors synthesize information in the form of a knowledge graph to resolve interoperability conflicts at integration time. They begin by describing KGMap, a mapping model for leveraging knowledge graphs to bridge heterogeneous relational, social and linked web data sources. The mapping model relies on semantic similarity measures to connect the knowledge graph schema with the sources' metadata elements. Then, based on KGMap, this paper proposes KeyFSI, a keyword-based semantic search engine. KeyFSI provides a responsive faceted navigating Web user interface designed to facilitate the exploration and visualization of embedded data behind the knowledge graph. The authors implemented their approach for a business enterprise data exploration scenario where inputs are retrieved on the fly from a local customer relationship management database combined with the DBpedia endpoint and the Facebook Web application programming interface (API).

Findings

The authors conducted an empirical study to test the effectiveness of their approach using different similarity measures. The observed results showed better efficiency when using a semantic similarity measure. In addition, a usability evaluation was conducted to compare KeyFSI features with recent knowledge exploration systems. The obtained results demonstrate the added value and usability of the contributed approach.

Originality/value

Most state-of-the-art interfaces allow users to browse one Web segment at a time. The originality of this paper lies in proposing a cost-effective virtual on-demand knowledge creation approach, a method that enables organizations to explore valuable knowledge across multiple Web segments simultaneously. In addition, the responsive components implemented in KeyFSI allow the interface to adequately handle the uncertainty imposed by the nature of Web information, thereby providing a better user experience.

Details

International Journal of Web Information Systems, vol. 18 no. 5/6
Type: Research Article
ISSN: 1744-0084

Keywords

Article
Publication date: 21 November 2016

Young Man Ko, Min Sun Song and Seung Jun Lee

The purpose of this paper is to construct a structural definition-based terminology ontology system that defines the meanings of academic terms on the basis of properties and…

492

Abstract

Purpose

The purpose of this paper is to construct a structural definition-based terminology ontology system that defines the meanings of academic terms on the basis of properties and links terms with properties that are structured by conceptual categories (classes). This study also aims to test the possibility of semantic searches by generating inference rules and setting very complicated search scenarios.

Design/methodology/approach

For the study, 55,236 keywords from the articles of the “Korea Citation Index” were structurally defined and relationships among terms and properties were built. Then, the authors converted the RDB data into RDF and designed ontologies using the ontology developing tool Protégé. The authors also tested the designed ontology with the inference engine of the Protégé editor. The generated reference rules were tested by TBox and SPARQL queries.

Findings

The authors generated inference control rules targeting high-input-ratio data in the properties of classes by calculating the input ratio of real input data in the system, and then the authors executed a semantic search by SPARQL query by setting very complicated search scenarios, for which it would be difficult to deduce results via a simple keyword search. As a result, it was confirmed that the search results show the logical combination of semantically related term data.

Practical implications

The proposed terminology ontology system was constructed with the author keywords from research papers, it will be useful in searching the research papers which include the keywords as search results by the complex combination of semantic relation. And the Structural Terminology Net database could be utilized as an index database in retrieval services and the mining of informal big data through the application of well-defined semantic concepts to each term.

Originality/value

This paper presented a methodology for supporting IR using expanded queries based on a novel model of structural terminology-based ontology. The user who wants to access the specific topic can create query that brings the semantically relevant information. The search results show the logical combination of semantically related term data, which would be difficult to deduce results via traditional IR systems.

Access

Year

All dates (8)

Content type

1 – 8 of 8