To read this content please select one of the options below:

Pushing similarity joins down to the storage layer in XML databases

Leonardo Andrade Ribeiro (Instituto de Informática, Universidade Federal de Goiás, Goiânia, Goiás, Brazil)

Theo Härder (Technische Universitat Kaiserslautern Fachbereich Informatik, Kaiserslautern, Rheinland-Pfalz, Germany)

International Journal of Web Information Systems

ISSN: 1744-0084

Article publication date: 18 April 2017

Downloads

520

Abstract

Purpose

This article aims to explore how to incorporate similarity joins into XML database management systems (XDBMSs). The authors aim to provide seamless and efficient integration of similarity joins on tree-structured data into an XDBMS architecture.

Design/methodology/approach

The authors exploit XDBMS-specific features to efficiently generate XML tree representations for similarity matching. In particular, the authors push down a large part of the structural similarity evaluation close to the storage layer.

Findings

Empirical experiments were conducted to measure and compare accuracy, performance and scalability of the tree similarity join using different similarity functions and on the top of different storage models. The results show that the authors’ proposal delivers performance and scalability without hurting the accuracy.

Originality/value

Similarity join is a fundamental operation for data integration. Unfortunately, none of the XDBMS architectures proposed so far provides an efficient support for this operation. Evaluating similarity joins on XML is challenging, because it requires similarity matching on the text and structure. In this work, the authors integrate similarity joins into an XDBMS. To the best of the authors’ knowledge, this work is the first to leverage the storage scheme of an XDBMS to support XML similarity join processing.

Keywords

Citation

Andrade Ribeiro, L. and Härder, T. (2017), "Pushing similarity joins down to the storage layer in XML databases", International Journal of Web Information Systems, Vol. 13 No. 1, pp. 55-71. https://doi.org/10.1108/IJWIS-04-2016-0019

Publisher

:

Emerald Publishing Limited

To read this content please select one of the options below:

Please note you do not have access to teaching notes

Pushing similarity joins down to the storage layer in XML databases

Abstract

Purpose

Design/methodology/approach

Findings

Originality/value

Keywords

Citation

Publisher

Related articles

To read this content please select one of the options below:

Please note you do not have access to teaching notes

Abstract

Purpose

Design/methodology/approach

Findings

Originality/value

Keywords

Citation

Publisher

Related articles

All feedback is valuable

Report an issue or find answers to frequently asked questions