To read this content please select one of the options below:

Effective keyword search on graph data using limited root redundancy of answer trees

Chang-Sup Park (Department of Computer Science, Dongduk Women’s University, Seoul, Korea)

International Journal of Web Information Systems

ISSN: 1744-0084

Article publication date: 20 August 2018

100

Abstract

Purpose

This paper aims to propose a new keyword search method on graph data to improve the relevance of search results and reduce duplication of content nodes in the answer trees obtained by previous approaches based on distinct root semantics. The previous approaches are restricted to find answer trees having different root nodes and thus often generate a result consisting of answer trees with low relevance to the query or duplicate content nodes. The method allows limited redundancy in the root nodes of top-k answer trees to produce more effective query results.

Design/methodology/approach

A measure for redundancy in a set of answer trees regarding their root nodes is defined, and according to the metric, a set of answer trees with limited root redundancy is proposed for the result of a keyword query on graph data. For efficient query processing, an index on the useful paths in the graph using inverted lists and a hash map is suggested. Then, based on the path index, a top-k query processing algorithm is presented to find most relevant and diverse answer trees given a maximum amount of root redundancy allowed for a set of answer trees.

Findings

The results of experiments using real graph datasets show that the proposed approach can produce effective query answers which are more diverse in the content nodes and more relevant to the query than the previous approach based on distinct root semantics.

Originality/value

This paper first takes redundancy in the root nodes of answer trees into account to improve the relevance and content nodes redundancy of query results over the previous distinct root semantics. It can satisfy the users’ various information need on a large and complex graph data using a keyword-based query.

Keywords

Citation

Park, C.-S. (2018), "Effective keyword search on graph data using limited root redundancy of answer trees", International Journal of Web Information Systems, Vol. 14 No. 3, pp. 299-316. https://doi.org/10.1108/IJWIS-10-2017-0070

Publisher

:

Emerald Publishing Limited

Copyright © 2018, Emerald Publishing Limited

Related articles