To read this content please select one of the options below:

FactQA: question answering over domain knowledge graph based on two-level query expansion

Xiaoming Zhang (School of Information Science and Engineering, Hebei University of Science and Technology, Shijiazhuang, China)
Mingming Meng (School of Information Science and Engineering, Hebei University of Science and Technology, Shijiazhuang, China)
Xiaoling Sun (School of Information Science and Engineering, Hebei University of Science and Technology, Shijiazhuang, China)
Yu Bai (School of Information Science and Engineering, Hebei University of Science and Technology, Shijiazhuang, China)

Data Technologies and Applications

ISSN: 2514-9288

Article publication date: 10 December 2019

Issue publication date: 24 March 2020

311

Abstract

Purpose

With the advent of the era of Big Data, the scale of knowledge graph (KG) in various domains is growing rapidly, which holds huge amount of knowledge surely benefiting the question answering (QA) research. However, the KG, which is always constituted of entities and relations, is structurally inconsistent with the natural language query. Thus, the QA system based on KG is still faced with difficulties. The purpose of this paper is to propose a method to answer the domain-specific questions based on KG, providing conveniences for the information query over domain KG.

Design/methodology/approach

The authors propose a method FactQA to answer the factual questions about specific domain. A series of logical rules are designed to transform the factual questions into the triples, in order to solve the structural inconsistency between the user’s question and the domain knowledge. Then, the query expansion strategies and filtering strategies are proposed from two levels (i.e. words and triples in the question). For matching the question with domain knowledge, not only the similarity values between the words in the question and the resources in the domain knowledge but also the tag information of these words is considered. And the tag information is obtained by parsing the question using Stanford CoreNLP. In this paper, the KG in metallic materials domain is used to illustrate the FactQA method.

Findings

The designed logical rules have time stability for transforming the factual questions into the triples. Additionally, after filtering the synonym expansion results of the words in the question, the expansion quality of the triple representation of the question is improved. The tag information of the words in the question is considered in the process of data matching, which could help to filter out the wrong matches.

Originality/value

Although the FactQA is proposed for domain-specific QA, it can also be applied to any other domain besides metallic materials domain. For a question that cannot be answered, FactQA would generate a new related question to answer, providing as much as possible the user with the information they probably need. The FactQA could facilitate the user’s information query based on the emerging KG.

Keywords

Acknowledgements

This research was funded by the Natural Science Foundation of Hebei Province (Grant No. F2018208116), Hebei Science and Technology Support Program (No. 16210312D) and Key Project of Hebei Education Department (Grant No. ZD2015099).

Citation

Zhang, X., Meng, M., Sun, X. and Bai, Y. (2020), "FactQA: question answering over domain knowledge graph based on two-level query expansion", Data Technologies and Applications, Vol. 54 No. 1, pp. 34-63. https://doi.org/10.1108/DTA-02-2019-0029

Publisher

:

Emerald Publishing Limited

Copyright © 2019, Emerald Publishing Limited

Related articles