To read the full version of this content please select one of the options below:

Ontology-based heuristic patent search

Cong-Phuoc Phan (International University, Ho Chi Minh City, Vietnam)
Hong-Quang Nguyen (International University, Ho Chi Minh City, Vietnam)
Tan-Tai Nguyen (HCMI International University, Ho Chi Minh City, Vietnam)

International Journal of Web Information Systems

ISSN: 1744-0084

Article publication date: 3 December 2018

Issue publication date: 8 August 2019

Abstract

Purpose

Large collections of patent documents disclosing novel, non-obvious technologies are publicly available and beneficial to academia and industries. To maximally exploit its potential, searching these patent documents has increasingly become an important topic. Although much research has processed a large size of collections, a few studies have attempted to integrate both patent classifications and specifications for analyzing user queries. Consequently, the queries are often insufficiently analyzed for improving the accuracy of search results. This paper aims to address such limitation by exploiting semantic relationships between patent contents and their classification.

Design/methodology/approach

The contributions are fourfold. First, the authors enhance similarity measurement between two short sentences and make it 20 per cent more accurate. Second, the Graph-embedded Tree ontology is enriched by integrating both patent documents and classification scheme. Third, the ontology does not rely on rule-based method or text matching; instead, an heuristic meaning comparison to extract semantic relationships between concepts is applied. Finally, the patent search approach uses the ontology effectively with the results sorted based on their most common order.

Findings

The experiment on searching for 600 patent documents in the field of Logistics brings better 15 per cent in terms of F-Measure when compared with traditional approaches.

Research limitations/implications

The research, however, still requires improvement in which the terms and phrases extracted by Noun and Noun phrases making less sense in some aspect and thus might not result in high accuracy. The large collection of extracted relationships could be further optimized for its conciseness. In addition, parallel processing such as Map-Reduce could be further used to improve the search processing performance.

Practical implications

The experimental results could be used for scientists and technologists to search for novel, non-obvious technologies in the patents.

Social implications

High quality of patent search results will reduce the patent infringement.

Originality/value

The proposed ontology is semantically enriched by integrating both patent documents and their classification. This ontology facilitates the analysis of the user queries for enhancing the accuracy of the patent search results.

Keywords

Acknowledgements

This research is funded by Vietnam National University Ho Chi Minh City (VNUHCM) under grant number C2016-2808.

Citation

Phan, C.-P., Nguyen, H.-Q. and Nguyen, T.-T. (2019), "Ontology-based heuristic patent search", International Journal of Web Information Systems, Vol. 15 No. 3, pp. 258-284. https://doi.org/10.1108/IJWIS-06-2018-0053

Publisher

:

Emerald Publishing Limited

Copyright © 2018, Emerald Publishing Limited