To read this content please select one of the options below:

A simple relevancy‐ranking strategy for an interface to Boolean OPACs

Christopher S.G. Khoo (Division of Information Studies, School of Communication and Information, Nanyang Technological University, Singapore)
Kwok‐Wai Wan (Division of Information Studies, School of Communication and Information, Nanyang Technological University, Singapore)

The Electronic Library

ISSN: 0264-0473

Article publication date: 1 April 2004

930

Abstract

A relevancy‐ranking algorithm for a natural language interface to Boolean online public access catalogs (OPACs) was formulated and compared with that currently used in a knowledge‐based search interface called the E‐Referencer, being developed by the authors. The algorithm makes use of seven well‐known ranking criteria: breadth of match, section weighting, proximity of query words, variant word forms (stemming), document frequency, term frequency and document length. The algorithm converts a natural language query into a series of increasingly broader Boolean search statements. In a small experiment with ten subjects in which the algorithm was simulated by hand, the algorithm obtained good results with a mean overall precision of 0.42 and mean average precision of 0.62, representing a 27 percent improvement in precision and 41 percent improvement in average precision compared to the E‐Referencer. The usefulness of each step in the algorithm was analyzed and suggestions are made for improving the algorithm.

Keywords

Citation

Khoo, C.S.G. and Wan, K. (2004), "A simple relevancy‐ranking strategy for an interface to Boolean OPACs", The Electronic Library, Vol. 22 No. 2, pp. 112-120. https://doi.org/10.1108/02640470410533380

Publisher

:

Emerald Group Publishing Limited

Copyright © 2004, Emerald Group Publishing Limited

Related articles