To read this content please select one of the options below:

AN EVALUATION OF FEEDBACK IN DOCUMENT RETRIEVAL USING CO‐OCCURRENCE DATA

D.J. HARPER (Computer Laboratory, University of Cambridge)
C.J. VAN RIJSBERGEN (Computer Laboratory, University of Cambridge)

Journal of Documentation

ISSN: 0022-0418

Article publication date: 1 March 1978

160

Abstract

This paper reports experiments with a term weighting model incorporating relevance information in which it is assumed that index terms are distributed dependently. Initially this model was tested with complete relevance information against a similar model which assumes index terms are distributed independently. The experiments demonstrated conclusively that index terms are not independent for a number of diverse document collections. It was concluded that the use of relevance information together with dependence information could potentially improve retrieval effectiveness. As a result of further experiments the initial strict dependence model was modified and in particular a new relevance‐based term weight was developed. This modified dependence model was then used as the basis for relevance feedback, i.e. with partial relevance information only, and significant increases in retrieval effectiveness were achieved. The evaluation method used in the feedback experiments emphasized the effect of the feedback on documents which the potential user would not previously have seen. Finally the incorporation of relevance feedback in an operational system is considered and in particular it is argued that if high recall searches are required, relevance feedback based on the modified dependence model may be superior to the widely used Boolean search.

Citation

HARPER, D.J. and VAN RIJSBERGEN, C.J. (1978), "AN EVALUATION OF FEEDBACK IN DOCUMENT RETRIEVAL USING CO‐OCCURRENCE DATA", Journal of Documentation, Vol. 34 No. 3, pp. 189-216. https://doi.org/10.1108/eb026659

Publisher

:

MCB UP Ltd

Copyright © 1978, MCB UP Limited

Related articles