To read this content please select one of the options below:

An evaluation of conflation accuracy using finite‐state transducers

Carmen Galvez (Department of Information Science, University of Granada, Granada, Spain)

Félix de Moya‐Anegón (Department of Information Science, University of Granada, Granada, Spain)

Journal of Documentation

ISSN: 0022-0418

Article publication date: 1 May 2006

Downloads

514

Abstract

Purpose

–

To evaluate the accuracy of conflation methods based on finite‐state transducers (FSTs).

Design/methodology/approach

–

Incorrectly lemmatized and stemmed forms may lead to the retrieval of inappropriate documents. Experimental studies to date have focused on retrieval performance, but very few on conflation performance. The process of normalization we used involved a linguistic toolbox that allowed us to construct, through graphic interfaces, electronic dictionaries represented internally by FSTs. The lexical resources developed were applied to a Spanish test corpus for merging term variants in canonical lemmatized forms. Conflation performance was evaluated in terms of an adaptation of recall and precision measures, based on accuracy and coverage, not actual retrieval. The results were compared with those obtained using a Spanish version of the Porter algorithm.

Findings

–

The conclusion is that the main strength of lemmatization is its accuracy, whereas its main limitation is the underanalysis of variant forms.

Originality/value

–

The report outlines the potential of transducers in their application to normalization processes.

Keywords

Citation

Galvez, C. and de Moya‐Anegón, F. (2006), "An evaluation of conflation accuracy using finite‐state transducers", Journal of Documentation, Vol. 62 No. 3, pp. 328-349. https://doi.org/10.1108/00220410610666493

Publisher

:

Emerald Group Publishing Limited

To read this content please select one of the options below:

Please note you do not have access to teaching notes

An evaluation of conflation accuracy using finite‐state transducers

Abstract

Purpose

Design/methodology/approach

Findings

Originality/value

Keywords

Citation

Publisher

Related articles

Something didn’t work…

All feedback is valuable

Platform update page

Questions & More Information

To read this content please select one of the options below:

Please note you do not have access to teaching notes

Abstract

Purpose

Design/methodology/approach

Findings

Originality/value

Keywords

Citation

Publisher

Related articles

We’re listening — tell us what you think

Something didn’t work…

All feedback is valuable

Join us on our journey

Platform update page

Questions & More Information