To read this content please select one of the options below:

Duplicate detection algorithms of bibliographic descriptions

Anestis Sitas (School of Philosophy, Aristotle University of Thessaloniki, and School of Library Science, Technological Institute of Thessaloniki, Thessaloniki, Greece)

Sarantos Kapidakis (Archive and Library Sciences Department, Ionian University, Paleo Anaktoro, Greece)

Library Hi Tech

ISSN: 0737-8831

Article publication date: 13 June 2008

Downloads

785

Abstract

Purpose

–

The purpose of this paper is to focus on duplicate record detection algorithms used for detection in bibliographic databases.

Design/methodology/approach

–

Individual algorithms, their application process for duplicate detection and their results are described based on available literature (published articles), information found at various library web sites and follow‐up e‐mail communications.

Findings

–

Algorithms are categorized according to their application as a process of a single step or two consecutive steps. The results of deletion, merging, and temporary and virtual consolidation of duplicate records are studied.

Originality/value

–

The paper presents an overview of the duplication detection algorithms and an up‐to‐date state of their application in different library systems.

Keywords

Citation

Sitas, A. and Kapidakis, S. (2008), "Duplicate detection algorithms of bibliographic descriptions", Library Hi Tech, Vol. 26 No. 2, pp. 287-301. https://doi.org/10.1108/07378830810880379

Publisher

:

Emerald Group Publishing Limited

To read this content please select one of the options below:

Please note you do not have access to teaching notes

Duplicate detection algorithms of bibliographic descriptions

Abstract

Purpose

Design/methodology/approach

Findings

Originality/value

Keywords

Citation

Publisher

Related articles

To read this content please select one of the options below:

Please note you do not have access to teaching notes

Abstract

Purpose

Design/methodology/approach

Findings

Originality/value

Keywords

Citation

Publisher

Related articles

All feedback is valuable

Report an issue or find answers to frequently asked questions