Search results

1 – 2 of 2
Article
Publication date: 13 June 2008

Anestis Sitas and Sarantos Kapidakis

The purpose of this paper is to focus on duplicate record detection algorithms used for detection in bibliographic databases.

Abstract

Purpose

The purpose of this paper is to focus on duplicate record detection algorithms used for detection in bibliographic databases.

Design/methodology/approach

Individual algorithms, their application process for duplicate detection and their results are described based on available literature (published articles), information found at various library web sites and follow‐up e‐mail communications.

Findings

Algorithms are categorized according to their application as a process of a single step or two consecutive steps. The results of deletion, merging, and temporary and virtual consolidation of duplicate records are studied.

Originality/value

The paper presents an overview of the duplication detection algorithms and an up‐to‐date state of their application in different library systems.

Details

Library Hi Tech, vol. 26 no. 2
Type: Research Article
ISSN: 0737-8831

Keywords

Article
Publication date: 1 July 2006

Anestis Sitas

The purpose of this article is to describe CDSware (CERN Document Server Software), a software developed at CERN.

923

Abstract

Purpose

The purpose of this article is to describe CDSware (CERN Document Server Software), a software developed at CERN.

Design/methodology/approach

CDSware supports the creation of electronic preprint servers, Online Public Access Catalogs (OPACs) and document systems on the web. It complies with the OAI‐PMH (Open Archive Initiative – Protocol for Metadata Harvesting) and uses MARC 21 as its underlying bibliographic standard. It is open source software, licensed under the terms of the GNU General Public License.

Findings

CDSware has been created for the handling of large repositories including various types of materials, like descriptions of museum objects, collections of confidential or public documents, etc.

Practical implications

All technical details of the software are described to enable comparison with all other open source software for managing and bibliographic organization of digitized context.

Originality/value

The paper presents a detailed description of highly (technically and bibliographically) developed software in order to help libraries in deciding which open source software is more suitable for their digitization project.

Details

Library Hi Tech, vol. 24 no. 3
Type: Research Article
ISSN: 0737-8831

Keywords

Access

Year

All dates (2)

Content type

1 – 2 of 2