Search results
1 – 2 of 2Anestis Sitas and Sarantos Kapidakis
The purpose of this paper is to focus on duplicate record detection algorithms used for detection in bibliographic databases.
Abstract
Purpose
The purpose of this paper is to focus on duplicate record detection algorithms used for detection in bibliographic databases.
Design/methodology/approach
Individual algorithms, their application process for duplicate detection and their results are described based on available literature (published articles), information found at various library web sites and follow‐up e‐mail communications.
Findings
Algorithms are categorized according to their application as a process of a single step or two consecutive steps. The results of deletion, merging, and temporary and virtual consolidation of duplicate records are studied.
Originality/value
The paper presents an overview of the duplication detection algorithms and an up‐to‐date state of their application in different library systems.
Details
Keywords
The purpose of this article is to describe CDSware (CERN Document Server Software), a software developed at CERN.
Abstract
Purpose
The purpose of this article is to describe CDSware (CERN Document Server Software), a software developed at CERN.
Design/methodology/approach
CDSware supports the creation of electronic preprint servers, Online Public Access Catalogs (OPACs) and document systems on the web. It complies with the OAI‐PMH (Open Archive Initiative – Protocol for Metadata Harvesting) and uses MARC 21 as its underlying bibliographic standard. It is open source software, licensed under the terms of the GNU General Public License.
Findings
CDSware has been created for the handling of large repositories including various types of materials, like descriptions of museum objects, collections of confidential or public documents, etc.
Practical implications
All technical details of the software are described to enable comparison with all other open source software for managing and bibliographic organization of digitized context.
Originality/value
The paper presents a detailed description of highly (technically and bibliographically) developed software in order to help libraries in deciding which open source software is more suitable for their digitization project.
Details