To read this content please select one of the options below:

A case study for understanding the nature of redundant entities in bibliographic digital libraries

Byung-Won On (Advanced Institutes of Convergence Technology, Seoul National University, Seoul, Republic of Korea)
Gyu Sang Choi (Department of Information and Communication Engineering, Yeungnam University, Gyeongsan, Republic of Korea)
Soo-Mok Jung (College of Science and Technology, Sahmyook University, Seoul, Republic of Korea)

Program: electronic library and information systems

ISSN: 0033-0337

Article publication date: 1 July 2014

287

Abstract

Purpose

The purpose of this paper is to collect and understand the nature of real cases of author name variants that have often appeared in bibliographic digital libraries (DLs) as a case study of the name authority control problem in DLs.

Design/methodology/approach

To find a sample of name variants across DLs (e.g. DBLP and ACM) and in a single DL (e.g. ACM), the approach is based on two bipartite matching algorithms: Maximum Weighted Bipartite Matching and Maximum Cardinality Bipartite Matching.

Findings

First, the authors validated the effectiveness and efficiency of the bipartite matching algorithms. The authors also studied the nature of real cases of author name variants that had been found across DLs (e.g. ACM, CiteSeer and DBLP) and in a single DL.

Originality/value

To the best of the authors knowledge, there is less research effort to understand the nature of author name variants shown in DLs. A thorough analysis can help focus research effort on real problems that arise when the authors perform duplicate detection methods.

Keywords

Acknowledgements

This work was supported by Basic Science Research Program through the National Research Foundation of Korea (NRF) grant funded by the Ministry of Science, ICT, and Future Planning (No. 2013012524) for the first author, by Basic Science Research Program through the National Research Foundation of Korea (NRF) grant funded by the Ministry of Education, Science, and Technology (No. 20110008686) for the second author, and by the Energy Efficiency & Resources of the Korea Institute of Energy Technology Evaluation and Planning (KETEP) grant funded by the Korea government Ministry of Knowledge Economy (No. 20132010101800) for the first and second authors.

Citation

On, B.-W., Sang Choi, G. and Jung, S.-M. (2014), "A case study for understanding the nature of redundant entities in bibliographic digital libraries", Program: electronic library and information systems, Vol. 48 No. 3, pp. 246-271. https://doi.org/10.1108/PROG-07-2012-0037

Publisher

:

Emerald Group Publishing Limited

Copyright © 2014, Emerald Group Publishing Limited

Related articles