The purpose of this paper was to analyse text mining (TM) literature indexed in the Web of Science (WoS) under the “Information Science Library Science” subcategory. More specifically, it analyses the chronological growth of TM literature, and the major countries, institutions, departments and individuals contributing to TM literature. Collaboration in TM research is also analysed.
Bibliographic and citation data required for this research were retrieved from the WoS database. TM being a multidisciplinary field, the search was restricted to “Information Science Library Science” subcategory in the WoS. A comprehensive query statement covering all synonyms of “text mining” was prepared using the Boolean operator “OR”. Microsoft Excel and HistCite software were used for data analysis. Pajek and VoSviewer were used for data visualization.
It was found that USA is the major producer of TM research literature, and the highest number of papers were published in the Journal of The American Medical Informatics. Columbia University ranked first both in number of articles and citations received in the top ten institutes publishing TM literature. It was also observed that six of the top ten subdivisions of institutions are either from medicine or medical informatics or biomedical information. H.C. Chen and C. Friedman were seen to be the most prolific authors.
The paper analyses articles on TM published during 1999-2013 in WoS under the subcategory Information Science Library Science’.
The paper is based on empirical data exclusively gathered for this research.
Nagarkar, S.P. and Kumbhar, R. (2015), "Text mining: An analysis of research published under the subject category ‘Information Science Library Science’ in Web of Science Database during 1999-2013", Library Review, Vol. 64 No. 3, pp. 248-262. https://doi.org/10.1108/LR-08-2014-0091
Emerald Group Publishing Limited
Copyright © 2015, Emerald Group Publishing Limited