Search results
1 – 4 of 4Koraljka Golub, Osma Suominen, Ahmed Taiye Mohammed, Harriet Aagaard and Olof Osterman
In order to estimate the value of semi-automated subject indexing in operative library catalogues, the study aimed to investigate five different automated implementations of an…
Abstract
Purpose
In order to estimate the value of semi-automated subject indexing in operative library catalogues, the study aimed to investigate five different automated implementations of an open source software package on a large set of Swedish union catalogue metadata records, with Dewey Decimal Classification (DDC) as the target classification system. It also aimed to contribute to the body of research on aboutness and related challenges in automated subject indexing and evaluation.
Design/methodology/approach
On a sample of over 230,000 records with close to 12,000 distinct DDC classes, an open source tool Annif, developed by the National Library of Finland, was applied in the following implementations: lexical algorithm, support vector classifier, fastText, Omikuji Bonsai and an ensemble approach combing the former four. A qualitative study involving two senior catalogue librarians and three students of library and information studies was also conducted to investigate the value and inter-rater agreement of automatically assigned classes, on a sample of 60 records.
Findings
The best results were achieved using the ensemble approach that achieved 66.82% accuracy on the three-digit DDC classification task. The qualitative study confirmed earlier studies reporting low inter-rater agreement but also pointed to the potential value of automatically assigned classes as additional access points in information retrieval.
Originality/value
The paper presents an extensive study of automated classification in an operative library catalogue, accompanied by a qualitative study of automated classes. It demonstrates the value of applying semi-automated indexing in operative information retrieval systems.
Details
Keywords
Marina Salse, Javier Guallar-Delgado, Núria Jornet-Benito, Maria Pilar Mateo Bretos and Josep Oriol Silvestre-Canut
The purpose of this study is to determine which metadata schemas are used in the museums and university collections of the main universities in Spain and other European countries…
Abstract
Purpose
The purpose of this study is to determine which metadata schemas are used in the museums and university collections of the main universities in Spain and other European countries. Although libraries and archives are also university memory institutions (according to a Galleries, Libraries, Archives and Museums perspective), their collections are not included in this study because their metadata systems are highly standardized and their inclusion would, therefore, skew our understanding of the diverse realities that the study aims to capture.
Design/methodology/approach
The analysis has three components. The first is a bibliographic review based on Web of Science. The second is a direct survey of the individuals responsible for university collections to understand their internal work and documentation systems. Finally, the results obtained are complemented by an analysis of collective university heritage portals in Europe.
Findings
The results of this study confirmed the hypothesis that isolation and a lack of resources are still major issues in many cases. Increasing digitalization and the desire to participate in content aggregation systems are forcing change, although the responsibility for that change at universities is still vague.
Originality/value
Universities, particularly those with a long history, have an important heritage whose parts are often scattered or hidden. Although many contemporary academic publications have focused on the dissemination of university collections, this study focuses on the representation of information based on the conviction that good metadata are essential for dissemination.
Details
Keywords
Lala Hajibayova, Mallory McCorkhill and Timothy D. Bowman
In this study, STEM resources reviewed in Goodreads were investigated to determine their authorship, linguistic characteristics and impact. The analysis reveals gender disparity…
Abstract
Purpose
In this study, STEM resources reviewed in Goodreads were investigated to determine their authorship, linguistic characteristics and impact. The analysis reveals gender disparity favoring titles with male authors.
Design/methodology/approach
This paper applies theoretical concepts of knowledge commons to understand how individuals leverage the affordances of the Goodreads platform to share their perceptions of STEM-related books.
Findings
The analysis reveals gender disparity favoring titles with male authors. Female-authored STEM publications represent popular science nonfiction and juvenile genres. Analysis of the scholarly impact of the reviewed titles revealed that Google Scholar provides broader and more diverse coverage than Web of Science. Linguistic analysis of the reviews revealed the relatively low aesthetic disposition of reviewers with an emphasis on embodied experiences that emerged from the reading.
Originality/value
This study contributes to the understanding of the impact of popular STEM resources as well as the influence of the language of user-generated reviews on production, consumption and discoverability of STEM titles.
Details
Keywords
Farshid Danesh and Somayeh Ghavidel
The purpose of this study was a longitudinal study on knowledge organization (KO) realm structure and cluster concepts and emerging KO events based on co-occurrence analysis.
Abstract
Purpose
The purpose of this study was a longitudinal study on knowledge organization (KO) realm structure and cluster concepts and emerging KO events based on co-occurrence analysis.
Design/methodology/approach
This longitudinal study uses the co-occurrence analysis. This research population includes keywords of articles indexed in the Web of Science Core Collection 1975–1999 and 2000–2018. Hierarchical clustering, multidimensional scaling and co-occurrence analysis were used to conduct the present research. SPSS, UCINET, VOSviewer and NetDraw were used to analyze and visualize data.
Findings
The “Information Technology” in 1975–1999 and the “Information Literacy” in 2000–2018, with the highest frequency, were identified as the most widely used keywords of KO in the world. In the first period, the cluster “Knowledge Management” had the highest centrality, the cluster “Strategic Planning” had the highest density in 2000–2018 and the cluster “Information Retrieval” had the highest centrality and density. The two-dimensional map of KO’s thematic and clustering of KO topics by cluster analysis method indicates that in the periods examined in this study, thematic clusters had much overlap in terms of concept and content.
Originality/value
The present article uses a longitudinal study to examine the KO’s publications in the past half-century. This paper also uses hierarchical clustering and multidimensional scaling methods. Studying the concepts and thematic trends in KO can impact organizing information as the core of libraries, museums and archives. Also, it can scheme information organizing and promote knowledge management. Because the results obtained from this article can help KO policymakers determine and design the roadmap, research planning, and micro and macro budgeting processes.
Details