Search results
1 – 10 of 44Koraljka Golub, Pawel Michal Ziolkowski and Goran Zlodi
The study aims to paint a representative picture of the current state of search interfaces of Swedish online museum collections, focussing on search functionalities with…
Abstract
Purpose
The study aims to paint a representative picture of the current state of search interfaces of Swedish online museum collections, focussing on search functionalities with particular reference to subject searching, as well as the use of controlled vocabularies, with the purpose of identifying which improvements of the search interfaces are needed to ensure high-quality information retrieval for the end user.
Design/methodology/approach
In the first step, a set of 21 search interface criteria was identified, based on related research and current standards in the domain of cultural heritage knowledge organization. Secondly, a complete set of Swedish museums that provide online access to their collections was identified, comprising nine cross-search services and 91 individual museums' websites. These 100 websites were each evaluated against the 21 criteria, between 1 July and 31 August 2020.
Findings
Although many standards and guidelines are in place to ensure quality-controlled subject indexing, which in turn support information retrieval of relevant resources (as individual or full search results), the study shows that they are not broadly implemented, resulting in information retrieval failures for the end user. The study also demonstrates a strong need for the implementation of controlled vocabularies in these museums.
Originality/value
This study is a rare piece of research which examines subject searching in online museums; the 21 search criteria and their use in the analysis of the complete set of online collections of a country represents a considerable and unique contribution to the fields of knowledge organization and information retrieval of cultural heritage. Its particular value lies in showing how the needs of end users, many of which are documented and reflected in international standards and guidelines, should be taken into account in designing search tools for these museums; especially so in subject searching, which is the most complex and yet the most common type of search. Much effort has been invested into digitizing cultural heritage collections, but access to them is hindered by poor search functionality. This study identifies which are the most important aspects to improve.
Details
Keywords
Martin Nečaský, Petr Škoda, David Bernhauer, Jakub Klímek and Tomáš Skopal
Semantic retrieval and discovery of datasets published as open data remains a challenging task. The datasets inherently originate in the globally distributed web jungle, lacking…
Abstract
Purpose
Semantic retrieval and discovery of datasets published as open data remains a challenging task. The datasets inherently originate in the globally distributed web jungle, lacking the luxury of centralized database administration, database schemes, shared attributes, vocabulary, structure and semantics. The existing dataset catalogs provide basic search functionality relying on keyword search in brief, incomplete or misleading textual metadata attached to the datasets. The search results are thus often insufficient. However, there exist many ways of improving the dataset discovery by employing content-based retrieval, machine learning tools, third-party (external) knowledge bases, countless feature extraction methods and description models and so forth.
Design/methodology/approach
In this paper, the authors propose a modular framework for rapid experimentation with methods for similarity-based dataset discovery. The framework consists of an extensible catalog of components prepared to form custom pipelines for dataset representation and discovery.
Findings
The study proposes several proof-of-concept pipelines including experimental evaluation, which showcase the usage of the framework.
Originality/value
To the best of authors’ knowledge, there is no similar formal framework for experimentation with various similarity methods in the context of dataset discovery. The framework has the ambition to establish a platform for reproducible and comparable research in the area of dataset discovery. The prototype implementation of the framework is available on GitHub.
Details
Keywords
Marina Salse, Javier Guallar-Delgado, Núria Jornet-Benito, Maria Pilar Mateo Bretos and Josep Oriol Silvestre-Canut
The purpose of this study is to determine which metadata schemas are used in the museums and university collections of the main universities in Spain and other European countries…
Abstract
Purpose
The purpose of this study is to determine which metadata schemas are used in the museums and university collections of the main universities in Spain and other European countries. Although libraries and archives are also university memory institutions (according to a Galleries, Libraries, Archives and Museums perspective), their collections are not included in this study because their metadata systems are highly standardized and their inclusion would, therefore, skew our understanding of the diverse realities that the study aims to capture.
Design/methodology/approach
The analysis has three components. The first is a bibliographic review based on Web of Science. The second is a direct survey of the individuals responsible for university collections to understand their internal work and documentation systems. Finally, the results obtained are complemented by an analysis of collective university heritage portals in Europe.
Findings
The results of this study confirmed the hypothesis that isolation and a lack of resources are still major issues in many cases. Increasing digitalization and the desire to participate in content aggregation systems are forcing change, although the responsibility for that change at universities is still vague.
Originality/value
Universities, particularly those with a long history, have an important heritage whose parts are often scattered or hidden. Although many contemporary academic publications have focused on the dissemination of university collections, this study focuses on the representation of information based on the conviction that good metadata are essential for dissemination.
Details
Keywords
Koraljka Golub, Osma Suominen, Ahmed Taiye Mohammed, Harriet Aagaard and Olof Osterman
In order to estimate the value of semi-automated subject indexing in operative library catalogues, the study aimed to investigate five different automated implementations of an…
Abstract
Purpose
In order to estimate the value of semi-automated subject indexing in operative library catalogues, the study aimed to investigate five different automated implementations of an open source software package on a large set of Swedish union catalogue metadata records, with Dewey Decimal Classification (DDC) as the target classification system. It also aimed to contribute to the body of research on aboutness and related challenges in automated subject indexing and evaluation.
Design/methodology/approach
On a sample of over 230,000 records with close to 12,000 distinct DDC classes, an open source tool Annif, developed by the National Library of Finland, was applied in the following implementations: lexical algorithm, support vector classifier, fastText, Omikuji Bonsai and an ensemble approach combing the former four. A qualitative study involving two senior catalogue librarians and three students of library and information studies was also conducted to investigate the value and inter-rater agreement of automatically assigned classes, on a sample of 60 records.
Findings
The best results were achieved using the ensemble approach that achieved 66.82% accuracy on the three-digit DDC classification task. The qualitative study confirmed earlier studies reporting low inter-rater agreement but also pointed to the potential value of automatically assigned classes as additional access points in information retrieval.
Originality/value
The paper presents an extensive study of automated classification in an operative library catalogue, accompanied by a qualitative study of automated classes. It demonstrates the value of applying semi-automated indexing in operative information retrieval systems.
Details
Keywords
Koraljka Golub, Jenny Bergenmar and Siska Humelsjö
This article aims to help ensure high-quality subject access to Swedish lesbian, gay, bisexual, transgender, queer and intersexual (LGBTQI) fiction, and aims to identify…
Abstract
Purpose
This article aims to help ensure high-quality subject access to Swedish lesbian, gay, bisexual, transgender, queer and intersexual (LGBTQI) fiction, and aims to identify challenges that librarians consider important to address, on behalf of themselves and end users.
Design/methodology/approach
A web-based questionnaire comprising 35 closed and open questions, 22 of which were required, was sent via online channels in January 2022. By the survey closing date, 20 March 2022, 82 responses had been received. The study was intended to complement an earlier study targeting end users.
Findings
Both this study of librarians and the previous study of end users have painted a dismal image of online search services when it comes to searching for LGBTQI fiction. The need to consult different channels (e.g. social media, library catalogues and friends), the inability to search more specifically than for the broad LGBTQI category and suboptimal search interfaces were among the commonly reported issues. The results of these studies are used to inform the development of a dedicated Swedish LGBTQI fiction database with an online search interface.
Originality/value
The subject searching of fiction via online services is usually limited to genre with facets for time and place, while users are often seeking characteristics such as pacing, characterization, storyline, frame/setting, tone and language/style. LGBTQI fiction is even more challenging to search because indexing practices are not really being standardized or disseminated worldwide. This study helps address this important gap, in both research and practical applications.
Details
Keywords
Elisa Banfi and Arnaud Gaudinat
The purpose of this paper is to investigate how Swiss public libraries are experiencing a normative revolution connected to new cataloging standards, such as RDA and the…
Abstract
Purpose
The purpose of this paper is to investigate how Swiss public libraries are experiencing a normative revolution connected to new cataloging standards, such as RDA and the FRBRization of catalogs.
Design/methodology/approach
Thanks to semi-structured interviews, the paper analyzes the current positioning of Swiss public libraries on the “bibliographic transition” issue by using a case study of the network of municipal libraries in Geneva.
Findings
In Switzerland, the federal and multi-linguistic structure of the library networks increases the organizational obstacles to the adoption of new cataloging principles and formats. At the local level, the Swiss municipal libraries have to cope with this complexity to transform their structures and continue to offer competitive and effective services to their users.
Practical implications
The paper proposes six scenarios of technology watershed for the analyzed case study and their consequences for cataloging standards and rules.
Social implications
The paper shows how the adoption of technological and conceptual innovations has to be done in the face of real organizational and administrative constraints, especially in the case of public lending libraries.
Originality/value
The paper analyzes at the empirical and theoretical levels how, especially in Switzerland, the variety of governance levels and linguistic areas have made strategizing more complex for public lending libraries.
Details
Keywords
Claudia Grisales Bohorquez, Lian Ruan and Kate Williams
This paper aims to understand how a special library helped firefighters in Illinois navigate the digital revolution by evidencing the elements and forms of work that made its…
Abstract
Purpose
This paper aims to understand how a special library helped firefighters in Illinois navigate the digital revolution by evidencing the elements and forms of work that made its innovative services possible.
Design/methodology/approach
The authors examine the history of a special library through a community informatics lens, drawing from sociomaterial perspectives to highlight forms of work often invisible in digital innovation. Data was collected through documentary revision, oral histories and semi-structured interviews. Deductive-inductive coding and constant comparative analysis was used in the analysis.
Findings
A historical narrative of the library between 1990 and 2021 highlights three sociotechnical innovations that assisted firefighters through the digital revolution: the facilitated collection, the co-created collection and the inside-out library. To develop these innovations the library drew from institutional relations, personal relations, grants, labor, knowledge of firefighters and technology. Various forms of articulation work brought these elements together to create innovative services.
Originality/value
The role of special libraries in addressing the digital divide has not been sufficiently detailed so far; this paper is a contribution in that direction. It also has practical value for professionals working in specialized libraries and information centers.
Details
Keywords
Genya Morgan O’Gara, Liz Woolcott, Elizabeth Joan Kelly, Caroline Muglia, Ayla Stein and Santi Thompson
The purpose of this paper is to highlight the initial top-level findings of a year-long comprehensive needs assessment, conducted with the digital library community, to reveal…
Abstract
Purpose
The purpose of this paper is to highlight the initial top-level findings of a year-long comprehensive needs assessment, conducted with the digital library community, to reveal reuse assessment practices and requirements for digital assets held by cultural heritage and research organizations. The type of assessment examined is in contrast to traditional library analytics, and does not focus on access statistics, but rather on how users utilize and transform unique materials from digital collections.
Design/methodology/approach
This paper takes a variety of investigative approaches to explore the current landscape, and future needs, of digital library reuse assessment. This includes the development and analysis of pre- and post-study surveys, in-person and virtual focus group sessions, a literature review, and the incorporation of community and advisory board feedback.
Findings
The digital library community is searching for ways to better understand how materials are reused and repurposed. This paper shares the initial quantitative and qualitative analysis and results of a community needs assessment conducted in 2017 and 2018 that illuminates the current and hoped for landscape of digital library reuse assessment, its strengths, weaknesses and community applications.
Originality/value
In so far as the authors are aware, this is the first paper to examine with a broad lens the reuse assessment needs of the digital library community. The preliminary analysis and initial findings have not been previously published.
Details
Keywords
The purpose of this study is to survey the landscape of online collections of digital games.
Abstract
Purpose
The purpose of this study is to survey the landscape of online collections of digital games.
Design/methodology/approach
First, the study identifies existing sites hosting collections and criteria that make a collection valuable for research, then it reports on sites that fit the criteria and analyzes trends.
Findings
Most sites provide simple binary downloads, but some choose encapsulation. Common metadata terms consistently include genre, year of release and publisher. Most sites claim the right to provide their collections as “abandonware,” but remove games if they are asked to.
Research limitations/implications
This study was conducted using a very limited subcategory of digital games, which could be expanded in other studies. Future research may require a multilingual team to account for collections based in non–English-speaking countries. Direct communication with sites’ management may be valuable in the future as well, but was not conducted in this study.
Practical implications
The study identifies practices that have developed organically in this field without any guiding standards. Understanding these may aid in Humanities research into digital games, as well as potential collection development in the future.
Social implications
Digital games are increasingly important as cultural artifacts, and there is a growing effort to preserve them for the future, but there are no standards for collecting and providing them. Understanding how this is currently done can help in providing access into the future for both casual and analytical use.
Originality/value
While game preservation is a growing and active field of research, no study has been published in recent years on this particular subject. It will be valuable for the development of future collections and for research using current ones.
Details
Keywords
Matjaž Kragelj and Mirjana Kljajić Borštnar
The purpose of this study is to develop a model for automated classification of old digitised texts to the Universal Decimal Classification (UDC), using machine-learning methods.
Abstract
Purpose
The purpose of this study is to develop a model for automated classification of old digitised texts to the Universal Decimal Classification (UDC), using machine-learning methods.
Design/methodology/approach
The general research approach is inherent to design science research, in which the problem of UDC assignment of the old, digitised texts is addressed by developing a machine-learning classification model. A corpus of 70,000 scholarly texts, fully bibliographically processed by librarians, was used to train and test the model, which was used for classification of old texts on a corpus of 200,000 items. Human experts evaluated the performance of the model.
Findings
Results suggest that machine-learning models can correctly assign the UDC at some level for almost any scholarly text. Furthermore, the model can be recommended for the UDC assignment of older texts. Ten librarians corroborated this on 150 randomly selected texts.
Research limitations/implications
The main limitations of this study were unavailability of labelled older texts and the limited availability of librarians.
Practical implications
The classification model can provide a recommendation to the librarians during their classification work; furthermore, it can be implemented as an add-on to full-text search in the library databases.
Social implications
The proposed methodology supports librarians by recommending UDC classifiers, thus saving time in their daily work. By automatically classifying older texts, digital libraries can provide a better user experience by enabling structured searches. These contribute to making knowledge more widely available and useable.
Originality/value
These findings contribute to the field of automated classification of bibliographical information with the usage of full texts, especially in cases in which the texts are old, unstructured and in which archaic language and vocabulary are used.
Details