Search results
1 – 10 of over 10000This paper seeks to examine image retrieval within two different contexts: a monolingual context where the language of the query is the same as the indexing language and a…
Abstract
Purpose
This paper seeks to examine image retrieval within two different contexts: a monolingual context where the language of the query is the same as the indexing language and a multilingual context where the language of the query is different from the indexing language. The study also aims to compare two different approaches for the indexing of ordinary images representing common objects: traditional image indexing with the use of a controlled vocabulary and free image indexing using uncontrolled vocabulary.
Design/methodology/approach
This research uses three data collection methods. An analysis of the indexing terms was employed in order to examine the multiplicity of term types assigned to images. A simulation of the retrieval process involving a set of 30 images was performed with 60 participants. The quantification of the retrieval performance of each indexing approach was based on the usability measures, that is, effectiveness, efficiency and satisfaction of the user. Finally, a questionnaire was used to gather information on searcher satisfaction during and after the retrieval process.
Findings
The results of this research are twofold. The analysis of indexing terms associated with all the 3,950 images provides a comprehensive description of the characteristics of the four non‐combined indexing forms used for the study. Also, the retrieval simulation results offers information about the relative performance of the six indexing forms (combined and non‐combined) in terms of their effectiveness, efficiency (temporal and human) and the image searcher's satisfaction.
Originality/value
The findings of the study suggest that, in the near future, the information systems could benefit from allowing an increased coexistence of controlled vocabularies and uncontrolled vocabularies, resulting from collaborative image tagging, for example, and giving the users the possibility to dynamically participate in the image‐indexing process, in a more user‐centred way.
Details
Keywords
Mari Vállez, Rafael Pedraza-Jiménez, Lluís Codina, Saúl Blanco and Cristòfol Rovira
Controlled vocabularies play an important role in information retrieval. Numerous studies have shown that conceptual searches based on vocabularies are more effective than keyword…
Abstract
Purpose
Controlled vocabularies play an important role in information retrieval. Numerous studies have shown that conceptual searches based on vocabularies are more effective than keyword searches, at least in certain contexts. Consequently, new ways must be found to improve controlled vocabularies. The purpose of this paper is to present a semi-automatic model for updating controlled vocabularies through the use of a text corpus and the analysis of query logs.
Design/methodology/approach
An experimental development is presented in which, first, the suitability of a controlled vocabulary to a text corpus is examined. The keywords entered by users to access the text corpus are then compared with the descriptors used to index it. Finally, both the query logs and text corpus are processed to obtain a set of candidate terms to update the controlled vocabulary.
Findings
This paper describes a model applicable both in the context of the text corpus of an online academic journal and to repositories and intranets. The model is able to: first, identify the queries that led users from a search engine to a relevant document; and second, process these queries to identify candidate terms for inclusion in a controlled vocabulary.
Research limitations/implications
Ideally, the model should be used in controlled web environments, such as repositories, intranets or academic journals.
Social implications
The proposed model directly improves the indexing process by facilitating the maintenance and updating of controlled vocabularies. It so doing, it helps to optimise access to information.
Originality/value
The proposed model takes into account the perspective of users by mining queries in order to propose candidate terms for inclusion in a controlled vocabulary.
Details
Keywords
Brian Matthews, Catherine Jones, Bartłomiej Puzoń, Jim Moon, Douglas Tudhope, Koraljka Golub and Marianne Lykke Nielsen
Traditional subject indexing and classification are considered infeasible in many digital collections. This paper seeks to investigate ways of enhancing social tagging via…
Abstract
Purpose
Traditional subject indexing and classification are considered infeasible in many digital collections. This paper seeks to investigate ways of enhancing social tagging via knowledge organization systems, with a view to improving the quality of tags for increased information discovery and retrieval performance.
Design/methodology/approach
Enhanced tagging interfaces were developed for exemplar online repositories, and trials were undertaken with author and reader groups to evaluate the effectiveness of tagging augmented with control vocabulary for subject indexing of papers in online repositories.
Findings
The results showed that using a knowledge organisation system to augment tagging does appear to increase the effectiveness of non‐specialist users (that is, without information science training) in subject indexing.
Research limitations/implications
While limited by the size and scope of the trials undertaken, these results do point to the usefulness of a mixed approach in supporting the subject indexing of online resources.
Originality/value
The value of this work is as a guide to future developments in the practical support for resource indexing in online repositories.
Details
Keywords
Koraljka Golub, Pawel Michal Ziolkowski and Goran Zlodi
The study aims to paint a representative picture of the current state of search interfaces of Swedish online museum collections, focussing on search functionalities with…
Abstract
Purpose
The study aims to paint a representative picture of the current state of search interfaces of Swedish online museum collections, focussing on search functionalities with particular reference to subject searching, as well as the use of controlled vocabularies, with the purpose of identifying which improvements of the search interfaces are needed to ensure high-quality information retrieval for the end user.
Design/methodology/approach
In the first step, a set of 21 search interface criteria was identified, based on related research and current standards in the domain of cultural heritage knowledge organization. Secondly, a complete set of Swedish museums that provide online access to their collections was identified, comprising nine cross-search services and 91 individual museums' websites. These 100 websites were each evaluated against the 21 criteria, between 1 July and 31 August 2020.
Findings
Although many standards and guidelines are in place to ensure quality-controlled subject indexing, which in turn support information retrieval of relevant resources (as individual or full search results), the study shows that they are not broadly implemented, resulting in information retrieval failures for the end user. The study also demonstrates a strong need for the implementation of controlled vocabularies in these museums.
Originality/value
This study is a rare piece of research which examines subject searching in online museums; the 21 search criteria and their use in the analysis of the complete set of online collections of a country represents a considerable and unique contribution to the fields of knowledge organization and information retrieval of cultural heritage. Its particular value lies in showing how the needs of end users, many of which are documented and reflected in international standards and guidelines, should be taken into account in designing search tools for these museums; especially so in subject searching, which is the most complex and yet the most common type of search. Much effort has been invested into digitizing cultural heritage collections, but access to them is hindered by poor search functionality. This study identifies which are the most important aspects to improve.
Details
Keywords
MaryEllen Sievert and Bert R. Boyce
The traditional role of the controlled vocabulary has been to bring together items scattered by a multiplicity of natural language terms. Therefore, it was considered a recall…
Abstract
The traditional role of the controlled vocabulary has been to bring together items scattered by a multiplicity of natural language terms. Therefore, it was considered a recall device. The file structure of current online structure systems now leads to the use of such vocabularies as a precision device. Because of the levels of specificity in the controlled vocabulary, the search for broader concepts means several terms may be ncessary. In this case, the file structure of the retrieval systems requires that for efficient searching the entry points be analyzed and minimized. Complete, rather than partial, entry of controlled terms appears to be inefficient.
Michelle Dalmau, Randall Floyd, Dazhi Jiao and Jenn Riley
Seeks to share with digital library practitioners the development process of an online image collection that integrates the syndetic structure of a controlled vocabulary to…
Abstract
Purpose
Seeks to share with digital library practitioners the development process of an online image collection that integrates the syndetic structure of a controlled vocabulary to improve end‐user search and browse functionality.
Design/methodology/approach
Surveys controlled vocabulary structures and their utility for catalogers and end‐users. Reviews research literature and usability findings that informed the specifications for integration of the controlled vocabulary structure into search and browse functionality. Discusses database functions facilitating query expansion using a controlled vocabulary structure, and web application handling of user queries and results display. Concludes with a discussion of open‐source alternatives and reuse of database and application components in other environments.
Findings
Affirms that structured forms of browse and search can be successfully integrated into digital collections to significantly improve the user's discovery experience. Establishes ways in which the technologies used in implementing enhanced search and browse functionality can be abstracted to work in other digital collection environments.
Originality/value
Significant amounts of research on integrating thesauri structures into search and browse functionalities exist, but examples of online resources that have implemented this approach are few in comparison. The online image collection surveyed in this paper can serve as a model to other designers of digital library resources for integrating controlled vocabularies and metadata structures into more dynamic search and browse functionality for end‐users.
Details
Keywords
Koraljka Golub, Xu Tan, Ying-Hsang Liu and Jukka Tyrkkö
This exploratory study aims to help contribute to the understanding of online information search behaviour of PhD students from different humanities fields, with a focus on…
Abstract
Purpose
This exploratory study aims to help contribute to the understanding of online information search behaviour of PhD students from different humanities fields, with a focus on subject searching.
Design/methodology/approach
The methodology is based on a semi-structured interview within which the participants are asked to conduct both a controlled search task and a free search task. The sample comprises eight PhD students in several humanities disciplines at Linnaeus University, a medium-sized Swedish university from 2020.
Findings
Most humanities PhD students in the study have received training in information searching, but it has been too basic. Most rely on web search engines like Google and Google Scholar for publications' search, and university's discovery system for known-item searching. As these systems do not rely on controlled vocabularies, the participants often struggle with too many retrieved documents that are not relevant. Most only rarely or never use disciplinary bibliographic databases. The controlled search task has shown some benefits of using controlled vocabularies in the disciplinary databases, but incomplete synonym or concept coverage as well as user unfriendly search interface present hindrances.
Originality/value
The paper illuminates an often-forgotten but pervasive challenge of subject searching, especially for humanities researchers. It demonstrates difficulties and shows how most PhD students have missed finding an important resource in their research. It calls for the need to reconsider training in information searching and the need to make use of controlled vocabularies implemented in various search systems with usable search and browse user interfaces.
Details
Keywords
This chapter aims to discuss the issues associated with social indexing as a solution to the challenges of current information organization systems by investigating the quality…
Abstract
Purpose
This chapter aims to discuss the issues associated with social indexing as a solution to the challenges of current information organization systems by investigating the quality and efficacy of social indexing.
Design/methodology/approach
The chapter focuses on the study which compared indexing similarity between two professional groups and also compared social tagging and professional indexing. The study employed the method of the modified vector-based Indexing Consistency Density (ICD) with three different similarity measures: cosine similarity, dot product similarity, and Euclidean distance metric.
Findings
The investigation of social indexing in comparison of professional indexing demonstrates that social tags are more accurate descriptions of resources and reflection of more current terminology than controlled vocabulary. Through the characteristics of social tagging discussed in this chapter, we have a clearer understanding of the extent to which social indexing can be used to replace and improve upon professional indexing.
Research limitations/implications
As investment in professionally developed web directories diminishes, it becomes even more critical to understand the characteristics of social tagging and to obtain benefit from it. In future research, the examination of subjective tags needs to be conducted. A survey or user study on tagging behavior also would help to extend understanding of social indexing practices.
George Macgregor and Emma McCulloch
The purpose of the paper is to provide an overview of the collaborative tagging phenomenon and explore some of the reasons for its emergence.
Abstract
Purpose
The purpose of the paper is to provide an overview of the collaborative tagging phenomenon and explore some of the reasons for its emergence.
Design/methodology/approach
The paper reviews the related literature and discusses some of the problems associated with, and the potential of, collaborative tagging approaches for knowledge organisation and general resource discovery. A definition of controlled vocabularies is proposed and used to assess the efficacy of collaborative tagging. An exposition of the collaborative tagging model is provided and a review of the major contributions to the tagging literature is presented.
Findings
There are numerous difficulties with collaborative tagging systems (e.g. low precision, lack of collocation, etc.) that originate from the absence of properties that characterise controlled vocabularies. However, such systems can not be dismissed. Librarians and information professionals have lessons to learn from the interactive and social aspects exemplified by collaborative tagging systems, as well as their success in engaging users with information management. The future co‐existence of controlled vocabularies and collaborative tagging is predicted, with each appropriate for use within distinct information contexts: formal and informal.
Research limitations/implications
Librarians and information professional researchers should be playing a leading role in research aimed at assessing the efficacy of collaborative tagging in relation to information storage, organisation, and retrieval, and to influence the future development of collaborative tagging systems.
Practical implications
The paper indicates clear areas where digital libraries and repositories could innovate in order to better engage users with information.
Originality/value
At time of writing there were no literature reviews summarising the main contributions to the collaborative tagging research or debate.
Details
Keywords
The ASM Information Retrieval System is currently in process of redesign to incorporate major refinements and improvements resulting from five years of operating experience as…
Abstract
The ASM Information Retrieval System is currently in process of redesign to incorporate major refinements and improvements resulting from five years of operating experience as well as experience and research of others in the field. Principles and methodology underlying these changes are explored for various components of this system, principally vocabulary, links and roles, and computer hardware and programming. The major change is from semantic code to thesaurus as the system vocabulary, and the relative efficiencies and inherent capabilities of each are compared and related to the general principles of vocabulary control.