Search results

1 – 10 of 46
Article
Publication date: 13 October 2023

Judit Gárdos, Julia Egyed-Gergely, Anna Horváth, Balázs Pataki, Roza Vajda and András Micsik

The present study is about generating metadata to enhance thematic transparency and facilitate research on interview collections at the Research Documentation Centre, Centre for…

Abstract

Purpose

The present study is about generating metadata to enhance thematic transparency and facilitate research on interview collections at the Research Documentation Centre, Centre for Social Sciences (TK KDK) in Budapest. It explores the use of artificial intelligence (AI) in producing, managing and processing social science data and its potential to generate useful metadata to describe the contents of such archives on a large scale.

Design/methodology/approach

The authors combined manual and automated/semi-automated methods of metadata development and curation. The authors developed a suitable domain-oriented taxonomy to classify a large text corpus of semi-structured interviews. To this end, the authors adapted the European Language Social Science Thesaurus (ELSST) to produce a concise, hierarchical structure of topics relevant in social sciences. The authors identified and tested the most promising natural language processing (NLP) tools supporting the Hungarian language. The results of manual and machine coding will be presented in a user interface.

Findings

The study describes how an international social scientific taxonomy can be adapted to a specific local setting and tailored to be used by automated NLP tools. The authors show the potential and limitations of existing and new NLP methods for thematic assignment. The current possibilities of multi-label classification in social scientific metadata assignment are discussed, i.e. the problem of automated selection of relevant labels from a large pool.

Originality/value

Interview materials have not yet been used for building manually annotated training datasets for automated indexing of scientifically relevant topics in a data repository. Comparing various automated-indexing methods, this study shows a possible implementation of a researcher tool supporting custom visualizations and the faceted search of interview collections.

Article
Publication date: 6 February 2024

Somayeh Tamjid, Fatemeh Nooshinfard, Molouk Sadat Hosseini Beheshti, Nadjla Hariri and Fahimeh Babalhavaeji

The purpose of this study is to develop a domain independent, cost-effective, time-saving and semi-automated ontology generation framework that could extract taxonomic concepts…

Abstract

Purpose

The purpose of this study is to develop a domain independent, cost-effective, time-saving and semi-automated ontology generation framework that could extract taxonomic concepts from unstructured text corpus. In the human disease domain, ontologies are found to be extremely useful for managing the diversity of technical expressions in favour of information retrieval objectives. The boundaries of these domains are expanding so fast that it is essential to continuously develop new ontologies or upgrade available ones.

Design/methodology/approach

This paper proposes a semi-automated approach that extracts entities/relations via text mining of scientific publications. Text mining-based ontology (TmbOnt)-named code is generated to assist a user in capturing, processing and establishing ontology elements. This code takes a pile of unstructured text files as input and projects them into high-valued entities or relations as output. As a semi-automated approach, a user supervises the process, filters meaningful predecessor/successor phrases and finalizes the demanded ontology-taxonomy. To verify the practical capabilities of the scheme, a case study was performed to drive glaucoma ontology-taxonomy. For this purpose, text files containing 10,000 records were collected from PubMed.

Findings

The proposed approach processed over 3.8 million tokenized terms of those records and yielded the resultant glaucoma ontology-taxonomy. Compared with two famous disease ontologies, TmbOnt-driven taxonomy demonstrated a 60%–100% coverage ratio against famous medical thesauruses and ontology taxonomies, such as Human Disease Ontology, Medical Subject Headings and National Cancer Institute Thesaurus, with an average of 70% additional terms recommended for ontology development.

Originality/value

According to the literature, the proposed scheme demonstrated novel capability in expanding the ontology-taxonomy structure with a semi-automated text mining approach, aiming for future fully-automated approaches.

Details

The Electronic Library , vol. 42 no. 2
Type: Research Article
ISSN: 0264-0473

Keywords

Article
Publication date: 13 December 2023

Sofia Martynovich

The interpretation of any emerging form or period in art history was never a trivial task. However, in the case of digital art, technology, becoming an integral part, multiplied…

Abstract

Purpose

The interpretation of any emerging form or period in art history was never a trivial task. However, in the case of digital art, technology, becoming an integral part, multiplied the complexity of describing, systematizing and evaluating it. This article investigates the most common metadata standards for the documentation of art as a broad category and suggests possible next steps toward an extended metadata standard for digital art.

Design/methodology/approach

Describing several techno-cultural phenomena formed in the last decade, manifesting the extendibility of digital art (its ability to be easily extended across multiple modalities), the article, at first, points to the long overdue need to re-evaluate the standards around it. Then it suggests a deeper analysis through a comparative study. In the scope of the study three artworks, The Arnolfini Portrait (Jan van Eyck), an iconic example of the early Renaissance, The World's First Collaborative Sentence (Douglas Davis), a classic example of early Internet art and Fake It Till You Make It (Maya Man), a prominent example of the blockchain art, are examined following the structure of the VRA Core 4.0 standard.

Findings

The comparative study demonstrates that digital art is more multi-semantic than traditional physical art, and requires new taxonomies as well as approaches for data acquisition.

Originality/value

Acknowledging that digital art simply has not yet evolved to the stage of being systematically collected by cultural institutions for documentation, curation and preservation, but otherwise, in the past few years, it has been at the front-center of social, economic and technological trends, the article suggests looking for hints on the future-proof extended metadata standard in some of those trends.

Details

Journal of Documentation, vol. 80 no. 2
Type: Research Article
ISSN: 0022-0418

Keywords

Article
Publication date: 24 January 2023

Hossein Motahari-Nezhad

No study has investigated the effects of different parameters on publication bias in meta-analyses using a machine learning approach. Therefore, this study aims to evaluate the…

Abstract

Purpose

No study has investigated the effects of different parameters on publication bias in meta-analyses using a machine learning approach. Therefore, this study aims to evaluate the impact of various factors on publication bias in meta-analyses.

Design/methodology/approach

An electronic questionnaire was created according to some factors extracted from the Cochrane Handbook and AMSTAR-2 tool to identify factors affecting publication bias. Twelve experts were consulted to determine their opinion on the importance of each factor. Each component was evaluated based on its content validity ratio (CVR). In total, 616 meta-analyses comprising 1893 outcomes from PubMed that assessed the presence of publication bias in their reported outcomes were randomly selected to extract their data. The multilayer perceptron (MLP) technique was used in IBM SPSS Modeler 18.0 to construct a prediction model. 70, 15 and 15% of the data were used for the model's training, testing and validation partitions.

Findings

There was a publication bias in 968 (51.14%) outcomes. The established model had an accuracy rate of 86.1%, and all pre-selected nine variables were included in the model. The results showed that the number of databases searched was the most important predictive variable (0.26), followed by the number of searches in the grey literature (0.24), search in Medline (0.17) and advanced search with numerous operators (0.13).

Practical implications

The results of this study can help clinical researchers minimize publication bias in their studies, leading to improved evidence-based medicine.

Originality/value

To the best of the author’s knowledge, this is the first study to model publication bias using machine learning.

Details

Aslib Journal of Information Management, vol. 76 no. 2
Type: Research Article
ISSN: 2050-3806

Keywords

Open Access
Article
Publication date: 5 April 2024

Miquel Centelles and Núria Ferran-Ferrer

Develop a comprehensive framework for assessing the knowledge organization systems (KOSs), including the taxonomy of Wikipedia and the ontologies of Wikidata, with a specific…

Abstract

Purpose

Develop a comprehensive framework for assessing the knowledge organization systems (KOSs), including the taxonomy of Wikipedia and the ontologies of Wikidata, with a specific focus on enhancing management and retrieval with a gender nonbinary perspective.

Design/methodology/approach

This study employs heuristic and inspection methods to assess Wikipedia’s KOS, ensuring compliance with international standards. It evaluates the efficiency of retrieving non-masculine gender-related articles using the Catalan Wikipedian category scheme, identifying limitations. Additionally, a novel assessment of Wikidata ontologies examines their structure and coverage of gender-related properties, comparing them to Wikipedia’s taxonomy for advantages and enhancements.

Findings

This study evaluates Wikipedia’s taxonomy and Wikidata’s ontologies, establishing evaluation criteria for gender-based categorization and exploring their structural effectiveness. The evaluation process suggests that Wikidata ontologies may offer a viable solution to address Wikipedia’s categorization challenges.

Originality/value

The assessment of Wikipedia categories (taxonomy) based on KOS standards leads to the conclusion that there is ample room for improvement, not only in matters concerning gender identity but also in the overall KOS to enhance search and retrieval for users. These findings bear relevance for the design of tools to support information retrieval on knowledge-rich websites, as they assist users in exploring topics and concepts.

Article
Publication date: 18 March 2024

Shiv Shakti Ghosh and Sunil Kumar Chatterjee

This study presents a review based research framework that aims to influence memory institutions in their projects on digital storytelling from digitized ancient travel records…

Abstract

Purpose

This study presents a review based research framework that aims to influence memory institutions in their projects on digital storytelling from digitized ancient travel records. This study aims to influence research and policymaking related to design and delivery of services based on memory institutions’ collections of historical records.

Design/methodology/approach

The demonstrated research framework has been synthesized using inputs from a review of existing studies on the domain accompanied by a short survey created for collecting the opinion of selected experts. Studies demonstrating utilization of semantic web technologies and those that can influence policymaking related to digital storytelling were primarily reviewed.

Findings

The core tasks behind digital storytelling vary depending on the project goals. So, a two-part framework had to be proposed that covers the generic fundamental tasks with diverse applicability and digital storytelling related specific tasks separately. Also during the review, it was found that studies demonstrating the use of travel records for digital storytelling were less in number compared to studies using digital storytelling for tourism in general.

Originality/value

The demonstrated research framework can guide memory institutions in exposing their travel-related holdings to a wider audience using innovative semantic web technologies and open up avenues for future empirical research thereby adding to the novelty of the presented research. Also, reviews of articles on digital storytelling or digital humanities in general exist, but, review of digital storytelling initiatives focusing specifically on tourism and travel literature is scarce.

Details

Global Knowledge, Memory and Communication, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 2514-9342

Keywords

Book part
Publication date: 31 January 2024

Georg Grossmann, Alice Beale, Harkaran Singh, Ben Smith and Julie Nichols

Cultural heritage archiving is experiencing an increase in digitalisations of artefacts in the last 15 years. The reason behind this trend is a demand for providing information…

Abstract

Cultural heritage archiving is experiencing an increase in digitalisations of artefacts in the last 15 years. The reason behind this trend is a demand for providing information about the artefact in a more accessible way to the audience, for example, through online delivery or virtual reality. Other reasons might be to simplify and automate the management of artefacts. Having a ‘digital copy’ of artefacts, allows one to search an archive and plan its storage and dissemination in a comprehensive manner. With the increased digitalisation comes an increased use of artificial intelligence [AI] applications. AI can be very beneficial in classifying artefacts automatically through machine learning [ML] and natural language processing [NLP]. For example, an algorithm can identify the source and age of artefacts based on an image and can do this much faster for a large collection of photos than a human. Although AI provides many benefits, it also presents challenges: Sophisticated AI techniques require certain insights on how they work, need specialists to customise a solution, and require an existing large dataset to train an algorithm. Another challenge is that typical AI techniques are regarded as black boxes, which means they decide, but it is not obvious why a decision has been made. This chapter describes a project in collaboration with the South Australian Museum [SAM] on the application of AI to extract material lists from a description of artefacts. A large dataset to train an algorithm did not exist, and hence, a customised approach was required. The outcome of the project was the application of NLP in combination with easy-to-customise rules that can be applied by non-IT specialists. The resulting prototype achieved the extraction of materials from a large list of artefacts within seconds and a flexible solution that can be applied on other collections in the future.

Details

Data Curation and Information Systems Design from Australasia: Implications for Cataloguing of Vernacular Knowledge in Galleries, Libraries, Archives, and Museums
Type: Book
ISBN: 978-1-80455-615-3

Keywords

Article
Publication date: 3 October 2023

Tatjana Aparac-Jelušić

The main purpose of the paper is to offer a personal view on the development of documentation/information and documentation (IuD) in Germany, while pointing out the need to…

Abstract

Purpose

The main purpose of the paper is to offer a personal view on the development of documentation/information and documentation (IuD) in Germany, while pointing out the need to further investigate the specific features of its development paths. The methodology is based on critical review of the available literature sources in the German language.

Design/methodology/approach

The paper uses the method of critical review of published documents in journals (especially in Nachrichten für Dokumentation), books and reports of state and provincial administrations that are directly related to monitoring and/or encouraging the development of the young field of documentation.

Findings

The paper offers a review and interpretation of the most significant development phases, the contributions of individuals and the influence of the official state and information policy based on the consulted sources.

Research limitations/implications

This research is limited to the literature written in German language.

Practical implications

The paper could be of interest to researchers and professionals who are interested in the development of documentation.

Social implications

The paper covers the period after the World War II until the end of 1980s that is especially interesting from the social point of view in divided Germany.

Originality/value

To the author’s knowledge, there is no comprehensive history of documentation in German-speaking countries written in English. This paper is the result of a research project started three years ago with colleagues from Germany, Austria and Switzerland, that aims to cover all phases of the appearance and development of information science in German-speaking countries and could be understood as a kind of introduction to papers planned to follow.

Details

Journal of Documentation, vol. 80 no. 3
Type: Research Article
ISSN: 0022-0418

Keywords

Book part
Publication date: 7 February 2024

Clóvis Reis and Yanet María Reimondo Barrios

This chapter presents a comparative study of the trends and patterns of communication and tourism research in Brazil and the United States over the last 20 years. Through a…

Abstract

This chapter presents a comparative study of the trends and patterns of communication and tourism research in Brazil and the United States over the last 20 years. Through a bibliometric analysis of the CAPES and EBSCO databases, the study identifies the main theoretical and methodological references, classifies the fundamental themes in the area, and describes the role of communication for tourism. The results indicate the predominance in North American scientific literature of research related to the image and the brand of the tourist destinations, as well as the measurement and the evaluation of the communicative strategies. On the other hand, Brazilian research presents a greater diversity of approaches: destination image studies, tourism consumption, tourist narrative analysis, identities, social networks, community-based tourism, sports, and ecological tourism, with an explicit recognition of the dangers of sexual objectification and dehumanization within tourism. The survey showed that the scientific community has a strong interest in this area, signaling a search for knowledge to deepen the conceptual understanding of the subject. Thus, this chapter provides insights regarding the opportunities and directions for the next decades of research in this field of study.

Details

Creating Culture Through Media and Communication
Type: Book
ISBN: 978-1-80071-602-5

Keywords

Book part
Publication date: 20 March 2024

Reetika Dadheech and Dhiraj Sharma

Purpose: Preserving a country’s culture is crucial for its sustainability. Handicraft is a key draw for tourism destinations; it protects any civilisation’s indigenous knowledge…

Abstract

Purpose: Preserving a country’s culture is crucial for its sustainability. Handicraft is a key draw for tourism destinations; it protects any civilisation’s indigenous knowledge and culture by managing the historical, economic, and ecological ecosystems and perfectly aligns with sustainable development. It has a significant role in creating employment, especially in rural regions and is an essential contributor to the export economy, mainly in developing nations. The study focuses on the skills required and existing gaps in the handicraft industry, its development and prospects by considering women and their role in preserving and embodying the traditional art of making handicrafts.

Approach: A framework has been developed for mapping and analysing the skills required in the handicraft sector using econometric modelling; an enormous number of skills have been crowdsourced from the respondents, and machine learning techniques have been used.

Findings: The findings of the study revealed that employment in this area is dependent not only on general or specialised skills but also on complex matrix skills ranging from punctuality to working in unclean and unsafe environments, along with a set of personal qualities, such as taking initiatives and specific skills, for example polishing and colour coding.

Implications: The skills mapping technique utilised in this study is applicable globally, particularly for women indulged in casual work in developing nations’ handicrafts industry. The sustainable development goals, tourism, and handicrafts are all interconnected. The research includes understanding skills mapping, which provides insights into efficient job matching by incorporating preferences and studying the demand side of casual working by women in the handicraft sector from a skills perspective.

Details

Contemporary Challenges in Social Science Management: Skills Gaps and Shortages in the Labour Market
Type: Book
ISBN: 978-1-83753-165-3

Keywords

1 – 10 of 46