Search results

1 – 10 of 27
Article
Publication date: 27 February 2023

Dilawar Ali, Kenzo Milleville, Steven Verstockt, Nico Van de Weghe, Sally Chambers and Julie M. Birkholz

Historical newspaper collections provide a wealth of information about the past. Although the digitization of these collections significantly improves their accessibility, a large…

Abstract

Purpose

Historical newspaper collections provide a wealth of information about the past. Although the digitization of these collections significantly improves their accessibility, a large portion of digitized historical newspaper collections, such as those of KBR, the Royal Library of Belgium, are not yet searchable at article-level. However, recent developments in AI-based research methods, such as document layout analysis, have the potential for further enriching the metadata to improve the searchability of these historical newspaper collections. This paper aims to discuss the aforementioned issue.

Design/methodology/approach

In this paper, the authors explore how existing computer vision and machine learning approaches can be used to improve access to digitized historical newspapers. To do this, the authors propose a workflow, using computer vision and machine learning approaches to (1) provide article-level access to digitized historical newspaper collections using document layout analysis, (2) extract specific types of articles (e.g. feuilletons – literary supplements from Le Peuple from 1938), (3) conduct image similarity analysis using (un)supervised classification methods and (4) perform named entity recognition (NER) to link the extracted information to open data.

Findings

The results show that the proposed workflow improves the accessibility and searchability of digitized historical newspapers, and also contributes to the building of corpora for digital humanities research. The AI-based methods enable automatic extraction of feuilletons, clustering of similar images and dynamic linking of related articles.

Originality/value

The proposed workflow enables automatic extraction of articles, including detection of a specific type of article, such as a feuilleton or literary supplement. This is particularly valuable for humanities researchers as it improves the searchability of these collections and enables corpora to be built around specific themes. Article-level access to, and improved searchability of, KBR's digitized newspapers are demonstrated through the online tool (https://tw06v072.ugent.be/kbr/).

Article
Publication date: 10 January 2024

Mario Gonzalez-Fuentes, Jonathan Ross Gilbert, Robert F. Scherer and Carlos Iglesias-Fernandez

A pronounced rise in postpandemic immigration is creating consumption opportunities and challenges for countries worldwide. Past research has shown that immigrant homeownership…

58

Abstract

Purpose

A pronounced rise in postpandemic immigration is creating consumption opportunities and challenges for countries worldwide. Past research has shown that immigrant homeownership indicates advanced consumer acculturation. However, critical factors which differentiate immigrant decisions to purchase a home remain underexplored. This study aims to examine the importance of different identity resources in determining homeownership gaps between immigrant groups in Spain during a dynamic decade.

Design/methodology/approach

A mixed methods research design with triangulation was used. First, the critical “historical research method” is used to empirically assess 15,465 household-level microdata files from the National Immigrant Survey of Spain. Second, the analysis is corroborated through informant interviews, an evaluation of digital news archives and other historical traces such as relevant advertisements in Spain from 2000 to 2009.

Findings

Results provided an account of immigrant homeownership whereby foreign-born consumers leveraged resources to promote social identities aligned with an advanced level of acculturation through housing investment during this period. Furthermore, marketing focused on specific targets of ethnic minority consumers coupled with government policies to promote immigrant homeownership reinforced the “Spanish Dream” as a new paradigm for housing market integration.

Originality/value

Spain provides an unprecedented historical context to explain marketing-related phenomena due to a perfect storm of immigration, job availability and integration supports. Contrary to popular wisdom, immigrant consumer homeownership gaps are not solely a result of differences in income and economic mobility, but rather an advanced acculturation outcome driven by personal and social investments in resources that lead to consumer identities.

Details

Journal of Historical Research in Marketing, vol. 16 no. 3
Type: Research Article
ISSN: 1755-750X

Keywords

Content available

Abstract

Details

Journal of Documentation, vol. 80 no. 5
Type: Research Article
ISSN: 0022-0418

Article
Publication date: 9 April 2024

Pia Borlund, Nils Pharo and Ying-Hsang Liu

The PICCH research project contributes to opening a dialogue between cultural heritage archives and users. Hence, the users are identified and their information needs, the search…

Abstract

Purpose

The PICCH research project contributes to opening a dialogue between cultural heritage archives and users. Hence, the users are identified and their information needs, the search strategies they apply and the search challenges they experience are uncovered.

Design/methodology/approach

A combination of questionnaires and interviews is used for collection of data. Questionnaire data were collected from users of three different audiovisual archives. Semi-structured interviews were conducted with two user groups: (1) scholars searching information for research projects and (2) archivists who perform their own scholarly work and search information on behalf of others.

Findings

The questionnaire results show that the archive users mainly have an academic background. Hence, scholars and archivists constitute the target group for in-depth interviews. The interviews reveal that their information needs are multi-faceted and match the information need typology by Ingwersen. The scholars mainly apply collection-specific search strategies but have in common primarily doing keyword searching, which they typically plan in advance. The archivists do less planning owing to their knowledge of the collections. All interviewees demonstrate domain knowledge, archival intelligence and artefactual literacy in their use and mastering of the archives. The search challenges they experience can be characterised as search system complexity challenges, material challenges and metadata challenges.

Originality/value

The paper provides a rare insight into the complexity of the search situation of cultural heritage archives, and the users’ multi-facetted information needs and hence contributes to the dialogue between the archives and the users.

Details

Journal of Documentation, vol. 80 no. 4
Type: Research Article
ISSN: 0022-0418

Keywords

Article
Publication date: 1 August 2024

Mingxia Jia, Yuxiang Chris Zhao, Xiaoyu Zhang and Dawei Wu

In the era of digital intelligence, individuals are increasingly interacting with digital information in their daily lives and work, and a growing phenomenon known as digital…

Abstract

Purpose

In the era of digital intelligence, individuals are increasingly interacting with digital information in their daily lives and work, and a growing phenomenon known as digital hoarding is becoming more prevalent. Prior research suggests that humanities researchers have unique and longstanding information interaction and management practices in the digital scholarship context. This study therefore aims to understand how digital hoarding manifests in humanities researchers’ behavior, identify the influencing factors associated with it, and explore how they perceive and respond to digital hoarding behavior.

Design/methodology/approach

Qualitative research methods enable us to acquire a rich insight and nuanced understanding of digital hoarding practices. In this study, semi-structured interviews were conducted with 20 humanities researchers who were pre-screened for a high propensity for digital hoarding. Thematic analyses were then used to analyze the interview data.

Findings

Three main characteristics of digital hoarding were identified. Further, the research paradigm, digital affordance, and personality traits and habits, collectively influencing the emergence and development of digital hoarding behaviors, were examined. The subtle influence of traditional Chinese culture was encountered. Interestingly, this study found that humanists perceive digital hoarding as a positive expectation (associated with inspiration, aesthetic pursuit, and uncertainty avoidance). Meanwhile, humanists' problematic perception of this behavior is more widely observed — they experience what we conceptualize as an “expectation-perception” gap. Three specific information behaviors related to avoidance were identified as aggravating factors for digital hoarding.

Originality/value

The findings deepen the understanding of digital hoarding behaviors and personal information management among humanities researchers within the LIS field, and implications for humanities researchers, digital scholarship service providers, and digital tool developers are discussed.

Details

Journal of Documentation, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 0022-0418

Keywords

Open Access
Article
Publication date: 18 April 2024

Joseph Nockels, Paul Gooding and Melissa Terras

This paper focuses on image-to-text manuscript processing through Handwritten Text Recognition (HTR), a Machine Learning (ML) approach enabled by Artificial Intelligence (AI)…

1324

Abstract

Purpose

This paper focuses on image-to-text manuscript processing through Handwritten Text Recognition (HTR), a Machine Learning (ML) approach enabled by Artificial Intelligence (AI). With HTR now achieving high levels of accuracy, we consider its potential impact on our near-future information environment and knowledge of the past.

Design/methodology/approach

In undertaking a more constructivist analysis, we identified gaps in the current literature through a Grounded Theory Method (GTM). This guided an iterative process of concept mapping through writing sprints in workshop settings. We identified, explored and confirmed themes through group discussion and a further interrogation of relevant literature, until reaching saturation.

Findings

Catalogued as part of our GTM, 120 published texts underpin this paper. We found that HTR facilitates accurate transcription and dataset cleaning, while facilitating access to a variety of historical material. HTR contributes to a virtuous cycle of dataset production and can inform the development of online cataloguing. However, current limitations include dependency on digitisation pipelines, potential archival history omission and entrenchment of bias. We also cite near-future HTR considerations. These include encouraging open access, integrating advanced AI processes and metadata extraction; legal and moral issues surrounding copyright and data ethics; crediting individuals’ transcription contributions and HTR’s environmental costs.

Originality/value

Our research produces a set of best practice recommendations for researchers, data providers and memory institutions, surrounding HTR use. This forms an initial, though not comprehensive, blueprint for directing future HTR research. In pursuing this, the narrative that HTR’s speed and efficiency will simply transform scholarship in archives is deconstructed.

Article
Publication date: 28 June 2024

Haihua Chen, Jeonghyun (Annie) Kim, Jiangping Chen and Aisa Sakata

This study aims to explore the applications of natural language processing (NLP) and data analytics in understanding large-scale digital collections in oral history archives.

Abstract

Purpose

This study aims to explore the applications of natural language processing (NLP) and data analytics in understanding large-scale digital collections in oral history archives.

Design/methodology/approach

NLP and data analytics were used to analyse the oral interview transcripts of 904 survivors of the Japanese American incarceration camps collected from Densho Digital Repository, relying specifically on descriptive analysis, keyword extraction, topic modelling and sentiment analysis (SA).

Findings

The researchers found multiple geographic areas of large residential communities of ethnic Japanese people and the place names of the concentration camps. The keywords and topics extracted reflect the deplorable conditions and militaristic nature of the camps and the forced labour of the internees. When remembering history, the main focus for the narrators remains the redress and reparation movement to obtain the restitution of their civil rights. SA further found that the forcible removal and incarceration of Japanese Americans during Second World War negatively impacted and brought deep trauma to the narrators.

Originality/value

This case study demonstrated how NLP and data analytics could be applied to analyse oral history archives and open avenues for discovery. Archival researchers and the general public may benefit from this type of analysis in making connections between temporal, spatial and emotional elements, which will contribute to a holistic understanding of individuals and communities in terms of their collective memory.

Details

The Electronic Library , vol. 42 no. 4
Type: Research Article
ISSN: 0264-0473

Keywords

Article
Publication date: 3 September 2024

Marco Humbel, Julianne Nyhan, Nina Pearlman, Andreas Vlachidis, JD Hill and Andrew Flinn

This paper aims to explore the accelerations and constraints libraries, archives, museums and heritage organisations (“collections-holding organisations”) face in their role as…

Abstract

Purpose

This paper aims to explore the accelerations and constraints libraries, archives, museums and heritage organisations (“collections-holding organisations”) face in their role as collection data providers for digital infrastructures. To date, digital infrastructures operate within the cultural heritage domain typically as data aggregation platforms, such as Europeana or Art UK.

Design/methodology/approach

Semi-structured interviews with 18 individuals in 8 UK collections-holding organisations and 2 international aggregators.

Findings

Discussions about digital infrastructure development often lay great emphasis on questions and problems that are technical and legal in nature. As important as technical and legal matters are, more latent, yet potent challenges exist too. Though less discussed in the literature, collections-holding organisations' capacity to participate in digital infrastructures is dependent on a complex interplay of funding allocation across the sector, divergent traditions of collection description and disciplinaries’ idiosyncrasies. Accordingly, we call for better social-cultural and trans-sectoral (collections-holding organisations, universities and technological providers) understandings of collection data infrastructure development.

Research limitations/implications

The authors recommend developing more understanding of the social-cultural aspects (e.g. disciplinary conventions) and their impact on collection data dissemination. More studies on the impact and opportunities of unified collections for different audiences and collections-holding organisations themselves are required too.

Practical implications

Sustainable financial investment across the heritage sector is required to address the discrepancies between different organisation types in their capacity to deliver collection data. Smaller organisations play a vital role in diversifying the (digital) historical canon, but they often struggle to digitise collections and bring catalogues online in the first place. In addition, investment in existing infrastructures for collection data dissemination and unification is necessary, instead of creating new platforms, with various levels of uptake and longevity. Ongoing investments in collections curation and high-quality cataloguing are prerequisites for a sustainable heritage sector and collection data infrastructures. Investments in the sustainability of infrastructures are not a replacement for research and vice versa.

Social implications

The authors recommend establishing networks where collections-holding organisations, technology providers and users can communicate their experiences and needs in an ongoing way and influence policy.

Originality/value

To date, the research focus on developing collection data infrastructures has tended to be on the drive to adopt specific technological solutions and copyright licensing practices. This paper offers a critical and holistic analysis of the dispersed experience of collections-holding organisations in their role as data providers for digital infrastructures. The paper contributes to the emerging understanding of the latent factors that make infrastructural endeavours in the heritage sector complex undertakings.

Article
Publication date: 2 August 2023

Atika Ahmad Kemal and Mahmood Hussain Shah

While the potential for digital innovation (DI) to transform organizational practices is widely acknowledged in the information systems (IS) literature, there is very limited…

Abstract

Purpose

While the potential for digital innovation (DI) to transform organizational practices is widely acknowledged in the information systems (IS) literature, there is very limited understanding on the socio-political nature of institutional interactions that determine DI and affect organizational practices in social cash organizations. Drawing on the neo-institutionalist vision, the purpose of the study is to examine the unique set of institutional exchanges that influence the transition to digital social cash payments that give rise to new institutional arrangements in social cash organizations.

Design/methodology/approach

The paper draws on an in-depth case study of a government social cash organization in Pakistan. Qualitative data were collected using 30 semi-structured interviews from key organizational members and stakeholders.

Findings

The results suggest that DI is determined by the novel intersections between the coercive (techno-economic, regulatory), normative (socio-organizational), mimetic (international) and covert power (political) forces. Hence, DI is not a technologically deterministic output, but rather a complex socio-political process enacted through dialogue, negotiation and conflict between institutional actors. Technology is socially embedded through the process of institutionalization that is coupled by the deinstitutionalization of established organizational practices for progressive transformation.

Research limitations/implications

The research has implications for government social cash organizations especially in the Global South. Empirically, the authors gained rare access to, and support from a government-backed social cash organization in Pakistan (an understudied country in the Global South), which made the data and the consequent analyses even invaluable. This made the empirical contribution within this geographical setting even more worthy, since this case study has received little attention from indigenous scholars in the past. The empirical findings showcased a unique set of contextual factors that were subject to BISP and interpreted through an account of socio-cultural sensitivities.

Practical implications

The paper provides practical implications for policymakers and practitioners, emphasizing the need to address institutional challenges, including covert power, during the implementation of digitalization projects in the public sector. The paper has certain potential for inspiring future e-government related (or public sector focused) studies. The paper may guide both private and government policy-makers and practitioners in presenting how to overcome certain institutional challenges while planning and implementing large scale multi-stakeholder digitization projects in similar country contexts. So while there is scope of linking the digitization of public sector organizations to anti-corruption measures in other Global South countries, the paper may not be that straightforward with the private sector involvement.

Social implications

The paper offers rich social insights on the institutional interchanges that occur between the social actors for the innovation of technology. Especially, the paper highlights the social-embeddedness nature of technology that underpins the institutionalization of new organizational practices. These have implications on how DI is viewed as a socio-political process of change.

Originality/value

This study contributes to neo-institutional theory by theorizing covert power as a political force that complements the neo-institutional framework. This force is subtle but also resistive for some political actors as the force shifts the equilibrium of power between different institutional actors. Furthermore, the paper presents the social and practical implications that guide policymakers and practitioners by taking into consideration the unique institutional challenges, such as covert power, while implementing large scale digital projects in the social cash sector.

Article
Publication date: 1 March 2024

Kai Naumann and Andreas Neuburger

Starting from the status quo, the paper outlines perspectives and challenges for the connection and interlinking of digitised and digital archival data. The following topics are…

Abstract

Purpose

Starting from the status quo, the paper outlines perspectives and challenges for the connection and interlinking of digitised and digital archival data. The following topics are addressed: Where are fields of action and what are the means of archives? Which functional and technical requirements are to be considered, and what is the role of portal infrastructures linking together various different institutions?

Design/methodology/approach

Considering needs of users and general framework conditions, the paper examines new approaches emerging in Germany. It outlines recent projects and considerations aiming to improve services and visibility of archives within the national data infrastructure in Germany.

Findings

Cross-connections are no new phenomenon, but change their appearance significantly in a digital context. In this respect, both smaller and bigger archives profit from participation in larger digital networks. Furthermore, archives need to keep in mind to reflect the quality of their digital (meta)data regularly and to offer or join systems that functionally and technically support cross-connection and interlinking of data.

Originality/value

The paper endeavours to show the importance of digital cross-connections and the role of portal infrastructures for visibility, online-distribution and use of digital archival metadata and data.

Details

Journal of Documentation, vol. 80 no. 5
Type: Research Article
ISSN: 0022-0418

Keywords

1 – 10 of 27