Search results

1 – 2 of 2
Article
Publication date: 3 September 2024

Marco Humbel, Julianne Nyhan, Nina Pearlman, Andreas Vlachidis, JD Hill and Andrew Flinn

This paper aims to explore the accelerations and constraints libraries, archives, museums and heritage organisations (“collections-holding organisations”) face in their role as…

Abstract

Purpose

This paper aims to explore the accelerations and constraints libraries, archives, museums and heritage organisations (“collections-holding organisations”) face in their role as collection data providers for digital infrastructures. To date, digital infrastructures operate within the cultural heritage domain typically as data aggregation platforms, such as Europeana or Art UK.

Design/methodology/approach

Semi-structured interviews with 18 individuals in 8 UK collections-holding organisations and 2 international aggregators.

Findings

Discussions about digital infrastructure development often lay great emphasis on questions and problems that are technical and legal in nature. As important as technical and legal matters are, more latent, yet potent challenges exist too. Though less discussed in the literature, collections-holding organisations' capacity to participate in digital infrastructures is dependent on a complex interplay of funding allocation across the sector, divergent traditions of collection description and disciplinaries’ idiosyncrasies. Accordingly, we call for better social-cultural and trans-sectoral (collections-holding organisations, universities and technological providers) understandings of collection data infrastructure development.

Research limitations/implications

The authors recommend developing more understanding of the social-cultural aspects (e.g. disciplinary conventions) and their impact on collection data dissemination. More studies on the impact and opportunities of unified collections for different audiences and collections-holding organisations themselves are required too.

Practical implications

Sustainable financial investment across the heritage sector is required to address the discrepancies between different organisation types in their capacity to deliver collection data. Smaller organisations play a vital role in diversifying the (digital) historical canon, but they often struggle to digitise collections and bring catalogues online in the first place. In addition, investment in existing infrastructures for collection data dissemination and unification is necessary, instead of creating new platforms, with various levels of uptake and longevity. Ongoing investments in collections curation and high-quality cataloguing are prerequisites for a sustainable heritage sector and collection data infrastructures. Investments in the sustainability of infrastructures are not a replacement for research and vice versa.

Social implications

The authors recommend establishing networks where collections-holding organisations, technology providers and users can communicate their experiences and needs in an ongoing way and influence policy.

Originality/value

To date, the research focus on developing collection data infrastructures has tended to be on the drive to adopt specific technological solutions and copyright licensing practices. This paper offers a critical and holistic analysis of the dispersed experience of collections-holding organisations in their role as data providers for digital infrastructures. The paper contributes to the emerging understanding of the latent factors that make infrastructural endeavours in the heritage sector complex undertakings.

Article
Publication date: 7 June 2021

Marco Humbel, Julianne Nyhan, Andreas Vlachidis, Kim Sloan and Alexandra Ortolja-Baird

By mapping-out the capabilities, challenges and limitations of named-entity recognition (NER), this article aims to synthesise the state of the art of NER in the context of the…

Abstract

Purpose

By mapping-out the capabilities, challenges and limitations of named-entity recognition (NER), this article aims to synthesise the state of the art of NER in the context of the early modern research field and to inform discussions about the kind of resources, methods and directions that may be pursued to enrich the application of the technique going forward.

Design/methodology/approach

Through an extensive literature review, this article maps out the current capabilities, challenges and limitations of NER and establishes the state of the art of the technique in the context of the early modern, digitally augmented research field. It also presents a new case study of NER research undertaken by Enlightenment Architectures: Sir Hans Sloane's Catalogues of his Collections (2016–2021), a Leverhulme funded research project and collaboration between the British Museum and University College London, with contributing expertise from the British Library and the Natural History Museum.

Findings

Currently, it is not possible to benchmark the capabilities of NER as applied to documents of the early modern period. The authors also draw attention to the situated nature of authority files, and current conceptualisations of NER, leading them to the conclusion that more robust reporting and critical analysis of NER approaches and findings is required.

Research limitations/implications

This article examines NER as applied to early modern textual sources, which are mostly studied by Humanists. As addressed in this article, detailed reporting of NER processes and outcomes is not necessarily valued by the disciplines of the Humanities, with the result that it can be difficult to locate relevant data and metrics in project outputs. The authors have tried to mitigate this by contacting projects discussed in this paper directly, to further verify the details they report here.

Practical implications

The authors suggest that a forum is needed where tools are evaluated according to community standards. Within the wider NER community, the MUC and ConLL corpora are used for such experimental set-ups and are accompanied by a conference series, and may be seen as a useful model for this. The ultimate nature of such a forum must be discussed with the whole research community of the early modern domain.

Social implications

NER is an algorithmic intervention that transforms data according to certain rules-, patterns- or training data and ultimately affects how the authors interpret the results. The creation, use and promotion of algorithmic technologies like NER is not a neutral process, and neither is their output A more critical understanding of the role and impact of NER on early modern documents and research and focalization of some of the data- and human-centric aspects of NER routines that are currently overlooked are called for in this paper.

Originality/value

This article presents a state of the art snapshot of NER, its applications and potential, in the context of early modern research. It also seeks to inform discussions about the kinds of resources, methods and directions that may be pursued to enrich the application of NER going forward. It draws attention to the situated nature of authority files, and current conceptualisations of NER, and concludes that more robust reporting of NER approaches and findings are urgently required. The Appendix sets out a comprehensive summary of digital tools and resources surveyed in this article.

Details

Journal of Documentation, vol. 77 no. 6
Type: Research Article
ISSN: 0022-0418

Keywords

1 – 2 of 2