Search results

1 – 5 of 5
Open Access
Article
Publication date: 31 July 2023

Sara Lafia, David A. Bleckley and J. Trent Alexander

Many libraries and archives maintain collections of research documents, such as administrative records, with paper-based formats that limit the documents' access to in-person use…

Abstract

Purpose

Many libraries and archives maintain collections of research documents, such as administrative records, with paper-based formats that limit the documents' access to in-person use. Digitization transforms paper-based collections into more accessible and analyzable formats. As collections are digitized, there is an opportunity to incorporate deep learning techniques, such as Document Image Analysis (DIA), into workflows to increase the usability of information extracted from archival documents. This paper describes the authors' approach using digital scanning, optical character recognition (OCR) and deep learning to create a digital archive of administrative records related to the mortgage guarantee program of the Servicemen's Readjustment Act of 1944, also known as the G.I. Bill.

Design/methodology/approach

The authors used a collection of 25,744 semi-structured paper-based records from the administration of G.I. Bill Mortgages from 1946 to 1954 to develop a digitization and processing workflow. These records include the name and city of the mortgagor, the amount of the mortgage, the location of the Reconstruction Finance Corporation agent, one or more identification numbers and the name and location of the bank handling the loan. The authors extracted structured information from these scanned historical records in order to create a tabular data file and link them to other authoritative individual-level data sources.

Findings

The authors compared the flexible character accuracy of five OCR methods. The authors then compared the character error rate (CER) of three text extraction approaches (regular expressions, DIA and named entity recognition (NER)). The authors were able to obtain the highest quality structured text output using DIA with the Layout Parser toolkit by post-processing with regular expressions. Through this project, the authors demonstrate how DIA can improve the digitization of administrative records to automatically produce a structured data resource for researchers and the public.

Originality/value

The authors' workflow is readily transferable to other archival digitization projects. Through the use of digital scanning, OCR and DIA processes, the authors created the first digital microdata file of administrative records related to the G.I. Bill mortgage guarantee program available to researchers and the general public. These records offer research insights into the lives of veterans who benefited from loans, the impacts on the communities built by the loans and the institutions that implemented them.

Details

Journal of Documentation, vol. 79 no. 7
Type: Research Article
ISSN: 0022-0418

Keywords

Open Access
Article
Publication date: 13 October 2023

Stefano Francesco Musso and Giovanna Franco

This article sets out to show how principles and questions about method that underlie a way of interpreting the discipline of conservation and restoration can find results in…

Abstract

Purpose

This article sets out to show how principles and questions about method that underlie a way of interpreting the discipline of conservation and restoration can find results in research and studies, aiming at achieving even conscious reuse process. The occasion is the very recent research performed on the former Church of Saints Gerolamo and Francesco Saverio in Genoa, Italy, the Jesuit church annexed to the 17th-century College of the order. It is a small Baroque jewel in the heart of the ancient city, former University Library and actually abandoned, forgotten for years, inaccessible and awaiting a new use.

Design/methodology/approach

The two-year work carried out on the monumental building was conducted according to a study and research methodology developed and refined over the years within the activities of the School of Specialisation in Architectural Heritage and Landscape of the University of Genoa. It is a multidisciplinary and rigorous approach, which aims to train high-level professionals, up-to-date and aware of the multiple problems that interventions on existing buildings, especially of a monumental nature, involve.

Findings

The biennal study has been carried out within the activities of the Post-Graduate Programme in Architectural Heritage and Landscape of the University of Genoa. The work methodology faces the challenges of the contemporary complexity, raised by the progressive broadening of the concept of cultural “heritage” and by the problems of its conservation, its active safeguard and its reuse: safety in respect of seismic risk, fire and hydro geological instability, universal accessibility – cognitive, physical and alternative – resource efficiency, comfort and savings in energy consumption, sustainability, communication and involvement of local communities and stakeholders.

Originality/value

The goals of the work were the following: understanding of the architectural heritage, through the correlated study of its geometries, elements and construction materials, surfaces, structures, spaces and functions; understanding of the transformations that the building has undergone over time, relating the results of historical reconstructions from indirect sources and those of direct archaeological analysis; assessment of the state of conservation of the building recognising phenomena of deterioration, damage, faults and deficits that affect materials, construction elements, systems and structures; identification of the causes and extent of damage, faults and deficits, assessing the vulnerability and level of exposure of the asset to the aggression of environmental factors and related risks; evaluation of the compatibility between the characteristics of the available spaces, the primary needs of conservation, the instance of regeneration and possible new uses; the definition of criteria and guidelines for establishing the planning of conservation, restoration and redevelopment interventions.

Details

Journal of Cultural Heritage Management and Sustainable Development, vol. ahead-of-print no. ahead-of-print
Type: Research Article
ISSN: 2044-1266

Keywords

Open Access
Book part
Publication date: 14 October 2021

Lisa Sugiura

Abstract

Details

The Incel Rebellion: The Rise of the Manosphere and the Virtual War Against Women
Type: Book
ISBN: 978-1-83982-257-5

Open Access
Article
Publication date: 8 June 2022

Federico Brunetti, Angelo Bonfanti, Andrea Chiarini and Virginia Vannucci

This paper explores how digitalization affects the academic research publication process by taking into account the perspective of management scholars. It provides an overview of…

2674

Abstract

Purpose

This paper explores how digitalization affects the academic research publication process by taking into account the perspective of management scholars. It provides an overview of the digital professional services dedicated to academic research, and investigates academics' awareness of, the impact on the publication process of, and scholars' expectations regarding digital services and software.

Design/methodology/approach

This explorative study adopted a qualitative approach by performing direct observations of websites regarding digital professional research services and in-depth interviews with national and international management scholars.

Findings

The multiple digital professional services dedicated to academic research enable authors to develop a scientific paper independently or with the support of professionals. The scholars' awareness regarding the digital services and software was limited, because of both the plethora of options on the market and the frequent use of the same digital tools over time. In impact terms, these tools enable scholars to improve research quality and to increase productivity. However, the negative effects led scholars to express different expectations about how they can be improved and what difficulties should be overcome to favor the publication process.

Practical implications

The results of this study provide suggestions both for scholars who engage in academic research and digital services and software providers.

Originality/value

To the best of the authors' knowledge, this is the first study to examine the ongoing development of digitalization in support of the research publication process from the perspective of academics.

Open Access
Article
Publication date: 28 November 2017

Mansoor Alghamdi and William Teahan

The aim of this paper is to experimentally evaluate the effectiveness of the state-of-the-art printed Arabic text recognition systems to determine open areas for future…

6582

Abstract

Purpose

The aim of this paper is to experimentally evaluate the effectiveness of the state-of-the-art printed Arabic text recognition systems to determine open areas for future improvements. In addition, this paper proposes a standard protocol with a set of metrics for measuring the effectiveness of Arabic optical character recognition (OCR) systems to assist researchers in comparing different Arabic OCR approaches.

Design/methodology/approach

This paper describes an experiment to automatically evaluate four well-known Arabic OCR systems using a set of performance metrics. The evaluation experiment is conducted on a publicly available printed Arabic dataset comprising 240 text images with a variety of resolution levels, font types, font styles and font sizes.

Findings

The experimental results show that the field of character recognition for printed Arabic still requires further research to reach an efficient text recognition method for Arabic script.

Originality/value

To the best of the authors’ knowledge, this is the first work that provides a comprehensive automated evaluation of Arabic OCR systems with respect to the characteristics of Arabic script and, in addition, proposes an evaluation methodology that can be used as a benchmark by researchers and therefore will contribute significantly to the enhancement of the field of Arabic script recognition.

Details

PSU Research Review, vol. 1 no. 3
Type: Research Article
ISSN: 2399-1747

Keywords

1 – 5 of 5