To read this content please select one of the options below:

Optical character recognition (OCR) using partial least square (PLS) based feature reduction: an application to artificial intelligence for biometric identification

Zainab Akhtar (COMSATS University Islamabad, Wah Campus, Islamabad, Pakistan)
Jong Weon Lee (Department of Software, Sejong University, Seoul, South Korea)
Muhammad Attique Khan (Department of Computer Science, HITEC University, Taxila, Pakistan)
Muhammad Sharif (COMSATS University Islamabad, Wah Campus, Islamabad, Pakistan)
Sajid Ali Khan (Foundation University Islamabad, Islamabad, Pakistan)
Naveed Riaz (SEECS, National University of Sciences and Technology, Islamabad, Pakistan)

Journal of Enterprise Information Management

ISSN: 1741-0398

Article publication date: 31 July 2020

Issue publication date: 24 April 2023

550

Abstract

Purpose

In artificial intelligence, the optical character recognition (OCR) is an active research area based on famous applications such as automation and transformation of printed documents into machine-readable text document. The major purpose of OCR in academia and banks is to achieve a significant performance to save storage space.

Design/methodology/approach

A novel technique is proposed for automated OCR based on multi-properties features fusion and selection. The features are fused using serially formulation and output passed to partial least square (PLS) based selection method. The selection is done based on the entropy fitness function. The final features are classified by an ensemble classifier.

Findings

The presented method was extensively tested on two datasets such as the authors proposed and Chars74k benchmark and achieved an accuracy of 91.2 and 99.9%. Comparing the results with existing techniques, it is found that the proposed method gives improved performance.

Originality/value

The technique presented in this work will help for license plate recognition and text conversion from a printed document to machine-readable.

Keywords

Acknowledgements

This research was supported by the MSIT (Ministry of Science and ICT), Korea, under the ITRC (Information Technology Research Center) support program (IITP-2020-2016-0-00312) supervised by the IITP (Institute for Information & communications Technology Planning & Evaluation). Zainab Akhtar and Jong Weon Lee are the co-first authors and contributed significantly in this article.

Citation

Akhtar, Z., Lee, J.W., Attique Khan, M., Sharif, M., Ali Khan, S. and Riaz, N. (2023), "Optical character recognition (OCR) using partial least square (PLS) based feature reduction: an application to artificial intelligence for biometric identification", Journal of Enterprise Information Management, Vol. 36 No. 3, pp. 767-789. https://doi.org/10.1108/JEIM-02-2020-0076

Publisher

:

Emerald Publishing Limited

Copyright © 2020, Emerald Publishing Limited

Related articles