Optical character recognition (OCR) using partial least square (PLS) based feature reduction: an application to artificial intelligence for biometric identification
Journal of Enterprise Information Management
ISSN: 1741-0398
Article publication date: 31 July 2020
Issue publication date: 24 April 2023
Abstract
Purpose
In artificial intelligence, the optical character recognition (OCR) is an active research area based on famous applications such as automation and transformation of printed documents into machine-readable text document. The major purpose of OCR in academia and banks is to achieve a significant performance to save storage space.
Design/methodology/approach
A novel technique is proposed for automated OCR based on multi-properties features fusion and selection. The features are fused using serially formulation and output passed to partial least square (PLS) based selection method. The selection is done based on the entropy fitness function. The final features are classified by an ensemble classifier.
Findings
The presented method was extensively tested on two datasets such as the authors proposed and Chars74k benchmark and achieved an accuracy of 91.2 and 99.9%. Comparing the results with existing techniques, it is found that the proposed method gives improved performance.
Originality/value
The technique presented in this work will help for license plate recognition and text conversion from a printed document to machine-readable.
Keywords
Acknowledgements
This research was supported by the MSIT (Ministry of Science and ICT), Korea, under the ITRC (Information Technology Research Center) support program (IITP-2020-2016-0-00312) supervised by the IITP (Institute for Information & communications Technology Planning & Evaluation). Zainab Akhtar and Jong Weon Lee are the co-first authors and contributed significantly in this article.
Citation
Akhtar, Z., Lee, J.W., Attique Khan, M., Sharif, M., Ali Khan, S. and Riaz, N. (2023), "Optical character recognition (OCR) using partial least square (PLS) based feature reduction: an application to artificial intelligence for biometric identification", Journal of Enterprise Information Management, Vol. 36 No. 3, pp. 767-789. https://doi.org/10.1108/JEIM-02-2020-0076
Publisher
:Emerald Publishing Limited
Copyright © 2020, Emerald Publishing Limited