To read the full version of this content please select one of the options below:

A speech and character combined recognition engine for mobile devices

Soo‐Young Suk (Information Technology Research Institute, AIST, Tsukuba, Japan, and)
Hyun‐Yeol Chung (School of EECS, Yeungnam University, Gyung‐San, Republic of Korea)

International Journal of Pervasive Computing and Communications

ISSN: 1742-7371

Article publication date: 27 June 2008

Abstract

Purpose

The purpose of this paper is to describe a speech and character combined recognition engine (SCCRE) developed for working on personal digital assistants (PDAs) or on mobile devices. Also, the architecture of a distributed recognition system for providing a more convenient user interface is discussed.

Design/methodology/approach

In SCCRE, feature extraction for speech and for character is carried out separately, but the recognition is performed in an engine. The client recognition engine essentially employs a continuous hidden Markov model (CHMM) structure and this CHMM structure consists of variable parameter topology in order to minimize the number of model parameters and to reduce recognition time. This model also adopts the proposed successive state and mixture splitting (SSMS) method for generating context independent model. SSMS optimizes the number of mixtures through splitting in mixture domain and the number of states through splitting in time domain.

Findings

The recognition results show that the developed engine can reduce the total number of Gaussian up to 40 per cent compared with the fixed parameter models at the same recognition performance when applied to speech recognition for mobile devices. It shows that SSMS can reduce the size of memory for models to 65 per cent and that for processing to 82 per cent. Moreover, the recognition time decreases 17 per cent with the SMS model while maintaining the recognition rate.

Originality/value

The proposed system will be very useful for many on‐line multimodal interfaces such as PDAs and mobile applications.

Keywords

Citation

Suk, S. and Chung, H. (2008), "A speech and character combined recognition engine for mobile devices", International Journal of Pervasive Computing and Communications, Vol. 4 No. 2, pp. 232-249. https://doi.org/10.1108/17427370810890409

Publisher

:

Emerald Group Publishing Limited

Copyright © 2008, Emerald Group Publishing Limited