To read this content please select one of the options below:

Improving ASR performance using context‐dependent phoneme models

Husniza Husni (UUM College of Arts and Sciences, Universiti Utara Malaysia, Sintok, Malaysia)
Zulikha Jamaludin (UUM College of Arts and Sciences, Universiti Utara Malaysia, Sintok, Malaysia)

Journal of Systems and Information Technology

ISSN: 1328-7265

Article publication date: 2 February 2010

445

Abstract

Purpose

The purpose of this paper is to present evidence of the need to have a carefully designed lexical model for speech recognition for dyslexic children reading in Bahasa Melayu (BM).

Design/methodology/approach

Data collection is performed to obtain the most frequent reading error patterns and the reading recordings. Design and development of the lexical model considers the errors for better recognition accuracy.

Findings

It is found that the recognition accuracy is increased to 75 percent when using context‐dependent (CD) phoneme model and phoneme refinement rule. Comparison between context‐independent phoneme models and CD phoneme model is also presented.

Research limitations/implications

The most frequent errors recognized and obtained from data collection and analysis illustrate and support that phonological deficit is the major factor for reading disabilities in dyslexics.

Practical implications

This paper provides the first step towards materializing an automated speech recognition (ASR)‐based application to support reading for BM, which is the first language in Malaysia.

Originality/value

The paper contributes to the knowledge of the most frequent error patterns for dyslexic children's reading in BM and to the knowledge that a CD phoneme model together with the phoneme refinement rule can built up a more fine‐tuned lexical model for an ASR specifically for dyslexic children's reading isolated words in BM.

Keywords

Citation

Husni, H. and Jamaludin, Z. (2010), "Improving ASR performance using context‐dependent phoneme models", Journal of Systems and Information Technology, Vol. 12 No. 1, pp. 56-69. https://doi.org/10.1108/13287261011032652

Publisher

:

Emerald Group Publishing Limited

Copyright © 2010, Emerald Group Publishing Limited

Related articles