Improving ASR performance using context‐dependent phoneme models

Husniza Husni (UUM College of Arts and Sciences, Universiti Utara Malaysia, Sintok, Malaysia)
Zulikha Jamaludin (UUM College of Arts and Sciences, Universiti Utara Malaysia, Sintok, Malaysia)

Journal of Systems and Information Technology

ISSN: 1328-7265

Publication date: 2 February 2010

Abstract

Purpose

The purpose of this paper is to present evidence of the need to have a carefully designed lexical model for speech recognition for dyslexic children reading in Bahasa Melayu (BM).

Design/methodology/approach

Data collection is performed to obtain the most frequent reading error patterns and the reading recordings. Design and development of the lexical model considers the errors for better recognition accuracy.

Findings

It is found that the recognition accuracy is increased to 75 percent when using context‐dependent (CD) phoneme model and phoneme refinement rule. Comparison between context‐independent phoneme models and CD phoneme model is also presented.

Research limitations/implications

The most frequent errors recognized and obtained from data collection and analysis illustrate and support that phonological deficit is the major factor for reading disabilities in dyslexics.

Practical implications

This paper provides the first step towards materializing an automated speech recognition (ASR)‐based application to support reading for BM, which is the first language in Malaysia.

Originality/value

The paper contributes to the knowledge of the most frequent error patterns for dyslexic children's reading in BM and to the knowledge that a CD phoneme model together with the phoneme refinement rule can built up a more fine‐tuned lexical model for an ASR specifically for dyslexic children's reading isolated words in BM.

Keywords

Citation

Husni, H. and Jamaludin, Z. (2010), "Improving ASR performance using context‐dependent phoneme models", Journal of Systems and Information Technology, Vol. 12 No. 1, pp. 56-69. https://doi.org/10.1108/13287261011032652

Download as .RIS

Publisher

:

Emerald Group Publishing Limited

Copyright © 2010, Emerald Group Publishing Limited

Please note you might not have access to this content

You may be able to access this content by login via Shibboleth, Open Athens or with your Emerald account.
If you would like to contact us about accessing this content, click the button and fill out the form.
To rent this content from Deepdyve, please click the button.