Skip to Main Content
Frequently Asked Questions
Submit an ETD
Global Search Box
Need Help?
Keyword Search
Participating Institutions
Advanced Search
School Logo
Files
File List
wright1176169264.pdf (1.01 MB)
ETD Abstract Container
Abstract Header
Multilingual Articulatory Features for Speech Recognition
Author Info
Ore, Brian M.
Permalink:
http://rave.ohiolink.edu/etdc/view?acc_num=wright1176169264
Abstract Details
Year and Degree
2007, Master of Science in Engineering (MSEgr), Wright State University, Electrical Engineering.
Abstract
Articulatory features describe the way in which the speech organs are used when producing speech sounds. Research has shown that incorporating this information into speech recognizers can lead to an improvement in system performance. The majority of previous work, however, has been limited to detecting articulatory features in a single language. In this thesis, Gaussian Mixture Models (GMMs) and Multi-Layer Perceptrons (MLPs) were used to detect articulatory features in English, German, Spanish, and Japanese. The outputs of the detectors were used to form the feature set for a Hidden Markov Model (HMM)-based phoneme recognizer. The best overall detection and recognition performance was obtained using MLPs with context. Compared to Mel-Frequency Cepstral Coefficient (MFCC)-based systems, the proposed feature sets yielded an increase of up to 4.39% correct and 5.37% accuracy when using monophone models, and an increase of up to 3.22% correct and 2.60% accuracy with triphone models. On a word recognition task, however, the MFCC systems performed better. Multilingual articulatory feature detectors were also created for all four languages using MLPs. An additional feature set was created using the multilingual detectors and evaluated on the same phoneme recognition task. Compared to the feature sets created with the language-dependent MLP detectors, the maximum decrease in system performance with monophone models was 1.44% correct and 1.72% accuracy on Japanese, and the maximum improvement in system performance with triphone models was 0.75% correct and 0.40% accuracy on Spanish. On a word recognition task, the feature sets created with the multilingual MLP detectors yielded a decrease of up to 3.75% correct and 6.01% accuracy. As a final experiment, two different procedures were investigated for combining the scores from the English GMM and MLP articulatory feature detectors. It was found that the detection performance for each articulatory feature can be improved by combining the scores from all GMM and MLP detectors.
Committee
Brian Rigling (Advisor)
Pages
101 p.
Keywords
Speech recognition
;
Articulatory features
Recommended Citations
Refworks
EndNote
RIS
Mendeley
Citations
Ore, B. M. (2007).
Multilingual Articulatory Features for Speech Recognition
[Master's thesis, Wright State University]. OhioLINK Electronic Theses and Dissertations Center. http://rave.ohiolink.edu/etdc/view?acc_num=wright1176169264
APA Style (7th edition)
Ore, Brian.
Multilingual Articulatory Features for Speech Recognition.
2007. Wright State University, Master's thesis.
OhioLINK Electronic Theses and Dissertations Center
, http://rave.ohiolink.edu/etdc/view?acc_num=wright1176169264.
MLA Style (8th edition)
Ore, Brian. "Multilingual Articulatory Features for Speech Recognition." Master's thesis, Wright State University, 2007. http://rave.ohiolink.edu/etdc/view?acc_num=wright1176169264
Chicago Manual of Style (17th edition)
Abstract Footer
Document number:
wright1176169264
Download Count:
1,073
Copyright Info
© 2007, all rights reserved.
This open access ETD is published by Wright State University and OhioLINK.