Skip to Main Content
 

Global Search Box

 
 
 
 

ETD Abstract Container

Abstract Header

Deriving Novel Posterior Feature Spaces For Conditional Random Field - Based Phone Recognition

Mohapatra, Prateeti

Abstract Details

2009, Master of Science, Ohio State University, Computer Science and Engineering.

Conditional Random Fields (CRFs) are undirected graphical models that can be used to define the joint probability distribution over a label sequences given a set of observation sequences to be labeled. A key advantage of CRFs is their great flexibility to include a wide variety of non-independent features of the input. Faced with this freedom, an important question remains: what features should be used?

This thesis describes two techniques for deriving novel features for use in Conditional Random Fields-based phone recognition, extending previous techniques that incorporated multiclass posteriors of phone classes or phonological features estimated by Multi-Layer Perceptrons.

The first technique investigates the integration of suprasegmental knowledge into the MLP classification system that is part of the CRF recognizer. CRFs are used to integrate MLP posterior estimates, particularly of phonological features or phonetic classes, which stand in as representations of the acoustics; this thesis shows that incorporating suprasegmental information as part of the MLP classification system augments the acoustic space in a beneficial way for phonological feature based CRF models. TIMIT phone recognition experiments show a small but statistically significant improvement due to both techniques.

The second experiment combines phonological feature scores from two different systems that gives a statistically significant improvement in Conditional Random Field-based TIMIT phone recognition, despite a standalone system based on their features performing significantly worse. We then explore the reasons for this improvement by examining different representations of phonological attribute classifiers, in terms of what they are classifying (binary versus n-ary features), the feature definition, the training paradigm and the representation of scoring functions. The analysis leads to the conclusions that different databases gives robustness, and that binary-ness, feature definition and score representation do not help in the improvement of the performance.

Eric Fosler-Lussier (Advisor)
Chris Brew (Committee Member)
64 p.

Recommended Citations

Citations

  • Mohapatra, P. (2009). Deriving Novel Posterior Feature Spaces For Conditional Random Field - Based Phone Recognition [Master's thesis, Ohio State University]. OhioLINK Electronic Theses and Dissertations Center. http://rave.ohiolink.edu/etdc/view?acc_num=osu1236784133

    APA Style (7th edition)

  • Mohapatra, Prateeti. Deriving Novel Posterior Feature Spaces For Conditional Random Field - Based Phone Recognition. 2009. Ohio State University, Master's thesis. OhioLINK Electronic Theses and Dissertations Center, http://rave.ohiolink.edu/etdc/view?acc_num=osu1236784133.

    MLA Style (8th edition)

  • Mohapatra, Prateeti. "Deriving Novel Posterior Feature Spaces For Conditional Random Field - Based Phone Recognition." Master's thesis, Ohio State University, 2009. http://rave.ohiolink.edu/etdc/view?acc_num=osu1236784133

    Chicago Manual of Style (17th edition)