Theses and Dissertations
Issuing Body
Mississippi State University
Advisor
Picone, Joseph
Committee Member
Lazarou, Georgios Y.
Committee Member
Younan, Nicolas H.
Date of Degree
12-13-2002
Original embargo terms
MSU Only Indefinitely
Document Type
Graduate Thesis - Campus Access Only
Major
Electrical Engineering
Degree Name
Master of Science
College
College of Engineering
Department
Department of Electrical and Computer Engineering
Abstract
Rapid advances in speech recognition theory, as well as computing hardware, have led to the development of machines that can take human speech as input, decode the information content of the speech, and respond accordingly. Real-time performance of such systems is often dominated by the evaluation of likelihoods in the statistical modeling component of the system. Statistical models are typically implemented using Gaussian mixture distributions. The primary objective of this thesis was to develop an extension of the Bucket Box Intersection algorithm in which the dimension with the optimal number of splits can be selected when multiple minima are present. The effects of normalization of mixture weights and Gaussian clipping have also been investigated. We show that the Extended BBI algorithm (EBBI) reduces run-time by 21% without introducing any approximation error. EBBI also produced a 12% lower word error rate than Gaussian clipping for the same computational complexity. These approaches were evaluated on a wide variety of tasks including conversational speech.
URI
https://hdl.handle.net/11668/19055
Recommended Citation
Srivastava, Shivali, "Fast Gaussian Evaluations in Large Vocabulary Continous Speech Recognition" (2002). Theses and Dissertations. 2237.
https://scholarsjunction.msstate.edu/td/2237