Large vocabulary speech recognition in noisy environments

Jabloun, Firas

Large vocabulary speech recognition in noisy environments

buir.advisor	Çetin, A. Enis
dc.contributor.author	Jabloun, Firas
dc.date.accessioned	2016-01-08T20:15:20Z
dc.date.available	2016-01-08T20:15:20Z
dc.date.issued	1998
dc.description	Cataloged from PDF version of article.	en_US
dc.description	Includes bibliographical references leaves 48-52	en_US
dc.description.abstract	A ІКПѴ set of speech feature parameters based on multirate subband analysis and the Teager Energy Operator (TEO) is developed. The speech signal is first divided into nonuniform subbands in mel-scale using a multirate filter-bank, then the Teager energies of the subsignals are estimated. Finally, the feature vector is constructed by logcompression and inverse DOT computation. The new feature parameters (TEOCEP) have a robust speech recognition performance in car engine noise which has a low pass nature. In this thesis, we also present some solutions to the problem of large vocabulary speech recognition. Triphone-based Hidden Markov. Models (HMM) are used to model the vocabulary words. Although the straight forward parallel search strategy gives good recognition performance, the processing time required is found to be long and impractical. Therefore another search strategy with similar performance is described. Subvocabularies are developed during the training session to reduce the total number of words considered in the search process. The search is then performed in a tree structure by investigating one subvocabulary instead of all the words.	en_US
dc.description.statementofresponsibility	Jabloun, Firas	en_US
dc.format.extent	52 leaves	en_US
dc.identifier.uri	http://hdl.handle.net/11693/18003
dc.language.iso	English	en_US
dc.rights	info:eu-repo/semantics/openAccess	en_US
dc.subject	Speech recognition	en_US
dc.subject	Multirate subband anttlysis	en_US
dc.subject	Teager Energy Operator	en_US
dc.subject	Nonlinear speech modeling.	en_US
dc.subject	Triphones	en_US
dc.subject	Tree structure search strategy	en_US
dc.subject.lcc	TK7895.S65 J33 1998	en_US
dc.subject.lcsh	Automatic speech recognition.	en_US
dc.subject.lcsh	Natural language processing(Computer science.	en_US
dc.title	Large vocabulary speech recognition in noisy environments	en_US
dc.type	Thesis	en_US
thesis.degree.discipline	Electrical and Electronic Engineering
thesis.degree.grantor	Bilkent University
thesis.degree.level	Master's
thesis.degree.name	MS (Master of Science)

Files

Original bundle

Now showing 1 - 1 of 1

Name:: B043204.pdf
Size:: 1.78 MB
Format:: Adobe Portable Document Format
Description:: Full printable version

Download

Collections

Graduate School of Engineering and Science