Large vocabulary speech recognition in noisy environments
| buir.advisor | Çetin, A. Enis | |
| dc.contributor.author | Jabloun, Firas | |
| dc.date.accessioned | 2016-01-08T20:15:20Z | |
| dc.date.available | 2016-01-08T20:15:20Z | |
| dc.date.issued | 1998 | |
| dc.description | Cataloged from PDF version of article. | en_US | 
| dc.description | Includes bibliographical references leaves 48-52 | en_US | 
| dc.description.abstract | A ІКПѴ set of speech feature parameters based on multirate subband analysis and the Teager Energy Operator (TEO) is developed. The speech signal is first divided into nonuniform subbands in mel-scale using a multirate filter-bank, then the Teager energies of the subsignals are estimated. Finally, the feature vector is constructed by logcompression and inverse DOT computation. The new feature parameters (TEOCEP) have a robust speech recognition performance in car engine noise which has a low pass nature. In this thesis, we also present some solutions to the problem of large vocabulary speech recognition. Triphone-based Hidden Markov. Models (HMM) are used to model the vocabulary words. Although the straight forward parallel search strategy gives good recognition performance, the processing time required is found to be long and impractical. Therefore another search strategy with similar performance is described. Subvocabularies are developed during the training session to reduce the total number of words considered in the search process. The search is then performed in a tree structure by investigating one subvocabulary instead of all the words. | en_US | 
| dc.description.statementofresponsibility | Jabloun, Firas | en_US | 
| dc.format.extent | 52 leaves | en_US | 
| dc.identifier.uri | http://hdl.handle.net/11693/18003 | |
| dc.language.iso | English | en_US | 
| dc.rights | info:eu-repo/semantics/openAccess | en_US | 
| dc.subject | Speech recognition | en_US | 
| dc.subject | Multirate subband anttlysis | en_US | 
| dc.subject | Teager Energy Operator | en_US | 
| dc.subject | Nonlinear speech modeling. | en_US | 
| dc.subject | Triphones | en_US | 
| dc.subject | Tree structure search strategy | en_US | 
| dc.subject.lcc | TK7895.S65 J33 1998 | en_US | 
| dc.subject.lcsh | Automatic speech recognition. | en_US | 
| dc.subject.lcsh | Natural language processing(Computer science. | en_US | 
| dc.title | Large vocabulary speech recognition in noisy environments | en_US | 
| dc.type | Thesis | en_US | 
| thesis.degree.discipline | Electrical and Electronic Engineering | |
| thesis.degree.grantor | Bilkent University | |
| thesis.degree.level | Master's | |
| thesis.degree.name | MS (Master of Science) | 
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- B043204.pdf
- Size:
- 1.78 MB
- Format:
- Adobe Portable Document Format
- Description:
- Full printable version