New methods for robust speech recognition
buir.advisor | Çetin, A. Enis | |
dc.contributor.author | Erzin, Engin | |
dc.date.accessioned | 2016-01-08T20:20:08Z | |
dc.date.available | 2016-01-08T20:20:08Z | |
dc.date.issued | 1995 | |
dc.description | Cataloged from PDF version of article. | en_US |
dc.description | Includes bibliographical references leaves 86-92. | en_US |
dc.description.abstract | New methods of feature extraction, end-point detection and speech enhcincement are developed for a robust speech recognition system. The methods of feature extraction and end-point detection are based on wavelet analysis or subband analysis of the speech signal. Two new sets of speech feature parameters, SUBLSF’s and SUBCEP’s, are introduced. Both parameter sets are based on subband analysis. The SUBLSF feature parameters are obtained via linear predictive analysis on subbands. These speech feature parameters can produce better results than the full-band parameters when the noise is colored. The SUBCEP parameters are based on wavelet analysis or equivalently the multirate subband analysis of the speech signal. The SUBCEP parameters also provide robust recognition performance by appropriately deemphasizing the frequency bands corrupted by noise. It is experimentally observed that the subband analysis based feature parameters are more robust than the commonly used full-band analysis based parameters in the presence of car noise. The a-stable random processes can be used to model the impulsive nature of the public network telecommunication noise. Adaptive filtering are developed for Q-stable random processes. Adaptive noise cancelation techniques are used to reduce the mismacth between training and testing conditions of the recognition system over telephone lines. Another important problem in isolated speech recognition is to determine the boundaries of the speech utterances or words. Precise boundary detection of utterances improves the performance of speech recognition systems. A new distance measure based on the subband energy levels is introduced for endpoint detection. | en_US |
dc.description.statementofresponsibility | Erzin, Engin | en_US |
dc.format.extent | xv, 93 leaves | en_US |
dc.identifier.uri | http://hdl.handle.net/11693/18526 | |
dc.language.iso | English | en_US |
dc.rights | info:eu-repo/semantics/openAccess | en_US |
dc.subject | Speech recognition | en_US |
dc.subject | Linear prediction | en_US |
dc.subject | Endpoint detection | en_US |
dc.subject | Speech enhancement | en_US |
dc.subject | Subband decomposition | en_US |
dc.subject | Wavelet transform | en_US |
dc.subject | Line spectrum frequencies | en_US |
dc.subject | A-stable distributions | en_US |
dc.subject.lcc | TK7882.S65 E79 1995 | en_US |
dc.subject.lcsh | Speech processing systems. | en_US |
dc.subject.lcsh | Speech recognition systems. | en_US |
dc.title | New methods for robust speech recognition | en_US |
dc.type | Thesis | en_US |
thesis.degree.discipline | Electrical and Electronic Engineering | |
thesis.degree.grantor | Bilkent University | |
thesis.degree.level | Doctoral | |
thesis.degree.name | Ph.D. (Doctor of Philosophy) |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- B032619.pdf
- Size:
- 2.91 MB
- Format:
- Adobe Portable Document Format
- Description:
- Full printable version