Speaker-dependent speech coding

Date
2006
Editor(s)
Advisor
Çetin, A. Enis
Supervisor
Co-Advisor
Co-Supervisor
Instructor
Source Title
Print ISSN
Electronic ISSN
Publisher
Bilkent University
Volume
Issue
Pages
Language
English
Journal Title
Journal ISSN
Volume Title
Series
Abstract

With the rapid expansion of Internet, it became feasible to have low-cost and secure telephone calls via internet. New digital speech compression standards were developed. Digital speech codecs can be used both in regular telephone networks and Internet based systems. Thereby, for a secure call, speech data are firstly compressed by digital speech codecs, and then, these compressed packages are sent in an encoded way through Data Encryption Standard (3DES) [1], Advanced Encryption Standard (AES) [2] encoding methods. Compressing and encoding processes require high processor performance and they may even require the use of high frequency processors and DSP’s to encode the binary speech data. Current speech coders are speaker independent, i.e., they don’t perform any speaker specific operations. They do not even distinguish between male and female speakers. An interesting way to solve this problem is to send speech after encoding it with a system that is based on a specific user. This system, which can be also called as speaker dependent speech encoding, provides a computationally efficient and relatively secure VoIP call, with high quality and without any encoding compared to the same bit rate standard speech codecs. Despite the disadvantages of requirement of acquiring all the speech characteristics of users and the need for extra data space, it has advantages such as providing secure communication because speech characteristics of the speaker is unknown to other users and the synthesized speech has higher quality compared to a same bit rate LPC compressed speech.

Course
Other identifiers
Book Title
Citation
Published Version (Please cite this version)