Dhiman, J K
(2013)
Prosody Modifications for Voice Conversion.
Masters thesis, Indian Institute of Technology Hyderabad.
Abstract
Generally defined, speech modification is the process of changing certain perceptual properties of speech while
leaving other properties unchanged. Among the many types of speech information that may be altered are rate
of articulation, pitch and formant characteristics.Modifying the speech parameters like pitch, duration and strength
of excitation by desired factor is termed as prosody modification. In this thesis prosody modifications for voice
conversion framework are presented. Among all the speech modifications for prosody two things are important
firstly modification of duartion and pauses (Time scale modification) in a speech utterance and secondly
modification of the pitch(pitch scale modification).Prosody modification involves changing the pitch and duration
of speech without affecting the message and naturalness.In this work time scale and pitch scale modifications
of speech are discussed using two methods Time Domain Pitch Synchronous Overlapped-Add (TD-PSOLA) and
epoch based approach.In order to apply desired speech modifications TD-PSOLA discussed in this thesis works
directly on speech in time domian although there are many variations of TD-PSOLA.The epoch based approach
involves modifications of LP-residual.
Actions (login required)
|
View Item |