Vijayan, K and Kodukula, Sri Rama Murty
(2014)
Comparative Study of Spectral Mapping Techniques for Enhancement of Throat Microphone Speech.
In: 2014 20th National Conference on Communications, NCC 2014, 28 February 2014 through 2 March 2014, Kanpur; India;.
Full text not available from this repository.
(
Request a copy)
Abstract
The objective of this work is to study the suitability of existing spectral mapping methods for enhancement of throat microphone (TM) speech, and propose a more elegant method for spectral mapping. Gaussian mixture models (GMM) and neural networks (NN) have been used for spectral mapping. Though GMM-based mapping captures the variability among speech sounds through multiple mixtures, it can only provide a linear map between the source and the target. On the other hand, NN-based mapping is capable of providing a nonlinear map but a single mapping scheme may not handle variability across different speech sounds. Incorporating the advantages from these approaches, we propose a spectral mapping method using multiple neural networks. Speech data is clustered using k-means algorithm, and a separate neural network is employed to capture the mapping within each cluster. Objective evaluation has shown that proposed method is better than both GMM-base and NN-base mapping schemes
Actions (login required)
|
View Item |