Logo Kérwá
 

An Experimental Study on Speech Enhancement Based on a Combination of Wavelets and Deep Learning

Loading...
Thumbnail Image

Authors

Gutiérrez Muñoz, Michelle
Coto Jiménez, Marvin

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

The purpose of speech enhancement is to improve the quality of speech signals degraded by noise, reverberation, or other artifacts that can affect the intelligibility, automatic recognition, or other attributes involved in speech technologies and telecommunications, among others. In such applications, it is essential to provide methods to enhance the signals to allow the understanding of the messages or adequate processing of the speech. For this purpose, during the past few decades, several techniques have been proposed and implemented for the abundance of possible conditions and applications. Recently, those methods based on deep learning seem to outperform previous proposals even on real-time processing. Among the new explorations found in the literature, the hybrid approaches have been presented as a possibility to extend the capacity of individual methods, and therefore increase their capacity for the applications. In this paper, we evaluate a hybrid approach that combines both deep learning and wavelet transformation. The extensive experimentation performed to select the proper wavelets and the training of neural networks allowed us to assess whether the hybrid approach is of benefit or not for the speech enhancement task under several types and levels of noise, providing relevant information for future implementations.

Description

Keywords

speech enhancement, denoising, Signal processing, Deep learning, wavelets

Citation

https://www.mdpi.com/2079-3197/10/6/102?type=check_update&version=1

Endorsement

Review

Supplemented By

Referenced By