Assessing the robustness of recurrent neural networks to enhance the spectrum of reverberated speech

Paniagua Peñaranda, Carolina; Zeledón Córdoba, Marisol; Coto Jiménez, Marvin

Assessing the robustness of recurrent neural networks to enhance the spectrum of reverberated speech

dc.creator	Paniagua Peñaranda, Carolina
dc.creator	Zeledón Córdoba, Marisol
dc.creator	Coto Jiménez, Marvin
dc.date.accessioned	2022-03-23T21:24:13Z
dc.date.available	2022-03-23T21:24:13Z
dc.date.issued	2020
dc.description	Part of the Communications in Computer and Information Science book series (CCIS, volume 1087).	es_ES
dc.description.abstract	Implementing voice recognition systems and voice analysis in real-life contexts present important challenges, especially when signal recording/registering conditions are adverse. One of the conditions that produce signal degradation, which has also been studied in recent years is reverberation. Reverberation is produced by the sound wave reflections that travel through the microphone from multiple directions. Several Deep Learning-based methods have been proposed to improve speech signals that have been degraded with reverberation and are proven to be effective. Recently, recurrent neural networks, especially those with short and long term memory (LSTM), have presented surprising results in those tasks. In this work, a proposal to evaluate the robustness of these neural networks to learn different reverberation conditions without any previous information is presented. The results show the necessity to train fewer sets of LSTM networks to improve speech signals, since a single network can learn several conditions simultaneously, in contrast with the current method of training a network for every single condition or noise level. The evaluation has been made based on quality measurements of the signal’s spectrum (distance and perceptual quality), in comparison with the reverberated version. Results help to affirm the fact that LSTM networks are able to enhance the signal in any of five conditions, where all of them were trained simultaneously, with equivalent results as if to train a network for every single condition of reverberation.	es_ES
dc.description.procedence	UCR::Vicerrectoría de Docencia::Ingeniería::Facultad de Ingeniería::Escuela de Ingeniería Eléctrica	es_ES
dc.description.sponsorship	Universidad de Costa Rica/[322-B9-105]/UCR/Costa Rica	es_ES
dc.identifier.citation	https://link.springer.com/chapter/10.1007/978-3-030-41005-6_19	es_ES
dc.identifier.codproyecto	322-B9-105
dc.identifier.doi	10.1007/978-3-030-41005-6_19
dc.identifier.isbn	978-3-030-41005-6
dc.identifier.uri	https://hdl.handle.net/10669/86270
dc.language.iso	eng	es_ES
dc.source	High Performance Computing (pp.276-290).Turrialba, Costa Rica: Springer, Cham	es_ES
dc.subject	Speech enhancement	es_ES
dc.subject	Reverberation	es_ES
dc.subject	Deep learning	es_ES
dc.subject	Long short-term memory (LSTM)	es_ES
dc.title	Assessing the robustness of recurrent neural networks to enhance the spectrum of reverberated speech	es_ES
dc.type	comunicación de congreso	es_ES

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Springer2.pdf
Size:: 190.35 KB
Format:: Adobe Portable Document Format
Description:: Artículo principal

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 3.5 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Ingeniería eléctrica