Enhancing speech recorded from a wearable sensor using a collection of autoencoders

González Salazar, Astryd; Gutiérrez Muñoz, Michelle; Coto Jiménez, Marvin

Enhancing speech recorded from a wearable sensor using a collection of autoencoders

dc.creator	González Salazar, Astryd
dc.creator	Gutiérrez Muñoz, Michelle
dc.creator	Coto Jiménez, Marvin
dc.date.accessioned	2022-03-23T21:33:01Z
dc.date.available	2022-03-23T21:33:01Z
dc.date.issued	2020
dc.description	Part of the Communications in Computer and Information Science book series (CCIS, volume 1087).	es_ES
dc.description.abstract	Assistive Technology (AT) is a concept which includes the use of technological devices to improve the learning process or the general capabilities of people with disabilities. One of the major tasks of the AT is the development of devices that offer alternative or augmentative communication capabilities. In this work, we implemented a simple AT device with a low-cost sensor for registering speech signals, in which the sound is perceived as low quality and corrupted. Thus, it is not suitable to integrate into speech recognition systems, automatic transcription or general recognition of vocal-tract sounds for people with disabilities. We propose the use of a group of artificial neural networks that improve different aspects of the signal. In the study of the speech enhancement, it is normal to focus on how to make improvements in specific conditions of the signal, such as background noise, reverberation, natural noises, among others. In this case, the conditions that degrade the sound are unknown. This uncertainty represents a bigger challenge for the enhancement of the speech, in a real-life application. The results show the capacity of the artificial neural networks to enhance the quality of the sound, under several objective evaluation measurements. Therefore, this proposal can become a way of treating these kinds of signals to improve robust speech recognition systems and increase the real possibilities for implementing low-cost AT devices.	es_ES
dc.description.procedence	UCR::Vicerrectoría de Docencia::Ingeniería::Facultad de Ingeniería::Escuela de Ingeniería Eléctrica	es_ES
dc.description.sponsorship	Universidad de Costa Rica/[322-B9-105]/UCR/Costa Rica	es_ES
dc.description.sponsorship	Universidad de Costa Rica/[ED-3416]/UCR/Costa Rica	es_ES
dc.identifier.citation	https://link.springer.com/chapter/10.1007/978-3-030-41005-6_26	es_ES
dc.identifier.codproyecto	322-B9-105
dc.identifier.codproyecto	ED-3416
dc.identifier.doi	10.1007/978-3-030-41005-6_26
dc.identifier.isbn	978-3-030-41005-6
dc.identifier.uri	https://hdl.handle.net/10669/86272
dc.language.iso	eng	es_ES
dc.source	High Performance Computing (pp.383-397).Turrialba, Costa Rica: Springer, Cham	es_ES
dc.subject	Artificial neural networks	es_ES
dc.subject	Assistive Technology	es_ES
dc.subject	Long short-term memory (LSTM)	es_ES
dc.subject	Speech enhancement	es_ES
dc.title	Enhancing speech recorded from a wearable sensor using a collection of autoencoders	es_ES
dc.type	comunicación de congreso	es_ES

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Springer3.pdf
Size:: 1.82 MB
Format:: Adobe Portable Document Format
Description:

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 3.5 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Ingeniería eléctrica