Measuring the effect of reverberation on statistical parametric speech synthesis
dc.creator | Coto Jiménez, Marvin | |
dc.date.accessioned | 2022-03-22T21:59:29Z | |
dc.date.available | 2022-03-22T21:59:29Z | |
dc.date.issued | 2020 | |
dc.description | Part of the Communications in Computer and Information Science book series (CCIS, volume 1087). | es_ES |
dc.description.abstract | Text-to-speech (TTS) synthesis is the technique of generating intelligible speech from a given text. The most recent techniques for TTS are based on machine learning, implementing systems which learn linguistic specifications and their corresponding parameters of the speech signal. Given the growing interest in implementing verbal communication systems in different devices, such as cell phones, car navigation system and personal assistants, it is important to use speech data from many sources. The speech recordings available for this purpose are not always generated with the best quality. For example, if an artificial voice is created from historical recordings, or a voice created from a person whom only a small set of recordings exists. In these cases, there is an additional challenge due to the adverse conditions in the data. Reverberation is one of the conditions that can be found in these cases, a product of the different trajectories that a speech signal can take in an environment before registering through a microphone. In the present work, we quantitatively explore the effect of different levels of reverberation on the quality of artificial voice generated with those references. The results show that the quality of the generated artificial speech is affected considerably with any level of reverberation. Thus, the application of algorithms for speech enhancement must be taken always into consideration before and after any process of TTS. | es_ES |
dc.description.procedence | UCR::Vicerrectoría de Docencia::Ingeniería::Facultad de Ingeniería::Escuela de Ingeniería Eléctrica | es_ES |
dc.description.sponsorship | Universidad de Costa Rica/[322-B9-105]/UCR/Costa Rica | es_ES |
dc.identifier.citation | https://link.springer.com/chapter/10.1007/978-3-030-41005-6_25 | es_ES |
dc.identifier.codproyecto | 322-B9-105 | |
dc.identifier.doi | 10.1007/978-3-030-41005-6_25 | |
dc.identifier.isbn | 978-3-030-41005-6 | |
dc.identifier.uri | https://hdl.handle.net/10669/86265 | |
dc.language.iso | eng | es_ES |
dc.source | High Performance Computing (pp.369-382).Turrialba, Costa Rica: Springer, Cham | es_ES |
dc.subject | Hidden Markov Models | es_ES |
dc.subject | PESQ | es_ES |
dc.subject | Reverberation | es_ES |
dc.subject | Speech Synthesis | es_ES |
dc.title | Measuring the effect of reverberation on statistical parametric speech synthesis | es_ES |
dc.type | comunicación de congreso | es_ES |