Improving automatic speech recognition containing additive noise using deep denoising autoencoders of lstm networks
dc.creator | Coto Jiménez, Marvin | |
dc.creator | Goddard Close, John | |
dc.creator | Martínez Licona, Fabiola | |
dc.date.accessioned | 2022-03-28T19:45:54Z | |
dc.date.available | 2022-03-28T19:45:54Z | |
dc.date.issued | 2016 | |
dc.description | Part of the Lecture Notes in Computer Science book series (LNCS, volume 9811). | es_ES |
dc.description.abstract | Automatic speech recognition systems (ASR) suffer from performance degradation under noisy conditions. Recent work, using deep neural networks to denoise spectral input features for robust ASR, have proved to be successful. In particular, Long Short-Term Memory (LSTM) autoencoders have outperformed other state of the art denoising systems when applied to the mfcc’s of a speech signal. In this paper we also consider denoising LSTM autoencoders (DLSTMA), but instead use three different DLSTMAs and apply each to the mfcc’s, fundamental frequency, and energy features, respectively. Results are given using several kinds of additive noise at different intensity levels, and show how this collection of DLSTMA’s improves the performance of the ASR in comparison with the LSTM autoencoder. | es_ES |
dc.description.procedence | UCR::Vicerrectoría de Docencia::Ingeniería::Facultad de Ingeniería::Escuela de Ingeniería Eléctrica | es_ES |
dc.description.sponsorship | Universidad de Costa Rica/[]/UCR/Costa Rica | es_ES |
dc.description.sponsorship | Consejo Nacional de Ciencia y Tecnología/[CB-2012-01, No.182432]/CONACyT/México | es_ES |
dc.identifier.citation | https://link.springer.com/chapter/10.1007/978-3-319-43958-7_42 | es_ES |
dc.identifier.doi | 10.1007/978-3-319-43958-7_42 | |
dc.identifier.isbn | 978-3-319-43958-7 | |
dc.identifier.uri | https://hdl.handle.net/10669/86306 | |
dc.language.iso | eng | es_ES |
dc.rights | acceso abierto | |
dc.source | Speech and Computer (pp.354-361).Budapest, Hungría: Springer, Cham | es_ES |
dc.subject | Long short-term memory (LSTM) | es_ES |
dc.subject | Deep learning | es_ES |
dc.subject | Denoising autoencoders | es_ES |
dc.title | Improving automatic speech recognition containing additive noise using deep denoising autoencoders of lstm networks | es_ES |
dc.type | comunicación de congreso | es_ES |
Archivos
Bloque original
1 - 1 de 1
No hay miniatura disponible
- Nombre:
- 10.1007@978-3-319-43958-742.pdf
- Tamaño:
- 1.91 MB
- Formato:
- Adobe Portable Document Format
- Descripción:
- Artículo principal
Bloque de licencias
1 - 1 de 1
No hay miniatura disponible
- Nombre:
- license.txt
- Tamaño:
- 3.5 KB
- Formato:
- Item-specific license agreed upon to submission
- Descripción: