Buscar

Mostrando ítems 1-10 de 15

Auto-Associative Initialization of LSTM Neural Networks for Fundamental Frequency Detection in Noisy Speech Signals

Coto Jiménez, Marvin (2018)

In this paper, we present a new approach for fundamental frequency detection in noisy speech, based on Long Short-term Memory Neural Networks (LSTM). Fundamental frequency is one of the most important parameters of human ...

Evaluation of denoising algorithms for footsteps sound classification in noisy environments

Brenes Jiménez, Carlos; Caravaca Mora, Ronald; Coto Jiménez, Marvin (2021)

Identifying a person using footsteps sounds is part of the recent research in developing biometrics, systems designed to identify an individual in a group using body measurements. The sound of footsteps has a short ...

Experimental study on transfer learning in denoising autoencoders for speech enhancement

Coto Jiménez, Marvin (2020)

The quality of speech signals is affected by a combination of background noise, reverberation, and other distortions in real-life environments. The processing of such signals presents important challenges for tasks such ...

Robustness of LSTM neural networks for the enhancement of spectral parameters in noisy speech signals

Coto Jiménez, Marvin (2019)

In this paper, we carry out a comparative performance analysis of Long Short-term Memory (LSTM) Neural Networks for the task of noise reduction. Recent work in this area has shown the advantages of this kind of network for ...

Hidden Markov Models for artificial voice production and accent modification

Coto Jiménez, Marvin; Goddard Close, John (2016)

In this paper, we consider the problem of accent modification between Castilian Spanish and Mexican Spanish. This is an interesting application area for tasks such as the automatic dubbing of pictures and videos with ...

An experimental study on fundamental frequency detection in reverberated speech with pre-trained recurrent neural networks

Alfaro Picado, Andrei Fabian; Solís Cerdas, Stacy Daniela; Coto Jiménez, Marvin (2020)

The detection of the fundamental frequency (f0) in speech signals is relevant in areas such as automatic speech recognition and identification, with multiple potential applications. For example, in virtual assistants, ...

Buscar

Auto-Associative Initialization of LSTM Neural Networks for Fundamental Frequency Detection in Noisy Speech Signals

Evaluation of denoising algorithms for footsteps sound classification in noisy environments

Experimental study on transfer learning in denoising autoencoders for speech enhancement

Robustness of LSTM neural networks for the enhancement of spectral parameters in noisy speech signals

Hidden Markov Models for artificial voice production and accent modification

An experimental study on fundamental frequency detection in reverberated speech with pre-trained recurrent neural networks

Assessing the robustness of recurrent neural networks to enhance the spectrum of reverberated speech

Pre-training Long Short-term Memory neural networks for efficient regression in artificial speech postfiltering

A performance evaluation of several artificial neural networks for mapping speech spectrum parameters

Enhancing speech recorded from a wearable sensor using a collection of autoencoders

Buscar

Filtros

Auto-Associative Initialization of LSTM Neural Networks for Fundamental Frequency Detection in Noisy Speech Signals

Evaluation of denoising algorithms for footsteps sound classification in noisy environments

Experimental study on transfer learning in denoising autoencoders for speech enhancement

Robustness of LSTM neural networks for the enhancement of spectral parameters in noisy speech signals

Hidden Markov Models for artificial voice production and accent modification

An experimental study on fundamental frequency detection in reverberated speech with pre-trained recurrent neural networks

Assessing the robustness of recurrent neural networks to enhance the spectrum of reverberated speech

Pre-training Long Short-term Memory neural networks for efficient regression in artificial speech postfiltering

A performance evaluation of several artificial neural networks for mapping speech spectrum parameters

Enhancing speech recorded from a wearable sensor using a collection of autoencoders