Speech synthesis based on Hidden Markov Models and deep learning

Coto Jiménez, Marvin; Goddard Close, John

Speech synthesis based on Hidden Markov Models and deep learning

dc.creator	Coto Jiménez, Marvin
dc.creator	Goddard Close, John
dc.date.accessioned	2022-03-28T20:05:06Z
dc.date.available	2022-03-28T20:05:06Z
dc.date.issued	2016
dc.description.abstract	Speech synthesis based on Hidden Markov Models (HMM) and other statistical parametric techniques have been a hot topic for some time. Using this techniques, speech synthesizers are able to produce intelligible and flexible voices. Despite progress, the quality of the voices produced using statistical parametric synthesis has not yet reached the level of the current predominant unit-selection approaches, that select and concatenate recordings of real speech. Researchers now strive to create models that more accurately mimic human voices. In this paper, we present our proposal to incorporate recent deep learning algorithms, specially the use of Long Short-term Memory (LSTM) to improve the quality of HMM-based speech synthesis. Thus far, the results indicate that HMM-voices can be improved using this approach in its spectral characteristics, but additional research should be conducted to improve other parameters of the voice signal, such as energy and fundamental frequency, to obtain more natural sounding voices.	es
dc.description.procedence	UCR::Vicerrectoría de Docencia::Ingeniería::Facultad de Ingeniería::Escuela de Ingeniería Eléctrica	es
dc.description.sponsorship	Universidad de Costa Rica/[]/UCR/Costa Rica	es
dc.description.sponsorship	Consejo Nacional de Ciencia y Tecnología/[CB-2012-01, No.182432]/CONACyT/México	es
dc.identifier.citation	https://www.rcs.cic.ipn.mx/2016_112/
dc.identifier.doi	https://doi.org/10.13053/rcs-112-1-2
dc.identifier.issn	1870-4069
dc.identifier.uri	https://hdl.handle.net/10669/86307
dc.language.iso	eng
dc.source	Research in Computing Science, vol.112, pp.19-28.	es
dc.subject	Long short-term memory (LSTM)	es
dc.subject	Hidden Markov Models (HMM)	es
dc.subject	Speech synthesis	es
dc.subject	Statistical parametric speech synthesis	es
dc.subject	Deep learning	es
dc.title	Speech synthesis based on Hidden Markov Models and deep learning	es
dc.type	artículo original	es

Files

Original bundle

Now showing 1 - 1 of 1

Name:: rcs-112-1-2-with-cover-page-v2.pdf
Size:: 1.24 MB
Format:: Adobe Portable Document Format
Description:: Artículo principal

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 3.5 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Ingeniería eléctrica