Logo Kérwá
 

Speech synthesis based on Hidden Markov Models and deep learning

dc.creatorCoto Jiménez, Marvin
dc.creatorGoddard Close, John
dc.date.accessioned2022-03-28T20:05:06Z
dc.date.available2022-03-28T20:05:06Z
dc.date.issued2016
dc.description.abstractSpeech synthesis based on Hidden Markov Models (HMM) and other statistical parametric techniques have been a hot topic for some time. Using this techniques, speech synthesizers are able to produce intelligible and flexible voices. Despite progress, the quality of the voices produced using statistical parametric synthesis has not yet reached the level of the current predominant unit-selection approaches, that select and concatenate recordings of real speech. Researchers now strive to create models that more accurately mimic human voices. In this paper, we present our proposal to incorporate recent deep learning algorithms, specially the use of Long Short-term Memory (LSTM) to improve the quality of HMM-based speech synthesis. Thus far, the results indicate that HMM-voices can be improved using this approach in its spectral characteristics, but additional research should be conducted to improve other parameters of the voice signal, such as energy and fundamental frequency, to obtain more natural sounding voices.es
dc.description.procedenceUCR::Vicerrectoría de Docencia::Ingeniería::Facultad de Ingeniería::Escuela de Ingeniería Eléctricaes
dc.description.sponsorshipUniversidad de Costa Rica/[]/UCR/Costa Ricaes
dc.description.sponsorshipConsejo Nacional de Ciencia y Tecnología/[CB-2012-01, No.182432]/CONACyT/Méxicoes
dc.identifier.citationhttps://www.rcs.cic.ipn.mx/2016_112/
dc.identifier.doihttps://doi.org/10.13053/rcs-112-1-2
dc.identifier.issn1870-4069
dc.identifier.urihttps://hdl.handle.net/10669/86307
dc.language.isoeng
dc.sourceResearch in Computing Science, vol.112, pp.19-28.es
dc.subjectLong short-term memory (LSTM)es
dc.subjectHidden Markov Models (HMM)es
dc.subjectSpeech synthesises
dc.subjectStatistical parametric speech synthesises
dc.subjectDeep learninges
dc.titleSpeech synthesis based on Hidden Markov Models and deep learninges
dc.typeartículo originales

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
rcs-112-1-2-with-cover-page-v2.pdf
Size:
1.24 MB
Format:
Adobe Portable Document Format
Description:
Artículo principal

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
3.5 KB
Format:
Item-specific license agreed upon to submission
Description: