MAP speaker adaptation of state duration distributions for speech recognition

Yoma, Néstor Becerra; Sánchez, Jorge Silva

Author	dc.contributor.author	Yoma, Néstor Becerra
Author	dc.contributor.author	Sánchez, Jorge Silva
Admission date	dc.date.accessioned	2019-01-29T17:51:50Z
Available date	dc.date.available	2019-01-29T17:51:50Z
Publication date	dc.date.issued	2002
Cita de ítem	dc.identifier.citation	IEEE Transactions on Speech and Audio Processing, Volumen 10, Issue 7, 2018, Pages 443-450
Identifier	dc.identifier.issn	10636676
Identifier	dc.identifier.other	10.1109/TSA.2002.803441
Identifier	dc.identifier.uri	https://repositorio.uchile.cl/handle/2250/163580
Abstract	dc.description.abstract	This paper presents a framework for maximum a posteriori (MAP) speaker adaptation of state duration distributions in hidden Markov models (HMM). Four key issues of MAP estimation, namely analysis and modeling of state duration distributions, the choice of prior distribution, the specification of the parameters of the prior density and the evaluation of the MAP estimates, are tackled. Moreover, a comparison with an adaptation procedure based on maximum likelihood (ML) estimation is presented, and the problem of truncation of the state duration distribution is addressed from the statistical point of view. The results shown in this paper suggest that the speaker adaptation of temporal restrictions substantially improves the accuracy of speaker-independent (SI) HMM with clean and noisy speech. The method requires a low computational load and a small number of adapting utterances, and can be useful to follow the dynamics of the speaking rate in speech recognition.
Lenguage	dc.language.iso	en
Type of license	dc.rights	Attribution-NonCommercial-NoDerivs 3.0 Chile
Link to License	dc.rights.uri	http://creativecommons.org/licenses/by-nc-nd/3.0/cl/
Source	dc.source	IEEE Transactions on Speech and Audio Processing
Keywords	dc.subject	Speaker adaptation
Keywords	dc.subject	Speech recognition
Keywords	dc.subject	State duration modeling
Título	dc.title	MAP speaker adaptation of state duration distributions for speech recognition
Document type	dc.type	Artículo de revista
Cataloguer	uchile.catalogador	SCOPUS
Indexation	uchile.index	Artículo de publicación SCOPUS
uchile.cosecha	uchile.cosecha	SI

Files in this item

Name:: item_0036815682.pdf
Size:: 1.956Kb
Format:: PDF

This item appears in the following Collection(s)

Artículos de revistas
Artículos de revistas

Show simple item record

Except where otherwise noted, this item's license is described as Attribution-NonCommercial-NoDerivs 3.0 Chile