Optimization of the parameters characterizing sigmoidal rate-level functions based on acoustic features
Author
dc.contributor.author
Poblete Ramírez, Víctor
Author
dc.contributor.author
Becerra Yoma, Néstor
es_CL
Author
dc.contributor.author
Stern, Richard M.
es_CL
Admission date
dc.date.accessioned
2014-12-16T19:07:46Z
Available date
dc.date.available
2014-12-16T19:07:46Z
Publication date
dc.date.issued
2014
Cita de ítem
dc.identifier.citation
Speech Communication 56 (2014) 19–34
en_US
Identifier
dc.identifier.other
dx.doi.org/10.1016/j.specom.2013.07.006
Identifier
dc.identifier.uri
https://repositorio.uchile.cl/handle/2250/126652
General note
dc.description
Artículo de publicación ISI
en_US
Abstract
dc.description.abstract
This paper describes the development of an optimal sigmoidal rate-level function that is a component of many models of the peripheral
auditory system. The optimization makes use of a set of criteria defined exclusively on the basis of physical attributes of the input
sound that are inspired by physiological evidence. The criteria developed attempt to discriminate between a degraded speech signal and
noise to preserve the maximum amount of information in the linear region of the sigmoidal curve, and to minimize the effects of distortion
in the saturating regions. The performance of the proposed optimal sigmoidal function is validated by text-independent speakerverification
experiments with signals corrupted by additive noise at different SNRs. The experimental results suggest that the approach
presented in combination with cepstral variance normalization can lead to relative reductions in equal error rate as great as 40% when
compared with the use of baseline MFCC coefficients for some SNRs.
en_US
Patrocinador
dc.description.sponsorship
This research was funded by Conicyt-Chile under grants
Fondecyt 1100195 and Team Research in Science and
Technology ACT 1120.