Show simple item record

Authordc.contributor.authorPoblete Ramírez, Víctor 
Authordc.contributor.authorEspic, Felipe 
Authordc.contributor.authorKing, Simon 
Authordc.contributor.authorStern, Richard M. 
Authordc.contributor.authorHuenupán, Fernando 
Authordc.contributor.authorFredes Sandoval, Josué Abraham 
Authordc.contributor.authorBecerra Yoma, Néstor 
Admission datedc.date.accessioned2015-08-17T20:21:41Z
Available datedc.date.available2015-08-17T20:21:41Z
Publication datedc.date.issued2015
Cita de ítemdc.identifier.citationComputer Speech and Language 31 (2015) 1–27en_US
Identifierdc.identifier.otherDOI: 10.1016/j.csl.2014.10.006
Identifierdc.identifier.urihttps://repositorio.uchile.cl/handle/2250/132800
General notedc.descriptionArtículo de publicación ISIen_US
Abstractdc.description.abstractThis paper proposes a new set of speech features called Locally-Normalized Cepstral Coefficients (LNCC) that are based onSeneff’s Generalized Synchrony Detector (GSD). First, an analysis of the GSD frequency response is provided to show that itgenerates spurious peaks at harmonics of the detected frequency. Then, the GSD frequency response is modeled as a quotient of twofilters centered at the detected frequency. The numerator is a triangular band pass filter centered around a particular frequency similarto the ordinary Mel filters. The denominator term is a filter that responds maximally to frequency components on either side of thenumerator filter. As a result, a local normalization is performed without the spurious peaks of the original GSD. Speaker verificationresults demonstrate that the proposed LNCC features are of low computational complexity and far more effectively compensate forspectral tilt than ordinary MFCC coefficients. LNCC features do not require the computation and storage of a moving average of thefeature values, and they provide relative reductions in Equal Error Rate (EER) as high as 47.7%, 34.0% or 25.8% when comparedwith MFCC, MFCC + CMN, or MFCC + RASTA in one case of variable spectral tilt, respectively.en_US
Patrocinadordc.description.sponsorshipCONICYT-ANILLO ACT 1120 CONICYT-FONDECYT 1100195 EPSRC EP/I031022/1 Defense Advanced Research Projects Agency (DARPA) D10PC20024en_US
Lenguagedc.language.isoenen_US
Publisherdc.publisherElsevieren_US
Type of licensedc.rightsAtribución-NoComercial-SinDerivadas 3.0 Chile*
Link to Licensedc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/3.0/cl/*
Keywordsdc.subjectChannel robust feature extractionen_US
Keywordsdc.subjectAuditorymodelsen_US
Keywordsdc.subjectSpectral local normalizationen_US
Keywordsdc.subjectSynchrony detectionen_US
Títulodc.titleA perceptually-motivated low-complexity instantaneous linearchannel normalization technique applied to speaker verificationen_US
Document typedc.typeArtículo de revista


Files in this item

Icon

This item appears in the following Collection(s)

Show simple item record

Atribución-NoComercial-SinDerivadas 3.0 Chile
Except where otherwise noted, this item's license is described as Atribución-NoComercial-SinDerivadas 3.0 Chile