An empirical comparison of latent sematic models for applications in industry
Author
dc.contributor.author
Contreras Piña, Constanza
Author
dc.contributor.author
Ríos, Sebastián
Admission date
dc.date.accessioned
2016-06-13T18:50:19Z
Available date
dc.date.available
2016-06-13T18:50:19Z
Publication date
dc.date.issued
2016
Cita de ítem
dc.identifier.citation
Neurocomputing179(2016)176–185
en_US
Identifier
dc.identifier.other
dx.doi.org/10.1016/j.neucom.2015.11.080
Identifier
dc.identifier.uri
https://repositorio.uchile.cl/handle/2250/138761
General note
dc.description
Artículo de publicación ISI
en_US
Abstract
dc.description.abstract
In recent years, topic models have been gaining popularity to perform classification of text from several web sources (from social networks to digital media). However, after working for many years in the web text mining area we have notice that assessing the quality of topics discovered is still an open problem, quite hard to solve. In this paper, we evaluated four latent semantic models using two metrics: coherence and interpretability which are the most used. We show how these pure mathematical metrics fall short to asses topics quality. Experiments were performed over a dataset of 21,863 text reclamation.