An efficient algorithm for approximated self-similarity joins in metric spaces

Ferrada Aliaga, Sebastián; Bustos Cárdenas, Benjamín; Reyes, Nora

Author	dc.contributor.author	Ferrada Aliaga, Sebastián
Author	dc.contributor.author	Bustos Cárdenas, Benjamín
Author	dc.contributor.author	Reyes, Nora
Admission date	dc.date.accessioned	2020-05-29T19:05:08Z
Available date	dc.date.available	2020-05-29T19:05:08Z
Publication date	dc.date.issued	2020
Cita de ítem	dc.identifier.citation	Information Systems. 91: (2020): 101510	es_ES
Identifier	dc.identifier.other	10.1016/j.is.2020.101510
Identifier	dc.identifier.uri	https://repositorio.uchile.cl/handle/2250/175109
Abstract	dc.description.abstract	Similarity join is a key operation in metric databases. It retrieves all pairs of elements that are similar. Solving such a problem usually requires comparing every pair of objects of the datasets, even when indexing and ad hoc algorithms are used. We propose a simple and efficient algorithm for the computation of the approximated k nearest neighbor self-similarity join. This algorithm computes Theta(n(3/2)) distances and it is empirically shown that it reaches an empirical precision of 46% in real-world datasets. We provide a comparison to other common techniques such as Quickjoin and Locality-Sensitive Hashing and argue that our proposal has a better execution time and average precision.	es_ES
Patrocinador	dc.description.sponsorship	Millennium Institute for Foundational Research on Data, Chile CONICYT-PFCHA, Argentina 2017-21170616	es_ES
Lenguage	dc.language.iso	en	es_ES
Publisher	dc.publisher	Elsevier	es_ES
Type of license	dc.rights	Attribution-NonCommercial-NoDerivs 3.0 Chile	*
Link to License	dc.rights.uri	http://creativecommons.org/licenses/by-nc-nd/3.0/cl/	*
Source	dc.source	Information Systems	es_ES
Keywords	dc.subject	Similarity joins	es_ES
Keywords	dc.subject	kNN	es_ES
Keywords	dc.subject	Approximated nearest neighbors	es_ES
Keywords	dc.subject	Algorithms	es_ES
Keywords	dc.subject	Metric spaces	es_ES
Título	dc.title	An efficient algorithm for approximated self-similarity joins in metric spaces	es_ES
Document type	dc.type	Artículo de revista	es_ES
dcterms.accessRights	dcterms.accessRights	Acceso Abierto
Cataloguer	uchile.catalogador	ctc	es_ES
Indexation	uchile.index	Artículo de publicación ISI
Indexation	uchile.index	Artículo de publicación SCOPUS

Files in this item

Name:: An-efficient-algorithm.pdf
Size:: 2.661Mb
Format:: PDF

This item appears in the following Collection(s)

Artículos de revistas
Artículos de revistas

Show simple item record

Except where otherwise noted, this item's license is described as Attribution-NonCommercial-NoDerivs 3.0 Chile