Document listing on repetitive collections with guaranteed performance
Author
dc.contributor.author
Navarro, Gonzalo
Admission date
dc.date.accessioned
2019-05-29T13:39:02Z
Available date
dc.date.available
2019-05-29T13:39:02Z
Publication date
dc.date.issued
2017
Cita de ítem
dc.identifier.citation
Leibniz International Proceedings in Informatics, LIPIcs, Volumen 78,
Identifier
dc.identifier.issn
18688969
Identifier
dc.identifier.other
10.4230/LIPIcs.CPM.2017.4
Identifier
dc.identifier.uri
https://repositorio.uchile.cl/handle/2250/168998
Abstract
dc.description.abstract
We consider document listing on string collections, that is, finding in which strings a given pattern appears. In particular, we focus on repetitive collections: a collection of size N over alphabet [1, σ] is composed of D copies of a string of size n, and s single-character edits are applied on the copies. We introduce the first document listing index with size Õ(n+s), precisely O((n lg σ+s lg2 N) lg D) bits, and with useful worst-case time guarantees: Given a pattern of length m, the index reports the ndoc strings where it appears in time O(m2 + m lg N(lgD + lgϵ N).
Lenguage
dc.language.iso
en
Publisher
dc.publisher
Schloss Dagstuhl- Leibniz-Zentrum fur Informatik GmbH, Dagstuhl Publishing