En: Combinatorial Pattern Matching 18th Annual Symposium, CPM 2007, London, Canada, July 9-11, 2007. Proceedings. pp. 216-227
en_US
Identifier
dc.identifier.other
10.1007/978-3-540-73437-6_23
Identifier
dc.identifier.uri
https://repositorio.uchile.cl/handle/2250/120331
Abstract
dc.description.abstract
Compressed text (self-)indexes have matured up to a point
where they can replace a text by a data structure that requires less
space and, in addition to giving access to arbitrary text passages, support
indexed text searches. At this point those indexes are competitive with
traditional text indexes (which are very large) for counting the number
of occurrences of a pattern in the text. Yet, they are still hundreds to
thousands of times slower when it comes to locating those occurrences in
the text. In this paper we introduce a new compression scheme for suffix
arrays which permits locating the occurrences extremely fast, while still
being much smaller than classical indexes. In addition, our index permits
a very efficient secondary memory implementation, where compression
permits reducing the amount of I/O needed to answer queries.
en_US
Patrocinador
dc.description.sponsorship
Mecesup Grant UCH 0109, Chile
en_US
Lenguage
dc.language.iso
en
en_US
Publisher
dc.publisher
Springer Berlin Heidelberg
en_US
Serie
dc.relation.ispartofseries
Lecture Notes in Computer Science;Volume 4580, 2007;