Show simple item record

Authordc.contributor.authorArroyuelo, Diego 
Authordc.contributor.authorNavarro, Gonzalo es_CL
Admission datedc.date.accessioned2013-12-20T15:21:35Z
Available datedc.date.available2013-12-20T15:21:35Z
Publication datedc.date.issued2007
Identifierdc.identifier.urihttps://repositorio.uchile.cl/handle/2250/125817
General notedc.descriptionArtículo de publicación ISIen_US
Abstractdc.description.abstractFull-text searching consists in locating the occurrences of a given pattern P[1::m] in a text T[1::u], both sequences over an alpha- bet of size ¾. In this paper we de¯ne a new index for full-text searching on secondary storage, based on the Lempel-Ziv compression algorithm and requiring 8uHk +o(u log ¾) bits of space, where Hk denotes the k-th order empirical entropy of T, for any k = o(log¾ u). Our experimental re- sults show that our index is signi¯cantly smaller than any other practical secondary-memory data structure: 1.4{2.3 times the text size including the text, which means 39%{65% the size of traditional indexes like String B-trees [Ferragina and Grossi, JACM 1999]. In exchange, our index re- quires more disk access to locate the pattern occurrences. Our index is able to report up to 600 occurrences per disk access, for a disk page of 32 kilobytes. If we only need to count pattern occurrences, the space can be reduced to about 1.04{1.68 times the text size, requiring about 20{60 disk accesses, depending on the pattern length.en_US
Lenguagedc.language.isoenen_US
Type of licensedc.rightsAttribution-NonCommercial-NoDerivs 3.0 Chile*
Link to Licensedc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/3.0/cl/*
Keywordsdc.subjectA Lempel-Ziv Text Indexen_US
Títulodc.titleA Lempel-Ziv Text Index on Secondary Storageen_US
Document typedc.typeArtículo de revista


Files in this item

Icon

This item appears in the following Collection(s)

Show simple item record

Attribution-NonCommercial-NoDerivs 3.0 Chile
Except where otherwise noted, this item's license is described as Attribution-NonCommercial-NoDerivs 3.0 Chile