Show simple item record

Authordc.contributor.authorGrabowski, Szymon 
Authordc.contributor.authorMäkinen, Veli es_CL
Authordc.contributor.authorNavarro, Gonzalo es_CL
Authordc.contributor.authorSalinger, Alejandro es_CL
Admission datedc.date.accessioned2009-04-08T09:34:50Z
Available datedc.date.available2009-04-08T09:34:50Z
Publication datedc.date.issued2006-12
Cita de ítemdc.identifier.citationINTERNATIONAL JOURNAL OF FOUNDATIONS OF COMPUTER SCIENCE Volume: 17 Issue: 6 Pages: 1365-1384 Published: DEC 2006en
Identifierdc.identifier.issn0129-0541
Identifierdc.identifier.urihttps://repositorio.uchile.cl/handle/2250/124880
Abstractdc.description.abstractWe design a succinct full-text index based on the idea of Huffman-compressing the text and then applying the Burrows-Wheeler transform over it. The resulting structure can be searched as an FM-index, with the benefit of removing the sharp dependence on the alphabet size, sigma, present in that structure. On a text of length n with zero-order entropy H-0, our index needs O(n(H-0 + 1)) bits of space, without any significant dependence on or. The average search time for a pattern of length m is O(m(H-0 + 1)), under reasonable assumptions. Each position of a text occurrence can be located in worst case time O((H-0 + 1) log n), while any text substring of length L can be retrieved in O((H-0 + 1)L) average time in addition to the previous worst case time. Our index provides a relevant space/time tradeoff between existing succinct data structures, with the additional interest of being easy to implement. We also explore other coding variants alternative to Huffman and exploit their synchronization properties. Our experimental results on various types of texts show that our indexes are highly competitive in the space/time tradeoff map.en
Lenguagedc.language.isoenen
Publisherdc.publisherWORLD SCIENTIFIC PUBL CO PTE LTDen
Keywordsdc.subjectSUFFIX ARRAYSen
Títulodc.titleA simple alphabet-independent FM-indexen
Document typedc.typeArtículo de revista


Files in this item

Icon

This item appears in the following Collection(s)

Show simple item record