About
Contact
Help
Sending publications
How to publish
Advanced Search
View Item 
  •   Home
  • Facultad de Ciencias Físicas y Matemáticas
  • Artículos de revistas
  • View Item
  •   Home
  • Facultad de Ciencias Físicas y Matemáticas
  • Artículos de revistas
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

Browse byCommunities and CollectionsDateAuthorsTitlesSubjectsThis CollectionDateAuthorsTitlesSubjects

My Account

Login to my accountRegister
Biblioteca Digital - Universidad de Chile
Revistas Chilenas
Repositorios Latinoamericanos
Tesis LatinoAmericanas
Tesis chilenas
Related linksRegistry of Open Access RepositoriesOpenDOARGoogle scholarCOREBASE
My Account
Login to my accountRegister

A simple alphabet-independent FM-index

Artículo
Thumbnail
Open/Download
IconGrabowski_Szymon.pdf (272.5Kb)
Publication date
2006-12
Metadata
Show full item record
Cómo citar
Grabowski, Szymon
Cómo citar
A simple alphabet-independent FM-index
.
Copiar
Cerrar

Author
  • Grabowski, Szymon;
  • Mäkinen, Veli;
  • Navarro, Gonzalo;
  • Salinger, Alejandro;
Abstract
We design a succinct full-text index based on the idea of Huffman-compressing the text and then applying the Burrows-Wheeler transform over it. The resulting structure can be searched as an FM-index, with the benefit of removing the sharp dependence on the alphabet size, sigma, present in that structure. On a text of length n with zero-order entropy H-0, our index needs O(n(H-0 + 1)) bits of space, without any significant dependence on or. The average search time for a pattern of length m is O(m(H-0 + 1)), under reasonable assumptions. Each position of a text occurrence can be located in worst case time O((H-0 + 1) log n), while any text substring of length L can be retrieved in O((H-0 + 1)L) average time in addition to the previous worst case time. Our index provides a relevant space/time tradeoff between existing succinct data structures, with the additional interest of being easy to implement. We also explore other coding variants alternative to Huffman and exploit their synchronization properties. Our experimental results on various types of texts show that our indexes are highly competitive in the space/time tradeoff map.
Identifier
URI: https://repositorio.uchile.cl/handle/2250/124880
ISSN: 0129-0541
Quote Item
INTERNATIONAL JOURNAL OF FOUNDATIONS OF COMPUTER SCIENCE Volume: 17 Issue: 6 Pages: 1365-1384 Published: DEC 2006
Collections
  • Artículos de revistas
xmlui.footer.title
31 participating institutions
More than 73,000 publications
More than 110,000 topics
More than 75,000 authors
Published in the repository
  • How to publish
  • Definitions
  • Copyright
  • Frequent questions
Documents
  • Dating Guide
  • Thesis authorization
  • Document authorization
  • How to prepare a thesis (PDF)
Services
  • Digital library
  • Chilean academic journals portal
  • Latin American Repository Network
  • Latin American theses
  • Chilean theses
Dirección de Servicios de Información y Bibliotecas (SISIB)
Universidad de Chile

© 2020 DSpace
  • Access my account