About
Contact
Help
Sending publications
How to publish
Advanced Search
View Item 
  •   Home
  • Facultad de Ciencias Físicas y Matemáticas
  • Artículos de revistas
  • View Item
  •   Home
  • Facultad de Ciencias Físicas y Matemáticas
  • Artículos de revistas
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

Browse byCommunities and CollectionsDateAuthorsTitlesSubjectsThis CollectionDateAuthorsTitlesSubjects

My Account

Login to my accountRegister
Biblioteca Digital - Universidad de Chile
Revistas Chilenas
Repositorios Latinoamericanos
Tesis LatinoAmericanas
Tesis chilenas
Related linksRegistry of Open Access RepositoriesOpenDOARGoogle scholarCOREBASE
My Account
Login to my accountRegister

Dynamic entropy-compressed sequences and full-text indexes

Artículo
Thumbnail
Open/Download
IconMakinen_Veli.pdf (143.1Kb)
Publication date
2006
Metadata
Show full item record
Cómo citar
Mäkinen, Veli
Cómo citar
Dynamic entropy-compressed sequences and full-text indexes
.
Copiar
Cerrar

Author
  • Mäkinen, Veli;
  • Navarro, Gonzalo;
Abstract
Given a sequence of n bits with binary zero-order entropy H-o, we present a dynamic data structure that requires nH(o) + o(n) bits of space, which is able of performing rank and select, as well as inserting and deleting bits at arbitrary positions, in O(log n) worst-case time. This extends previous results by Hon et al. [ISAAC 2003] achieving O(log n/log log n) time for rank and select but Theta(polylog(n)) amortized time for inserting and deleting bits, and requiring n + o(n) bits of space; and by Raman et al. [SODA 2002] which have constant query time but a static structure. In particular, our result becomes the first entropy-bound dynamic data structure for rank and select over bit sequences. We then show how the above result can be used to build a dynamic full-text self-index for a collection of texts over an alphabet of size sigma, of overall length n and zero-order entropy H-o. The index requires nHo + o(n log o) bits of space, and can count the number of occurrences of a pattern of length m in time O(m log n log sigma). Reporting the occ occurrences can be supported in O(occ log(2) n log sigma) time, paying O(n) extra space. Insertion of text to the collection takes O(log n log sigma) time per symbol, which becomes O(log(2) n log sigma) for deletions. This improves a previous result by Chan et al. [CPM 2004]. As a consequence, we obtain an O(n log n log sigma) time construction algorithm for a compressed self-index requiring nH(o) + o(n log sigma) bits working space during construction.
Identifier
URI: https://repositorio.uchile.cl/handle/2250/124917
ISSN: 0302-9743
Quote Item
COMBINATORIAL PATTERN MATCHING, PROCEEDINGS Book Series: LECTURE NOTES IN COMPUTER SCIENCE Volume: 4009 Pages: 306-317 Published: 2006
Collections
  • Artículos de revistas
xmlui.footer.title
31 participating institutions
More than 73,000 publications
More than 110,000 topics
More than 75,000 authors
Published in the repository
  • How to publish
  • Definitions
  • Copyright
  • Frequent questions
Documents
  • Dating Guide
  • Thesis authorization
  • Document authorization
  • How to prepare a thesis (PDF)
Services
  • Digital library
  • Chilean academic journals portal
  • Latin American Repository Network
  • Latin American theses
  • Chilean theses
Dirección de Servicios de Información y Bibliotecas (SISIB)
Universidad de Chile

© 2020 DSpace
  • Access my account