About
Contact
Help
Sending publications
How to publish
Advanced Search
View Item 
  •   Home
  • Facultad de Ciencias Físicas y Matemáticas
  • Artículos de revistas
  • View Item
  •   Home
  • Facultad de Ciencias Físicas y Matemáticas
  • Artículos de revistas
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

Browse byCommunities and CollectionsDateAuthorsTitlesSubjectsThis CollectionDateAuthorsTitlesSubjects

My Account

Login to my accountRegister
Biblioteca Digital - Universidad de Chile
Revistas Chilenas
Repositorios Latinoamericanos
Tesis LatinoAmericanas
Tesis chilenas
Related linksRegistry of Open Access RepositoriesOpenDOARGoogle scholarCOREBASE
My Account
Login to my accountRegister

A metric index for approximate string matching

Artículo
Thumbnail
Open/Download
IconChavez_Edgar.pdf (780.9Kb)
Publication date
2006-03-07
Metadata
Show full item record
Cómo citar
Chávez, Edgar
Cómo citar
A metric index for approximate string matching
.
Copiar
Cerrar

Author
  • Chávez, Edgar;
  • Navarro, Gonzalo;
Abstract
We present a radically new indexing approach for approximate string matching. The scheme uses the metric properties of the edit distance and can be applied to any other metric between strings. We build a metric space where the sites are the nodes of the suffix tree of the text, and the approximate query is seen as a proximity query on that metric space. This permits us finding the occ occurrences of a pattern of length in, permitting up to r differences, in a text of length n over an alphabet of size a, in average time O(m(1+epsilon) + occ) for any epsilon > 0, if r = o(m/log, m) and m > ((1 + epsilon)/epsilon)log(sigma) n. The index works well up to r < (3 - root 2)m/log(sigma) m, where it achieves its maximum average search complexity O(m(1+root 2+epsilon) + occ). The construction time of the index is O(m(1+root 2+epsilon) n log n) and its space is O(m(1+root 2+epsilon) n). This is the first index achieving average search time polynomial in in and independent of n, for r = O(m /log(sigma) m). Previous methods achieve this complexity only for r = O(m/log(sigma) m). We also present a simpler scheme needing O(n) space.
Identifier
URI: https://repositorio.uchile.cl/handle/2250/124793
ISSN: 0304-3975
Quote Item
THEORETICAL COMPUTER SCIENCE Volume: 352 Issue: 1-3 Pages: 266-279 Published: MAR 7 2006
Collections
  • Artículos de revistas
xmlui.footer.title
31 participating institutions
More than 73,000 publications
More than 110,000 topics
More than 75,000 authors
Published in the repository
  • How to publish
  • Definitions
  • Copyright
  • Frequent questions
Documents
  • Dating Guide
  • Thesis authorization
  • Document authorization
  • How to prepare a thesis (PDF)
Services
  • Digital library
  • Chilean academic journals portal
  • Latin American Repository Network
  • Latin American theses
  • Chilean theses
Dirección de Servicios de Información y Bibliotecas (SISIB)
Universidad de Chile

© 2020 DSpace
  • Access my account