Show simple item record

Authordc.contributor.authorFredriksson, Kimmo 
Authordc.contributor.authorNavarro, Gonzalo es_CL
Admission datedc.date.accessioned2007-05-07T14:18:59Z
Available datedc.date.available2007-05-07T14:18:59Z
Publication datedc.date.issued2004
Cita de ítemdc.identifier.citationCOMBINATORIAL PATTERN MATCHING, PROCEEDINGS LECTURE NOTES IN COMPUTER SCIENCE 3109: 457-471 2004en
Identifierdc.identifier.issn0302-9743
Identifierdc.identifier.urihttps://repositorio.uchile.cl/handle/2250/124546
Abstractdc.description.abstractWe present a new algorithm for multiple approximate string matching. It is based on reading backwards enough l-grams from text windows so as to prove that no occurrence can contain the part of the window read, and then shifting the window. Three variants of the algorithm are presented, which give different tradeoffs between how much they work in the window and how much they shift it. We show analytically that two of our algorithms are optimal on average. Compared to the first average-optimal multipattern approximate string matching algorithm [Fredriksson and Navarro, CPM 2003], the new algorithms are much faster and axe optimal up to difference ratios of 1/2, contrary to the maximum of 1/3 that could be reached in previous work. This is also a contribution to the area of single-pattern approximate string matching, as the only average-optimal algorithm [Chang and Marr, CPM 1994] also reached a difference ratio of 1/3. We show experimentally that our algorithms are very competitive, displacing the long-standing best algorithms for this problem. On real life texts, our algorithms are especially interesting for computational biology applications.en
Lenguagedc.language.isoenen
Publisherdc.publisherSPRINGER-VERLAG BERLINen
Keywordsdc.subjectPARALLELen
Títulodc.titleImproved single and multiple approximate string matchingen
Document typedc.typeArtículo de revista


Files in this item

Icon

This item appears in the following Collection(s)

Show simple item record