Repetition-free longest common subsequence of random sequences
Author
dc.contributor.author
Fernandes, Cristina G.
Author
dc.contributor.author
Kiwi Krauskopf, Marcos
Admission date
dc.date.accessioned
2016-12-13T19:39:39Z
Available date
dc.date.available
2016-12-13T19:39:39Z
Publication date
dc.date.issued
2016
Cita de ítem
dc.identifier.citation
Discrete Applied Mathematics 210 (2016) 75–87
es_ES
Identifier
dc.identifier.other
10.1016/j.dam.2015.07.005
Identifier
dc.identifier.uri
https://repositorio.uchile.cl/handle/2250/141850
Abstract
dc.description.abstract
A repetition-free Longest Common Subsequence (LCS) of two sequences x and y is an LCS of x and y where each symbol may appear at most once. Let R denote the length of a repetition free LCS of two sequences of n symbols each one chosen randomly, uniformly, and independently over a k-ary alphabet. We study the asymptotic, inn and k, behavior of R and establish that there are three distinct regimes, depending on the relative speed of growth of n and k. For each regime we establish the limiting behavior of R. In fact, we do more, since we actually establish tail bounds for large deviations of R from its limiting behavior.
Our study is motivated by the so called exemplar model proposed by Sankoff (1999) and the related similarity measure introduced by Adi et al. (2010). A natural question that arises in this context, which as we show is related to long standing open problems in the area of probabilistic combinatorics, is to understand the asymptotic, in n and k, behavior of parameter R. (C) 2015 Elsevier B.V. All rights reserved