The language of plain SO-tgds: Composition, inversion and structural properties
Artículo
![Thumbnail](/themes/Mirage2/images/cubierta.jpg)
Open/ Download
Publication date
2013Metadata
Show full item record
Cómo citar
Arenas, Marcelo
Cómo citar
The language of plain SO-tgds: Composition, inversion and structural properties
Abstract
The problems of composing and inverting schema mappings specified by source-to-target
tuple-generating dependencies (st-tgds) have attracted a lot of attention, as they are
of fundamental importance for the development of Bernstein’s metadata management
framework. In the case of the composition operator, a natural semantics has been proposed
and the language of second-order tuple generating dependencies (SO-tgds) has been
identified as the right language to express it. In the case of the inverse operator, several
semantics have been proposed, most notably the maximum recovery, the only inverse
notion that guarantees that every mapping specified by st-tgds is invertible. Unfortunately,
less attention has been paid to combining both operators, which is the motivation of this
paper. More precisely, we start our investigation by showing that SO-tgds are not good
for inversion, as there exist mappings specified by SO-tgds that are not invertible under
any of the notions of inversion proposed in the literature. To overcome this limitation, we
borrow the notion of CQ-composition, which is a relaxation obtained by parameterizing
the composition of mappings by the class of conjunctive queries (CQ), and we propose
a restriction over the class of SO-tgds that gives rise to the language of plain SO-tgds.
Then we show that plain SO-tgds are the right language to express the CQ-composition
of mappings given by st-tgds, in the same sense that SO-tgds are the right language
to express the composition of st-tgds, and we prove that every mapping specified by a
plain SO-tgd admits a maximum recovery, thus showing that plain SO-tgds have a good
behavior w.r.t. inversion. Moreover, we show that the language of plain SO-tgds shares
some fundamental structural properties with the language of st-tgds, but being much more
expressive, and we provide a polynomial-time algorithm to compute maximum recoveries
for mappings specified by plain SO-tgds (which can also be used to compute maximum
recoveries for mappings given by st-tgds). All these results suggest that the language
of plain SO-tgds is a good alternative to be implemented in data exchange and data
integration applications.
General note
Artículo de publicación ISI
Identifier
URI: https://repositorio.uchile.cl/handle/2250/126321
DOI: DOI: 10.1016/j.jcss.2013.01.002
Quote Item
Journal of Computer and System Sciences 79 (2013) 763–784
Collections