XXS: Efficient XPath Evaluation on Compressed XML Documents
Author
dc.contributor.author
Brisaboa, Nieves R.
Author
dc.contributor.author
Cerdeira Pena, Ana
es_CL
Author
dc.contributor.author
Navarro, Gonzalo
es_CL
Admission date
dc.date.accessioned
2015-01-08T19:20:15Z
Available date
dc.date.available
2015-01-08T19:20:15Z
Publication date
dc.date.issued
2014-06
Cita de ítem
dc.identifier.citation
ACM Transactions on Information Systems, Vol. 32, No. 3, Article 13,
en_US
Identifier
dc.identifier.other
DOI: http://dx.doi.org/10.1145/2629554
Identifier
dc.identifier.uri
https://repositorio.uchile.cl/handle/2250/127016
General note
dc.description
Artículo de publicación ISI
en_US
Abstract
dc.description.abstract
The eXtensible Markup Language (XML) is acknowledged as the de facto standard for semistructured data
representation and data exchange on the Web and many other scenarios. A well-known shortcoming of XML
is its verbosity, which increases manipulation, transmission, and processing costs. Various structure-blind
and structure-conscious compression techniques can be applied to XML, and some are even access-friendly,
meaning that the documents can be efficiently accessed in compressed form. Direct access is necessary to
implement the query languages XPath and XQuery, which are the standard ones to exploit the expressiveness
of XML. While a good deal of theoretical and practical proposals exist to solve XPath/XQuery operations on
XML, only a few ones are well integrated with a compression format that supports the required access
operations on the XML data. In this work we go one step further and design a compression format for XML
collections that boosts the performance of XPath queries on the data. This is done by designing compressed
representations of the XML data that support some complex operations apart from just accessing the data,
and those are exploited to solve key components of the XPath queries. Our system, called XXS, is aimed
at XML collections containing natural language text, which are compressed to within 35%–50% of their
original size while supporting a large subset of XPath operations in time competitive with, and many times
outperforming, the best state-of-the-art systems that work on uncompressed representations.
en_US
Patrocinador
dc.description.sponsorship
Funded in part by MICINN grants (PGE and FEDER) TIN2009-14560-C03-02- and TIN2010-21246-C02-01,
Xunta de Galicia grants (co-funded with FEDER) GRC2013/053 and CN 2012/211, and MINECO grants
(co-funded with CDTI and GAIN) CDTI EXP 00064563 and ITC-20133062 (for the Spanish group); and by
Fondecyt grants 1-080019 and 1-110066, Chile (G.N.).