Show simple item record

Authordc.contributor.authorArroyuelo, Diego 
Authordc.contributor.authorClaude, Francisco 
Authordc.contributor.authorManeth, Sebastian 
Authordc.contributor.authorMäkinen, Veli 
Authordc.contributor.authorNavarro, Gonzalo 
Authordc.contributor.authorNguyen, Kim 
Authordc.contributor.authorSirén, Jouni 
Authordc.contributor.authorVälimäki, Niko 
Admission datedc.date.accessioned2015-08-25T02:21:52Z
Available datedc.date.available2015-08-25T02:21:52Z
Publication datedc.date.issued2015
Cita de ítemdc.identifier.citationSoftware-Practice and Experience. Volumen: 45 Número: 3 Páginas: 399-434en_US
Identifierdc.identifier.otherDOI: 10.1002/spe.2227
Identifierdc.identifier.urihttps://repositorio.uchile.cl/handle/2250/133086
General notedc.descriptionArtículo de publicación ISIen_US
Abstractdc.description.abstractExtensible Markup Language (XML) documents consist of text data plus structured data (markup). XPath allows to query both text and structure. Evaluating such hybrid queries is challenging. We present a system for in-memory evaluation of XPath search queries, that is, queries with text and structure predicates, yet without advanced features such as backward axes, arithmetics, and joins. We show that for this query fragment, which contains Forward Core XPath, our system, dubbed Succinct XML Self-Index (‘SXSI’), outperforms existing systems by 1–3 orders of magnitude. SXSI is based on state-of-the-art indexes for text and structure data. It combines two novelties. On one hand, it represents the XML data in a compact indexed form, which allows it to handle larger collections in main memory while supporting powerful search and navigation operations over the text and the structure. On the other hand, it features an execution engine that uses tree automata and cleverly chooses evaluation orders that leverage the speeds of the respective indexes. SXSI is modular and allows seamless replacement of its indexes. This is demonstrated through experiments with (1) a text index specialized for search of bio sequences, and (2) a word-based text index specialized for natural language search.en_US
Patrocinadordc.description.sponsorshipFondecyt, Chile 1-110066en_US
Lenguagedc.language.isoenen_US
Publisherdc.publisherWiley-Blackwellen_US
Type of licensedc.rightsAtribución-NoComercial-SinDerivadas 3.0 Chile*
Link to Licensedc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/3.0/cl/*
Keywordsdc.subjectXMLen_US
Keywordsdc.subjectSuccinct data structuresen_US
Keywordsdc.subjectXPathen_US
Keywordsdc.subjectTree automataen_US
Títulodc.titleFast in-memory XPath search using compressed indexesen_US
Document typedc.typeArtículo de revista


Files in this item

Icon

This item appears in the following Collection(s)

Show simple item record

Atribución-NoComercial-SinDerivadas 3.0 Chile
Except where otherwise noted, this item's license is described as Atribución-NoComercial-SinDerivadas 3.0 Chile