Show simple item record

Authordc.contributor.authorFerrada, Héctor 
Authordc.contributor.authorNavarro, Gonzalo 
Admission datedc.date.accessioned2019-10-15T12:23:50Z
Available datedc.date.available2019-10-15T12:23:50Z
Publication datedc.date.issued2019
Cita de ítemdc.identifier.citationInformation and Computation, Volumen 265,
Identifierdc.identifier.issn10902651
Identifierdc.identifier.issn08905401
Identifierdc.identifier.other10.1016/j.ic.2019.01.006
Identifierdc.identifier.urihttps://repositorio.uchile.cl/handle/2250/171625
Abstractdc.description.abstractDocument retrieval structures index a collection of string documents, to retrieve those that are relevant to query strings p: document listing retrieves all documents where p appears; top-k retrieval retrieves the k most relevant of those. Classical structures use too much space in practice. Most current research uses compressed suffix arrays, but fast indices still use 17–21 bpc (bits per character), whereas small ones take milliseconds per returned answer. We present the first document retrieval structures based on Lempel–Ziv compression, precisely LZ78. Our structures use 7–10 bpc and dominate a large part of the space/time tradeoffs. They also enable more efficient partial or approximate answers: our document listing outputs the first 75%–80% of the answers at a rate of one per microsecond; for top-k retrieval we return a result of 90% quality at the same rate and using just 4–6 bpc. This outperforms current indices by a wide margin.
Lenguagedc.language.isoen
Publisherdc.publisherElsevier Inc.
Type of licensedc.rightsAttribution-NonCommercial-NoDerivs 3.0 Chile
Link to Licensedc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/3.0/cl/
Sourcedc.sourceInformation and Computation
Keywordsdc.subjectCompressed data structures
Keywordsdc.subjectDocument listing
Keywordsdc.subjectDocument retrieval
Keywordsdc.subjectString databases
Keywordsdc.subjectTop-k queries
Títulodc.titleLempel–Ziv compressed structures for document retrieval
Document typedc.typeArtículo de revista
Catalogueruchile.catalogadorSCOPUS
Indexationuchile.indexArtículo de publicación SCOPUS
uchile.cosechauchile.cosechaSI


Files in this item

Icon

This item appears in the following Collection(s)

Show simple item record

Attribution-NonCommercial-NoDerivs 3.0 Chile
Except where otherwise noted, this item's license is described as Attribution-NonCommercial-NoDerivs 3.0 Chile