Show simple item record

Authordc.contributor.authorMunro, J. Ian 
Authordc.contributor.authorNavarro, Gonzalo 
Authordc.contributor.authorNielsen, Jesper Sindahl 
Authordc.contributor.authorShah, Rahul 
Authordc.contributor.authorThankachan, Sharma V. 
Admission datedc.date.accessioned2019-05-29T13:29:54Z
Available datedc.date.available2019-05-29T13:29:54Z
Publication datedc.date.issued2017
Cita de ítemdc.identifier.citationAlgorithmica, Volumen 78, Issue 2, 2017, Pages 379-393
Identifierdc.identifier.issn14320541
Identifierdc.identifier.issn01784617
Identifierdc.identifier.other10.1007/s00453-016-0167-2
Identifierdc.identifier.urihttps://repositorio.uchile.cl/handle/2250/168876
Abstractdc.description.abstractLetD={T1,T2,...,TD}be a collection ofDstring doc-uments ofncharacters in total, that are drawn from an alphabet setΣ= [σ]. Thetop-kdocument retrieval problemis to preprocessDintoa data structure that, given a query (P[1..p],k), can return thekdocu-ments ofDmost relevant to patternP. The relevance is captured usinga predefined ranking function, which depends on the set of occurrencesofPinTd. For example, it can be the term frequency (i.e., the num-ber of occurrences ofPinTd), or it can be the term proximity (i.e., thedistance between the closest pair of occurrences ofPinTd), or a pattern-independent importance score ofTdsuch as PageRank. Linear space andoptimal query time solutions already exist for this problem. Compressedand compact space solutions are also known, but only for a few rank-ing functions such as term frequency and importance. However, spaceefficient data structures for term proximity based retrieval have beenevasive. In this paper we present the first sub-linear space data structurefor this relevance function, which uses onlyo(n) bits on top of any com-pressed suffix array ofDand solves queries in timeO((p+k) polylogn).
Lenguagedc.language.isoen
Publisherdc.publisherSpringer
Type of licensedc.rightsAttribution-NonCommercial-NoDerivs 3.0 Chile
Link to Licensedc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/3.0/cl/
Sourcedc.sourceAlgorithmica
Keywordsdc.subjectCompact data structures
Keywordsdc.subjectCompressed data structures
Keywordsdc.subjectDocument indexing
Keywordsdc.subjectProximity search
Keywordsdc.subjectRanked document retrieval
Keywordsdc.subjectSuccinct data structures
Keywordsdc.subjectTop-k document retrieval
Títulodc.titleTop-k Term-Proximity in Succinct Space
Document typedc.typeArtículo de revista
Catalogueruchile.catalogadorlaj
Indexationuchile.indexArtículo de publicación SCOPUS
uchile.cosechauchile.cosechaSI


Files in this item

Icon

This item appears in the following Collection(s)

Show simple item record

Attribution-NonCommercial-NoDerivs 3.0 Chile
Except where otherwise noted, this item's license is described as Attribution-NonCommercial-NoDerivs 3.0 Chile