Show simple item record

Authordc.contributor.authorMoffat, Alistair es_CL
Authordc.contributor.authorWebber, William es_CL
Authordc.contributor.authorZobel, Justin es_CL
Authordc.contributor.authorBaeza Yates, Ricardo es_CL
Admission datedc.date.accessioned2008-05-14T14:08:14Z
Available datedc.date.available2008-05-14T14:08:14Z
Publication datedc.date.issued2007es_CL
Cita de ítemdc.identifier.citationINFORMATION RETRIEVAL Vol. 10 JUN 2007 3 205-231es_CL
Identifierdc.identifier.urihttps://repositorio.uchile.cl/handle/2250/124708
General notedc.descriptionPublicación ISIes_CL
Abstractdc.description.abstractTwo principal query-evaluation methodologies have been described for cluster-based implementation of distributed information retrieval systems: document partitioning and term partitioning. In a document-partitioned system, each of the processors hosts a subset of the documents in the collection, and executes every query against its local sub-collection. In a term-partitioned system, each of the processors hosts a subset of the inverted lists that make up the index of the collection, and serves them to a central machine as they are required for query evaluation. In this paper we introduce a pipelined query-evaluation methodology, based on a term-partitioned index, in which partially evaluated queries are passed amongst the set of processors that host the query terms. This arrangement retains the disk read benefits of term partitioning, but more effectively shares the computational load. We compare the three methodologies experimentally, and show that term distribution is inefficient and scales poorly. The new pipelined approach offers efficient memory utilization and efficient use of disk accesses, but suffers from problems with load balancing between nodes. Until these problems are resolved, document partitioning remains the preferred method.es_CL
Lenguagedc.language.isoenes_CL
Keywordsdc.subjectdistributed retrievales_CL
Area Temáticadc.subject.otherComputer Science, Information Systemses_CL
Títulodc.titleA pipelined architecture for distributed text query evaluationes_CL
Document typedc.typeArtículo de revista


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record