Show simple item record

Authordc.contributor.authorBaeza Yates, Ricardo 
Authordc.contributor.authorCastillo Ocaranza, Carlos es_CL
Authordc.contributor.authorMarín, Mauricio es_CL
Authordc.contributor.authorRodríguez, Andrea es_CL
Admission datedc.date.accessioned2013-12-20T18:39:49Z
Available datedc.date.available2013-12-20T18:39:49Z
Publication datedc.date.issued2005
Cita de ítemdc.identifier.citationWWW 2005 May 10–14, 2005, Chiba, Japanen_US
Identifierdc.identifier.urihttps://repositorio.uchile.cl/handle/2250/125821
General notedc.descriptionArtículo de publicación ISIen_US
Abstractdc.description.abstractThis article compares several page ordering strategies for Web crawling under several metrics. The objective of these strategies is to download the most \important" pages \early" during the crawl. As the coverage of modern search engines is small compared to the size of the Web, and it is impossi- ble to index all of the Web for both theoretical and practical reasons, it is relevant to index at least the most important pages. We use data from actual Web pages to build Web graphs and execute a crawler simulator on those graphs. As the Web is very dynamic, crawling simulation is the only way to ensure that all the strategies considered are compared un- der the same conditions. We propose several page ordering strategies that are more e cient than breadth- rst search and strategies based on partial Pagerank calculations.en_US
Lenguagedc.language.isoenen_US
Type of licensedc.rightsAttribution-NonCommercial-NoDerivs 3.0 Chile*
Link to Licensedc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/3.0/cl/*
Keywordsdc.subjectweb crawleren_US
Títulodc.titleCrawling a Country: Better Strategies than BreadthFirst for Web Page Orderingen_US
Document typedc.typeArtículo de revista


Files in this item

Icon

This item appears in the following Collection(s)

Show simple item record

Attribution-NonCommercial-NoDerivs 3.0 Chile
Except where otherwise noted, this item's license is described as Attribution-NonCommercial-NoDerivs 3.0 Chile