Show simple item record

Professor Advisordc.contributor.advisorSaavedra Rondo, José
Professor Advisordc.contributor.advisorRíos Pérez, Sebastián
Authordc.contributor.authorÚbeda Soto, Ignacio Andrés 
Associate professordc.contributor.otherHeutte, Laurent
Associate professordc.contributor.otherSauré Valenzuela, Denis
Admission datedc.date.accessioned2020-10-15T02:27:39Z
Available datedc.date.available2020-10-15T02:27:39Z
Publication datedc.date.issued2020
Identifierdc.identifier.urihttps://repositorio.uchile.cl/handle/2250/177136
General notedc.descriptionTesis para optar al grado de Magíster en Gestión de Operacioneses_ES
General notedc.descriptionMemoria para optar al título de Ingeniero Civil Industrial
Abstractdc.description.abstractPattern spotting consists of locating different instances of a given object (i.e. an image query) in a collection of document images. Contrary to object detection, no prior information is given about the patterns that can be searched, hence no training can be done for localization. The queried patterns may vary in shape, size, color, context and even style, which makes pattern spotting a difficult task. To tackle this problem, we propose a convolutional neural network approach based on Feature Pyramid Networks (FPN) as the feature extractor of our system. Using FPN allows us to extract descriptors of local regions of the documents to be indexed and queries, at multiple scales with a single forward pass. Our main hypothesis is that deep features are more discriminatory than classical descriptors for pattern localization. Experiments conducted on DocExplore (a historical document dataset for pattern spotting evaluation) show that the proposed system improves mAP by 73% (from 0.157 to 0.272) in pattern localization compared with state-of-the-art results, even when the feature extractor is not trained with domain-specific data. Memory requirement and computation time are also decreased since the descriptor dimension used for distance computation is reduced by a factor of 16. We conclude that CNN-based local descriptors are better than VLAD (classical) descriptors at locating patterns and we use them to propose a system for pattern localization. Limitation of our approach is that it struggles with non-square patterns. We also propose a solution to address this issue extracting multiple descriptors per query. Although it improves results in retrieving documents, it loses precision in locating patterns. Aggregation on those descriptors is proposed as interesting future work in order to improve the system.es_ES
Lenguagedc.language.isoenes_ES
Publisherdc.publisherUniversidad de Chilees_ES
Type of licensedc.rightsAttribution-NonCommercial-NoDerivs 3.0 Chile*
Link to Licensedc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/3.0/cl/*
Keywordsdc.subjectRedes neuronales (Ciencia de la computación)es_ES
Keywordsdc.subjectRecuperación de imágeneses_ES
Keywordsdc.subjectPattern spottinges_ES
Títulodc.titlePattern spotting in historical documents using convolutional modelses_ES
Document typedc.typeTesis
Catalogueruchile.catalogadorgmmes_ES
Departmentuchile.departamentoDepartamento de Ingeniería Industriales_ES
Facultyuchile.facultadFacultad de Ciencias Físicas y Matemáticases_ES
uchile.titulacionuchile.titulacionDoble Titulaciónes_ES


Files in this item

Icon
Icon

This item appears in the following Collection(s)

Show simple item record

Attribution-NonCommercial-NoDerivs 3.0 Chile
Except where otherwise noted, this item's license is described as Attribution-NonCommercial-NoDerivs 3.0 Chile