Improving pattern spotting in historical documents using feature pyramid networks
Author
dc.contributor.author
Úbeda, Ignacio
Author
dc.contributor.author
Saavedra, José M.
Author
dc.contributor.author
Nicolas, Stéphane
Author
dc.contributor.author
Petitjean, Caroline
Author
dc.contributor.author
Heutte, Laurent
Admission date
dc.date.accessioned
2020-04-29T15:07:05Z
Available date
dc.date.available
2020-04-29T15:07:05Z
Publication date
dc.date.issued
2020
Cita de ítem
dc.identifier.citation
Pattern Recognition Letters 131: 398-404
es_ES
Identifier
dc.identifier.other
10.1016/j.patrec.2020.02.002
Identifier
dc.identifier.uri
https://repositorio.uchile.cl/handle/2250/174224
Abstract
dc.description.abstract
Pattern spotting consists of locating different instances of a given object (i.e. an image query) in a collection of historical document images. These patterns may vary in shape, size, color, context and even style because they are hand-drawn, which makes pattern spotting a difficult task. To tackle this problem, we propose a Convolutional Neural Network (CNN) approach based on Feature Pyramid Networks (FPN) as the feature extractor of our system. Using FPN allows to extract descriptors of local regions of the documents to be indexed and queries, at multiple scales with just a single forward pass. Experiments conducted on DocExplore dataset show that the proposed system improves mAP by 73% (from 0.157 to 0.272) in pattern localization compared with state-of-the-art results, even when the feature extractor is not trained with domain-specific data. Memory requirement and computation time are also decreased since the descriptor dimension used for distance computation is reduced by a factor of 16.
es_ES
Patrocinador
dc.description.sponsorship
Comision Nacional de Investigacion Cientifica y Tecnologica (CONICYT): PFCHA/MAGISTER NACIONAL/2018 -22180111, STIC-Amsud 19-STIC-04
European Union (EU)
European Union (EU): HN0005604
Normandy Region