thesis

Détection de patterns dans les documents anciens

Defense date:

Jan. 1, 2016

Edit

Institution:

Rouen

Disciplines:

Authors:

Abstract EN:

This thesis addresses the problem of retrieving and spotting patterns in historical document images. In particular, we are interested in searching for small graphical objects (20#20 pixels) in degraded, noisy and unconstrained layout document images. In addition, the hand-drawn patterns in historical document images makes our problem become even more challenging due to intra-class variabilities. Searching for generic graphical patterns in unconstrained layout document images requires an exhaustive matching at every possible size and location. This exhaustive search is not only computationally expensive but produces also usually a lot of false alarms. To overcome this problem, we propose an efficient indexing strategy based on a background removal component followed by region proposal to estimate if a given region contains or not an object. This enables us to reduce the number of sub-windows by 7 times while maintaining a high level of recall. Then, based on an exhaustive experimentation between recent feature extraction techniques, VLAD is chosen as our image representation. Finally, by observing that the conventional distance measures (e. G. Cosine) can not cope well with image variabilities, we propose an adaptive distance function learned on the fly at almost no cost and without the need of labeled data. Our historical document image retrieval system is then extended by integrating a localization component, thus turning the retrieval system into a pattern spotting system, enhancing its capability to locate more precisely objects of interest. While our system produces some meaningful results, we go beyond by coping with scalability issues. We show that we can efficiently retrieve/spot an object in less than a second in up to millions of subwindows. We also developed two other systems from these ideas. First, we show on various public datasets that our adaptive distance is more powerful than the conventional distance functions for natural scene image retrieval. Second, we show that our system can be turned into a word spotting system with only few complimentary components, thus demonstrating the robustness of the proposed method. All our experiments have been carried out on a new dataset called "DocExplore". Experiments show the robustness of our systems to retrieve/spot graphical objects in noisy and degraded document images. Finally, our last contribution is the release of our DocExplore dataset for public along with the experimental protocol and evaluation metrics to further encourage other researchers to continue tackle these problems.

Abstract FR:

Pas de résumé disponible.