Réseau de neurones dynamique perceptif - Application à la reconnaissance de structures logiques de documents. (Dynamic and perceptive neural network applied to document logical structure recognition)
نویسنده
چکیده
Logical structure extraction of documents remains a challenging problem due to their inherent complexityand the gap between the physical features extracted from the image and their corresponding logicalinterpretation. Most of the literature approaches propose model-driven approaches which are not genericenough to handle complex and noisy documents. They do not use intermediate interpretation steps anddo not explain the relationships between the physical blocks and the corresponding logical labels. Themain objective of this thesis is to develop a hybrid method, using both data-driven and model-drivenapproach, which is capable to learn the relationships and simulate human perception during the logicalrecognition task. We have proposed a Dynamic Perceptive Neural Network which can handle drawbacksof previous systems. Four main points have been developed:– a special network topology based on local representation where the knowledge can be integrated.The logical interpretation is unfolded along the layers of the network and a training stage isperformed to find the weights for each link;– perceptive cycles (several bottom-up and top-down processes) perform the recognition. The net-work is able to generate hypothesis, validate them and detect ambiguous patterns. The contextmanages the correction of the input features to improve the recognition rate;– an input feature clustering has been proposed to speed-up the recognition. Subsets of featuresare automatically computed and are given progressively to feed the network in order to adapt theamount of computations according to the pattern complexity;– dynamic integration in the network that make it possible to integrate the data correction infor-mation during the training stage to have more appropriate behavior during the recognition. Theimprovement uses a Time Delay Neural Network architecture to take into account the input datavariations after each perceptive cycle while the recognition step is quite similar to the static one.
منابع مشابه
Un modèle neuro markovien profond pour l'extraction de séquences dans des documents manuscrits
RÉSUMÉ. Dans cet article, nous proposons un système d’extraction de mots clés dans des documents manuscrits. Notre approche est basée sur la reconnaissance des lignes de texte à l’aide d’un modèle HMM capable de rejeter les mots n’appartenant pas à un lexique prédéfini. Afin d’être plus discriminant, nous avons remplacé les mélanges de gaussiennes des HMM par un réseau de neurones profond pour ...
متن کاملRate versus synchrony code for human action recognition
We propose a bio-inspired feedforward spiking network modeling two brain areas dedicated to motion (V1 and MT), and we show how the spiking output can be exploited in a computer vision application: action recognition. In order to analyze spike trains, we consider two characteristics of the neural code: mean firing rate of each neuron and synchrony between neurons. Interestingly, we show that th...
متن کاملFusion des connaissances en analyse de documents - Exemples sur des documents d'archives
RÉSUMÉ. La reconnaissance de collections de documents structurés numérisés et notamment de documents d’archives est difficile non seulement par la complexité de l’organisation des documents, mais aussi par la dégradation des documents (tâches, déchirures, encre traversant le papier, courbures produites à la numérisation. . . ). Afin d’améliorer la qualité de la reconnaissance tout en gérant le ...
متن کاملSegmentation des fichiers logs
Résumé. Avec la méthode de segmentation appelée passages de discours, la reconnaissance des divisions logiques de documents est essentielle. Cela s’avère plus difficile dans les documents ayant des unités logiques différentes de celles trouvées dans les textes classiques comme les paragraphes ou les sections. Ainsi, nous proposons une méthode automatique pour caractériser les unités logiques co...
متن کاملNetwork Coding for Wireless Broadcast: Rate Selection with Dynamic Heuristics
Network coding is a novel method for transmitting data, which has been recently proposed, and has been shown to have potential to improve wireless network performance. In this article, we study using network coding for one specific case of multicast, broadcasting. Precisely, we focus on (energy-)efficient broadcasting in a multi-hop wireless networks: transmitting data from one source to all no...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007