Main Navigation
Main Content
Sidebar

Russian Digital Libraries Journal

Home
About
Current
Archives
Register
Login
Search

Published since 1998

ISSN 1562-5419

16+

Language

Русский
English

Search

Search articles for

Advanced filters

Published After

Published Before

By Author

Search Results

Data Extraction from Similarly Structured Scanned Documents

Rustem Damirovich Saitgareev, Bulat Rifatovich Giniyatullin, Vladislav Yurievich Toporov, Artur Aleksandrovich Atnagulov, Farid Radikovich Aglyamov

667-688

Abstract:

Currently, the major part of transmitted and stored data is unstructured, and the amount of unstructured data is growing rapidly each year, although it is hardly searchable, unqueryable, and its processing is not automated. At the same time, there is a growth of electronic document management systems. This paper proposes a solution for extracting data from paper documents considering their structure and layout based on document photos. By examining different approaches, including neural networks and plain algorithmic methods, we present their results and discuss them.

Keywords: neural networks, document structure.

1 - 1 of 1 items

Information