Authors:
Gustavo Cunha Lacerda
and
Raimundo C. S. Vasconcelos
Affiliation:
Instituto Federal de Brasília, Taguatinga - DF, Brazil
Keyword(s):
OCR, Text Recognition, Historical Manuscripts, Neural Networks.
Abstract:
The creation of writing has facilitated the humanity’s accumulation and sharing of knowledge, being a vital part of what differentiates humans from other animals and has a high importance for the culture of all peoples. Thus, the first human records (manuscripts), historical documents of organizations and families, began to have new perspectives with the digital age accumulation. These handwritten records remained the primary source for the history of countries, including Brazil before the period of independence, until the Gutenberg movable type printing press dominated the archival world. Thus, over the decades, these handwritten documents, due to their fragility, became difficult to access and manipulate. This has changed, with the possibility of digitization and, consequently, its distribution over the internet. Therefore, this work shows a solution for transcribing historical texts written in Portuguese, bringing accessibility, searchability, sharing and preservation to these rec
ords, which achieved a result of 97% of letters recognized in the database used.
(More)