YOLOv3-Tesseract Model for Improved Intelligent form Recognition

Zhang Yun-An; Pan Ziheng; Dui Hongyan; Bai Guanghan

doi:10.2174/2666255813666191204141610

ISSN: 2666-2558
E-ISSN: 2666-2566

YOLOv3-Tesseract Model for Improved Intelligent form Recognition
By Zhang Yun-An, Pan Ziheng, Dui Hongyan and Bai Guanghan
Source: Recent Advances in Computer Science and Communications, Volume 14, Issue 6, Aug 2021, p. 1833 - 1842
DOI: https://doi.org/10.2174/2666255813666191204141610
- Available online: 01 Aug 2021

Abstract

Background: YOLOv3-Tesseract is widely used for the intelligent form recognition because it exhibits several attractive properties. It is important to improve the accuracy and efficiency of the optical character recognition. Methods: The YOLOv3 exhibits the classification advantages for the object detection. Tesseract can effectively recognize regular characters in the field of the optical character recognition. In this study, a YOLOv3 and Tesseract-based model of improved intelligent form recognition is proposed. Results: First, YOLOv3 is trained to detect the position of the text in the table and to subsequently segment text blocks. Second, Tesseract is used to individually detect text blocks and combine YOLOv3 and Tesseract to achieve the goal of table character recognition. Conclusion: Based on the Tianchi big data, experimental simulation is used to demonstrate the proposed method. The YOLOv3-Tesseract model is trained and tested to effectively accomplish the recognition task.

Article metrics loading...

/content/journals/rascs/10.2174/2666255813666191204141610

2021-08-01

2026-03-03

From This Site

/content/journals/rascs/10.2174/2666255813666191204141610

dcterms_title,dcterms_subject,pub_keyword

-contentType:Contributor -contentType:Concept -contentType:Institution

10

5

Full text loading...

/content/journals/rascs/10.2174/2666255813666191204141610

Article Type: Research Article

Keyword(s): Character recognition; deep learning; form image; Optical Character Recognition (OCR); Python3.5.4; YOLOv3-tesseract

YOLOv3-Tesseract Model for Improved Intelligent form Recognition

Abstract

From This Site

Most Read This Month

Most Cited Most Cited RSS feed

Key Issues in Software Reliability Growth Models

An Ensemble of Bacterial Foraging, Genetic, Ant Colony and Particle Swarm Approach EB-GAP: A Load Balancing Approach in Cloud Computing

Remaining Useful Life Prediction of Lithium-ion Batteries Using Multiple Kernel Extreme Learning Machine

ROUGE-SS: A New ROUGE Variant for the Evaluation of Text Summarization

Extensive Review of Literature on Explainable AI (XAI) in Healthcare Applications

An Analog Circuit Fault Diagnosis Approach Based on Wavelet-based Fractal Analysis and Multiple Kernel SVM

Research on Monitoring System of Daily Statistical Indexes Through Big Data

A Study on E-Learning and Recommendation System

Container Elasticity: Based on Response Time using Docker

Revolutionizing Agriculture: A Comprehensive Review of IoT Farming Technologies