yaliqin / text_recognition Public

Notifications You must be signed in to change notification settings
Fork 0
Star 0

text recognition and paragraph analysis

0 stars 0 forks Branches Tags Activity

Notifications

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
data		data
AioOcr.py		AioOcr.py
README.md		README.md
generate_key.py		generate_key.py
split_image.py		split_image.py

Repository files navigation

text_recognition

text recognition and paragraph analysis

what the project does:

scan input image and recognize the texts in the image
based on input text pattern to split the image into small images and store the texts in each small image to separate text files

input and output examples:

input image

output image

output text files

you can see the output files in
https://github.com/yaliqin/text_recognition/blob/master/data/output/split_amc8.png0.txt https://github.com/yaliqin/text_recognition/blob/master/data/output/split_amc8.png1.txt https://github.com/yaliqin/text_recognition/blob/master/data/output/split_amc8.png2txt https://github.com/yaliqin/text_recognition/blob/master/data/output/split_amc8.png3.txt https://github.com/yaliqin/text_recognition/blob/master/data/output/split_amc8.png4.txt https://github.com/yaliqin/text_recognition/blob/master/data/output/split_amc8.png5.txt https://github.com/yaliqin/text_recognition/blob/master/data/output/split_amc8.png6.txt

how to use the project

AioOcr.py is the main file. Change the input file name and split image pattern in the main function of AioOcr.py

About

text recognition and paragraph analysis

Report repository

Releases

No releases published

Packages

No packages published

Languages

Python 100.0%