text recognition and paragraph analysis
- scan input image and recognize the texts in the image
- based on input text pattern to split the image into small images and store the texts in each small image to separate text files
you can see the output files in
https://github.com/yaliqin/text_recognition/blob/master/data/output/split_amc8.png0.txt
https://github.com/yaliqin/text_recognition/blob/master/data/output/split_amc8.png1.txt
https://github.com/yaliqin/text_recognition/blob/master/data/output/split_amc8.png2txt
https://github.com/yaliqin/text_recognition/blob/master/data/output/split_amc8.png3.txt
https://github.com/yaliqin/text_recognition/blob/master/data/output/split_amc8.png4.txt
https://github.com/yaliqin/text_recognition/blob/master/data/output/split_amc8.png5.txt
https://github.com/yaliqin/text_recognition/blob/master/data/output/split_amc8.png6.txt
AioOcr.py is the main file. Change the input file name and split image pattern in the main function of AioOcr.py