Skip to content

Latest commit

 

History

History
16 lines (11 loc) · 368 Bytes

Readme.md

File metadata and controls

16 lines (11 loc) · 368 Bytes

For PDF image extraction use following command

python pdf_utils.py <input-pdf> <pages-count>

Use lesser pages-count for testing purpose


For applying OCR on the extracted images use following command

python main.py <input-image>

NOTE: only the images extracted by pdf_utils having name *_1.png can be used as input to main.py