An OCR project as a Proof of Concept to deploy a machine learning project using Streamlit. The text extraction process is accompanied with image processing for better text recognition and achieving higher efficiency.
-
- Streamlit - UI
- Tkinter - File download setup
- Pytesseract
- Pdf2Image
- OpenCV - Image Processing
-
- Make sure you have Python 3.8 installed on the system.
- Open cmd where the contents are unzipped.
- Run the below command to install required packages.
pip install -r requirements.txt
- Install Tesseract setup file
- Once all packages are installed, run the following command:
streamlit run OCR.py
If facing any issues, contact me
Bounding Box around recognized text