Restaurant Bill Reader

Table of Content:

Problem Statement
Modules Used
Solution Approach
Installation
Execution Instructions
License
Version
Author

Problem Statement

In this problem we are given a bunch of resturant bills in pdf format. We have to extract text from the images of bills given in ".pdf" files.

Modules Used

The major python modules used for solving the above mentioned problem are as follows:

Wand
OpenCV
PyTesseract OCR

Solution Approach

Use Wand to convert pdf to image of any resolution (here we have used 700 x 700) and save it in images folder.
Read the generated image using OpenCV
Use PyTesseract to read text from the images and save the data obtained in a json file (here json/img-to-text.json ).

Installation

In this project Python version 3.7.7 is used.

First create a new anaconda environment and then activate the environment:

# Create environmemt.
conda create -n bill-reader python=3.7
# Activate environment.
conda activate bill-reader

Then install the following python packages using pip:

$ pip install wand

$ pip install pytesseract

$ pip install opencv-python

Checking Wand

STEP-1

Open Python terminal by typing the following command in anaconda command prompt: $ python

This will open a python terminal.

STEP-2

from wand.image import Image as wi

If you get error any error proceed to Step-3:

Visit the following link and follow the instructions given for your respective OS.

For Wndows.

Checkboxes that must be ticked while installing are as follows:

And then check again repeat Steps 1 and 2. Hopefully it will solve the import error with wand module.

STEP-3

If there is no error, then wand module is working fine. And we will exit the terminal.

quit()

STEP-4 Now open 01-pdf-to-image.ipynb file and run the cells in your jupyter-notebook.

If you get DelegateError, do the follows:

INSTALL GHOSTSCRIPT

Checking pytesseract

STEP-1

Open Python terminal by typing the following command in anaconda command prompt: $ python

This will open a python terminal.

Step-2:

Visit this link and download the write installer according to your python architecture (32 or 64). Then install it and make a note of the installation location.

Then open '02-image-to-text.ipynb' file and in cell 1 update the path mensioned to your installing location.

Checking OpenCV

STEP-1

Open Python terminal by typing the following command in anaconda command prompt: $ python

This will open a python terminal.

STEP-2

import cv2

If you get error any error proceed to Step-3:

Refer this answer -- https://stackoverflow.com/questions/19876079/cannot-find-module-cv2-when-using-opencv

else refer this: opencv_installation_instructions

It worked for me.

Execution Instructions

First run 01-pdf-to-image.ipynb.

It will take some time to execute completely depending upon your computer hardware.
Now run 02-image-to-text.ipynb.

It will also take some time to execute.

Name		Name	Last commit message	Last commit date
Latest commit History 45 Commits
Dataset		Dataset
json		json
readme-assets		readme-assets
.gitignore		.gitignore
01-pdf-to-image.ipynb		01-pdf-to-image.ipynb
02-image-to-text.ipynb		02-image-to-text.ipynb
LICENSE.md		LICENSE.md
README.md		README.md
opencv_installation_instructions.txt		opencv_installation_instructions.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Restaurant Bill Reader

Table of Content:

Problem Statement

Modules Used

Solution Approach

Installation

Checking Wand

Checking pytesseract

Checking OpenCV

Execution Instructions

License

Version

Author

The author of this project is Deepankar.

About

Releases

Packages

Languages

License

Deepankar-98/Restaurant-Bill-Reader

Folders and files

Latest commit

History

Repository files navigation

Restaurant Bill Reader

Table of Content:

Problem Statement

Modules Used

Solution Approach

Installation

Checking Wand

Checking pytesseract

Checking OpenCV

Execution Instructions

License

Version

Author

The author of this project is Deepankar.

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages