GitHub - cuongngm/text-in-image: learn to build OCR and image understanding system

Quickstart

pip install torch==1.7.0+cu101 torchvision==0.8.1+cu101 -f https://download.pytorch.org/whl/torch_stable.html
pip install --upgrade ultocr  # install our project with package

# for inference phase
from ultocr.inference import OCR
from PIL import Image
model = OCR(det_model='DB', reg_model='MASTER')
image = Image.open('..')  # ..is the path of image
result = model.get_result(image)

Or view in google colab demo

Install

git clone https://github.com/cuongngm/text-in-image
pip install torch==1.7.0+cu101 torchvision==0.8.1+cu101 -f https://download.pytorch.org/whl/torch_stable.html
pip install -r requirements.txt
bash scripts/download_weights.sh

Prepare data

Pretrained model

Model	size(MB)
DB	140
MASTER	261

Train

Custom params in each config file of config folder then:

Single gpu training:

python train.py --config config/db_resnet50.yaml --use_dist False
# tracking with mlflow
mlflow run text-in-image -P config=config/db_resnet50.yaml -P use_dist=False -P device=1

Multi gpu training:

# assume we have 2 gpu
python -m torch.distributed.launch --nnodes=1 --node_rank=0 --nproc_per_node=2 --master_addr=127.0.0.1 --master_post=5555 train.py --config config/db_resnet50.yaml

Serve and Inference

python run.py

Then, open your browser at http://127.0.0.1:8000/docs. Request url of the image, the result is as follows:

Todo

Multi gpu training
Tracking experiments with Mlflow
Model serving with FastAPI
Add more text detection and recognition model

Name		Name	Last commit message	Last commit date
Latest commit History 130 Commits
assets		assets
config		config
scripts		scripts
ultocr		ultocr
.gitignore		.gitignore
Dockerfile		Dockerfile
MLproject		MLproject
README.md		README.md
conda.yaml		conda.yaml
requirements.txt		requirements.txt
run.py		run.py
serve.py		serve.py
setup.py		setup.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Quickstart

Install

Prepare data

Pretrained model

Train

Serve and Inference

Todo

Reference

About

Releases 2

Packages

Languages

cuongngm/text-in-image

Folders and files

Latest commit

History

Repository files navigation

Quickstart

Install

Prepare data

Pretrained model

Train

Serve and Inference

Todo

Reference

About

Resources

Stars

Watchers

Forks

Releases 2

Packages 0

Languages

Packages