Skip to content
This repository has been archived by the owner on Nov 10, 2023. It is now read-only.

This work is an adaptation of CNN+Transformer architecture to training text recognition models for Yorùbá & Igbo Languages

Notifications You must be signed in to change notification settings

ToluClassics/LowResourceOCR

Repository files navigation

OCR For Low Resource Languages

This work is an adaptation of CNN+Transformer architecture to training text recognition models for Yorùbá & Igbo.

Architecture

alt text

Setup

  • Clone the github repository

    $ git clone https://github.com/ToluClassics/LowResourceOCR.git
    
  • Create a virtual environment and install dependencies

    $ python2 -m venv venv 
    $ (venv) pip install -r requirements.txt
    
  • Download TextRecognition Model from google drive

  • Run Inference:

    alt text

    python3 inference.py --lang yor 
        --image_path samples/yor_sample.jpg 
        --checkpoint_path run/checkpoint_weights_igbo_trdg.pt
    

    Output

    [INFO] Load pretrained model
    [INFO] Predicted text is: Àwọn Ohun Tó Wà
    

Reference::

  • Scene Text Recognition via Transformer, Paper

About

This work is an adaptation of CNN+Transformer architecture to training text recognition models for Yorùbá & Igbo Languages

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 3

  •  
  •  
  •