Perseo: Spanish Optical Character Recognition (OCR)

Perseo is a vision transformer based OCR for the Spanish language.

The architecture is based on TrOCR. It is trained on the Spanish Wikipedia dataset, using trdg to generate the images of the sentences. The model's encoder is initialized with the small version of the encoder described in the TrOCR paper, while the decoder in initialized with the RoBERTa Spanish model available in Hugging Face.

Status

Version 0.0 is trained using machine typed characters to evaluate its performance. In future versions handwritten characters will be used.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Perseo: Spanish Optical Character Recognition (OCR)

Status

Files

README.md

Latest commit

History

README.md

File metadata and controls

Perseo: Spanish Optical Character Recognition (OCR)

Status