Skip to content

Latest commit

 

History

History
10 lines (5 loc) · 759 Bytes

README.md

File metadata and controls

10 lines (5 loc) · 759 Bytes

Perseo: Spanish Optical Character Recognition (OCR)

Perseo is a vision transformer based OCR for the Spanish language.

The architecture is based on TrOCR. It is trained on the Spanish Wikipedia dataset, using trdg to generate the images of the sentences. The model's encoder is initialized with the small version of the encoder described in the TrOCR paper, while the decoder in initialized with the RoBERTa Spanish model available in Hugging Face.

Status

Version 0.0 is trained using machine typed characters to evaluate its performance. In future versions handwritten characters will be used.