Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Data for Brazilian Portuguese #91

Open
reletreby opened this issue Apr 3, 2023 · 2 comments
Open

Data for Brazilian Portuguese #91

reletreby opened this issue Apr 3, 2023 · 2 comments

Comments

@reletreby
Copy link

Where can I find the data used to train:
https://huggingface.co/Helsinki-NLP/opus-mt-tc-big-en-pt
?

When I use the local make data and specify pob to be a target language, it doesn't do anything. In particular, this location has nothing about pob
https://object.pouta.csc.fi/Tatoeba-Challenge-v2021-08-07/

I would like to know how the data for this particular model looks like as I would like to fine-tune it.

@reletreby
Copy link
Author

@jorgtied would really appreciate your help!

@jorgtied
Copy link
Member

There is Brazilian Portuguese in the eng-por package. You need to look at the language label file in the package to see which instance is Brazilian Portuguese.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants