-
Notifications
You must be signed in to change notification settings - Fork 70
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Update languages metadata file and use of it thoughout project #293
Comments
I'm interested in this issue. 😃 |
This would be a really good one for you, @OmarAI2003 😊 Let us know if you have any questions! |
Replacing the dependency on |
In talking about this a bit, @OmarAI2003, we might not be able to do this. @SethiShreya and I were talking and as you said we need the QIDs as well so that we can do calls for the CLI based on QIDs as well. Without a central store of languages and their QIDs, maybe it can't work? |
Maybe we could use the directory structure just for language names, but still keep |
Is an interesting idea, but then say that we rely on the structure and then we don't get a QID added and then some functionality is broken 🤔 |
So is this issue will be closed , or is there anything that needs to be addressed? |
I'm thinking that for this one we can convert the functionality of the languages metadata file? I don't think we need the header key for it or the "languages" key where all the leagues are? You can remove the header and put all the language objects at the top level. You can also remove all of the keys that aren't the language name, iso and qid? Then from there we need to rework the reference of this metadata file throughout the project and fix the tests 😇 How does this sound, @OmarAI2003? :) |
Sounds nice @andrewtavis, but I will need to engage in several discussions here and there along the way to make sure I'm on the same page. |
Sure thing, @OmarAI2003! Just start with getting the file down to just objects with languages, ISO-2s and QIDs at the base level, and then we can discuss from there. Happy to help as needed! |
hi @andrewtavis |
@OmarAI2003, can you send along a snippet of the current version of the file so we can all take a look? :) |
This is the current version of the JSON file. I'm telling you not to worry about the sub-languages file path because there will be a |
Thank you. Sounds great 😃 |
Closed by #402 :) Thanks for the great work @OmarAI2003 and for the great conversation all! |
You're welcome! It was a great experience working on this, and I appreciate all the valuable feedback and discussions. |
Terms
Description
As of now the Scribe-Data CLI options are determined based on the language_metadata.json file. To make maintenance of the package easier, it would be great if the options of the CLI were instead determined by the directory structure of src/scribe_data/language_data_extraction so that the code doesn't need to be updated each time new queries are being added in.
Of key importance is also that the options of the CLI would allow for dialects as well, so for Norwegian we'd like to see
Norwegian - Bokmål
andNorwegian - Nynorsk
, for example. How this will be achieved is open for discussion!Contribution
Happy to discuss how best to read in dialect sub directories and review the changes here when the PR is up!
The text was updated successfully, but these errors were encountered: