Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Remove current Scribe-Data translations process #292

Closed
2 tasks done
andrewtavis opened this issue Oct 9, 2024 · 7 comments
Closed
2 tasks done

Remove current Scribe-Data translations process #292

andrewtavis opened this issue Oct 9, 2024 · 7 comments
Assignees
Labels
good first issue Good for newcomers hacktoberfest Included as a part of Hacktoberfest help wanted Extra attention is needed refactor Refactor code to improve quality

Comments

@andrewtavis
Copy link
Member

Terms

Issue

This issue would go through the project and remove the current machine learning based translation process from Scribe-Data. We'd remove the following dependencies:

  • torch
  • pyarrow
  • sentencepiece
  • tabulate
  • transformers

All references to these dependencies should be removed. We'd also remove all translations directories from src/scribe_data/language_data_extraction. We'd see that this is complete when translations are no longer listed as data type options for the CLI 😊

@andrewtavis andrewtavis added help wanted Extra attention is needed good first issue Good for newcomers refactor Refactor code to improve quality hacktoberfest Included as a part of Hacktoberfest labels Oct 9, 2024
@swarupn17
Copy link

Hey @andrewtavis I can do this task. Please can youbassign it to me.
Thanks !

@andrewtavis
Copy link
Member Author

@KesharwaniArpita had expressed interest in this issue, @swarupn17 :) Let's wait for her to confirm whether she'd like to work on this, and in the meantime you can check many of the issues that have a list of data types at the top that you can write Wikidata queries for. If the data type hasn't been checked and no one has asked to work on it, then feel free to ask to work on it 😊

What I mean is the Expand LANGUAGE data queries issues :)

@KesharwaniArpita
Copy link
Contributor

Hi @andrew can you assign me this task??

@andrewtavis andrewtavis moved this from Todo to In Progress in Scribe Board Oct 9, 2024
@andrewtavis
Copy link
Member Author

Sure thing, @KesharwaniArpita! Let us know if you need any support :)

@swarupn17, feel free to check the Expand LANGUAGE data queries issues for data types that are needed. These are the unchecked boxes in the issue description. Write in the issue and we can assign it to you as well 😊

@KesharwaniArpita
Copy link
Contributor

Hi @andrewtavis I went through the code to explore the occurrences of the aforementioned dependencies. Apparently only torch and transformers are being used. Next I'll remove these dependencies and see that it does not hamper the working of anything else.

@andrewtavis
Copy link
Member Author

Sounds good, @KesharwaniArpita!

@andrewtavis
Copy link
Member Author

Closed by #309 :) Thanks for your support here, @KesharwaniArpita!

@github-project-automation github-project-automation bot moved this from In Progress to Done in Scribe Board Oct 12, 2024
andrewtavis added a commit that referenced this issue Oct 12, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
good first issue Good for newcomers hacktoberfest Included as a part of Hacktoberfest help wanted Extra attention is needed refactor Refactor code to improve quality
Projects
Archived in project
Development

No branches or pull requests

3 participants