Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Contributhon] updating available languages in "example_text_classification" #166

Open
JoJeongJin opened this issue Aug 12, 2020 · 3 comments
Assignees
Labels
Contributhon OpenSource Contributhon 2020

Comments

@JoJeongJin
Copy link

Problem
Currently, "example_text_classification" example only supports English, so I would like to add a code to test the example in Korean.

Solution
Through Python, I will write an example code that can be text classified in Korean and then rewrite it in c language.
I am going to pre-processing Korean using KoNLPy library.

@taos-ci
Copy link
Collaborator

taos-ci commented Aug 12, 2020

:octocat: cibot: Thank you for posting issue #166. The person in charge will reply soon.

@myungjoo myungjoo added the Contributhon OpenSource Contributhon 2020 label Aug 12, 2020
@dongju-chae
Copy link
Member

dongju-chae commented Aug 12, 2020

As I know, KoNLPy is a pre-processor to tokenize Korean texts. With such tokens, you have to train the neural network model with the dataset of meaningful Korean texts. So... if there's no pre-trained model available, I think it's a non-trivial task (and out-of-scope of nnstreamer?).

@JoJeongJin
Copy link
Author

I'll try to code using the existing model if possible. Thank you.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Contributhon OpenSource Contributhon 2020
Projects
None yet
Development

No branches or pull requests

4 participants