-
Notifications
You must be signed in to change notification settings - Fork 4.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[bugfix][WIP] Fix the bug of Chinese tokenization in 1.10.12 #6755
[bugfix][WIP] Fix the bug of Chinese tokenization in 1.10.12 #6755
Conversation
Thanks for submitting a pull request 🚀 @rctatman will take a look at it as soon as possible ✨ |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for tackling this issue!
I am not quite sure I fully understand the problem. Added some questions in the comments.
Also we would need to have a changelog entry and some tests. Thanks.
…into bugfix/chinese_tokenization_in_rasa_v1
Looks good so far! Can you please add a changelog entry? Thanks. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks great! 👍 Thanks for tackling all my comments.
Can you please merge the latest version of 1.10.x
into this branch as the convert test fixes are merged there? Thanks.
…hinese_tokenization_in_rasa_v1
Proposed changes:
[UNK]
as 1, to fix the OOV issuefix #6754
Status (please check what you already did):
black
(please check Readme for instructions)