Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Problem with languages that have unicode characters #420

Closed
Gvantsats opened this issue Oct 24, 2019 · 2 comments
Closed

Problem with languages that have unicode characters #420

Gvantsats opened this issue Oct 24, 2019 · 2 comments

Comments

@Gvantsats
Copy link

Duckling doesn't find 'ten' in for example 'ptenz' because ten is just a substring same happens with russian language. But in georgian 'ათი' means ten and Duckling finds 10 in every word that has 'ათი' as a substring. for example this is the answer on the text 'პათიზ'

[ { "body": "ათი", "start": 1, "value": { "value": 10, "type": "value" }, "end": 4, "dim": "number", "latent": false } ]

I commented all the rules and I only left one which knows that 'ათი' is ten but I still get the same result. So I thinks it's not because of the rules could it be problem with the encoding?

@Bagdu
Copy link

Bagdu commented Nov 28, 2019

Same to me!

@chessai
Copy link
Contributor

chessai commented Nov 6, 2020

closing in favour of #439

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants