Problem with languages that have unicode characters #420

Gvantsats · 2019-10-24T07:02:07Z

Duckling doesn't find 'ten' in for example 'ptenz' because ten is just a substring same happens with russian language. But in georgian 'ათი' means ten and Duckling finds 10 in every word that has 'ათი' as a substring. for example this is the answer on the text 'პათიზ'

[ { "body": "ათი", "start": 1, "value": { "value": 10, "type": "value" }, "end": 4, "dim": "number", "latent": false } ]

I commented all the rules and I only left one which knows that 'ათი' is ten but I still get the same result. So I thinks it's not because of the rules could it be problem with the encoding?

The text was updated successfully, but these errors were encountered:

Bagdu · 2019-11-28T14:19:35Z

Same to me!

chessai · 2020-11-06T05:30:12Z

closing in favour of #439

Bagdu mentioned this issue Dec 12, 2019

Possible solution for the issue #420 #442

Closed

chessai added duplicate KA (Georgian) labels Nov 6, 2020

chessai closed this as completed Nov 6, 2020

This was referenced Nov 11, 2020

test suite failure, Georgian, GHC >= 8.8.x #541

Open

ghc88x compat #550

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Problem with languages that have unicode characters #420

Problem with languages that have unicode characters #420

Gvantsats commented Oct 24, 2019

Bagdu commented Nov 28, 2019

chessai commented Nov 6, 2020

Problem with languages that have unicode characters #420

Problem with languages that have unicode characters #420

Comments

Gvantsats commented Oct 24, 2019

Bagdu commented Nov 28, 2019

chessai commented Nov 6, 2020