You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi, i am working on duckling and I have following problem.
Duckling doesn't find 'ten' in for example 'ptenz' because ten is just a substring same happens with russian language. But in georgian 'ათი' means ten and Duckling finds 10 in every word that has 'ათი' as a substring. for example this is the answer on the text 'პათიზ'
I commented all the rules and I only left one which knows that 'ათი' is ten but I still get the same result. So I thinks it's not because of the rules could it be problem with the encoding?
The text was updated successfully, but these errors were encountered:
Did you regenerate the classifiers after commenting out all those rules? you will typically want to regenerate classifiers after making changes to rules/corpora.
If your locale is set to russian (e.g. via makeLocale RU Nothing), duckling will surely not determine the entity string ptenz to contain the numeral 10, because ten should only resolve to the numeral 10 when the locale is EN (english).
Here is an example showing that you have indeed uncovered an issue, because EN does not behave this way (nor should it):
> debug (makeLocale EN Nothing) "ptenz" [Seal Numeral]
[]
> debug (makeLocale KA Nothing) "პათიზ" [Seal Numeral]
integer (0..19) (ათი)
-- regex (ათი)
[Entity {dim = "number", body = "\4304\4311\4312", value = RVal Numeral (NumeralValue {vValue = 10.0}), start = 1, end = 4, latent = False, enode = Node {nodeRange = Range 1 4, token = Token Numeral (NumeralData {value = 10.0, grain = Nothing, multipliable = False, okForAnyTime = True}), children = [Node {nodeRange = Range 1 4, token = Token RegexMatch (GroupMatch ["\4304\4311\4312",""]), children = [], rule = Nothing}], rule = Just "integer (0..19)"}}]
HI! thank you for your response,
In the first question, it was a long time ago, but as I remember I regenerated classifier after commenting all those rules,
and in the second question when I was testing I have set locale to English so that's not the case either.
Maybe this issue can help to you #442
Hi, i am working on duckling and I have following problem.
Duckling doesn't find 'ten' in for example 'ptenz' because ten is just a substring same happens with russian language. But in georgian 'ათი' means ten and Duckling finds 10 in every word that has 'ათი' as a substring. for example this is the answer on the text 'პათიზ'
[ { "body": "ათი", "start": 1, "value": { "value": 10, "type": "value" }, "end": 4, "dim": "number", "latent": false } ]
I commented all the rules and I only left one which knows that 'ათი' is ten but I still get the same result. So I thinks it's not because of the rules could it be problem with the encoding?
The text was updated successfully, but these errors were encountered: