Token_pattern is incorrectly applied on incoming text during inference #5905

dakshvar22 · 2020-05-27T07:37:35Z

Rasa version: 1.10.0

Python version: 3.6.5

Operating system (windows, osx, ...): MacOS

Issue: The original problem is explained here. It is not a quick fix because since we use sequence in our downstream models, we cannot alter the length of the sequence in a featurizer. Hence, token_pattern should be added as a parameter inside a tokenizer and removed from countvectorsfeaturizer.

The text was updated successfully, but these errors were encountered:

dakshvar22 added type:bug 🐛 Inconsistencies or issues which will cause an issue or problem for users or implementors. area:rasa-oss 🎡 Anything related to the open source Rasa framework labels May 27, 2020

tabergma self-assigned this Jun 26, 2020

tabergma mentioned this issue Jun 26, 2020

Move token_pattern to tokenizers #6073

Merged

4 tasks

tabergma closed this as completed in #6073 Jul 7, 2020

tabergma mentioned this issue Jul 7, 2020

[Diet Classifier] ValueError: Number of examples should be the same for all data. #5508

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Token_pattern is incorrectly applied on incoming text during inference #5905

Token_pattern is incorrectly applied on incoming text during inference #5905

dakshvar22 commented May 27, 2020

Token_pattern is incorrectly applied on incoming text during inference #5905

Token_pattern is incorrectly applied on incoming text during inference #5905

Comments

dakshvar22 commented May 27, 2020