Token_pattern is incorrectly applied on incoming text during inference #5905
Labels
area:rasa-oss 🎡
Anything related to the open source Rasa framework
type:bug 🐛
Inconsistencies or issues which will cause an issue or problem for users or implementors.
Rasa version: 1.10.0
Python version: 3.6.5
Operating system (windows, osx, ...): MacOS
Issue: The original problem is explained here. It is not a quick fix because since we use sequence in our downstream models, we cannot alter the length of the sequence in a featurizer. Hence,
token_pattern
should be added as a parameter inside a tokenizer and removed from countvectorsfeaturizer.The text was updated successfully, but these errors were encountered: