Parameter "share_hidden_layers" not compatible with RegexFeaturizer/LexicalSyntacticFeaturizer #5528

hotzenklotz · 2020-03-30T15:15:48Z

Rasa version:
1.9.3

Rasa SDK version (if used & relevant):
1.9.0

Rasa X version (if used & relevant):
not used

Python version:
3.7

Operating system (windows, osx, ...):
OSX

Issue:
Training a response selector / DIET classifier with the parameter share_hidden_layers: true leads to the following error, even though the text+label dimension were configure to be of equal size in the config:

ValueError: If embeddings are shared text features and label features must coincide. Check the output dimensions of previous components.

Suggested fix by @Ghostvv : Removing the RegexFeaturizer and the LexicalSyntacticFeaturizer from my config solved the issue. Looks like Rasa is attaching some features to either the text or label vectors internally that breaks the share_hidden_layers parameter.

Breaking NLU config:

pipeline:
  - name: HFTransformersNLP
    model_name: "bert"
    model_weights: "bert-base-german-cased"
  - name: "LanguageModelTokenizer"
  - name: RegexFeaturizer
  - name: LexicalSyntacticFeaturizer
  - name: LanguageModelFeaturizer
  - name: CountVectorsFeaturizer
    lowercase: false
    use_shared_vocab: true
  - name: DIETClassifier
    epochs: 50
  - name: EntitySynonymMapper
  - name: ResponseSelector
    epochs: 500
    share_hidden_layers: true
    hidden_layers_sizes:
      text: [256, 128]
      label: [256, 128]

Error (including full traceback):

ValueError: If embeddings are shared text features and label features must coincide. Check the output dimensions of previous components.

in diet_classifier.py, line ~400
see method _check_input_dimension_consistency

Command or request that led to error:

rasa train / rasa train with cross_validation

The text was updated successfully, but these errors were encountered:

sara-tagger · 2020-03-31T06:00:06Z

Thanks for raising this issue, @alwx will get back to you about it soon✨

Please also check out the docs and the forum in case your issue was raised there too 🤗

dakshvar22 · 2020-04-14T14:28:19Z

Don’t think this can be solved right now without implementing #5510 because

It doesn’t make sense to process responses through LexicalSyntacticFeaturizer
Inside ResponseSelector there is no way of finding which feature set corresponds to the ones coming from LexicalSyntacticFeaturizer for text, so that they can be left out.

Ghostvv · 2020-04-14T21:59:58Z

@dakshvar22 I think it even doesn't make sense to have sentence level features in LexicalSyntacticFeaturizer

dakshvar22 · 2020-05-18T08:35:34Z

@Ghostvv I agree, they should be excluded. cc @tabergma

tabergma · 2020-06-12T14:40:26Z

@dakshvar22 Can this be closed as you can now exclude specific features from the ResponseSelector? (#5863)

hotzenklotz added the type:bug 🐛 Inconsistencies or issues which will cause an issue or problem for users or implementors. label Mar 30, 2020

dakshvar22 self-assigned this Apr 6, 2020

Ghostvv added the area:rasa-oss 🎡 Anything related to the open source Rasa framework label Apr 14, 2020

tabergma closed this as completed Jun 24, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Parameter "share_hidden_layers" not compatible with RegexFeaturizer/LexicalSyntacticFeaturizer #5528

Parameter "share_hidden_layers" not compatible with RegexFeaturizer/LexicalSyntacticFeaturizer #5528

hotzenklotz commented Mar 30, 2020 •

edited

Loading

sara-tagger commented Mar 31, 2020

dakshvar22 commented Apr 14, 2020

Ghostvv commented Apr 14, 2020

dakshvar22 commented May 18, 2020

tabergma commented Jun 12, 2020

Parameter "share_hidden_layers" not compatible with RegexFeaturizer/LexicalSyntacticFeaturizer #5528

Parameter "share_hidden_layers" not compatible with RegexFeaturizer/LexicalSyntacticFeaturizer #5528

Comments

hotzenklotz commented Mar 30, 2020 • edited Loading

sara-tagger commented Mar 31, 2020

Please also check out the docs and the forum in case your issue was raised there too 🤗

dakshvar22 commented Apr 14, 2020

Ghostvv commented Apr 14, 2020

dakshvar22 commented May 18, 2020

tabergma commented Jun 12, 2020

hotzenklotz commented Mar 30, 2020 •

edited

Loading