Make NLU configuration more flexible #5510

dakshvar22 · 2020-03-27T08:59:28Z

Description of Problem:
Currently, the NLU pipeline is interpreted as a flat pipeline where each component produces annotations, i.e. tokens, sparse features, dense features, etc. and then they are finally consumed by all the ML components present in the pipeline. This used to work well when we were dealing with a smaller number of ML tasks in the same pipeline, i.e. intent classification and entity recognition. As we keep adding more ML tasks as part of our NLU pipeline, like response selection, the idea doesn't scale well because the same set of featurizers are not useful for every ML task and may actually degrade the performance for some.
It would be better if each ML component can take as input which featurizers should be used for its modelling.

Overview of the Solution:
We add an extra parameter to each component which sets an alias for it in the configuration. That alias can be used in the downstream ML component.

For example -

- pipeline:
   - name: WhitespaceTokenizer
   - name: CountVectorsFeaturizer
      alias: cvf_word
   - name: CountVectorsFeaturizer
      analyzer: char
      min_ngram: 1
      max_ngram: 4
      alias: cvf_char
   - name: DIETClassifier
      in: [cvf_word, cvf_char]
   - name: ResponseSelector
      in: [cvf_word]

Definition of Done:

Tests are added
Feature described the docs
Feature mentioned in the changlog

The text was updated successfully, but these errors were encountered:

dakshvar22 added the type:enhancement ✨ Additions of new features or changes to existing ones, should be doable in a single PR label Mar 27, 2020

tabergma added the area:rasa-oss 🎡 Anything related to the open source Rasa framework label Mar 27, 2020

dakshvar22 mentioned this issue Apr 14, 2020

Parameter "share_hidden_layers" not compatible with RegexFeaturizer/LexicalSyntacticFeaturizer #5528

Closed

tabergma self-assigned this May 4, 2020

tabergma mentioned this issue May 20, 2020

Flexible NLU pipeline #5863

Merged

4 tasks

tabergma closed this as completed in #5863 Jun 5, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make NLU configuration more flexible #5510

Make NLU configuration more flexible #5510

dakshvar22 commented Mar 27, 2020 •

edited

Loading

Make NLU configuration more flexible #5510

Make NLU configuration more flexible #5510

Comments

dakshvar22 commented Mar 27, 2020 • edited Loading

dakshvar22 commented Mar 27, 2020 •

edited

Loading