Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Refactor CountVectorizer and ML components to support incremental training #7413

Closed
3 tasks done
dakshvar22 opened this issue Nov 30, 2020 · 0 comments
Closed
3 tasks done
Assignees
Labels
area:rasa-oss/ml 👁 All issues related to machine learning type:enhancement ✨ Additions of new features or changes to existing ones, should be doable in a single PR

Comments

@dakshvar22
Copy link
Contributor

dakshvar22 commented Nov 30, 2020

Part of #6971

  • Add buffer tokens to vocabulary of CountVectorsFeaturizer and RegexFeaturizer which can be used during incremental training.
  • Load DIET, ResponseSelector and TED from previously trained checkpoints so get them ready for further fine-tuning.
  • Add tests for the above.

The logic for first two are already implemented in this working version branch. However, the implementation needs to be adapted to changes introduced by related PRs - #7328 , #7329 and #7330

@dakshvar22 dakshvar22 added type:enhancement ✨ Additions of new features or changes to existing ones, should be doable in a single PR area:rasa-oss 🎡 Anything related to the open source Rasa framework area:rasa-oss/ml 👁 All issues related to machine learning and removed area:rasa-oss 🎡 Anything related to the open source Rasa framework labels Nov 30, 2020
@dakshvar22 dakshvar22 self-assigned this Nov 30, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area:rasa-oss/ml 👁 All issues related to machine learning type:enhancement ✨ Additions of new features or changes to existing ones, should be doable in a single PR
Projects
None yet
Development

No branches or pull requests

1 participant