Tracker featurization during batch generation #9022
Labels
area:rasa-oss 🎡
Anything related to the open source Rasa framework
area:rasa-oss/ml 👁
All issues related to machine learning
research:end-to-end
type:enhancement ✨
Additions of new features or changes to existing ones, should be doable in a single PR
Description of Problem: Currently every tracker is featurized before the training and kept around in memory. This takes a lot of time, but also a lot of memory, keeping all the featurized trackers around at the same time. This issue is particularly big for large e2e datasets such as multiwoz.
Overview of the Solution: The trackers can be featurized during the batch generation. This leads to much lower memory consumption. However, featurization from the ground up is slower than the model training loop. This is why this issue first needs the lookup table issue to be resolved to be viable.
There is a prototypical implementation of this feature in the combined e2e fixes branch where the trackers are encoded inside the data loader class.
Open Issues
Blockers #9020 is needed for this proposal to be computationally viable
Definition of Done:
The text was updated successfully, but these errors were encountered: