Load models in finetune mode based on command line parameters. #7329

wochinge · 2020-11-20T17:58:57Z

Depends on #7328.

load NLU pipeline in fine-tune mode as shown here
load Core policies in fine-tune mode as show here
pass in new number of epochs

dakshvar22 · 2020-11-30T15:38:55Z

@wochinge @joejuzl Do you folks know how the load method for each ML component will be called during finetuning? For example, would there be an extra parameter passed to load function of the components to load them in "finetune" mode and also pass other configurations parameters like number of epochs? Here's how I had done it in the working version branch but I am assuming it wouldn't be exactly the same. If we can plan this ahead, it would unblock me on some of the implementation due for my PR. 🙏

joejuzl · 2020-12-01T09:23:48Z

@dakshvar22 I haven' started looking at this area of the code yet (working on #7330 first) so it's hard for me to say. @wochinge Any opinions/thoughts?

dakshvar22 · 2020-12-02T12:34:21Z

@joejuzl @wochinge Making a proposal to see if we can reach to a consensus quickly on the above question:

New number of epochs get changed inside the meta parameter that is passed to the load method of all components.
Add a boolean parameter to load method - finetune_mode which is set to True if the component is loaded in finetune mode.

What do you folks think?

joejuzl · 2020-12-03T15:57:58Z

@dakshvar22 So the solution we have come up with (today!) is as follows:

For NLU:

In Interpreter.load we pass in the new config along with the old model.
Interpreter.load updates the old metadata with the new epochs and calls Interpreter.create with a flag should_finetune.
Interpreter.create sets should_finetune in the context which gets passed to each component.
Trainer.__init__ now optionally takes the old loaded model and uses that pipeline e.g. self.pipeline = old_model.pipeline

dakshvar22 · 2020-12-03T16:04:33Z

@joejuzl Perfect! Small clarification:

Interpreter.create sets should_finetune in the context which gets passed to each component.

The context dictionary is what will be passed as part of the kwargs argument of load() methods of each component?

joejuzl · 2020-12-03T16:06:48Z

The context dictionary is what will be passed as part of the kwargs argument of load() methods of each component?

Yes exactly, via component_builder.load_component

joejuzl · 2020-12-03T16:37:00Z

For Core:

The new config is passed from Agent.load -> PolicyEnsemble.load.
Then the part of the new config for each policy is passed into its respective load which passes the new epochs into its constructor.
Still not clear exactly how the should_finetune flag will be passed around. I guess through the same path @wochinge ?

wochinge · 2020-12-03T18:08:28Z

@dakshvar22 Our current approach would be to provide should_finetune through the constructor. The alternative would be to do so when calling train. Do you have preferences?

dakshvar22 · 2020-12-03T18:16:46Z

@wochinge Do you mean for Core specifically or NLU components also?
Either ways, we don't need it when calling train. It definitely has to be passed to the load() method of the ML components(DIETClassifier, TEDPolicy) so that the model weights can be instantiated in "training" mode and then load() would pass this parameter to the constructor anyways. If you want to see an example of what I mean, here is how I do it for ML components inside NLU.

…e_nlu #7329 load models in finetune mode nlu

* Load core model in fine-tuning mode * Core finetune loading test * Test and PR comments * Fallback to default epochs * Test policy and ensemble fine-tuning exception cases * Remove epoch_override from Policy.load * use kwargs * fix * fix train tests * More test fixes * Apply suggestions from code review Co-authored-by: Daksh Varshneya <[email protected]> * remove unneeded sklearn epochs * Apply suggestions from code review Co-authored-by: Tobias Wochinger <[email protected]> * PR comments for warning strings * Add typing * add back invalid model tests * small comments Co-authored-by: Daksh Varshneya <[email protected]> Co-authored-by: Tobias Wochinger <[email protected]>

wochinge added type:enhancement ✨ Additions of new features or changes to existing ones, should be doable in a single PR area:rasa-oss 🎡 Anything related to the open source Rasa framework priority:high labels Nov 20, 2020

TyDunn assigned joejuzl Nov 30, 2020

dakshvar22 mentioned this issue Nov 30, 2020

Refactor CountVectorizer and ML components to support incremental training #7413

Closed

3 tasks

dakshvar22 mentioned this issue Dec 3, 2020

Get ML components ready for incremental training #7419

Merged

4 tasks

joejuzl linked a pull request Dec 4, 2020 that will close this issue

#7329 load models in finetune mode nlu #7456

Merged

4 tasks

joejuzl added a commit that referenced this issue Dec 7, 2020

Merge pull request #7456 from RasaHQ/7329/load_models_in_finetune_mod…

36fc2b6

…e_nlu #7329 load models in finetune mode nlu

wochinge closed this as completed Dec 11, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Load models in finetune mode based on command line parameters. #7329

Load models in finetune mode based on command line parameters. #7329

wochinge commented Nov 20, 2020

dakshvar22 commented Nov 30, 2020

joejuzl commented Dec 1, 2020

dakshvar22 commented Dec 2, 2020

joejuzl commented Dec 3, 2020 •

edited

Loading

dakshvar22 commented Dec 3, 2020

joejuzl commented Dec 3, 2020

joejuzl commented Dec 3, 2020

wochinge commented Dec 3, 2020

dakshvar22 commented Dec 3, 2020

Load models in finetune mode based on command line parameters. #7329

Load models in finetune mode based on command line parameters. #7329

Comments

wochinge commented Nov 20, 2020

dakshvar22 commented Nov 30, 2020

joejuzl commented Dec 1, 2020

dakshvar22 commented Dec 2, 2020

joejuzl commented Dec 3, 2020 • edited Loading

dakshvar22 commented Dec 3, 2020

joejuzl commented Dec 3, 2020

joejuzl commented Dec 3, 2020

wochinge commented Dec 3, 2020

dakshvar22 commented Dec 3, 2020

joejuzl commented Dec 3, 2020 •

edited

Loading