Weights computation for MAML in RL Setting #2

smiler80 · 2019-01-27T12:08:46Z

Thank you for this very interesting work.

I have a question regarding section 6.3 "MAML in Supervised Learning".
While in Supervised learning setting, Step 3: (inner loop) is quite obvious, I'm still not sure how to implement it for Reinforcement learning setting. In fact Di consists of K trajectories each one of horizon H. How should theta'i be computed?

A- For each of the Ks trajectories?
B- At the end of the all Ks trajectories training?

In both cases, do you have an idea on how should gradient-descent/losses be operated (eventually aggregated) to obtain theta'i?

Best Regards,

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Weights computation for MAML in RL Setting #2

Weights computation for MAML in RL Setting #2

smiler80 commented Jan 27, 2019

Weights computation for MAML in RL Setting #2

Weights computation for MAML in RL Setting #2

Comments

smiler80 commented Jan 27, 2019