Two issues #9

LindgeW · 2021-08-30T10:19:33Z

If Adam optimizer is used, can it still work ? (Line 7. indicates a standard gradient desent method) Or, this just fit into the SGD based optimizer?
2. I would like to use this reweighting strategy in more complicated neural framework such as LSTM, BERT for other downstream tasks. Whether I must modify these to the 'meta-style' structures ? It seems to be trivial.

LindgeW · 2021-08-30T10:24:02Z

@mengye-ren

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Two issues #9

Two issues #9

LindgeW commented Aug 30, 2021

LindgeW commented Aug 30, 2021

Two issues #9

Two issues #9

Comments

LindgeW commented Aug 30, 2021

LindgeW commented Aug 30, 2021