How to get intermediate values of pretrained models? #2527

snie2012 · 2019-02-19T21:52:53Z

Is it possible to get the intermediate values of the hidden layers for the pretrained sequence tagging models when making prediction? For example, the output of the LSTM layer of a pretrained tagging model. If so, how?

joelgrus · 2019-02-19T22:57:15Z

I have an experimental PR that does this:

https://github.com/allenai/allennlp/pull/2211/files#diff-a64f3186684b9877702c7c4e2540950aR63

basically you can register a hook on the LSTM layer and use that hook to grab its output. that PR is still kind of half-baked, but it should point you in the right direction.

HarshTrivedi · 2019-02-19T23:24:47Z

If the model is your own code, then you can also change that to have hidden state in the output dict in forward method. Eg. output_dict["hidden_state"] = my_lstm_state. (Make sure first dim of my_lstm_state is batch_size). Even if the pretrained model was generated with old model code, when it's loaded the new code will be used. If you are using default predict_instance in your predictor, it will have the santized hidden state in output already. If not you need to make sure that key hidden_state key is actually passed on in the returned json dict of predict_instance.

HarshTrivedi · 2019-02-19T23:34:15Z

However, what @joelgrus is suggesting is obviously cleaner, since that way you don't make hardcoded changes in model code just to tweak what predictor needs to give out at prediction time.

snie2012 · 2019-02-19T23:38:13Z

Thanks for your replies @joelgrus @HarshTrivedi . I am also looking at using forward hooks to get intermediate values mainly from pretrained models. It works pretty well:)

MeiqiGuo · 2019-03-13T19:49:14Z

@snie2012 Hi, could you please specify how did you solve this issue? I am also looking for intermediate values of pertained model on new data. Thanks in advance!

joelgrus · 2019-03-13T19:58:22Z

I have a new PR that's even cleaner, but it's not merged yet:

https://github.com/allenai/allennlp/pull/2581/files

snie2012 · 2019-03-13T20:44:32Z

@MeiqiGuo As @joelgrus says, use hooks can do the job.
In addition from the above example, here is another one flairNLP/flair#524 (comment)

MeiqiGuo · 2019-03-13T20:57:06Z

@snie2012 @joelgrus Thanks for your help! I tried changing the forward function and it works. I will try this cleaner version with hooks later.

schmmd · 2019-03-15T22:33:33Z

Closing since #2581 is merged.

schmmd assigned joelgrus Mar 14, 2019

schmmd closed this as completed Mar 15, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to get intermediate values of pretrained models? #2527

How to get intermediate values of pretrained models? #2527

snie2012 commented Feb 19, 2019 •

edited

Loading

joelgrus commented Feb 19, 2019

HarshTrivedi commented Feb 19, 2019

HarshTrivedi commented Feb 19, 2019

snie2012 commented Feb 19, 2019

MeiqiGuo commented Mar 13, 2019

joelgrus commented Mar 13, 2019

snie2012 commented Mar 13, 2019

MeiqiGuo commented Mar 13, 2019

schmmd commented Mar 15, 2019

How to get intermediate values of pretrained models? #2527

How to get intermediate values of pretrained models? #2527

Comments

snie2012 commented Feb 19, 2019 • edited Loading

joelgrus commented Feb 19, 2019

HarshTrivedi commented Feb 19, 2019

HarshTrivedi commented Feb 19, 2019

snie2012 commented Feb 19, 2019

MeiqiGuo commented Mar 13, 2019

joelgrus commented Mar 13, 2019

snie2012 commented Mar 13, 2019

MeiqiGuo commented Mar 13, 2019

schmmd commented Mar 15, 2019

snie2012 commented Feb 19, 2019 •

edited

Loading