[TF]Refine LSTMBlockCell to support dynamic rnn #5963

lixiaoquan · 2020-06-30T08:29:51Z

Refine conversion of LSTMBlockCell
1. Make its output follows definition in TensorFlow
2. Avoid introducing variables which doesn't match any placeholder nodes in TensorFlow graph
About change in test_forward_ptb
States nodes of LSTMBlockCell in this PB file are actually Constant node.
TF can feed data to those Constant nodes but relay can't do that, so current conversion of LSTMBockCell introduces extra variables to solve this issue.
But this causes that relay IR doesn't match original TF graph. This PR solves this issue by converting those states node into placeholders.

cc @kevinthesun @srkreddy1238 @joyalbin Please help to review, thanks

lixiaoquan · 2020-07-06T02:40:45Z

also cc @zhiics @yongwww Could you please help to reivew? Thanks

yongwww · 2020-07-08T07:14:32Z

python/tvm/relay/frontend/tensorflow.py

+        # Return dummy for those unused values
+        dummy = tvm.relay.const(0)
+        return tvm.relay.TupleWrapper(
+            tvm.relay.Tuple([dummy, next_c, dummy, dummy, dummy, dummy, next_h]), 7)


I am not sure if the dummy node will be used somehow in some cases. It would be good to generate the real node as TF does for the dummy nodes.

def lstm_block_cell(x, cs_prev, h_prev, w, wci, wcf, wco, b, forget_bias=1, cell_clip=3, use_peephole=False, name=None): r"""Computes the LSTM cell forward propagation for 1 time step. This implementation uses 1 weight matrix and 1 bias vector, and there's an optional peephole connection. This kernel op implements the following mathematical equations: xh = [x, h_prev] [i, f, ci, o] = xh * w + b f = f + forget_bias if not use_peephole: wci = wcf = wco = 0 i = sigmoid(cs_prev * wci + i) f = sigmoid(cs_prev * wcf + f) ci = tanh(ci) cs = ci .* i + cs_prev .* f cs = clip(cs, cell_clip) o = sigmoid(cs * wco + o) co = tanh(cs) h = co .* o ... Returns: A tuple of `Tensor` objects (i, cs, f, o, ci, co, h). ...

I added all return objects, but I just can't test them, because tensorflow.contrib.rnn.LSTMBlockCell.call() only return h and new_states

I think if users uses API to contruct graph, other values won't be used.

(cs_prev, h_prev) = state (_, cs, _, _, _, _, h) = _lstm_block_cell( inputs, cs_prev, h_prev, self._kernel, self._bias, wci=wci, wcf=wcf, wco=wco, forget_bias=self._forget_bias, cell_clip=self._cell_clip, use_peephole=self._use_peephole) new_state = rnn_cell_impl.LSTMStateTuple(cs, h) return h, new_state

cc @yongwww Could you take another look? Thanks

1. Refine conversion of `LSTMBlockCell` 1) Make its output follows definition in TensorFlow 2) Avoid introducing variables which doesn't match any placeholder nodes in TensorFlow graph 2. About change in test_forward_ptb States nodes of LSTMBlockCell in this PB file are actually Constant node. TF can feed data to those Constant nodes but relay can't do that, so current conversion of LSTMBockCell introduces extra variables to solve this issue. But this causes that relay IR doesn't match original TF graph. This PR solves this issue by convert those states node into placeholders.

yongwww

LGTM

lixiaoquan · 2020-07-16T07:12:32Z

ping @zhiics Could you help to review and merge？ Thanks

kevinthesun

LGTM

kevinthesun · 2020-07-16T22:22:45Z

Thanks @lixiaoquan @yongwww

1. Refine conversion of `LSTMBlockCell` 1) Make its output follows definition in TensorFlow 2) Avoid introducing variables which doesn't match any placeholder nodes in TensorFlow graph 2. About change in test_forward_ptb States nodes of LSTMBlockCell in this PB file are actually Constant node. TF can feed data to those Constant nodes but relay can't do that, so current conversion of LSTMBockCell introduces extra variables to solve this issue. But this causes that relay IR doesn't match original TF graph. This PR solves this issue by convert those states node into placeholders.

apivovarov · 2021-10-02T01:26:50Z

Tensorflow2 LSTM failed - the discussion is here https://discuss.tvm.apache.org/t/tensorflow2-lstm-failed/11174

lixiaoquan changed the title ~~Refine LSTMBlockCell to support dynamic rnn~~ [TF]Refine LSTMBlockCell to support dynamic rnn Jun 30, 2020

lixiaoquan force-pushed the lstm branch 3 times, most recently from 7460648 to 9acbe2c Compare June 30, 2020 23:22

yongwww requested changes Jul 8, 2020

View reviewed changes

lixiaoquan force-pushed the lstm branch from 9acbe2c to 053ee50 Compare July 8, 2020 11:46

tqchen added the status: need review label Jul 10, 2020

yongwww approved these changes Jul 14, 2020

View reviewed changes

kevinthesun approved these changes Jul 16, 2020

View reviewed changes

kevinthesun merged commit b85c239 into apache:master Jul 16, 2020

ZihengJiang mentioned this pull request Sep 25, 2020

TVM v0.7 Release Note Candidate #6486

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[TF]Refine LSTMBlockCell to support dynamic rnn #5963

[TF]Refine LSTMBlockCell to support dynamic rnn #5963

lixiaoquan commented Jun 30, 2020 •

edited

Loading

lixiaoquan commented Jul 6, 2020

yongwww Jul 8, 2020

yongwww Jul 8, 2020

lixiaoquan Jul 8, 2020

lixiaoquan Jul 14, 2020

yongwww left a comment

lixiaoquan commented Jul 16, 2020

kevinthesun left a comment

kevinthesun commented Jul 16, 2020

apivovarov commented Oct 2, 2021

[TF]Refine LSTMBlockCell to support dynamic rnn #5963

[TF]Refine LSTMBlockCell to support dynamic rnn #5963

Conversation

lixiaoquan commented Jun 30, 2020 • edited Loading

lixiaoquan commented Jul 6, 2020

yongwww Jul 8, 2020

Choose a reason for hiding this comment

yongwww Jul 8, 2020

Choose a reason for hiding this comment

lixiaoquan Jul 8, 2020

Choose a reason for hiding this comment

lixiaoquan Jul 14, 2020

Choose a reason for hiding this comment

yongwww left a comment

Choose a reason for hiding this comment

lixiaoquan commented Jul 16, 2020

kevinthesun left a comment

Choose a reason for hiding this comment

kevinthesun commented Jul 16, 2020

apivovarov commented Oct 2, 2021

lixiaoquan commented Jun 30, 2020 •

edited

Loading