[TOPI] FIFO buffer op, to accelerate sequence modeling with dilated convolutions #4039

hcho3 · 2019-10-01T00:18:20Z

Motivation. Dilated convolutions have appeared as an effective alternative to recurrent units in modeling sequences. For example, WaveNet [1] uses a stack of dilated convolutional layers to generate raw audio waveforms from text. Snips [2] modifies the WaveNet architecture to detect a keyword in an audio stream.

In order to capture temporal context, the WaveNet architecture feeds a sliding window over the input sequence into the first convolutional layer. As noted in [2] and [3], computing convolution over the sliding window results in redundant computation:

This pull request implements a FIFO buffer operator where intermediate outputs are cached from each convolutional layer, so as to eliminate redundant computation. This is like [4], except that here the re-use is explicit and inherent in the model. Note that caching is only applicable in inference time (so not applicable to training).

Semantics. The FIFO buffer op should behave like

concat(buffer, data, axis=axis)
.slice_axis(axis=axis, begin=data.shape[axis], end=data.shape[axis]+buffer.shape[axis])

Usage. See topi/tests/python/test_fifo_buffer.py

Limitation. Currently, the buffer op exists only in TOPI. To make it useful, we want to merge it into MXNet and other frameworks. Alternatively, we could conceivably implement a custom pass in Relay so that the user can annotate a stack of convolutional layers.

References
[1] "WaveNet: A Generative Model for Raw Audio." https://arxiv.org/abs/1609.03499
[2] "Efficient keyword spotting using dilated convolutions and gating" https://arxiv.org/abs/1811.07684
[3] "Fast Wavenet Generation Algorithm" https://arxiv.org/abs/1611.09482
[4] "Deep reuse: streamline CNN inference on the fly via coarse-grained computation reuse" https://dl.acm.org/citation.cfm?id=3330384

Special thanks to Thibaud Senechal (Amazon) for initially suggesting the concept of FIFO buffer.

cc @yongwww @wweic @zhiics @kevinthesun @anijain2305

hcho3 · 2019-10-01T00:33:28Z

TODO.

Create an end to end example.
Send a pull request to MXNet.

tqchen · 2019-10-02T18:48:49Z

cc @vinx13 @merrymercy would be great if you can help comment and review

anijain2305

Thanks for the contribution. I will have to look into the details to understand the compute, but overall looks good to me. Will do one more round by tomorrow.

python/tvm/relay/op/nn/_nn.py

src/relay/op/nn/nn.cc

zhiics

Thanks for the contribution. I left some minor reviews. Otherwise, looks good to me.

python/tvm/relay/frontend/mxnet.py

src/relay/op/nn/nn.cc

topi/tests/python/test_fifo_buffer.py

topi/python/topi/nn/fifo_buffer.py

yongwww

LGTM

zhiics

LGTM. @vinx13 Can you take another look?

…onvolutions (apache#4039) * Add FIFO buffer op to enable explicit computation re-use in convolution * Add a test * Add end-to-end test with 1D convolution * Add a stub in MXNet frontend * Address reviewer comments * Add back stub for MXNet frontend

hcho3 added 2 commits September 30, 2019 23:44

Add FIFO buffer op to enable explicit computation re-use in convolution

48afe60

Add a test

b200715

hcho3 added 2 commits October 2, 2019 03:05

Add end-to-end test with 1D convolution

f8d7ae7

Add a stub in MXNet frontend

1f2c839

tqchen added the status: need review label Oct 2, 2019

anijain2305 reviewed Oct 2, 2019

View reviewed changes

python/tvm/relay/op/nn/_nn.py Show resolved Hide resolved

src/relay/op/nn/nn.cc Outdated Show resolved Hide resolved

zhiics reviewed Oct 3, 2019

View reviewed changes

python/tvm/relay/frontend/mxnet.py Show resolved Hide resolved

src/relay/op/nn/nn.cc Outdated Show resolved Hide resolved

topi/tests/python/test_fifo_buffer.py Show resolved Hide resolved

Address reviewer comments

915f2ce

kevinthesun reviewed Oct 6, 2019

View reviewed changes

topi/python/topi/nn/fifo_buffer.py Show resolved Hide resolved

yongwww approved these changes Oct 7, 2019

View reviewed changes

Add back stub for MXNet frontend

8374d6e

tqchen assigned vinx13 and zhiics Oct 10, 2019

zhiics approved these changes Oct 10, 2019

View reviewed changes

vinx13 approved these changes Oct 10, 2019

View reviewed changes

vinx13 merged commit aa42413 into apache:master Oct 10, 2019

vinx13 added status: accepted and removed status: need review labels Oct 10, 2019

hcho3 deleted the fifo_buffer_op branch October 11, 2019 01:11

tqchen unassigned zhiics and vinx13 Nov 4, 2019

tqchen mentioned this pull request Nov 8, 2019

[RELEASE][DRAFT] TVM v0.6 Release candidate #4259

Closed

hcho3 mentioned this pull request Nov 14, 2019

Add topi.nn.fifo_buffer to TOPI API doc #4343

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[TOPI] FIFO buffer op, to accelerate sequence modeling with dilated convolutions #4039

[TOPI] FIFO buffer op, to accelerate sequence modeling with dilated convolutions #4039

hcho3 commented Oct 1, 2019 •

edited

Loading

hcho3 commented Oct 1, 2019 •

edited

Loading

tqchen commented Oct 2, 2019

anijain2305 left a comment

zhiics left a comment

yongwww left a comment

zhiics left a comment

[TOPI] FIFO buffer op, to accelerate sequence modeling with dilated convolutions #4039

[TOPI] FIFO buffer op, to accelerate sequence modeling with dilated convolutions #4039

Conversation

hcho3 commented Oct 1, 2019 • edited Loading

hcho3 commented Oct 1, 2019 • edited Loading

tqchen commented Oct 2, 2019

anijain2305 left a comment

Choose a reason for hiding this comment

zhiics left a comment

Choose a reason for hiding this comment

yongwww left a comment

Choose a reason for hiding this comment

zhiics left a comment

Choose a reason for hiding this comment

hcho3 commented Oct 1, 2019 •

edited

Loading

hcho3 commented Oct 1, 2019 •

edited

Loading