Fast exponent #4790

alexgl-github · 2020-01-29T16:27:51Z

Thanks for contributing to TVM! Please refer to guideline https://docs.tvm.ai/contribute/ for useful information and tips. After the pull request is submitted, please request code reviews from Reviewers by @ them in the pull request thread.

topi/include/topi/elemwise.h

masahi · 2020-02-04T01:23:59Z

topi/include/topi/elemwise.h

+                  std::string name = "T_exp",
+                  std::string tag = kElementWise) {
+  if (x->dtype == DataType::Float(32)) {
+    return fast_exp(x, name, tag);


unless this fast_exp is guaranteed to give a bit identical output as libc exp, I don't think it is a good idea to use this by default. I recommend using something like env var to enable this.

@masahi It's not identical.
Relative fast exp error vs Tensorflow exp is between [-4.52e-06, 4.17e-06]
Relative fast exp error vs Numpy exp is [-3.11e-06, 3.10e-06]
How about using it only if enabled via cmake option?

Perhaps a better way would be have a separate operator fast_exp, then have a pass(fast-math) in relay to rewrite the exp into the fast_exp

I like @tqchen's solution. If you use cmake option it is not configurable after libtvm.so is built. It requires more work, but it can be done in later PR. This PR can be merged with topi only change including test cases.

I know what I am talking about here because I also did fast_exp for my internal work in the past. Accurate exp is very slow and the high accuracy is not required for inference. The biggest benefit is it enables vectorization if it is written in topi (in my case it was HalideIR). Vectorizing exp was the main reason to introduce op fusion improvement in #1548

How about having 3 new relay contrib operators - contrib.fast_exp, contrib.fast_tanh, contrib.fast_softmax. We can then add a Relay pass with opt_level 4, that legalizes these functions to their approximate counterparts.

Edit - Sorry should have told why these 3. For softmax, we are essentially playing with exp op. Softmax takes substantial time in SSD models, where input shape is very large. For tanh, we already have a fast_tanh that is enabled by default. We should change that.

masahi · 2020-02-05T19:40:32Z

@alexgl-github tests cases are absolutely required for a new operator like this.

alexgl-github · 2020-02-11T17:32:38Z

@masahi @anijain2305 @FrozenGene Would you mind reviewing again?

zhiics · 2020-02-11T23:55:53Z

I have some silly questions: when should we switch to the fast_exp since it is in topi? Do we expect users to select it? Does this mean that this op is only available in topi, but not Relay?

alexgl-github · 2020-02-12T00:44:54Z

I have some silly questions: when should we switch to the fast_exp since it is in topi? Do we expect users to select it? Does this mean that this op is only available in topi, but not Relay?

@zhiics In a separate PR we'll introduce relay optimization pass that should select fast_exp if opt_level=4

anijain2305

LGTM

anijain2305 · 2020-02-12T22:12:12Z

Can this get in? I will work on Relay changes.

anijain2305 · 2020-02-13T00:20:10Z

@tqchen @FrozenGene Can you please check if the changes you requested are addressed?

tqchen · 2020-02-13T00:28:17Z

Overall looks OK, it would be great if we can decide a consistent naming convention. In this case, we can have fastexp vs fast_exp

anijain2305 · 2020-02-13T00:35:19Z

Right. I think fast_exp fits better with current naming style.

alexgl-github · 2020-02-13T01:40:32Z

Right. I think fast_exp fits better with current naming style.
@anijain2305
I've changed fastexp to fast_exp

FrozenGene

LGTM. Minor comment. Please fix it.

topi/python/topi/math.py

masahi · 2020-02-14T07:40:37Z

@tqchen please give an approval.

anijain2305 · 2020-02-14T22:59:38Z

Lets get this in - @tqchen

masahi · 2020-02-17T09:13:18Z

ping @tqchen

tqchen

lgtm from my side

tqchen · 2020-02-17T17:22:30Z

Thanks @alexgl-github @anijain2305 @masahi @FrozenGene !

alexgl-github force-pushed the fastexp branch 8 times, most recently from 12358c4 to dbcaf2e Compare January 29, 2020 20:17

alexgl-github marked this pull request as ready for review January 29, 2020 20:30

FrozenGene requested changes Jan 30, 2020

View reviewed changes

topi/include/topi/elemwise.h Outdated Show resolved Hide resolved

tqchen requested changes Jan 31, 2020

View reviewed changes

topi/include/topi/elemwise.h Outdated Show resolved Hide resolved

alexgl-github force-pushed the fastexp branch 2 times, most recently from 1831312 to 48d60c8 Compare January 31, 2020 18:44

alexgl-github requested review from tqchen and FrozenGene January 31, 2020 18:46

alexgl-github force-pushed the fastexp branch 2 times, most recently from a286e9e to 5e7efd8 Compare January 31, 2020 19:15

tqchen reviewed Jan 31, 2020

View reviewed changes

topi/include/topi/elemwise.h Show resolved Hide resolved

topi/include/topi/elemwise.h Outdated Show resolved Hide resolved

tqchen assigned FrozenGene Jan 31, 2020

masahi reviewed Feb 4, 2020

View reviewed changes

alexgl-github force-pushed the fastexp branch from 5e7efd8 to dd1bc2d Compare February 11, 2020 00:54

alexgl-github requested review from Huyuwei and Laurawly as code owners February 11, 2020 00:54

alexgl-github requested a review from tqchen February 11, 2020 00:55

alexgl-github force-pushed the fastexp branch 2 times, most recently from 4fe5940 to bec4a55 Compare February 11, 2020 02:05

alexgl-github requested a review from masahi February 11, 2020 06:56

masahi approved these changes Feb 11, 2020

View reviewed changes

alexgl-github force-pushed the fastexp branch from 03cb498 to 04dabe1 Compare February 12, 2020 16:27

anijain2305 approved these changes Feb 12, 2020

View reviewed changes

anijain2305 mentioned this pull request Feb 13, 2020

[Relay][FastMath] Relay pass to use fast exp/tanh #4873

Merged

alexgl-github force-pushed the fastexp branch from 04dabe1 to a6eb710 Compare February 13, 2020 01:38

FrozenGene reviewed Feb 13, 2020

View reviewed changes

topi/python/topi/math.py Outdated Show resolved Hide resolved

Fast exponent

df783ec

alexgl-github force-pushed the fastexp branch from a6eb710 to df783ec Compare February 13, 2020 16:07

alexgl-github requested a review from FrozenGene February 13, 2020 19:07

FrozenGene approved these changes Feb 14, 2020

View reviewed changes

tqchen approved these changes Feb 17, 2020

View reviewed changes

tqchen merged commit 1314091 into apache:master Feb 17, 2020

tqchen added the status: accepted label Feb 17, 2020

tqchen unassigned FrozenGene Feb 17, 2020

alexwong pushed a commit to alexwong/tvm that referenced this pull request Feb 26, 2020

Fast exponent (apache#4790)

0f57977

alexwong pushed a commit to alexwong/tvm that referenced this pull request Feb 28, 2020

Fast exponent (apache#4790)

bca1935

zhiics pushed a commit to neo-ai/tvm that referenced this pull request Mar 2, 2020

Fast exponent (apache#4790)

16288ad

ZihengJiang mentioned this pull request Sep 17, 2020

TVM v0.7 Release Note Candidate #6486

Closed

alexgl-github deleted the fastexp branch November 3, 2020 22:14

merrymercy mentioned this pull request Dec 24, 2020

[Relay] Add fast_softmax #7163

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fast exponent #4790

Fast exponent #4790

alexgl-github commented Jan 29, 2020

masahi Feb 4, 2020

alexgl-github Feb 5, 2020

tqchen Feb 5, 2020

masahi Feb 5, 2020

masahi Feb 5, 2020

anijain2305 Feb 5, 2020 •

edited

Loading

masahi commented Feb 5, 2020

alexgl-github commented Feb 11, 2020

zhiics commented Feb 11, 2020

alexgl-github commented Feb 12, 2020

anijain2305 left a comment

anijain2305 commented Feb 12, 2020

anijain2305 commented Feb 13, 2020

tqchen commented Feb 13, 2020

anijain2305 commented Feb 13, 2020

alexgl-github commented Feb 13, 2020

FrozenGene left a comment

masahi commented Feb 14, 2020

anijain2305 commented Feb 14, 2020

masahi commented Feb 17, 2020

tqchen left a comment

tqchen commented Feb 17, 2020

Fast exponent #4790

Fast exponent #4790

Conversation

alexgl-github commented Jan 29, 2020

masahi Feb 4, 2020

Choose a reason for hiding this comment

alexgl-github Feb 5, 2020

Choose a reason for hiding this comment

tqchen Feb 5, 2020

Choose a reason for hiding this comment

masahi Feb 5, 2020

Choose a reason for hiding this comment

masahi Feb 5, 2020

Choose a reason for hiding this comment

anijain2305 Feb 5, 2020 • edited Loading

Choose a reason for hiding this comment

masahi commented Feb 5, 2020

alexgl-github commented Feb 11, 2020

zhiics commented Feb 11, 2020

alexgl-github commented Feb 12, 2020

anijain2305 left a comment

Choose a reason for hiding this comment

anijain2305 commented Feb 12, 2020

anijain2305 commented Feb 13, 2020

tqchen commented Feb 13, 2020

anijain2305 commented Feb 13, 2020

alexgl-github commented Feb 13, 2020

FrozenGene left a comment

Choose a reason for hiding this comment

masahi commented Feb 14, 2020

anijain2305 commented Feb 14, 2020

masahi commented Feb 17, 2020

tqchen left a comment

Choose a reason for hiding this comment

tqchen commented Feb 17, 2020

anijain2305 Feb 5, 2020 •

edited

Loading