[Relay] Handle float16 constants & fix BatchNorm #3260

cbalint13 · 2019-05-30T09:52:25Z

This PR fixes:

constant expressions targeted as float16 in relay.
batchnorm module is not fixed to float32 anymore.
extend relay testcase to cover float16 too.

It was tested on real Yolo-Tiny net targeted as float16, it works well now including auto-tuning.

It is based on @anijain2305 suggestion from this discuss.tvm.ai thread.

tqchen · 2019-05-30T16:19:46Z

Thanks for the contribution, please request reviews from Reviewers

cbalint13 · 2019-05-30T16:41:14Z

cc @jroesch @MarisaKirisame @wweic @icemelon9

Please help with review.

vinx13 · 2019-05-30T18:38:08Z

src/relay/pass/simplify_inference.cc

@@ -52,7 +53,6 @@ Expr BatchNormToInferUnpack(const Attrs attrs,
  }

  int axis = param->axis;
-  auto ttype = tdata.as<TensorTypeNode>();
  CHECK(ttype);


let's also move this CHECK to the beginning of the function

vinx13 · 2019-05-30T18:45:00Z

src/relay/pass/pattern_util.h

+      // convert to float16
+      // storage is uint16_t
+      *static_cast<DType*>(arr->data) =
+        __truncXfYf2__<float, uint32_t, 23, uint16_t, uint16_t, 10>(static_cast<float>(value));


please add an assertion that T is float32

@vinx13

static_cast<float>(T value) will make sure __truncXfyf2() is feeding with float32.

unsure, still want to assert(T is_float32) or we let T to be any type ?

This bit manipulation seems to bring a certain level of tediousness/ugliness in the TVM codebase.

A way to resolve this can be to keep the Float constants always in FP32. And insert a Relay cast operation to FP16 if need be. Then this will become part of fold_constant in Relay graph, and will use LLVM to generate the FP32 to FP16 conversion (hiding the bit manipulation).

I do not have any strong opinion on either. What do you think @vinx13 ?

Because we support arbitrary bit types in the IR, we should really use gmp and mpfr for storing literals. It is bad practice to store them in finite types, as truncation may occur and it is impossible to test whether truncation occurred programmatically w/o weird hacks. This is out of the scope of this PR unfortunately.

@cbalint13 ignore my comments :) the static cast if sufficient here

I agree with @jroesch that we should do the pre-computation in higher precision, we can do this in the future

anijain2305 · 2019-05-30T23:43:32Z

src/relay/pass/pattern_util.h

+      // convert to float16
+      // storage is uint16_t
+      *static_cast<DType*>(arr->data) =
+        __truncXfYf2__<float, uint32_t, 23, uint16_t, uint16_t, 10>(static_cast<float>(value));


This bit manipulation seems to bring a certain level of tediousness/ugliness in the TVM codebase.

A way to resolve this can be to keep the Float constants always in FP32. And insert a Relay cast operation to FP16 if need be. Then this will become part of fold_constant in Relay graph, and will use LLVM to generate the FP32 to FP16 conversion (hiding the bit manipulation).

I do not have any strong opinion on either. What do you think @vinx13 ?

vinx13 · 2019-05-31T02:14:30Z

Thanks @cbalint13 @wweic @jroesch @anijain2305 this is merged

tqchen added the status: need review label May 30, 2019

vinx13 requested changes May 30, 2019

View reviewed changes

[Relay] Handle float16 constants & fix BatchNorm

639c8a5

jroesch approved these changes May 30, 2019

View reviewed changes

anijain2305 approved these changes May 30, 2019

View reviewed changes

wweic approved these changes May 31, 2019

View reviewed changes

vinx13 approved these changes May 31, 2019

View reviewed changes

vinx13 merged commit 584a32a into apache:master May 31, 2019

vinx13 added status: accepted and removed status: need review labels May 31, 2019

wweic pushed a commit to wweic/tvm that referenced this pull request Jun 26, 2019

[Relay] Handle float16 constants & fix BatchNorm (apache#3260)

d0705c5

wweic pushed a commit to neo-ai/tvm that referenced this pull request Jun 27, 2019

[Relay] Handle float16 constants & fix BatchNorm (apache#3260)

6087454

tqchen mentioned this pull request Nov 8, 2019

[RELEASE][DRAFT] TVM v0.6 Release candidate #4259

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Relay] Handle float16 constants & fix BatchNorm #3260

[Relay] Handle float16 constants & fix BatchNorm #3260

cbalint13 commented May 30, 2019 •

edited

Loading

tqchen commented May 30, 2019

cbalint13 commented May 30, 2019

vinx13 May 30, 2019

vinx13 May 30, 2019

cbalint13 May 30, 2019 •

edited

Loading

anijain2305 May 30, 2019

jroesch May 31, 2019

vinx13 May 31, 2019 •

edited

Loading

anijain2305 May 30, 2019

vinx13 commented May 31, 2019

[Relay] Handle float16 constants & fix BatchNorm #3260

[Relay] Handle float16 constants & fix BatchNorm #3260

Conversation

cbalint13 commented May 30, 2019 • edited Loading

tqchen commented May 30, 2019

cbalint13 commented May 30, 2019

vinx13 May 30, 2019

Choose a reason for hiding this comment

vinx13 May 30, 2019

Choose a reason for hiding this comment

cbalint13 May 30, 2019 • edited Loading

Choose a reason for hiding this comment

anijain2305 May 30, 2019

Choose a reason for hiding this comment

jroesch May 31, 2019

Choose a reason for hiding this comment

vinx13 May 31, 2019 • edited Loading

Choose a reason for hiding this comment

anijain2305 May 30, 2019

Choose a reason for hiding this comment

vinx13 commented May 31, 2019

cbalint13 commented May 30, 2019 •

edited

Loading

cbalint13 May 30, 2019 •

edited

Loading

vinx13 May 31, 2019 •

edited

Loading