enable TF remapper optimizer #1418

njzjz · 2022-01-14T04:50:49Z

TF supports a remapper optimizer which remaps subgraphs onto more efficient implementations by replacing commonly occuring subgraphs with optimized fused monolithic kernels. However, its support is limited: (1) MatMul + BiasAdd (not Add) + Activation; (2) Float32 (but not float64); (3) Activation is Tanh; (4) MKL is built and used.
This commit replaces Add by BiasAdd in the NN. The speed of a single op can be improved by about 20% when TF is using MKL and precision is set to float32. One can find _MklNativeFusedMatMul op in the profiler.

Original graph. Ops include MklMatMul, AddV2, and Tanh.

New graph. _MklNativeFusedMatMul is used here.

See also:

(cherry picked from commit 8f2dc44)

TF supports a remapper optimizer which remaps subgraphs onto more efficient implementations by replacing commonly occuring subgraphs with optimized fused monolithic kernels. However, its support is limited: (1) MatMul + BiasAdd (not Add) + Activation; (2) Float32 (but not float64); (3) Activation is Tanh; (4) MKL is built and used. This commit replaces Add by BiasAdd in the NN. The speed of a single op can be improved by about 20% when TF is using MKL and precision is set to float32. One can find `_MklNativeFusedMatMul` op in the profiler. See also: - https://www.tensorflow.org/guide/graph_optimization - https://github.com/tensorflow/tensorflow/blob/master/tensorflow/core/grappler/optimizers/remapper.cc (cherry picked from commit 8f2dc44)

codecov-commenter · 2022-01-14T04:54:33Z

Codecov Report

Merging #1418 (69f9d9f) into devel (0f6d644) will increase coverage by 1.17%.
The diff coverage is 100.00%.

@@            Coverage Diff             @@
##            devel    #1418      +/-   ##
==========================================
+ Coverage   74.55%   75.72%   +1.17%     
==========================================
  Files          92       92              
  Lines        7623     7650      +27     
==========================================
+ Hits         5683     5793     +110     
+ Misses       1940     1857      -83

Impacted Files	Coverage Δ
deepmd/utils/network.py	`82.79% <100.00%> (ø)`
source/op/_gelu.py	`69.23% <0.00%> (-12.59%)`	⬇️
source/op/_tabulate_grad.py	`100.00% <0.00%> (ø)`
source/op/_prod_force_grad.py	`100.00% <0.00%> (ø)`
source/op/_prod_virial_grad.py	`100.00% <0.00%> (ø)`
source/op/_soft_min_force_grad.py	`100.00% <0.00%> (ø)`
source/op/_prod_force_se_a_grad.py	`100.00% <0.00%> (ø)`
source/op/_prod_force_se_r_grad.py	`100.00% <0.00%> (ø)`
source/op/_soft_min_virial_grad.py	`100.00% <0.00%> (ø)`
source/op/_prod_virial_se_a_grad.py	`100.00% <0.00%> (ø)`
... and 7 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 0f6d644...69f9d9f. Read the comment docs.

wanghan-iapcm

Is this change compatible to tf 1.x?

njzjz · 2022-01-15T06:04:53Z

This PR only introduces tf.nn.bais_add, which is available in tf v1 and it is a recommended way to add the biases.

The support for tanh + MKL is introduced in tensorflow/tensorflow#42173 (v2.4).

I don't know why only FP32 is supported.

njzjz · 2022-01-15T06:27:26Z

I found that Intel OneDNN does not support fp64 at all...

njzjz · 2022-01-16T17:51:55Z

I have some other ideas: we can customize a remapper optimizer. TF has provided the interface.

wanghan-iapcm reviewed Jan 15, 2022

View reviewed changes

wanghan-iapcm requested a review from denghuilu January 15, 2022 05:43

wanghan-iapcm approved these changes Jan 16, 2022

View reviewed changes

denghuilu approved these changes Jan 16, 2022

View reviewed changes

wanghan-iapcm merged commit 057e6ab into deepmodeling:devel Jan 17, 2022

njzjz deleted the bias_add branch January 17, 2022 01:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

enable TF remapper optimizer #1418

enable TF remapper optimizer #1418

njzjz commented Jan 14, 2022

codecov-commenter commented Jan 14, 2022 •

edited

Loading

wanghan-iapcm left a comment •

edited

Loading

njzjz commented Jan 15, 2022 •

edited

Loading

njzjz commented Jan 15, 2022 •

edited

Loading

njzjz commented Jan 16, 2022

enable TF remapper optimizer #1418

enable TF remapper optimizer #1418

Conversation

njzjz commented Jan 14, 2022

codecov-commenter commented Jan 14, 2022 • edited Loading

Codecov Report

wanghan-iapcm left a comment • edited Loading

Choose a reason for hiding this comment

njzjz commented Jan 15, 2022 • edited Loading

njzjz commented Jan 15, 2022 • edited Loading

njzjz commented Jan 16, 2022

codecov-commenter commented Jan 14, 2022 •

edited

Loading

wanghan-iapcm left a comment •

edited

Loading

njzjz commented Jan 15, 2022 •

edited

Loading

njzjz commented Jan 15, 2022 •

edited

Loading