Fix to threshold's width optimization #866

mmrahorovic · 2023-07-31T11:31:06Z

The width of the thresholds are based on the width of the input datatype and threshold datatype. The input datatype is in most cases coming from an MVU, which will have by default a 32-bit output datatype. Since the MinimizeAccumulatorWidth is applied iteratively in a loop, the output datatype of the MVU is optimized/minimized before the Thresholding_Batch layer is considered. However, we need to run the InferDataTypes transformation to propagate this information such that the succeeding Thresholding_Batch knows of the minimized bit-width of the input (and does not default to 32-bit thresholds).

…ch loop iteration

mmrahorovic added 4 commits July 31, 2023 12:25

[custom op]: set output datatype MVAU given no activation function

5615d8d

[custom op]: update tensor datatype for consistency

153c2d4

[minimize acc width]: apply InferDataTypes to propagate changes in ea…

f367a5a

…ch loop iteration

[custom op]: set outputDataType in case of no activation

763fa48

auphelia marked this pull request as ready for review August 4, 2023 15:45

auphelia added 2 commits August 4, 2023 17:09

Merge upstream/dev into fix/minimize_bitwidth_thresholds

6bf77f6

[Custom Op] Delete obsolete lines after merging with dev

f52871d

auphelia merged commit fd72d48 into Xilinx:dev Aug 4, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix to threshold's width optimization #866

Fix to threshold's width optimization #866

mmrahorovic commented Jul 31, 2023

Fix to threshold's width optimization #866

Fix to threshold's width optimization #866

Conversation

mmrahorovic commented Jul 31, 2023