Fix the FP16 precision problem of add_n. #50129

liuruyan · 2023-02-01T03:03:11Z

PR types

Performance optimization

PR changes

Others

Describe

Fix the FP16 precision problem of add_n.

… add_n_develop

paddle-bot · 2023-02-01T03:03:15Z

你的PR提交成功，感谢你对开源项目的贡献!
请关注后续CI自动化测试结果，详情请参考Paddle-CI手册。
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

paddle/phi/kernels/gpu/add_n_kernel.cu

zhangting2020 · 2023-02-01T08:40:54Z

paddle/phi/kernels/gpu/add_n_kernel.cu

      }
    }
-    out[id] = total;
+    out[id] = static_cast<T>(total);
    id += blockDim.x * gridDim.x;
  }
 }


只改这里应该不够吧？AddNKernel中有一些条件走了其他实现，比如如果是2个tensor，调用eigen库实现的，那里加法也需要提升到fp32计算。

已解决。

zhangting2020

LGTM

* Fix scale kernel for low precision, cherry pick #50998. * Fix the FP16 precision problem of add_n. (#50129) * Change squared_l2_norm to reuse ReduceKernel, and register fp16 and bf16 kernel, which is cherry pick #48315. * Cherry-pick the fix of MPTypeTrait in KP, which is implemented in #50993. * Cherry-pick the multi-precision support of AdamW for bf16, #48041. * Fix compiling error. * Cherry-pick the fix of CubTensorReduceImpl for bfloat16 in #50993. * Fix unittest. --------- Co-authored-by: liuruyan <[email protected]>

liuruyan added 2 commits February 1, 2023 02:45

Fix the FP16 precision problem of add_n.

b4ce82f

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

249e94b

… add_n_develop

address ci question

94ea8a5

liuruyan force-pushed the add_n_develop branch from c309274 to 94ea8a5 Compare February 1, 2023 06:47

zhangting2020 reviewed Feb 1, 2023

View reviewed changes

Fix the FP16 precision problem of add_n for 2inputs tensor

9558207

zhangting2020 approved these changes Feb 2, 2023

View reviewed changes

zhangting2020 merged commit 14dd68e into PaddlePaddle:develop Feb 2, 2023

pangengzheng pushed a commit to pangengzheng/Paddle that referenced this pull request Feb 2, 2023

Fix the FP16 precision problem of add_n. (PaddlePaddle#50129)

a94aa2b

Xreki pushed a commit to Xreki/Paddle that referenced this pull request Apr 9, 2023

Fix the FP16 precision problem of add_n. (PaddlePaddle#50129)

9998206

Xreki mentioned this pull request Apr 11, 2023

Cherry pick for fix of operator precision. #52705

Merged

liuruyan deleted the add_n_develop branch December 2, 2024 07:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix the FP16 precision problem of add_n. #50129

Fix the FP16 precision problem of add_n. #50129

liuruyan commented Feb 1, 2023

paddle-bot bot commented Feb 1, 2023

zhangting2020 Feb 1, 2023

liuruyan Feb 1, 2023

zhangting2020 left a comment

Fix the FP16 precision problem of add_n. #50129

Fix the FP16 precision problem of add_n. #50129

Conversation

liuruyan commented Feb 1, 2023

PR types

PR changes

Describe

paddle-bot bot commented Feb 1, 2023

zhangting2020 Feb 1, 2023

Choose a reason for hiding this comment

liuruyan Feb 1, 2023

Choose a reason for hiding this comment

zhangting2020 left a comment

Choose a reason for hiding this comment