【PaddlePaddle Hackathon 3 No.37】为 Paddle 优化 argmin_argmax op 在 GPU 上的计算性能 #46655

thunder95 · 2022-09-29T13:07:45Z

PR types

Performance optimization

PR changes

OPs

Describe

目前 Paddle 内 argmax\argmin 算子的 GPU 实现采用了 Cub 库实现，可以用 Reduce 替换，Reduce 模块的性能需进一步提升。
设计文档: PaddlePaddle/community#256

开发环境：

设备：RTX 2070s
环境：CUDA10.2，cuDNN 7

优化方法
对kps::Reduce进行改写，支持索引返回

完成优化后，Paddle与优化前的Paddle的性能对比效果:

Case No.	input_shape	dtype	axis	Paddle Perf(ms) argmin	Old Paddle Perf(ms) argmin	diff
0	[-1L, 513L, 513L, 19L]	float32	3	15.0504	15.0504	15.0504
1	[-1L, 513L, 513L, 19L]	float32	1	20.0625	20.0625	15.0504
2	[1000L, 1000L]	float32	-1	0.16095	0.16095	15.0504
3	[1000L, 1000L]	float32	0	0.7225	0.7225	15.0504

完成优化后，Paddle与Pytorch的性能对比效果如下:

Case No.	input_shape	dtype	axis	Paddle Perf(ms) argmin	Pytorh Perf(ms) argmin	diff
0	[-1L, 513L, 513L, 19L]	float32	3	10.426	10.426	15.0504
1	[-1L, 513L, 513L, 19L]	float32	1	2.4442	2.4442	15.0504
2	[1000L, 1000L]	float32	-1	0.03902	0.03902	15.0504
3	[1000L, 1000L]	float32	0	0.04725	0.04725	15.0504

… argmin_max_perf

ZzSean · 2022-10-13T07:14:03Z

paddle/phi/kernels/gpu/arg_min_max_kernel.cu

@@ -14,212 +14,295 @@

 #include "paddle/phi/kernels/arg_min_max_kernel.h"

+#include "paddle/fluid/platform/device/gpu/gpu_launch_config.h"


使用phi目录下的文件，这些头文件应该都是有的

ZzSean · 2022-10-13T07:40:59Z

paddle/phi/kernels/gpu/arg_min_max_kernel.cu

+};
+
+template <>
+struct SharedMemory<float> {


这个结构体是必要的吗？为什么不使用extern __shared__ T

ZzSean · 2022-10-13T07:47:06Z

paddle/phi/kernels/gpu/arg_min_max_kernel.cu

+}
+
+template <typename Context, typename T, typename IndType, typename CompOp>
+void ArgCUDAImpl(const Context& dev_ctx,


函数命名最好还是能够清晰释义

ZzSean · 2022-10-13T07:48:55Z

除此以外需要先解决下CI问题

paddle-bot · 2023-10-17T06:33:01Z

Since you haven't replied for more than a year, we have closed this issue/pr.
If the problem is not solved or there is a follow-up one, please reopen it at any time and we will continue to follow up.
由于您超过一年未回复，我们将关闭这个issue/pr。
若问题未解决或有后续问题，请随时重新打开，我们会继续跟进。

thunder95 added 2 commits September 29, 2022 12:56

first try

db84c0d

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

50fc605

… argmin_max_perf

thunder95 changed the title ~~【PaddlePaddle Hackathon 3 No.31】为 Paddle 优化 argmin_argmax op 在 GPU 上的计算性能~~ 【PaddlePaddle Hackathon 3 No.37】为 Paddle 优化 argmin_argmax op 在 GPU 上的计算性能 Sep 29, 2022

thunder95 mentioned this pull request Sep 29, 2022

【PaddlePaddle Hackathon 第三期】任务总览 #43938

Closed

paddle-bot-old bot added the contributor External developers label Sep 29, 2022

luotao1 assigned luotao1, ZzSean and Ligoml Sep 30, 2022

thunder95 added 2 commits October 1, 2022 17:54

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

66d855d

… argmin_max_perf

second try, not well as expected

3bcccbe

PaddlePaddle locked and limited conversation to collaborators Oct 13, 2022

PaddlePaddle unlocked this conversation Oct 13, 2022

ZzSean reviewed Oct 13, 2022

View reviewed changes

luotao1 mentioned this pull request Aug 10, 2023

算子GPU性能优化 #56119

Closed

paddle-bot bot closed this Oct 17, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

【PaddlePaddle Hackathon 3 No.37】为 Paddle 优化 argmin_argmax op 在 GPU 上的计算性能 #46655

【PaddlePaddle Hackathon 3 No.37】为 Paddle 优化 argmin_argmax op 在 GPU 上的计算性能 #46655

thunder95 commented Sep 29, 2022 •

edited

Loading

ZzSean Oct 13, 2022

ZzSean Oct 13, 2022

ZzSean Oct 13, 2022

ZzSean commented Oct 13, 2022

paddle-bot bot commented Oct 17, 2023

		@@ -14,212 +14,295 @@

		#include "paddle/phi/kernels/arg_min_max_kernel.h"

		#include "paddle/fluid/platform/device/gpu/gpu_launch_config.h"

【PaddlePaddle Hackathon 3 No.37】为 Paddle 优化 argmin_argmax op 在 GPU 上的计算性能 #46655

【PaddlePaddle Hackathon 3 No.37】为 Paddle 优化 argmin_argmax op 在 GPU 上的计算性能 #46655

Conversation

thunder95 commented Sep 29, 2022 • edited Loading

PR types

PR changes

Describe

ZzSean Oct 13, 2022

Choose a reason for hiding this comment

ZzSean Oct 13, 2022

Choose a reason for hiding this comment

ZzSean Oct 13, 2022

Choose a reason for hiding this comment

ZzSean commented Oct 13, 2022

paddle-bot bot commented Oct 17, 2023

thunder95 commented Sep 29, 2022 •

edited

Loading