Need to add some new algorithm(like direct, winograd) for convolution calculation. #2177

hedaoyuan · 2017-05-17T05:48:25Z

At present, there are only sgemm and CUDNN-based convolution implementations in Paddle. In a model training or prediction without a GPU, can only select convolution calculations based on sgemm. However, based on the sgemm convolution calculation, performance is not optimal in many scenarios. see here

We also encountered the problem of convolution computing performance when deploying Paddle into some product environments. At the same time, there are many excellent convolution implementation libraries, I think we can try to import it into Paddle to improve the Paddle convolution calculation performance.

hedaoyuan · 2017-05-26T03:54:51Z

Convolution performance

Convolution layer performance in a ResNet model with 3x3 kernel. Test in a raspberry environment, and build with NNPACK. We can see that some layers with gemm algorithm performance better, while some layers with wt8x8 algorithm performance better.

Xreki · 2018-05-14T06:50:07Z

NNPACK is supported in v2 Paddle and new features won't be implemented based on v2, so close this issue.

hedaoyuan self-assigned this May 17, 2017

This was referenced May 17, 2017

Work on Embedded #2025

Closed

Refactor the convolution-related code #2196

Closed

hedaoyuan mentioned this issue Jun 20, 2017

Paddle Function #892

Closed

Xreki closed this as completed May 14, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Need to add some new algorithm(like direct, winograd) for convolution calculation. #2177

Need to add some new algorithm(like direct, winograd) for convolution calculation. #2177

hedaoyuan commented May 17, 2017

hedaoyuan commented May 26, 2017

Xreki commented May 14, 2018

Need to add some new algorithm(like direct, winograd) for convolution calculation. #2177

Need to add some new algorithm(like direct, winograd) for convolution calculation. #2177

Comments

hedaoyuan commented May 17, 2017

hedaoyuan commented May 26, 2017

Convolution performance

Xreki commented May 14, 2018