high-performance SingleThreadedWorkQueue #35086

liutiexing · 2021-08-23T09:05:49Z

PR types

New features

PR changes

Others

Describe

基于Eigen的实现，改进实现细节，实现一个high-performance SingleThreadedWorkQueue。

对于Eigen原本的实现，修改点：
1）修改内存对齐和内存分配，进一步提升性能。
2）修改EventCount接口，提升易用性。
3）新增WaitQueueEmpty接口，便于用户等待task完成而无需自己追踪task。
4）替换了Eigen自定义宏为C++标准库宏和函数。
5）后续将要做的修改：将RunQueue的std::mutex替换为spinlock，以便提升性能
6）后续将要做的修改：将ThreadPool的多线程分支大改，包括spin等待逻辑/spin的条件，以便提升性能。

性能测试结论：
性能好于Paddle原来使用的ThreadPool、TFRT的SingleThreadedWorkQueue。
测试方法：
1）将PTB模型的OP计算图dump成文件，在测试程序中还原计算图。
2）使用一段纯CPU计算（for循环计数）模拟OP执行。
3）按照拓扑排序执行计算图，通过AddTask方法将算子提交到SingleThreadedWorkQueue。
4）执行2000个batch，每个batch执行一遍计算图，模拟训练过程。

paddle-bot-old · 2021-08-23T09:05:55Z

Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

Aurelius84

great work.LGTM

Aurelius84 · 2021-08-23T13:26:39Z

paddle/fluid/framework/new_executor/event_count.h

+
+  EventCount(const EventCount&) = delete;
+
+  void operator=(const EventCount&) = delete;


Not important，我们其实是有DISABLE_COPY_AND_ASSIGN宏的

zhiqiu

LGTM for const_cast

wanghuancoder

LGTM

high-performance SingleThreadedWorkQueue

bc79b3f

Merge branch 'PaddlePaddle:develop' into workqueue_develop

aea921f

Aurelius84 approved these changes Aug 23, 2021

View reviewed changes

zhiqiu approved these changes Aug 23, 2021

View reviewed changes

wanghuancoder approved these changes Aug 24, 2021

View reviewed changes

raindrops2sea approved these changes Aug 24, 2021

View reviewed changes

wanghuancoder merged commit 751a794 into PaddlePaddle:develop Aug 25, 2021

liutiexing deleted the workqueue_develop branch August 25, 2021 14:31

liutiexing restored the workqueue_develop branch August 26, 2021 04:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

high-performance SingleThreadedWorkQueue #35086

high-performance SingleThreadedWorkQueue #35086

liutiexing commented Aug 23, 2021 •

edited

Loading

paddle-bot-old bot commented Aug 23, 2021

Aurelius84 left a comment

Aurelius84 Aug 23, 2021

zhiqiu left a comment

wanghuancoder left a comment


		EventCount(const EventCount&) = delete;

		void operator=(const EventCount&) = delete;

high-performance SingleThreadedWorkQueue #35086

high-performance SingleThreadedWorkQueue #35086

Conversation

liutiexing commented Aug 23, 2021 • edited Loading

PR types

PR changes

Describe

paddle-bot-old bot commented Aug 23, 2021

Aurelius84 left a comment

Choose a reason for hiding this comment

Aurelius84 Aug 23, 2021

Choose a reason for hiding this comment

zhiqiu left a comment

Choose a reason for hiding this comment

wanghuancoder left a comment

Choose a reason for hiding this comment

liutiexing commented Aug 23, 2021 •

edited

Loading