【PIR Dist Op Reg No.4 and No.26】 reg global_scatter and limit_by_capacity #62579

xiaoyewww · 2024-03-09T15:51:49Z

PR types

Others

PR changes

Others

Description

#60436
注册算子global_scatter和limit_by_capacity

paddle-bot · 2024-03-09T15:51:54Z

你的PR提交成功，感谢你对开源项目的贡献!
请关注后续CI自动化测试结果，详情请参考Paddle-CI手册。
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

xingmingyyj

- op: limit_by_capacity
  inputs:
    {expert_count : expert_count, capacity : capacity}
  outputs :
    out : Out

这里的缩进需要调整一下

xingmingyyj · 2024-03-11T07:50:45Z

paddle/fluid/pir/dialect/operator/ir/ops.yaml

@@ -875,6 +884,15 @@
  inplace: (x -> out)
  interfaces : paddle::dialect::InferSymbolicShapeInterface

+- op : limit_by_capacity
+  args : (Tensor expert_count, Tensor capacity, int n_worker)
+  output : Tensor(Out)


Suggested change

output : Tensor(Out)

output : Tensor(out)

paddle/fluid/pir/dialect/operator/ir/ops.yaml

xingmingyyj · 2024-03-11T07:55:24Z

paddle/phi/api/yaml/op_compat.yaml

@@ -1604,6 +1604,12 @@
  attrs :
    {pre_nms_top_n : pre_nms_topN, post_nms_top_n : post_nms_topN}

+- op : global_scatter(global_scatter)


Suggested change

- op : global_scatter(global_scatter)

- op : global_scatter

xiaoyewww · 2024-03-12T15:31:29Z

@xingmingyyj 麻烦请问一下这里是哪里有问题，yaml上哪里填写不对吗

xingmingyyj · 2024-03-13T02:10:35Z

@xingmingyyj 麻烦请问一下这里是哪里有问题，yaml上哪里填写不对吗

可以看一下是不是limit_by_capacity的yaml的input，好像没对齐

kangguangli · 2024-03-13T04:00:11Z

paddle/phi/api/yaml/op_compat.yaml

@@ -3730,6 +3736,12 @@
  outputs :
    {param_out: ParamOut, velocity_out: VelocityOut, master_param_out: MasterParamOut}

+- op: limit_by_capacity
+    inputs:
+    {expert_count : expert_count, capacity : capacity}


Suggested change

{expert_count : expert_count, capacity : capacity}

完全一致的话不需要在这里处理

kangguangli · 2024-03-13T04:00:30Z

paddle/phi/api/yaml/op_compat.yaml

@@ -1604,6 +1604,12 @@
  attrs :
    {pre_nms_top_n : pre_nms_topN, post_nms_top_n : post_nms_topN}

+- op : global_scatter
+  inputs :
+    {x : X, local_count : local_count, global_count : global_count, ring_id : ring_id, use_calc_stream : use_calc_stream}


Suggested change

{x : X, local_count : local_count, global_count : global_count, ring_id : ring_id, use_calc_stream : use_calc_stream}

{x : X}

完全一致的话不需要在这里处理

kangguangli · 2024-03-13T04:03:27Z

paddle/phi/api/yaml/backward.yaml

@@ -1019,6 +1019,15 @@
    func : gelu_grad
  composite: gelu_grad(x, out_grad, approximate, x_grad)

+- backward_op : global_scatter_grad
+  forward : (Tensor x, Tensor local_count, Tensor global_count, int ring_id=0, bool use_calc_stream=false) -> Tensor(out)


Suggested change

forward : (Tensor x, Tensor local_count, Tensor global_count, int ring_id=0, bool use_calc_stream=false) -> Tensor(out)

forward : global_scatter (Tensor x, Tensor local_count, Tensor global_count, int ring_id=0, bool use_calc_stream=false) -> Tensor(out)

这里应该是目前编译报错的原因，没找到反向对应的前向op

xiaoyewww · 2024-03-15T10:03:10Z

@xingmingyyj @kangguangli 这个麻烦再review一下有没有问题~

xingmingyyj · 2024-03-15T10:08:27Z

paddle/fluid/pir/dialect/operator/ir/ops.yaml

+    func : GlobalScatterInferMeta
+  kernel :
+    func : global_scatter
+    data_type : dtype


Suggested change

data_type : dtype

data_type : x

这里的data_type对齐的是旧IR下的：

phi::KernelKey GetExpectedKernelType( const framework::ExecutionContext& ctx) const override { return phi::KernelKey(OperatorWithKernel::IndicateVarDataType(ctx, "X"), ctx.GetPlace()); }

已修改，感谢

paddle-bot bot added the contributor External developers label Mar 9, 2024

xiaoyewww force-pushed the pir-pr branch from fb1aa7f to 6cff157 Compare March 10, 2024 16:17

luotao1 mentioned this pull request Mar 11, 2024

【PIR】PIR下的分布式算子注册 #60436

Closed

xingmingyyj reviewed Mar 11, 2024

View reviewed changes

xiaoyewww force-pushed the pir-pr branch from fb4a862 to 03bc4c7 Compare March 11, 2024 11:55

xiaoyewww added 3 commits March 11, 2024 15:06

feat(pir): reg global_scatter and limit_by_capacity

de1a95b

feat(pir): reg global_scatter and limit_by_capacity

1d363ab

feat(pir): reg global_scatter and limit_by_capacity

49b7fde

xiaoyewww force-pushed the pir-pr branch from 203da1a to 49b7fde Compare March 11, 2024 15:07

luotao1 added the HappyOpenSource 快乐开源活动issue与PR label Mar 12, 2024

luotao1 assigned luotao1 and kangguangli Mar 12, 2024

kangguangli reviewed Mar 13, 2024

View reviewed changes

xiaoyewww added 3 commits March 13, 2024 14:16

feat(pir): reg global_scatter and limit_by_capacity

6694ea3

feat(pir): reg global_scatter and limit_by_capacity

8f61b62

feat(pir): reg global_scatter and limit_by_capacity

f0b5f71

xingmingyyj reviewed Mar 15, 2024

View reviewed changes

feat(pir): reg global_scatter and limit_by_capacity

cc80f02

kangguangli approved these changes Mar 20, 2024

View reviewed changes

zyfncg approved these changes Mar 20, 2024

View reviewed changes

kangguangli merged commit 4024e45 into PaddlePaddle:develop Mar 20, 2024
30 checks passed

xiaoyewww deleted the pir-pr branch May 10, 2024 15:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

【PIR Dist Op Reg No.4 and No.26】 reg global_scatter and limit_by_capacity #62579

【PIR Dist Op Reg No.4 and No.26】 reg global_scatter and limit_by_capacity #62579

xiaoyewww commented Mar 9, 2024

paddle-bot bot commented Mar 9, 2024

xingmingyyj left a comment

xingmingyyj Mar 11, 2024

xingmingyyj Mar 11, 2024

xiaoyewww commented Mar 12, 2024

xingmingyyj commented Mar 13, 2024 •

edited

Loading

kangguangli Mar 13, 2024

kangguangli Mar 13, 2024

kangguangli Mar 13, 2024

xiaoyewww commented Mar 15, 2024

xingmingyyj Mar 15, 2024

xiaoyewww Mar 19, 2024

	{x : X, local_count : local_count, global_count : global_count, ring_id : ring_id, use_calc_stream : use_calc_stream}
	{x : X}

	forward : (Tensor x, Tensor local_count, Tensor global_count, int ring_id=0, bool use_calc_stream=false) -> Tensor(out)
	forward : global_scatter (Tensor x, Tensor local_count, Tensor global_count, int ring_id=0, bool use_calc_stream=false) -> Tensor(out)

【PIR Dist Op Reg No.4 and No.26】 reg global_scatter and limit_by_capacity #62579

【PIR Dist Op Reg No.4 and No.26】 reg global_scatter and limit_by_capacity #62579

Conversation

xiaoyewww commented Mar 9, 2024

PR types

PR changes

Description

paddle-bot bot commented Mar 9, 2024

xingmingyyj left a comment

Choose a reason for hiding this comment

xingmingyyj Mar 11, 2024

Choose a reason for hiding this comment

xingmingyyj Mar 11, 2024

Choose a reason for hiding this comment

xiaoyewww commented Mar 12, 2024

xingmingyyj commented Mar 13, 2024 • edited Loading

kangguangli Mar 13, 2024

Choose a reason for hiding this comment

kangguangli Mar 13, 2024

Choose a reason for hiding this comment

kangguangli Mar 13, 2024

Choose a reason for hiding this comment

xiaoyewww commented Mar 15, 2024

xingmingyyj Mar 15, 2024

Choose a reason for hiding this comment

xiaoyewww Mar 19, 2024

Choose a reason for hiding this comment

xingmingyyj commented Mar 13, 2024 •

edited

Loading