新增多任务模型 Dev zsx mtl1 #255

zanshuxun · 2022-08-12T08:30:39Z

Add multi-task models: SharedBottom, ESMM, MMOE, PLE
Bugfix:
Getting "RuntimeError: 'lengths' argument should be a 1D CPU int64 tensor, but got 1D cuda:0 Long tensor" #240
DIEN的example在GPU上执行tensor类型出错 #232

codecov · 2022-08-12T08:33:28Z

Codecov Report

Merging #255 (90cae06) into master (2cd84f3) will increase coverage by 0.26%.
The diff coverage is 93.23%.

@@            Coverage Diff             @@
##           master     #255      +/-   ##
==========================================
+ Coverage   92.31%   92.58%   +0.26%     
==========================================
  Files          30       35       +5     
  Lines        2069     2333     +264     
==========================================
+ Hits         1910     2160     +250     
- Misses        159      173      +14

Flag	Coverage Δ
pytest	`92.58% <93.23%> (+0.26%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
deepctr_torch/models/basemodel.py	`86.45% <87.50%> (+1.35%)`	⬆️
deepctr_torch/models/multitask/esmm.py	`90.24% <90.24%> (ø)`
deepctr_torch/models/multitask/sharedbottom.py	`91.30% <91.30%> (ø)`
deepctr_torch/models/multitask/mmoe.py	`92.53% <92.53%> (ø)`
deepctr_torch/models/multitask/ple.py	`95.87% <95.87%> (ø)`
deepctr_torch/models/__init__.py	`100.00% <100.00%> (ø)`
deepctr_torch/models/dcnmix.py	`88.37% <100.00%> (ø)`
deepctr_torch/models/dien.py	`97.23% <100.00%> (ø)`
deepctr_torch/models/multitask/__init__.py	`100.00% <100.00%> (ø)`

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

shenweichen · 2022-08-12T08:54:52Z

deepctr_torch/models/dcnmix.py

@@ -70,10 +70,9 @@ def __init__(self, linear_feature_columns,
                                    layer_num=cross_num, device=device)
        self.add_regularization_weight(
            filter(lambda x: 'weight' in x[0] and 'bn' not in x[0], self.dnn.named_parameters()), l2=l2_reg_dnn)
+        self.add_regularization_weight(


通过'_list'来判断参数容易引起后续维护迭代出现问题

shenweichen · 2022-08-12T08:55:30Z

deepctr_torch/models/multitask/ple.py

+                filter(lambda x: 'weight' in x[0] and 'bn' not in x[0], self.specific_gate_dnn.named_parameters()),
+                l2=l2_reg_dnn)
+        else:
+            self.specific_gate_dnn_final_layer = nn.ModuleList(


if else里的self.specific_gate_dnn_final_layer可进一步合并

shenweichen · 2022-08-12T08:55:58Z

deepctr_torch/models/multitask/ple.py

+
+        self.out = nn.ModuleList([PredictionLayer(task) for task in task_types])
+
+        self.add_regularization_weight(


多个正则权重的添加逻辑可以进行优化

shenweichen · 2022-08-12T08:56:31Z

tests/models/multitask/MMOE_test.py

+    'num_experts, expert_dnn_hidden_units, gate_dnn_hidden_units, tower_dnn_hidden_units, task_types, '
+    'sparse_feature_num, dense_feature_num',
+    [
+        (3, (256, 128), (64,), (64,), ['binary', 'binary'], 3, 3),


几个test文件中的维度可以调小

zanshuxun added 30 commits June 26, 2022 16:33

mtl

0671b59

inplace operation

5267cf8

1

25cf516

1

567576a

1

30d0fc7

11

71cf99a

111

29f177f

if self.num_tasks <= 1:

cbd9eea

add_regularization_weight

c00ff9f

1

0c70377

add_regularization_weight

2340721

1

1663dbc

byterec_sample.txt

64ed8fa

format

f370701

format

9f3a51c

format

a1de4a9

完善超参及注释

9eee512

cgc pole

4d38beb

ple

113d90b

ple

ae8b626

mtl

269a305

dim

0d0ec88

1

ee23764

1

14ec373

test

310c043

dien lengths .cpu()

0f56a63

docs

5660a4b

docs

6e4611d

eg

4c73e7e

注释

80552b6

zanshuxun and others added 8 commits July 2, 2022 16:06

注释

d9d45cb

format

924dcf7

format

1efafb3

byterec_sample.txt 200

18f04ed

byterec_sample.txt 200

40b7d4c

multi_module_list

5e59953

format

7dff848

1

1c71508

shenweichen reviewed Aug 12, 2022

View reviewed changes

zanshuxun changed the title ~~Dev zsx mtl1~~ 新增多任务模型 Dev zsx mtl1 Aug 12, 2022

shenweichen reviewed Aug 12, 2022

View reviewed changes

wuhen added 9 commits August 12, 2022 17:00

dcnmix

f11a00f

缩小test维度

3e20cb8

add_regularization_weight

2232321

data url; loss_func

4b054bd

final layer

a35ee3f

format

13de334

format

cdbfb16

add_regularization_weight

05f134c

add_regularization_weight dcnmix

90cae06

shenweichen changed the base branch from master to release August 15, 2022 12:15

shenweichen merged commit 19e09e4 into release Aug 15, 2022

shenweichen deleted the dev-zsx-mtl1 branch October 23, 2022 14:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

新增多任务模型 Dev zsx mtl1 #255

新增多任务模型 Dev zsx mtl1 #255

zanshuxun commented Aug 12, 2022 •

edited

Loading

codecov bot commented Aug 12, 2022 •

edited

Loading

shenweichen Aug 12, 2022

shenweichen Aug 12, 2022

shenweichen Aug 12, 2022

shenweichen Aug 12, 2022


		self.out = nn.ModuleList([PredictionLayer(task) for task in task_types])

		self.add_regularization_weight(

新增多任务模型 Dev zsx mtl1 #255

新增多任务模型 Dev zsx mtl1 #255

Conversation

zanshuxun commented Aug 12, 2022 • edited Loading

codecov bot commented Aug 12, 2022 • edited Loading

Codecov Report

shenweichen Aug 12, 2022

Choose a reason for hiding this comment

shenweichen Aug 12, 2022

Choose a reason for hiding this comment

shenweichen Aug 12, 2022

Choose a reason for hiding this comment

shenweichen Aug 12, 2022

Choose a reason for hiding this comment

zanshuxun commented Aug 12, 2022 •

edited

Loading

codecov bot commented Aug 12, 2022 •

edited

Loading