Add inference program for Transformer. #727

guoshengCS · 2018-03-13T08:44:56Z

Add inference program for Transformer.

lcy-seso

Thank you for this work.

lcy-seso · 2018-03-19T10:32:07Z

fluid/neural_machine_translation/transformer/config.py

@@ -15,6 +15,23 @@ class TrainTaskConfig(object):
    # the params for learning rate scheduling
    warmup_steps = 4000

+    # the directory for saving inference models


for saving inference models --> for saving trained models

lcy-seso · 2018-03-19T10:36:34Z

fluid/neural_machine_translation/transformer/config.py

+
+class InferTaskConfig(object):
+    use_gpu = False
+    # number of sequences contained in a mini-batch


number of sequences contained in a mini-batch --> the number of examples in one run for sequence generation.

Please add a comment here to warn users currently the batch size can only be set to 1.

lcy-seso · 2018-03-19T10:36:54Z

fluid/neural_machine_translation/transformer/config.py

+
+class InferTaskConfig(object):
+    use_gpu = False
+    # number of sequences contained in a mini-batch


number of sequences contained in a mini-batch --> the number of examples in one run for sequence generation.

lcy-seso · 2018-03-19T10:37:37Z

fluid/neural_machine_translation/transformer/config.py

+    # the params for beam search
+    beam_size = 5
+    max_length = 30
+    n_best = 1


please comment n_best. It is confusing to me about what is the difference between beam_size and n_best.

lcy-seso · 2018-03-19T10:38:32Z

fluid/neural_machine_translation/transformer/config.py

@@ -15,6 +15,23 @@ class TrainTaskConfig(object):
    # the params for learning rate scheduling
    warmup_steps = 4000

+    # the directory for saving inference models
+    model_dir = "transformer_model"


change the name to "trained_models"

lcy-seso · 2018-03-20T01:32:11Z

fluid/neural_machine_translation/transformer/model.py

+                max_length,
+                slf_attn_bias_flag,
+                src_attn_bias_flag,
+                pos_flag=1):


change "pos_flag" into "is_pos=True"

lcy-seso · 2018-03-20T01:32:14Z

fluid/neural_machine_translation/transformer/model.py

-        dtype="float32",
-        append_batch_size=False)
+    enc_input_layers = make_inputs(encoder_input_data_names, n_head, d_model,
+                                   batch_size, max_length, 1, 0)


0 --> False

lcy-seso · 2018-03-20T01:32:34Z

fluid/neural_machine_translation/transformer/model.py

+
+    dec_input_layers = make_inputs(decoder_input_data_names, n_head, d_model,
+                                   batch_size, max_length, 1, 1)
+


the last two 1 --> True

lcy-seso · 2018-03-20T01:32:51Z

fluid/neural_machine_translation/transformer/model.py

+    # Padding index do not contribute to the total loss. The weights is used to
+    # cancel padding index in calculating the loss.
+    gold, weights = make_inputs(label_data_names, n_head, d_model, batch_size,
+                                max_length, 0, 0, 0)


make the last three parameters of make_inputs boolean parameters.

lcy-seso · 2018-03-20T01:34:42Z

fluid/neural_machine_translation/transformer/model.py

+            name=input_data_names[2]
+            if slf_attn_bias_flag == 1 else input_data_names[-1],
+            shape=[batch_size, n_head, max_length, max_length]
+            if slf_attn_bias_flag == 1 else [batch_size, max_length, d_model],


make src_attn_bias_flag a boolean parameter.

…ntion bias in inference.

lcy-seso

LGTM.

guoshengCS force-pushed the add-transformer-infer branch from 38a8d43 to 60dba9f Compare March 13, 2018 09:00

Add inference program for Transformer.

ff80721

guoshengCS force-pushed the add-transformer-infer branch from 60dba9f to ff80721 Compare March 13, 2018 11:25

guoshengCS requested a review from lcy-seso March 13, 2018 15:30

lcy-seso reviewed Mar 20, 2018

View reviewed changes

guoshengCS added 2 commits March 20, 2018 20:42

Refine Transformer by following comments and fix the target self atte…

6ef54e8

…ntion bias in inference.

Refine the log calculation in Transformer beam search

6e896b0

lcy-seso approved these changes Mar 21, 2018

View reviewed changes

lcy-seso merged commit ae792ec into PaddlePaddle:develop Mar 21, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add inference program for Transformer. #727

Add inference program for Transformer. #727

guoshengCS commented Mar 13, 2018

lcy-seso left a comment

lcy-seso Mar 19, 2018

guoshengCS Mar 20, 2018

lcy-seso Mar 19, 2018

guoshengCS Mar 20, 2018

lcy-seso Mar 19, 2018

guoshengCS Mar 20, 2018

lcy-seso Mar 19, 2018

guoshengCS Mar 20, 2018

lcy-seso Mar 19, 2018

guoshengCS Mar 20, 2018

lcy-seso Mar 20, 2018

guoshengCS Mar 20, 2018

lcy-seso Mar 20, 2018

guoshengCS Mar 20, 2018

lcy-seso Mar 20, 2018

guoshengCS Mar 20, 2018

lcy-seso Mar 20, 2018

guoshengCS Mar 20, 2018

lcy-seso Mar 20, 2018

guoshengCS Mar 20, 2018

lcy-seso left a comment


		dec_input_layers = make_inputs(decoder_input_data_names, n_head, d_model,
		batch_size, max_length, 1, 1)

Add inference program for Transformer. #727

Add inference program for Transformer. #727

Conversation

guoshengCS commented Mar 13, 2018

lcy-seso left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

lcy-seso left a comment

Choose a reason for hiding this comment