New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Add simple wrapper for beam search. #675

Open

pkuyym wants to merge 11 commits into PaddlePaddle:dev-static from pkuyym:fix-674

Contributor

pkuyym commented Mar 1, 2018

Provide a simple seq2seq model.
Fix part of #674.


          Add simple wrapper for beam search.

33d14b1

Provide a simple seq2seq model.

pkuyym changed the title ~~Add simple wrapper for beam search.~~ [WIP] Add simple wrapper for beam search.

pkuyym added 8 commits

March 7, 2018 11:00


          Merge branch 'fix-674' of https://github.com/pkuyym/models into fix-674

ebacc5e


          Refine api design for beam search.

6e2b787


          Merge branch 'develop' of https://github.com/PaddlePaddle/models into…

ec13861

… fix-674


          Currently, simple_seq2seq can run smoothly.

a4ecb69


          Add attention_seq2seq.py and training can run.

2b022f0


          Replace callback with decorator.

51b6ecf


          Make embedding table shared between training and beam search.

abfe896


          Make sure lod of state kept consistent.

14c1e75

Collaborator

lcy-seso commented Mar 20, 2018

It seems that this PR may potentially conflict with this #729 . Has work in this PR already finished so we cant begin reviewing it now? Thank you for this work.

Contributor Author

pkuyym commented Mar 20, 2018

@lcy-seso This work is almost finished except some refinement and cleaning. However, the api for beam search is ready to review, please feel free to do it.

pkuyym requested a review from lcy-seso

March 20, 2018 01:55

Collaborator

lcy-seso commented Mar 20, 2018

I see. Thank you.


          Adapt to new sequence_expand.

683d248

pkuyym changed the title ~~[WIP] Add simple wrapper for beam search.~~ Add simple wrapper for beam search.

pkuyym requested a review from panyx0718

March 20, 2018 05:37


          Remove old beam search api.

61e1d00

panyx0718 reviewed

View reviewed changes

Contributor

panyx0718 left a comment

High level comments: Please add a lot of comments, especially to all public methods. For those that don't need to be exposed, please make them private

fluid/rnn_beam_search/attention_seq2seq.py

		"(default: %(default)d)")


		def lstm_step(x_t, hidden_t_prev, cell_t_prev, size):

Contributor

panyx0718 Mar 20, 2018

Do you need implement a separate lstm here? If true, make it private?

fluid/rnn_beam_search/attention_seq2seq.py

+                  forget_gate = fluid.layers.sigmoid(x=linear([hidden_t_prev, x_t]))
+                  input_gate = fluid.layers.sigmoid(x=linear([hidden_t_prev, x_t]))
+                  output_gate = fluid.layers.sigmoid(x=linear([hidden_t_prev, x_t]))

Contributor

panyx0718 Mar 20, 2018

are all these gate the same? I remember in TF's LSTM implementation, you don't need to do 3 separate fc.

fluid/rnn_beam_search/attention_seq2seq.py

		return translation_ids, translation_scores, feeding_list


		def to_lodtensor(data, place, dtype='int64'):

Contributor

panyx0718 Mar 20, 2018

Can this become a general library?

fluid/rnn_beam_search/attention_seq2seq.py

		return lod_t, lod[-1]


		def lodtensor_to_ndarray(lod_tensor):

Contributor

panyx0718 Mar 20, 2018

Can this become a general library? Need comments?

fluid/rnn_beam_search/attention_seq2seq.py

		@@ -0,0 +1,458 @@
		"""seq2seq model for fluid."""

Contributor

panyx0718 Mar 20, 2018

To me, this seems to be a example? Perhaps the main file should live in model zoo or examples or tests? And some general utility methods can live here?

fluid/rnn_beam_search/simple_seq2seq.py



		if __name__ == '__main__':
		#train_main()

Contributor

panyx0718 Mar 20, 2018

flag here?

fluid/rnn_beam_search/simple_seq2seq.py

		@@ -0,0 +1,245 @@
		# Copyright (c) 2018 PaddlePaddle Authors. All Rights Reserve.

Contributor

panyx0718 Mar 20, 2018

should this file live in example or tests or model zoo?

fluid/rnn_beam_search/beam_search_api.py

		return self._need_reorder


		class MemoryState(object):

Contributor

panyx0718 Mar 20, 2018

should this and many others be private? _MemoryState

fluid/rnn_beam_search/beam_search_api.py

+                      self._counter.stop_gradient = True
+                      # write initial state
+                      block.append_op(

Contributor

panyx0718 Mar 21, 2018

why is this a append_op not a layers.write_arrray?

fluid/rnn_beam_search/beam_search_api.py

		self._switched_decoder = False


		class TrainingDecoder(object):

Contributor

panyx0718 Mar 21, 2018

need comments

CLAassistant commented Mar 24, 2020

Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you sign our Contributor License Agreement before we can accept your contribution.
_{You have signed the CLA already but the status is still pending? Let us recheck it.}

paddle-bot bot added the contributor label

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels