Attention visualization #121

cifkao · 2016-10-30T14:54:35Z

Soft alignment visualization (#108) for the bahdanau branch.

jlibovicky

Good job, thanks a lot! Please pay attention to the tests. One of the unit tests is failing because of the TensorBoard summaries (it issue can be also in the test). The integration test is failing because it cannot download the data it should work with. It should get fixed when you rebase with master.

jlibovicky · 2016-10-31T09:27:19Z

neuralmonkey/decoding_function.py

@@ -47,6 +48,12 @@ def attention_decoder(decoder_inputs, initial_state, attention_objects,
            outputs.append(output)
            states.append(state)

+        if summary_collections:
+            for i, a in enumerate(attention_objects):
+                alignments = tf.expand_dims(tf.transpose(tf.pack(a.attentions_in_time), perm=[1, 2, 0]), -1)


This line is too long. We check the style of the files using pylint. If you run tests/lint_run.sh, it will check all python files get you a detailed report.

I wonder if this is general enough. If you look e.g, at the Recurrent Neural Machine Translation paper, they use GRU net do the attention instead of the softmax weighting and come with a clever way visualizing this attention. i think that reserving the attention_in_time field for the visualization purpose should work even for the recurrent attention, but please check if it is so.

just FYI, the line length used by me is 80. I will add it to the documentation for developers

jlibovicky · 2016-10-31T09:29:56Z

neuralmonkey/learning_utils.py

@@ -223,9 +223,10 @@ def training_loop(sess, saver,

                if step % validation_period == validation_period - 1:
                    decoded_val_sentences, decoded_raw_val_sentences, \
-                        val_evaluation = run_on_dataset(
+                        val_evaluation, val_images = run_on_dataset(


Considering that the inputs of NM can be also images (we did image captioning with it previously), name images may be confusing. What about plots, visualizations, ... I don't know, maybe even images are OK.

What about out_images?

I'd stick with val_plots.

jlibovicky · 2016-10-31T09:39:01Z

tests/small.ini

@@ -29,7 +29,7 @@ name=translation
 output=tests/tmp-test-output
 overwrite_output_dir=True
 batch_size=16
-epochs=2
+epochs=5


jlibovicky · 2016-11-05T12:25:47Z

@cifkao: Can you rebase the branch with master and find out whether it will fix the failing unit tets?

jindrahelcl · 2016-11-05T21:26:47Z

@jlibovicky @cifkao I think the best solution is maybe to cherry-pick the commit that adds the testing data to the repository: ab0a0ba

cifkao · 2016-11-07T18:44:13Z

@jindrahelcl I agree, rebasing onto master would be difficult. But maybe it's better to cherry-pick ab0a0ba into bahdanau directly?

Apart from that, I found that the visualization is incorrect because attentions_in_time accumulates attention vectors from both training and runtime decoding (is that intended?). Given that I'm visualizing the validation data, I think I want to use only the second half of the list. But what's the right way to separate it from the rest?

jindrahelcl · 2016-11-07T21:19:47Z

The data should be already in the bahdanau branch (commit 5d88de3)..

The graph contains the attention mechanism ops twice - one for training and the other one for runtime. During validation, the training ops are not executed, so there should not be a problem.

jindrahelcl · 2016-11-07T21:29:45Z

@cifkao if you don't want to cherry-pick the commit (which is imho the best option), merge the bahdanau branch to this one. (Or rebase this on bahdanau.. But rebasing something that is already pushed to github should be avoided)

cifkao · 2016-11-08T08:30:39Z

The problem is that the same Attention object is used for constructing both the training and runtime parts of the graph, and therefore attentions_in_time contains tensors from both parts. Because I'm using attentions_in_time for visualization, I was getting images like this:

This was, of course, easily fixed by taking only the last len(decoder_inputs) elements of the list (i.e. the ones that were just appended).

jindrahelcl · 2016-11-08T09:44:35Z

Huh, this looks very much like some sort of bug. I'll look into it.

2016-11-08 8:30 GMT+00:00 Ondřej Cífka [email protected]:

The problem is that the same Attention object is used for constructing
both the training and runtime parts of the graph, and therefore
attentions_in_time contains tensors from both parts. Because I'm using
attentions_in_time for visualization, I was getting images like this:

[image: tensorboard]
https://cloud.githubusercontent.com/assets/8046580/20091403/dac7641a-a592-11e6-9777-ce513c1e0bb1.png

This was, of course, easily fixed by taking only the last
len(decoder_inputs) elements of the list.

—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
#121 (comment),
or mute the thread
https://github.com/notifications/unsubscribe-auth/ABwcs1uiXk8b1HogwqbBU8OOoyH4Lq4Gks5q8DMwgaJpZM4KkZYF
.

jlibovicky · 2016-11-08T10:14:08Z

It's definitely a bug, but luckily, it does not influence performance of most of the models, but it is probably a reason why the coverage model worked so poorly.

cifkao added 6 commits October 30, 2016 11:30

Add attention matrix summary image

cf343f2

Set logging_period=1

39c822a

Attention matrix summary for validation set

90e77ce

Fix attention visualization

37468af

Clean up

0ab1be7

Evaluate image summary during evaluation and write it to TensorBoard

90c2d9a

cifkao mentioned this pull request Oct 30, 2016

Soft alignment visualisation #108

Closed

jlibovicky requested changes Oct 31, 2016

View reviewed changes

cifkao added 2 commits November 2, 2016 11:43

Fix tests, reverse unnecessary changes

d0fcbd2

Rename val_images to val_plots

9e1f9b2

jindrahelcl and others added 2 commits November 7, 2016 22:35

put the small test data to repo

06e6f7b

Fix attention visualization to only include runtime attention

1c2ecbe

jlibovicky approved these changes Nov 8, 2016

View reviewed changes

jlibovicky merged commit bbca906 into ufal:bahdanau Nov 8, 2016

cifkao mentioned this pull request Dec 2, 2016

Separate training and runtime attention #174

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Attention visualization #121

Attention visualization #121

cifkao commented Oct 30, 2016

jlibovicky left a comment

jlibovicky Oct 31, 2016

jlibovicky Oct 31, 2016

jindrahelcl Oct 31, 2016

jlibovicky Oct 31, 2016

cifkao Nov 2, 2016

jindrahelcl Nov 2, 2016

jlibovicky Oct 31, 2016

jlibovicky commented Nov 5, 2016

jindrahelcl commented Nov 5, 2016 •

edited

Loading

cifkao commented Nov 7, 2016

jindrahelcl commented Nov 7, 2016

jindrahelcl commented Nov 7, 2016

cifkao commented Nov 8, 2016 •

edited

Loading

jindrahelcl commented Nov 8, 2016

jlibovicky commented Nov 8, 2016

Attention visualization #121

Attention visualization #121

Conversation

cifkao commented Oct 30, 2016

jlibovicky left a comment

Choose a reason for hiding this comment

jlibovicky Oct 31, 2016

Choose a reason for hiding this comment

jlibovicky Oct 31, 2016

Choose a reason for hiding this comment

jindrahelcl Oct 31, 2016

Choose a reason for hiding this comment

jlibovicky Oct 31, 2016

Choose a reason for hiding this comment

cifkao Nov 2, 2016

Choose a reason for hiding this comment

jindrahelcl Nov 2, 2016

Choose a reason for hiding this comment

jlibovicky Oct 31, 2016

Choose a reason for hiding this comment

jlibovicky commented Nov 5, 2016

jindrahelcl commented Nov 5, 2016 • edited Loading

cifkao commented Nov 7, 2016

jindrahelcl commented Nov 7, 2016

jindrahelcl commented Nov 7, 2016

cifkao commented Nov 8, 2016 • edited Loading

jindrahelcl commented Nov 8, 2016

jlibovicky commented Nov 8, 2016

jindrahelcl commented Nov 5, 2016 •

edited

Loading

cifkao commented Nov 8, 2016 •

edited

Loading