Rearrange attention output to have batch dimension on the 0th axis #8591

dakshvar22 · 2021-05-03T11:44:43Z

Proposed changes:

...

Status (please check what you already did):

added some tests for the functionality
updated the documentation
updated the changelog (please check changelog for instructions)
reformat files using black (please check Readme for instructions)

JEM-Mosig

Looks good! Just one small thing in the changelog. ... And you'll have to fix that test for output shapes.

JEM-Mosig · 2021-05-06T11:58:06Z

changelog/8591.misc.md

@@ -0,0 +1,3 @@
+Tensorflow models now return batch dimension on the first axis and number of layers on the second axis for output array associated with `attention_output` key.


The key would be attention_weights, not attention_output.

github-actions · 2021-05-12T15:14:43Z

Hey @dakshvar22! 👋 To run model regression tests, comment with the /modeltest command and a configuration.

Tips 💡: The model regression test will be run on push events. You can re-run the tests by re-add status:model-regression-tests label or use a Re-run jobs button in Github Actions workflow.

Tips 💡: Every time when you want to change a configuration you should edit the comment with the previous configuration.

You can copy this in your comment and customize:

/modeltest

```yml
##########
## Available datasets
##########
# - "Carbon Bot"
# - "Hermit"
# - "Private 1"
# - "Private 2"
# - "Private 3"
# - "Sara"

##########
## Available configurations
##########
# - "BERT + DIET(bow) + ResponseSelector(bow)"
# - "BERT + DIET(seq) + ResponseSelector(t2t)"
# - "Spacy + DIET(bow) + ResponseSelector(bow)"
# - "Spacy + DIET(seq) + ResponseSelector(t2t)"
# - "Sparse + BERT + DIET(bow) + ResponseSelector(bow)"
# - "Sparse + BERT + DIET(seq) + ResponseSelector(t2t)"
# - "Sparse + DIET(bow) + ResponseSelector(bow)"
# - "Sparse + DIET(seq) + ResponseSelector(t2t)"
# - "Sparse + Spacy + DIET(bow) + ResponseSelector(bow)"
# - "Sparse + Spacy + DIET(seq) + ResponseSelector(t2t)"

## Example configuration
#################### syntax #################
## include:
##   - dataset: ["<dataset_name>"]
##     config: ["<configuration_name>"]
#
## Example:
## include:
##  - dataset: ["Carbon Bot"]
##    config: ["Sparse + DIET(bow) + ResponseSelector(bow)"]
#
## Shortcut:
## You can use the "all" shortcut to include all available configurations or datasets
#
## Example: Use the "Sparse + EmbeddingIntent + ResponseSelector(bow)" configuration
## for all available datasets
## include:
##  - dataset: ["all"]
##    config: ["Sparse + DIET(bow) + ResponseSelector(bow)"]
#
## Example: Use all available configurations for the "Carbon Bot" and "Sara" datasets
## and for the "Hermit" dataset use the "Sparse + DIET + ResponseSelector(T2T)" and
## "BERT + DIET + ResponseSelector(T2T)" configurations:
## include:
##  - dataset: ["Carbon Bot", "Sara"]
##    config: ["all"]
##  - dataset: ["Hermit"]
##    config: ["Sparse + DIET(seq) + ResponseSelector(t2t)", "BERT + DIET(seq) + ResponseSelector(t2t)"]
#
## Example: Define a branch name to check-out for a dataset repository. Default branch is 'main'
## dataset_branch: "test-branch"
## include:
##  - dataset: ["Carbon Bot", "Sara"]
##    config: ["all"]


include:
 - dataset: ["Carbon Bot"]
   config: ["Sparse + DIET(bow) + ResponseSelector(bow)"]

```

github-actions · 2021-05-12T15:14:46Z

/modeltest

include:
 - dataset: ["Carbon Bot"]
   config: ["Sparse + DIET(bow) + ResponseSelector(bow)", "Sparse + DIET(seq) + ResponseSelector(t2t)"]

github-actions · 2021-05-12T15:14:48Z

The model regression tests have started. It might take a while, please be patient.
As soon as results are ready you'll see a new comment with the results.

Used configuration can be found in the comment.

github-actions · 2021-05-12T15:35:51Z

Commit: 3c1c5d6, The full report is available as an artifact.

Dataset: Carbon Bot, Dataset repository branch: main, commit: c3e1ed09c204a1be311c61320c8defcf0ee1a7dd

Configuration	Intent Classification Micro F1	Entity Recognition Micro F1	Response Selection Micro F1
`Sparse + DIET(bow) + ResponseSelector(bow)` test: `36s`, train: `2m33s`, total: `3m9s`	0.7748 (0.04)	0.7529 (0.00)	0.4702 (-0.01)
`Sparse + DIET(seq) + ResponseSelector(t2t)` test: `55s`, train: `3m46s`, total: `4m41s`	0.7359 (-0.01)	0.6724 (-0.01)	0.5033 (-0.04)

dakshvar22 added 3 commits May 3, 2021 13:36

transpose first and second axis

3da32ea

add changelog

042180d

typo

0e9c3d8

dakshvar22 requested a review from JEM-Mosig May 3, 2021 11:50

This was referenced May 3, 2021

Update Attention Viewer RasaHQ/rasalit#57

Closed

DIETLanguage needs to be double checked. koaning/whatlies#306

Closed

JEM-Mosig approved these changes May 6, 2021

View reviewed changes

dakshvar22 added 2 commits May 12, 2021 17:02

Merge branch 'main' into rearrange_attention_output

488fb04

fix tests

d4ba7a6

dakshvar22 added runner:gpu status:model-regression-tests and removed status:model-regression-tests labels May 12, 2021

github-actions bot deleted a comment from dakshvar22 May 12, 2021

github-actions bot removed status:model-regression-tests runner:gpu labels May 12, 2021

dakshvar22 enabled auto-merge (squash) May 12, 2021 15:43

dakshvar22 merged commit 03b3236 into main May 12, 2021

dakshvar22 deleted the rearrange_attention_output branch May 12, 2021 16:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rearrange attention output to have batch dimension on the 0th axis #8591

Rearrange attention output to have batch dimension on the 0th axis #8591

dakshvar22 commented May 3, 2021

JEM-Mosig left a comment •

edited

Loading

JEM-Mosig May 6, 2021

github-actions bot commented May 12, 2021

github-actions bot commented May 12, 2021

github-actions bot commented May 12, 2021

github-actions bot commented May 12, 2021

		@@ -0,0 +1,3 @@
		Tensorflow models now return batch dimension on the first axis and number of layers on the second axis for output array associated with `attention_output` key.

Rearrange attention output to have batch dimension on the 0th axis #8591

Rearrange attention output to have batch dimension on the 0th axis #8591

Conversation

dakshvar22 commented May 3, 2021

JEM-Mosig left a comment • edited Loading

Choose a reason for hiding this comment

JEM-Mosig May 6, 2021

Choose a reason for hiding this comment

github-actions bot commented May 12, 2021

github-actions bot commented May 12, 2021

github-actions bot commented May 12, 2021

github-actions bot commented May 12, 2021

JEM-Mosig left a comment •

edited

Loading