Fix the EvaluationStore to match prediction and target entities #6419

davidezanella · 2020-08-14T15:48:01Z

Proposed changes:

Fix the EvaluationStore serialise method: the prediction and target entities were not aligned. Previously, they were aligned only by adding 'None' at the end of the smaller list instead of looking where they should be placed.

Practical example:

target entities: [
   {'text': 'hi, how are you', 'start': 0, 'end': 2, entity: 'verb' },
   {'text': 'hi, how are you', 'start': 4, 'end': 7, entity: 'noun' }
]
predicted entities: [
   {'text': 'hi, how are you', 'start': 4, 'end': 7, entity: 'noun' }
]

Adding 'None' at the end, as done originally, leads to wrong metrics because even the right prediction was marked as wrong.

Status (please check what you already did):

added some tests for the functionality
updated the documentation
updated the changelog (please check changelog for instructions)
reformat files using black (please check Readme for instructions)

Signed-off-by: Davide <[email protected]>

sara-tagger · 2020-08-17T06:00:09Z

Thanks for submitting a pull request 🚀 @degiz will take a look at it as soon as possible ✨

davidezanella · 2020-08-19T15:28:45Z

@tabergma can you review this pr?

degiz

Hey @davidezanella

Thanks a lot for the PR! Could you please also write a unit for the change? I believe you can use existing tests/test_test.py file.

Signed-off-by: Davide <[email protected]>

tabergma · 2020-08-26T09:19:58Z

@davidezanella Thanks for addressing this. I think your concern is valid and should be fixed, but I am not sure if your method fixes the complete problem. Currently you just iterating over the gold entities to see if there are matching predicted entities and you are adding None in case no matching entity could be found. That way we ensure that the size of the gold end predicted entities is the same. However, i think we also need to consider the case when we have more predicted entities than gold entities. For example, you have a sentence like "Tanja is currently in Munich, but she lives in Berlin" and the annotator made a mistake and just annotate "Tanja" as person, but forgot to tag "Munich" and "Berlin" as city. However, the model predicts all three of them. If I understand your code correctly, you would just consider the gold entity "Tanja" and you would ignore the two additional entities "Munich" and "Berlin", which is not correct as those are false positive predictions in this case. What do you think?

…get ones Signed-off-by: Davide <[email protected]>

davidezanella · 2020-08-26T11:19:03Z

Hey @tabergma, you are right! I've fixed the code and the tests.
Let me know what do you think about it

Signed-off-by: Davide <[email protected]>

degiz

doesn't seem I can cancel my "changes requested" 😞
@tabergma will review the code.

tabergma · 2020-09-01T07:19:39Z

@davidezanella Sorry for the late reply, I was offline a couple of days. Will take a look at the PR today. Can you please resolve the conflicts in the meantime? Thanks.

tabergma

Looks great 🚀 Added a few comments. Can you please also add a changelog entry? Thanks.

rasa/core/test.py

Signed-off-by: Davide <[email protected]>

tabergma

Looks great 💯 Thanks for fixing this!

tabergma · 2020-09-02T06:15:29Z

rasa/core/test.py

+                key=lambda x: x.get("start"),
+            )
+
+            i_pred, i_target = 0, 0


Minor: We try to not use abbreviations in names. I would rename this to index_prediction and index_target.

tabergma · 2020-09-02T06:15:58Z

rasa/core/test.py

+            i_pred, i_target = 0, 0
+
+            while i_pred < len(entity_predictions) or i_target < len(entity_targets):
+                cmp = self._compare_entities(


Minor: I would rename this to comparison_result.

tabergma · 2020-09-02T06:17:09Z

rasa/core/test.py

+    def _compare_entities(
+        entity_predictions: List[Dict[Text, Any]],
+        entity_targets: List[Dict[Text, Any]],
+        i_pred: int,


Minor: Rename to index_prediction.

tabergma · 2020-09-02T06:17:19Z

rasa/core/test.py

+        entity_predictions: List[Dict[Text, Any]],
+        entity_targets: List[Dict[Text, Any]],
+        i_pred: int,
+        i_target: int,


Minor: Rename to index_target.

tabergma · 2020-09-02T06:21:27Z

@degiz I think you need to update your review otherwise we are not able to merge this PR.

Fix the EvaluationStore to match prediction and target entities

c052e5b

Signed-off-by: Davide <[email protected]>

davidezanella force-pushed the fix-EvaluationStore-predicted-entities branch from 04cbacd to c052e5b Compare August 14, 2020 21:41

davidezanella added 3 commits August 14, 2020 23:47

Moved binary operators to a new line

cba8a5c

Signed-off-by: Davide <[email protected]>

Reformatted the code using poetry black

205a2ee

Signed-off-by: Davide <[email protected]>

Fix missing returned type of function

9c62564

Signed-off-by: Davide <[email protected]>

sara-tagger requested a review from degiz August 17, 2020 06:00

degiz suggested changes Aug 24, 2020

View reviewed changes

davidezanella added 2 commits August 24, 2020 21:37

Add unit test for the serialise method

72fba81

Signed-off-by: Davide <[email protected]>

Reformatted test_test.py file

2e4641b

Signed-off-by: Davide <[email protected]>

davidezanella requested a review from degiz August 24, 2020 21:31

Fixed the serialise method to manage more predicted entities than tar…

465a2f8

…get ones Signed-off-by: Davide <[email protected]>

Fixed deepSource issues

a790f87

Signed-off-by: Davide <[email protected]>

degiz requested review from tabergma and degiz and removed request for degiz August 27, 2020 12:20

degiz reviewed Aug 27, 2020

View reviewed changes

tabergma requested changes Sep 1, 2020

View reviewed changes

rasa/core/test.py Show resolved Hide resolved

rasa/core/test.py Outdated Show resolved Hide resolved

rasa/core/test.py Outdated Show resolved Hide resolved

rasa/core/test.py Outdated Show resolved Hide resolved

rasa/core/test.py Outdated Show resolved Hide resolved

davidezanella and others added 5 commits September 1, 2020 19:42

Merge branch 'master' into fix-EvaluationStore-predicted-entities

25e3d48

Fixed deepSource import issue

b2b7301

Signed-off-by: Davide <[email protected]>

Improved the code quality in the core/test file

a07b69f

Signed-off-by: Davide <[email protected]>

Add the changelog file

f9f70bc

Signed-off-by: Davide <[email protected]>

Fix DeepSource issues

fa8b564

Signed-off-by: Davide <[email protected]>

davidezanella requested a review from tabergma September 1, 2020 20:35

tabergma approved these changes Sep 2, 2020

View reviewed changes

degiz approved these changes Sep 2, 2020

View reviewed changes

degiz merged commit be72c78 into RasaHQ:master Sep 2, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix the EvaluationStore to match prediction and target entities #6419

Fix the EvaluationStore to match prediction and target entities #6419

davidezanella commented Aug 14, 2020 •

edited

Loading

sara-tagger commented Aug 17, 2020

davidezanella commented Aug 19, 2020

degiz left a comment •

edited

Loading

tabergma commented Aug 26, 2020

davidezanella commented Aug 26, 2020

degiz left a comment •

edited

Loading

tabergma commented Sep 1, 2020

tabergma left a comment

tabergma left a comment

tabergma Sep 2, 2020

tabergma Sep 2, 2020

tabergma Sep 2, 2020

tabergma Sep 2, 2020

tabergma commented Sep 2, 2020

Fix the EvaluationStore to match prediction and target entities #6419

Fix the EvaluationStore to match prediction and target entities #6419

Conversation

davidezanella commented Aug 14, 2020 • edited Loading

sara-tagger commented Aug 17, 2020

davidezanella commented Aug 19, 2020

degiz left a comment • edited Loading

Choose a reason for hiding this comment

tabergma commented Aug 26, 2020

davidezanella commented Aug 26, 2020

degiz left a comment • edited Loading

Choose a reason for hiding this comment

tabergma commented Sep 1, 2020

tabergma left a comment

Choose a reason for hiding this comment

tabergma left a comment

Choose a reason for hiding this comment

tabergma Sep 2, 2020

Choose a reason for hiding this comment

tabergma Sep 2, 2020

Choose a reason for hiding this comment

tabergma Sep 2, 2020

Choose a reason for hiding this comment

tabergma Sep 2, 2020

Choose a reason for hiding this comment

tabergma commented Sep 2, 2020

davidezanella commented Aug 14, 2020 •

edited

Loading

degiz left a comment •

edited

Loading

degiz left a comment •

edited

Loading