[ML] fix NLP question_answering task when best answer is only one token #88347

benwtrent · 2022-07-07T14:19:59Z

There are scenarios when question_answering find the best start/end token and they are the same token. An example of this is:

context: "My name is Ben and I live in London" question: "Where do I live?"

The correct answer here is London and its a single token. Without this fix, we will return in London with a lower probability.

elasticmachine · 2022-07-07T14:20:04Z

Pinging @elastic/ml-core (Team:ML)

elasticsearchmachine · 2022-07-07T14:20:26Z

Hi @benwtrent, I've created a changelog YAML for you.

benwtrent · 2022-07-07T14:45:47Z

@elasticmachine update branch

droberts195 · 2022-07-08T12:05:15Z

...in/ml/src/main/java/org/elasticsearch/xpack/ml/inference/nlp/QuestionAnsweringProcessor.java

@@ -212,7 +212,7 @@ static void topScores(
                if (startNormalized[i] == 0) {
                    continue;
                }
-                for (int j = i + 1; j < (maxAnswerLength + i) && j < tokenSize; j++) {
+                for (int j = i; j < (maxAnswerLength + i) && j < tokenSize; j++) {


Line 227 still has i + 1. Is that correct? If it is then it would be good to add a comment explaining why a different start point is used in the two places.

…sticsearch into bugfix/ml-q_and_a-nlp-task

benwtrent · 2022-07-08T13:21:23Z

@elasticmachine update branch

droberts195

LGTM

[ML] fix NLP question_answering task when best answer is only one token

8dcf8fe

benwtrent added >bug :ml Machine learning v8.4.0 labels Jul 7, 2022

elasticmachine added the Team:ML Meta label for the ML team label Jul 7, 2022

Update docs/changelog/88347.yaml

7f66d5e

Merge branch 'master' into bugfix/ml-q_and_a-nlp-task

d21f9a7

droberts195 reviewed Jul 8, 2022

View reviewed changes

benwtrent added 2 commits July 8, 2022 08:56

fixing bug for split input as well

1f2119b

Merge branch 'bugfix/ml-q_and_a-nlp-task' of github.com:benwtrent/ela…

70ceedb

…sticsearch into bugfix/ml-q_and_a-nlp-task

Merge branch 'master' into bugfix/ml-q_and_a-nlp-task

852b3ae

droberts195 approved these changes Jul 8, 2022

View reviewed changes

benwtrent merged commit 9abfe4b into elastic:master Jul 8, 2022

benwtrent deleted the bugfix/ml-q_and_a-nlp-task branch July 8, 2022 14:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ML] fix NLP question_answering task when best answer is only one token #88347

[ML] fix NLP question_answering task when best answer is only one token #88347

benwtrent commented Jul 7, 2022

elasticmachine commented Jul 7, 2022

elasticsearchmachine commented Jul 7, 2022

benwtrent commented Jul 7, 2022

droberts195 Jul 8, 2022

benwtrent commented Jul 8, 2022

droberts195 left a comment

[ML] fix NLP question_answering task when best answer is only one token #88347

[ML] fix NLP question_answering task when best answer is only one token #88347

Conversation

benwtrent commented Jul 7, 2022

elasticmachine commented Jul 7, 2022

elasticsearchmachine commented Jul 7, 2022

benwtrent commented Jul 7, 2022

droberts195 Jul 8, 2022

Choose a reason for hiding this comment

benwtrent commented Jul 8, 2022

droberts195 left a comment

Choose a reason for hiding this comment