[Upgrade Assistant] Update logic for handling reindex failures #124571

alisonelizabeth · 2022-02-03T18:31:50Z

The Upgrade Assistant was incorrectly marking a reindexing process as a failure. This PR adjusts the logic for determining a reindex failure to only check for the existence of failures in the task API response.

It also fixes a minor spacing issue when rendering an error callout.

How to test

This is a hard one to add automated tests for (open to suggestions!), as it involves restarting Kibana to reproduce.

I was able to reproduce the bug (pre change) fairly consistently using the manual steps documented in #123817 (comment). With this change, the reindexing process should no longer fail.

To trigger an actual reindexing failure (post change):

Start up 6.x Elasticsearch and create some indices that contain a property with a text type. For example:

PUT /test/_doc/1
{
  "field": "my_string"
}

Alternatively, you can use this zip file, which already contains some indices: 6.8.16-data.zip

Start Elasticsearch with the 6.x snapshot created in step 1: yarn es snapshot --license=trial -E path.data=./path_to_6.x
Hack the UA code so that it creates the destination index with a bad mapping. This will trigger a reindex failure:

      createIndex = await esClient.indices.create({
        index: newIndexName,
        body: {
          settings,
          mappings: {
            properties: {
              field: { type: 'date' },
            },
          },
        },
      });

Start Kibana and navigate to Stack Management -> Upgrade Assistant -> ES deprecations
Start reindexing one of the indices. Note reindexing should fail and trigger error callout.

Screenshots

Before/after of spacing issue.

Before:

After:

alisonelizabeth · 2022-02-03T18:38:55Z

x-pack/plugins/upgrade_assistant/server/lib/reindexing/reindex_service.ts

-        body: { count },
-      } = await esClient.count({ index: reindexOp.attributes.indexName });
-
-      if (taskResponse.task.status!.created < count) {


This is not a reliable check, as the documents could end up in an updated state. This could occur if Kibana is restarted and a user resumes the reindex process in Upgrade Assistant, which actually kicks off the reindex again. As the code is here, we would actually throw an error when there was not.

More info on the task response body here: https://www.elastic.co/guide/en/elasticsearch/reference/current/docs-reindex.html#docs-reindex-api-response-body

…dex_error_handling

alisonelizabeth · 2022-02-07T15:52:54Z

@elasticmachine merge upstream

elasticmachine · 2022-02-07T16:02:29Z

Pinging @elastic/platform-deployment-management (Team:Deployment Management)

alisonelizabeth · 2022-02-07T17:15:55Z

@elasticmachine merge upstream

alisonelizabeth · 2022-02-09T19:11:27Z

@elasticmachine merge upstream

kibana-ci · 2022-02-09T20:35:16Z

💚 Build Succeeded

Metrics [docs]

✅ unchanged

History

💚 Build #22199 succeeded 9554953
💔 Build #22164 failed 8ea0b6d
💔 Build #22139 failed a32ac7e
💚 Build #21689 succeeded 519cb6c

To update your PR or re-run it, just comment with:
@elasticmachine merge upstream

sebelga

Code LGTM! I tried to reproduce locally the issue reverting your changes but could not get the error to be thrown. I tried both stopping Kibana at the beginning of the reindexing and in the middle of the 3rd step ("Reindex documents"). In both cases UA resumed and completed the reindex.

The code change make sense to me though. But it would be great to get to the bottom as exactly when this issue occurs. Could it be ES related?

alisonelizabeth · 2022-02-10T13:14:54Z

Thanks for the review @sebelga!

The code change make sense to me though. But it would be great to get to the bottom as exactly when this issue occurs. Could it be ES related?

The issue is likely occurring because we are actually kicking off another reindex operation when the user clicks "resume" in the UI so the documents may have already been created, and end up in an updated state in the ES task response.

adjust logic for determining reindex failures

519cb6c

alisonelizabeth added release_note:fix Team:Kibana Management Dev Tools, Index Management, Upgrade Assistant, ILM, Ingest Node Pipelines, and more Feature:Upgrade Assistant v7.17.1 labels Feb 3, 2022

alisonelizabeth commented Feb 3, 2022

View reviewed changes

Merge branch '7.17' of https://github.com/elastic/kibana into ua/rein…

a32ac7e

…dex_error_handling

Merge branch '7.17' into ua/reindex_error_handling

8ea0b6d

alisonelizabeth marked this pull request as ready for review February 7, 2022 16:02

alisonelizabeth requested a review from sebelga February 7, 2022 16:02

Merge branch '7.17' into ua/reindex_error_handling

9554953

Merge branch '7.17' into ua/reindex_error_handling

e3a0b1f

sebelga approved these changes Feb 10, 2022

View reviewed changes

alisonelizabeth merged commit 2212285 into elastic:7.17 Feb 10, 2022

alisonelizabeth deleted the ua/reindex_error_handling branch February 10, 2022 13:15

This was referenced Feb 10, 2022

[Upgrade Assistant] Reindexing error after restart #124153

Closed

[Upgrade Assistant] Update logic for handling reindex failures #125246

Merged

jloleysens mentioned this pull request Dec 9, 2024

[UA] Remove delete from .tasks permission check #203379

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Upgrade Assistant] Update logic for handling reindex failures #124571

[Upgrade Assistant] Update logic for handling reindex failures #124571

alisonelizabeth commented Feb 3, 2022 •

edited

Loading

alisonelizabeth Feb 3, 2022 •

edited

Loading

alisonelizabeth Feb 7, 2022

alisonelizabeth commented Feb 7, 2022

elasticmachine commented Feb 7, 2022

alisonelizabeth commented Feb 7, 2022

alisonelizabeth commented Feb 9, 2022

kibana-ci commented Feb 9, 2022

sebelga left a comment •

edited

Loading

alisonelizabeth commented Feb 10, 2022

[Upgrade Assistant] Update logic for handling reindex failures #124571

[Upgrade Assistant] Update logic for handling reindex failures #124571

Conversation

alisonelizabeth commented Feb 3, 2022 • edited Loading

How to test

Screenshots

alisonelizabeth Feb 3, 2022 • edited Loading

Choose a reason for hiding this comment

alisonelizabeth Feb 7, 2022

Choose a reason for hiding this comment

alisonelizabeth commented Feb 7, 2022

elasticmachine commented Feb 7, 2022

alisonelizabeth commented Feb 7, 2022

alisonelizabeth commented Feb 9, 2022

kibana-ci commented Feb 9, 2022

💚 Build Succeeded

Metrics [docs]

History

sebelga left a comment • edited Loading

Choose a reason for hiding this comment

alisonelizabeth commented Feb 10, 2022

alisonelizabeth commented Feb 3, 2022 •

edited

Loading

alisonelizabeth Feb 3, 2022 •

edited

Loading

sebelga left a comment •

edited

Loading