Immediately upgrading a downgraded tsdb data stream fails #96163

martijnvg · 2023-05-16T13:44:07Z

Immediately upgrading a data stream to tsdb after is has been downgraded from a tsdb data stream fails the execute.
This is because there already exists a tsdb backing index and the rollover doesn't detects that, because the data stream is non tsdb.

Note that after waiting ~4hrs the rollover should succeed.

Reproduction:

PUT _index_template/1
{
  "index_patterns": [
    "test*"
  ],
  "template": {
    "settings": {
      "index": {
        "mode": "time_series"
      }
    },
    "mappings": {
      "properties": {
          "my_field": {
              "time_series_dimension": true,
              "type": "keyword"
          }
      }
    }
  },
  "data_stream": {}
}

POST test1/_doc
{
  "@timestamp": "2023-05-16T11:49:50.599Z",
  "my_field": "value"
}

PUT _index_template/1
{
    "index_patterns": [
        "test*"
    ],
    "template": {
        "settings": {
            "index": {
                "mode": null
            }
        },
        "mappings": {
            "properties": {
                "my_field": {
                    "time_series_dimension": true,
                    "type": "keyword"
                }
            }
        }
    },
    "data_stream": {}
}

POST test1/_rollover

PUT _index_template/1
{
    "index_patterns": [
        "test*"
    ],
    "template": {
        "settings": {
            "index": {
                "mode": "time_series"
            }
        },
        "mappings": {
            "properties": {
                "my_field": {
                    "time_series_dimension": true,
                    "type": "keyword"
                }
            }
        }
    },
    "data_stream": {}
}

POST test1/_rollover

The text was updated successfully, but these errors were encountered:

elasticsearchmachine · 2023-05-16T13:44:31Z

Pinging @elastic/es-analytics-geo (Team:Analytics)

… installed (#157869) ## Summary Fixes #157345 When a package with a changed `index.mode` or `source.mode` setting is installed, Fleet will now automatically perform a rollover to ensure the correct setting is present on the resulting backing index. There is an issue with Elasticsearch wherein toggling these settings back and forth will incur a backing index range overlap error. See elastic/elasticsearch#96163. To test 1. Install the `system` integration at version `1.28.0` 2. Create an integration policy for the `system` integration (a standard default agent policy will do) 3. Enroll an agent in this policy, and allow it to ingest some data 4. Confirm that there are documents present in the `metrics-system.cpu-default` data stream, and note its backing index via Stack Management 5. Create a new `1.28.1` version of the `system` integration where `elasticsearch.index_mode: time_series` is set and install it via `elastic-package install --zip` 6. Confirm that a rollover occurs and the backing index for the `metrics-system.cpu-default` data stream has been updated ### Checklist Delete any items that are not applicable to this PR. - [x] [Unit or functional tests](https://www.elastic.co/guide/en/kibana/master/development-tests.html) were updated or added to match the most common scenarios --------- Co-authored-by: Kibana Machine <[email protected]>

… installed (elastic#157869) ## Summary Fixes elastic#157345 When a package with a changed `index.mode` or `source.mode` setting is installed, Fleet will now automatically perform a rollover to ensure the correct setting is present on the resulting backing index. There is an issue with Elasticsearch wherein toggling these settings back and forth will incur a backing index range overlap error. See elastic/elasticsearch#96163. To test 1. Install the `system` integration at version `1.28.0` 2. Create an integration policy for the `system` integration (a standard default agent policy will do) 3. Enroll an agent in this policy, and allow it to ingest some data 4. Confirm that there are documents present in the `metrics-system.cpu-default` data stream, and note its backing index via Stack Management 5. Create a new `1.28.1` version of the `system` integration where `elasticsearch.index_mode: time_series` is set and install it via `elastic-package install --zip` 6. Confirm that a rollover occurs and the backing index for the `metrics-system.cpu-default` data stream has been updated ### Checklist Delete any items that are not applicable to this PR. - [x] [Unit or functional tests](https://www.elastic.co/guide/en/kibana/master/development-tests.html) were updated or added to match the most common scenarios --------- Co-authored-by: Kibana Machine <[email protected]> (cherry picked from commit 22e3847)

…ged is installed (#157869) (#157916) # Backport This will backport the following commits from `main` to `8.8`: - [[Fleet] Rollover data streams when package w/ TSDB setting changed is installed (#157869)](#157869)  ### Questions ? Please refer to the [Backport tool documentation](https://github.com/sqren/backport)  Co-authored-by: Kyle Pollich <[email protected]>

mlunadia · 2023-05-17T07:52:27Z

@martijnvg what is the recommended cooling time between changing index for seeing no rollover issues? Do you think this and the manual change mechanism should be documented by Kibana or ES?

martijnvg · 2023-05-17T08:32:41Z

what is the recommended cooling time between changing index for seeing no rollover issues?

I think by default the recommended cooling time should be 4 hours. But this could be less if downgrading happened some time later after the last tsdb rollover.

But this also depends on whether a custom index.look_ahead_time has been set. This default to 2 hours. The first backing index will have start time of now - look_ahead_time and end time of now + look_ahead_time.

Do you think this and the manual change mechanism should be documented by Kibana or ES?

I don't think we have a documented upgrading to and downgrading from tsdb. I think Elasticsearch should docs around this, but I think Kibana too (a minimised version of it).

felixbarny · 2023-05-17T10:39:27Z

Is there a way for ES to automatically adjust the end time to the max @timestamp when rolling over a data stream? That would eliminate the issue assuming that the actual timestamps in that index are lower than the current time. Alternatively, could ES create a new backing index that has a start time that's higher than the existing backing indices' end time?

martijnvg · 2023-05-17T10:50:15Z

Is there a way for ES to automatically adjust the end time to the max @timestamp when rolling over a data stream?

In the context of the rollover operation the information required to update the index.time_series.end_time isn't available.
Maybe on a downgraded data stream we could trim the index.time_series.end_time index setting based on the highest @timestamp in the backing index. But this would need to be done in a separate api call. This api doesn't exist today.

Alternatively, could ES create a new backing index that has a start time that's higher than the existing backing indices' end time?

Yes, but that index could be up to 4 hours in the future and will not end up getting used. Meanwhile current writes will go to the older tsdb backing index. And my concern is that if this downgrade and upgrade cycle happens again then we end up with another tsdb backing index but then up to 8 hours in the future.

… installed (#157869) ## Summary Fixes #157345 When a package with a changed `index.mode` or `source.mode` setting is installed, Fleet will now automatically perform a rollover to ensure the correct setting is present on the resulting backing index. There is an issue with Elasticsearch wherein toggling these settings back and forth will incur a backing index range overlap error. See elastic/elasticsearch#96163. To test 1. Install the `system` integration at version `1.28.0` 2. Create an integration policy for the `system` integration (a standard default agent policy will do) 3. Enroll an agent in this policy, and allow it to ingest some data 4. Confirm that there are documents present in the `metrics-system.cpu-default` data stream, and note its backing index via Stack Management 5. Create a new `1.28.1` version of the `system` integration where `elasticsearch.index_mode: time_series` is set and install it via `elastic-package install --zip` 6. Confirm that a rollover occurs and the backing index for the `metrics-system.cpu-default` data stream has been updated ### Checklist Delete any items that are not applicable to this PR. - [x] [Unit or functional tests](https://www.elastic.co/guide/en/kibana/master/development-tests.html) were updated or added to match the most common scenarios --------- Co-authored-by: Kibana Machine <[email protected]>

elasticsearchmachine · 2023-05-26T09:45:31Z

Pinging @elastic/es-docs (Team:Docs)

martijnvg · 2023-05-26T09:45:44Z

This issue was discussed in yesterday's tsdb integration sync. The fact that due to how downgrading from tsdb and upgrading to tsdb works causes this bug isn't ideal, but isn't something that we will address. This is because immediately upgrading a downgrade data stream to tsdb isn't a use case we need to support. It is okay if there is some wait time before again upgrading to tsdb.

We do need to document this as part of upgrading to tsdb and downgrading from tsdb.

elasticsearchmachine · 2024-03-20T16:33:46Z

Pinging @elastic/es-storage-engine (Team:StorageEngine)

nchaulet · 2024-10-07T12:05:24Z

It happens in Fleet we have to rollback our integrations and Fleet can trigger automatic upgrades and having to wait n hours to be able to upgrade again because of that behaviour, it is not optimal, both for Fleet and for our users, should/could this be automatically handled by elasticsearch?

martijnvg · 2024-10-07T13:08:49Z

It happens in Fleet we have to rollback our integrations and Fleet can trigger automatic upgrades

Our assumption was that rollbacks should occur rarely. Typically an integration's template / mapping has to be modified in order to be ready for tsdb. After testing, the chance of rolling back should be small. Unless there is some unforeseen bug or the tradeoffs that come with tsdb don't work out well. In that case the second upgrade to tsdb could be days / weeks after the rollback.

and having to wait n hours to be able to upgrade again because of that behaviour,

On recent versions, the wait time is lower now. If index.look_back_time setting is set to 1 minute that migrating to tsdb after a rollback can occur as soon as 31 minutes after rollback.

it is not optimal, both for Fleet and for our users, should/could this be automatically handled by elasticsearch?

This is something we can address, but it had always lower priority over other work. This mainly was based on the fact that we assumed that upgrading minutes to hours after a rollback isn't a common scenario.

martijnvg added >bug :StorageEngine/TSDB You know, for Metrics labels May 16, 2023

elasticsearchmachine added the Team:Analytics Meta label for analytical engine team (ESQL/Aggs/Geo) label May 16, 2023

kpollich mentioned this issue May 16, 2023

[Fleet] Rollover data streams when package w/ TSDB setting changed is installed elastic/kibana#157869

Merged

1 task

lalit-satapathy mentioned this issue May 18, 2023

[Meta] Observability TSDB packages migration elastic/integrations#5233

Closed

45 tasks

martijnvg self-assigned this May 23, 2023

martijnvg added >docs General docs changes and removed >bug labels May 26, 2023

elasticsearchmachine added the Team:Docs Meta label for docs team label May 26, 2023

wchaparro removed the Team:Analytics Meta label for analytical engine team (ESQL/Aggs/Geo) label Mar 20, 2024

elasticsearchmachine added the Team:StorageEngine label Mar 20, 2024

martijnvg mentioned this issue Sep 27, 2024

Enabling tsdb on a standard datastream that have been a tsdb datastream previously fail with overlapping backing index #113480

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Immediately upgrading a downgraded tsdb data stream fails #96163

Immediately upgrading a downgraded tsdb data stream fails #96163

martijnvg commented May 16, 2023

elasticsearchmachine commented May 16, 2023

mlunadia commented May 17, 2023

martijnvg commented May 17, 2023

felixbarny commented May 17, 2023

martijnvg commented May 17, 2023

elasticsearchmachine commented May 26, 2023

martijnvg commented May 26, 2023

elasticsearchmachine commented Mar 20, 2024

nchaulet commented Oct 7, 2024

martijnvg commented Oct 7, 2024

Immediately upgrading a downgraded tsdb data stream fails #96163

Immediately upgrading a downgraded tsdb data stream fails #96163

Comments

martijnvg commented May 16, 2023

elasticsearchmachine commented May 16, 2023

mlunadia commented May 17, 2023

martijnvg commented May 17, 2023

felixbarny commented May 17, 2023

martijnvg commented May 17, 2023

elasticsearchmachine commented May 26, 2023

martijnvg commented May 26, 2023

elasticsearchmachine commented Mar 20, 2024

nchaulet commented Oct 7, 2024

martijnvg commented Oct 7, 2024