esArchiver datastream support #132853

klacabane · 2022-05-24T19:37:02Z

Summary

Adds support for archiving/loading/unloading data streams.

When archiving indices we now verify whether we have a backing index of a data stream. When that's the case we build the data stream's backing index template resolving any component templates link, and save it as a data_stream record type in the mappings.json file:

{
  "type": "data_stream",
  "value": {
    "data_stream": "my-data-stream-one",
    "template": {
      "index_patterns": ["my-data-stream-*"],
      "mappings": { ... },
      "settings": { ... },
      ...
    }
  }
}

While multiple backing indices of the same data stream can be returned we'll only output a single entry containing the latest mappings and settings.

The documents associated with a data stream keep the same doc type but have an additional data_stream property used at load time to index it to the appropriate target (we can't directly write to a backing indice) and to pick the correct BULK operation (data streams can only use create).

Note:
The index template could have an ILM policy that we currently don't save or create when loading the archive. We can add it if they are uses case that could benefit from that

Testing

the new data streams paths are unit tested
functional test suites are green
archived local and cloud data streams

Manual steps

Create a data stream or target an existing one (see setup a data stream)
Archive it
node scripts/es_archiver.js save ~/my-data-stream my-data-stream --es-url=http://elastic:changeme@localhost:9200 --kibana-url=http://elastic:changeme@localhost:5601/pat
Load the archive (while loading removes existing resources we should test against a clean cluster)
node scripts/es_archiver.js load ~/my-data-stream --es-url=http://elastic:changeme@localhost:9200 --kibana-url=http://elastic:changeme@localhost:5601/pat
Inspect the loaded data stream and template
Unload the archive
node scripts/es_archiver.js load ~/my-data-stream --es-url=http://elastic:changeme@localhost:9200 --kibana-url=http://elastic:changeme@localhost:5601/pat
Verify the resources are gone

…-fix'

klacabane · 2022-05-28T12:43:09Z

@elasticmachine merge upstream

…-fix'

…-ref HEAD~1..HEAD --fix'

elasticmachine · 2022-05-29T18:54:30Z

Pinging @elastic/kibana-operations (Team:Operations)

spalger

LGTM, thank you so much for getting this in!

spalger · 2022-06-02T15:13:32Z

packages/kbn-es-archiver/src/lib/docs/generate_doc_records_stream.ts

                // if keepIndexNames is false, rewrite the .kibana_* index to .kibana_1 so that
                // when it is loaded it can skip migration, if possible
                index:
                  hit._index.startsWith('.kibana') && !keepIndexNames ? '.kibana_1' : hit._index,
+                data_stream: dataStream,


Nit: Part of me would prefer that docs either had an index or a data_stream, but I'm not opposed to keeping the index if there's some use for it.

I mainly kept it for traceability when debugging or inspecting archived data, besides that there's no real use for it :)

klacabane · 2022-06-02T15:40:12Z

@elasticmachine merge upstream

kibana-ci · 2022-06-02T16:42:27Z

💚 Build Succeeded

Metrics [docs]

✅ unchanged

History

💚 Build #48042 succeeded e1f3fb0
💚 Build #48040 succeeded fc8acc1
💚 Build #48020 succeeded a2625b3
💔 Build #48019 failed d09c349
💚 Build #48005 succeeded 7764051
💔 Build #48003 failed 2501806

To update your PR or re-run it, just comment with:
@elasticmachine merge upstream

cc @klacabane

kibanamachine · 2022-06-06T17:46:18Z

Friendly reminder: Looks like this PR hasn’t been backported yet.
To create automatically backports add the label auto-backport or prevent reminders by adding the backport:skip label.
You can also create backports manually by running node scripts/backport --pr 132853 locally

* aliases fallback * nasty datastream support implementation * datastreams stats method * update filter stream * datastream support for unload action * create-index datastream support * index records data stream support * doc records data streams support * [CI] Auto-commit changed files from 'node scripts/eslint --no-cache --fix' * lint * pull composable templates * set data_stream as a separate property on documents * force create bulk operation when datastream record * [CI] Auto-commit changed files from 'node scripts/eslint --no-cache --fix' * lint * getIndexTemplate tests * [CI] Auto-commit changed files from 'node scripts/precommit_hook.js --ref HEAD~1..HEAD --fix' * share cache across transform executions Co-authored-by: kibanamachine <[email protected]> (cherry picked from commit 4c4f0f5)

kibanamachine · 2022-06-24T15:28:01Z

💚 All backports created successfully

Status	Branch	Result
✅	8.3

Note: Successful backport PRs will be merged automatically after passing CI.

Questions ?

Please refer to the Backport tool documentation

* aliases fallback * nasty datastream support implementation * datastreams stats method * update filter stream * datastream support for unload action * create-index datastream support * index records data stream support * doc records data streams support * [CI] Auto-commit changed files from 'node scripts/eslint --no-cache --fix' * lint * pull composable templates * set data_stream as a separate property on documents * force create bulk operation when datastream record * [CI] Auto-commit changed files from 'node scripts/eslint --no-cache --fix' * lint * getIndexTemplate tests * [CI] Auto-commit changed files from 'node scripts/precommit_hook.js --ref HEAD~1..HEAD --fix' * share cache across transform executions Co-authored-by: kibanamachine <[email protected]> (cherry picked from commit 4c4f0f5) Co-authored-by: Kevin Lacabane <[email protected]>

klacabane added 2 commits May 23, 2022 23:31

aliases fallback

bd058e2

nasty datastream support implementation

12e5148

klacabane changed the title ~~aliases fallback~~ esArchiver datastream support May 25, 2022

klacabane and others added 8 commits May 28, 2022 01:22

datastreams stats method

29ec303

update filter stream

aebfe90

datastream support for unload action

0a34b00

create-index datastream support

de896bc

index records data stream support

313b7a7

doc records data streams support

187d74c

[CI] Auto-commit changed files from 'node scripts/eslint --no-cache -…

c625231

…-fix'

lint

2501806

kibanamachine and others added 8 commits May 28, 2022 07:43

Merge branch 'main' into es-archiver-save-no-aliases

7764051

pull composable templates

d68d25a

set data_stream as a separate property on documents

795334c

force create bulk operation when datastream record

8ec501b

[CI] Auto-commit changed files from 'node scripts/eslint --no-cache -…

d09c349

…-fix'

lint

a2625b3

getIndexTemplate tests

3cde7ce

[CI] Auto-commit changed files from 'node scripts/precommit_hook.js -…

fc8acc1

…-ref HEAD~1..HEAD --fix'

klacabane added Team:Operations Team label for Operations Team v8.3.0 labels May 29, 2022

klacabane self-assigned this May 29, 2022

share cache across transform executions

e1f3fb0

klacabane marked this pull request as ready for review May 29, 2022 18:54

klacabane requested a review from a team as a code owner May 29, 2022 18:54

klacabane added the release_note:enhancement label May 29, 2022

klacabane mentioned this pull request May 29, 2022

Add support for data streams in ES Archiver #69061

Closed

spalger self-requested a review May 31, 2022 19:16

chrisronline mentioned this pull request Jun 1, 2022

[ResponseOps] Visualize alerting metrics in Stack Monitoring #123726

Merged

neptunian mentioned this pull request Jun 1, 2022

[Stack Monitoring] Unskip tests for new kibana monitoring metrics #133343

Open

spalger approved these changes Jun 2, 2022

View reviewed changes

Merge branch 'main' into es-archiver-save-no-aliases

dfa66dc

klacabane merged commit 4c4f0f5 into elastic:main Jun 2, 2022

kibanamachine added the backport missing Added to PRs automatically when the are determined to be missing a backport. label Jun 6, 2022

spalger added v8.4.0 and removed v8.3.0 labels Jun 6, 2022

kibanamachine added backport:skip This commit does not require backporting and removed backport missing Added to PRs automatically when the are determined to be missing a backport. labels Jun 6, 2022

klacabane mentioned this pull request Jun 24, 2022

[8.3] [Stack Monitoring] Query persistent queue size for metricbeat documents (#134569) #134834

Merged

klacabane added auto-backport Deprecated - use backport:version if exact versions are needed v8.3.1 and removed backport:skip This commit does not require backporting labels Jun 24, 2022

kibanamachine mentioned this pull request Jun 24, 2022

[8.3] esArchiver datastream support (#132853) #135139

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

esArchiver datastream support #132853

esArchiver datastream support #132853

klacabane commented May 24, 2022 •

edited by joshdover

Loading

klacabane commented May 28, 2022

elasticmachine commented May 29, 2022

spalger left a comment

spalger Jun 2, 2022 •

edited

Loading

klacabane Jun 2, 2022

klacabane commented Jun 2, 2022

kibana-ci commented Jun 2, 2022

kibanamachine commented Jun 6, 2022

kibanamachine commented Jun 24, 2022

esArchiver datastream support #132853

esArchiver datastream support #132853

Conversation

klacabane commented May 24, 2022 • edited by joshdover Loading

Summary

Testing

klacabane commented May 28, 2022

elasticmachine commented May 29, 2022

spalger left a comment

Choose a reason for hiding this comment

spalger Jun 2, 2022 • edited Loading

Choose a reason for hiding this comment

klacabane Jun 2, 2022

Choose a reason for hiding this comment

klacabane commented Jun 2, 2022

kibana-ci commented Jun 2, 2022

💚 Build Succeeded

Metrics [docs]

History

kibanamachine commented Jun 6, 2022

kibanamachine commented Jun 24, 2022

💚 All backports created successfully

Questions ?

klacabane commented May 24, 2022 •

edited by joshdover

Loading

spalger Jun 2, 2022 •

edited

Loading