Add known issue docs for #75598 #79221

DaveCTurner · 2021-10-15T08:02:11Z

Adds a description of #75598, and the mitigation, to the release notes
of versions 7.13.2 through 7.14.0.

Adds a description of elastic#75598, and the mitigation, to the release notes of versions 7.13.2 through 7.14.0.

elasticmachine · 2021-10-15T09:08:02Z

Pinging @elastic/es-docs (Team:Docs)

elasticmachine · 2021-10-15T09:08:03Z

Pinging @elastic/es-distributed (Team:Distributed)

jrodewig

LGTM. Aside from some minor wording nits, I think we should include a snippet for the setting update. Thanks @DaveCTurner!

jrodewig · 2021-10-15T13:51:00Z

docs/reference/release-notes/7.13.asciidoc

+causing future restore operations to fail. To mitigate this problem, prevent
+concurrent snapshot operations by setting
+`snapshot.max_concurrent_operations: 1`.
+


Since the remediation step is a single API call, I'd include it here. If you'd rather not do that, I'd at least state you can update snapshot.max_concurrent_operations using the update cluster settings API (with a link).

Suggested change

+

+

[source,console]

----

PUT _cluster/settings

{

"persistent" : {

"snapshot.max_concurrent_operations" : 1

}

}

----

+

👍 good idea.

jrodewig · 2021-10-15T13:56:08Z

docs/reference/release-notes/7.13.asciidoc

+* Snapshot and restore: If a running snapshot is cancelled while a
+previously-started snapshot is still ongoing and a later snapshot is enqueued
+then there is a risk that some shard data may be lost from the repository,
+causing future restore operations to fail. To mitigate this problem, prevent
+concurrent snapshot operations by setting
+`snapshot.max_concurrent_operations: 1`.


Minor edits to reword some passive voice. There is still some passive voice in here, but I think this reads better. Feel free to ignore if wanted tho.

Suggested change

* Snapshot and restore: If a running snapshot is cancelled while a

previously-started snapshot is still ongoing and a later snapshot is enqueued

then there is a risk that some shard data may be lost from the repository,

causing future restore operations to fail. To mitigate this problem, prevent

concurrent snapshot operations by setting

`snapshot.max_concurrent_operations: 1`.

* Snapshot and restore: If you cancel a running snapshot while a

previously-started snapshot is still ongoing and a later snapshot is enqueued,

the repository may lose some shard data. This can cause future restore

operations to fail. To mitigate this problem, set

`snapshot.max_concurrent_operations` to `1` to prevent concurrent snapshot

operations.

I've left the first bit of passive voice in there ("if a running snapshot is cancelled" etc) since users will typically hit this when snapshots are being run by other components (SLM or ILM for instance) rather than when running snapshots themselves.

DaveCTurner · 2021-10-15T14:27:25Z

Sorry, I messed up a merge and brought in some commits from a different branch. Force-pushed to fix it, but didn't change any reviewed commits.

jrodewig

No worries at all. Still looks good. Thanks!

Adds a description of #75598, and the mitigation, to the release notes of versions 7.13.2 through 7.14.0.

The known-issue docs give the impression that an upgrade will restore the lost data in the repository. This isn't the case, so this commit clarifies this in the docs. Relates elastic#73456 Relates elastic#75598 Relates elastic#79221

The known-issue docs give the impression that an upgrade will restore the lost data in the repository. This isn't the case, so this commit clarifies this in the docs. Relates #73456 Relates #75598 Relates #79221

DaveCTurner added >docs General docs changes :Distributed Coordination/Snapshot/Restore Anything directly related to the `_snapshot/*` APIs v7.13.5 v7.16.0 v7.14.3 v7.15.2 labels Oct 15, 2021

Add known issue docs for elastic#75598

4ca78e9

Adds a description of elastic#75598, and the mitigation, to the release notes of versions 7.13.2 through 7.14.0.

DaveCTurner force-pushed the 2021-10-15-75598-known-issue-docs branch from e690314 to 4ca78e9 Compare October 15, 2021 08:56

DaveCTurner marked this pull request as ready for review October 15, 2021 09:07

elasticmachine added Team:Docs Meta label for docs team Team:Distributed (Obsolete) Meta label for distributed team (obsolete). Replaced by Distributed Indexing/Coordination. labels Oct 15, 2021

DaveCTurner requested a review from jrodewig October 15, 2021 09:08

jrodewig approved these changes Oct 15, 2021

View reviewed changes

DaveCTurner added 2 commits October 15, 2021 15:25

Merge branch '7.x' into 2021-10-15-75598-known-issue-docs

f76962a

Review comments

b211069

DaveCTurner force-pushed the 2021-10-15-75598-known-issue-docs branch from e673f36 to b211069 Compare October 15, 2021 14:26

jrodewig approved these changes Oct 15, 2021

View reviewed changes

DaveCTurner added the auto-backport-and-merge label Oct 15, 2021

DaveCTurner merged commit afc3814 into elastic:7.x Oct 15, 2021

DaveCTurner deleted the 2021-10-15-75598-known-issue-docs branch October 15, 2021 14:40

DaveCTurner added the backport pending label Oct 15, 2021

DaveCTurner added a commit that referenced this pull request Oct 15, 2021

Add known issue docs for #75598 (#79221)

fecc105

Adds a description of #75598, and the mitigation, to the release notes of versions 7.13.2 through 7.14.0.

DaveCTurner added a commit that referenced this pull request Oct 15, 2021

Add known issue docs for #75598 (#79221)

850ce64

Adds a description of #75598, and the mitigation, to the release notes of versions 7.13.2 through 7.14.0.

DaveCTurner added a commit that referenced this pull request Oct 15, 2021

Add known issue docs for #75598 (#79221)

5f8cb09

Adds a description of #75598, and the mitigation, to the release notes of versions 7.13.2 through 7.14.0.

DaveCTurner mentioned this pull request Nov 11, 2021

Add docs about repair of repo affected by corruption bug #80662

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add known issue docs for #75598 #79221

Add known issue docs for #75598 #79221

DaveCTurner commented Oct 15, 2021

elasticmachine commented Oct 15, 2021

elasticmachine commented Oct 15, 2021

jrodewig left a comment

jrodewig Oct 15, 2021

DaveCTurner Oct 15, 2021

jrodewig Oct 15, 2021

DaveCTurner Oct 15, 2021

DaveCTurner commented Oct 15, 2021

jrodewig left a comment

-+
++
+[source,console]
+----
+PUT _cluster/settings
+{
+  "persistent" : {
+    "snapshot.max_concurrent_operations" : 1
+  }
+}
+----
++

Add known issue docs for #75598 #79221

Add known issue docs for #75598 #79221

Conversation

DaveCTurner commented Oct 15, 2021

elasticmachine commented Oct 15, 2021

elasticmachine commented Oct 15, 2021

jrodewig left a comment

Choose a reason for hiding this comment

jrodewig Oct 15, 2021

Choose a reason for hiding this comment

DaveCTurner Oct 15, 2021

Choose a reason for hiding this comment

jrodewig Oct 15, 2021

Choose a reason for hiding this comment

DaveCTurner Oct 15, 2021

Choose a reason for hiding this comment

DaveCTurner commented Oct 15, 2021

jrodewig left a comment

Choose a reason for hiding this comment