[META] Insufficient guardrails leading to disk going full on nodes #5712

RS146BIJAY · 2023-01-05T14:33:49Z

Is your feature request related to a problem? Please describe.
Currently, we are observing multiple instances where data volume is getting 100% filled up on one or more node. We have guardrails like flood stage watermark in place which ensures that OpenSearch put blocks at the right time and enough amount of space is available for OpenSearch to perform internal operations (like segment merge, cluster state update etc.). Still, we sometime observe that available space on few or all data nodes of a domain goes to 0. This can cause node from getting removed from cluster (by either FSHealthService checks or due to cluster state update) which may ultimately result in red clusters (if it contains active primary).

Describe the solution you'd like
OpenSearch should ensure that guardrails like FloodStage watermarks are applied correctly and enough amount of space is available for OpenSearch to perform internal operations (like segment merge, cluster state update etc.).

OpenSearch Subtasks

RS146BIJAY added enhancement Enhancement or improvement to existing feature or request untriaged labels Jan 5, 2023

Bukhtawar added Meta Meta issue, not directly linked to a PR distributed framework and removed untriaged labels Jan 5, 2023

Bukhtawar changed the title ~~[META] Disk space full issue in OpenSearch~~ [META] Insufficient Guardrails leading to disk going full on nodes Jan 5, 2023

Bukhtawar changed the title ~~[META] Insufficient Guardrails leading to disk going full on nodes~~ [META] Insufficient guardrails leading to disk going full on nodes Jan 5, 2023

andrross added the Roadmap:Stability/Availability/Resiliency Project-wide roadmap label label May 31, 2024

Pallavi-AWS added this to OpenSearch Roadmap May 31, 2024

github-project-automation bot moved this to Planned work items in OpenSearch Roadmap May 31, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[META] Insufficient guardrails leading to disk going full on nodes #5712

[META] Insufficient guardrails leading to disk going full on nodes #5712

RS146BIJAY commented Jan 5, 2023 •

edited

Loading

[META] Insufficient guardrails leading to disk going full on nodes #5712

[META] Insufficient guardrails leading to disk going full on nodes #5712

Comments

RS146BIJAY commented Jan 5, 2023 • edited Loading

RS146BIJAY commented Jan 5, 2023 •

edited

Loading