Expose index health and status to the _stats API #81954

gmarouli · 2021-12-20T17:44:33Z

Expose the index health and status to the _stats API. We only enrich the IndicesStatsResponse with the new fields when it is called by the API in order to not affect the performance during internal usage.

Resolve #80413

dakrone

Thanks for working on this Mary! I left some comments about the implementation

server/src/main/java/org/elasticsearch/action/admin/indices/stats/IndicesStatsResponse.java

dakrone · 2021-12-20T20:48:10Z

server/src/main/java/org/elasticsearch/action/admin/indices/stats/IndicesStatsResponse.java

    ) {
        super(totalShards, successfulShards, failedShards, shardFailures);
        this.shards = shards;
+        this.clusterState = clusterState;


This isn't serialized in the constructor and writeTo methods, so it's going to be null quite frequently.

I think we have at least two alternative options:

The first is to continue to pass the cluster state into this method, but construct a map of indexName -> state & health, which is serialized in the IndicesStatsResponse so that it's available on every node. Constructing this map would happen in the constructor, so that we don't hold on to a reference to the cluster state in the class itself.

The second would be to push the health and state into the ShardStats object itself, so that we could collate the index health and open/closed state from the ShardStats the way we calculate other states in getIndices().

I think I lean slightly towards the second option, because it would allow us to expose this information (state and health) in the shard stats API as well, re-using it for this particular use case. What do you think?

I was having hard timing deciding how to approach it as well. That's why this draft contains the least invasive option to start with.

I agree these are the most obvious alternatives, I had the following concerns:

1) Keep a map with index state and health
Pros:

Minimal penalty during serialization, it's just a map with the metadata

This feels to me like the most logical place

Cons:

Calculation overhead in the constructor since we would have to go through the shards to determine the relevant indices

2) Add it to the ShardStats
Pros:

Nicer code wise and the new information will be available right where we need it, no need for extra maps etc.

Cons:

This feels more "convenient" than "right" to me. What I mean with that is, an index has one or more shards, the ShardStats class contains information about a single shard (if I am not mistaken), adding index information feels like we are stretching the scope of the ShardStats to contain also some index information.

I would like to put out there one more option, again with some trade offs:

3) Initialize indices in the constructor and add it in the serialization
Pros:

No intermediary maps, the information is available right where we need it

This feels to me like the most logical place

Cons:

Calculation overhead in the constructor since we would have to go through the shards to determine the relevant indices (if we need this information often then that's not a big deal because we go through the shards once.

Serialization penalty, with this option we are increasing the serialized object significantly.

What do you think? Based on this I am kind of leaning towards the first or the third.

I gave it a try and implemented the extra maps because it felt the best option. Re-reading your initial comment, you say:

it would allow us to expose this information (state and health) in the shard stats API as well, re-using it for this particular use case

This is not covered in this solution. Do you think this benefit is strong enough to justify adding to the stats of a single shards information about the whole index?

server/src/main/java/org/elasticsearch/action/admin/indices/stats/IndicesStatsResponse.java

server/src/main/java/org/elasticsearch/cluster/health/ClusterHealthStatus.java

gmarouli · 2021-12-22T10:16:54Z

@dakrone Thank you for the comments, they were really helpful for me!

gmarouli · 2021-12-22T10:27:53Z

@elasticmachine update branch

dakrone

Thanks Mary! I think this is a better implementation. I left a couple comments and answers to your comment here (because Github won't let me attach a comment to your response for some reason):

2) Add it to the ShardStats
Cons:

This feels more "convenient" than "right" to me. What I mean with that is, an index has one or more shards, the ShardStats class contains information about a single shard (if I am not mistaken), adding index information feels like we are stretching the scope of the ShardStats to contain also some index information.

Well, a shard does itself have the concept of state (open or closed), though it's confusing to re-use IndexMetadata.State for a shard, since it says index metadata and not shard metadata. We don't have an enum/state at the shard level because we don't really expose it outside of the concept of an index (yet).

As for health, we have the concept of assigned, initializing, or unassigned for a shard, which very roughly maps on to the index health (I guess it would be either red or green in all cases), so it's almost-but-not-quite pertinent to a shard in addition to an index.

I think it's fine to wait on this sort of fine-grained state for a single shard until we figure out exactly what we want from the "fine-grained health API" project.

3) Initialize indices in the constructor and add it in the serialization
Pros:

No intermediary maps, the information is available right where we need it

This feels to me like the most logical place

Cons:

Calculation overhead in the constructor since we would have to go through the shards to determine the relevant indices (if we need this information often then that's not a big deal because we go through the shards once.

Serialization penalty, with this option we are increasing the serialized object significantly.

I agree this is the most logical from a behavior point of view, however, I am also concerned about the overhead for serialization and construction. I took a look at the git-blame output for this and apparently it wasn't a performance optimization added later —it's been implemented that way (calculated on getIndices()) the entire time.

What do you think? Based on this I am kind of leaning towards the first or the third.

I agree, I also would lean towards the first or the third. I think the indices stats API is hit fairly heavily from the monitoring side, so for that reason, I think we should stick with the first option for now. We can always change it later since it is an implementation detail and wouldn't break anything to change.

dakrone · 2021-12-22T19:05:34Z

server/src/main/java/org/elasticsearch/action/admin/indices/stats/IndicesStatsResponse.java

    ) {
        super(totalShards, successfulShards, failedShards, shardFailures);
        this.shards = shards;
+        if (clusterState != null) {


When do we expect the cluster state to be null? As far as I can tell it would only be in the tests.

I think we should require it to be non-null (with Objects.requireNonNull(clusterState) and handle the case when clusterState.getMetadata().index(index) returns null gracefully. Then in tests if needed you can pass ClusterState.EMPTY_STATE if you don't need to construct it for anything.

server/src/main/java/org/elasticsearch/action/admin/indices/stats/IndexStats.java

gmarouli · 2021-12-23T13:50:19Z

@elasticmachine update branch

gmarouli · 2021-12-23T14:55:31Z

Thanks for thinking along with me, I am looking forward to see how the fine-grained health api is going to be formed!

elasticmachine · 2021-12-23T19:19:09Z

Pinging @elastic/es-data-management (Team:Data Management)

dakrone

LGTM, thanks for iterating on this Mary!

The feature added in elastic#81954 lacks coverage in BwC situations. This commit adds a YAML test to address that.

The feature added in #81954 lacks coverage in BwC situations. This commit adds a YAML test to address that.

Replace switch with switch expression (JAVA 17)

4b603ab

elasticsearchmachine added the v8.1.0 label Dec 20, 2021

Expose index health and status via _stat api

e0cc3f1

dakrone requested changes Dec 20, 2021

View reviewed changes

Use ClusterIndexHealth to determine index health

bf256d6

elasticmachine and others added 2 commits December 22, 2021 02:27

Merge branch 'master' into expose-state-health-in-index-stats

63b3573

Serialize the index health and state info

2569c7f

gmarouli requested a review from dakrone December 22, 2021 14:38

dakrone requested changes Dec 22, 2021

View reviewed changes

Polishing

c15d5c0

Merge branch 'master' into expose-state-health-in-index-stats

4673889

gmarouli marked this pull request as ready for review December 23, 2021 14:53

gmarouli requested a review from dakrone December 23, 2021 14:55

gmarouli added the :Data Management/Indices APIs APIs to create and manage indices and templates label Dec 23, 2021

elasticmachine added the Team:Data Management Meta label for data/management team label Dec 23, 2021

gmarouli added the >enhancement label Dec 23, 2021

dakrone approved these changes Jan 4, 2022

View reviewed changes

gmarouli merged commit b118d84 into elastic:master Jan 10, 2022

gmarouli deleted the expose-state-health-in-index-stats branch January 10, 2022 08:59

This was referenced Jan 12, 2022

Add health and status properties to the Indices Stats API spec elastic/elasticsearch-specification#1253

Closed

[Index Management] Remove _cat/indices API from server code elastic/kibana#122867

Merged

3kt mentioned this pull request Nov 8, 2024

Index stats enhancement: creation date and tier_preference #116339

Merged

DaveCTurner added a commit to DaveCTurner/elasticsearch that referenced this pull request Nov 13, 2024

Add YAML test for status in indices stats

56dd10b

The feature added in elastic#81954 lacks coverage in BwC situations. This commit adds a YAML test to address that.

DaveCTurner mentioned this pull request Nov 13, 2024

Add YAML test for status in indices stats #116711

Merged

DaveCTurner added a commit that referenced this pull request Nov 29, 2024

Add YAML test for status in indices stats (#116711)

17d2803

The feature added in #81954 lacks coverage in BwC situations. This commit adds a YAML test to address that.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Expose index health and status to the _stats API #81954

Expose index health and status to the _stats API #81954

gmarouli commented Dec 20, 2021

dakrone left a comment

dakrone Dec 20, 2021

gmarouli Dec 22, 2021 •

edited

Loading

gmarouli Dec 22, 2021

gmarouli commented Dec 22, 2021

gmarouli commented Dec 22, 2021

dakrone left a comment

dakrone Dec 22, 2021

gmarouli Dec 23, 2021

gmarouli commented Dec 23, 2021

gmarouli commented Dec 23, 2021

elasticmachine commented Dec 23, 2021

dakrone left a comment

Expose index health and status to the _stats API #81954

Expose index health and status to the _stats API #81954

Conversation

gmarouli commented Dec 20, 2021

dakrone left a comment

Choose a reason for hiding this comment

dakrone Dec 20, 2021

Choose a reason for hiding this comment

gmarouli Dec 22, 2021 • edited Loading

Choose a reason for hiding this comment

gmarouli Dec 22, 2021

Choose a reason for hiding this comment

gmarouli commented Dec 22, 2021

gmarouli commented Dec 22, 2021

dakrone left a comment

Choose a reason for hiding this comment

dakrone Dec 22, 2021

Choose a reason for hiding this comment

gmarouli Dec 23, 2021

Choose a reason for hiding this comment

gmarouli commented Dec 23, 2021

gmarouli commented Dec 23, 2021

elasticmachine commented Dec 23, 2021

dakrone left a comment

Choose a reason for hiding this comment

gmarouli Dec 22, 2021 •

edited

Loading