[WIP] Operator and plugin read-only settings #86224

grcevski · 2022-04-27T14:15:06Z

Introduction

This PR introduces the concept of operator settings. The operator setting have the following properties:

They are read-only settings that cannot be changed by the REST API.
They can be cluster settings or entity settings like ILM policies.
The operator settings can be created/modified or deleted only through the 'operator mode'. The only operator mode available in this PR is file based settings, however the intent is to enable plugins and modules to set their own 'immutable' settings.

Implementation details

We have multiple components introduced by this PR:

File Settings Service

Watches a directory under Elasticsearch config for changes to a file called settings.json. This is a JSON file describing the settings that the operator wants to set that will be immutable through the REST API. Any changes to these settings will only be allowed through modifying the file. As configured by default we watch for a path operator/settings.json under the config directory. The name of the directory operator can be changed via settings.

The File Settings Service will notice updates to the file and perform an update. The operator directory doesn't have to exist, the file setting service will notice when it's created. Recommended way to change operator file based settings is through modifying the settings.json file in a brand new location and then performing a symlink under the Elasticsearch config directory.

Operator Cluster State Controller

This is a reusable component that has number of Operator Handler components that know how to update parts of the Elasticsearch state. This component is only used by the File Settings Service at the moment, but the intent is for modules and plugins to be able to call it to save settings and entities in the cluster state that cannot be changed by the end user.

Operator Metadata

This is an extension to the ClusterState Metadata object where we store various information about the operator updated state. This Operator Metadata object contains information about the version and the 'keys' that were set in the cluster state, through the operator interface. We use this information to coordinate updates and to ensure that the keys of the cluster state set by the operator are not overwritten by the REST API.

Example operator state to be set by file based settings

{
    "metadata": {
        "version": "1234",
        "compatibility": "8.4.0"
    },
    "state": {
        "cluster_settings": {
            "indices.recovery.max_bytes_per_sec": "50mb"
        },
        "ilm": {
            "my_timeseries_lifecycle": {
                "policy": {
                    "phases": {
                        "warm": {
                            "min_age": "10s",
                            "actions": {
                            }
                        },
                        "delete": {
                            "min_age": "30s",
                            "actions": {
                            }
                        }
                    }
                }
            }
        }
    }
}

In the example above we can see the following details:

Metadata information

We show an example of how the version can be set along with the Elasticsearch Version for compatibility. The version metadata field is assumed to be of type long and it's encoded as a JSON String to avoid precision errors in certain JSON language implementations (e.g. JavaScript). For proper usage, each time the cluster state is updated we need to bump the version. The version bump has to be increasing in value, but not necessarily monotonically increasing, e.g. we can use a consistent epoch timestamp generator for versioning purposes.

The compatibility field is used to enforce a minimum Elasticsearch version that can understand the file content. We can use this field to avoid trying to apply the cluster settings to an incompatible Elasticsearch version in mixed version clusters.

State information

In this case we show an example of two separate operator state handlers: cluster settings and ilm.

This cluster state information will be applied as one single cluster state update. Even though the two transport actions are separate when we use the REST API, with operator state updates we first validate everything up-front and then we use a joint cluster state update.

In the example above, the final OperatorMetadata will contain information about two OperatorHandlers, cluster_settings and ilm, which individually will contain the keys of the settings that are updated (indices.recovery.max_bytes_per_sec and my_timeseries_lifecycle respectively). If these two settings were set as an operator state, any REST API requests that want to modify indices.recovery.max_bytes_per_sec or mutate/delete the my_timeseries_lifecycle policy, would be rejected.

Scope of the PR

This PR doesn't handle yet updates to role mappings which are not stored in the cluster state. Given the PR is large as-is, the ability to set role mappings through in operator mode will be added as a follow-up PR.

Documentation will also be done as separate PR.

This PR will be used to finalize the file based settings changes after the following sub PRs are finished:

Metadata classes for Operator state (Add immutable 'operator' metadata classes for cluster state #87763)
OperatorHandler and its SPI (Add OperatorHandler interface #87767)

TODO

Unit tests for everything
Integrations tests
Docker tests
Use SPI instead of extending the plugin interface

elasticmachine · 2022-04-27T14:15:08Z

Pinging @elastic/es-core-infra (Team:Core/Infra)

Extract execute logic from the transport actions for cluster update settings and ILM put/delete to support future reuse for operator file based updates. Relates to elastic#86224

Extract execute logic from the transport actions for cluster update settings and ILM put/delete to support future reuse for operator file based updates. Relates to #86224

Extract execute logic from the transport actions for cluster update settings and ILM put/delete to support future reuse for operator file based updates. Relates to elastic#86224

grcevski · 2022-06-09T10:17:57Z

@elasticmachine run elasticsearch-ci/part-2

This commit only introduces the storage classes, unused for now. Relates to #86224

Relates to #86224

grcevski · 2022-07-12T14:32:53Z

Closing this Draft PR, all of the changes are either in or in waiting on PR review.

Nikola Grcevski added 7 commits April 21, 2022 12:42

WIP.

74dc598

Fix compile error.

44fc2d0

More WIP

9244455

More refactoring

764505e

Make ILM do processing

31b5ae3

Merge master

7f4898e

Fix merge

682dec4

grcevski added >enhancement WIP :Core/Infra/Core Core issues without another label Team:Core/Infra Meta label for core/infra team v8.3.0 labels Apr 27, 2022

grcevski marked this pull request as draft April 27, 2022 14:15

Nikola Grcevski added 6 commits April 28, 2022 19:24

Add file watcher service

5db45de

Add additional exception handling

fc45962

Fix issue with watcher starting after service is stopped

273e56d

Add delete paths

2533448

One more thread to skip

86ec3b4

Merge master

38ebdad

grcevski mentioned this pull request May 19, 2022

Extract transport cluster settings/ilm execute logic #86941

Merged

Merge master

13160ea

craigtaverner added v8.4.0 and removed v8.3.0 labels May 25, 2022

Nikola Grcevski added 3 commits May 26, 2022 13:53

Merge branch 'master' into feature/bulk_settings

5dfc772

Refactor for modules

dc21c39

Refactor for modules

ae19af7

Nikola Grcevski added 3 commits June 8, 2022 16:14

Apply PR suggestions

fd9a1a0

Make metadata hashing consistent

503b496

Merge branch 'master' into feature/bulk_settings

ebbcdfa

grcevski added >feature and removed >enhancement labels Jun 9, 2022

Nikola Grcevski added 9 commits June 9, 2022 11:25

Support async error reporting in the controller

c89dccc

Merge branch 'master' into feature/bulk_settings

4180739

Merge branch 'master' into feature/bulk_settings

f7eee71

Apply martijnvg's suggestions

e4b5938

Replace registration with SPI

5630b52

fix javadoc

eee70fb

Change to illegalargument exception

602b33c

Fail start on bad operator config

f61b5b7

Add version check on error metadata

e81d779

grcevski mentioned this pull request Jun 16, 2022

Modularize Elasticsearch (with Java Modules) #78744

Open

93 tasks

Nikola Grcevski added 2 commits June 16, 2022 10:10

Refactor ilm section to exclude policy

4272392

Merge master

26e60a6

This was referenced Jun 16, 2022

Add immutable 'operator' metadata classes for cluster state #87763

Merged

Add OperatorHandler interface #87767

Merged

Modularize ILM/SLM #87769

Merged

This was referenced Jun 27, 2022

Add generic interface for loading service providers from plugins #88082

Merged

Implement few operator handlers #88097

Merged

grcevski added a commit that referenced this pull request Jun 29, 2022

Add immutable 'operator' metadata classes for cluster state (#87763)

e1d03ef

This commit only introduces the storage classes, unused for now. Relates to #86224

grcevski mentioned this pull request Jun 30, 2022

Immutable cluster state controller #88224

Closed

grcevski added a commit that referenced this pull request Jul 4, 2022

Implement ILM/settings operator handlers (#88097)

fc93f77

Relates to #86224

grcevski mentioned this pull request Jul 6, 2022

File Settings Service #88329

Merged

grcevski closed this Jul 12, 2022

grcevski mentioned this pull request Jul 13, 2022

Reserved cluster state service #88527

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Operator and plugin read-only settings #86224

[WIP] Operator and plugin read-only settings #86224

grcevski commented Apr 27, 2022 •

edited

Loading

elasticmachine commented Apr 27, 2022

grcevski commented Jun 9, 2022

grcevski commented Jul 12, 2022

[WIP] Operator and plugin read-only settings #86224

[WIP] Operator and plugin read-only settings #86224

Conversation

grcevski commented Apr 27, 2022 • edited Loading

Introduction

Implementation details

File Settings Service

Operator Cluster State Controller

Operator Metadata

Example operator state to be set by file based settings

Metadata information

State information

Scope of the PR

This PR will be used to finalize the file based settings changes after the following sub PRs are finished:

TODO

elasticmachine commented Apr 27, 2022

grcevski commented Jun 9, 2022

grcevski commented Jul 12, 2022

grcevski commented Apr 27, 2022 •

edited

Loading