Add the shared state to the global scope to get previous data #8447

essobedo · 2020-11-21T16:51:36Z

Required for all PRs:

Signed CLA.
Associated README.md updated.
Has appropriate unit tests.

Fix for #7793

Motivation

We would like to have a way to compare the current metric with the previous one

Modifications:

Adds the shared state to the global scope
Adds a new simple script to show how to use the shared state
Adds a link to the new script into the README.md

srebhan · 2020-11-23T08:22:19Z

While the code looks good, I'm not sure this is actually a good thing to do. There are multiple reasons IMO:

I think it is not guaranteed that the metrics are sent to apply in a time-ordered way. That means you will soon want to extend this to store N past metrics to sort them and might create all kind of corner-cases.
I'd rather think of an option to make the global scope read/write to carry over information between apply calls e.g. to have a counter or similar. However, this will probably create hard-to-debug problems as the plugin becomes stateful.
This circumvents the one-metric in multiple-metrics out scheme. Wouldn't it be better to add another function e.g. applyAll that you can use in cases where you need to process over multiple metrics? We could even copy the starlark processor and add a starlark aggregator...

@ssoroka what is your opinion on this?

ssoroka

Code review, will respond to the other comments separately.

plugins/processors/starlark/starlark.go

ssoroka · 2020-11-23T18:13:01Z

@srebhan

I think it is not guaranteed that the metrics are sent to apply in a time-ordered way. That means you will soon want to extend this to store N past metrics to sort them and might create all kind of corner-cases.

Generally they are as most inputs place them in order and Telegraf keeps them in order, but you're right that it's not a guarantee. It depends on the setup. Other than the complexity, I think storing a set of metrics should still work.

I'd rather think of an option to make the global scope read/write to carry over information between apply calls e.g. to have a counter or similar. However, this will probably create hard-to-debug problems as the plugin becomes stateful.

I considered that, but it might be better for avoiding bugs to make it explicit.

This circumvents the one-metric in multiple-metrics out scheme. Wouldn't it be better to add another function e.g. applyAll that you can use in cases where you need to process over multiple metrics? We could even copy the starlark processor and add a starlark aggregator...

Processing over multiple metrics with batches doesn't really work (where do you draw that batch border, what if you need a metric outside that grouping?). In the future all aggregators are probably going to be reimplemented as processors (likely transparently), as processors + state are exactly aggregators.

plugins/processors/starlark/starlark.go

oplehto · 2020-11-25T07:00:20Z

plugins/processors/starlark/starlark.go

+
+// Store the pair (key, value) into the shared state. If the value is None, the pair (key, value) will be
+// removed from the shared state if it exists
+func (s *State) Store(key string, value starlark.Value) {


Should there be a configurable capacity limit here to prevent OOMs?

I can foresee cases where the state store usage might blow up, for example if the name is defined based on tag values and the cardinality unexpectedly increases.

that's a good question. I'm not sure if we should handle this or just add a warning to watch out for this.

One alternative is to just the ability to check the size of the state and clear it if it exceeds a threshold. This could be implemented as additional built-in Starlark functions or just a configuration setting.

Running into an OOM is unlikely but for some of my use cases I need to trust that there is some reasonable guaranteed upper bound for memory usage when dealing with high-volume, high-cardinality data.

I could see how it might be difficult to manage this in Starlark itself.. I'll give some thought to that.

srebhan · 2020-11-26T18:49:31Z

@ssoroka well we currently have the global scope locked as Daniel didn't want the plugin to be stateful to avoid all those nasty hard-to-debug problems. This now adds a state to the plugin. Fair enough, there are plenty of use-cases for it.

I considered that, but it might be better for avoiding bugs to make it explicit.

I don't see how this helps. We still have a state.

We can do the same with inserting globals["state"] = starlark.NewDict(0) just before globals.Freeze() (line 80) and the have a script:

state = {
  "counter": 0,
  "cache": []
}

def apply(metric):
  metric.fields["lala"] = state["counter"]
  metric.fields["bubu"] = len(state["cache"])
  state["counter"] += 1
  state["cache"].append(deepcopy(metric))

  return metric

That's 1 line of code...

ssoroka · 2020-11-26T19:24:28Z

@ssoroka well we currently have the global scope locked as Daniel didn't want the plugin to be stateful to avoid all those nasty hard-to-debug problems. This now adds a state to the plugin. Fair enough, there are plenty of use-cases for it.

All the plugins have state, everything in the plugin struct is stateful.

We can do the same with inserting globals["state"] = starlark.NewDict(0) just before globals.Freeze() (line 80) and the have a script:
state = {
  "counter": 0,
  "cache": []
}

I like this too, but it's maybe a little less obvious that something special is happening with this variable as opposed to other global variables. There's another consideration that I want to soon be persisting plugin state externally to Telegraf, and it would be interesting if this plugin could take advantage of that. If it's a global var I need to reach into the plugin to get the values, but if it's functions they can easily hook into an external state store/load interface.

I'm not sure what you're saying with your example. Here's the same code with Store/Load:

def apply(metric):
  counter = Load("counter") || 0  # we should change this to support defaults, like Load("counter", 0)
  cache = Load("cache") || [] # prefer Load("cache", [])
  metric.fields["lala"] = counter
  metrics.fields["bubu"] = len(cache)
  Store("counter", counter + 1)
  Store("cache", cache.append(deepcopy(metric)))

  return metric

That's 1 line of code...

I count the state global version at 11 lines and the Store/Load funcs at 9; I don't see a huge difference.

plugins/processors/starlark/starlark.go

thatsafunnyname · 2021-02-24T20:02:35Z

Thanks for adding the shared state, I found it useful in #8903 .

I think this common question in the Starlark Processor README.md should be updated to reflect the newly available shared state:

https://github.com/influxdata/telegraf/blame/8ddbab47a46e256281392ad0aac876715189c117/plugins/processors/starlark/README.md#L166

How can I save values across multiple calls to the script?

Telegraf freezes the global scope, which prevents it from being modified.
Attempting to modify the global scope will fail with an error.

Maybe add:

A shared global dictionary named state exists, this can be used by the apply function.
See an example of this in plugins/processors/starlark/testdata/compare_metrics.star

Thanks.

essobedo · 2021-02-25T08:37:00Z

@thatsafunnyname Makes sense indeed, feel free to create a dedicated ticket for it

For influxdata#8907 . After influxdata#8447 , this common question in the Starlark Processor README.md should be updated to reflect the newly available shared state.

…data#8447)

essobedo mentioned this pull request Nov 21, 2020

Starlark Processor: Access to previous metrics, store variables #7793

Closed

sjwang90 added area/starlark feat Improvement on an existing feature such as adding a new setting/mode to an existing plugin labels Nov 23, 2020

srebhan self-assigned this Nov 23, 2020

ssoroka reviewed Nov 23, 2020

View reviewed changes

essobedo changed the title ~~Add a shared cache to get previous data~~ Add a shared state to get previous data Nov 23, 2020

Add a shared state to get previous data

7f35558

essobedo requested a review from ssoroka November 23, 2020 18:43

ssoroka reviewed Nov 23, 2020

View reviewed changes

plugins/processors/starlark/starlark.go Outdated Show resolved Hide resolved

oplehto reviewed Nov 25, 2020

View reviewed changes

Allow to store all existing types in shared state

164bc77

essobedo requested a review from ssoroka November 27, 2020 22:05

ssoroka reviewed Nov 27, 2020

View reviewed changes

plugins/processors/starlark/starlark.go Outdated Show resolved Hide resolved

Add the shared state to the global scope to get previous data

2e9994e

essobedo changed the title ~~Add a shared state to get previous data~~ Add the shared state to the global scope to get previous data Nov 30, 2020

essobedo requested a review from ssoroka November 30, 2020 20:20

ssoroka approved these changes Nov 30, 2020

View reviewed changes

ssoroka merged commit 01fc69d into influxdata:master Nov 30, 2020

essobedo deleted the 7793/shared_cache branch December 5, 2020 09:38

thatsafunnyname mentioned this pull request Feb 25, 2021

Correct common question in the Starlark Processor README about shared state. #8907

Closed

thatsafunnyname mentioned this pull request Mar 1, 2021

Correct Q+A about state #8918

Merged

arstercz pushed a commit to arstercz/telegraf that referenced this pull request Mar 5, 2023

Add the shared state to the global scope to get previous data (influx…

1be33f8

…data#8447)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add the shared state to the global scope to get previous data #8447

Add the shared state to the global scope to get previous data #8447

essobedo commented Nov 21, 2020 •

edited

Loading

srebhan commented Nov 23, 2020

ssoroka left a comment

ssoroka commented Nov 23, 2020

oplehto Nov 25, 2020

ssoroka Nov 26, 2020

oplehto Nov 27, 2020

ssoroka Nov 27, 2020

srebhan commented Nov 26, 2020 •

edited

Loading

ssoroka commented Nov 26, 2020 •

edited

Loading

thatsafunnyname commented Feb 24, 2021

essobedo commented Feb 25, 2021

Add the shared state to the global scope to get previous data #8447

Add the shared state to the global scope to get previous data #8447

Conversation

essobedo commented Nov 21, 2020 • edited Loading

Required for all PRs:

Motivation

Modifications:

srebhan commented Nov 23, 2020

ssoroka left a comment

Choose a reason for hiding this comment

ssoroka commented Nov 23, 2020

oplehto Nov 25, 2020

Choose a reason for hiding this comment

ssoroka Nov 26, 2020

Choose a reason for hiding this comment

oplehto Nov 27, 2020

Choose a reason for hiding this comment

ssoroka Nov 27, 2020

Choose a reason for hiding this comment

srebhan commented Nov 26, 2020 • edited Loading

ssoroka commented Nov 26, 2020 • edited Loading

thatsafunnyname commented Feb 24, 2021

essobedo commented Feb 25, 2021

essobedo commented Nov 21, 2020 •

edited

Loading

srebhan commented Nov 26, 2020 •

edited

Loading

ssoroka commented Nov 26, 2020 •

edited

Loading