[Question] Prometheus Gauge in OpenTelemetry? #1139

otherview · 2020-09-08T21:07:43Z

Hi !

Similar to #708 I'm trying to model a Gauge using otel-go.
Finding it hard to wrap my head around it, can someone shine some light on this?

The results seem to be accumulative either way I do it 🤔

Using prometheus client:

func recordMetrics() {
	go func() {
		for {
			rtt.Set(rand.Float64() * 100)
			time.Sleep(2 * time.Second)
		}
	}()
}

var (
	rtt = promauto.NewGauge(prometheus.GaugeOpts{
		Name: "RTT_things",
	})
)

func main() {
	recordMetrics()
	http.Handle("/metrics", promhttp.Handler())
	http.ListenAndServe(":2222", nil)
}

Using otel-go:

func initMeter() {

	exporter, err := prometheus.InstallNewPipeline(prometheus.Config{
		DefaultHistogramBoundaries: []float64{1, 100, 250, 500},
	})
	if err != nil {
		log.Panicf("failed to initialize prometheus exporter %v", err)
	}
	http.HandleFunc("/metrics", exporter.ServeHTTP)
	go func() {
		_ = http.ListenAndServe(":2222", nil)
	}()

	fmt.Println("Prometheus server running on :2222")
}

func main() {
	initMeter()

	meter := global.Meter("basicTest")
	ctx := context.Background()

	// RTT
	rttValue := metric.Must(meter).NewInt64UpDownCounter("rtt_meter")

	for {
		randVal := rand.Int63n(100)
		fmt.Println("rtt-> ", randVal)

		// either this
		// rttValue.Add(ctx, randVal)

		// or this
		meter.RecordBatch(ctx,
			[]label.KeyValue{},
			rttValue.Measurement(randVal),
		)

		time.Sleep(10 * time.Second)
	}
}

Rendering:

Prometheus server running on :2222
rtt->  10
rtt->  21
rtt->  37
----------
# HELP rtt_meter
# TYPE rtt_meter counter
rtt_meter 10
----------
# HELP rtt_meter
# TYPE rtt_meter counter
rtt_meter 31
----------
# HELP rtt_meter
# TYPE rtt_meter counter
rtt_meter 68

Thanks! 😃

The text was updated successfully, but these errors were encountered:

jmacd · 2020-09-10T04:11:08Z

Ah, now that I see your example, the ValueObserver instrument (asynchronous) is well suited to your needs. Instead of defining a loop and a periodic gauge being Set, you'll define a ValueObserver callback that will be called once per collection interval and output a classic Gauge in both OTLP and Prometheus exporters.

You'll have fewer goroutines and arbitrary sleep statements then. See open-telemetry/opentelemetry-specification#834

otherview · 2020-09-10T10:23:41Z

Thanks for the reply @jmacd ! ☺️

I'm still not able to achieve it though...
This is returning an accumulative histogram, whereas I just want it to spit out the value that is "set".

I feel like my approach to this is wrong 🤔 . Can you take a look ?

Given :

func main() {
	initMeter()

	meter := global.Meter("basicTest")
	ctx := context.Background()

	observerLock := new(sync.RWMutex)
	observerValueToReport := int64(0)

	// Latency
	cb := func(_ context.Context, result metric.Int64ObserverResult) {
		(*observerLock).RLock()
		value := observerValueToReport
		(*observerLock).RUnlock()
		result.Observe(value)
	}

	_ = metric.Must(meter).NewInt64ValueObserver("latency_meter", cb)

	for {
		randVal := rand.Int63n(100)
		fmt.Println("rtt-> ", randVal)

		(*observerLock).RLock()
		observerValueToReport = randVal
		(*observerLock).RUnlock()

		time.Sleep(10 * time.Second)
	}
}

This renders:

Prometheus server running on :2222
rtt->  10
rtt->  51
rtt->  21

--------
# HELP latency_meter
# TYPE latency_meter histogram
latency_meter_bucket{le="1"} 0
latency_meter_bucket{le="100"} 1
latency_meter_bucket{le="250"} 1
latency_meter_bucket{le="500"} 1
latency_meter_bucket{le="+Inf"} 1
latency_meter_sum 10
latency_meter_count 1

--------
# HELP latency_meter
# TYPE latency_meter histogram
latency_meter_bucket{le="1"} 0
latency_meter_bucket{le="100"} 3
latency_meter_bucket{le="250"} 3
latency_meter_bucket{le="500"} 3
latency_meter_bucket{le="+Inf"} 3
latency_meter_sum 71       -> 10          + 10        + 51
latency_meter_count 3      -> obs1       obs1       obs2

nilebox · 2020-09-10T21:49:13Z

This is probably because Prometheus exporter is using simple selector:

opentelemetry-go/exporters/metric/prometheus/prometheus.go

Line 149 in f995380

simple.NewWithHistogramDistribution(config.DefaultHistogramBoundaries),

which uses histogram distribution for ValueObserver:

opentelemetry-go/sdk/metric/selector/simple/simple.go

Lines 127 to 128 in c9726ef

    
           case metric.ValueObserverKind, metric.ValueRecorderKind: 
        
           	aggs := histogram.New(len(aggPtrs), descriptor, s.boundaries)

The workaround for now is to implement a custom selector which uses "LastValue" aggregation for ValueObserver, this is what we did in Cloud Monitoring.

MrAlias · 2020-09-10T22:57:41Z

Talking about this in the SIG meeting today the proposal put forth was to include a selector similar to what @nilebox showed was done in the Google Cloud Monitoring exporter in the main project. Additionally add the functionality to the prometheus and cortext exporters to be configured with a user defined selector. That way, even though the default will still be the Histogram selector, it will allow users to solve this problem in a more standard way.

It was also brought up that this is likely something the Views API is well suited to address. The long-term solution would be to address this configuration there.

nilebox mentioned this issue Sep 10, 2020

Async metrics GoogleCloudPlatform/opentelemetry-operations-go#92

Merged

MrAlias added area:metrics Part of OpenTelemetry Metrics enhancement New feature or request help wanted Extra attention is needed pkg:SDK Related to an SDK package priority:p3 labels Sep 10, 2020

MrAlias added this to the RC1 milestone Sep 10, 2020

jmacd mentioned this issue Sep 11, 2020

Use LastValue by default for ValueObserver instruments #1165

Merged

ercl mentioned this issue Sep 11, 2020

[Prometheus Remote Write Exporter for Cortex] Add configuration option for selectors open-telemetry/opentelemetry-go-contrib#338

Closed

Aneurysm9 closed this as completed in #1165 Sep 24, 2020

seanhoughton mentioned this issue Apr 15, 2021

How to model Prometheus Gauge in OpenTelemetry language? #708

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Question] Prometheus Gauge in OpenTelemetry? #1139

[Question] Prometheus Gauge in OpenTelemetry? #1139

otherview commented Sep 8, 2020

jmacd commented Sep 10, 2020

otherview commented Sep 10, 2020 •

edited

Loading

nilebox commented Sep 10, 2020 •

edited

Loading

MrAlias commented Sep 10, 2020

[Question] Prometheus Gauge in OpenTelemetry? #1139

[Question] Prometheus Gauge in OpenTelemetry? #1139

Comments

otherview commented Sep 8, 2020

jmacd commented Sep 10, 2020

otherview commented Sep 10, 2020 • edited Loading

nilebox commented Sep 10, 2020 • edited Loading

MrAlias commented Sep 10, 2020

otherview commented Sep 10, 2020 •

edited

Loading

nilebox commented Sep 10, 2020 •

edited

Loading