Improve Prometheus integration documentation #9792

tbragin · 2018-12-25T18:12:29Z

Current docs on the Metricbeat Prometheus module are reference-style and quite minimal. The following blog expands on the way one can use this module and how it compares to other methods of integrating with Prometheus:
https://discuss.elastic.co/t/dec-23rd-2018-en-observability-querying-metrics-from-prometheus/161609

We should consider improving the docs, or linking to this blog.

cc: @roncohen @ruflin

hodgesds · 2019-01-25T00:53:37Z

Can you also include that exporting beat metrics to Prometheus is not supported.

exekias · 2019-01-29T20:47:38Z

Docs have improved after #9948: https://www.elastic.co/guide/en/beats/metricbeat/master/metricbeat-metricset-prometheus-collector.html.

@tbragin do you think this is enough or we should keep this issue?

tbragin · 2019-02-01T01:46:18Z

We are in the process of converting the discuss blog to a blog on elastic.co. How do folks feel about linking the reference-style Prometheus documentation to that blog? Also, do we feel we should evolve our modules documentation in general to include richer information, outside simply how to configure it? Perhaps I'll add a discuss label to this.

tbragin · 2019-02-01T01:46:32Z

cc: @makwarth @alvarolobato @DanRoscigno

ruflin · 2019-02-04T17:56:59Z

+1 on having links to prometheus docs. I think it's important to link to the reference docs.

++ on having richer module docs. All information a users needs to learn more about the module, how it works etc. should be there.

makwarth · 2019-02-07T13:12:31Z

++ absolutely. We probably need an owner per module that takes care of the Kibana and Elastic.co documentation

DanRoscigno · 2019-02-25T14:40:53Z

I have to test this, but I think the first YAML excerpt on the docs would be better like so (I was confused by the port 9090 on localhost because the example is supposed to be about grabbing from an exporter, not from the Prometheus server):

- module: prometheus
  period: 10s
  hosts:
    - nodehost:9100 <---- the existing port 9090 in the docs is confusing, as that is the Prometheus server port.  9100 is the default Node exporter port
    - haproxyhost:9101
    - mysqlhost:9104
  metrics_path: /metrics

DanRoscigno · 2019-02-25T14:43:02Z

@makwarth please bring me in when the discussion of what should be in module docs starts up. I approach this from the sysadmin point of view since I spent years in ops/services and in training people that sell to ops.

DanRoscigno · 2019-03-19T19:20:28Z

I opened this issue around scraping from exporters.

DanRoscigno · 2019-04-19T19:42:29Z

@tbragin @makwarth is the agenda set for EAH at Orlando? If there is time, can we have a slot to talk through what info Ops needs about a module? I have some strong opinions. Would be a good session for DeDe, Karen, dev, me, infra, SREs. If there is not time there, then we should do it after.

tbragin · 2019-04-28T19:25:26Z

@DanRoscigno You can suggest your topic here: https://docs.google.com/document/d/1-zyHzxk-s8MgqrjmV1OXI4TwP3vJmUkZSkVOHphNn4A/edit#

cc @alvarolobato Not sure if this would fall under plenary vs breakout.

tbragin · 2019-06-26T02:16:03Z

@sorantis I'd love your thoughts on this issue. We've had it open for a while, and some aspects have been addressed. Perhaps, as you you set up our Promethheus module yourself and walk through the experience, you could suggest where these reference docs can be shored up?

One improvement I can think of is documenting the expected document schema a bit better. I think it's one doc per poll interval per unique label combination (@exekias can confirm).

We should improve what we can and close soon.

sorantis · 2019-07-01T07:56:27Z

@tbragin sure, I can share my experience with the module so far.
One of the things I noticed when setting up Prometheus with two node_exporters scraped by the Metricbeat's Prometheus module, was that there was no straightforward way to filter metrics by node. When using multiple exporters Prometheus server adds additional labels to a metrics. E.g. for the two running node_exporters Prometheus added instance and job labels. Because these labels are added by the Prometheus server itself, they're not exposed by the service, and Metricbeat can't scrape them (thanks @exekias for explaining this). So, as a devops engineer, being used to Prometheus' labels, I find it hard to separate metrics per node_exporter in Metrics Explorer because the labels I'm relying upon are missing. Instead, as @exekias mentioned, the service.address filter should be used.
I think our documentation can mentioned this, or similar cases, where module's behavior diverges from expected one.
Another thing that IMHO is not clear is that based on Prometheus' data model (<metric name>{<label name>=<label value>, ...}) each metric has a set of labels associated with it. In case of our Prometheus module, labels and metrics are two separate fields, which should be intersected in order to replicate PromQL behavior. E.g. the results of the PromQL expression node_cpu_seconds_total{cpu="1", mode="idle"}, can be achieved in Metrics Explorer by selecting prometheus.metrics.node_cpu_seconds_total base metric and applying two additional filters on labels for cpu and mode.

Makes sense, but as someone who's just getting started with the module's capabilities, and has prior experience with Prometheus, I would've loved documentation for such implementation differences.

roncohen · 2019-07-01T11:14:30Z

@exekias could we add a labels.instance or similar in the Prometheus collector to make this a slightly smoother experience?

exekias · 2019-07-01T15:44:06Z

SGTM, I opened a new issue for that (#12739), let's move the discussion there and keep this one for docs.

botelastic · 2022-01-27T17:47:22Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

tbragin added docs Team:Integrations Label for the Integrations team labels Dec 25, 2018

exekias assigned exekias and unassigned exekias Jan 7, 2019

tbragin added the discuss Issue needs further discussion. label Feb 1, 2019

alvarolobato removed the discuss Issue needs further discussion. label Feb 18, 2019

andresrc added the [zube]: Backlog label Jul 22, 2019

botelastic bot added the Stalled label Jan 27, 2022

botelastic bot closed this as completed Jul 26, 2022

zube bot added [zube]: Done and removed [zube]: Backlog labels Jul 26, 2022

zube bot removed the [zube]: Done label Oct 25, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve Prometheus integration documentation #9792

Improve Prometheus integration documentation #9792

tbragin commented Dec 25, 2018

hodgesds commented Jan 25, 2019

exekias commented Jan 29, 2019

tbragin commented Feb 1, 2019

tbragin commented Feb 1, 2019 •

edited

Loading

ruflin commented Feb 4, 2019

makwarth commented Feb 7, 2019

DanRoscigno commented Feb 25, 2019

DanRoscigno commented Feb 25, 2019

DanRoscigno commented Mar 19, 2019

DanRoscigno commented Apr 19, 2019

tbragin commented Apr 28, 2019

tbragin commented Jun 26, 2019

sorantis commented Jul 1, 2019 •

edited

Loading

roncohen commented Jul 1, 2019

exekias commented Jul 1, 2019

botelastic bot commented Jan 27, 2022

Improve Prometheus integration documentation #9792

Improve Prometheus integration documentation #9792

Comments

tbragin commented Dec 25, 2018

hodgesds commented Jan 25, 2019

exekias commented Jan 29, 2019

tbragin commented Feb 1, 2019

tbragin commented Feb 1, 2019 • edited Loading

ruflin commented Feb 4, 2019

makwarth commented Feb 7, 2019

DanRoscigno commented Feb 25, 2019

DanRoscigno commented Feb 25, 2019

DanRoscigno commented Mar 19, 2019

DanRoscigno commented Apr 19, 2019

tbragin commented Apr 28, 2019

tbragin commented Jun 26, 2019

sorantis commented Jul 1, 2019 • edited Loading

roncohen commented Jul 1, 2019

exekias commented Jul 1, 2019

botelastic bot commented Jan 27, 2022

tbragin commented Feb 1, 2019 •

edited

Loading

sorantis commented Jul 1, 2019 •

edited

Loading