Problems using UDP influxdb output #2862

mcfedr · 2017-05-29T09:34:30Z

Looks like telegraf is splitting a udp message and this means its not getter parsed properly by influxdb.

This has started happening since update to telegraf 1.3

Relevant telegraf.conf:

[[outputs.influxdb]]
  urls = ["udp://influxdb:8089"]
  database = "telegraf"
  retention_policy = ""
  write_consistency = "any"
  timeout = "5s

[[inputs.system]]

System info:

Telegraf 1.3 docker image
Influxdb 1.2 docker image

Log message

influxdb_1    | [I] 2017-05-29T09:25:00Z Failed to parse points: unable to parse 'system,host=proxy-docker uptime_format="5 days 1496049900000000000': unbalanced quotes service=udp
influxdb_1    | [I] 2017-05-29T09:25:00Z Failed to parse points: unable to parse 'system,host=proxy-docker   1:26" 1496049900000000000': invalid field format service=udp

The text was updated successfully, but these errors were encountered:

oplehto · 2017-05-29T12:06:54Z

I am seeing something which may be related with the socket writer plugin with UDP: Unrelated metrics are getting randomly dropped when there are measurements active which require a split into multiple UDP packets. The amount of drops seems to correlate with the amount of splits required.

I haven't had time to look in any deeper but I suspect that this loop breaks prematurely:

telegraf/plugins/outputs/influxdb/client/udp.go

Line 75 in f74687d

func (c *udpClient) WriteStream(r io.Reader, contentLength int) (int, error) {

sebito91 · 2017-05-29T13:38:18Z

The splitter was updated before the v1.3.0 release here: #2795. The splits actually take place in the influxdb Write function (the conn in the udp.go file).

Split function -- https://github.com/influxdata/telegraf/blob/master/metric/metric.go#L220

danielnelson · 2017-05-30T22:01:33Z

I think maybe we shouldn't determine the buffer length until after splitting https://github.com/influxdata/telegraf/blob/master/plugins/outputs/influxdb/influxdb.go#L189

sebito91 · 2017-05-30T23:15:07Z

I see what happened, it was refactored slightly on #2799 since we merged #2795! @danielnelson, didn't realize we actually merged that...I think we built with just #2795 but will confirm in the morning.

danielnelson · 2017-05-30T23:23:59Z

I'm pretty sure this is it, working on PR now.

danielnelson · 2017-06-08T19:37:40Z

The invalid points was fixed in 1.3.1, but there was still an edge case where long point are not split correctly and they would be dropped. This is fixed in 1.3.2.

danielnelson added bug unexpected problem or unintended behavior regression something that used to work, but is now broken labels May 30, 2017

danielnelson added this to the 1.3.1 milestone May 30, 2017

danielnelson self-assigned this May 30, 2017

danielnelson mentioned this issue May 30, 2017

Metric corruption when using http2 nginx reverse proxy with influxdb output #2854

Closed

danielnelson mentioned this issue May 30, 2017

Fix length calculation of split metric buffer #2869

Merged

2 tasks

121watts added the in progress label May 30, 2017

danielnelson closed this as completed in #2869 May 31, 2017

121watts removed the in progress label May 31, 2017

danielnelson reopened this Jun 2, 2017

This was referenced Jun 3, 2017

Fix udp metric splitting #2880

Merged

Possible truncation outputting to udp in influx format #2881

Closed

Fix metric splitting edge cases #2896

Merged

danielnelson closed this as completed Jun 8, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Problems using UDP influxdb output #2862

Problems using UDP influxdb output #2862

mcfedr commented May 29, 2017

oplehto commented May 29, 2017 •

edited

Loading

sebito91 commented May 29, 2017 •

edited

Loading

danielnelson commented May 30, 2017

sebito91 commented May 30, 2017

danielnelson commented May 30, 2017

danielnelson commented Jun 8, 2017

Problems using UDP influxdb output #2862

Problems using UDP influxdb output #2862

Comments

mcfedr commented May 29, 2017

Relevant telegraf.conf:

System info:

Log message

oplehto commented May 29, 2017 • edited Loading

sebito91 commented May 29, 2017 • edited Loading

danielnelson commented May 30, 2017

sebito91 commented May 30, 2017

danielnelson commented May 30, 2017

danielnelson commented Jun 8, 2017

oplehto commented May 29, 2017 •

edited

Loading

sebito91 commented May 29, 2017 •

edited

Loading