Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

derivative broken after upgrade to 0.10.0-0.rc1 #5474

Closed
ljagiello opened this issue Jan 28, 2016 · 19 comments
Closed

derivative broken after upgrade to 0.10.0-0.rc1 #5474

ljagiello opened this issue Jan 28, 2016 · 19 comments

Comments

@ljagiello
Copy link

Hi,

It looks like derivative is broken in the latest InfluxDB version (0.10.0-0.rc1)

with Influxdb 0.10.0-0.beta2:

SELECT derivative(mean(reads), 1s) FROM telegraf."default".diskio WHERE time > now() - 12h GROUP BY time(1m), host limit 5
name: diskio
tags: host=rado01
time            derivative
----            ----------
1453977720000000000    2.3527777777751906
1453977780000000000    2.7111111111162853
1453977840000000000    2.486111111108524
1453977900000000000    2.038888888891476

with 0.10.0-0.rc1:

SELECT derivative(mean(reads), 1s) FROM telegraf."default".diskio WHERE time > now() - 12h GROUP BY time(1m), host limit 5
name: diskio
tags: host=rado01
time            derivative
----            ----------
1453978200000000000
1453978260000000000
1453978320000000000
1453978380000000000
@jwilder
Copy link
Contributor

jwilder commented Jan 28, 2016

What does your query return without the derivative?

SELECT mean(reads) FROM telegraf."default".diskio WHERE time > now() - 12h GROUP BY time(1m), host limit 5

@ljagiello
Copy link
Author

with 0.10.0-0.beta2:

InfluxDB shell 0.10.0
> SELECT mean(reads) FROM telegraf."default".diskio WHERE time > now() - 12h GROUP BY time(1m), host limit 5
name: diskio
tags: host=dev-a
time            mean
----            ----
1453978560000000000 104647
1453978620000000000 104647
1453978680000000000 104647
1453978740000000000 104647
1453978800000000000 104647


name: diskio
tags: host=dev-b
time            mean
----            ----
1453978560000000000 278992
1453978620000000000 278992
1453978680000000000 278992
1453978740000000000 278992
1453978800000000000 278992


name: diskio
tags: host=dev-c
time            mean
----            ----
1453978560000000000 480867.2
1453978620000000000 480867.2
1453978680000000000 480867.2
1453978740000000000 480867.2
1453978800000000000 480868.6666666667


name: diskio
tags: host=dev-d
time            mean
----            ----
1453978560000000000 469380.4
1453978620000000000 469380.4
1453978680000000000 469380.4
1453978740000000000 469380.4
1453978800000000000 469382.2


name: diskio
tags: host=dev-e
time            mean
----            ----
1453978560000000000 1.75580955e+07
1453978620000000000 1.75580955e+07
1453978680000000000 1.7558102444444444e+07
1453978740000000000 1.7558103833333332e+07
1453978800000000000 1.7558103833333332e+07


name: diskio
tags: host=dev-f
time            mean
----            ----
1453978560000000000 1.72238304e+07
1453978620000000000 1.72238306e+07
1453978680000000000 1.72238363e+07
1453978740000000000 1.7223838e+07
1453978800000000000 1.7223838266666666e+07


name: diskio
tags: host=dev-g
time            mean
----            ----
1453978560000000000 68279.42857142857
1453978620000000000 68279.42857142857
1453978680000000000 68279.42857142857
1453978740000000000 68279.42857142857
1453978800000000000 68279.42857142857

with 0.10.0-0.rc1:

InfluxDB shell 0.10.0rc1
> SELECT mean(reads) FROM telegraf."default".diskio WHERE time > now() - 12h GROUP BY time(1m), host limit 5
name: diskio
tags: host=dev-a
time            mean
----            ----
1453978680000000000 104647
1453978740000000000 104647
1453978800000000000 104647
1453978860000000000 104647
1453978920000000000 104647


name: diskio
tags: host=dev-b
time            mean
----            ----
1453978680000000000 278992
1453978740000000000 278992
1453978800000000000 278992
1453978860000000000 278992
1453978920000000000 278992


name: diskio
tags: host=dev-c
time            mean
----            ----
1453978680000000000 480867.2
1453978740000000000 480867.2
1453978800000000000 480868.6666666667
1453978860000000000 480871.2
1453978920000000000 480871.2


name: diskio
tags: host=dev-d
time            mean
----            ----
1453978680000000000 469380.4
1453978740000000000 469380.4
1453978800000000000 469382.2
1453978860000000000 469385.2
1453978920000000000 469385.2


name: diskio
tags: host=dev-e
time            mean
----            ----
1453978680000000000 1.7558103833333332e+07
1453978740000000000 1.7558103833333332e+07
1453978800000000000 1.7558103833333332e+07
1453978860000000000 1.7558104444444444e+07
1453978920000000000 1.7558104833333332e+07


name: diskio
tags: host=dev-f
time            mean
----            ----
1453978680000000000 1.72238375e+07
1453978740000000000 1.7223838e+07
1453978800000000000 1.7223838266666666e+07
1453978860000000000 1.7223838933333334e+07
1453978920000000000 1.7223843266666666e+07


name: diskio
tags: host=dev-g
time            mean
----            ----
1453978680000000000 68279.42857142857
1453978740000000000 68279.42857142857
1453978800000000000 68279.42857142857
1453978860000000000 68279.42857142857
1453978920000000000 68279.42857142857

@jwilder
Copy link
Contributor

jwilder commented Jan 28, 2016

@benbjohnson Could #5463 have caused this?

@benbjohnson
Copy link
Contributor

@ljagiello I've been trying to reproduce this issue with a few other people here and we can't for the life of us cause it to happen. Have you made changes to the configuration file? What kind of retention policy are you using?

@ljagiello
Copy link
Author

@benbjohnson

> SHOW RETENTION POLICIES ON "telegraf"
name    duration    replicaN    default
default 504h0m0s    1       true

Configuration is fairly standard, nothing fancy - https://gist.github.com/ljagiello/32778372df8daecc176b

I test that issue with 2 different servers and it's the same behaviour.

@benbjohnson
Copy link
Contributor

@ljagiello Hmm, do you have any sample raw data you can provide or a test case that reproduces it?

@max3163
Copy link

max3163 commented Jan 29, 2016

I've the same issue since I upraded to rc1 :
SELECT derivative(mean("value"), 10s) FROM "interface_rx" WHERE "type" = 'if_octets' AND "host" = 'toto' AND time > now() - 24h GROUP BY time(10s) fill(null)

Time derivative
2016-01-28T21:36:40Z
2016-01-28T21:36:50Z
2016-01-28T21:37:00Z

Before all was OK

@jwilder
Copy link
Contributor

jwilder commented Jan 29, 2016

@max3163 What version did you upgrade from?

@ljagiello
Copy link
Author

0.10.0-0.beta2

> SELECT mean(reads) FROM telegraf."default".diskio WHERE time > now() - 1h AND host = 'dev-s1' GROUP BY time(10s),host limit 10
name: diskio
tags: host=dev-s1
time            mean
----            ----
1454100740000000000
1454100750000000000 5.68124583e+08
1454100760000000000 5.68124583e+08
1454100770000000000 5.68124583e+08
1454100780000000000 5.681245842857143e+08
1454100790000000000 5.681245842857143e+08
1454100800000000000 5.681245842857143e+08
1454100810000000000 5.681245842857143e+08
1454100820000000000 5.681245842857143e+08
1454100830000000000 5.681245842857143e+08
> SELECT derivative(mean(reads)) FROM telegraf."default".diskio WHERE time > now() - 1h AND host = 'dev-s1' GROUP BY time(10s),host limit 10
name: diskio
tags: host=dev-s1
time            derivative
----            ----------
1454100750000000000
1454100760000000000 0
1454100770000000000 0
1454100780000000000 1.2857142686843872
1454100790000000000 0
1454100800000000000 0
1454100810000000000 0
1454100820000000000 0
1454100830000000000 0
> SELECT reads FROM telegraf."default".diskio WHERE time > now() - 1h AND host = 'dev-s1' limit 100
name: diskio
------------
time            reads
1454100790000000000 1325098574
1454100790000000000 787310
1454100790000000000 1324946344
1454100790000000000 1589
1454100790000000000 1325889358
1454100790000000000 147173
1454100790000000000 1742
1454100800000000000 1325098574
1454100800000000000 1325889358
1454100800000000000 1742
1454100800000000000 1324946344
1454100800000000000 147173
1454100800000000000 1589
1454100800000000000 787310
1454100810000000000 1325098574
1454100810000000000 787310
1454100810000000000 1742
1454100810000000000 1325889358
1454100810000000000 1589
[…]

On 0.10.0-0.rc1:

> SELECT mean(reads) FROM telegraf."default".diskio WHERE time > now() - 1h AND host = 'dev-s1' GROUP BY time(10s),host limit 10
name: diskio
tags: host=dev-s1
time            mean
----            ----
1454100610000000000
1454100620000000000 5.68124559e+08
1454100630000000000 5.68124559e+08
1454100640000000000 5.68124559e+08
1454100650000000000 5.68124559e+08
1454100660000000000 5.68124559e+08
1454100670000000000 5.68124559e+08
1454100680000000000 5.68124559e+08
1454100690000000000 5.681245641428572e+08
1454100700000000000 5.681245658571428e+08

> SELECT derivative(mean(reads)) FROM telegraf."default".diskio WHERE time > now() - 1h AND host = 'dev-s1' GROUP BY time(10s),host limit 10
name: diskio
tags: host=dev-s1
time            derivative
----            ----------
1454100630000000000
1454100640000000000
1454100650000000000
1454100660000000000
1454100670000000000
1454100680000000000
1454100690000000000
1454100700000000000
1454100710000000000
> SELECT reads FROM telegraf."default".diskio WHERE time > now() - 1h AND host = 'dev-s1' limit 1000
name: diskio
------------
time            reads
1454100590000000000 1589
1454100590000000000 1.325098513e+09
1454100590000000000 787310
1454100590000000000 1.324946283e+09
1454100590000000000 1742
1454100590000000000 147173
1454100590000000000 1.325889297e+09
1454100600000000000 1.325098514e+09
1454100600000000000 1742
1454100600000000000 1.325889298e+09
1454100600000000000 147173
1454100600000000000 1589
1454100600000000000 787310
1454100600000000000 1.324946284e+09
1454100610000000000 787310
1454100610000000000 1.324946285e+09
1454100610000000000 1742
1454100610000000000 1.325098515e+09
1454100610000000000 1589
1454100610000000000 1.325889299e+09
1454100610000000000 147173
1454100620000000000 1.325889299e+09
1454100620000000000 147173
1454100620000000000 1.324946285e+09
1454100620000000000 787310
1454100620000000000 1589
1454100620000000000 1742
[…]

I'm using in both cases the same database just switching influxdb binary.

@max3163
Copy link

max3163 commented Jan 29, 2016

@jwilder From 0.10.beta2

@jwilder
Copy link
Contributor

jwilder commented Jan 29, 2016

@ljagiello @max3163 Can you show the output of show servers and show diagnostics?

@ljagiello
Copy link
Author

@max3163
Copy link

max3163 commented Jan 30, 2016

@jwilder

> show servers
name: data_nodes
----------------
id      http_addr       tcp_addr
1       localhost:8086  localhost:8088


name: meta_nodes
----------------
id      http_addr       tcp_addr
1       localhost:8091  localhost:8088

> show diagnostics
name: build
-----------
Branch  Build Time                      Commit                                          Version
0.10.0  2016-01-28T19:36:51.857673      834744a4c30e7f06a7c5fa8d0376de0d1930a318        0.10.0rc1


name: hh
--------
node    active  last modified   head    tail


name: network
-------------
hostname
toto


name: runtime
-------------
GOARCH  GOMAXPROCS      GOOS    version
amd64   2               linux   go1.4.3


name: system
------------
PID     currentTime                     started                         uptime
26454   2016-01-30T09:40:13.132278806Z  2016-01-29T17:07:56.714125425Z  16h32m16.418153612s

@jwilder
Copy link
Contributor

jwilder commented Jan 31, 2016

@max3163 @ljagiello Thanks. Can you also show me the contents of your meta/node.json file?

@max3163
Copy link

max3163 commented Feb 1, 2016

@jwilder

My node.json contain only this line :

[/var/lib/influxdb] $ cat node.json
{"ID":2,"MetaServers":["localhost:8091"]}

@ljagiello
Copy link
Author

@jwilder

 % cat node.json
{"ID":2,"MetaServers":["localhost:8091"]}

@jwilder
Copy link
Contributor

jwilder commented Feb 1, 2016

I believe this is an upgrade issue with beta2. If you both manually edit your node.json file and change the ID to 1, that might resolve it.

@max3163
Copy link

max3163 commented Feb 1, 2016

@jwilder : I just made the change and now all is ok back !

Thanks for your help !

@jwilder jwilder closed this as completed Feb 1, 2016
@ljagiello
Copy link
Author

@jwilder It's working perfectly fine after that change. Thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

4 participants