Wrong metrics on timed out HTTP requests #925

na-- · 2019-02-13T14:08:09Z

When we time out an HTTP request like this:

import http from "k6/http";

export default function () {
    http.get("https://httpbin.org/delay/10", { timeout: 3000 });
}

The metrics k6 emits don't make sense:

WARN[0003] Request Failed                                error="Get https://httpbin.org/delay/10: net/http: request canceled (Client.Timeout exceeded while awaiting headers)"
    done [==========================================================] 1 / 1

    data_received..............: 3.1 kB 1.0 kB/s
    data_sent..................: 455 B  151 B/s
    http_req_blocked...........: avg=501.1ms  min=501.1ms  med=501.1ms  max=501.1ms  p(90)=501.1ms  p(95)=501.1ms 
    http_req_connecting........: avg=128.62ms min=128.62ms med=128.62ms max=128.62ms p(90)=128.62ms p(95)=128.62ms
    http_req_duration..........: avg=100.02µs min=100.02µs med=100.02µs max=100.02µs p(90)=100.02µs p(95)=100.02µs
    http_req_receiving.........: avg=0s       min=0s       med=0s       max=0s       p(90)=0s       p(95)=0s      
    http_req_sending...........: avg=100.02µs min=100.02µs med=100.02µs max=100.02µs p(90)=100.02µs p(95)=100.02µs
    http_req_tls_handshaking...: avg=312.07ms min=312.07ms med=312.07ms max=312.07ms p(90)=312.07ms p(95)=312.07ms
    http_req_waiting...........: avg=0s       min=0s       med=0s       max=0s       p(90)=0s       p(95)=0s      
    http_reqs..................: 1      0.333313/s
    iteration_duration.........: avg=3s       min=3s       med=3s       max=3s       p(90)=3s       p(95)=3s      
    iterations.................: 1      0.333313/s
    vus........................: 1      min=1 max=1
    vus_max....................: 1      min=1 max=1

The http_req_* metrics should add up roughly to 3 seconds, but they're very far from it...

The text was updated successfully, but these errors were encountered:

mstoykov · 2019-02-13T14:12:58Z

I would guess that we don't calculate http_req_receiving and http_req_waiting at all. I would say that the waiting should get all the difference ?

na-- · 2019-02-13T14:20:05Z

Yes, seems to me that waiting should soak up the difference, unless the server delay was implemented in such a way that it didn't even start reading the request headers and body. That wasn't the case here apparently, but it could happen. Also, http_req_blocked seems quite high in this case, which is also strange...

na-- · 2019-02-14T10:05:57Z

The receiving times here also seem suspiciously low: https://serverfault.com/questions/952025/not-understanding-metrics-of-k6/
I'm not sure how to unit-test these things without introducing anymore flaky tests... maybe we can spin-up an http server and intercept the network traffic to it, injecting some sleep times in either the network connection or in the HTTP response, so we can simulate different aspects of a shitty network. Then, if we check that say the http_req_connecting or http_req_receiving or whatever is more than the sleep we injected, this test should be pretty resilient and shouldn't fail randomly in CI runs...

mstoykov · 2019-02-14T10:42:54Z

This test looks like it will probably need some pretty heavy setup which might require something like docker-compose to run it easier. Although my experience with making network shitty is that VM and docker have some random problems ... Maybe we should first try to make a simple setup that we can run and test stuff and than figure out if we can make it into a test ?

na-- · 2019-02-14T11:14:45Z

That's how I started the issue 😜 and I'm all for that approach, for finding the bugs and for the final verification that we've fixed them. But for the actual fixing, I'd really like if we have a repeatable automated test that should work according to our expectations, but currently fails, at least until we fix the bug. Ideally, we should have at least one such test for each http_req_* metric, just so we're reasonably sure that we won't break something else by fixing the bugs we know about...

And locally, we should be able to simulate most of the delays without any docker containers, just by spinning up a Go http server in the test and either messing with the connection, or with the server response. For sure, we won't be able to simulate every possible shitty network, and we're bound to have a ton of issues with HTTP/2, but we should be able to have one simple test per metric...

This should fix #1041, #1044, #925, and a potential minor data race! Hopefully, without introducing new bugs or preformance regressions...

Fix a bunch of HTTP measurement and handling issues This should fix #1041, #1044, #925, and a potential minor data race! Hopefully, without introducing new bugs or performance regressions...

na-- added bug high prio labels Feb 13, 2019

na-- added this to the v1.0.0 milestone Feb 13, 2019

na-- mentioned this issue Jun 5, 2019

metrics show incorrect timings #1041

Closed

na-- added a commit that referenced this issue Jun 10, 2019

Fix a bunch of HTTP measurement and handling issues

2bf26be

This should fix #1041, #1044, #925, and a potential minor data race! Hopefully, without introducing new bugs or preformance regressions...

na-- mentioned this issue Jun 10, 2019

Fix a bunch of HTTP measurement and handling issues #1047

Merged

na-- closed this as completed in #1047 Jun 13, 2019

na-- removed this from the v1.0.0 milestone Jul 13, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Wrong metrics on timed out HTTP requests #925

Wrong metrics on timed out HTTP requests #925

na-- commented Feb 13, 2019

mstoykov commented Feb 13, 2019

na-- commented Feb 13, 2019 •

edited

Loading

na-- commented Feb 14, 2019 •

edited

Loading

mstoykov commented Feb 14, 2019

na-- commented Feb 14, 2019

Wrong metrics on timed out HTTP requests #925

Wrong metrics on timed out HTTP requests #925

Comments

na-- commented Feb 13, 2019

mstoykov commented Feb 13, 2019

na-- commented Feb 13, 2019 • edited Loading

na-- commented Feb 14, 2019 • edited Loading

mstoykov commented Feb 14, 2019

na-- commented Feb 14, 2019

na-- commented Feb 13, 2019 •

edited

Loading

na-- commented Feb 14, 2019 •

edited

Loading