Skip to content
This repository has been archived by the owner on Aug 7, 2023. It is now read-only.

connect_and_idle scenario drops bulk connections (intermittent) #88

Closed
rpappalax opened this issue Jan 31, 2017 · 3 comments
Closed

connect_and_idle scenario drops bulk connections (intermittent) #88

rpappalax opened this issue Jan 31, 2017 · 3 comments

Comments

@rpappalax
Copy link
Contributor

rpappalax commented Jan 31, 2017

On long test runs (>30 mins), ap-loadtester will drop a large number of connections (not usually all). This usually happens at least once on long-running tests, but can happen multiple times.

I've never been able to get a connection test to exceed 11M connections.
https://github.com/mozilla-services/ap-loadtester/blob/master/aplt/scenarios.py#L44

connections-dropped

@rpappalax rpappalax added this to the PUSHSVC-0: quality milestone Jan 31, 2017
@rpappalax rpappalax changed the title Determine why connect_and_idle scenario drops bulk connections (intermittent) connect_and_idle scenario drops bulk connections (intermittent) Jan 31, 2017
@rpappalax
Copy link
Contributor Author

NOTE: this test completed successfully after 3 hours reaching the 100K conns / node threshold on each.
so, how to make the ap-loadtester more robust to handle network hiccups?

@pjenvey
Copy link
Member

pjenvey commented Feb 10, 2017

The dip here turned out to be due to a redeploy of stage we weren't aware of, so ap-loadtester actually handled it pretty well except for lack of feedback (#73).

I'll note one more thing -- we discussed that the connect_and_idle testplan took a while to create so many connections. Per datadog, this run did about 25 conns per second. We should able to greatly improve that with a higher aplt_testplan stagger value (how many scenarios to launch per second).

@rpappalax
Copy link
Contributor Author

@pjenvey ok cool. that's very useful info.
The README indicates that based on this, the 2nd param is stagger value (in this example, set to 1).
In my tests, I've played around with these values, but started setting them low to speed up the tests. I had the value set to 2, but I'll try setting it to 10 instead and try ratcheting it up progressively if needed.

Run 5 instances of the basic scenario, starting one every second, against the dev server:
$ aplt_testplan wss://autopush.dev.mozaws.net/ "aplt.scenarios:basic,5,1,0"

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Projects
None yet
Development

No branches or pull requests

2 participants