Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Improve connection distribution in clusters to reduce latency and reactor utilization #12912

Closed
ballard26 opened this issue Aug 21, 2023 · 4 comments · Fixed by #12906
Closed
Assignees
Labels
core kind/bug Something isn't working

Comments

@ballard26
Copy link
Contributor

During testing with OMB on a 3x i3en.6xlarge cluster it was found that traffic from internal RPC clients can cause reactor utilization on some shards to be 40% higher than others. This largely caused by poor distribution of client connections to other brokers amongst shards.

Looking at RPC client in + out bytes in the graph below we can see that some shards are processing 4x the amount of throughput as others which can explain a lot of the extra reactor utilization on some shards.
image

In the chart below the allowed connections was set to equal the number of shards on a given broker. This allows each shard to have its own connections to all of the brokers in the cluster. And we can see in the chart that the standard deviation for reactor utilization is reduced by 50%.
image

@mattschumpert
Copy link

@StephanDollberg can we update the title of this issue to represent the improvement the PR(s) will make please?

@piyushredpanda
Copy link
Contributor

That's on @ballard26 to do. This will help with reducing variance in latencies.

@ballard26 ballard26 changed the title Poor connection distribution in clusters with more than 8 shards a node Poor connection distribution in cluster Aug 30, 2023
@ballard26 ballard26 changed the title Poor connection distribution in cluster Improve connection distribution in clusters to reduce latency and reactor utilization Aug 30, 2023
@ballard26
Copy link
Contributor Author

I've updated the title let me know if it's adequate.

@mattschumpert
Copy link

💥

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
core kind/bug Something isn't working
Projects
None yet
Development

Successfully merging a pull request may close this issue.

3 participants