You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
During testing with OMB on a 3x i3en.6xlarge cluster it was found that traffic from internal RPC clients can cause reactor utilization on some shards to be 40% higher than others. This largely caused by poor distribution of client connections to other brokers amongst shards.
Looking at RPC client in + out bytes in the graph below we can see that some shards are processing 4x the amount of throughput as others which can explain a lot of the extra reactor utilization on some shards.
In the chart below the allowed connections was set to equal the number of shards on a given broker. This allows each shard to have its own connections to all of the brokers in the cluster. And we can see in the chart that the standard deviation for reactor utilization is reduced by 50%.
The text was updated successfully, but these errors were encountered:
That's on @ballard26 to do. This will help with reducing variance in latencies.
ballard26
changed the title
Poor connection distribution in clusters with more than 8 shards a node
Poor connection distribution in cluster
Aug 30, 2023
ballard26
changed the title
Poor connection distribution in cluster
Improve connection distribution in clusters to reduce latency and reactor utilization
Aug 30, 2023
During testing with OMB on a 3x
i3en.6xlarge
cluster it was found that traffic from internal RPC clients can cause reactor utilization on some shards to be 40% higher than others. This largely caused by poor distribution of client connections to other brokers amongst shards.Looking at RPC client in + out bytes in the graph below we can see that some shards are processing 4x the amount of throughput as others which can explain a lot of the extra reactor utilization on some shards.
In the chart below the allowed connections was set to equal the number of shards on a given broker. This allows each shard to have its own connections to all of the brokers in the cluster. And we can see in the chart that the standard deviation for reactor utilization is reduced by 50%.
The text was updated successfully, but these errors were encountered: