-
Notifications
You must be signed in to change notification settings - Fork 4.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Performance degradation (small blocks) after updating from 1.8 to 1.9 #24163
Comments
CC @taozhu-chicago |
Thank you @antonya86 for sharing the finding. Interesting that increased |
@taozhu-chicago when we restored banking thread to default value
After updating to v1.9 our validator started to produce blocks like that (a bunch of 4 slots):
Next, I lowered
Finally, I set |
Interesting that your examples happen to show that the first block often has the lowest amount of transactions, as well as the last one. I think I've noticed a similar pattern when scanning blocks... any theories Tao? It seems like leaders are being sent transactions quite ahead of their leader slots these days I think (someone from Mango said they forward 8 slots ahead I think), which makes it odd that their first slot would have so few txns? Might also interest @ckamm Also, maybe if yall have the time @antonya86 it could be interesting to look at mean txns/block across the four leader slots with stats to see if it's a legitimate phenomenon 😅 |
found the issue last night after some discussion with @sakridge and @carllin if cost model is exceeded, banking stage leaves those packets in the buffer. this causes the the same situation could also happen with lots of txs locking the same account returning AccountInUse error, but more likely to resolve itself after many iterations of that loop. |
Problem
After upgrading our mainnet-beta validator from v1.8 to v1.9 we found that our validator started to produce blocks with a really low amount of transactions against average. We investigated the situation, fixed the issue and revealed a significant bunch of validators whose amount of transactions per leader block dropped by 25% after upgrading to v1.9. We posted our investigation on solana forum here
Although the situation is not a disaster, it would be great if someone from the Solana foundation responsible for cluster/validator performance looked at the data we gathered. Thanks in advance.
The text was updated successfully, but these errors were encountered: