[Node Operator Question] Uneven archive node performance under load #45
Replies: 4 comments 15 replies
-
They're archive nodes and have nothing to do with optimism actually, it is a op-geth issue. You may try out some alternative EL nodes but notice that they are unstable, unproven to run for mission critical tasks. https://github.com/testinprod-io/op-erigon |
Beta Was this translation helpful? Give feedback.
-
Hey @jakobilobi, can you provide the following:
|
Beta Was this translation helpful? Give feedback.
-
btw, how do you monitor these status data? Have you started a dedicated monitoring program (this is another topic and is essentially irrelevant to the current problem) |
Beta Was this translation helpful? Give feedback.
-
Found that there was a more probable external reason for the issues we saw with the node, thanks for your help though! |
Beta Was this translation helpful? Give feedback.
-
Issue Description
Hello everyone,
I was referred here from the Optimism Discord after asking this question, so I'll pose it here as well:
We're running two Optimism archive nodes (legacy node + Bedrock/OP node) and have run into an elusive issue. One of our nodes is faring much worse under load than the other, becoming slower to respond and even falling behind in block height. The underlying hardware/infra is exactly the same for both nodes, and it is well above and minimum requirements listed in the docs. Both nodes are also running the same versions,
op-geth v1.101304.0
andop-node v1.3.0
, at the moment, though the issue started to appear before they were upgraded to the latest version.I've tried experimenting with the
op-geth
settings--snapshot=... --cache=... --maxpeers=...
but they don't seem to have any effect. Particularly the--snapshot
setting I thought would have at least some kind of impact, from what I understand that it does (I'm just a node runner though, actual blockchain client knowledge isn't very deep). On the affected node, the--snapshot
setting wastrue
, with state snapshot generated, when the issue appeared while it on the unaffected node wasfalse
.Does this ring a bell for anyone? Any advice on what other configurations I might try out?
Cheers,
Jakob
Node Logs
No response
Beta Was this translation helpful? Give feedback.
All reactions