-
Notifications
You must be signed in to change notification settings - Fork 69
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
checker memory leak #921
Comments
@ihard Привет, подскажи, пожалуйста, график с очередью - это в момент проблемы? сможешь еще дополнить, пожалуйста, графиками?
|
@kissken added to first post |
In pprof heap - problem function:
|
@ihard Could you, please, added info are triggers tagged or flat? I'm just guessing, in this case when many pods down and many up in one trigger, the problem appears or nor? |
99% are flat triggers |
Memory consumption when checking each trigger in debug mode or in metrics would greatly help in analyzing such problems. |
hello, could you tell us, please, how many metrics match for pattern, when open /patterns page and sort by desc value at field metrics? |
~ 100 000 at the time of analyzing the problems, now the number has grown to ~ 250,000 |
Updating to 2.9.0 revealed about 100 division-by-0 triggers that were removed.
Triggers with warning errors: |
We removed a large pack of triggers for which no metrics were received, the problem stopped. |
In addition, I'd like to add that we found a potential issue that may be causing increased memory consumption - the Temporary fix for the problem:
|
It's very similar to our problem, after deleting the metrics, everything goes back to normal for a while, but then the problem returns. |
Hi! We added the |
Will the Cli clean the keys in the current database, and the leak itself was previously fixed in some release? |
Yes, the command in the cli will clean up the current garbage, the leak fix will come in the next release. We recommend adding |
BUG REPORT
What version of Moira are you using (
[binary] --version
)?Configuration
checker.yaml
pprof heap + goroutine
pprof.zip
Metric trigger queue
Grafana Dashboard
What did you expect to see?
no memory leak
What did you see instead?
usage memory peak to 60GB and OOM
The text was updated successfully, but these errors were encountered: