-
Notifications
You must be signed in to change notification settings - Fork 133
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feature request: cloudwatch_logs templating failure can lead to a very large number of warn logs #599
Comments
For 2.0 there are workarounds: fluent/fluent-bit#6918 (comment) |
This is a concern for the upcoming Daemon support launch, since it makes extensive use of templating which can fail . |
Yes we also had this issue costing quite a bit (and yes we now have alerts). Would be great to have a solution indeed! |
It's not only templating that can fail with this impact, but also access denied when attempting to (say) create a log stream (where fluent bit can create the log stream for its own logs for whatever misconfiguration reason). |
The new log suppress feature in 2.0 is good for the CW access denied and other errors. Which suppresses the same message from a single plugin. For the templating failures, the message actually comes from a core library which I think is not part of the log suppress feature. |
we ran into this as well and racked up massive cloudwatch changes. the default values should not have any template variables that aren't guaranteed to be there. people will probably keep tripping over this. 😐 |
aws/aws-for-fluent-bit#599 Signed-off-by: Wesley Pettit <[email protected]>
When Fluent Bit is deployed as a kubernetes daemonset pod, it collects its own logs, and thus, when it emits an error message, it will collect its own error message, potentially leading to a cycle of log spam in which each of its own error logs cause it to produce another error log. Key cases:
|
See one option for workaround here: fluent/fluent-bit#6918 (comment) |
aws/aws-for-fluent-bit#599 Signed-off-by: Wesley Pettit <[email protected]>
aws/aws-for-fluent-bit#599 Signed-off-by: Wesley Pettit <[email protected]>
See: fluent/fluent-bit#6918
The warning for record_accessor failure here can be emitted for every single log record processed: https://github.com/fluent/fluent-bit/blob/master/plugins/out_cloudwatch_logs/cloudwatch_api.c#L1084
It'd be ideal if this was only emitted once per batch instead. This will require code changes.
The text was updated successfully, but these errors were encountered: