-
Notifications
You must be signed in to change notification settings - Fork 133
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Latest and Stable versions failing due to invalid config options #371
Comments
Same here. Using the provided params by AWS cause this to fail to start with a 255 error taking the rest of the task with it. When past default values causes crashes this should not be in "stable" |
@discovery-NukulSharma @JasonIamAUnixAdmin Thanks for report. Working on finding the root cause |
Also @discovery-NukulSharma @JasonIamAUnixAdmin , could you please explain more about what "failing" means? And could any of you confirm for me that if our dockerhub images have the same problem or the issue only happens with public ECR? https://hub.docker.com/r/amazon/aws-for-fluent-bit/tags |
Failing: the log container exits with a 255 error code, since it is tagged as essential it takes the whole task out. Pinning to the old "stable" image of 2.23.3 fixes the issue, using the current stable has the same issue. seems that our requirement of passing in |
@JasonIamAUnixAdmin I see, so the root cause is that the latest few versions 2.25.1 and 2.26.0 have this unfixed bug in it right? Yesterday we advanced our stable version so I think that might be the problem.. |
correct. We changed to And since we are messing with logging I understand why it is hard to get the "why" out out to show us but man is this painful to debug on the user end. |
@JasonIamAUnixAdmin so from the #348 we can see that from fluent bit 1.9, it starts to do some input validation. Can you please share your config so that we can figure out which field is not allowed? |
Here is one task:
And another:
|
Thanks @JasonIamAUnixAdmin , so from the public repo: https://docs.fluentbit.io/manual/pipeline/outputs/datadog. not only |
All the options in use came from DD in the past. Looks like |
@JasonIamAUnixAdmin sorry I was wrong. So Thanks. |
Can you please revert the broken |
@nakulpathak3 apologies, right now we have not been able to conclude that the To be more specific, revert cannot resolve the issue because Fluent Bit will not revert their validation which means after Fluent Bit 1.9, the restriction will always there. I will recommend you to update your task definition config and if you have any problem, you can post it here and we are always willing to help you. Thanks |
We also were hit by this. Logs confirm its a config validation issue:
As said, reverting the |
My perspective on this recent issue is that I fully understand the pain and its unfortunate that folks were impacted by the addition of config validation; furthermore, fixing config validation is a good thing, and it's not a backwards breaking change. The config options that all of you had in your configs never had any effect- so no features were removed. I think of the fluent bit config options as an API contract. Imagine you were calling a service API without adhering to its contract- failing to validate and fail those requests would be a bug IMO. |
well its absolutely a contract, but I don't agree it is a non-breaking change to that contract. If I made a similar change to a REST API without a major version bump and brought down production, that's on me, not the consumer of the API even so it is a positive change to make and the real problem here is people (us included) not pinning the image version and expecting nothing to ever change under them. |
@james-skinner-deltatre has a good point at the end of his comment that I strongly agree with and that folks should check out- it is best to lock to a specific version tag. Rather than locking to our latest or stable, have a human check these files and then pick that version: |
For folks coming here having issues with DataDog the |
We are noticing stable version since yesterday are failing, seems previous version is working fine , if we just use specific tagof previous version.
Failed
Passed - older version
The text was updated successfully, but these errors were encountered: