-
Notifications
You must be signed in to change notification settings - Fork 5.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
chore: infer managed resources health from redis instead of storing it in CRD #10191
Conversation
Codecov ReportAttention: Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## master #10191 +/- ##
==========================================
+ Coverage 45.92% 45.94% +0.01%
==========================================
Files 229 229
Lines 28258 28268 +10
==========================================
+ Hits 12978 12987 +9
Misses 13510 13510
- Partials 1770 1771 +1 ☔ View full report in Codecov by Sentry. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Should docs be added explaining the benefit of using the option? Is there any plan to deprecate the CRD-storage mechanism?
Also I'm not sure if this is really testable... seems like maybe at least inferResourcesStatusHealth
should have a test.
server/application/application.go
Outdated
@@ -1897,6 +1905,22 @@ func (s *Server) GetApplicationSyncWindows(ctx context.Context, q *application.A | |||
return res, nil | |||
} | |||
|
|||
func (s *Server) inferResourcesStatusHealth(app *v1alpha1.Application) { | |||
if !argocommon.PersistResourceStatusHealth { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Should this boolean be part of the Application cache key?
The upside would be that, after someone toggles on the option, there's no moment when health statuses are not returned.
The downside would be that, after someone toggles on the option, the entire Application cache is now out of date.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I've got, hopefully, a good idea: introduced resourceHealthSource
field in application status that indicates if resources health is in CRD (by default) or in application tree.
This eliminates the requirement to coordinate new flag value between application controller & API server. Another benefit is that health status will be gradually removed from CRDs by controller.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think we should not remove the health status from the CRD, as many folks building solutions around Argo CD may use this information to feed their own systems instead of going through the API (which imho, is a perfectly fine use case).
So from my point of view, this is a heavily breaking change.
Just putting my Request for Change on here to prevent early merge and to facilitate further discussion on this :)
I think Alex intends this feature to be opt-in rather than opt-out. If we find it super valuable, we could consider enabling it by default in 3.0. But as the PR is written now, I don't think it's a breaking change. |
Yeah, I was a little quick with making my assumption. Sorry - I'm tired and stupid :) You're right. It's even mentioned it in the PR's description. |
Nah, just tired. And I can relate. 😴 |
2e08d42
to
1b1fdac
Compare
Thank you for review @jannfis and @crenshaw-dev ! PTAL |
@alexmt conflict time. :-) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
One nitpick!
Should there be a note somewhere in documentation about when/why this might be helpful (and that we might make it the default in 3.0, if that's the plan)?
cmd/argocd-application-controller/commands/argocd_application_controller.go
Outdated
Show resolved
Hide resolved
1b1fdac
to
774915a
Compare
I could not find and dedicated file that would be appropriate. So documented the setting in |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
lgtm!
@jannfis PTAL |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM!
…t in CRD Signed-off-by: Alexander Matyushentsev <[email protected]>
Signed-off-by: Alexander Matyushentsev <[email protected]>
774915a
to
46cdd7d
Compare
…t in CRD (argoproj#10191) * chore: infer managed resources health from redis instead of storing it in CRD Signed-off-by: Alexander Matyushentsev <[email protected]> * apply reviewer notes Signed-off-by: Alexander Matyushentsev <[email protected]> Signed-off-by: Alexander Matyushentsev <[email protected]> Signed-off-by: Yi Cai <[email protected]>
* fix: missing actions (#10327) (#10359) Signed-off-by: CI <[email protected]> Signed-off-by: CI <[email protected]> Signed-off-by: Yi Cai <[email protected]> * chore: infer managed resources health from redis instead of storing it in CRD (#10191) * chore: infer managed resources health from redis instead of storing it in CRD Signed-off-by: Alexander Matyushentsev <[email protected]> * apply reviewer notes Signed-off-by: Alexander Matyushentsev <[email protected]> Signed-off-by: Alexander Matyushentsev <[email protected]> Signed-off-by: Yi Cai <[email protected]> * fix: Add logic to handle for fileHandle.Close() (#9963) (#10361) Signed-off-by: xin.li <[email protected]> Signed-off-by: xin.li <[email protected]> Signed-off-by: Yi Cai <[email protected]> * docs: fix typo in upgrade notes (#10377) Signed-off-by: Xijun Dai <[email protected]> Signed-off-by: Xijun Dai <[email protected]> Signed-off-by: Yi Cai <[email protected]> * fix: add space before prompt in CLI (#10362) Signed-off-by: xin.li <[email protected]> Signed-off-by: xin.li <[email protected]> Signed-off-by: Yi Cai <[email protected]> * docs: fix indentation of example AppProject in 'Sync Windows' documentation (#10388) Signed-off-by: Yi Cai <[email protected]> * fix: Correctly assume cluster-scoped resources to be self-referenced (#10390) Signed-off-by: jannfis <[email protected]> Signed-off-by: jannfis <[email protected]> Signed-off-by: Yi Cai <[email protected]> * ui-make-https-repo-credential-editable Signed-off-by: ciiay <[email protected]> Signed-off-by: Yi Cai <[email protected]> * Minor format fix Signed-off-by: ciiay <[email protected]> Signed-off-by: Yi Cai <[email protected]> * Minor fix for unclickable input field Signed-off-by: ciiay <[email protected]> Signed-off-by: Yi Cai <[email protected]> * Updates for comments Signed-off-by: ciiay <[email protected]> Signed-off-by: Yi Cai <[email protected]> * ui-make-https-repo-credential-editable Signed-off-by: ciiay <[email protected]> Signed-off-by: Yi Cai <[email protected]> * Minor format fix Signed-off-by: ciiay <[email protected]> Signed-off-by: Yi Cai <[email protected]> * Minor fix for unclickable input field Signed-off-by: ciiay <[email protected]> Signed-off-by: Yi Cai <[email protected]> * Updates for comments Signed-off-by: ciiay <[email protected]> Signed-off-by: Yi Cai <[email protected]> Signed-off-by: CI <[email protected]> Signed-off-by: Yi Cai <[email protected]> Signed-off-by: Alexander Matyushentsev <[email protected]> Signed-off-by: xin.li <[email protected]> Signed-off-by: Xijun Dai <[email protected]> Signed-off-by: jannfis <[email protected]> Signed-off-by: ciiay <[email protected]> Co-authored-by: Michael Crenshaw <[email protected]> Co-authored-by: Alexander Matyushentsev <[email protected]> Co-authored-by: my-git9 <[email protected]> Co-authored-by: Xijun Dai <[email protected]> Co-authored-by: Jun Duan <[email protected]> Co-authored-by: jannfis <[email protected]>
+1 on this. |
Hey @alexmt (and community)! May I ask, will setting I want to use the benefits of the flag, but my ci/cd pipeline depends on resource health outputs of I've confirmed that using the Can I trust that moving forward (even in v3.0), argocd cli output will stay the same? (Or at least that change in cli behavior will be properly documented/announced?) Thanks always! 🤩 |
Hello @dlemfh , CLI/API behavior won't change after setting You can assume that v2.x won't have breaking API/CLI changes. v3.0 might introduce it but we will make sure to notify about such change ahead of time and describe migration path. Currently there are no plans to introduce v3.0 |
@alexmt Awesome, thanks!! 🎉 |
Signed-off-by: Alexander Matyushentsev [email protected]
Argo CD historically stores managed resources' health details in Application CRD.
At some point, Argo CD started collecting health status for the whole resource hierarchy so now health status is duplicated in CRD and decided to store new health status in Redis to reduce application CRD side and reduce the number of updates.
This PR is the continuation of the original effort and introduces the following changes:
The change slightly reduces Application CRD but, most importantly, significantly reduces the number of application CRD updates. E.g. before, an application that manages Deployment with 10 pods would need to be updated 10+ times during rollout. The only change would be the change in health messages such as
Waiting for rollout to finish: 7 of 10 updated replicas are available...
. With changes, the application CRD would be updated just once.For backward compatibility controller, by default, keep storing health status in CRD. The new behavior can be enabled using
ARGOCD_PERSIST_RESOURCE_STATUS_HEALTH=true
env variable.