-
Notifications
You must be signed in to change notification settings - Fork 35
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Clarify agent health reporting #136
Comments
@andykellr @PeterF778 what do you think? |
I agree that this is unclear in the spec. I think |
What do we do with |
Resolves open-telemetry#136 - Renamed `up` to `healthy`. - `start_time_unix_nano` is no longer tied to `up` and is set independently.
Resolves #136 - Renamed `up` to `healthy`. - `start_time_unix_nano` is no longer tied to `up` and is set independently.
The AgentHealth currently has an
up
field and alast_error
fields.It is not clear how to set fields if the agent process is started and running but it is unhealthy (e.g. we have a way to verify its health by polling a health check endpoint). Should we set
up
to true or false in this case?The
up
field definition isSo, it seems like we should set it to true. However, there is no other explicitly defined way to indicate unhealthiness, unless we assume the presence of
last_error
is that indicator.We need to either clarify the spec to say
last_error
is the indicator or add another field to indicate the unhealthiness (e.g.bool healthy
), or maybe renameup
tohealthy
?The text was updated successfully, but these errors were encountered: