-
Notifications
You must be signed in to change notification settings - Fork 773
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
AdvancedCronJob支持展示上一次执行的成功job数 #519
Comments
@qiankunli |
DESIRED=52,SUCCEEDED=10,FAILED=2 中间的差值在哪看呢?一般有啥原因不? |
@qiankunli YAML please. |
这个acj的作用是,每天早上8点执行clean-log image 中包含的clean-log shell 清理物理机xx 天前的日志。我发现有一些pod 报Outpod(估计是物理机负载较大导致pod 创建失败),导致pod failed,如果 FailurePolicyType默认为FailFast,导致整个bj 执行失败? |
@qiankunli 是的,默认 |
@FillZpp 感谢提醒,我试试 |
@qiankunli 方便的话可以登记一下使用 #289 ,以便我们后续收集反馈 |
acj 常规输出
我们发现部分node 可能是因为负载较高的缘故,从未执行过job(pod 状态为OutOfpods),但一直没发现。
期待可以输出最近一次执行的 成功数/node节点数,来判断在所有节点上是否都执行,如果未执行,可以尽快去采取一些措施。该数据也建议与kube-state-metrics 集成,方便做监控。
The text was updated successfully, but these errors were encountered: