-
Notifications
You must be signed in to change notification settings - Fork 548
persistent log for PAI services #2129
Comments
partially resolved by #2244 |
@scarlett2018 recently, there are some wired issues in OpenPAI, as there is no log, so it's hard to trouble shooting. It should persistent log of all pods near realtime. |
We have two options:
|
It's better to centralize logs. It don't need to deploy Nginx services on host. |
@squirrelsc they have pros and cons, centralized will need fluentd service deployed on every hosts and elastic search deployed and a lot of space dedicated to the service. IMO, The local host is much more light weighted. |
we can just put all logs on HDFS, no elasticSearch is needed. If there is any alert, we have pod name, and it can be used to find log files. |
closed and tracked in #4992. |
What would you like to be added:
persistent log for key PAI services like RM, NameNode, etc.
Why is this needed:
the log will be useful for debug.
Without this feature, how does the current module work:
right now, when a pai service restarts, the log is gone. hard to debug
Components that may involve changes:
PAI key services.
The text was updated successfully, but these errors were encountered: