-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Evaluate Logging Options for provider and tenant (container) logs #250
Comments
I think we should look into the managed hassle-free solutions, so any Akash Provider (K8s cluster) can install the agent and get all the logs out of the box available on the dashboard: https://newrelic.com/ And maintaining Elastic Search is a big pain. Few companies use cloud.elastic.co which is a managed ES solution. I think most (if not all) of them support K8s pods logging (including akash-provider pod, etc), so we probably just need to pick the one that:
|
@shimpa1 installed fluentd+Loki+grafana, everything looks to be working. He is now fixing a small issue with the fluentd not seeing |
Is your feature request related to a problem? Please describe.
When debugging customer issues today, we have limited logging capabilities available. Broadly (ideally) we need logs from two places - the provider (including the kubernetes cluster) and from the container (the tenants application container) so that we can determine what was the source of the issue. And ideally we want the logs to be retained for a resonable amount of time (at least a few hours if not several days pr weeks) so that we don't have a short window of opportunity to catch the issues.
This is what we have (TODAY) in terms of logs, retention and ability to query things:
Provider/ Cluster:
Tentant/ Container:
Limited query capabilities - mostly have to grep for anything we want from the above two logs.
Describe the solution you'd like
The logs from the provider software, the k8s control plane components and the tenant containers (apps) are collected, stored and made queryable through some logging platform like Lodgy or Grafana/ Loki, ELK stack or similar).
The provider should be able to configure (via CLI or provider console) where to send provider and k8s logs.
The user (tenant) should be able to configure (via Console or API) where the tenant/ container logs should be sent.
As a precursor to implementing any UI and API changes to support this we want to evaluate whether fluentd is a good option to use for us to use for log collection.
Benefits of using fluentd:
Goal of the exercise:
Describe alternatives you've considered
continuing to grep logs from kubectl for provider
Search
Code of Conduct
Additional context
No response
The text was updated successfully, but these errors were encountered: