This projects deploy the prometheus monitor framework.
The monitor target contains:
- Ceph
- K8S
- Openstack
- Zabbix
TODO
We deploy the monitor framework base on the following architecture.
- Each k8s contains promethues+grafana locally, for example: dev k8s、b28 k8s、m6 k8s.
- Admin cluster contains the global prometheus+grafana. The global prometheus contains metric for openstack、ceph、zabbix.
- Each k8s promethues thanos sidecar uploads the metric to OOS(minio)
- Global grafana select the metric from thanos query which forward the request to thanos-storegateway、each k8s sidecar and local promethues.
it contains minio, query frontend, query and storegateway.
minio need to add dev-k8s user with 'readwrite,consoleAdmin,diagnostics' policy and create the thanos bucket auto cleanup lifecycle.
./mc --insecure ilm add --expiry-days "61" thanos-minio/thanos
the ilm can list with the following command:
./mc --insecure ilm ls thanos-minio/thanos --json
it contains grafana, prometheus-operator, prometheus.
TODO
1、The server reports the operation status through ZABBIX agent, including CPU, memory, hard disk, network status, equipment operation, etc.
2、Ensure the high availability of the server through bond.
3、Routing Switch&Ethernet Switch SNMP reports network status, equipment operation, network congestion,etc.
4、VSU ensures high availability of switch.
5、The loadbalance ensures high availability of services.
6、The firewall ensures the security and controllability of the data centers B28 and M6.