Hello,
I am looking for some advice on setting up Alerts for Docker Containers.
In my current environment, we have Several Kubernetes containers running on several Linux servers.
I have been tasked to alert when a container restarts.
What I am seeing is the container in Exited status, and I alert on that status. However, it takes 7 days to change to Deleted, which will reset the alert.
In the meantime, the server spins up another running container.
So far, the best I have been able to do is provide a count of the Running state for the container with the similar name on the same node, with instructions to Acknowledge the alert as long as a running container is seen.
I am hoping somebody has some better ideas to assist me in my quest to monitor containers.
I have considered alerting on a SWQL query, but cannot seem to get the logic, and get it to fit in the the default Container Select statement.
In each container, I filter for the Name to contain a certain string.I also filter out "PODS" to eliminate the paused containers.
I then have one or more "names" (names vary by container), which I parse out to get a simple name.From the resulting list, there is 1 "running" container. When that container is no longer running, I want an alert, and when 1 is running again, the alert should clear.