Has anyone generated a report that shows the Application status PER Poller (SAM Monitor UP/DOWN/Unknown count)? The Event Summary shows these counts overall on the system but when 3k components are showing down, it's good to know what poller is having issues quickly.
I have multiple APEs and have an issue that a poller will stop processing the application monitors properly and most/all the applications will show DOWN or UNKNOWN. Sometime services will recover due to a crash, but most of the time a HA failover or server reboot is required to restart services.
My problem is when this happens at night and another tech gets flooded with alerts.
I'd love to be able easily show how many components are UP/DOWN/Warning on a per poller basis.
Any input appreciated.
Thanks!