Recently I encounter with an issue with 1 of my application server.
The server is being monitored in SAM so we're monitoring all it's processes & general server utilizations.
However 2 days back the server hung & nobody realized it until a couple of hours when more and more users complaining the application getting timed out.
When check, we're unable to RDP to the server, however ping works fine. We had to force restart to bring it back to normal.
Throughout this duration no down alert was generated by SAM until we restarted the server.
I'm assuming when the OS is hung, WMI processes all got stuck as well. Is there a way for SAM to alert us if the OS is hung? There should be a way right? Since the server's WMI hung, SAM no longer able to poll successfully then it should be able to send an alert for this scenario right?