I am rolling out the APM 2.5 soon, (upgrade from legacy Application monitor) and need some input regarding best strategies for whether to alert our admins via email for the application status or just the monitor status. I have all my templates built and assigned to the nodes.
Which do you alert on, Application, monitor, or both?
Do you create advanced alerts for all different kinds of monitors? or just include a bunch of blank fields for say SNMP when a Windows service monitor is being alerted on?
Is there any report that can tie the status of the components together with those of the application? (Like components grouped by application template?)
Can anyone provide a sample of the email that they are sending to their admins? Mine so far is below.
Subject: Medium Alert - Application Monitor: ${ComponentName} on ${NodeName} is ${ComponentStatus}
**********************************************************************************************
Node Information for ${SQL:select caption from nodes where nodeid = ${NodeID}}
Name: ${NodeName}
Status: ${SQL:select StatusDescription from nodes where nodeid = ${NodeID}}
IP Address: ${SQL:select IP_Address from nodes where nodeid = ${NodeID}}
Last Boot: ${SQL:select lastboot from nodes where nodeid = ${NodeID}}
Fully Qualified Domain Name: ${SQL:select dns from nodes where nodeid = ${NodeID}}
Percent Memory Used: ${SQL:select PercentMemoryUsed from nodes where nodeid = ${NodeID}}
Node CPU Utilization: ${SQL:select cpuload from nodes where nodeid = ${NodeID}}
Node Page: SERVERNAME/.../NodeDetails.aspx
**********************************************************************************************
**********************************************************************************************
Application Name (Vendor - Product - Protocol): ${ApplicationName}
Component Name (Monitoring Type - Service): ${ComponentName}
Display Type: ${DisplayType}
Application Availability: ${ApplicationAvailability}; Component Status: ${ComponentStatus}
Last Time up: ${LastTimeUp}
Component Response Time: ${ComponentResponseTime}
**********************************************************************************************
**********************************************************************************************
Process Name: ${ProcessName}
Process Status: ${ProcessStatus}
Percent Memory Used: ${ProcessPercentMemory}
Process Memory Used: ${ProcessMemoryUsed}
Process Percent CPU: ${ProcessPercentCPU}
Process Instance Count: ${ProcessInstanceCount}
Process Percent Availability: ${Process_Percent_Availability}
**********************************************************************************************
**********************************************************************************************
Service Process Name: ${ServiceProcessName}
Service Status: ${Service_Status}
Service Percent Availability: ${Service_Percent_Availability}
Service Memory Used: ${ServiceMemoryUsed}
Service Percent Memory: ${ServicePercentMemory}
Service Percent Virtual Memory Used: ${ServiceVirtualMemoryUsed}
Services Percent Virtual Memory: ${ServicePercentVirtualMemory}
Service Percent CPU: ${ServicePercentCPU}
**********************************************************************************************
For more information please visit: SERVERNAME/.../MonitorDetails.aspx
Thanks
Steve
Network Engineer