We've been experiencing a strange error with one of our monitored applications.
Our alert rule states to fire an email when the state of this application is NOT "Up". So an alert email notification will trigger even if the monitor is in an "Unknown" state.
Inside the alert rule we include the component status message. In this instance we see the following in the alert notification:
The monitor on NODE05 is Unknown for the following reason:
Unexpected error occurred. Out of memory
Alert was triggered at Monday, December 5, 2016 9:54 PM.
A few minutes later the alert resets and the APM monitor works normally for several hours before this happens again.
I found the APM logs for this particular application and have these entries:
2016-12-05 21:49:12,721 [STP Pool:184 Thread #0] [A4526] [C0] ERROR SolarWinds.APM.Probes.ProbeExecutors.WmiWindowsServiceBatchExecutor - Error in ExecuteGroup occured.
System.Management.ManagementException: Out of memory
at System.Management.ManagementException.ThrowWithExtendedInfo(ManagementStatus errorCode)
at System.Management.ManagementObjectCollection.ManagementObjectEnumerator.MoveNext()
at SolarWinds.APM.Probes.ProcessInfo.ProcessAllProcesses(Dictionary`2 processes, ManagementObjectSearcher searcher, Action`2 proc)
at SolarWinds.APM.Probes.ProcessInfo.GetPerfCounterDeltaInFiveSeconds(ManagementScopeContext scope, Dictionary`2 processes, EnumerationOptions options)
at SolarWinds.APM.Probes.ProcessInfo.GetProcessInfo(ManagementScopeContext scope, IEnumerable`1 pids, IList`1 processNames, IList`1 commandLineFilters)
at SolarWinds.APM.Probes.ProcessInfo.GetProcessInfo(ManagementScopeContext scope, IEnumerable`1 pids)
at SolarWinds.APM.Probes.ProbeExecutors.WmiWindowsServiceBatchExecutor.GetProcessInfoViaWmi(String ipAddress, UserNamePasswordCredential credential, List`1 services)
at SolarWinds.APM.Probes.ProbeExecutors.WmiWindowsServiceBatchExecutor.ExecuteGroup(List`1 serviceProbeInfos)
2016-12-05 21:49:12,737 [STP SmartThreadPool Thread #38] [A4526] [C(null)] ERROR SolarWinds.APM.Probes.ProbeExecutors.WmiWindowsServiceBatchExecutor - ManagementException caught. MamagementStatus code: OutOfMemory
System.Management.ManagementException: Out of memory
at System.Management.ManagementException.ThrowWithExtendedInfo(ManagementStatus errorCode)
at System.Management.ManagementObjectCollection.ManagementObjectEnumerator.MoveNext()
at SolarWinds.APM.Probes.ProcessInfo.ProcessAllProcesses(Dictionary`2 processes, ManagementObjectSearcher searcher, Action`2 proc)
at SolarWinds.APM.Probes.ProcessInfo.GetPerfCounterDeltaInFiveSeconds(ManagementScopeContext scope, Dictionary`2 processes, EnumerationOptions options)
at SolarWinds.APM.Probes.ProcessInfo.GetProcessInfo(ManagementScopeContext scope, IEnumerable`1 pids, IList`1 processNames, IList`1 commandLineFilters)
at SolarWinds.APM.Probes.ProcessInfo.GetProcessInfo(ManagementScopeContext scope, IEnumerable`1 pids)
at SolarWinds.APM.Probes.ProbeExecutors.WmiWindowsServiceBatchExecutor.GetProcessInfoViaWmi(String ipAddress, UserNamePasswordCredential credential, List`1 services)
at SolarWinds.APM.Probes.ProbeExecutors.WmiWindowsServiceBatchExecutor.ExecuteGroup(List`1 serviceProbeInfos)
2016-12-05 21:49:12,737 [STP SmartThreadPool Thread #38] [A4526] [C(null)] ERROR SolarWinds.APM.Probes.ProbeExecutors.WmiWindowsServiceBatchExecutor - ManagementException caught. MamagementStatus code: OutOfMemory
System.Management.ManagementException: Out of memory
at System.Management.ManagementException.ThrowWithExtendedInfo(ManagementStatus errorCode)
at System.Management.ManagementObjectCollection.ManagementObjectEnumerator.MoveNext()
at SolarWinds.APM.Probes.ProcessInfo.ProcessAllProcesses(Dictionary`2 processes, ManagementObjectSearcher searcher, Action`2 proc)
at SolarWinds.APM.Probes.ProcessInfo.GetPerfCounterDeltaInFiveSeconds(ManagementScopeContext scope, Dictionary`2 processes, EnumerationOptions options)
at SolarWinds.APM.Probes.ProcessInfo.GetProcessInfo(ManagementScopeContext scope, IEnumerable`1 pids, IList`1 processNames, IList`1 commandLineFilters)
at SolarWinds.APM.Probes.ProcessInfo.GetProcessInfo(ManagementScopeContext scope, IEnumerable`1 pids)
at SolarWinds.APM.Probes.ProbeExecutors.WmiWindowsServiceBatchExecutor.GetProcessInfoViaWmi(String ipAddress, UserNamePasswordCredential credential, List`1 services)
at SolarWinds.APM.Probes.ProbeExecutors.WmiWindowsServiceBatchExecutor.ExecuteGroup(List`1 serviceProbeInfos)
What concerns me is the entries that say "ManagementException caught. ManagementStatus code: OutOfMemory".
Would this indicate a problem with the Orion APM or the monitored Node?