Community
- Command Central
- MVP Program
- Monthly Mission
- Blogs
- Groups
- Events
- Media Vault
Products
- Observability
- Network Management
- Application Management
- IT Security
- IT Service Management
- System Management
- Database Management
Content Exchange
- SolarWinds Platform
- Server & Application Monitor
- Database Performance Analyzer
- Server Configuration Monitor
- Network Performance Monitor
- Network Configuration Manager
- SQL Sentry
- Web Help Desk
Free Tools & Trials

Custom SQL Alerts - Do reset conditions also need to be custom?

We have a custom SQL alert (well, we actually have a few of them) that does a calculation that uses SAM to capture processor queue length and compare it against the total number of available CPUs plus the actual CPU utilization. (Thanks Leon Adato!) We noticed last week that a bunch of these alerts weren't clearing out when the conditions in trigger were no longer true. Thinking it was a possible app and/or DB issue we opened an incident with support. This was the query that was in the database for our trigger reset (as recorded in the database) when we had selected "Reset when trigger conditions are no longer true". Note the WHERE NOT in the Original Query below.

To fix our problem we changed the reset condition to "Reset this alert when the following conditions are met" which copies the trigger query to the reset tab. We modified the query as follows:

1) Changed the (nodes.CPU_Crit is null AND nodes.CPULoad > 90) to (nodes.CPU_Crit is null AND nodes.CPULoad < 90)

2) Changed the (nodes.CPU_Crit is not null AND nodes.CPULoad > nodes.CPU_Crit) to (nodes.CPU_Crit is not null AND nodes.CPULoad < nodes.CPU_Crit)

3) Although we didn't have to change it when we copied it from the trigger actions, the reset query is now a WHERE not a WHERE NOT.

Question: How do you handle custom SQL alert reset conditions?

Original Query

SELECT Nodes.NodeID AS NetObjectID, Nodes.Caption AS Name

FROM Nodes /*SplitMarker*/inner join APM_AlertsAndReportsData on (Nodes.NodeID = APM_AlertsAndReportsData.NodeId)

INNER join (select c1.NodeID, COUNT(c1.CPUIndex) as CPUCount

from (select DISTINCT CPUMultiLoad.NodeID, CPUMultiLoad.CPUIndex

from CPUMultiLoad) c1

group by c1.NodeID) c2 on Nodes.NodeID = c2.NodeID

WHERE NOT

Nodes.n_mute <> 1

AND Nodes.Prod_State = 'PROD'

AND APM_AlertsAndReportsData.ComponentName = 'Win_Processor_Queue_Len'

AND APM_AlertsAndReportsData.StatisticData > c2.CPUCount

AND

(

(nodes.CPU_Crit is null

AND nodes.CPULoad > 90)

OR (nodes.CPU_Crit is not null

AND nodes.CPULoad > nodes.CPU_Crit)

)

Find more posts tagged with

custom_sql_alert

Accepted answers

HolyGuacamole

For Custom SQL alerts, generally the Reset condition also needs to be Custom SQL as well

See this excellent post from Richard Letts

Warning about custom SQL alerts (reset trigger)

All comments

HolyGuacamole

For Custom SQL alerts, generally the Reset condition also needs to be Custom SQL as well

See this excellent post from Richard Letts

Warning about custom SQL alerts (reset trigger)

jbiggley

Ahh, that's the ticket! Thanks for the link to the post from Richard Letts as well. That was exactly the detail I was looking for.

adatole

Great catch. I've updated the Ultimate CPU Alert (https://thwack.solarwinds.com/message/212028#212028) with this information, but for those who got here first:

You can't just select "reset when the condition is no longer true". The solution, as elaborated by Richard Letts here: Warning about custom SQL alerts (reset trigger), the reset trigger needs to be:

inner join APM_AlertsAndReportsData

on (Nodes.NodeID = APM_AlertsAndReportsData.NodeId)

INNER join (select c1.NodeID, COUNT(c1.CPUIndex) as CPUCount

from (select DISTINCT CPUMultiLoad.NodeID, CPUMultiLoad.CPUIndex

from CPUMultiLoad) c1

group by c1.NodeID) c2 on Nodes.NodeID = c2.NodeID

where

(APM_AlertsAndReportsData.ComponentName = 'Win_Processor_Queue_Len' AND APM_AlertsAndReportsData.StatisticData > c2.CPUCount)

OR nodes.CPULoad > 90

The key change here is that you want to reset when EITHER the processes are less than the number of CPU's, OR the CPU load is under the threshold