We have an environment where all routers sharing a particular custom property need to be monitored for a very specific condition.
Each of these routers has two virtual interfaces: Tunnel10 and Tunnel20
Each of these tunnels has a single EIGRP neighbor.
The EIGRP neighbor addresses themselves are DYNAMICALLY assigned, so the actual EIGRP neighbor address itself is not significant to us. Only the presence of an EIGRP neighbor is important.
This output shows a normal operating status:
someroutername#show ip int br | in Tunnel
Tunnel10 10.255.5.59 YES NVRAM up up
Tunnel20 10.255.9.223 YES NVRAM up up
someroutername#show ip eigrp nei
EIGRP-IPv4 Neighbors for AS(65001)
H Address Interface Hold Uptime SRTT RTO Q Seq
(sec) (ms) Cnt Num
1 10.255.5.1 Tu10 11 1d00h 92 1398 0 125120116
0 10.255.9.1 Tu20 10 5d04h 43 1398 0 96867940
someroutername#
The following output shows an ABNORMAL operating status if it persists for more than five minutes:
RT-841-VPA-IL-ELKGROVEVILLAGE#show ip int br | in Tunnel
Tunnel10 10.255.5.59 YES NVRAM up up
Tunnel20 10.255.9.223 YES NVRAM up up
RT-841-VPA-IL-ELKGROVEVILLAGE#show ip eigrp nei
EIGRP-IPv4 Neighbors for AS(65001)
H Address Interface Hold Uptime SRTT RTO Q Seq
(sec) (ms) Cnt Num
1 10.255.5.1 Tu10 11 1d00h 92 1398 0 125120116
Note that in the previous output, one of the tunnel neighbors is not working - there is no established neighbor.
I want to create an alert that flags the above condition.
To be specific: If one of the two neighbors is down for more than five minutes, trigger the alert. If that specific condition no longer exists, CLEAR the alert automatically.
I've asked for help on this type of alert before. Syslogs and trap messages don't work for us because they cannot automatically clear themselves and we really don't care if a neighbor disappears for a few seconds, we only care if a neighbor disappears for more than five minutes. We don't care about the actual neighbor IP addresses themselves, only that there IS one.
If neighbors on BOTH tunnels disappear, the router (entire node) goes offline and we have a different alert for that.
Out of the box "routing neighbor" conditions don't work for this set of conditions, many have been tried with the assistance of Solarwinds technical support. They recommend that I come here for attempting to create a custom SQL or SWQL alert.
Can anyone assist?