I need to resolve an issue that been reported on a node, either a router has an issue, I am gathering something unimportant or incorrectly, or I need to understand it better. Likely 2 of the 3. One interface of this router (which contents this offsite via GigaMAN service back to our primary datacenter). InErrorThisHour values range from 1 million to 4 million. I think there is clearly something wrong, but when the network engineer goes to check stats from the router (his counters haven't been reset in about 2 months) ther appear to be no errs. I feel like we aren't speaking the same language, any suggestions?
Pull these values using custom query for my network team:
SELECT
'' AS n,
'/Orion/images/StatusIcons/small-' + ToString(AdminStatusLED) AS [_IconFor_n],
FullName as [INTERFACE],
'/Orion/Interfaces/InterfaceDetails.aspx?NetObject=I%3a' + ToString(InterfaceID) AS [_LinkFor_INTERFACE],
MAC,
InErrorsThisHour as ['In Err HR'],
InErrorsToday as ['In Err Today'],
OutErrorsThisHour as ['Out Err HR'],
OutErrorsToday as ['Out Err Today']
FROM
Orion.NPM.Interfaces
Where
(OutErrorsToday + InErrorsToday) > 0
Order By (OutErrorsToday + InErrorsToday) Desc
This is the top entry all the time by a wide margin.