7 Replies Latest reply on May 10, 2018 10:54 AM by bobmarley

    NPM - monitoring Polling engine down

    wlouisharris

      I've searched a few threads on this and can't find a simple solution to monitor "Polling engine down" and "last database synce" greater than 5 minutes.  We have had problems with our polling engines showing "Polling engine down."  The alerts <polling engine> - minutes since last keep alive and <polling engine>  "polling engine completion rate" are not capturing the condition.  The alert <polling engine> "last database sync" only allows date values and not minute values. It seems like a simple solution would be to have a column in the database that measures last database sync.  We are looking at collecting the collector.service.log data and looking for this pattern:

       

      2018-04-30 11:15:55,579 [7] ERROR SolarWinds.Collector.OrionCommon.SWEventLogging - Service was unable to open new database connection when requested.

       

       

      We do not have a SAM license.  This seems like a primary component that Solarwinds should be able to monitor.  We do have 2 silos so not sure if we can setup some type of cross polling.  We're trying to avoid having to write a customer management pack with SCOM or powershell code.  If that is the only way, then we'll go that route.

       

      I've attached a screenshot.

       

      Thanks in advance.