6 Replies Latest reply on Jul 25, 2016 10:05 AM by cahunt

    Polling Engine Status Alerts

    k1gaudineer

      We have several additional polling engines in our environment.  Recently we have been having a problem with one or two of the additional polling engines stop polling, and when we check the status of the polling engine in the web console the status is Down.  The fix is to shut down all services and restart all services. 

       

      My question is...  Is there a way to create an alert for an additional polling engine status?  It would be nice to get an alert when the polling engine status changes so I can correct the problem before the customer calls and tells me that there is data missing from their graphs....  Any hints on how to create the alert are greatly appreciated.

        • Re: Polling Engine Status Alerts
          cjfranca

          You can create just as it creates to another node when the status is anything other than up

          Now I have no way to send a picture of it.

          This here can help you SolarWinds Online Help  to have a more accurate idea of the solution.




          .

          • Re: Polling Engine Status Alerts
            rschroeder

            If you're running NPM's newer versions, consider enabling WMI on the servers hosting the pollers, then using admin credentials to add QoE to them. 

             

            Then you'll have a more granular picture of what's actually going on inside them, and be able to alert on that.

             

            NPM can alert on services, if I recall correctly, and if you have WMI enabled on the server, and NPM polling them.

             

            A starting place for customizing your pollers is here:

            http://www.solarwinds.com/resources/videos/universal-device-pollers-undp-solarwinds-training-network-performance-monitor.html

            • Re: Polling Engine Status Alerts
              prawij

              Assuming you have SAM, you can assign the Orion Server template to each poller, and configure an alert to page you if the app monitor shows as down (which it will do if any of the Orion services shows as down).

               

              Now what this doesn't detect is a case where the Orion services are up, but not actually working correctly. I'm working on an alert for this scenario now, where I leverage WPM to playback a transaction where I look up the node details of a specific node on a given poller, and set it to verify that we get back the expected text (WPm - text match on node name etc). Set one of these up for each poller you have.

               

              So yes, you can monitor the pollers automatically, as long as you have paid the bucks for SAM and WPM....

              • Re: Polling Engine Status Alerts
                cahunt

                select

                Engines.ServerName,Engines.IP, Engines.ServerType,

                convert(varchar, round(nodes.systemuptime/60/60, 2, 1))+' hrs' as Uptime,

                Engines.Elements as Elmts, Engines.Nodes, Engines.Interfaces as Int, Engines.Volumes as Vol,

                c.custpolls as UnDP, a.samct as SAM,

                N.Down_node, I.Down_Int, V.Down_vol, A2.Down_sam,

                s.failed as noSNMP,

                Engines.PollingCompletion as "%complt",

                nodes.nodeid, nodes.CPUload as "%CPU", nodes.percentmemoryused as "%RAM",

                e1.PropertyValue as NPM_Rate, e2.PropertyValue as SAM_Rate

                from Engines

                join nodes on engines.ip = nodes.ip_address

                 

                 

                left join (select engineproperties.engineid, EngineProperties.PropertyValue from EngineProperties where engineproperties.propertyname = 'Orion.Standard.Polling') e1

                  on engines.engineid = e1.engineid

                left join (select engineproperties.engineid, EngineProperties.PropertyValue from EngineProperties where engineproperties.propertyname = 'APM.Components.Polling') e2

                  on engines.engineid = e2.engineid

                • Re: Polling Engine Status Alerts
                  cahunt

                  Check each server logs for issues either with resources or the application itself. Check your polling rate/Job Weight as well in settings and off load nodes from your higest offender ---- if this problem node is carrying the most in job weight there is another underlying issue with it.

                  • Re: Polling Engine Status Alerts
                    cahunt

                    also found this, may provide you some insight -> UDT Job Status report by polling engine