prawij

Comments

  • On a side not, not sure if you are aware but in NPM 11.5 the permission to unmanage/remanage nodes is now a separate account privilege you can assign to certain users (whereas previously they would have needed the more powerful "Node Management" option granted)
  • "I am trying to allow our NOC to access "Manage Nodes" so they can reconnect passive agents" and "restart agents" but it doesn't look like there is a way to do that without giving them admin access." Can you give them these permissions for SAM? Service Control Manager - Yes and Allow Service Actions Rights - Yes. Then they…
  • You need to use the SolarWinds Unmanage Scheduling Utility: Configure Orion to unmanage items on a scheduled basis - SolarWinds Worldwide, LLC. Help and Support This is what we use to ignore nodes during their regularly scheduled weekly reboots etc.
  • We've seen this ourselves with large Unix volumes, I believe it's to do with 16 bit counters limitations and SMNP agents Here's a post regarding a similar issue with VMware volumes: 8TB limit bug on ESXi 5.1.0 datastore polling?
  • This is what we use for our Top 10 Windows volumes:
  • Looking at IPAM: Orion IP Address Manager (IPAM) Latest version: 4.3.1, Compatible with NPM 11.x Does this mean we cannot upgrade to 12.x if we are using IPAM? Is there a new 12.x compatible release of IPAM imminent?
  • This is a known bug. Multiple buddy drops have failed to resolve the issue. Devs are still working on it.
  • With the new online installer where you can choose the mutliple Orion components you want to deploy, the Config Wizard will run once at the end. Makes things much faster/smoother.
  • Seems to work for me, however, does you account have Node Management rights enabled? Looks like you could have Admin rights turned on and Node Management off in your permissions in which case you may not be able to see the Manage Nodes page?
  • There's a canned report in the alert tool: "Alert me when a neighbor goes down", which by default writes an entry in the SW log. You just need to add an additional action to that alert to send an email to you, and make sure you have the "BGP neighbors" and EIGRP neighbors" resources selected under "Routing" via List…
  • Someone had mentioned on the first page: "My team and I have found 2 other bugs so far. 1.) In alert actions sending an email using the ${DefaultEmailTo} or ${DefaultEmailCC} or ${DefaultEmailBCC} or ${DefaultEmailFrom} these values get corrupted in the database. The issue is something is causing the values to start…
  • Upgraded last Tuesday, been having struggles since Thursday. The upgrade process itself went smoothly, but the stability of the environment can now be described as "a hot mess". Basically need to reboot primary and additional pollers every two days to stop them from locking up hard and needing reboots via ILO. Bunch of…
  • Here's how I do it: 1) Configure the LEM appliance to do a daily backup for the logs - archiveconfig option under cmc->manager. This will kick off at 6:25AM daily. 2) Then it's a simple matter of browsing to the network share you configured archiveconfig to point to with Windows Explorer, go into the…
  • Does Array Licensed show as "Yes" below that, at the bottom of the Array Polling Details resource? When I've seen problems like that in the past it's because SRM for some reason decided that I wasn't licensed correctly. Fix/workaround was to remove the SRM integration to our standalone SRM-Profiler instance and then…
  • This is a known issue with that release of IOS: Hardware Health Temp Sensor Status 'Major' for Cisco C3650 You can upgrade your version of IOS or as a workaround use the "Manage Sensor Status" feature in 11.5 to set custom warning/critical temp thresholds for the affected sensors. So in your example: "Inlet Temperature…
  • Looks like they updated the notes: SolarWinds SAM 6.2.2 Release Notes Fixed Issues and New Additions for Server & Application Monitor (Version 6.2.2): Fixed IssuesCustomer Case NumberA problem with reporting the correct serial number for Dell nodes configured to use WMI via Asset Inventory has been fixed.790564An issue…
  • Go to Settings->Polling Settings. Under "DATABASE SETTINGS" adjust the EVENTS RETENTION field to the value in days that you would like to retain. Once nightly DB maintenance runs, you should see that any events older than your new retention date have been cleared out. BTW the maximum number of events that can be displayed…
  • Depending on how busy your network gear is - ICMP is treated as low priority, and potentially could be dropped if router/switch is under heavy load, causing a false down node alert....just something to keep in mind. Set the alert to not fire until after say 5 minutes to avoid any false alerts.
  • Do you have a lot of events/long retention time for events set? I've had that behavior before, for some reason the Last 25 events query is more resource intensive than viewing all events using the Events resource.
  • My thoughts on this: Assuming one has poller A, B and C running normally. Have three alternate pollers, Alternate-A, Alternate-B, Alternate-C waiting as standby (passive) pollers. In the case where something happens to pollers A, B or C, assuming that Windows itself is still booting, deactivate the licenses on the primary…
  • I have a single instance of Orion, and we monitor it via the "Orion Services" SAM template, applied to the primary and additional pollers, configured to page out to my cellphone if there's an issue. However this won't catch an issue where the alerting service on the primary goes offline. But at that point one of the staff…
  • Case #355574 has been opened for this issue. Thanks.
  • Anyone seeing unexplained growth in their NetPerfMon DB size since Hotfix 4 was applied?
  • The LEM appliance is running Debian so it doesn't have a C: drive, I'm guessing this is telling you one of the nodes you have a LEM agent deployed on is approaching 100% utlization on its C:\ drive. When you log in to the LEM GUI, go to "Monitor" and then choose Rule Activity on the left side menu, this should show you…
  • Can you use different zoom levels in the browsers depending on the resolution of the screens attached?
  • That's the correct package to install on any additional pollers if you are only upgrading NPM 12 on your primary poller (obviously upgrade the primary first).
  • For NPM 11.x I believe that issue of time offset from UTC was fixed in a hot fix. Edit, sorry, that was a hotfix for incorrect times shown for transaction events for WPM: "Web Performance Monitor v2.0.1 - Hot Fix 5"
  • If you are using the "Pause actions of all alerts" function under All Active Alerts, you'll need to give your monitors some time to poll and go into an Up status, before you deselect the pause option. Basically if you pause actions at say 10am, and do your maintenance, and maintenance completes at 11am, you would want to…
  • Assuming you have SAM, you can assign the Orion Server template to each poller, and configure an alert to page you if the app monitor shows as down (which it will do if any of the Orion services shows as down). Now what this doesn't detect is a case where the Orion services are up, but not actually working correctly. I'm…