tigger2

Comments

  • Like! As long as you like scripting/coding... I've almost always found the need for a secondary set of processing/scripts to "enhance" or "band-aid" montioring systems giant pile of alerting gibberish since they all don't focus on event management very well, just condition detection and data collection, then burp out an…
  • Awesome! Thanks for posting these!
  • This may possibly be of interest as it's a modification of the "check for node not responding" you originally posted that I'm using to catch when Windows servers go into a "soft fault" condition (WMI not responding, cannot login via Remote Desktop or VMware console...but server pings, application may be working, and…
  • If it's of any additional use for detecting WMI/polling anomalies: I've got a cheap method that only works for WMI, but it's helped catch a few times a device was pingable and Orion *was* polling correctly but the server had issues (you need to have SAM installed). I call it a "canary" alert because you really don't know…
  • I am doing MD5 hashing of some files single file and with a custom script I wrote in ....vbscript . I'm also not doing it all in 1 component monitor. If you're more of a vbscript guy (and I hope not ) I'm using the MD5 hashing algorithm from here: How to generate MD5 using VBScript in classic ASP? - Stack Overflow BUT if…
  • Follow up, in case anyone wants it: for Part B above, this may suffice using CPULoad_Detail. It is a SQL query against the NPM database that returns data from the Nodes table that might be useful in tracking down why the node is missing data. Selects "last day" of data, not 24 hours but works if you run it after each…
  • One thing I've used often, even though it's kinda a pain, but works for how we manage things: Make a custom node property called something like "AllowAlertNotifications", and set it to a "Yes/No" (Boolean) value. Then in all your alerts, have a top level condition of "AllowAlertNotifications" = Yes Then if you need to turn…
  • I agree with mesverrum​, and I haven't gone out on Thwack and read anything, and it may be hackish, but if you search Thwack for "Ultimate CPU Alert" you may find something. there are several posts and people have been trying to build a cool/better CPU alert. I haven't kept up with the posts (I just know it's out there as…
  • My $0.02... Always have more than one method/protocol other than SMTP to send notifications out, if you can, to alleviate issues with email (latency, email maintenance or email system down). Usually just one additional method like SNPP/SMS or anything non-SMTP will work "well enough" as a backup method for "critical"…
  • Some things I've done: - Remove as much data as you can on pages people use (by doing the things GoldTipu‌ (Malik) mentioned above) - I had some issues where I tried to make some views that, behind the scenes, were making a *lot* of queries to determine what access to what devices a user had ( think lots of account…
  • Something to check. It won't help your problem, but it might give a more accurate file count: - I believe that Orion monitor has an attribute/config of 'INCLUDE SUBDIRECTORIES" set to "yes" by default so the file count may be all files in C:\Windows\temp. I've overridden mine to "no" - The "file extension filter" in that…
  • If you set the alert condition so it's "if any of the below condition is not met" the node can be up and CPU be > 80% and the alert will trigger. You may not want this all in one alert (due to reasons below) because it's putting a lot of separate logic/conditions that are not really related into a single alert, making the…
  • "when I try to add the node it fails the pre-test every time" I've recently had some issues with SNMP and adding nodes. Here's one post I had that might have some helpful info/things to try: Is there a way to troubleshoot why the the "test" button fails when adding a new node using via SNMP
  • I really, really, really with this was built in functionality. I believe finding "dead/changed" interfaces has been added to the newest versions of NPM, but not volumes I've seen this question asked before on the forums, and the only answer I personally have is still mostly a manual process: 1. To find new volumes: To run…
  • Off the top of my head, I can think of 5 - Creating "new" templates with one or more component monitors is relatively easy. Even if your component monitor is literally a script you have to create from scratch, it doesn't take too long to go from "I have written this script" to "and Now Orion is running it for me every X…
  • I'm using NPM version 12.1. (Maybe an older version doesn't work this way) It's possible that this setting may need to be set: Admin -> Settings -> Web Console Settings -> check "ENABLE AUDIT TRAILS"? I don't know if this turns on extended auditing or if it's literally a "all auditing ON/OFF" switch. I have this set in my…
  • Update, for those who may end up here: Issue still not fixed, BUT it has been more concretely determined that the issue exists on the Exchange server, specifically with the Exchange "https://${IP}/powershell/" component. Next steps: The only thing that the net seems to agree on is "restarting" and "rebooting" and…
  • Update: Since I do have some servers working, I started trying to find differences in them. One that jumps out: Navigating to https://<IP address>/Powershell on the servers where the Exchange template configuration "worked" brings up the "certificate not trusted" warning page. Accepting to continue just goes to a blank…
  • My 2 cents: 1. Get NPM "up and running" (as suggested above) if you don't' already have it or if no one is monitoring the servers/network for "the basics". At last scope out if this is "a needed thing" from your team. 2. For application level monitoring, it would be "nice" to try to scope out how many apps and "things"…
  • Also wondering this....bump
  • It looks like you just select and "delete it" via "remove DNS zones". These hanging around may be an issue with IPAM not scanning anymore and not performing the updates itself, or, in my case, there WAS a single cinfig somewhere that was not cleaned up on one of teh DNS servers.
  • The post is a little old...but not marked "answered" so .... maybe here is another way this can occur, or at least some more info on what to look out for. I ran two upgrades on NPM 11.0.1 to upgrade IPAM from 4.1 to 4.2, then another to upgrade to 4.3. Everything looked "ok"...except it wasn't. I saw some application event…
  • Curious, whether to use the "online" vs "offline" installer for this upgrade. It seems the consensus from this thread (I've tried to catch up on it since the 2018.2 platform came out) is to use the "offline" installer even though the "online" installer is recommended by SolarWinds. Is there any difference between them…
  • Bump . I know this is an old post but.... We're on a single server install of: NPM 12.1 with Orion Platform 2017.1.3 SP3, NCM 7.6,SAM 6.4.0 And we just had this issue, and the "unmanage" + "remanage" appears to have fixed our issues. I'm posing to add some info, if it helps anyone: 1. CPU on 1 node in Orion went from…
  • Sorry for the long post/typos.. The problem is, as I see it, is many-fold but starts out with the problem of you don't have the data in the monitoring systems(s) that you wish you had, even if you passed that data to a script. The data fields used to generate an alert, and the data fields used to send the alert (or pass it…
  • I think you may be right about how Orion interprets $1. Just a hunch as well. I have not tested this, but you should be able to get rid of $1 and $2 by doing this: my ($timeServerTime,$drift) = $rawDrift =~ /(\d+[ ]\w{3}[ ][\d:]{8}) .*? offset[ ]-?(\d+)[.].*? /xms; then, if you don't want to cram int() into the call…
  • Update: It looks like the vendor does not have and is not is not going to add MIB-2 to their device. Their reasoning is "MIB-1 ought to be enough for anybody", which is all I received via a hand written, wax sealed folded scrap of parchment that was hand delivered by a messenger riding a horse. I'm pretty sure they're not…
  • Thanks for all the input! It's been awhile since I've had to do anything too deep in SNMP so I'm a little rusty on... everything. I don't have any command line tools installed, etc other than the Engineers Toolset. There's several questions above so I'll try to sum it all up so far: - SNMP is enabled on the target device,…
  • Update: I migrated both IPAM and UDT to their own servers. Basically, it appears the instructions above to "Migrate to a new server/IP" are the ones to use, with the added step of "when Orion is shut down, make a copy of the database to use for the new system" I ran into a few "gotchas". Since UDT/IPAM have a base of Orion…
  • Update: I had a new cert issued, and it "looks better" than the first (the first was probably exported incorrectly) but the new cert is still not showing up in the Orion config wizard as a choice when I set the "Enable HTTPS" option. FYI: The cert does show up in the IIS manager as cert I can use/select when binding, but…