mesverrum · Observability Architect · ✭✭✭✭✭

Comments

  • There wasn't a later incident of the netflow service failing closer to ~9:35 was there? If you review the logs for the HA service you will see it normally checks all the services every minute (maybe it was 2 mins, can't recall for sure off the top of my head). When I'm investigating things like this I typically would open…
  • When you say coming through, do you mean they are triggering the rule after you have put this in place? Regardless of the rule's existence all incoming traps will be visible in the log viewer. Or are you saying that even with this updated logic the events are showing as triggering the rule and taking whatever action you…
  • In case anybody wants to see an example of how you would turn a grid of info like this into a SWQL report this is basically how you would get it done. I'll caution you this is absolutely not "ready for prod" code since I don't have most of these types of systems in my lab, but just an example of the logic one could write.…
  • The first time I did a solarwinds upgrade years ago we called up support and they sat on the line with us through the whole process, not sure how often they do that but it has happened.
  • To take advantage of the LEM rule you need to populate a user defined list of the allowed DNS servers, and you have to be getting the logs from your firewalls.Once you have those in place you just enable the built in DNS rule (can't recall the exact name off the top of my head unfortunately) 
  • I also have seen it struggle with LWAP's, marking them as the parent of the switch they connect to. 
  • So this one actually has 2 queries, one for the 7 day count and one for the 24 hour count, just filter both parts like so select distinct ac.Name as [Alert Name] ,'/Orion/NetPerfMon/ActiveAlertDetails.aspx?NetObject=AAT:'+ToString(AlertObjectID) as [_linkfor_Alert Name] --,ah.Message ,count(ah.message) as [Alert Count 30…
  • Potentially, but I'm pretty sure inbound SNMP traps are a feature of NPM, assuming you have that module you'd need to add all your array controllers as Nodes there for this trick to work
  • Your inference is incorrect, Ncentral is a product built from an acquisition of a client hosted remote management platform, formerly known as N-Able. It stands entirely on it's own and has never integrated with Orion in a meaningful way. Likewise they bought GFI Logicnow, which is was integrated with Ncentral. They are…
  • The module and snapin are the same, its just the old way to load the module but it doesn't cause any operational differences (at least until some as yet unannounced day when the snapin may get deprecated).
  • There also another section in there I believe called virtual machine disks or something similar, you might have some luck there.
  • So to address your question about the avg/peak values, to get the correct ones you would go to the add columns, make sure to click the button on the left that says interface traffic history and check the boxes like this: Then you need to set the aggregation on that columns like this: Then you go back to the column picker,…
  • Create a custom HTML resource on your view and paste this in <a href="javascript:;" onclick="return SW.Orion.SSH.NodeManagement.openTerminalWindow(${nodeid});"><img width="16px" height="16px" src="/Orion/images/ToolsetIntegration/Small.SSH.mo.gif" alt=""> SSH</a>
  • Thats the problem, the api doesn't support direct sql queries, it has to be SWQL
  • SWQL is a lot more bare bones than full t sql, seems like your query would work syntax wise if you wrote it like below, but in my lab the query itself times out, Not sure if its a problem specific to my lab or with the way you wrote this all. What is the actual datapoint you are trying to gather? Could be more efficient…
  • Unfortunately none of what you are asking is doable, the contents of the list resource page are generated dynamically when you click the button and there is no mechanism to filter away the types of objects it tries to find or add in objects that it does not find. If a feature like asset inventory does not come up then…
  • Do you have VMAN? Datastores are only collected if you have a VMAN license. If you do, then do you have your vcenters all being monitored and does you account have the browse datastore permission?
  • They should only show up if they are assigned to the node, to clear them out go to the node management resource and click the pollers button, unselect the ones you need to remove
  • You will need to check with your application and it's args, i have worked with places where we could use a template for all servers in a cluster because the processes were effectively the same, other times we have had to build templates for each server.
  • I wouldn't know the command line arguments for your processes, but you can see them if you use the process explorer on your server, I think you have to add the column if you haven't done that before. And yes you will need to either create two templates or you would need to disable the 4th process on any instances where it…
  • the method shown in there is to use powershell locally on the orion server to remove any nodes you want. Obviously you can access the api from any server in your environment and use whatever scripting/programming languages you are more familiar with. I would assume that if you are programmatically managing these systems…
  • The view limitation does change every tab on a single view group. If you manipulate them in the db you can have separate limitations on each tab but the GUI always applies the same limitation to the whole group.
  • There is a lag time between changes to dynamic group memberships and the group actually checking if there is anyone new. Editing the membership in the GUI forces an update immediately, otherwise you can just wait and it will usually catch up later.
  • Try it like this and see what you get
  • It is a bit complicated but it is looking for a series of down type events (down, unknown, unreachable, suspended as commented in lines 88-90) and then looks for the next event that could imply the status has improved from those (as commented in line 25-28) and then it figures how long between those events. It is honestly…
  • The only way I could think would be to write it such that the script of the SAM component doesn't actually contain the real meat of the scripts, instead you create it as a sort of pointer and place holder, have the script in SAM point to another file on your repo and display the second scripts outputs. Thinking it through…
  • Maybe I'm not seeing it, but tdanner​'s original solution seems like it would still work as long as you are making sure to actually allocate addresses to any large subnets that you really use. Addresses automatically allocate for any /21 or smaller so you only need to take an extra step for bigger subnets that are actually…
  • So the variables that are available are all from the database table that these objects represent. So the node details pages can use any variable from the nodes table, volume details can use volumes, etc. At this point none of the variables pull data from the browser or session in the way you are asking, so the only way to…
  • To be accurate, adding the polling wouldn't be entirely without harm. Unless that are doing some kind of data validation in the job engine I suspect you would be getting jobs added in for your new pollers and then those jobs would be erroring out, possibly retrying, possibly increasing your polling loads and CPU use. If…
  • Actually SWIS exists on all SW servers that have the information service running, so the AWE does in fact provide that option. The part where things can get ugly is the SWIS schema is not 100% consistent between the various flavors of orion servers, so querying against an AWE will give you slightly different tables…