marcrobinson

unfortunately, the packet loss that I have been getting from Microsoft endpoints limits the utility of the data. You can resolve all the paths and public IPs of a public service endpoint. Everything up to that point is pretty solid. So it is still helpful, but you don't get the "hey there is latency at Microsoft Service…

in Proactive Monitoring of Microsoft M365 Services using Solarwinds HCO Comment by marcrobinson May 2023

@"adam.beedell" A positive experience with azure? Ok, it is not all bad, but Microsoft has not standardized security across the azure/o365 space - and that is a big part of the headache. Using the built in API monitoring templates, there are some limitations, that come to mind as it wants numeric data or formulaic words…

in Proactive Monitoring of Microsoft M365 Services using Solarwinds HCO Comment by marcrobinson May 2023

Generally, it has been chased down to whitelisting email from external senders (your front end mail filtering) and client side email filtering rules. There have been a couple rare instances with Pingdom email, but you can check here Pingdom Service Status for status on their internal services. As far as tracking email…

in Need help troubleshooting Pingdom alert notifications Comment by marcrobinson May 2023

I like your modern dashboard. I have played with it for a NOC, but wish I could have tabs with it. I use a the classic for a NOC to rotate through several screens. One thing I have started to play with are timely charts/tables - if X happened within last Y hours, display what X is. Other than that - maps can be useful to…

in Network Dashboard - What are you doing? Comment by marcrobinson May 2023

When you go into settings, then account management and edit a user or group, It would be the Default Summary View. Changing that view changes where users go when they click on the logo in the top left. The "Home Page View" sets the view they see after they login.

in Go to Orion Home (when you click the top left icon) - How to change the home page tab? Comment by marcrobinson May 2023

I agree with @"bobmarley". Your polling rate should be above 75% typically. I tend to use the main polling engine just for discovery and to park nodes for later migration also. The main poller is responsible for a lot of the solarwinds platform duties - ie alerts etc. Keeping it happy is honestly a good thing. If you have…

in Network Discovery cannot start on the Main Poller Comment by marcrobinson May 2023

Yes - you edit that copy. I kind of follow what you are saying as this is the next step: You can assign as the default view for that type of device. When editing this copied (and renamed) dashboard, select Add Widgets and search for 'universal' and check the following ones. These should be some of the more useful: *…

in How to monitor UPS Comment by marcrobinson May 2023

@"fop" Please add the link to this feature request, so we can all bump it up. Calling the bumpsquad! As to your pain with Microsoft and Azure - umm you are not alone. I get very similar reactions when I talk to the Azure Architects. This is often mentioned with a comment to wait two weeks, Microsoft will update Azure to…

in Anyone tried to use "Component Monitor Wizard" to monitor Azure Application Gateway before? Comment by marcrobinson May 2023

RabbitMQ is still part of the SolarWinds/Orion platform. I would not disable it. I have not run into your issue either. Do you have a method to whitelist it in Trigeo? SolarWinds Platform 2023.2 System Requirements - look for traffic on port 5671.

in RabbitMQ Tripping PowerShell Monitoring Alerts Comment by marcrobinson May 2023

Copy the default node view. Rename it something like Vertiv Details. You will need to add widgets based on the undp data. You can take the swql and create widgets like the radial dials or charts based on things like time on battery too. The swql that you use for the reports can be modified for the alerts. Universal Device…

in How to monitor UPS Comment by marcrobinson May 2023

2023.2 includes fixes for the retention, and I believe a few other fixes for the maintenance. 2023.1.1 also included some fixes. As far as monitoring, I have heard of people monitoring this log with a SAM component. Lemme see if I can find a quick pointer. EDIT: I recall this being done with powershell in a template. But…

in Database maintenance ... has finished with errors. Comment by marcrobinson May 2023

separate syslog db is one of the recommendations, if for no other reason than this.

in DATABASE FULL Comment by marcrobinson May 2023

Agreed. Also check latency/port exhaustion to your SQL server. There are some windows events to look for, in particular Event ID 4001 from SWService. The message will be something like the system is unable to open a connection to the sql server. Since those are logged in windows events, you can search after the fact for…

in Tools to check for windows server port congestion?? Comment by marcrobinson May 2023

Your post just made my day! Thank you for posting this and suggesting DPA as an eval to get to the root of the issue. This is definitely an issue for those of us with large environments or limited compute power.

in Easy way to increase SQL performance in multiple APEs environment Comment by marcrobinson May 2023

Hmm, this might explain somethings.. Good call.

in Anyone tried to use "Component Monitor Wizard" to monitor Azure Application Gateway before? Comment by marcrobinson May 2023

gotcha. then ignore my thoughts about changing scope.

in How do I get an alert to fire off during a specific time period? Comment by marcrobinson April 2023

Looking at similar that I have - I used component status = down/up for trigger and reset and specified the component name or id in the scope. This does change it from and application to a component alert though.

in How do I get an alert to fire off during a specific time period? Comment by marcrobinson April 2023

Lets start by changing the evaluation to 5 minutes, and setting the trigger condition must exist for more than – to 45 minutes in your test system. You can play with that persistence trigger to force the alert to make it is doing something too. Otherwise, you can reverse the logic - if a file shows up - alert. I am…

in How do I get an alert to fire off during a specific time period? Comment by marcrobinson April 2023

Check the latency to the sql server. Also check to see db maintenance is running. Maybe have a dba check to see if additional indexes should be created too. (IT shouldn't but your mileage may vary as much as mine has on this last option.) What modules do you have? If you have NPM or HCO, then you can use netpath to dig…

in Has this Orion latency symptom been experienced and remedied before? Comment by marcrobinson April 2023

Hmm Could this affect how items are searched and reported upon in SWQL/web interface. I am noticing some odd results with underscores and hyphens/dashes in results against orion 2023.1. It looks like the two characters are treated as the same by swql now.

in Change Orion SQL Collation Comment by marcrobinson April 2023

I have seen this done with multiple nics. Hateful, but it worked. A real network engineer may provide insight into using acls and routing that system to where it needs to go. Note: I am not a network engineer. So I will not try to answer this one.

in Monitoring machine connecting to multiple closed networks Comment by marcrobinson April 2023

Good description, the actual logic may help if you can post it. Second - you have a fun one. Ok, fun is not quite accurate. You may be able to solve this with alert trigger and reset persistence settings and changing the alert frequency. Create a copy of the alert and have it email just you for testing. Based on the…

in How do I get an alert to fire off during a specific time period? Comment by marcrobinson April 2023

Excellent Dashboard. @"Slamdance" beyond setting up that dashboard, some of those metrics are available in the admin section. Go to All Settings, then scroll down to the Polling Engine for quick stats on polling levels, and then the My Deployment (also seen under Settings) and go to Deployment Health. You will want to look…

in Polling server constant 90-100% CPU usage Comment by marcrobinson April 2023

Click on the Store link after the Free Tools and Trials. You can get SOCKS! And other bits of goodness, such as exam vouchers if you get enough points.

in Create multiple User accounts in SolarWinds Comment by marcrobinson April 2023

Start with the usual things - is that one poller running significantly more components? Are there large numbers of components/templates/nodes with higher than normal polling rates on that single poller? 4 cpu? It might need more cpu, sounds like you are on the verge of 8 cpu.

in Polling server constant 90-100% CPU usage Comment by marcrobinson April 2023

I just poked around google on that json sql error and this is one for SolarWinds support. Mostly because it looks like you are getting an error on some configuration json sent to the orion db. Not sure if I would want to troubleshoot that on a whim. Good luck and please post, it sounds like this might the first big bug…

in Orion Database Failed (trying to update to 2022.3) JSON_MODIFY must be a string literal Comment by marcrobinson April 2023

Awesome, and you hit the issue on the head. How to do that with the api poller is the next question as you stated. I am able to test it with token also, but the token expires hourly - so another challenge there.

in Microsoft Azure Application Gateway API Comment by marcrobinson April 2023

The Orion.Cman.Container entity has provided me with most of stuff I need for reporting. The Orion.Cman group of entities is probably where you found the data?

in Swis API call for current Container Status Comment by marcrobinson April 2023

I am glad that you sorted this out. When you talk about the fields not populating on the dummy node - I assume that is the external node? The data will show up in the api poller. There 'should' be a widget on your default node details that will display the data. If not, I would check the default nodes page and look for the…

in Azure API monitoring Comment by marcrobinson April 2023

There were some bugs in 2022.4.x. I would consider waiting for 2023.2, as .2 should be going from RC to GA within the next couple of weeks. It includes fixes for 2023.1 and 2023.1.1. I am planning to upgrade from 2023.1 to 2023.2 and skip 2023.1.1 at this point. There is a fix I need in 23.2 for a bug in both 23.1…

in NPM 2023.1.0 Upgrade Feedback Thread Comment by marcrobinson April 2023

marcrobinson · Observability Detective · ✭✭✭✭✭

Comments