lcsw2013 ✭✭✭✭✭

Comments

  • I would say change *I want to alert on* to "Node" instead of "Volume" as this is more of a node related alert than it is a volume related alert. The condition at least to mean appears to be sound. And you have it set to be active for at least 10 minutes before firing an alert which normally should be enough to to avoid…
  • Initially we had 2 sockets and 8 cores. So 2 cpus by 4 cores each to have a total of 8 cores. Performance in this configuration was unusable. And CPU was constantly pegged at 100% nearly all the time. We spent most of the time rebooting the VM more than anything else just to gain console access. Then we switched to 4…
  • Only way I’m aware of is to edit each group and look to see if individual noses where added or if there is a dynamic query to have SolarWinds automatically locate nodes for the group. in our case with my previous employer I looked at system logs, I forget the exact one, and in there it showed a super expansive query that…
  • You have no idea how much this is helping us! Big thank you! I'll continue to keep this opened a little longer just in case more questions come in. We appreciate you helping us out!
  • My team asked me to send you a big thanks. let me know if you have the additional information i requested. if not i think we can use the initial information sent. i opened this thread because we urgently needed this help.
  • That may very well be the fix. That's the reason why I suggested support! Let us know if that ends up being the fix!
  • Let me bring on adatole​ to help answer this specific question or aLTeReGo​. I'm not exactly sure 100% and wanted to make sure your given a correct answer on this. I myself had similar experiences and that's why I can't answer this specific question myself. Hopefully we'll have an answer from two amazing experts, gurus in…
  • These settings should be in the wizard. When you use the wizard to setup a discovery you should see it as part of the setup. If they happen not to be there they will be on the screen after the discovery is finished. The screen before importing allows you to check if you want to import duplicates or not. I know that by…
  • SolarWinds device groups. We had setup groups within solarwinds to show our different datacenters and device groups. And instead of adding all network devices manually into the network group for example there was a dynamic query to pull that information up. My advice is look at the information service v3 logs in the…
  • Sir, I understand and thank you for your help. I really do. Disappointment stems from distrust and people stepping over each other here and rejecting the truth. But that's a whole different story. My directors told me they called 2 consultancy companies at least twice without answer. They left a few messages but by the…
  • In that case I don't see why the out of box wouldn't work in your case. Maybe you'd might want to duplicate and edit them just so you can make minor modifications to them so that they can alert properly. By default I believe that the out of the box alerts should be able to work on the remaining items you've listed. Have…
  • The table actually had near 20 million rows and over 3 gigs in size. Due to this is the reason why this was hanging it up. Every time anything needed to be done to this table it had to go through so much table data that it would time out or run into out of memory exceptions. It was very inefficient because it was not being…
  • Sure! our case number is 00196158. Let them know your issue has similarities to ours. This should help you out.
  • Exactly!! Took me reading this post twice to realize....NIC's have MAC addresses. Systems have Serial numbers. But either way asset inventory should be the best to get either of these data points. Wouldn't you say?
  • Rich, The queries helped greatly. We've removed all dup's out of the system. And are better organized now. As far as the stale polling we've had help from loop 1 to get a report that uses the CPU resource and if it hasn't been updated in x amount of day's it shows up as potential for polling failure in our report. It needs…
  • Marc, The issue represents itself as slowness. Database maintenance that takes way longer than it should. Among other performance related problems. Messages hung up in rabbit mq. Deadlocks. Etc. Plenty of operation time out error messages on nearly every log. Commit errors and retry errors. I mean there is more than enough…
  • @"bobmarley" What you have is what I'm trying to accompish haha. I know it's possible. But proving and getting approvals etc becomes an act of congress when the metrics do not clearly spell out trouble. More often I get the numbers thrown at me and people saying you see there is no problems. Not approved. Typically these…
  • @"mesverrum" Very interesting! But I have to ask what is GCP? Not sure I follow. Thanks!
  • Hey Tony, Thank you! I have my admin privilege's to nearly all my systems including SolarWinds. I just don't have that admin access to the database. That's why I was asking about getting reports through SolarWinds itself if it where an option. We have DPA. And we have SAM as well both collecting DB information. It's one of…
  • That is interesting! It appears the macro parsers is parsing the information a bit differently after the upgrade. I would say in this case open a ticket with support and give them this thwack post. They may have to investigate this one a bit further on the macro parser logs to see why the information is being handled…
  • Serena, Can you guide me towards some further guides to help me learn more about newer capabilities please? Right now I'm in a data gathering mode and research. I just took over a brand new environment. And right now I'm building diagrams and design plans to improve this aging old handicapped environment. And I'm more used…
  • Exactly What I was about to mention. I've seen a few posts around the forum stating something similar to this thus why I brought in a few Dev's to the topic to shed some light and let them know about it.
  • I second this. Some times the SDF files for these services can become corrupt or have something going on with them. And uninstalling/ reinstalling clears this out. Normally it does the trick and restores normal monitoring.
  • Normally I believe in that same screen from the screenshot above there should also be a little icon to view the report from the last run. Basically it's just a log that shows what happened during that run. In my experience if the log is updated and has information normally it indicates the job completed. If the log does…
  • In my case it turned out to be a corrupted ipam view. I had bad data in my database as well. So during a change control I was able to do some maintenance. I called SolarWinds and they helped me identify the bad view. Once all this was removed I ran a reindex of my database and my problem was fixed. I still do have a bit of…
  • No, My issue is different. In my environment what happens is that Information Service v3 will slowly creep up on CPU and memory usage. Then just stay at a high usage and never release. I don't see any logs about the service restarting or acting in any weird way. It stays on. It functions but it just kills my resources. We…
  • Node reboot alert out of the box only looks at the sysuptime counter. If lower than the previous poll it triggers as a reboot. Problem with that is that when SNMP is restarted it clears that counter I believe, there are other things too that can clear that out causing solarwinds to falsely trigger a reboot alert. We have…
  • Great answer! And I don't know how you did it. But you guessed alot of things right. I inherited a mess. No SOP's, no documentation at all. Everything thrown all over the place. No documentation of what is used and what is not and what was done with the environment. Call it the perfect storm if you will. lol. I think since…
  • Our environment is a one-off special case problem child. We where close to wiping these servers clean and starting from ground zero, but decided we wanted to chase the root cause instead of going that route to prevent the problem from ever happening again. Only reason I posted this to the forum is because I was curious in…
  • This is something I was looking into. My thought was that something was enabled that shouldn't be or there are old resources that need to be turned off. But again I wasn't sure.