Hi,
I'm working for a MSP and we put all our customer systems in this SolarWinds environment to keep track on (application) performance and making backups of all Network Devices. But right now it feels like we're hitting a wall with how we should set up things and scale the environment further and I'm looking for some advice from fellow Thwackers.
Currently I'm working with an environment of about 9500 nodes. They're spread out between 6 polling engines and all nodes are configured to be able to connect to all polling engines right now.
The environment has NPM, NCM and SAM (All unlimited licensing). And all polling engines are set up with High Availability.
But I'm not sure how we should continue with the environment in terms of scaling and infrastructure, would it be better to switch up things and let certain devices only connect with 1 poller instead of all pollers?
We also have about 2500 agents which we need have on server initiated communication since we connect to all devices through NAT. But that also seems to lead to disconnected agents when the target server reboots.
Kinda looking for fellow Thwackers who have experience with the same things I'm facing and have successfully found a way to make things work with too much hassle.