Hi,
We have IPAM running as a module within the NPM.
IPAM is failing with SQL errors. Is it possible to re-configure the IPAM to access a different brand new database ?
Currently, the IPAM info tables are within the NPM database.
Best Regards,
A
You'd have to break out IPAM I believe onto it's own server. There wouldn't be a way to configure it to another SQL server while still installed on the NPM server.
Can't I uninstall it and install it on the same server and point it to a new database. I don't mind redoing the whole IPAM setup again.
Regards,
Athreya
Also, if I install it on a different server, can I make it the link available from the NPM admin page ?
Thanks and Regards,
No on both. If you uinstall and reinstall it on the same server, it's going to simply add as a module. If you try on a separate server, it will be it's own app separate from the SolarWinds website. There's no way to integrate it without having it installed as a module using the same database.
The IPAM 4.2 is failing to after 67%, with the error,
Command FailedAn unexpected error has occurred when processing DHCP scopes results for server '1703'
Solarwinds support is pointing it Storage latency. the Database is on SSD and so far no issues we have seen
So, I thought it will be possible to move to another Database.
Any other ideas I can have ?
Also, it is after an upgrade, I am clueless what to do with it.
So why are they pointing to Storage latency? Did they give you a reason, or just that it has to be storage latency? How many scopes are on that server? What other modules do you have installed other than IPAM?
there is nothing really pointing to the latency.
from the IPM business layer log, this was seen,
2014-10-21 05:22:35,390 [5] ERROR SolarWinds.IPAM.ScanningEngine.JobProcessor.JobProcessorBase`1 (null) - An unexpected error has occurred when processing scan results for subnetid '3409'
System.Data.SqlClient.SqlException (0x80131904): Timeout expired. The timeout period elapsed prior to completion of the operation or the server is not responding.
The statement has been terminated.
There only NPM 11.0.1 ,NCM 7.3.2 and IPAM 4.2.
However the IPAM was working and some intermittent issues used to get reported, that used to vanish. A month ago it stopped working completely and then it was upgraded to fix. Since then it stopped working altogether.
OK that would point to a connection timeout. I'm assuming the database and application servers are in the same location. An old schole trick that used to work on the SolarWinds server was finding the SWNetPerfMon.db file, opening it in notepad and changing the software's connection and command timeouts from 20 and 90 to 600. From there, stop the services, save the change, and restart the services. This will at least make sure the application is on the same timeout as SQL is. This might help. How many scopes are on that server? Are your subnet scans working ok?
there are 150 server/scopes on the IPAM, 61805 IPs.
When Select a DHCP server and select Edit and then test, it is successful.
However, when I select the Same server and SCAN. fails with the error,
Sorry,
Pops up this
How long after you perform the scan? I'd be more apt to thinking it's something wrong with the install. You may want to do the uninstall/reinstall to see if that resolves it. All you information will still be in the database. If things still fail, I'd recommend requesting escalation from support.
it scans for almost 5 mins
First it stops at 33%, then it moves 67% immediately and then a long wait, then the error.
ERROR SolarWinds.IPAM.ScanningEngine.JobProcessor.JobProcessorBase`1 (null) - Error occurred when processing DHCP results - serverGroupId: 2497, serverNodeId: 4022
at System.Data.SqlClient.SqlConnection.OnError(SqlException exception, Boolean breakConnection)
at System.Data.SqlClient.SqlInternalConnection.OnError(SqlException exception, Boolean breakConnection)
at System.Data.SqlClient.TdsParser.ThrowExceptionAndWarning()
at System.Data.SqlClient.TdsParser.Run(RunBehavior runBehavior, SqlCommand cmdHandler, SqlDataReader dataStream, BulkCopySimpleResultSet bulkCopyHandler, TdsParserStateObject stateObj)
at System.Data.SqlClient.SqlCommand.RunExecuteNonQueryTds(String methodName, Boolean async)
at System.Data.SqlClient.SqlCommand.InternalExecuteNonQuery(DbAsyncResult result, String methodName, Boolean sendToPipe)
at System.Data.SqlClient.SqlCommand.ExecuteNonQuery()
at SolarWinds.IPAM.Storage.DAL.ConflictDAL.PurgeConflictData()
at SolarWinds.IPAM.Storage.DAL.ConflictDAL.StoreDhcpConflicts(Int32 scopeId, List`1 conflicts, Boolean updateConflictData)
at SolarWinds.IPAM.ScanMerge.DhcpDiscoveryMerger.UpdateScope(Int32 serverId, Int32 scopeId, Int32 groupId, String scopeAddress, Boolean includingGroupTable, DhcpServerType serverType)
at SolarWinds.IPAM.ScanMerge.DhcpDiscoveryMerger.UpdateScopesState(Int32 serverId, Int32 groupId, Boolean addAutomaticallyNewScopes)
at SolarWinds.IPAM.ScanningEngine.JobProcessor.JobProcessorOneTimeDhcpServerSync.SubmitResults(UIJob job)
ClientConnectionId:e4634d91-5859-4b86-90d4-0bcf39d4f0ad
This is what I see from the Logs.
what ever we find the root cause, thanks lot to you for spending your valuable time on this.
I really appreciate it.
Regards
All I can recommend is what I did previously, reinstall then request escalation if there's still a failure.
Sure, I am doing a Storage firmware upgrade this weekend. I will try out
1. The timeout settings change
2. Reinstall steps
I surely update the outcome.
We had to escalate it. It turned out to be an issue with the database, where the old data was not getting purged.
Support managed to provide us a stored Procedure which is doing that now.
All is working now. Thanks a lot for your inputs
Glad to help out where I could and thank you for sharing what was done to get this corrected.