This discussion has been locked. The information referenced herein may be inaccurate due to age, software updates, or external references.
You can no longer post new replies to this discussion. If you have a similar question you can start a new discussion in this forum.

Alerts on the age of the most recent NCM backup?

Is there a way to create an alert on the age of the most recent NCM config backups?

we had a massive failure with our ASA today, but luckily (or so I thought) we had been getting backups of their configs every single day.

but when i went to look at the most recent one it looks like the NCM job, for whatever reason stopped taking backups on 9/1. I

just want to be notified if there is an issue with these jobs not running correctly, to avoid another disaster in the future...

  • You can configure NCM to send you an e-mail each time a backup job fails.  Your task then becomes to open the e-mail every time it's sent to you, and then troubleshoot the error.  If NCM fails to backup your ASA even once, you'd have an alert, and know you have something on your to-do list.

    From my NCM alert e-mails:

    +++++++++++++++(e-mail begins)+++++++++++++++++++++

      SolarWinds
    Network Configuration Manager

      Scheduled Job
    Notification


    ___________________________________________________________________________

      Job Engine:
    SolarWindsServer

      9/27/2016
    10:00:03 AM : Started  running-config Backup Job :
    JobDescription_(long cryptic name)

      Download Configs
    from Devices

      149 devices
    selected

      Devices : 149

      Errors  : 4

    (Device name and IP address):

    ERROR: Cannot download running-config :
    Connection Refused by (IP address of node)

    (List of failed devices' names and their IP addresses appears below, as well as successfully completed nodes.)

    +++++++++++++++++++++++(e-mailed alert ends)++++++++++++++++++++

    To set this up, or to verify you have it configured correctly for notifications of NCM backup job failures:

    pastedImage_0.png

    Find the job that does your running-config backup.

    pastedImage_0.png

    In the "Enter Notification Details" tab:

    Verify you have the circled items selected and filled in with the correct information:

    pastedImage_2.png

    In the "Add Job Specific Details" section, choose the config type and add any notifications you want:

    pastedImage_0.png

    Review the job.  Verify your nodes are selected, notification is enabled, and Click Finish.

    pastedImage_1.png

    You should now receive an e-mail every time the job runs and one or more nodes fails to backup successfully.  The only thing left is for you to follow through and troubleshoot every time a node backup fails.

    If you worry that you might not have received the information you need, you can run a report to show the status of all devices' config backup.  Just open "All Reports" and search for "backup".

    pastedImage_2.png

    Select the report you wish to view and run it.

    The far right column lists the last successful running config of every device:

    pastedImage_6.png

    It's easy to export to Excel and sort on the last column, or simply scroll down through the list and see any devices that were not backed up recently.

    pastedImage_7.png

  • *facepalm* i forgot about that setting, and over thought the solution again emoticons_silly.png thanks for the help!

  • Thank you for this post almost made my own for something similar. Looking for a way to have in the default report NCM has for nightly backups to have the last successful so I can have a better idea of new issues rather than known ones. Not sure it's possible yet and think that would take tinkering with things I shouldn't yet. Reportwish.png

  • There are some alternate options, if you don't want to rely solely on the process I provided initially.

    1.  Go to your NPM My Dashboards > Configs > Config Summary.  It should already be configured for some useful information, but you can customize the page to what you don't see present already.

    Here's mine:   The image in the upper left makes seeing what's not backed up very simple.

    pastedImage_0.png

    I've found that image so useful that I added it to NPM's front page default fiew, along with the other things I find most useful:

    But I still rely on those daily reports in my e-mail to see which devices did NOT backup last night.

    2.  There are at least two Reports available that can tell you the status of any device's backup for Startup-config and Running-config:

    pastedImage_4.png

    When you run them, you'll see that last time each device was successfully backed up:

    pastedImage_5.png

    Better still, you can use this report in a schedule, adjust it to only show those devices that haven't been backed up in X Days, e-mail it regularly / automatically to anyone you need on a scheduled basis, and even export it to Excel or PDF.  Once I have it in Excel I sort by the last Running Backup date column and see if anything's out of line.

    3.  Make your own reports with NPM's Web-based Report Builder, or with the on-server Report-Writer application.

    Here's hoping one of these is right for you!

  • I wasn't able to find that report until your post.(Was trying to go in and do the query digging in the database). I actually copied it and edited that report so the oldest dates are at the top. That way it is at the top of the report when it hits my email. The config summary page stuff is awesome too. Mine is looking much better with your additions. Here is the screenshots showing what I meant by changing the order:

    SortedPrior.png

    Then the setting change:

    OrderByascending.png

    Thank you again for the assistance, it is very much appreciated. Should have come to thwack first would have saved me sometime.

  • I'm glad I was able to help.  Thanks for the response, and for showing your process and methods.  Others will benefit from your sharing.

    Swift Packets!

    Rick Schroeder

  • Hi, I've been trying to work out a way of setting up an alert so that instead of sending an email to the team, it sends an SNMP trap to our Netcool system, which in terms auto generates an incident ticket.  I've spent a few hours trying to work find the correct fields to use for the tigger conditions, and also the alert action text.

    Cheers,


    Phil

  • Found this post in the hunt for a good solution on how to create normal orion alerts when backup in NCM fails. I was not satisfied with the solution with reports, wanted a proper alert. So I created on.

    Below script looks at when last backup job was done, taking into account if the config was unchanged and backup file was not saved. It also checks if the node is up. In the query you can change how old the backup may be. For me it's ok if it misses one backup job but not two.

    So, create a custom SWQL alert with "Node" as target:

    Seashore_0-1592380670725.png

    Paste in the below code: (observe that some code is "hard coded" in the alert and cant be changed)

    INNER JOIN Cirrus.Nodes AS NCM ON Nodes.Nodeid=NCM.CoreNodeID
    INNER JOIN 
    (
    SELECT 
        CA.NodeID AS NodeID, 
        MAX(CA.AttemptedDownloadTime) as LastBackup
    FROM Cirrus.ConfigArchive AS CA
    GROUP BY CA.NodeID
    HAVING MAX(CA.AttemptedDownloadTime)<ADDDAY(-2,GETDATE()) -- Adjust how old backups that are ok
    ) AS A ON NCM.NodeID=A.Nodeid
    
    WHERE NCM.Status=1 

    Good luck!