I wrote about visibility via monitoring being the first step in successful IT change management. And as an IT Pro’s career progresses, they will encounter many breaks and failures in their IT infrastructure. The only guarantee in IT is that something will break and IT pros have to be able to fix it ASAP. Experience and a solid process framework, coupled with visibility are key to successfully troubleshooting IT issues.
Troubleshooting is a skill that consists of two parts: root-cause analysis and taking corrective measures. In the past, troubleshooting would include:
Fast forward to today, and troubleshooting is all about collaboration i.e. someone has probably already ran into this issue and has blogged about it or shared the knowledge on an IT community website like thwack. So troubleshooting becomes as simple as Google-ing it or Bing – winner, winner, chicken dinner.
But what if you are the first to encounter a problem? Then, you’ll need a framework to troubleshoot issues. If you don’t have one, here’s a template framework that you can leverage. And within that framework root-cause analysis begins with what is happening (a real-time dashboard) and what has happened (logs). Once the problem is identified and cause-effect is understood, the prescriptive measures can be determined, tested, verified as viable fixes, and deployed into production. Troubleshooting success consists of the efficiency and effectiveness of the resolution.
In closing, troubleshooting is a constantly evolving skill for an IT pro. When you think you’ve mastered your environment, new technology always intervenes. So learn the art of troubleshooting like your career depends on it.
Let me know what you think in the comment section below. Also feel free to share your troubleshooting process or tips below.