Phase 1: Investigation

 

 

 

Investigation is the first phase in the troubleshooting process. It involves the following steps:

 

  1. Write a problem statement.
  2. Write a problem description.
  3. Identify differences and changes.

Write a Problem Statement

The first part of the Investigation phase is to write a problem statement. The problem statement must be broad enough to describe the problem, but narrow enough to focus the investigation. It should not contain value judgements and should be a factual answer to the question "What is wrong?"1

 

Good Examples 

 

Why are these good examples? They do NOT include a value judgement, and they do NOT guess at possible causes. They do not list any observable symptoms beyond the precise statement of the problem.

 

Bad Examples 

 

Why are these bad examples? These examples include a guess at a possible cause, and they include a list of immediately observable symptom beyond simply stating the problem.

 


Write a Problem Description

The next part of the Investigation phase is to write a problem description. This is where you describe the problem in more detail by collecting as many symptoms as possible. Gather as much data as you can, such as any error messages, core dumps, service outages, and contrasting descriptions of what still works. As much as possible, identify the time of the incident and the circumstance.1 

 

When documenting the problem description, do not underestimate or overestimate the magnitude of the problem. Be sure to stick to the facts in the data that you gathered.

 

Good Example

 

This is a good problem description because it includes information about symptoms, and contrasting information about what still works on the car. It also identifies the time that the problem occurred.

 

Bad Example 

 

This is a bad example because it is too vague; it does not contain symptom information or information on what still works. It also contains a statement that overestimates the magnitude of the problem, with nothing to back the hypothesis.

 


Identify Differences and Changes

Identify differences between the faulted system and any similar working systems. Has this problem always existed? Has the system ever worked properly?

 

Also, identify any recent changes to the system.1 For example, was any preventive maintenance performed recently? Was any part of the system upgraded or modified recently? 

 

Examples

 


What's Next?

Phase 2: Analysis >

 


References

  1. Cromar, Scott. "Troubleshooting Methodology," Princeton University Enterprise Servers and Storage, 2007; available from http://www.princeton.edu/~unix/Solaris/troubleshoot/methodology.html; accessed on January 4, 2011.

 

 
Notes

Tim, I removed  the "about difficulties caused by the crash" text from the Problem Statement section. I agree that it was confusing. Good catch.

Note by Tom Sajbel on 01/30/2011 01:30 PM

Tim, I added information to the Identify Differences and Changes section on determining if the problem has always existed.

Note by Tom Sajbel on 01/30/2011 01:28 PM

Identify Differences and Changes

Herbst:  Along with identifying any recent changes, it is important to determine whether or not the problem has always existed (ie...has it ever worked?)

Note by Tim Herbst on 01/27/2011 02:49 PM

Why is this a good example? Note that these example problem statement do not include any value judgments about difficulties caused by the crash, they do not include a guess at a possible cause, and they do not include a list of immediately observable symptoms.

 

Herbst:  I would strike "about difficulties caused by the crash" as it only pertains to one of the examples above it and thus makes the statement confusing.

Note by Tim Herbst on 01/27/2011 02:40 PM


This page was generated by the BrainKeeper Enterprise Wiki, © 2018