HP 3000 Manuals

To Identify Problems [ NS3000/iX Operations & Maintenance Reference Manual ] MPE/iX 5.0 Documentation


NS3000/iX Operations & Maintenance Reference Manual

To Identify Problems 

The usual method of identifying problems is to characterize the situation
in which the problem occurs and then investigate which of the possible
causes are actually responsible for the problem.  Finding the cause is
often sufficient to suggest the resolution of the problem.  For example,
assume that the problem is characterized as "the user is unable to open a
line with the DSLINE command." A possible cause is that the user entered
a command using incorrect syntax.  You would resolve the problem by
correcting the command and reissuing it.  However, if the syntax was
correct, you would have to look for another possible cause, such as an
inactive link or a failure of the remote node.

Thus, in most cases you start with the characterization of the
problem and investigate the possible causes.  The difficult part of
troubleshooting is to identify the actual cause of the problem.  Once you
know the actual cause, you can take the appropriate action to resolve the
problem.

To Characterize the Problem 

It is important to ask questions when you are trying to characterize a
problem.  Start with global questions and gradually get more specific.
Depending on the response, you ask another series of question, until you
have enough information to understand exactly what happened.

Key questions to ask are as follows:

   1.  Was an error message generated?  Use the NS3000/iX Error Messages 
       Reference Manual to look up the cause of the error and take the
       action suggested.  If this does not resolve the problem, continue
       with the next question.

   2.  Is the problem isolated to one user or program?  If so, continue
       to the next question.  If more than one user is involved, proceed
       to question 6.

   3.  Did the user perform the operation correctly?  Was syntax correct?
       Does the user have the correct logon and authority to use the
       command or service?  Correct any problems found.  If the operation
       was correct, continue with the next question.

   4.  Did the problem occur while the user was running a program?  Were
       there program errors?  If so, investigate and correct the program
       errors.  Otherwise, continue with the next question.

   5.  Did the problem occur while attempting to open a line or transmit
       data?  If so, investigate the connection between this system and
       the remote system.

   6.  If more than one user is involved, does the problem affect all
       users?  The entire node?  If so, has anything changed recently?
       Some possibilities are:

          *   New software and hardware installation.

          *   Same hardware but changes to the software.  Has the
              configuration file been modified?  Has the MPE/iX
              configuration been changed?

          *   Same software but changes to the hardware.

   7.  Do you suspect hardware or software?

       It is often difficult to determine whether the problem is hardware
       or software related.  Symptoms that mean you should suspect the
       hardware are:

          *   Bad LANIC or PSI dumps.

          *   Link level errors, either returned to the user or logged to
              the console.  This includes CI errors, NMERR errors, power
              fails, and link shutdowns.

          *   Lost data - data is sent but not received at the link
              destination.  (This could also be caused by a software
              problem.)

       Symptoms that mean you should suspect the software are:

          *   Logging messages at the console.

          *   Network Services errors returned to users or programs.

          *   MPE/iX file system (FSERR) or command interface (CIERR)
              errors (except "Remote Not Responding" errors).

          *   Data corruption.

          *   Terminal hangs.

          *   Intermittent errors.

          *   Network-wide problems.

To Identify the Cause of Problems 

The type of investigation that you use to identify the possible causes of
a problem depends on whether the problem affects one user or an
individual situation, or if the problem is node-wide.  Once you have the
answers to the questions listed previously, use the flowchart in figure
4-1 as a guide and see chapter 5 for a problem resolution strategy.

[]
Figure 4-1. Characterizing the Problem


MPE/iX 5.0 Documentation