Troubleshooting Procedures [ RJE/XL Node Manager's Guide ] MPE/iX 5.0 Documentation
RJE/XL Node Manager's Guide
Troubleshooting Procedures
The node management troubleshooting procedures are designed to isolate
reproducible problems to either the HP 3000 or IBM host using a bottom-up
approach. This is done by examining the physical level first; then
examining the BSC link and if necessary, the system service and any
accompanying applications. The following graph and procedures display a
data flow structure and troubleshooting flowchart for both the remote
host and local system. By necessity, the procedures require the
gathering of information from both the HP 3000 and the IBM system which
may be in geographically separate locations.
Figure 6-1. Troubleshooting Diagram
The following procedures are general purpose procedures and are only
intended to assist you in troubleshooting problems evident when using
RJE/XL. They do not provide specific answers to particular problems.
First, verify the functionality of the physical connection between the
local system and the remote system. Once the physical connection between
the HP 3000 and the IBM host (including modems) is verified, you can
start troubleshooting both the HP 3000 and the IBM host.
If the physical connection cannot be verified as being functional or if
it is proven to be nonfunctional, your Hewlett-Packard support person
should attempt to determine the nonfunctional physical component and make
an appropriate fix if possible. Once this is accomplished, you may
continue with the troubleshooting procedures. The troubleshooting
procedures contain four basic steps:
* Isolating the problem.
* Identifying the problem.
* Determining the cause.
* Resolving the problem.
Isolating the Problem
The following steps help to isolate problems:
1. If there is more than one BSC link attached to the HP 3000,
attempt to isolate the problem to the respective PSI card. If the
problem appears in all the PSI cards, the RJE/XL system service
should then be examined. If there is only one BSC link, test the
link for adequate functionality; then examine RJE/XL.
2. If the problem can be isolated to a specific PSI card (Physical
Path ID), the problem should then be isolated to a specific link
configuration. The link configuration specifies the links
configured for the node by identifying the link names. If the
problem is evident in all the configured links within the
respective node, the PSI card should be examined. If the problem
can be isolated to a specific link name, go to the next step.
3. If the problem has been isolated to a link name, verify the link
data to ensure that the link has the appropriate parameters
configured specific to the BSC protocol.
4. If the problem could not be isolated to the HP side or could
potentially be related to job submittal, verify that the job is
being successfully transmitted to and received by the host.
Identifying the Problem
It helps clarify and identify a problem if the symptoms are described in
detail. This identification or problem description involves describing
what exactly happened and why it was a problem (regardless of how obvious
it might seem). This will expedite matters by eliminating fixes that
fail to address the true problem, creating a temporary solution.
A detailed problem description is always required by HP support prior to
any assistance provided. As a result, preparing a problem description as
soon as a problem is evident will accelerate problem identification if
the customer is forced to contact HP support. When the problem
description is prepared immediately following the error or problem, the
user's ability to accurately describe the problem is enhanced
dramatically, especially if the problem proves to be difficult to
reproduce. This may be done by addressing some of the following
questions:
1. What are the specific symptoms of the problem?
2. What is the frequency of the problem? Are there any trends
evident which seem to be related to the problem (that is, trends
conspicuous only after the problem was noticed)? Were there any
known changes made to the system prior to the time that the
problem was first noticed?
3. When was the problem first observed (day and time)?
4. Did the link function correctly prior to the problem or was it
just installed? If the link was functioning at one time, what
changed since it last functioned correctly? Identify any
differences in the system when fully operational and the current
state of the system.
5. Identify those components or subsystems (if any) that communicate
with the RJE/XL or the BSC link that were not affected by the
problem.
6. What is the extent of the problem? Is there limited capability to
execute specific system functions or is the system completely
inoperable? What is the impact and severity of the error on the
system (is it a recoverable error or a fatal error)?
Determining the Cause
At this point you should have an adequate description of the problem and
should attempt to identify the possible cause of the problem. This can
be initiated by identifying those areas that could reasonably be
eliminated as the cause of the problem. Once this is done, choose the
most plausible source of the problem first, using the evidence collected
during the earlier troubleshooting steps. The following is a list of
procedures that help you determine the cause of a problem:
1. Verify the hardware installation.
(Refer to the section titled "Hardware Troubleshooting Tools" in
this chapter for more detailed information.)
a. Check cable connections and verify the cable type.
b. Check modem indicators to check modem functionality.
c. Execute PSI self test.
d. Run PSIDAD diagnostic utility through SYSDIAG.
2. Verify the software installation.
(Refer to the section titled "Installation Problems" in this
chapter for more information.)
a. Check version stamps. (Version stamps appear in the product
banner when RJE/XL is first invoked.)
b. Use NMMAINT to verify subsystem versions.
3. Verify HP configuration.
(See the section titled "Configuration Problems" in this chapter
for more information.)
a. Print an NMMGR print summary.
b. Compare NMMGR values with IBM host generation values.
c. Check the MPE XL I/O configuration (SYSGEN).
4. Verify the IBM host configuration.
(Refer to the section titled "IBM (Host) Troubleshooting Tools.")
a. Check the NCP generation (EP).
b. Check the JES generation.
c. Obtain a copy of IBM jobs (including JCL and Data).
5. Check for common problems.
(Refer to the section titled "Common Problems" in this chapter.)
a. Verify that the output is routed correctly. (Refer to the
section titled "Output Routing Problems" in this chapter.)
Resolving the Problem
The resolution of a problem must be initiated by the customer.
Hewlett-Packard support personnel should not be contacted before the
preceding troubleshooting procedures have been performed. If the problem
still has not been adequately resolved, Hewlett-Packard support should
then be contacted. Refer to the section titled "Escalation Procedures"
in this chapter for additional information on problem escalation.
The next section provides a brief summary of some of the more common
problems experienced by the user.
MPE/iX 5.0 Documentation