Troubleshoot Hardware Faults Using Oracle ILOM Web Interface

Use this procedure to troubleshoot hardware faults using the Oracle ILOM web interface and, if necessary, prepare the server for service. This procedure uses the basic troubleshooting steps described in Diagnosing Server Component Hardware Faults.

Note:

This procedure provides one basic approach to troubleshooting hardware faults. It uses the Oracle ILOM web interface. However, you can perform the procedure using the Oracle ILOM command-line interface (CLI). For more information about the Oracle ILOM web interface and CLI, refer to Oracle ILOM Documentation.
  1. Log in to the server SP Oracle ILOM web interface.

    Open a browser and direct it using the IP address of the server SP. On the Login screen, enter a user name (with administrator privileges) and password. The Summary Information page appears. The Status section of the Summary Information page provides information about the server subsystems, including:

    • Processors
    • Memory
    • Power
    • Cooling
    • Storage
    • Networking
    • PCI_Devices
    • Firmware
  2. In the Status section of the Oracle ILOM Summary Information page, identify the server subsystem that requires service.
    An image showing Oracle ILOM web interface.

    For example, if a hardware component in the subsystem is in a fault state, the Status column notes the status as Service Required.

  3. To identify the faulty component, click the component in the Status section.

    The Oracle ILOM page showing the faulty component appears.

  4. To get more information, click the Open Problems link.

    The Open Problems page provides detailed information, such as the time the event occurred, the component and subsystem name, and a description of the issue. It also includes a link to an Oracle Knowledge Base article.

    Tip:

    The System Log provides a chronological list of all the system events and faults that occurred since the log was last reset and includes additional information, such as severity levels and error counts. The System Log also includes information on the devices not reported in the Status section. To access the System Log, in the left panel, click System Log.
  5. Before going to the server, review Product Information and Known Issues for any late-breaking information about the server and for information related to the issue or the component. Review up-to-date information about server hardware-related known issues.

    Refer to Oracle AMD-Based Cloud Servers Product Notes.

  6. Prepare the server for service.

    After servicing the component, you might need to clear the fault in Oracle ILOM. For more information, refer to the service procedure for the component. For details, refer to Oracle Integrated Lights Out Manager (ILOM) documentation at Oracle ILOM Documentation.

  7. Service the component.

    To service replaceable components, see the removal, installation, and replacement procedures in this document.

  8. Return the Server to Operation.