Operational Defect Database

BugZero found this defect 46 days ago.

Hewlett Packard Enterprise | a00134183en_us

Advisory: (Revision) HPE ProLiant Gen10 Plus V2 Servers and Apollo Gen10 Plus Servers- Uncorrectable PCIe Bus Errors May Occur On Systems Configured with an AMD EPYC 7xx3-Series Processor

Last update date:

4/4/2024

Affected products:

HPE Apollo 6500 Gen10 Plus System

HPE ProLiant DL325 Gen10 Plus v2 server

HPE ProLiant DL345 Gen10 Plus server

HPE ProLiant DL365 Gen10 Plus server

HPE ProLiant DL385 Gen10 Plus v2 server

Affected releases:

No affected releases provided.

Fixed releases:

No fixed releases provided.

Description:

Info

Document Version Release Date Details 2 April 4, 2024 Updated to indicate the permanent fix, System ROM v 3.00. 1 August 17, 2023 Original document release HPE ProLiant Gen10 Plus V2 Servers and Apollo Gen10 Plus servers may experience uncorrectable PCIe bus errors. These failures are more likely to occur when high-bandwidth PCIe option cards capable of data transfer speeds greater than 100Gb/s are installed in the server. Two failure scenarios are possible which are described below. This failure is due to the AMD EPYC 7xx3-series processor and is not unique to HPE servers. Scenario 1: The Integrated Management Log (IML) will contain one or more errors similar to the following example. "Unrecoverable I/O Error has occurred. System Firmware will log additional details in a separate IML message entry if possible. Uncorrectable PCI Express Error Detected. Slot 16 (Segment 0x0, Bus 0xC8, Device 0x0, Function 0x0). Uncorrectable Error Status: 0x4000 ACTION: Update the firmware of the failing device. If the issue persists, replace the device." Failures matching this symptom will have an Uncorrectable Error Status value of 0x4000. Scenario 2: The IML will contain one or more errors similar to the following example: "Unrecoverable I/O Error has occurred. System Firmware will log additional details in a separate IML message entry if possible. Uncorrectable PCI Express Error Detected. Slot 4 (Segment 0x0, Bus 0xA3, Device 0x0, Function 0x0). Uncorrectable Error Status: 0x40000 ACTION: Update the firmware of the failing device. If the issue persists, replace the device." Failures matching this symptom will have an Uncorrectable Error Status value of 0x40000.

Scope

This failure can occur on any HPE ProLiant Gen10 Plus V2 server or HPE Apollo Gen10 Plus server in the Affected Hardware list configured with an AMD EPYC 7xx3-series processor.

Resolution

These failures are resolved with System ROM release version 3.00. Update to this System ROM version at the earliest opportunity if servers with high-bandwidth option cards are experiencing this failure. Click the following link: Hewlett Packard Enterprise Support Center Enter a product name (e.g.,ProLiant DL325 Gen10 Plus v2 " in the text search field and wait for a list of Suggested Products to display. From the Suggested Products list displayed, identify the desired product and select it. Click the "DRIVERS AND SOFTWARE" tab near the top of the page for the selected product. From the "DRIVERS AND SOFTWARE" filter menus on the page: In the "Find in Drivers and Software' search box, enter ProLiant DL325 Gen10 v2 System ROM 3.00." Locate and select the applicable version of the System ROM (or later). Click the Download button. Note: To ensure that you have selected the latest version of the firmware/driver, click the Revision History tab to check if a new version of the firmware/driver is available. For more important information, review the Release Notes tab.

Additional Resources / Links

Share:

BugZero® Risk Score

What's this?

Coming soon

Status

Unavailable

Learn More

Search:

...