Operational Defect Database

BugZero found this defect 70 days ago.

Hewlett Packard Enterprise | a00138215en_us

Advisory: HPE ProLiant DL380/DL385/DL345 Gen10 Plus Servers or ProLiant DL385 Gen10 Plus v2 Server - Systems Configured With Multiple GPUs May Encounter Server Critical Fault on AUX/MAIN E-fuse When Powering On

Last update date:

3/14/2024

Affected products:

HPE ProLiant DL345 Gen10 Plus server

HPE ProLiant DL380 Gen10 Plus server

HPE ProLiant DL385 Gen10 Plus server

HPE ProLiant DL385 Gen10 Plus v2 server

Affected releases:

No affected releases provided.

Fixed releases:

No fixed releases provided.

Description:

Info

Any of the HPE ProLiant Gen10 Plus servers mentioned in the Scope section below may encounter a Server Critical Fault when powering on after conducting multiple "Soft Off" events. This is due to servers configured with multiple GPUs conducting several "Soft Off" events causing electrical components to enter a degraded state and preventing the server from powering on. The following error message may be registered in the IML: Server Critical Fault (Service Information: Power On Fault, System Board, AUX/Main EFUSE (10h))

Scope

Any of the following servers configured with 2*Dual Wide GPUs (300w or greater) or configured with 4*Single Wide GPUs (150w or greater): HPE ProLiant DL345 Gen10 Plus server HPE ProLiant DL380 Gen10 Plus server HPE ProLiant DL385 Gen10 Plus server HPE ProLiant DL385 Gen10 Plus v2 server

Resolution

This issue is solved by updating the PIC firmware to version 1.1.4, the System Programmable Logic Device (CPLD) firmware and changing the BIOS/Platform Configuration (RBSU) > Automatic Power-On settings to "Always Power On". The PIC - Advanced Power Capping Microcontroller Firmware for HPE Gen10 and Gen10 Plus Server firmware version 1.1.4 is available here . CPLD firmware - Contact HPE Support and reference Doc ID a00138183en_us, to obtain the firmware and installation instructions. ProLiant DL380 Gen10 Plus: CPLD version v1717 ProLiant DL385 Gen10 Plus and ProLiant DL385 Gen10 Plus v2: CPLD version v3131. ProLiant DL345 Gen10 Plus: CPLD version v1E1E. Flash sequence: Update first the CPLD firmware. VERY IMPORTANT: When installing the CPLD firmware, monitor the update status and make sure not to disrupt AC power before the CPLD flashing is complete. Once complete, CPLD will force a power cycle, then a reboot is needed, which can be done remotely. Conduct an AC power cycle. Flash the Power PIC firmware to version 1.1.4. Reboot the system. After the firmware update, change the Automatic Power-On setting to "Always Power ON" under System Configuration > BIOS/Platform Configuration (RBSU) > System Options > Server Availability > Automatic Power-On. If this is set to "Always Power Off" or "Restore Last Power State", the system may not power on by itself automatically. Note: When the firmware updates are completed, the following false error may be observed when the fix is executed to prevent an actual failure. This false error message can be ignored. Server Critical Fault (Service Information: Power On Fault, System Board, AUX/Main EFUSE (40h)) RECEIVE PROACTIVE UPDATES : Receive support alerts (such as Customer Advisories), as well as updates on drivers, software, firmware, and customer replaceable components, proactively in your e-mail through HPE Support Alerts. Sign up for Support Alerts at the following URL: HPE Email Preference Center. NAVIGATION TIP: For hints on navigating HPE.com to locate the latest drivers, patches and other support software downloads, refer to the Navigation Tips document. SEARCH TIP: For hints on locating similar documents on HPE.com, refer to the Search Tips documen

Additional Resources / Links

Share:

BugZero® Risk Score

What's this?

Coming soon

Status

Unavailable

Learn More

Search:

...