Hi All,
I do have support cases open with both VMware and HP, but was just wondering if anyone has experienced these PSOD and knows the permanent fix? I have seen two in the past 12 hours and since there are 34 identical hosts I'm concerned about operational stability.
VMware have confirmed that it's hardware related, so perhaps a BIOS / firmware update? There is nothing in any of the hardware logs to indicate any failures or issues.
Here is the PSOD error;
---------------------------------------------------------------------------------------------------------
# vmkernel-core.1_10MB_hash=221ac8fe6d68676711240c146f892ea9
#0 PanicSaveRegs (world=0x412240825000) at bora/vmkernel/main/panic.c:114
#1 0x000041803b66d2e6 in PanicvPanicInt (fullFrameIn=0x0, storedBacktrace=0x0, fmt=0x41803b97b048 "LINT1 motherboard interrupt. This is a hardware problem; please contact your hardware vendor.", argsIn=0x412240807b98, flags=<value optimized out>) at bora/vmkernel/main/panic.c:777
#2 0x000041803b66db0e in Panic_vPanicCustom (fmt=0x41803b97b048 "LINT1 motherboard interrupt. This is a hardware problem; please contact your hardware vendor.", args=0x412240807b98, flags=1) at bora/vmkernel/main/panic.c:508
-----------------------------------------------------------------------------------------------------------
The hosts are all HP DL380p Gen8 servers running ESXi 5.0 update 3;
~ # esxcli system version get
Product: VMware ESXi
Version: 5.0.0
Build: Releasebuild-1851670
Update: 3
Processor information;
My thoughts at the moment are perhaps a CPU microcode issue that is causing this, so I'm off to scour the HP site for customer advisories / firmware updates etc.
Any pointers in the right direction will be gladly received.
Cheers,
Jon
PS: see you in Barcelona for VMworld
Hp indeed published an Advisory c04327904 related to an Intel Microcode issue for the v2 CPUs. I can't tell you whether this is the solution to the issue you have, but definitely worth checking.
André
iLO4?
Have a look here:
http://h20564.www2.hp.com/portal/site/hpsc/public/kb/docDisplay/?docId=emr_na-c04332584
Thanks Andre, I've already applied that BIOS update (previously) so probably something else (or it's still broken).
Thanks FritzBrause - HP have come back to me with the following statement;
The LINT1 PSOD issue is known to us and a fix has been released. Kindly update the iLO firmware to 1.51 or later to resolve the issue. Please do power cycle to the server after the update. Kindly let me know if there are further queries.
Kindly down load cp023644.exe and extract (only extract, no install) it on a windows pc. Use the bin file to upload the firmware in the iLO 4 > administration > firmware option.
Intermittent Non-Maskable Interrupt (NMI) Events May Occur on HP ProLiant Gen8 Servers running HP Integrated Lights-Out 4 Firmware Versions 1.30, 1.32, 1.40 and 1.50.
I'm going to assume this is the fix, and will update this thread if anything changes.
Cheers,
Jon
Yes,
I'm also having this problem with my HP Blades 465c G8. So is there any specific steps that I need to do before applying the firmware updates to the iLO ?
or is there any other prerequistes before applying this firmware update to the effected HP Blades server ?