VMware Cloud Community
hopit
Contributor
Contributor

HP Proliant DL360 Gen9 + VMware ESXi, 6.5.0, 8294253 = Host hardware fan status error reading 0 RPM for empty fan bays

This issue does not show up on VMware ESXi, 6.5.0, 7967591 (U1). Once I upgraded to U2 the erroneous issue shows up.

iLO shows Fan 1 & 2 are NOT installed

2018-05-10_10-09-01.png

host (and vcenter) shows them as installed with 0 RPM reading

2018-05-10_10-13-48.png

2018-05-10_10-11-30.png

HP support keeps pointing me to the URL (Using HPE Custom ESXi Images to Install ESXi on HPE ProLiant Servers | HPE ) which I reassured them I used to update 6.5 U1 to 6.5 U2. Reset logs/sensors doesnt work as well.

I checked HCL and the server build is most definitely on it but i guess thats just a grey area until its verified.

Oh happy days.

Anyone having the same issue?

55 Replies
A13x
Hot Shot
Hot Shot

still here in VMware ESXi, 6.7.0, 10176752

Reply
0 Kudos
MarcHuppert
Enthusiast
Enthusiast

The problem is fixed with 6.7 Update 1.

I have already updated my systems. It runs now 10302608

VCDX #181, VSP, VTSP, VCA, VCP-DCV(2+3+4+5+6+6.5+6.7+2019), VCP-DT, VCP-NV, VCAP(DCA4+5+DCD4+5), VCIX-NV, VCIX-DCV, VCI, vExpert, vEpxert NSX, vExpert VSAN and VCDX
A13x
Hot Shot
Hot Shot

Lucky you, we are stuck waiting on backup solutions etc to support 6.7 U1

Reply
0 Kudos
vargarl
Contributor
Contributor

agree Smiley Happy

Reply
0 Kudos
PSo3G
Contributor
Contributor

Is this issue going to be resolved in 6.5U2 as well? we can't update our hosts to 6.7 as they are not supported.

Reply
0 Kudos
VSPH
Contributor
Contributor

We can't update ESXI to 6.7 either, because we still have old HP Gen8 servers in the cluster. I have now updated the vCenter to 6.7.0.20000 and the hosts to ESXi 6.5.0 10719125. The problem persists. This is not a satisfactory way to deal with customers...

Reply
0 Kudos
schurl850
Contributor
Contributor

i have updated one of our server, which had the problem, to the newest version of VMware ESXi, 6.5.0, 10884925, and the error is gone noe. So it seems Vmware has fixed the defect.

Rietumu
Contributor
Contributor

Installing latest patches solve Hardware Status alert on  HP Proliant DL360 Gen9 + VMware ESXi, 6.5.0, 10884925

Reply
0 Kudos
rschneiderman57
Contributor
Contributor

I updated my hosts as well, most of the sensors have recovered/cleared except for 3 (Running on Dell R630):

Add-in Card 3 SD2 0 --- 0.11.3.245: unknown

Add-in Card 3 SD1 0 --- 0.11.3.244: unknown

System Board 1 Power Optimized 0 --- 0.7.1.118: unknown

Reply
0 Kudos
GaelV
Enthusiast
Enthusiast

Still the error on HP ML350 Gen10 VMware ESXi, 6.5.0, 10884925 ...

Reply
0 Kudos
VSPH
Contributor
Contributor

Now, after 8 months, the error is finally gone with esxi 6.5.0, 10884925 (on HP DL380p Gen 8). I found this info in the release notes:

PR 2088424: You might see false hardware health alarms due to disabled or idle Intelligent Platform Management Interface (IPMI) sensors

This issue is resolved in this release. This fix filters out such alarms.

Thanks for this hard piece of work VMWare Smiley Wink

Reply
0 Kudos
Dorini
Contributor
Contributor

Hi,

I´ve got the same issue but with all our ML110 Gen10 with ESXi 6.7 latest Hotfix ( 6.7.0 13981272 ) installed with "HPE-ESXi-6.7.0-Update2-iso-Gen9plus-670.U2.10.4.1.8".

iLO says everything is fine but for all 12 Servers vCenter says Alarm: Host hardware fan status  and the Hardware Healt shows Warning for Cooling Unit 1 Fans with Event Log "Assert + Fan Redundancy Lost" ...

Is this the same issue?

Please help ...

Dorini

Reply
0 Kudos
M0n1t0r
Contributor
Contributor

I was just looking up the same issue. - I have exactly the same problem and have ML110 Gen10's running ESXi 6.7 - 13006603

Can't see any issue in ILO or OneView just on the Host & vCenter "Host hardware fan status"  "Cooling Unit 1 Fans - Warning - 0 "


Does anyone know a way to fix this? - ILO version 1.43 May 23 2019 (latest)

Regards

Reece

Reply
0 Kudos
GaelV
Enthusiast
Enthusiast

Hi

If you can, do the following things :

1) Update your server with the Service Pack For ProLiant from HPE (i used the 2019.03.1 and it works)

2) Update your ESXi 6.7 to the build 13981272 (aka  ESXi670-201906002)

Let me know if you still encounter issues Smiley Happy

Regards,

Gael

Reply
0 Kudos
Dorini
Contributor
Contributor

Hi GaelV ,


quick reply on this ... it did not help fixing my issue.
I´ve downloaded the ServicePack and did an auto apply to the server ( it only updated the system rom to the latest version, all other parts seemed to be up to date ), did a reset of the alarm in the vCenter but they are showing up again ... Alarm: Host hardware fan status ... It it Hardware Health, ID 0.30.1.49 - Cooling Unit 1 Fans - Warning - Assert + Fan Redundancy lost.

Any other idea how to get a good sleep at night again?

Dorini

Reply
0 Kudos
GaelV
Enthusiast
Enthusiast

Hi,

Weird, i did it with Interactive Mode, something there's differences with Automatic mode.

I've just find out the mail that vmware support send me to completely resolve the issue, it works for me but i don't know if it could works for everyone, so i won't be responsible if the following cli affects your server, software or anything.

As a reminder, i had the issue with HP ML350 Gen10 with ESXi 6.5 Update 2.

The support told me :

1) Update the server with the SPP [done]

2) Update the ESXi to Update 3

3) Finally, put this cli in ssh : "localcli hardware ipmi sdr list  |grep "Cooling Unit 1 Fans " &&  esxcfg-advcfg -s 49 /UserVars/HardwareHealthIgnoredSensors"

4) Reboot

5) Everything's work fine.

I repeat, it works for me because the vmware support told me to do this, maybe if the warning is about different devices, it won't work for you.

Let me know Smiley Happy