Re: Warning for Add-in Card 10 35-GOU2

Dimitri2000 · ‎05-12-2014

So for some reason i have warnings on my 4 ESXi hosts.
3 of which when i go alarms or warnings or hardware status i dont see anything. But they all have the yellow exclamation warning sign on the HOSTs. We have a 90 Day Evaluation license which has 60 days left. They are 3 Proliant Gen 8 and 1 Gen 7 blades that host ESXi. ESXi and vSphere are all 5.5.

But for right now the Gen 8 that has something in the Hardware Status monitor is driving me crazy. I try to go to the HP onboard Administrator and i do health checks and hardwarde tests and it shows everything is good and all lights are Green (showing perfecrt health).

Please see screenshot below and let me know if you could help me out.

Thanks.

magnussjodin · ‎05-21-2014

Hi Dimitri,

We are experiencing the same problem on our Gen8 with vSphere 5.5.

Did you find a solution to this problem?

Br,

Magnus

Fnilsen80 · ‎06-01-2014

Anyone found a solution to this?

I have the same warnings (after upgrading 5.0 to 5.5U1)

NitzanYemal · ‎06-01-2014

Same here - after upgrading from 5.0 to 5.5

ASOF · ‎06-03-2014

Exaclty the same problem here Gents.

HP Blades 460c Gen8 , with latest SPP (February 2014) and ESXI 5.5u1 .

Onboard administrator 4.21 not reporting any errors.

i have tried reseting the sensors but nothing happens

vmware pls advice

AS

Linjo · ‎06-11-2014

Try to restart the host, have seen this a few times and a reboot have solved it.

// Linjo

Best regards, Linjo Please follow me on twitter: @viewgeek If you find this information useful, please award points for "correct" or "helpful".

ASoff · ‎06-11-2014

Well I had HP support USA connect to my blades yesterday and restarted almost all the watchdogs , vpxa ,, hpsum and agents and the notification went away (that and one more blade with another sensor reporting issue).

I guess restarting the blade would probably have the same effect but we avoided having to vmotion the VMs to other blades and crowd them (especially avoided the ones using RDMs)

anyway looks ok for now , hope the notification never come back.

thanks

Asof

AndyH310 · ‎06-25-2014

Same problem here. Restarting the agents from the ESXi shell with this command clears the warnings after a few minutes:

services.sh restart

ragmon · ‎06-30-2014

Same problem here as well with ESXi 5.5 U1.

ASoff · ‎06-30-2014

suggest you try installing the latest ISO VMware-ESXi-5.5.0-Update1-1746018-HP-5.74.27-Jun2014.iso

MarcusFoelling · ‎07-07-2014

Hi Dimitri,

~ # /etc/init.d/sfcbd-watchdog restart

and then Host > Hardware Status > Update

solved the issue for us.

Regards,

Marcus

VCP - VCI CCA - CCEA - CCIA - CCI

vmatzeetcATdts · ‎07-15-2014

Hi guys,

same problem here, any updates beside restarting monitoring services or servers?

Cheers

Matt

narmonk · ‎07-28-2014

We are experiencing the same issues across multiple clusters and data centers. Have put in a call with VMware & HP support and so far have only been told to try the ~ # /etc/init.d/sfcbd-watchdog restart that Marcus mentioned by VMware. Although this took away the alert, it was only temporary. HP ran hardware tests and found that there were no issues with the hardware and had no other offerings by means of solution as they pointed the blame back at VMware. We are receiving not only the same temperature issues, but are also receiving issues about Storage, Logs and System Chassis 3 Enclsoure Asserts. We have reopened the case with VMware and will update if we get anywhere past this. Has anyone else had any success? This is pretty ridiculous.

jlambrichts · ‎08-11-2014

Dear all,

We are experiencing the same issue on our blade infrastructure with HP firmware 4.01, even after update ESXi, 5.5.0, 1892794.

This issue is not occurring on our “older” blades with HP firmware 3.71.

Kind regards,
Jeroen Lambrichts

amirhusainov · ‎09-11-2014

Try this #localcli hardware ipmi sel clear. my techdirt: false GPU overhearting warning on esxi 5.5

Or in the vSphere client you can choose host, open Configuration->Security Profile->click Properties... link in the Services section and restart CIM Server. You can try to Update information in the Hardware Status pane, but it will be much faster logout of the client and login again.

Or open ssh session and run command /etc/init.d/sfcbd-watchdog restart.

It worked for me.

Dipankar1985 · ‎09-26-2014

Hi All,

I am also facing same problem with multiple ESXi 5.5.0 update 1746018 and all system are installed HP Proliant BL460c Gen8 Blade. Clearing IPMI logs by localcli hardware ipmi sel clear or resetting the sensor is not helping here any more. I have found this might be a fales alert can be possible with above Hardware and OS combination:

my techdirt: false GPU overhearting warning on esxi 5.5

I understand restarting watchdog/management service or ESXi host might fixed it temporarily but this is not right approach for any production environment.

Can somebody puts some lights on it ??

---Dipankar Saha

Fnilsen80 · ‎09-29-2014

Hi guys.

I updated all my hosts to from 5.5(no update) to 5.5 - Update 2 (HP image) about a week ago and have not seen any error messages after that. - and I had a LOT of error messages.

/rank

DBoring · ‎12-11-2014

We are running HP BL460c Gen 8 with a fresh install of HP ESXi 5.5 U2 (build 2068190). A service restart of Watchdog (CIM Server) resolved the alarm.

SCC3 · ‎01-16-2015

Has anyone found a permenant fix for this one?

We are running ESXi 5.5 build 2302651 on BL460c BIOS I31. Seeing the same issue.