VMware Cloud Community
mikeyes
Enthusiast
Enthusiast

Dell DSET utility breaks hardware monitoring on Dell Poweredge R710 Servers

I have 6 Dell Poweredge R710 servers. Each server is running ESX 4 update 1. On each server I ran the Dell DSET utility to generate a report for Dell tech support (I did not install DSET I only used it to run the report). After DSET was run and the host was rebooted the ability to see the hardware status was broken.

If you access the host directly with the vClient the tab that should contain the hardware status is empty. If you access the vCenter server and click the Hardware Status tab you get the error "Hardware monitoring service on this host is not responding or not available"

Vmware support worked on the server for a few hours and Dell tech support is trying to re-create the situation in their lab.

Has anyone seen anything like this before. I have found other posts talking about the loss of hardware status after installing Dell openmanage agents.

(I have removed and re-added the hosts as suggested in http://communities.vmware.com/thread/219556)

Thanks.

0 Kudos
5 Replies
cstep
Contributor
Contributor

I have the exact same issue on one of my 5 hosts. I too ran (did NOT install) the DSET utility for a Dell Tech. Have you gotten any info on this?

0 Kudos
mikeyes
Enthusiast
Enthusiast

Dell finally admitted that this was their fault. Even though you don't install DSET Dell was nice enough to leave behind a few rpm packages that interfere with the hardware monitoring.

I was able to fix it by removing these two packages:

-sblim-sfcc-2.1.0Dell-6.1.el4

-sblim-sfcb-1.3.2Dell-9.1.el4

-and installing the Update 2 for ESX 4.

Dell told me they would have this fixed by the time DSET 2.0 was released

I also found these steps as a possible solution BUT CANNOT CONFIRM THEM as I did not try them myself

-


1) /etc/init.d/sfcbd-watchdog stop

2) rpm -e sblim-sfcb --nodeps

3) rpm -e sblim-sfcc --nodeps

4) rpm -qa | grep vmware-esx-cim - take note of the build number at the end since we'll be reinstalling this later

5) rpm -e vmware-esx-cim --nodeps

6) Good time to reboot here if allowed - but I was able to proceed w/o reboot

7) Grab vmware-esx-cim.xxxx.rpm from install CD - best to make sure build is exact or slightly newer (can be updated with an Update Manager scan later)

😎 /etc/init.d/sfcbd-watchdog start

-


0 Kudos
cstep
Contributor
Contributor

Would have been nice if they had stopped using the product if it caused these problems. My issue started last week. Looks like yours was in June.... I may try your alternate fix to see if it helps. Just for fun. Thanks for the info!

0 Kudos
Norcal_Sandman
Contributor
Contributor

Cudos to Mikeyes! Dell Dset 2.0 definetly breaks hardware monitoring by just running the utility. I burnt many hours researching causes for this issue and mine was more complicated due to recent replacement of mobo in Dell PET610. Due to many changes...mobo and firmwares upgrades, I did not immediately relate the issue to the Dset util that I had ran prior to mobo replacement.

To resolve for ESX4 U1 I did the following

service sfcbd-watchdog stop

rpm -e sblim-sfcb --nodeps

rpm -e sblim-sfcc --nodeps

rpm -e vmware-esx-cim --nodeps

Transfer vmware-esx-cim-4.1.0-0.0.260247.i386.rpm from ESX4 U1 iso

rpm -ivh vmware-esx-cim-4.1.0-0.0.260247.i386.rpm

service sfcbd-watchdog start

Update hardware monitoring. Good to go again! Thanks Dell 😕

0 Kudos
rreynol
Enthusiast
Enthusiast

We have used another way to deal with this issue that leaves the VMware RPMs and services alone.

First use rpm -qa to search for any Dell related RPM files:

root@esx08-in ~]# rpm -qa|grep -i dell
libcmpiCppImpl0-2.0.0Dell-1.1.el4
sblim-sfcb-1.3.2Dell-9.1.el4
sblim-sfcc-2.1.0Dell-6.1.el4

Use the rpm -e command to remove them

[root@esx08-in ~]# rpm -e sblim-sfcb-1.3.2Dell-9.1.el4
[root@esx08-in ~]# rpm -e sblim-sfcc-2.1.0Dell-6.1.el4
[root@esx08-in ~]# rpm -e libcmpiCppImpl0-2.0.0Dell-1.1.el4

Now the upgrade to the new version of ESX should work fine.  I had the bad experience of attempting the other steps while in the middle of an upgrade and ended up corrupting the initrd and had to rebuild ESX.

0 Kudos