VMware Cloud Community
CLowe
Contributor
Contributor

One ESX host keeps disconnecting...

I have one host in a 4 host cluster (HA snd DRS on identical hardware), all current patches, that this week suddenly started showing up as "not responding" in VC. It's gotten so bad that within 30-60 seconds of doing a disconnect/connect it goes back to a "not responding" state. All the vm's on it are fine. I have tried restarting the vc agent on the server and restarting the vc service but no go. I was going to try rebooting the host but it won't stay in vc long enough to be able to even vMotion anything.

Anyone seen this behavior before?

0 Kudos
11 Replies
kjb007
Immortal
Immortal

Have you changed IP addresses? When you registered the host, was it via IP or hostname? Restart the mgmt agent on the host, service mgmt-vmware restart.

-KjB

vExpert/VCP/VCAP vmwise.com / @vmwise -KjB
0 Kudos
CLowe
Contributor
Contributor

IP address has not changed. Host was registered 6 months ago via hostname. Have restarted agent multiple times. Only change lately was Update 2 applied to the cluster about 3 weeks ago and the date patch applied on the 14th.

One thing I just noticed though. On 2 of the 4 hosts (including the host that is giving me fits), if you go into the configuration tab there is something called HEALTH STATUS in the Hardware section. This does not exist in the other 2 hosts. They are supposed to be identical configurations.

0 Kudos
SimonHuizenga
Enthusiast
Enthusiast

is your DNS ok? only use lowercase names in DNS or hostfile.

0 Kudos
JonRoderick
Hot Shot
Hot Shot

One of my 3.5 U2 hosts did a similar thing (once) this morning. What ESX build version/VC build version are you using (I'm using 110268 - the latest I think).

Jon.

0 Kudos
CLowe
Contributor
Contributor

The hosts file is all lowercase and is identical to the other 3 hosts.

0 Kudos
CLowe
Contributor
Contributor

According to VC, the hosts are 3.5.0 110181 and VC and client are 2.5.0 104215

0 Kudos
espi3030
Expert
Expert

Sounds like a licensing issue to me. I would check the license status on this host first, then check your license server to make sure license is being properly read.

Hope this helps.

0 Kudos
kjb007
Immortal
Immortal

This should also be reflected in the vmkernel/hostd.log

-KjB

vExpert/VCP/VCAP vmwise.com / @vmwise -KjB
0 Kudos
mike_laspina
Champion
Champion

Hello,

Have a look at the vpxa.log or post it

cat /var/log/vmware/vpx/vpxa.log

http://blog.laspina.ca/ vExpert 2009
0 Kudos
JonRoderick
Hot Shot
Hot Shot

My vpxa log didn't seem to cover the time period of the problem - I've gone route one and raised a support call.

0 Kudos
CLowe
Contributor
Contributor

Final update:

After disconnecting/reconnecting about 20 times I was able to finally get all the vm's off the host. I then rebooted it and have not had a problem since late yesterday. At this point I don't know what the issue could have been.

>>> espi3030 <communities-emailer@vmware.com> 8/27/2008 8:10 AM >>>

Chris Lowe,

A new message was posted in the thread "One ESX host keeps disconnecting...":

http://communities.vmware.com/message/1035376

Author : espi3030

Profile : http://communities.vmware.com/people/espi3030

Message:

0 Kudos