meghas1234
Contributor
Contributor

HA agent showing failed on one esx host in a cluster and one VM is not responding. Unable to migrate.

Hi All,

We are receiving HA error on one of our esxi host in a cluster of 4 servers. Its showing error that HA is not working, conenction timed out.

All VM's hosted are showing in disconnected status but working fine, however there is one VM which is not responding. How can I make it working as migration is also not working on it.

0 Kudos
5 Replies
sparrowangelste
Virtuoso
Virtuoso

restart the management agents on the host.

do it after hours. if the restart fails, then the vms may also start responding.

no way to get the unresponsive vm started without being able to connect to the host.

--------------------- Sparrowangelstechnology : Vmware lover http://sparrowangelstechnology.blogspot.com
0 Kudos
meghas1234
Contributor
Contributor

we were unable to start the management services from esx. Even no commands were working except basic commands like date/time etc..

Rebooting the host fixed the issue. But we are still not sure why did it occur?

0 Kudos
depping
Leadership
Leadership

usually right clicking the host and selecting "reconfigure for HA" gets the job done. if you want to do a root cause analysis you will need to dive in to the logfiles though,

0 Kudos
meghas1234
Contributor
Contributor

We already tried by reconnecting it from vshere client. But it didn't work.

Which  log files yiu are talking about? esx host's?

0 Kudos
sparrowangelste
Virtuoso
Virtuoso

check syslog and hostd logs on your host.

you can browse the logs via https://<esxi ip address>/host  , and you should be able to get them via the VI   client, or DCUI console.

The 2 logs above should help you on your way but here are what some of the logs do:

http://sparrowangelstechnology.blogspot.com/2012/07/what-to-dcui-console-logs-show.html

--------------------- Sparrowangelstechnology : Vmware lover http://sparrowangelstechnology.blogspot.com
0 Kudos