We are receiving HA error on one of our esxi host in a cluster of 4 servers. Its showing error that HA is not working, conenction timed out.
All VM's hosted are showing in disconnected status but working fine, however there is one VM which is not responding. How can I make it working as migration is also not working on it.
restart the management agents on the host.
do it after hours. if the restart fails, then the vms may also start responding.
no way to get the unresponsive vm started without being able to connect to the host.
we were unable to start the management services from esx. Even no commands were working except basic commands like date/time etc..
Rebooting the host fixed the issue. But we are still not sure why did it occur?
check syslog and hostd logs on your host.
you can browse the logs via https://<esxi ip address>/host , and you should be able to get them via the VI client, or DCUI console.
The 2 logs above should help you on your way but here are what some of the logs do: