Hi all,
we have a strange error within our HA-enabled cluster.
One of the hosts became unresponsive (network up but no SSH to host anymore, no login over console it just hangs). VMs where marked not respsonding but are still active and running.
Instead of rebooting the host by powering off and manually restarting the VMs afterwards we thought we could simulate an Host isolation by plugging the cables for Service console 1 and 2. Ok, nifty idea and actually the host was recognized as isolated and vsphere started to move off some vms and restarting them somewhere else in the cluster.
All of a sudden HA just stopped moving the remaining VMs off the host! Now they just sit there as not responding though they actually run fine. now we have replugged the cables to the NICs but ofcourse the host stays as not responding until I reboot it.
I still don't want to reboot the host before all VMs are moved off. Does anybody have an idea?
Thanks!