I received an error message that there was a possible host failure detected by HA on one of our Hosts in a two-node cluster. This was at 5:36:20pm.
The next messages are as follows:
Host is being isolated from the cluster at 5:36:33PM
Insufficient resources are available to satisfy HA failover level......at 5:36:33PM
Sufficient resources are available to satisfy HA failover level....at 5:36:33PM
IThe VIC shows these events two more times.
My question is, how can I find exactly what caused the HA to think the Host failed? Is there a process and place to find more detailed logs other than the events found using the VIC? I found the Host online with Guest OSs running this morning after the events of last evening. The only reason I noticed was that one Guest was powered off and I was investigating how/why it was shutdown. Thank you.
The default detection for isolation is to ping the default gateway. Is this pingable? Did it go offline for some reason?
You can change it by setting the das.isolationaddress custom setting.