Pratt
Contributor
Contributor

Detailed Log files after 'A possible host failure has been detected by HA on XXXX'

I received an error message that there was a possible host failure detected by HA on one of our Hosts in a two-node cluster. This was at 5:36:20pm.

The next messages are as follows:

  • Host is being isolated from the cluster at 5:36:33PM

  • Insufficient resources are available to satisfy HA failover level......at 5:36:33PM

  • Sufficient resources are available to satisfy HA failover level....at 5:36:33PM

IThe VIC shows these events two more times.

My question is, how can I find exactly what caused the HA to think the Host failed? Is there a process and place to find more detailed logs other than the events found using the VIC? I found the Host online with Guest OSs running this morning after the events of last evening. The only reason I noticed was that one Guest was powered off and I was investigating how/why it was shutdown. Thank you.

0 Kudos
1 Reply
bhadzik
Enthusiast
Enthusiast

The default detection for isolation is to ping the default gateway. Is this pingable? Did it go offline for some reason?

You can change it by setting the das.isolationaddress custom setting.

0 Kudos