I have the following error on my of my hosts:
HA agent on [host] in [cluster] in [datacenter] has an error:unknown HA error
How do I begin to troubleshoot this.
I have inherited this system so not all the settings were made by me.
Any assitance would be greatly appreciated.
mrc2011
Message was edited by: amaier650 Attachment removed.
Have you got some log files for us to look at?
It may be worth disabling HA on the cluster and re-enabling it and seeing what happens?
Also, you will want to make sure that all hosts in the cluster have identically PortGroups and all shared storage available.
You may want to remove that attachment - not good practice to publish attachments with company info included.
updated screengrab:
Have you got some log files for us to look at?
It may be worth disabling HA on the cluster and re-enabling it and seeing what happens?
Also, you will want to make sure that all hosts in the cluster have identically PortGroups and all shared storage available.
First thing I tend to focus on when HA fails is DNS. Make sure all of the vSphere hosts have updated and correct DNS entries.
From the vSphere Client -
check too hostname in DNS (FQDN)
Thanks bulletprooffo.... turning it off got rid of the host error.
I will investigate what needs to be done before turning it on.
Actually you should be looking in Events under Tasks & Events for more detailed and accurate info. Sometimes HA is configured correctly but the summary section still shows error and does not get updated unless HA is reconfigured. I have seen issues with HA failing on other nodes when one of the host in the cluster is put into maintenance mode or even failing on the host which is exiting maintenance mode.
A good place to start troubleshooting
Does the system log tell you what the error is?
Last time I check the log and it tells me that I don't have enough resource and it turns out that one of my host has only 2G of RAM and I need at least 3G of RAM for HA to be enabled on that host.
mrc2011 I've been through the same trouble
If you have followed bulletprooffool & chriswahl00 still not yet problem solved do as follows,
1. Disable the HA cluster.
2. Check wether the host in maintanence mode.
3. If you are running production VMs on the troubled host move them temporary to a another host.
4. Then restart the host. Bcos restarting management agent some times not solving the problem
5. Once the host restarted try Enable the HA cluster.