My Hosts are ESX 3.0.2 (61618) and VC is VirtualCenter 2.0.2 Update 2 (62327), I have NFS Netapp storage.
My Problem is when the hosts are in a cluster I can enable DRS but not HA, when I try I get an error
"Insufficient resources to satisfy HA failover level on cluster London in UK
Unable to contact primary HA agent in cluster London in UK"
Is HA not available if using NFS storage?
This isn't really something to do with the type of storage you have ?
HA agent is apparently not able to find the Primary host in the cluster.
Did you try Disabling HA on the entire cluster(this can be done thru the cluster settings) ? This will un-configure the HA agent on all the hosts in the cluster. You will see the progress in the tasks pane. Once this is done., go ahead and enable HA on the entire cluster.
Cheers....
Abhilash
Yes I have tried this many times but it still fails.
Worked this through?
The configuration of ESX Server host networking and name resolution, as well as the networking infrastructure external to ESX Server hosts (switches, routers, and firewalls) is critical to optimizing VMware HA setup. When you configure these components, use the following best practices to improve VMware HA performance.
If your switches support the PortFast (or an equivalent) setting, enable it on the physical network switches that connect servers. This helps avoid spanning tree isolation events. For more information on this option, see the documentation provided by your networking switch vendor.
Make sure that the following firewall ports are open for communication by the service console for all ESX Server 3 hosts:
Incoming port: TCP/UDP 8042-8045
Outgoing port: TCP/UDP 2050-2250
For better heartbeat reliability, configure end-to-end dual network paths between servers for service console networking. Configure shorter network paths between the servers in a cluster. Routes with too many hops can cause networking packet delays for heartbeats.
Disable VMware HA (using VirtualCenter, clear the Enable VMware HA checkbox in the Settings dialog box for the cluster) when you perform any networking maintenance that might disable all heartbeat paths between hosts.
Use DNS for name resolution rather than the error-prone method of manually editing the local /etc/hosts file on ESX Server hosts. If you do edit /etc/hosts, you must include both long and short names.
Use consistent port names on VLANs for public networks. Port names are used to reconfigure access to the network by virtual machines. If you use inconsistent names between the original server and the failover server, virtual machines are disconnected from their networks after failover.
Use valid virtual machine network labels on all servers in a VMware HA cluster. Virtual machines use these labels to reestablish network connectivity upon restart.
AWo
HA with NFS is working (configured it several times)
The question is what "insufficient resources" means in this case.
Are the cluster hosts in the same LAN?
Is the network between the hosts open or restriced by a firewall?
Make sure to check your FQDN if its larger than 30 character than its a problem, also try to edit your /etc/hosts file to FQDN and shortnames too to see if it work. You might also have problem with configuration of your HA Failover Capacity which allow how many hosts to be failover etc..
Example:
Current Failover Capacity: 0 hosts
Configured Failover Capacity: 1 hosts
See before for more details.
http://communities.vmware.com/thread/52601
BTW, were you able to remove and disconnect your host and added to new cluster to see if it work. I've done that before because some dummy process is hose and we have to restart several critical services and disconnect the host and readded to new one. Than we were able to add back to old cluster. People have different issues so its hard to tell but try to experience troubleshooting steps!
How many ESX hosts do you have? Did you get the error messages in all hosts?
Sorted, it was the VMkernal Gateway being incorrect