VMware Communities > VMTN > VMware vSphere™ > VMware ESX™ 4 > Discussions

This Question is Answered

1 "correct" answer available (10 pts) 2 "helpful" answers available (6 pts)
13 Replies Last post: Oct 14, 2009 12:43 AM by mightycjo
Reply

vSphere 4 and ESXi 4 HA errors

Jun 5, 2009 1:03 AM

Click to view adeelleo's profile Novice adeelleo 14 posts since
May 26, 2009
I am having trouble enabling HA on the new ESXi 4 and vSphere 4 environment on my IBM BaldeCenter S environment. Everything was working perfect untill i activated DPM on aggressive mode. DPM powered down all hosts except one ruuning all the VM's. Now since then the HA agent is not configuring properly.

I receive the following errors when i try to enable HA on the cluster.


HA agent has an error : cmd addnode failed for
secondary node: Internal AAM Error - agent could
not start. : Unknown HA error
error
5/27/2009 1:59:51 PM
Administrator

HA agent has an error : Cannot complete the HA
configuration
error
5/27/2009 1:59:51 PM
Administrator


I have tried everything from disabling and re-enabling HA, Creating a new cluste and DataCenter, entering hosts in maintainance mode and then exiting, disconecting the hosts and removing then re-adding, installing frest ESXi and vCenter servers, changing network parameters, reconfiguring for HA.


I have manually edited the host file and entered the entries for all esx hosts. All esx hosts and vCenter can ping each other through IP, FQDN and short names.


Any help would be highly appretiated. I am totally out of options.


Best regards,


Adeel Akram

Reply Re: vSphere 4 and ESXi 4 HA errors May 27, 2009 2:56 AM
Click to view AndreTheGiant's profile Guru AndreTheGiant 5,643 posts since
Aug 28, 2008
Have you fixed also /etc/opt/vmware/aam/FT_HOSTS?

Have you reinstalled the HA agent (using VIC)?

Andre
**if you found this or any other answer useful please consider allocating points for helpful or correct answers
Reply Re: vSphere 4 and ESXi 4 HA errors May 27, 2009 3:09 AM
in response to: AndreTheGiant
Click to view adeelleo's profile Novice adeelleo 14 posts since
May 26, 2009

Hi Andre,

Thanks for the quick response.

if i give the command "vi /etc/opt/vmware/aam/FT_HOSTS" on one of my 4 x ESXi 4.0 hosts, i receive an empty file with text input. Do i need to enter text in this file as well as i did for "/etc/hosts" file?

How do you reffer i install the HA agent through VIC. If you are refering to right clicking the host and clicking reconfigure for HA and un-checking and checking HA and DRS in the cluster settings. Then yes, i did bot that.

Best regards,

Adeel Akram

Reply Re: vSphere 4 and ESXi 4 HA errors May 27, 2009 3:25 AM
in response to: adeelleo
Click to view AndreTheGiant's profile Guru AndreTheGiant 5,643 posts since
Aug 28, 2008
Sorry, I forgot that you have ESXi.

Try:
/opt/vmware/aam/VMware-aam-ha-uninstall.sh

Then reinstall the HA agent.

Andre
**if you found this or any other answer useful please consider allocating points for helpful or correct answers
Reply Re: vSphere 4 and ESXi 4 HA errors May 27, 2009 3:35 AM
in response to: AndreTheGiant
Click to view adeelleo's profile Novice adeelleo 14 posts since
May 26, 2009
Hi Andre,

Thanks for the quick response.

if i give the command "vi /etc/opt/vmware/aam/FT_HOSTS" on one of my 4 x ESXi 4.0 hosts, i receive an empty file with text input. Do i need to enter text in this file as well as i did for "/etc/hosts" file?

How do you reffer i install the HA agent through VIC. If you are refering to right clicking the host and clicking reconfigure for HA and un-checking and checking HA and DRS in the cluster settings. Then yes, i did bot that.

Best regards,

Adeel Akram
Reply Re: vSphere 4 and ESXi 4 HA errors May 27, 2009 4:00 AM
in response to: AndreTheGiant
Click to view adeelleo's profile Novice adeelleo 14 posts since
May 26, 2009

Issue resolved.

In my case the problem was that i had manually entered the host file entries for all ESXi servers but i did not make an entry for the vCenter.

As soon as i made an entry for the vCenter on the host file of each ESXi server. Disconnected the hosts from the cluster and re-added with HA and DRS enabled. The issue was resolved.

Best regards,

Adeel Akram

Reply Re: vSphere 4 and ESXi 4 HA errors May 27, 2009 4:04 AM
in response to: AndreTheGiant
Click to view adeelleo's profile Novice adeelleo 14 posts since
May 26, 2009
Issue resolved.

In my case the problem was that i had manually entered the host file entries for all ESXi servers but i did not make an entry for the vCenter.

As soon as i made an entry for the vCenter on the host file of each ESXi server. Disconnected the hosts from the cluster and re-added with HA and DRS enabled. The issue was resolved.

Best regards,

Adeel Akram
Reply Re: vSphere 4 and ESXi 4 HA errors May 27, 2009 5:24 AM
in response to: adeelleo
Click to view AndreTheGiant's profile Guru AndreTheGiant 5,643 posts since
Aug 28, 2008
Good for you :)

Andre
**if you found this or any other answer useful please consider allocating points for helpful or correct answers
Reply Re: vSphere 4 and ESXi 4 HA errors Jun 2, 2009 10:02 PM
in response to: AndreTheGiant
Click to view adeelleo's profile Novice adeelleo 14 posts since
May 26, 2009

Hi Andre,

The nighmaire has returned once more.

I tested DPM with aggressive mode. All blades except one with the VM's were powered down by vCenter. Now when i powered them back up. I have the same HA error that i was having earlier.

HA agent has an error : cmd addnode failed for
secondary node: Internal AAM Error - agent could
not start. : Unknown HA error
error
6/3/2009 10:53:30 AM

HA agent has an error : Cannot complete the HA
configuration
error
6/3/2009 10:53:30 AM

Have tried everything but no luck so far. :-(

Best regards,

Adeel Akram

Reply Re: vSphere 4 and ESXi 4 HA errors Jun 5, 2009 7:44 AM
in response to: adeelleo
Click to view depping's profile Champion depping 2,992 posts since
Jan 17, 2005
VMware Moderator
All blades except for one? hmmmm, that's not a good thing in terms of High Availability. Are you sure?

Duncan
VMware Communities User Moderator | VCP | VCDX


Blogging: http://www.yellow-bricks.com
Twitter: http://www.twitter.com/depping

If you find this information useful, please award points for "correct" or "helpful".
Reply Re: vSphere 4 and ESXi 4 HA errors Jun 6, 2009 11:17 PM
in response to: adeelleo
Click to view AndreTheGiant's profile Guru AndreTheGiant 5,643 posts since
Aug 28, 2008
Have your tried to reinstall the HA agent?
The inventory in VC cluster is fine? Or there was also some old host names?

Andre
**if you found this or any other answer useful please consider allocating points for helpful or correct answers
Reply Re: vSphere 4 and ESXi 4 HA errors Jun 7, 2009 9:38 PM
in response to: AndreTheGiant
Click to view Texiwill's profile Guru Texiwill 10,056 posts since
Jan 13, 2004
Moderator
Hello,

Moved to the ESXi 4 forum.


Best regards, Edward L. Haletky VMware Communities User Moderator, VMware vExpert 2009
Now Available on Rough-Cuts: 'VMware vSphere(TM) and Virtual Infrastructure Security: Securing ESX and the Virtual Environment'
Also available 'VMWare ESX Server in the Enterprise'
SearchVMware Pro|Blue Gears|Top Virtualization Security Links|Virtualization Security Round Table Podcast
Reply Re: vSphere 4 and ESXi 4 HA errors Jun 8, 2009 10:34 PM
in response to: AndreTheGiant
Click to view adeelleo's profile Novice adeelleo 14 posts since
May 26, 2009

Well in my case this initially solved my problem. But this time i had
all entries in my host file correct and still could not resolve the
issue. So enabling HA and DRS one by one solved the issue for me.


Wierd things happen with HA in ESX. :-)
<!-- BEGIN attachments -->
<!-- END attachments -->
<!-- BEGIN content details -->

Reply Re: vSphere 4 and ESXi 4 HA errors Oct 14, 2009 12:43 AM
Click to view mightycjo's profile Novice mightycjo 9 posts since
Jul 23, 2009
solution: hi, had the same error on ESXi 4 but don´t want to use the "unsupported" mode for changing the /etc/hosts file. in my case the FQDN in the "search domains" was not the full one. after changing this and restart everything works fine. Also i had to check all dns entries (hosts, vcenter and so on).
Actions