I am currently migrating to vsphere 4.1. I have the vsphere server managing 3.5up2 hosts. Though I'm working to upgrade the hosts, it will be a few weeks before i can complete this task.
I am using vRanger to backup vms. The method I'm currently using puts a moderate load on the the service console and consumes almost 98 % of the cpu for certain vms. I'm concerned that the server may have heartbeat issues. How does the heartbeat work. Will a busy console possibley effect it?
It might impact the heartbeats indeed. However the isolation response will only be triggered if NO heartbeats are received for 14 seconds and the gateway can't be pinged. Chances of that happening due to very high load should be low. However you could indeed, as stated above, increase the das.failuredetectiontime to 30000 so that the chances of a false positive are decreased.
Duncan
VMware Communities User Moderator | VCDX
-
Now available: <a href="http://www.amazon.com/gp/product/1439263450?ie=UTF8&tag=yellowbricks-20&linkCode=as2&camp=1789&creative=9325&creativeASIN=1439263450">Paper - vSphere 4.0 Quick Start Guide (via amazon.com)</a> | <a href="http://www.lulu.com/product/download/vsphere-40-quick-start-guide/6169778">PDF (via lulu.com)</a>
Blogging: http://www.yellow-bricks.com | Twitter: http://www.twitter.com/DuncanYB
HA host monitoring will declare an ESX host down if no hearbeat is received in 15 seconds.
You can mitigate the risk by configuring redudant service console/Management networks.
Regards,
-Kyle
You can also change it from 15 seconds to say 30,45,60 depending on what works best, however don't change it to too great of a number as that could also cause problems if you actually do have an network isolation.
This site is a good reference for HA: http://www.yellow-bricks.com/vmware-high-availability-deepdiv/
It might impact the heartbeats indeed. However the isolation response will only be triggered if NO heartbeats are received for 14 seconds and the gateway can't be pinged. Chances of that happening due to very high load should be low. However you could indeed, as stated above, increase the das.failuredetectiontime to 30000 so that the chances of a false positive are decreased.
Duncan
VMware Communities User Moderator | VCDX
-
Now available: <a href="http://www.amazon.com/gp/product/1439263450?ie=UTF8&tag=yellowbricks-20&linkCode=as2&camp=1789&creative=9325&creativeASIN=1439263450">Paper - vSphere 4.0 Quick Start Guide (via amazon.com)</a> | <a href="http://www.lulu.com/product/download/vsphere-40-quick-start-guide/6169778">PDF (via lulu.com)</a>
Blogging: http://www.yellow-bricks.com | Twitter: http://www.twitter.com/DuncanYB