Hi everyone, we are utilizing vSAN 6.x for our production server environment. We have 5 host. We just had a major event where 2 disk needed to be replaced. We replaced those disk and then a resync of data kicked off. A few days later the resync was rebalancing 70TB worth of data. It brought the vSAN to a crawl while our host would become unresponsive as their disks became full. Got on the phone with support and they noticed we had our rebalance configuration set at 95%. They believe that configuration plus the replacing of two disk caused the problem. We have a plan of action to change the rebalance to their recommended 80% threshold when this is over with. We lost services to our mission critical servers for two days.
Now I need to plan for redundancy. What if this happened again? How redundant is vSAN? what if vSAN itself becomes corrupt? Thinking outside the box I realized we have no other storage array to move data to and that would have saved us in this event.
What do most people do for vSAN redundancy? what is recommended? What are the best options here? I am now planning for the future as I consider this a disaster situation. Any ideas and thoughts are appreciated!
You can use SRM or vSphere Replication in your VSAN infrastructure. You can replicate the virtual machines in your infrastructure to a different site. vSphere Replication is included in your current license. You also do not need to make a purchase.
A couple of ideas for your DR.
Hope this helps.
It must have been highly stressful time for you and your team. It is sad to hear and a bit difficult for me to imagine.
Few important points
If I were you I will divide this goal into three parts
1. Things which you can do immediately
2. Things which you can do in 6 months
3. Last, in one year.
1. Easy Wins
2. Short Term Fixes
3. Long term Plan
I hope it helps if you wish to discuss more please reach me out.