VMware Cloud Community
terrible_towel
Contributor
Contributor
Jump to solution

vDS upgrade with VSAN

I’ve got a single vDS that has the port groups for VSAN traffic (as well as management, vmotion, and all VM traffic).

It’s still at version 6.0.0.   The cluster is all at ESXi 6.7 (most recent).  The vCenter is also at most recent 6.7.

I was thinking it’s time to hit that “upgrades available” button on the vDS and upgrade it to the most recent.

Does anyone have any experience with upgrading the vDS while using VSAN through that vDS?

I’m worried about the stability of VSAN while the single vDS that everything goes through is upgraded.

Thanks!

Note:

I actually opened a support ticket with this.. but got very (very) little help. 

They just were not in the mood to help and claimed to never have heard of anyone using VSAN doing a vDS upgrade.

Said they searched for best practices with upgrading vDS with VSAN.. found nothing...

Reply
0 Kudos
1 Solution

Accepted Solutions
TheBobkin
Champion
Champion
Jump to solution

Hello terrible_towel

"Is this saying to reconfigure the VMs that loose network connectivity?"

No, actually it says to basically make any DVPortgroup change that then triggers the switchover part of the upgrade that didn't complete - from reading the associated PR that particular issue appears to have been resolved in 6.7 U1.

"I know the likelihood of this is low.. but I want to be ready just in case.  (600+ VMs on VSAN cluster)."

As the kb (and I) mentioned, it is recommended to perform this activity during a maintenance window - just because something can be done on the fly, doesn't mean it is advisable to do so.

Bob

View solution in original post

Reply
0 Kudos
5 Replies
TheBobkin
Champion
Champion
Jump to solution

Hello terrible_towel

Firstly, sorry to hear you had a seemingly negative support experience - if you could please PM me the SR number I would like to have a look from our side and loop in my management with regard to this case.

As a daily practitioner of worst-possible-scenarios (mostly user/admin/architect-caused) I do understand your concern over clicking the button that is in reality going to do a lot of things and potentially reacting unwanted states.

From my own research and testing of upgrading vDS from 6.0 to 6.6 (on HOL, nested-homelab and physical) I have never encountered an issue that caused vSAN cluster to become partitioned (as opposed to migration testing ripping out a used uplink and migrating to a vDS which recovers in ~30 seconds on a HOL cluster).

That is not to say there is no possibility of a negative impact, there are known issues (and apologies but I am going to have to get this kb remediated as it makes it sound like it is NSX-specific when it does not appear to be so):

https://kb.vmware.com/s/article/52621

As our documentation advises, please take a backup of the vDS configuration before upgrading:

Upgrade a vSphere Distributed Switch to a Later Version

When it comes to networking change guidance I am generally overly-cautious, but this is borne of seeing how bad it can go when someone doesn't understand and/or know their network layout and/or the potential impact of the changes they are making (and wants to make them on the fly just because it is possible), thus I generally advise *if possible* that planned downtime or changing this during non-production hours is the best course of action. That being said, upgrading vDS is relatively low on the scale of things that you can change that have the potential to really break things (e.g. MTU/VLAN/physical changes).

Bob

Reply
0 Kudos
terrible_towel
Contributor
Contributor
Jump to solution

Hello,

Thank you so much for the reply.  This is exactly the kind of response I was looking for.  In that KB:

VMs and vmknics lose network connectivity if LAG ports are connected to DVS after VMs are connected and the DVS is upgraded:

  • VM and vmknics dont have network connectivity.

To resolve this issue:

  • Trigger reconfiguration of the affected VM by changing any DVPortgroup setting in the VC UI or through AP.

That is the note that caused me to seek out help.

Is this saying to reconfigure the VMs that loose network connectivity?

i.e. .. Set-NetworkAdapter –NetworkName “new portgroup” and then Set-NetworkAdapter –NetworkName “original portgroup” for all the affected VMs?

OR is saying to go in to the settings for the portgroups themselves and change a setting?

I know the likelihood of this is low.. but I want to be ready just in case.  (600+ VMs on VSAN cluster).

Thanks!

Reply
0 Kudos
TheBobkin
Champion
Champion
Jump to solution

Hello terrible_towel

"Is this saying to reconfigure the VMs that loose network connectivity?"

No, actually it says to basically make any DVPortgroup change that then triggers the switchover part of the upgrade that didn't complete - from reading the associated PR that particular issue appears to have been resolved in 6.7 U1.

"I know the likelihood of this is low.. but I want to be ready just in case.  (600+ VMs on VSAN cluster)."

As the kb (and I) mentioned, it is recommended to perform this activity during a maintenance window - just because something can be done on the fly, doesn't mean it is advisable to do so.

Bob

Reply
0 Kudos
RDowling00
Contributor
Contributor
Jump to solution

Any update on how this went for you?

I see the KB has now been updated with a note:

NOTE: Upgrading a DVS affects all hosts on the DVS at once. To avoid possible issues to all hosts, make a new DVS at the appropriate version level, then move hosts and objects from your current DVS over to the new DVS after creation.

Is this something you have any feedback on?

 

Reply
0 Kudos
ripclawffb
Contributor
Contributor
Jump to solution

@terrible_towel what did you end up doing for the vDS upgrade on your VSAN cluster?

Reply
0 Kudos