Hi,
As of very recently we have been experiencing random network disconnections in our guests, without any visible reason or order, which are driving me up the wall...
Our setup
We have three ESX 3.5 servers, each connected to the LAN via three vSwitches:
- one vSwitch with a Service Console port group and 2 physical NICs
- one vSwitch with a VMkernel port group and 2 physical NICs
- one vSwitch with several VMnet port groups, one for each VLAN, with 6 physical NICs, one of which is designated the standby adapter
All physical NICs connect to two HP Procurve 5406zl core switches (the five active ports to one switch, the standby port to the second one), on which all the linked ports have the VLANs in use set in tagged mode.
An edited part of the switch config (ports A15 through A18 are connected to one ESX server):
vlan 10
name "PROD"
tagged A15-A18
exit
vlan 20
name "TEST"
tagged A15-A18
exit
vlan 30
name "UITW"
tagged A15-A18
exit
spanning-tree
spanning-tree B8 path-cost 50
spanning-tree Trk1 priority 4
spanning-tree config-name "SITE01"
spanning-tree legacy-path-cost
spanning-tree force-version STP-compatible
The vSwitches are set up as follows:
- Promiscuous Mode: Reject
- MAC Address Changes: Accept
- Forged Transmits: Accept
- Traffic shaping: disabled
- Load Balancing: Route based on the originating virtual port ID
- Network Failover Detection: Link status only
- Notify switches: Yes
- Failback: Yes
The guests have two virtual network adapters: one for our production LAN (VLAN 10) and one for iSCSI access to our NAS (VLAN 106).
Our problem
Since very recently VM's randomly lose network connections. Windows does not show the link as disconnected, but still cannot get traffic in or out to other systems, except to guests that are on the same ESX server (which soft of makes sense as this traffic never actually touches the physical adapter). The really weird bits are:
- A single VM on one ESX may suddenly have this problem at any time, while the other VM's on the same ESX still work fine
- A single VM may have this problem on one NIC but not on both, or sometimes on both cards at the same time
- Neither Windows or VMware report any issues/events/etc.
Does anyone have experience with issues like these? Is this a known issue (I could not find any info on this while search through the discussions here)?
Any help would be greatly appreciated!
Mark.
As of very recently we have been experiencing random network disconnections in our guests, without any visible reason or order, which are driving me up the wall...
Our setup
We have three ESX 3.5 servers, each connected to the LAN via three vSwitches:
- one vSwitch with a Service Console port group and 2 physical NICs
- one vSwitch with a VMkernel port group and 2 physical NICs
- one vSwitch with several VMnet port groups, one for each VLAN, with 6 physical NICs, one of which is designated the standby adapter
All physical NICs connect to two HP Procurve 5406zl core switches (the five active ports to one switch, the standby port to the second one), on which all the linked ports have the VLANs in use set in tagged mode.
An edited part of the switch config (ports A15 through A18 are connected to one ESX server):
vlan 10
name "PROD"
tagged A15-A18
exit
vlan 20
name "TEST"
tagged A15-A18
exit
vlan 30
name "UITW"
tagged A15-A18
exit
spanning-tree
spanning-tree B8 path-cost 50
spanning-tree Trk1 priority 4
spanning-tree config-name "SITE01"
spanning-tree legacy-path-cost
spanning-tree force-version STP-compatible
The vSwitches are set up as follows:
- Promiscuous Mode: Reject
- MAC Address Changes: Accept
- Forged Transmits: Accept
- Traffic shaping: disabled
- Load Balancing: Route based on the originating virtual port ID
- Network Failover Detection: Link status only
- Notify switches: Yes
- Failback: Yes
The guests have two virtual network adapters: one for our production LAN (VLAN 10) and one for iSCSI access to our NAS (VLAN 106).
Our problem
Since very recently VM's randomly lose network connections. Windows does not show the link as disconnected, but still cannot get traffic in or out to other systems, except to guests that are on the same ESX server (which soft of makes sense as this traffic never actually touches the physical adapter). The really weird bits are:
- A single VM on one ESX may suddenly have this problem at any time, while the other VM's on the same ESX still work fine
- A single VM may have this problem on one NIC but not on both, or sometimes on both cards at the same time
- Neither Windows or VMware report any issues/events/etc.
Does anyone have experience with issues like these? Is this a known issue (I could not find any info on this while search through the discussions here)?
Any help would be greatly appreciated!
Mark.
Tags:
esx,
3.5,
network,
connection,
guest