VMware Cloud Community
zaspam
Enthusiast
Enthusiast

DELL R810 issues post upgrade to vsphere 6.5

Hi all,

I have upgraded my DELL R810 servers to vSphere 6.5 (even though they are not listed as compatible in the HCL) with the Dell Customized ISO.

Now I have the issue that the server does not restart or shutdown poroperly. On reboot or shutdown it halts with an NMI fault and a PSOD.
Furthermore the following errors are logged in the system log of the IDRAC:

pastedImage_0.png

I have traced the components with lspci to be:

0000:00:00.0 Bridge: Intel Corporation 5520/5500/X58 I/O Hub to ESI Port [PCIe RP[0000:00:00.0]]

0000:00:01.0 Bridge: Intel Corporation 5520/5500/X58 I/O Hub PCI Express Root Port 1 [PCIe RP[0000:00:01.0]]

0000:00:02.0 Bridge: Intel Corporation 5520/5500/X58 I/O Hub PCI Express Root Port 2 [PCIe RP[0000:00:02.0]]

0000:00:03.0 Bridge: Intel Corporation 5520/5500/X58 I/O Hub PCI Express Root Port 3 [PCIe RP[0000:00:03.0]]

0000:00:05.0 Bridge: Intel Corporation 5520/X58 I/O Hub PCI Express Root Port 5 [PCIe RP[0000:00:05.0]]

0000:00:07.0 Bridge: Intel Corporation 5520/5500/X58 I/O Hub PCI Express Root Port 7 [PCIe RP[0000:00:07.0]]

0000:00:09.0 Bridge: Intel Corporation 7500/5520/5500/X58 I/O Hub PCI Express Root Port 9 [PCIe RP[0000:00:09.0]]

0000:00:14.0 Generic system peripheral: Intel Corporation 7500/5520/5500/X58 I/O Hub System Management Registers

0000:00:14.1 Generic system peripheral: Intel Corporation 7500/5520/5500/X58 I/O Hub GPIO and Scratch Pad Registers

0000:00:14.2 Generic system peripheral: Intel Corporation 7500/5520/5500/X58 I/O Hub Control Status and RAS Registers

0000:00:1a.0 Serial bus controller: Intel Corporation 82801JI (ICH10 Family) USB UHCI Controller #4

0000:00:1a.7 Serial bus controller: Intel Corporation 82801JI (ICH10 Family) USB2 EHCI Controller #2

0000:00:1d.0 Serial bus controller: Intel Corporation 82801JI (ICH10 Family) USB UHCI Controller #1

0000:00:1d.1 Serial bus controller: Intel Corporation 82801JI (ICH10 Family) USB UHCI Controller #2

0000:00:1d.2 Serial bus controller: Intel Corporation 82801JI (ICH10 Family) USB UHCI Controller #3

0000:00:1d.7 Serial bus controller: Intel Corporation 82801JI (ICH10 Family) USB2 EHCI Controller #1

0000:00:1e.0 Bridge: Intel Corporation 82801 PCI Bridge

0000:00:1f.0 Bridge: Intel Corporation 82801JIB (ICH10) LPC Interface Controller

0000:00:1f.2 Mass storage controller: Intel Corporation ICH10 4 port SATA IDE Controller [vmhba1]

0000:01:00.0 Network controller: QLogic Corporation QLogic NetXtreme II BCM5709 1000Base-T [vmnic0]

0000:01:00.1 Network controller: QLogic Corporation QLogic NetXtreme II BCM5709 1000Base-T [vmnic1]

0000:02:00.0 Network controller: QLogic Corporation QLogic NetXtreme II BCM5709 1000Base-T [vmnic2]

0000:02:00.1 Network controller: QLogic Corporation QLogic NetXtreme II BCM5709 1000Base-T [vmnic3]

0000:03:00.0 Bridge: PLX Technology, Inc. PEX 8624 24-lane, 6-Port PCI Express Gen 2 (5.0 GT/s) Switch [ExpressLane]

0000:04:00.0 Bridge: PLX Technology, Inc. PEX 8624 24-lane, 6-Port PCI Express Gen 2 (5.0 GT/s) Switch [ExpressLane]

0000:04:01.0 Bridge: PLX Technology, Inc. PEX 8624 24-lane, 6-Port PCI Express Gen 2 (5.0 GT/s) Switch [ExpressLane]

0000:04:04.0 Bridge: PLX Technology, Inc. PEX 8624 24-lane, 6-Port PCI Express Gen 2 (5.0 GT/s) Switch [ExpressLane]

0000:04:05.0 Bridge: PLX Technology, Inc. PEX 8624 24-lane, 6-Port PCI Express Gen 2 (5.0 GT/s) Switch [ExpressLane]

0000:05:00.0 Mass storage controller: LSI Logic / Symbios Logic Dell PERC H200 Integrated [vmhba0]

0000:06:00.0 Serial bus controller: QLogic Corp. SP232-based 4Gb Fibre Channel to PCI Express HBA

0000:08:00.0 Serial bus controller: QLogic Corp. SP232-based 4Gb Fibre Channel to PCI Express HBA

0000:09:00.0 Bridge: PLX Technology, Inc. PEX 8624 24-lane, 6-Port PCI Express Gen 2 (5.0 GT/s) Switch [ExpressLane]

0000:0a:04.0 Bridge: PLX Technology, Inc. PEX 8624 24-lane, 6-Port PCI Express Gen 2 (5.0 GT/s) Switch [ExpressLane]

0000:0a:05.0 Bridge: PLX Technology, Inc. PEX 8624 24-lane, 6-Port PCI Express Gen 2 (5.0 GT/s) Switch [ExpressLane]

0000:0a:08.0 Bridge: PLX Technology, Inc. PEX 8624 24-lane, 6-Port PCI Express Gen 2 (5.0 GT/s) Switch [ExpressLane]

0000:0c:00.0 Network controller: QLogic Corporation QLogic NetXtreme II BCM5709 1000Base-T [vmnic8]

0000:0c:00.1 Network controller: QLogic Corporation QLogic NetXtreme II BCM5709 1000Base-T [vmnic9]

0000:0d:00.0 Bridge: Integrated Device Technology, Inc. [IDT] PES12T3G2 PCI Express Gen2 Switch

0000:0e:02.0 Bridge: Integrated Device Technology, Inc. [IDT] PES12T3G2 PCI Express Gen2 Switch

0000:0e:04.0 Bridge: Integrated Device Technology, Inc. [IDT] PES12T3G2 PCI Express Gen2 Switch

0000:0f:00.0 Network controller: QLogic Corporation QLogic NetXtreme II BCM5709 1000Base-T [vmnic4]

0000:0f:00.1 Network controller: QLogic Corporation QLogic NetXtreme II BCM5709 1000Base-T [vmnic5]

0000:10:00.0 Network controller: QLogic Corporation QLogic NetXtreme II BCM5709 1000Base-T [vmnic6]

0000:10:00.1 Network controller: QLogic Corporation QLogic NetXtreme II BCM5709 1000Base-T [vmnic7]

0000:11:00.0 Serial bus controller: QLogic Corp ISP2432-based 4Gb Fibre Channel to PCI Express HBA [vmhba3]

0000:12:00.0 Serial bus controller: QLogic Corp ISP2432-based 4Gb Fibre Channel to PCI Express HBA [vmhba2]

0000:13:03.0 Display controller: Matrox Electronics Systems Ltd. MGA G200eW WPCM450

The offending devices are built into the motherboard itself, so I cannot reseat them or otherwise manipulate them in any sort of manner.

As far as I can tell there is no device connected to the PCI Root Port and to the PCI Express Switch. Is there a way for them to be disabled from BIOS (or in any other way for that matter).

The server works properly as far as I can tell (apart from the hangs on reboot and shutdown), but I am concerned if there is a risk to continue running on these servers?

0 Kudos
2 Replies
virtualg_uk
Leadership
Leadership

Hello

I am not sure if you can disable in the BIOS, as it;s not supported on the HCL I would be wary about running production workloads on this host.

Perhaps you could try upgrading the BIOS to the latest version to see if this helps with the errors?

http://www.dell.com/support/home/us/en/04/product-support/product/poweredge-r810/drivers


Graham | User Moderator | https://virtualg.uk
0 Kudos
zaspam
Enthusiast
Enthusiast

I did already upgrade all the available firmware, BIOS, IDRAC,Lifecycle controller...

I also read that probably need to disable the C1 and C states in bios... it didn't make any difference.

0 Kudos