No,we use bl460c gen8 on on 2015 fw, and we have VR.
I have Dell M630 blade servers running current firmware and received this PSOD on the first upgraded machine two hours after installing ESXi 6.7.0 Build 8941472 (Dell specific ISO - VMware-VMvisor-Installer-6.7.0-8941472.x86_64-DellEMC_Customized-A02.iso). I stopped the upgrade of the rest of the ESXi cluster nodes. They run fine under 6.5.0 Build 7388607. The host in question was replicating a VM to our disaster recovery site during the crash. The VM in question was stuck and did not free up until the host was rebooted.
Based on the functions reported on the PSOD screen, looks like the issue is in the vSphere Replication stack. You may want to pause the replication in case it is happening too frequently. In the meantime, engage VMware support to get further assistance on this.
Please consider marking this answer as "correct" or "helpful" if you think your questions have been answered.
"Engineering is currently investigating PSODs with the exact same trace that is occuring due to a HBR module (OEM Provided - Qlogic, in this case)."
I've updated my NIC drivers as suggested by support and have continued testing a couple of my hosts. These hosts are now running ESXi 6.7.0 build 9214924.
what driver version do you have? we had same problem on 9214924. drv version 1.713.30.v60.6
In our case the upgrade left an older QLogic driver on our hosts - qfle3 version 18.104.22.168. I installed VMware ESXi 6.7 Driver CD for QLogic Network/iSCSI/FCoE Driver Set from the Drivers and Tools tab which contains qfle3 22.214.171.124. This is the second day of testing but so far no PSODs. Check your NICs against the VMware compatibility list:
SSH into your ESXi host
esxcli network nic list #List your NICs
esxcli network nic get -n vmnic0 #Get your software and firmware versions for each NIC
Firmware Version: FW: 126.96.36.199 BC: 7.10.39
This is a known issue with ESXi 6.7.
Symptom: An ESXi host might fail with the error PANIC bora/vmkernel/main/dlmalloc.c:4924 - Usage error in dlmalloc
When you replicate virtual machines using the VMware Site Replication Manager hbr_filter with vSphere Replication, the ESXi host might fail with purple screen immediately or within 24 hours and reports the error: PANIC bora/vmkernel/main/dlmalloc.c:4924 - Usage error in dlmalloc.
This issue is resolved in vSphere 6.7 EP5 - ESXi670-201811001
We published the following KB https://kb.vmware.com/s/article/55650.
Hmm this still isnt fixed....the issue i have is looking at the date this was first reported, and im on the phone to VMware now with no fix still....that is a long time to have a critical issue like this!
Your VMware not some back street company knocking our a hypervisor...you make us pay through the nose for the product but at the moment its not fit for purpose!
It seems that EP5 doesn't help me. I'm with ESXi 6.7 U2, build 13006603 and I'm still getting the same PSOD.
Am I missing something?
Thank you in advance!
I am having the same issue with 6.7 U2
in my case the issue was caused by the qfle3 driver and I see a very similar situation in your PSOD.
check for similar errors in the logs -
ql_fcoe:vmhba64:PortLoginCallback:1218:Info: PortLoginCallback: Sess 0x430e95197000 000000 PLOGI timeout or cancel
*The PLOGI timeout indicates that the HBA does not get a reply from the storage after sending a PLOGI
for me - the storage was OK and next step was to check the Driver / FW
- here again - everything was fine as the HCL
and our vendor told me for some known issues caused by the qfle3 driver and as workaround was to revert to bnx2x driver.
so far everything is stable.
hope that will help you!