VMware Cloud Community
ChristophHerdeg
Contributor
Contributor

Instable ESXi 5u1 on H8SGL with LSI SAS 9211-8i HBA

Valuable Colleagues,

On a ESXi whitebox I'm experiencing sever problems. At 1st the machine (equipped with an Opteron 6134 and 16GB unbuffered RAM) had stability problems, randomly freezing after 1 minute to 26 days of online time. So I today updated the H8SGL's BIOS from 1.00 to 2.00a and 2.00b (currently in testing). When disabling USB 2.0 ESXi 5 (and 5u1) installs and boots just fine.

I thought when already at updating, update the HBA's firmware also...from 7.00.00.00 to 13.00.57.00 - via esxcli and sas2flash (provided by LSI). Bad idea, as it seems:

After updating the firmware ESXi 5u1  (which boots from the raid-volume created off the controller-attached  disks just perfect) takes about 1-5 minutes to discover the raid-volume  as usable local storage (rescanning for devices in VIclient helps).   The same problem exists when trying to freshly install/update ESXi from  DVD: it takes about 5 minutes of again and again searching for usable  storage devices until the raid-volume is found.

The vmkernel.log says (pls check the bold lines at the bottom):

2012-05-12T15:30:09.744Z cpu4:2627)ScsiScan: 1098: Path 'vmhba1:C1:T0:L0': Vendor: 'LSI     '  Model: 'Logical Volume  '  Rev: '3000'
2012-05-12T15:30:09.744Z cpu4:2627)ScsiScan: 1101: Path 'vmhba1:C1:T0:L0': Type: 0x0, ANSI rev: 6, TPGS: 0 (none)
2012-05-12T15:30:09.748Z cpu4:2627)<6>Fusion MPT SAS Host:0:1:0:0 :: RAID10: handle(0x011e), wwid(0x018af2389964090a), pd_count(4), type(SATA)
2012-05-12T15:30:09.748Z cpu4:2627)<6>Fusion MPT SAS Host:0:1:0:0 :: qdepth(128), tagged(1), simple(1), ordered(0), scsi_level(7), cmd_que(1)
2012-05-12T15:30:09.748Z cpu4:2627)ScsiScan: 1582: Add path: vmhba1:C1:T0:L0
2012-05-12T15:30:09.749Z cpu4:2627)<6>mpt2sas0: Get SATA identify successfully for handle=0x9 with try_count=1
2012-05-12T15:30:09.750Z cpu4:2627)ScsiScan: 1098: Path 'vmhba1:C0:T0:L0': Vendor: 'ATA     '  Model: 'Hitachi HDS72101'  Rev: 'A5R0'
2012-05-12T15:30:09.750Z cpu4:2627)ScsiScan: 1101: Path 'vmhba1:C0:T0:L0': Type: 0x0, ANSI rev: 6, TPGS: 0 (none)
2012-05-12T15:30:09.752Z cpu4:2627)<6>Fusion MPT SAS Host:0:0:0:0 :: SATA: handle(0x0009), sas_addr(0x4433221100000000), phy(0), device_name(0x5000cca37cc5d2f7)
2012-05-12T15:30:09.752Z cpu4:2627)<6>Fusion MPT SAS Host:0:0:0:0 :: SATA: enclosure_logical_id(0x500605b0046ae6d0), slot(3)
2012-05-12T15:30:09.752Z cpu4:2627)<6>Fusion MPT SAS Host:0:0:0:0 :: atapi(n), ncq(y), asyn_notify(n), smart(y), fua(y), sw_preserve(y)
2012-05-12T15:30:09.752Z cpu4:2627)<6>Fusion MPT SAS Host:0:0:0:0 :: qdepth(32), tagged(1), simple(1), ordered(0), scsi_level(7), cmd_que(1)
2012-05-12T15:30:09.752Z cpu4:2627)WARNING: ScsiScan: 1485: Failed to add path vmhba1:C0:T0:L0 : Not found
2012-05-12T15:30:09.753Z cpu4:2627)<6>mpt2sas0: Get SATA identify successfully for handle=0xa with try_count=1
2012-05-12T15:30:09.754Z cpu4:2627)ScsiScan: 1098: Path 'vmhba1:C0:T1:L0': Vendor: 'ATA     '  Model: 'Hitachi HDS72101'  Rev: 'A5R0'
2012-05-12T15:30:09.754Z cpu4:2627)ScsiScan: 1101: Path 'vmhba1:C0:T1:L0': Type: 0x0, ANSI rev: 6, TPGS: 0 (none)
2012-05-12T15:30:09.756Z cpu4:2627)<6>Fusion MPT SAS Host:0:0:1:0 :: SATA: handle(0x000a), sas_addr(0x4433221101000000), phy(1), device_name(0x5000cca37cc6e331)
2012-05-12T15:30:09.756Z cpu4:2627)<6>Fusion MPT SAS Host:0:0:1:0 :: SATA: enclosure_logical_id(0x500605b0046ae6d0), slot(2)
2012-05-12T15:30:09.756Z cpu4:2627)<6>Fusion MPT SAS Host:0:0:1:0 :: atapi(n), ncq(y), asyn_notify(n), smart(y), fua(y), sw_preserve(y)
2012-05-12T15:30:09.756Z cpu4:2627)<6>Fusion MPT SAS Host:0:0:1:0 :: qdepth(32), tagged(1), simple(1), ordered(0), scsi_level(7), cmd_que(1)
2012-05-12T15:30:09.756Z cpu4:2627)WARNING: ScsiScan: 1485: Failed to add path vmhba1:C0:T1:L0 : Not found
2012-05-12T15:30:09.757Z cpu4:2627)<6>mpt2sas0: Get SATA identify successfully for handle=0xb with try_count=1
2012-05-12T15:30:09.758Z cpu4:2627)ScsiScan: 1098: Path 'vmhba1:C0:T2:L0': Vendor: 'ATA     '  Model: 'Hitachi HDS72101'  Rev: 'A5R0'
2012-05-12T15:30:09.758Z cpu4:2627)ScsiScan: 1101: Path 'vmhba1:C0:T2:L0': Type: 0x0, ANSI rev: 6, TPGS: 0 (none)
2012-05-12T15:30:09.761Z cpu4:2627)<6>Fusion MPT SAS Host:0:0:2:0 :: SATA: handle(0x000b), sas_addr(0x4433221102000000), phy(2), device_name(0x5000cca37cc6c9a0)
2012-05-12T15:30:09.761Z cpu4:2627)<6>Fusion MPT SAS Host:0:0:2:0 :: SATA: enclosure_logical_id(0x500605b0046ae6d0), slot(1)
2012-05-12T15:30:09.761Z cpu4:2627)<6>Fusion MPT SAS Host:0:0:2:0 :: atapi(n), ncq(y), asyn_notify(n), smart(y), fua(y), sw_preserve(y)
2012-05-12T15:30:09.761Z cpu4:2627)<6>Fusion MPT SAS Host:0:0:2:0 :: qdepth(32), tagged(1), simple(1), ordered(0), scsi_level(7), cmd_que(1)
2012-05-12T15:30:09.761Z cpu4:2627)WARNING: ScsiScan: 1485: Failed to add path vmhba1:C0:T2:L0 : Not found
2012-05-12T15:30:09.762Z cpu4:2627)<6>mpt2sas0: Get SATA identify successfully for handle=0xc with try_count=1
2012-05-12T15:30:09.763Z cpu4:2627)ScsiScan: 1098: Path 'vmhba1:C0:T3:L0': Vendor: 'ATA     '  Model: 'Hitachi HDS72101'  Rev: 'A5R0'
2012-05-12T15:30:09.763Z cpu4:2627)ScsiScan: 1101: Path 'vmhba1:C0:T3:L0': Type: 0x0, ANSI rev: 6, TPGS: 0 (none)
2012-05-12T15:30:09.765Z cpu4:2627)<6>Fusion MPT SAS Host:0:0:3:0 :: SATA: handle(0x000c), sas_addr(0x4433221103000000), phy(3), device_name(0x5000cca37cc7b8ee)
2012-05-12T15:30:09.765Z cpu4:2627)<6>Fusion MPT SAS Host:0:0:3:0 :: SATA: enclosure_logical_id(0x500605b0046ae6d0), slot(0)
2012-05-12T15:30:09.765Z cpu4:2627)<6>Fusion MPT SAS Host:0:0:3:0 :: atapi(n), ncq(y), asyn_notify(n), smart(y), fua(y), sw_preserve(y)
2012-05-12T15:30:09.765Z cpu4:2627)<6>Fusion MPT SAS Host:0:0:3:0 :: qdepth(32), tagged(1), simple(1), ordered(0), scsi_level(7), cmd_que(1)
2012-05-12T15:30:09.765Z cpu4:2627)WARNING: ScsiScan: 1485: Failed to add path vmhba1:C0:T3:L0 : Not found
2012-05-12T15:30:09.765Z cpu4:2627)PCI: driver mpt2sas claimed device 0000:01:00.0
2012-05-12T15:30:09.765Z cpu4:2627)PCI: driver mpt2sas claimed 1 device
2012-05-12T15:30:09.765Z cpu4:2627)ScsiNpiv: 1525: GetInfo for adapter vmhba1, [0x4100080c9f40], max_vports=0, vports_inuse=0, linktype=0, state=0, failreason=0, rv=-1, sts=bad0020
2012-05-12T15:30:09.765Z cpu4:2627)Mod: 4015: Initialization of mpt2sas succeeded with module ID 31.
2012-05-12T15:30:09.765Z cpu4:2627)mpt2sas loaded successfully.
2012-05-12T15:30:09.965Z cpu2:2627)Loading module lvmdriver ...
2012-05-12T15:30:09.969Z cpu2:2627)Elf: 1862: module lvmdriver has license VMware
2012-05-12T15:30:09.970Z cpu5:2670)WARNING: LinuxSignal: 761: ignored unexpected signal flags 0x2 (sig 17)
2012-05-12T15:30:09.970Z cpu2:2627)LVM: 832: LVM max heap size: 43008KB
2012-05-12T15:30:09.970Z cpu2:2627)FDS: 386: lvm
2012-05-12T15:30:09.970Z cpu2:2627)Mod: 4015: Initialization of lvmdriver succeeded with module ID 32.
2012-05-12T15:30:09.970Z cpu2:2627)lvmdriver loaded successfully.
2012-05-12T15:30:09.995Z cpu2:2627)Loading module deltadisk ...
2012-05-12T15:30:09.998Z cpu2:2627)Elf: 1862: module deltadisk has license VMware
2012-05-12T15:30:10.003Z cpu2:2627)FDS: 386: deltadisks
2012-05-12T15:30:10.003Z cpu2:2627)Mod: 4015: Initialization of deltadisk succeeded with module ID 33.
2012-05-12T15:30:10.003Z cpu2:2627)deltadisk loaded successfully.
2012-05-12T15:30:10.021Z cpu2:2627)Loading module multiextent ...
2012-05-12T15:30:10.024Z cpu2:2627)Elf: 1862: module multiextent has license VMware
2012-05-12T15:30:10.025Z cpu2:2627)FDS: 386: multiextent
2012-05-12T15:30:10.025Z cpu2:2627)Mod: 4015: Initialization of multiextent succeeded with module ID 34.
2012-05-12T15:30:10.025Z cpu2:2627)multiextent loaded successfully.
2012-05-12T15:30:14.067Z cpu2:2627)ScsiClaimrule: 2352: Enabling claimrules for MP plugins.
2012-05-12T15:30:14.068Z cpu2:2627)ScsiPath: 4541: Plugin 'NMP' claimed path 'vmhba1:C1:T0:L0'
2012-05-12T15:30:14.068Z cpu2:2627)ScsiScan: 638: Path 'vmhba1:C1:T0:L0': Failed to read VPD Serial id page: Not supported
2012-05-12T15:30:14.068Z cpu2:2627)WARNING: ScsiScan: 645: Path 'vmhba1:C1:T0:L0': Possible LUN change?  changed from supporting to not supporting VPD Serial ID page
2012-05-12T15:30:14.068Z cpu2:2627)ALERT: NMP: vmk_NmpVerifyPathUID:1166:The physical media represented by device Unregistered Device (path vmhba1:C1:T0:L0) has changed. If this is a data LUN, this is a critical error. Detected UID  
2012-05-12T15:30:14.068Z cpu2:2627)WARNING: NMP: nmpPathClaimEnd:1195:Device, seen through path vmhba1:C1:T0:L0 is not registered (no active paths)

Before updating the controller, the raid-volume was discovered  instantly. What the heck could I do to render the system completely usable again? One consequence of this quite annoying problem is, that I can't AutoStart the VMs on the host...

Please help, I'm quite desparate by now...I'll provide anything you need...

Regards,

Chris

Reply
0 Kudos
2 Replies
ChristophHerdeg
Contributor
Contributor

Anybody?

Reply
0 Kudos
aakalan
Enthusiast
Enthusiast

Hi Chris,

Did you solve this ?

Reply
0 Kudos