VMware Cloud Community
zeroNorth
Contributor
Contributor

HP Smart Array P420i warning

Hi,

I got a brand new DL380p gen8 Server with 8 300GB SAS disks in 2 raid 5, and installed esxi from the branded image: VMware-ESXi-5.1.0-799733-HP-5.34.23.iso on the first raid.

Everything seems to be working fine but I get a warning for the Smart Array P420i controller in the Hardware Health Status.

I checked ILO and everything is ok there. I have uploaded the machines Active Health System log to HP and they say everything is working properly.

Has anyone else had a similar Problem?

rgds,

franz

0 Kudos
68 Replies
JonBelanger2011
Enthusiast
Enthusiast

I am also getting the error running ESXi 5.1.0 U1 (1065491) on a ProLiant DL360p Gen8 ..

vmarrayalerts.JPG

I checked and I believe I have the latest VIBs ...

hpvibs.JPG

I just installed 3 servers of the same model and they are all showing the same alerts... I have 4 more to install this week and I would like to get rid of this false warning... Any ideas before I open a call with HP?

0 Kudos
CPSWernerSt
Enthusiast
Enthusiast

Sehr geehrter Kunde/Geschäftspartner/Kollege!

Ich bin leider nicht erreichbar. Bitte wenden Sie sich in dringenden Fällen an unsere Supportline unter Tel: 02236 315325 800 mailto:ecall@cpservices.at oder für unsere Vertragskunden

an unseren Helpdesk unter Tel: 02236 315325 2525 mailto:msphelpdesk@cpservices.at

Mit freundlichen Grüßen Werner Steinegger

0 Kudos
abelliot
Contributor
Contributor

Je suis absent jusqu'au 19/08.

En cas d'urgence, merci de prendre contact avec notre équipe au 02.41.69.66.67

0 Kudos
MKguy
Virtuoso
Virtuoso

hp-smx-provider 500.03.02.10.4 is indeed the April version which fixed the issue. Seems like your problem is actually related to the RAID controller battery. In the issue caused by the older bundle, only the controller device showed a warning but not the battery. Maybe the battery on your new servers isn't fully charged yet or something like that (or are you using flash cache?)

Execute the following on the shell of an ESXi host and post the output:

/opt/hp/hpacucli/bin/hpacucli ctrl all show status

/opt/hp/hpacucli/bin/hpacucli ctrl all show config detail

You could also boot a server into the embedded HP array configuration utility or RAID controller BIOS to check for any warnings or info on the battery status.

Also dump the hardware status info into a file with this:

/usr/lib/vmware/vm-support/bin/swfw.sh > /tmp/swfw

And search the file for sections containing "HPSA" or "P420i"

-- http://alpacapowered.wordpress.com
0 Kudos
JonBelanger2011
Enthusiast
Enthusiast

Thanks for the quick reply!

Here's the output :

/opt/hp/hpacucli/bin/hpacucli ctrl all show status

Smart Array P420i in Slot 0 (Embedded)

   Controller Status: OK

   Cache Status: OK

   Battery/Capacitor Status: OK

/opt/hp/hpacucli/bin/hpacucli ctrl all show config detail

Smart Array P420i in Slot 0 (Embedded)

   Bus Interface: PCI

   Slot: 0

   RAID 6 (ADG) Status: Disabled

   Controller Status: OK

   Hardware Revision: B

   Firmware Version: 3.42

   Rebuild Priority: Medium

   Expand Priority: Medium

   Surface Scan Delay: 15 secs

   Surface Scan Mode: Idle

   Queue Depth: Automatic

   Monitor and Performance Delay: 60  min

   Elevator Sort: Enabled

   Degraded Performance Optimization: Disabled

   Inconsistency Repair Policy: Disabled

   Wait for Cache Room: Disabled

   Surface Analysis Inconsistency Notification: Disabled

   Post Prompt Timeout: 0 secs

   Cache Board Present: True

   Cache Status: OK

   Cache Ratio: 10% Read / 90% Write

   Drive Write Cache: Disabled

   Total Cache Size: 1024 MB

   Total Cache Memory Available: 816 MB

   No-Battery Write Cache: Disabled

   Cache Backup Power Source: Capacitors

   Battery/Capacitor Count: 1

   Battery/Capacitor Status: OK

   SATA NCQ Supported: True

   Spare Activation Mode: Activate on drive failure

   Controller Temperature (C): 64

   Cache Module Temperature (C): 31

   Capacitor Temperature  (C): 21

   Array: A

      Interface Type: SAS

      Unused Space: 0  MB

      Status: OK

      Array Type: Data

      Logical Drive: 1

         Size: 136.7 GB

         Fault Tolerance: 1

         Heads: 255

         Sectors Per Track: 32

         Cylinders: 35132

         Strip Size: 256 KB

         Full Stripe Size: 256 KB

         Status: OK

         Caching:  Enabled

         Unique Identifier: 600508B1001CD3F872C16AB5778A7BB9

         Disk Name: vmhba1:C0:T0:L1

         Mount Points: None

         Logical Drive Label: A12FF04050014380259E3E8053E2

         Mirror Group 0:

            physicaldrive 1I:1:1 (port 1I:box 1:bay 1, SAS, 146 GB, OK)

         Mirror Group 1:

            physicaldrive 1I:1:2 (port 1I:box 1:bay 2, SAS, 146 GB, OK)

         Drive Type: Data

         Caching Association: None

      physicaldrive 1I:1:1

         Port: 1I

         Box: 1

         Bay: 1

         Status: OK

         Drive Type: Data Drive

         Interface Type: SAS

         Size: 146 GB

         Rotational Speed: 15000

         Firmware Revision: HPD3

         Serial Number: 6XM27DED0000B330D6L4

         Model: HP      EH0146FBQDC    

         Current Temperature (C): 23

         Maximum Temperature (C): 27

         PHY Count: 2

         PHY Transfer Rate: 6.0Gbps, Unknown

         Drive Authentication Status: OK

         Carrier Application Version: 11

         Carrier Bootloader Version: 6

      physicaldrive 1I:1:2

         Port: 1I

         Box: 1

         Bay: 2

         Status: OK

         Drive Type: Data Drive

         Interface Type: SAS

         Size: 146 GB

         Rotational Speed: 15000

         Firmware Revision: HPD3

         Serial Number: 6XM27DLS0000B330DD59

         Model: HP      EH0146FBQDC    

         Current Temperature (C): 25

         Maximum Temperature (C): 30

         PHY Count: 2

         PHY Transfer Rate: 6.0Gbps, Unknown

         Drive Authentication Status: OK

         Carrier Application Version: 11

         Carrier Bootloader Version: 6

   Array: B

      Interface Type: SAS

      Unused Space: 0  MB

      Status: OK

      Array Type: Data

      Logical Drive: 2

         Size: 1.2 TB

         Fault Tolerance: 5

         Heads: 255

         Sectors Per Track: 32

         Cylinders: 65535

         Strip Size: 256 KB

         Full Stripe Size: 768 KB

         Status: OK

         Caching:  Enabled

         Parity Initialization Status: Initialization Completed

         Unique Identifier: 600508B1001C34511D9A303B90949B11

         Disk Name: vmhba1:C0:T0:L2

         Mount Points: None

         Logical Drive Label: A24115E350014380259E3E806AB1

         Drive Type: Data

         Caching Association: None

      physicaldrive 1I:1:3

         Port: 1I

         Box: 1

         Bay: 3

         Status: OK

         Drive Type: Data Drive

         Interface Type: SAS

         Size: 450 GB

         Rotational Speed: 10000

         Firmware Revision: HPD5

         Serial Number: 6XQ1CXFK0000B244AXCT

         Model: HP      EG0450FBLSF    

         Current Temperature (C): 23

         Maximum Temperature (C): 28

         PHY Count: 2

         PHY Transfer Rate: 6.0Gbps, Unknown

         Drive Authentication Status: OK

         Carrier Application Version: 11

         Carrier Bootloader Version: 6

      physicaldrive 1I:1:4

         Port: 1I

         Box: 1

         Bay: 4

         Status: OK

         Drive Type: Data Drive

         Interface Type: SAS

         Size: 450 GB

         Rotational Speed: 10000

         Firmware Revision: HPD5

         Serial Number: 6XQ1CX4G0000B244BAXX

         Model: HP      EG0450FBLSF    

         Current Temperature (C): 24

         Maximum Temperature (C): 28

         PHY Count: 2

         PHY Transfer Rate: 6.0Gbps, Unknown

         Drive Authentication Status: OK

         Carrier Application Version: 11

         Carrier Bootloader Version: 6

      physicaldrive 2I:1:5

         Port: 2I

         Box: 1

         Bay: 5

         Status: OK

         Drive Type: Data Drive

         Interface Type: SAS

         Size: 450 GB

         Rotational Speed: 10000

         Firmware Revision: HPD5

         Serial Number: 6XQ1CJN00000B243F1F5

         Model: HP      EG0450FBLSF    

         Current Temperature (C): 23

         Maximum Temperature (C): 28

         PHY Count: 2

         PHY Transfer Rate: 6.0Gbps, Unknown

         Drive Authentication Status: OK

         Carrier Application Version: 11

         Carrier Bootloader Version: 6

      physicaldrive 2I:1:6

         Port: 2I

         Box: 1

         Bay: 6

         Status: OK

         Drive Type: Data Drive

         Interface Type: SAS

         Size: 450 GB

         Rotational Speed: 10000

         Firmware Revision: HPD5

         Serial Number: 6XQ1CGK60000B242FBUZ

         Model: HP      EG0450FBLSF    

         Current Temperature (C): 24

         Maximum Temperature (C): 29

         PHY Count: 2

         PHY Transfer Rate: 6.0Gbps, Unknown

         Drive Authentication Status: OK

         Carrier Application Version: 11

         Carrier Bootloader Version: 6

   SEP (Vendor ID PMCSIERA, Model SRCv8x6G) 380

      Device Number: 380

      Firmware Version: RevB

      WWID: 50014380259E3E8F

      Vendor ID: PMCSIERA

      Model: SRCv8x6G

/usr/lib/vmware/vm-support/bin/swfw.sh > /tmp/swfw

/tmp # cat swfw | grep hpsa

VMware_ComponentSoftwareIdentity.InstanceID="VMware:Hewlett-Packard_bootbank_scsi-hpsa_5.0.0-40OEM.500.0.0.472560"

                    InstanceID = VMware:Hewlett-Packard_bootbank_scsi-hpsa_5.0.0-40OEM.500.0.0.472560

                       Caption = scsi-hpsa

                   ElementName = scsi-hpsa

0 Kudos
MKguy
Virtuoso
Virtuoso

I really don't see anything wrong here that would explains the status of your controller/battery. Did you install these servers with the 5.1 U1 HP ISO?

Have you tried resetting sensors and updating info in the Hardware status tab?

You can update the p420i firmware to 3.54 btw, but I doubt it matters in this case:

http://www.hp.com/swpublishing/MTX-73544a7eae7b47aebde3d43402

Parse the swfw output like this:

grep SMX_SAFirmware -A20 /tmp/swfw

-- http://alpacapowered.wordpress.com
0 Kudos
JonBelanger2011
Enthusiast
Enthusiast

The alarms on 2 of the affected hosts just cleared themselves... I am still having the alarm on the most recently installed host... I'll wait and see if it clears itself.  It says the controller battery is partially charged.. Perhaps it takes a while to charge?

0 Kudos
MKguy
Virtuoso
Virtuoso

Yeah, it can take a while until the RAID controller batteries are charged. Just give it some time then.

-- http://alpacapowered.wordpress.com
0 Kudos
JonBelanger2011
Enthusiast
Enthusiast

Thanks a lot!

0 Kudos