VMware Cloud Community
Dropnet
Contributor
Contributor

raw disks - smart/temp settings & alerts

hi,

with vsphere 5.5 ànd using RAW disks my concern is to follow up the smart status

there are some documents that redirect you to the syslog page where i could find smart details like :

2015-01-22T20:02:08Z smartd: [warn] naa.*********b: above TEMPERATURE threshold (176 > 0)

2015-01-22T21:02:09Z smartd: [warn] naa.*********b: above TEMPERATURE threshold (171 > 0)

2015-01-23T06:32:41Z smartd: [warn] naa.*********b: above TEMPERATURE threshold (253 > 0)

2015-01-23T09:32:42Z smartd: [warn] naa.*********b: above TEMPERATURE threshold (181 > 0)

i do not know if there are any better tools to report on current disk status..

first of all those values here are fahrenheit i believe but 253 i impossible ..... 
even it was a glitch value like 170 (=76°C) would be impossible

second i prefer to have to be alerted when certain values  exceed a certain threshold.

would this require me to run some vcenter or something like that or are there alternatives ?

0 Kudos
2 Replies
Dropnet
Contributor
Contributor

some further checking:

getting SMART from a regular disk on a server SATA port (my sata controller is defined by Vsphere as "Wellsburg raid controller") runs fine

but the value is still questionable.
i do wish to know if additional tools or plugins would be usefull.

# esxcli storage core device smart get -d t10.ATA_____Hitachi_HDS723030ALA640_______________________xxxxxxxxxx

Parameter                     Value  Threshold  Worst

----------------------------  -----  ---------  -----

Health Status                 OK     N/A        N/A

Media Wearout Indicator       N/A    N/A        N/A

Write Error Count             N/A    N/A        N/A

Read Error Count              100    16         100

Power-on Hours                99     0          99

Power Cycle Count             100    0          100

Reallocated Sector Count      100    5          100

Raw Read Error Rate           100    16         100

Drive Temperature             214    0          214   <------i assume this is 214°F = 101°C which is impossible...

...                                                                         or i may have to substract 100 from the value so : 214-100=114°F =45°C which is more acceptable

a device on my sas 2308 controller does not pass smart info

if something goes wrong i must assume my sas controller will warn me 'somehow' ...

# esxcli storage core device smart get -d naa.xxxxxxxxxx

Error getting Smart Parameters: CANNOT open device


if this question would fit better into another section by all means feel free to move it

any additional information how small is appreciated.

0 Kudos
benhadad
Contributor
Contributor

I noticed the same thing,  My ssd are reporting 37 & 44 but my regular HDD are saying 139,140.  The 37 & 44 for ssd seems reasonable as Celcius as the ambient temperature should be around 45. But 140 C seems high.

0 Kudos