peetz
Leadership
Leadership

Greetings all,

I'm late to the party, but I want to share with you an alternate method to detect errors reported by LSI MegaRAID adapters.

I also noticed that the LSI CIM provider is no longer working with ESXi 6.7 and looked in the ESXi host log files for other useful information.

In /var/log/vmkernel.log you can find messages like this one

2018-12-16T13:42:23.459Z cpu0:2097613)<6>megasas_hotplug_work[1]: event code 0x0023

You can look up the meaning of the event codes in the LSI User Guides, but I found a helpful list readily compiled here:

MegaRAID Event Messages – Thomas-Krenn-Wiki

According to this list code 23 means that the scheduled Patrol read has completed, so this is a pretty harmless and expected message.

However, if disk or controller errors etc. are detected then other events will be generated.

I'm forwarding my host logs to vRealize Log Insight (you could also use another free syslog server) and have set up a custom alert there that sends me an e-mail when this message appears with an unexpected event code.

This works well for my "LSI MegaRAID SAS 9260-4i" adapter and should also work for other controllers using the megaraid_sas driver, but your mileage may vary.

I hope this helps

Andreas

Twitter: @VFrontDe, @ESXiPatches | https://esxi-patches.v-front.de | https://vibsdepot.v-front.de
Reply
0 Kudos