We had a memory module die in one of our ESX servers a little while back. The memory was replaced and all is well. However, having cleared the error shown on the Alarms tab for the host, it comes back again within a few hours. Checking the Hardware Status tab, shows "1 alert" on the Memory heading but drilling down into the status for the individual memory modules is showing normal. The IMM (this is an IBM server) also shows that all memory is functioning normally.
I've tried doing Reset Sensors and even removed the host from the cluster and re-added it. Still, the memory status alarm recurs within hours...
Can anyone suggest how I can clear this permanently?
cheers
Dave
Did you ever find an answer on this as I have the same issue?
Wayne
Just to follow up on this, the problem kind of went away.... I acknowledge/cleared the error (probably for the 4th or 5th time) and it didn't come back.
It's not a nice answer, it would have been much better to have had a solution for it but I'm still glad the problem is gone!
Dave