I am running a whitebox and when I first got it up and running, it was running flawless for weeks. It eventually started locking up and becoming absolutely unresponsive. I can't ping the host IP, I can't log in with VClient, and I can't even get the console to respond to keyboard inputs.
I first thought that I had too many VM's running on it, so I looked at the historical performance graphs in VCenter Server (Eval Version) and it was all low usage of everything (RAM, CPU, Disk). I have upto 5 VM's running (4 2008 64bit and 1 XP 32bit). I started moving the VM's to a VMServer and had it down to 1 VM on the ESXi host. I dont know if I was impatiant but it seemed to run fine for 5+ days so I started moving VM's back to ESXi one by one (using the free converter). It seemed to run fine for a bit, and now it's back to locking up about once every day or two. It had at one time gotten where it was locking up multiple times per day.
My next step was I thought maybe having the two network ports plugged in, something was getting confused as to which interface was used for management without actually doing any additional configuration on the Vswitch. Although I setup the one NIC as the management interface, I figured I try unplugging the second NIC. It's still locking up
I found one other thread about the same type of symptoms, but that persons problems seem to be locking up every couple of hours, at a minimum. I'll go anywehre from 12 hours to 2-3/4 days. My VM's are not resource intensive by any means. I have an AD server with one user, two Exchange servers with one mailbox, and an XP machine doing some media sharing to my network
My hardware is as follows:
Supermicro C2SEA
Xeon X3360
8GB RAM (if someone wants to know specifically what, I can find out)
Intel Pro gigabit PCIx NIC
LSI 3Ware 9690 Raid card
***5 1TB HDD's in raid5 with two volumes configured on that raid to split it in half
***I had to use the update host utility to install the 3ware drivers from their website
***I have all VM's installed on one of the two volumes, the second volume is empty
74GB 10kRPM WD HDD - this is where ESXi is installed, nothing else is on this datastore
I have not done anything much different than default when setting up ESXi or the guests or the network. I have also not done anything for resource allocation because I have not taken the time to learn how to set that up correctly yet. I also have the host attached to an eval version of VCenter Server.
I have looked through the logs and nothing shows up. It doesnt even recognize the fact that it's become unresponsive. The only way to pull it out of this "state" is by doing a hard reset. I also lose all connectivity to the VM guests too.
If anyone has any ideas, maybe I missed something or there is a known issue with one of the pieces of hardware I am using, I'd greatly appreciate any guidance.