<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:clearspace="http://www.jivesoftware.com/xmlns/clearspace/rss" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:wfw="http://wellformedweb.org/CommentAPI/" xmlns:opensearch="http://a9.com/-/spec/opensearch/1.1/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>VMware Communities: Message List - Big Problems...Whole system down</title>
    <link>http://communities.vmware.com/community/vmtn/vi/esx3.5?view=discussions</link>
    <description>Most recent forum messages</description>
    <language>en</language>
    <pubDate>Mon, 14 Apr 2008 16:47:28 GMT</pubDate>
    <generator>Clearspace 1.10.12 (http://jivesoftware.com/products/clearspace/)</generator>
    <dc:date>2008-04-14T16:47:28Z</dc:date>
    <dc:language>en</dc:language>
    <item>
      <title>Re: Big Problems...Whole system down</title>
      <link>http://communities.vmware.com/message/914595?tstart=0#914595</link>
      <description>Looks like we may have configuration update issues.  When we went in to CHAP access to change the config, we got an "unable to update settings" message.  The engineer is puzzled.</description>
      <pubDate>Mon, 14 Apr 2008 16:47:28 GMT</pubDate>
      <author>TylerApache</author>
      <guid>http://communities.vmware.com/message/914595?tstart=0#914595</guid>
      <dc:date>2008-04-14T16:47:28Z</dc:date>
      <clearspace:dateToText>1 year, 7 months ago</clearspace:dateToText>
    </item>
    <item>
      <title>Re: Big Problems...Whole system down</title>
      <link>http://communities.vmware.com/message/914353?tstart=0#914353</link>
      <description>Having delt with VMWare so frequently on different issues, often times they suggest to me blowing the machines away from Virtual Center and reregistering.  Having reread your original problem, though, I would tend to agree with wondab that you may need to unpresent/re-present LUNS.</description>
      <pubDate>Mon, 14 Apr 2008 15:08:58 GMT</pubDate>
      <author>IB_IT</author>
      <guid>http://communities.vmware.com/message/914353?tstart=0#914353</guid>
      <dc:date>2008-04-14T15:08:58Z</dc:date>
      <clearspace:dateToText>1 year, 7 months ago</clearspace:dateToText>
      <clearspace:replyCount>1</clearspace:replyCount>
    </item>
    <item>
      <title>Re: Big Problems...Whole system down</title>
      <link>http://communities.vmware.com/message/914320?tstart=0#914320</link>
      <description>In the queue at the moment...will ask when I get them online.</description>
      <pubDate>Mon, 14 Apr 2008 15:05:08 GMT</pubDate>
      <author>TylerApache</author>
      <guid>http://communities.vmware.com/message/914320?tstart=0#914320</guid>
      <dc:date>2008-04-14T15:05:08Z</dc:date>
      <clearspace:dateToText>1 year, 7 months ago</clearspace:dateToText>
      <clearspace:replyCount>2</clearspace:replyCount>
    </item>
    <item>
      <title>Re: Big Problems...Whole system down</title>
      <link>http://communities.vmware.com/message/914330?tstart=0#914330</link>
      <description>I did not unpresent/represent and rescan.</description>
      <pubDate>Mon, 14 Apr 2008 15:00:49 GMT</pubDate>
      <author>TylerApache</author>
      <guid>http://communities.vmware.com/message/914330?tstart=0#914330</guid>
      <dc:date>2008-04-14T15:00:49Z</dc:date>
      <clearspace:dateToText>1 year, 7 months ago</clearspace:dateToText>
    </item>
    <item>
      <title>Re: Big Problems...Whole system down</title>
      <link>http://communities.vmware.com/message/914319?tstart=0#914319</link>
      <description>We had a similar issue with a power outage.  Our hosts were brought up and online before the SAN had a chance to power on, although we did not experience the same thing you are...seems your issue is a bit more drastic.   In our case all the VM's were inaccessible or orphaned.  It sounds like your hosts are still a bit confused as to who is accessing what VM...I wonder if you could try to "remove from inventory" everything from your cluster and then add back into inventory...your hosts included.  If you have an open case with VMWare, see what they think about that.</description>
      <pubDate>Mon, 14 Apr 2008 14:59:41 GMT</pubDate>
      <author>IB_IT</author>
      <guid>http://communities.vmware.com/message/914319?tstart=0#914319</guid>
      <dc:date>2008-04-14T14:59:41Z</dc:date>
      <clearspace:dateToText>1 year, 7 months ago</clearspace:dateToText>
      <clearspace:replyCount>3</clearspace:replyCount>
    </item>
    <item>
      <title>Re: Big Problems...Whole system down</title>
      <link>http://communities.vmware.com/message/914323?tstart=0#914323</link>
      <description>&lt;br /&gt;
What was the fist thing you did after the EMC rebooted? Did you unpresent and represent the LUNs and rescan? &lt;br /&gt;
&lt;p /&gt;
There is a good thread explaining reservation conflicts due to the simultaneous altering of metadata. &lt;br /&gt;
&lt;p /&gt;
&lt;a class="jive-link-thread" href="http://communities.vmware.com/thread/80561"&gt;http://communities.vmware.com/thread/80561&lt;/a&gt; &lt;br /&gt;
&lt;p /&gt;
&lt;p /&gt;
&lt;p /&gt;
&lt;br /&gt;</description>
      <pubDate>Mon, 14 Apr 2008 14:50:44 GMT</pubDate>
      <author>wondab</author>
      <guid>http://communities.vmware.com/message/914323?tstart=0#914323</guid>
      <dc:date>2008-04-14T14:50:44Z</dc:date>
      <clearspace:dateToText>1 year, 7 months ago</clearspace:dateToText>
      <clearspace:replyCount>1</clearspace:replyCount>
    </item>
    <item>
      <title>Big Problems...Whole system down</title>
      <link>http://communities.vmware.com/message/914262?tstart=0#914262</link>
      <description>Two Dell PE2950, 32GB RAM, ESX 3.0.2&lt;br /&gt;
&lt;br /&gt;
EMC NS40 SAN presenting iSCSI&lt;br /&gt;
&lt;p /&gt;
Last Friday we experienced a power failure that caused our SAN to reboot. When I began checking machines that morning things seemed fine but shortly after I began to notice some oddities. Suddenly one of my VMs wouldn't reboot. Another went BSOD. Before I knew it, all of my VMs were corrupt. Most would not make it to a lgoin propmt. Others would BSOD at login. Still others wouldn't boot at all. &lt;br /&gt;
&lt;p /&gt;
I spent the next 26 hrs on the phone with EMC and VMware trying to figure out what was wrong. VMware says everything is OK. EMC says the storage is fine. Ugh!! As a last ditch effort to save the VMs, we cloned one of the BSOD VMs to local storage and it booted perfectly. Ah HA!! I blew away one of the LUNS on the NS40, recreated it and re-presented it to the hosts. When I cloned machines over to it they booted fine. We only lost two machines due to drastic Windows repair efforts. Happy ending right...not so fast.&lt;br /&gt;
&lt;p /&gt;
The next morning after letting the machines settle over night, I came in to begin the process of restoring order. It didn't take long to realize that the machines were beginning to exhibit the same corruption as before. The only machines that have come back uncorrupted and have stayed that way are the ones that I moved to the local storage. Now EMC is saying that they are seeing reservation errors on the SAN. Apparently this happens when two hosts attempt to access the LUN at the same time. I'm waiting on a return call from VMware at the moment. Help!!</description>
      <pubDate>Mon, 14 Apr 2008 14:03:50 GMT</pubDate>
      <author>TylerApache</author>
      <guid>http://communities.vmware.com/message/914262?tstart=0#914262</guid>
      <dc:date>2008-04-14T14:03:50Z</dc:date>
      <clearspace:dateToText>1 year, 7 months ago</clearspace:dateToText>
      <clearspace:replyCount>6</clearspace:replyCount>
    </item>
  </channel>
</rss>

