<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:clearspace="http://www.jivesoftware.com/xmlns/clearspace/rss" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:wfw="http://wellformedweb.org/CommentAPI/" xmlns:opensearch="http://a9.com/-/spec/opensearch/1.1/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>VMware Communities: Message List - ESX 3.0.1 - Linux Guests go ReadOnly</title>
    <link>http://communities.vmware.com/community/vmtn/general/vm-guest?view=discussions</link>
    <description>Most recent forum messages</description>
    <language>en</language>
    <pubDate>Sun, 09 Mar 2008 06:19:34 GMT</pubDate>
    <generator>Clearspace 1.10.12 (http://jivesoftware.com/products/clearspace/)</generator>
    <dc:date>2008-03-09T06:19:34Z</dc:date>
    <dc:language>en</dc:language>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/881727?tstart=0#881727</link>
      <description>Well.. now I'm posting again to the thread that just will not die.&lt;br /&gt;
&lt;br /&gt;
I recently switched from running our production SAN on IET (Iscsi Enterprise Target) to the commercial Wasabi Systems iSCSI target. My production VMs are a handful of Centos 4.6 boxes, running the latest 2.6.9-67.0.4.EL kernels. It was my understanding that this contained patches that would prevent the ext3_fs aborts and resulting read-only issues.&lt;br /&gt;
&lt;br /&gt;
I reviewed the code in the centos repository and saw that the Fusion Drivers had been updated to version "3.02.99.00rh". This appears to include the changes to the SCSI mid-layer to support conditional requeue of the timeouts. This patch appears to have gone mainline here: &lt;a class="jive-link-external" href="http://www.mail-archive.com/linux-scsi@vger.kernel.org/msg05338.html"&gt;http://www.mail-archive.com/linux-scsi@vger.kernel.org/msg05338.html&lt;/a&gt;&lt;br /&gt;
&lt;br /&gt;
After making the change to the new SAN, and updating the kernels on my Centos box, all looked good a while. Then we started getting 5-6 instances of boxes going ReadOnly a day. So I storage vmotioned everything off of the SAN and spent another week testing things to be absolutely sure that the SAN was "clean". I've not had any issues w/ Windows box, and I've checked every inch of the Network (switch ports, duplex settings, error levels, collisions etc..). I can get great performance from the SAN w/ very minimal latency. One caveat, is that I am using Vmware's Round Robin Load balancing w/ a pair of NICs on the servers, so I'm not sure if this plays into the issue.&lt;br /&gt;
&lt;br /&gt;
Basically, I'm stumped on this one. Any suggestions? It looks like a SCSI timeout, but I can't see anything in the Vmware logs or the kernel logs to indicate it. The network is CLEAN and SOLID.. I can achieve 150MB / second sustained reads/writes from the SAN using the 2 Gig-E nics. When I was running on IET, I had NO such issues. The difference here seems to the be Wasabi target. Is there something that I'm overlooking?&lt;br /&gt;
&lt;br /&gt;
P.S. And yes, this is ESX 3.5</description>
      <pubDate>Sun, 09 Mar 2008 06:19:34 GMT</pubDate>
      <author>Damin</author>
      <guid>http://communities.vmware.com/message/881727?tstart=0#881727</guid>
      <dc:date>2008-03-09T06:19:34Z</dc:date>
      <clearspace:dateToText>1 year, 8 months ago</clearspace:dateToText>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/876399?tstart=0#876399</link>
      <description>I've taken to just running Tom's patched driver on all Centos 4 boxes. That seems to be the only thing that works for me. Still having a few issues w/ Centos 5 as well.</description>
      <pubDate>Mon, 03 Mar 2008 14:26:58 GMT</pubDate>
      <author>Damin</author>
      <guid>http://communities.vmware.com/message/876399?tstart=0#876399</guid>
      <dc:date>2008-03-03T14:26:58Z</dc:date>
      <clearspace:dateToText>1 year, 8 months ago</clearspace:dateToText>
      <clearspace:replyCount>1</clearspace:replyCount>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/850792?tstart=0#850792</link>
      <description>&lt;br /&gt;
I guess this is the thread that just won't die! &lt;img class="jive-emoticon" border="0" src="http://communities.vmware.com/images/emoticons/happy.gif" alt=":)" /&gt;&lt;br /&gt;
&lt;p /&gt;
I'm running a pre-production, test environment in my lab consisting of HCL SuperMicro boxes connected to a Wasabi Systems iSCSI SAN box. The purpose of this testing is to really put the Vmware multipath round-robin load balancing under the microscope and see how it performs.&lt;br /&gt;
&lt;p /&gt;
My test bed consists of 2 SuperMicro boxes w/ 2 x Quad Core Intel boxes each connected via Gig-E iSCSI to a dedicated target NIC on the Wasabi box. The Wasabi box has 16 1 TB Sata drives configured in RAID-10 (it's a screamer!).&lt;br /&gt;
&lt;p /&gt;
I am running 32 bit Centos 4 (latest release) VMs inside of the Vmware infrastructure. Kernel is the latest available from Centos (2.6.9-67.0.1.EL). Each VM is running w/ a single processor and 256 megs of memory, w/ a 4 Gig filesystem. I run 32 total VMs accross the two SuperMicro boxes.&lt;br /&gt;
&lt;p /&gt;
I was under the assumption that the released kernel for the later Centos versions had the fixes to prevent this from happening in the event of a SAN failure, but I am still able to cause the guests to go readonly w/ the stock kernel. So I downloaded and applied Tom's patch, and verified that the driver is being loaded (rhel-vmfix in version string). However, I'm STILL getting timeouts even w/ the patched driver.&lt;br /&gt;
&lt;p /&gt;
What I find particularly confusing, however, is that nowhere in the Vmware logs is there ANY indication of SAN timeouts. Everything looks clean as a whistle. Am I missing something really simple? Inquiring minds want to know..</description>
      <pubDate>Tue, 29 Jan 2008 15:49:25 GMT</pubDate>
      <author>Damin</author>
      <guid>http://communities.vmware.com/message/850792?tstart=0#850792</guid>
      <dc:date>2008-01-29T15:49:25Z</dc:date>
      <clearspace:dateToText>1 year, 9 months ago</clearspace:dateToText>
      <clearspace:replyCount>2</clearspace:replyCount>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/813664?tstart=0#813664</link>
      <description>&lt;div class="jive-quote"&gt;&lt;span class="jive-quote-header"&gt;socius wrote:&lt;/span&gt;&lt;br /&gt;
&lt;p /&gt;
Yes, they are indeed installed with a single partition and then cloned from the same installation. What surprised me is that we got this problem with file system corruption on all of the ones that got mounted as read only. &lt;/div&gt;
&lt;br /&gt;
That is pretty unusual.  You mentioned RHEL4 U3 and there was a known bug that caused additional corruption on ext3 with some kernels from that release.  I don't remember the specifics, but it would cause a similar "ext3 journal abort" even on normal systems on the SAN.  These issues are fixed in more recent releases (U5 and above).&lt;br /&gt;
&lt;br /&gt;
Also, even though I've noticed many distro's have stopped suggesting using multiple filesystems for root, /var, and /usr I still think this is a good practice.  We continue to use separate filesystems for volumes that get significant rights as this significantly decreases the chance of the root or /usr filesystem becoming corrupted.&lt;br /&gt;
&lt;p /&gt;
&lt;div class="jive-quote"&gt;Unfortunately, rescue mode did not help fixing the file systems. We managed to get some of our data back, but the vms still won't boot. We're naturally reluctant to install any more linux vms until we know that this won't happen again, that's why I' was curious if anybody else had experienced this.  &lt;/div&gt;
&lt;br /&gt;
The error you are getting is critical, but shouldn't be unrecoverable except in the worst cases.  There are multiple copies of the group descriptors on disk and you should be able to locate them with dumpe2fs.  Worst case you should be able to get your data by using debugfs.  I've been doing this a long time, and I've never seen this be unrecoverable, but it is a touchy recovery and it's certainly possible that you have an unrecoverable situation that I've never run across.  Just because I haven't seen it doesn't mean it does exist.&lt;br /&gt;
&lt;br /&gt;
I can certainly understand being reluctant to reinstall linux VMs.  I'd be that way too if I were in your situation.  I will tell you that I personally run over 20 RHEL4 (and now a few RHEL5) VM's running critical production systems and have never seen more than minor corruption, and none since the problems with journal aborts were corrected.  Of course, backups are always important, and you'll want to test your setup thoroughly but I believe it is possible to have great success with Linux on VMware.&lt;br /&gt;
&lt;br /&gt;
Later,&lt;br /&gt;
Tom</description>
      <pubDate>Fri, 07 Dec 2007 15:10:11 GMT</pubDate>
      <author>tsightler</author>
      <guid>http://communities.vmware.com/message/813664?tstart=0#813664</guid>
      <dc:date>2007-12-07T15:10:11Z</dc:date>
      <clearspace:dateToText>1 year, 11 months ago</clearspace:dateToText>
      <clearspace:replyCount>3</clearspace:replyCount>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/813634?tstart=0#813634</link>
      <description>&lt;br /&gt;
I'm not really an expert on SAN infrastructure so maybe these are not the answers you were looking for, but I know we are using HP EVA 8000. &lt;br /&gt;
&lt;p /&gt;
The ESX hosts in question are running on HP DL385 G1 hardware and I think the HBAs are Emulex LP1050DC.</description>
      <pubDate>Fri, 07 Dec 2007 14:49:19 GMT</pubDate>
      <author>socius</author>
      <guid>http://communities.vmware.com/message/813634?tstart=0#813634</guid>
      <dc:date>2007-12-07T14:49:19Z</dc:date>
      <clearspace:dateToText>1 year, 11 months ago</clearspace:dateToText>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/813601?tstart=0#813601</link>
      <description>&lt;div class="jive-quote"&gt;Yes, they are indeed installed with a single partition and then cloned&lt;br /&gt;
from the same installation. What surprised me is that we got this&lt;br /&gt;
problem with file system corruption on all of the ones that got mounted&lt;br /&gt;
as read only.&lt;br /&gt;
&lt;p /&gt;
&lt;p /&gt;
Unfortunately, rescue mode did not help fixing the file systems. We&lt;br /&gt;
managed to get some of our data back, but the vms still won't boot.&lt;br /&gt;
We're naturally reluctant to install any more linux vms until we know&lt;br /&gt;
that this won't happen again, that's why I' was curious if anybody else&lt;br /&gt;
had experienced this.&lt;/div&gt;
&lt;br /&gt;
Out of curiosity, what sort of SAN setup are you running?</description>
      <pubDate>Fri, 07 Dec 2007 14:24:04 GMT</pubDate>
      <author>Damin</author>
      <guid>http://communities.vmware.com/message/813601?tstart=0#813601</guid>
      <dc:date>2007-12-07T14:24:04Z</dc:date>
      <clearspace:dateToText>1 year, 11 months ago</clearspace:dateToText>
      <clearspace:replyCount>1</clearspace:replyCount>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/813376?tstart=0#813376</link>
      <description>&lt;br /&gt;
Yes, they are indeed installed with a single partition and then cloned from the same installation. What surprised me is that we got this problem with file system corruption on all of the ones that got mounted as read only. &lt;br /&gt;
&lt;p /&gt;
Unfortunately, rescue mode did not help fixing the file systems. We managed to get some of our data back, but the vms still won't boot. We're naturally reluctant to install any more linux vms until we know that this won't happen again, that's why I' was curious if anybody else had experienced this.</description>
      <pubDate>Fri, 07 Dec 2007 06:58:55 GMT</pubDate>
      <author>socius</author>
      <guid>http://communities.vmware.com/message/813376?tstart=0#813376</guid>
      <dc:date>2007-12-07T06:58:55Z</dc:date>
      <clearspace:dateToText>1 year, 11 months ago</clearspace:dateToText>
      <clearspace:replyCount>6</clearspace:replyCount>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/813140?tstart=0#813140</link>
      <description>&lt;div class="jive-quote"&gt;&lt;span class="jive-quote-header"&gt;socius wrote:&lt;/span&gt;&lt;br /&gt;
The errors look like the attached screenshots in this post:&lt;br /&gt;
&lt;p /&gt;
&lt;p /&gt;
The first picture is before attempted reboot and the second is after.&lt;/div&gt;
&lt;br /&gt;
I have never seen any of our filesystems corrupted significantly, but some did experience minor corruption and I would suspect it's possible that bad things could happen if the error hit at a critical time.  Do you happen to install your systems with only a single partition (or perhaps just root and boot partitions)?  That would certainly make it more susceptible to corrupting the root volume.&lt;br /&gt;
&lt;br /&gt;
You can probably boot a CD in rescue mode and run fsck on the root filesystem to get the system booting again.&lt;br /&gt;
&lt;br /&gt;
Later,&lt;br /&gt;
Tom</description>
      <pubDate>Thu, 06 Dec 2007 23:13:51 GMT</pubDate>
      <author>tsightler</author>
      <guid>http://communities.vmware.com/message/813140?tstart=0#813140</guid>
      <dc:date>2007-12-06T23:13:51Z</dc:date>
      <clearspace:dateToText>1 year, 11 months ago</clearspace:dateToText>
      <clearspace:replyCount>7</clearspace:replyCount>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/811388?tstart=0#811388</link>
      <description>The errors look like the attached screenshots in this post:&lt;br /&gt;
&lt;p /&gt;
The first picture is before attempted reboot and the second is after.</description>
      <pubDate>Wed, 05 Dec 2007 13:39:44 GMT</pubDate>
      <author>socius</author>
      <guid>http://communities.vmware.com/message/811388?tstart=0#811388</guid>
      <dc:date>2007-12-05T13:39:44Z</dc:date>
      <clearspace:dateToText>1 year, 11 months ago</clearspace:dateToText>
      <clearspace:replyCount>8</clearspace:replyCount>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/811382?tstart=0#811382</link>
      <description>&lt;p /&gt;
&lt;p /&gt;
&lt;br /&gt;
We ran about 20-25 RHEL 3 and 4 VMs using Vmware ESX 3.1 and 3.2&lt;br /&gt;
&lt;p /&gt;
Never had a kernel panic. The read only issue is now fixed with RHEL 5 also.&lt;br /&gt;
&lt;p /&gt;
&lt;p /&gt;
&lt;p /&gt;
I have seem kernel panics when using the Redhat cluster services on Vmware (free server) but that may not be related to this issue. Those are mainly with GFS or CLVMD.&lt;br /&gt;
&lt;p /&gt;
&lt;p /&gt;
&lt;p /&gt;
Sriram</description>
      <pubDate>Wed, 05 Dec 2007 13:32:45 GMT</pubDate>
      <author>sriramrajan</author>
      <guid>http://communities.vmware.com/message/811382?tstart=0#811382</guid>
      <dc:date>2007-12-05T13:32:45Z</dc:date>
      <clearspace:dateToText>1 year, 11 months ago</clearspace:dateToText>
      <clearspace:replyCount>9</clearspace:replyCount>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/811336?tstart=0#811336</link>
      <description>&lt;br /&gt;
Has anybody else had this problem with the addition of corrupted file systems? Almost all our rhel4u3 vms encountered this at some point this autumn and became unusable afterwards.&lt;br /&gt;
&lt;p /&gt;
I only see people in this thread talking about the file systems being mounted as read only, do your vms also receive kernel panics at reboot after this happened?</description>
      <pubDate>Wed, 05 Dec 2007 12:17:01 GMT</pubDate>
      <author>socius</author>
      <guid>http://communities.vmware.com/message/811336?tstart=0#811336</guid>
      <dc:date>2007-12-05T12:17:01Z</dc:date>
      <clearspace:dateToText>1 year, 11 months ago</clearspace:dateToText>
      <clearspace:replyCount>10</clearspace:replyCount>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/803645?tstart=0#803645</link>
      <description>Hi All,&lt;br /&gt;
&lt;br /&gt;
For what it's worth, we ran into this issue also. However, in our case this symptom (ext3fs remounts read-only) occured even after applying the mptscsi hotfix to a 2.6.9-42.0.8 kernel (the one with another ext3 + san based storage and ext3fs fix). We also found another thread which talks about increasing vm.min_free_kbytes to 10240 (see links below). Even after making sure all these three things were set and done, we encountered the issue! &lt;br /&gt;
&lt;br /&gt;
However we did finally pinpoint another reason for this occurence. There is a recent update for ESX 3.0.2 (3.0.2 Update 1) and three updates for this release, detailed in ESX-1002424, ESX-1002425 and ESX-1002429. This update fixes a networking latency issue on various hardware that leads to high network latency (high ping response times) and extremely slow san/iscsi/nfs/smb speeds.&lt;br /&gt;
&lt;br /&gt;
To summarize, what we had to do to finally alleviate the issue is this:&lt;br /&gt;
&lt;br /&gt;
1: Set vm.min_free_kbytes = 10240 in /etc/sysctl.conf on guests:&lt;br /&gt;
&lt;a class="jive-link-external" href="http://www.noah.org/wiki/index.php/VMware_notes"&gt;http://www.noah.org/wiki/index.php/VMware_notes&lt;/a&gt;&lt;br /&gt;
&lt;a class="jive-link-message" href="http://communities.vmware.com/message/249823#249823"&gt;http://communities.vmware.com/message/249823&lt;/a&gt;&lt;br /&gt;
&lt;br /&gt;
2: Update to at least kernel 2.6.9-42.0.8 which fixes an issue with ext3fs on san/iscsi/nfs turning read-only by interpreting certain scsi layer messages as severe:&lt;br /&gt;
&lt;a class="jive-link-external" href="http://kbase.redhat.com/faq/FAQ_85_9610.shtm"&gt;http://kbase.redhat.com/faq/FAQ_85_9610.shtm&lt;/a&gt;&lt;br /&gt;
&lt;a class="jive-link-external" href="https://bugzilla.redhat.com/show_bug.cgi?id=213921"&gt;https://bugzilla.redhat.com/show_bug.cgi?id=213921&lt;/a&gt;&lt;br /&gt;
&lt;br /&gt;
3: Apply the fix mentioned on this page relating to the mptscsi driver and an upstream patch relating to failover/multipath functionality:&lt;br /&gt;
&lt;a class="jive-link-external" href="http://www.tuxyturvy.com/blog/index.php?/archives/31-VMware-ESX-and-ext3-journal-aborts.html"&gt;http://www.tuxyturvy.com/blog/index.php?/archives/31-VMware-ESX-and-ext3-journal-aborts.html&lt;/a&gt;&lt;br /&gt;
&lt;a class="jive-link-external" href="http://kb.vmware.com/selfservice/microsites/search.do?cmd=displayKC&amp;#38;externalId=51306"&gt;http://kb.vmware.com/selfservice/microsites/search.do?cmd=displayKC&amp;#38;externalId=51306&lt;/a&gt;&lt;br /&gt;
&lt;a class="jive-link-external" href="http://communities.vmware.com/thread/58121?tstart=0&amp;#38;start=0"&gt;http://communities.vmware.com/thread/58121?tstart=0&amp;#38;start=0&lt;/a&gt;&lt;br /&gt;
&lt;br /&gt;
4: Update ESX 3.0.2 to 3.0.2 Update 1 + three additional fixes: ESX-1002424, ESX-1002425 and ESX-1002429, fixing high networking latency and other issues:&lt;br /&gt;
&lt;a class="jive-link-external" href="http://communities.vmware.com/thread/97117"&gt;http://communities.vmware.com/thread/97117&lt;/a&gt;&lt;br /&gt;
&lt;a class="jive-link-external" href="http://www.vmware.com/download/vi/"&gt;http://www.vmware.com/download/vi/&lt;/a&gt;&lt;br /&gt;
&lt;a class="jive-link-external" href="http://www.vmware.com/download/vi/vi3_patches_302.html"&gt;http://www.vmware.com/download/vi/vi3_patches_302.html&lt;/a&gt;&lt;br /&gt;
&lt;br /&gt;
We have pretty thoroughly stress-tested the above configuration with a very high load on io and cpu for about three days now and have survived stuff that would previously lead to remounting read-only pretty quickly (used to be less than an hour in most cases).&lt;br /&gt;
&lt;br /&gt;
Regards,&lt;br /&gt;
&lt;br /&gt;
Rubin.</description>
      <pubDate>Mon, 26 Nov 2007 13:04:12 GMT</pubDate>
      <author>rann</author>
      <guid>http://communities.vmware.com/message/803645?tstart=0#803645</guid>
      <dc:date>2007-11-26T13:04:12Z</dc:date>
      <clearspace:dateToText>1 year, 12 months ago</clearspace:dateToText>
      <clearspace:replyCount>11</clearspace:replyCount>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/753889?tstart=0#753889</link>
      <description>OK, good, that's what I thought.&lt;br /&gt;
&lt;br /&gt;
So just to clarify for future readers of the thread here is the current status as far as I know:&lt;br /&gt;
&lt;br /&gt;
RHEL4 U3 &amp;#38; U4 -- VMware provides a patch for these OS versions in VMware KB ID 51306.&lt;br /&gt;
You can also upgrade to RHEL4 U5 which includes the required fixes.&lt;br /&gt;
&lt;br /&gt;
SLES9 SP3, SLES10 -- VMware provides a patch for these OS versions in VMware KB ID 51306.&lt;br /&gt;
&lt;br /&gt;
RHEL5 FCS (Initial Release) -- VMware provides a patch for these OS versions in VMware KB ID 1001778&lt;br /&gt;
The RHEL5 U1 beta kernel includes a fix for this issue and thus it is expected that when RHEL5 U1 final is released (probably sometime in October) it will also include the fix.&lt;br /&gt;
&lt;br /&gt;
For other, unsupported distros (or even the distro's above if you don't want VMware's patches for some reason) you can also continue to use the patched drivers available on my site or follow the generic instructions to manually apply the source patch to the driver included with your distros kernel source package.  I have success reports from many Debian and Ubuntu users as well as several other distros.&lt;br /&gt;
&lt;br /&gt;
Later,&lt;br /&gt;
Tom</description>
      <pubDate>Wed, 19 Sep 2007 02:42:38 GMT</pubDate>
      <author>tsightler</author>
      <guid>http://communities.vmware.com/message/753889?tstart=0#753889</guid>
      <dc:date>2007-09-19T02:42:38Z</dc:date>
      <clearspace:dateToText>2 years, 2 months ago</clearspace:dateToText>
      <clearspace:replyCount>12</clearspace:replyCount>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/752884?tstart=0#752884</link>
      <description>Unpatched RedHat 5 kernels (Centos) will break. I've patched several of the virtual machines w/ your Centos patches and so far so good. I guess I was under the mistaken impression that the mainlined redhat kernels in R5 had the patches.</description>
      <pubDate>Tue, 18 Sep 2007 05:11:14 GMT</pubDate>
      <author>Damin</author>
      <guid>http://communities.vmware.com/message/752884?tstart=0#752884</guid>
      <dc:date>2007-09-18T05:11:14Z</dc:date>
      <clearspace:dateToText>2 years, 2 months ago</clearspace:dateToText>
      <clearspace:replyCount>13</clearspace:replyCount>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/752720?tstart=0#752720</link>
      <description>Damin,&lt;br /&gt;
&lt;br /&gt;
Are you saying you are seeing this even with the VMware provided RHEL5 LSILogic patches or the RHEL5 patches from my site?  I know that a completely unpatched RHEL5 system still has the problem, that's what I stated above (although perhaps not clearly).&lt;br /&gt;
&lt;br /&gt;
Also, if you have access you may want to test the RHEL5.1 beta kernels as they appear to include the same fixes that Redhat included in RHEL4 U5.&lt;br /&gt;
&lt;br /&gt;
Later,&lt;br /&gt;
Tom</description>
      <pubDate>Tue, 18 Sep 2007 00:04:56 GMT</pubDate>
      <author>tsightler</author>
      <guid>http://communities.vmware.com/message/752720?tstart=0#752720</guid>
      <dc:date>2007-09-18T00:04:56Z</dc:date>
      <clearspace:dateToText>2 years, 2 months ago</clearspace:dateToText>
      <clearspace:replyCount>14</clearspace:replyCount>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/747064?tstart=0#747064</link>
      <description>Tom,&lt;br /&gt;
    I've been testing the waters on our ESX 3.0.1 (w/ all patches) cluster w/ a bunch of Centos 5 virtual machines, and all of them tend to go ReadOnly at some point. Same behavior as the last time, while the Centos 4 boxes happily crank along. I'm running stock Centos kernels, so I'll look at the patches to see what might need to be done and automate it. I've also been seeing this with Rpath based appliances running the later 2.6.18 kenels as well.</description>
      <pubDate>Tue, 11 Sep 2007 03:30:49 GMT</pubDate>
      <author>Damin</author>
      <guid>http://communities.vmware.com/message/747064?tstart=0#747064</guid>
      <dc:date>2007-09-11T03:30:49Z</dc:date>
      <clearspace:dateToText>2 years, 2 months ago</clearspace:dateToText>
      <clearspace:replyCount>15</clearspace:replyCount>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/739207?tstart=0#739207</link>
      <description>The fix posted by VMware for RHEL5 is equivalent to my fix for RHEL5 which I have posted on my website and is basically just a patch to the LSIlogic SCSI driver which gives the required "retry forever" behavior.&lt;br /&gt;
&lt;br /&gt;
VMware is correct in stating that RHEL5 has another issue caused by code in the SCSI mid-layer that means that, even with the fix in the LSIlogic SCSI driver, it's still possible for RHEL5 to timeout during very long stalls in I/O.  This is much more difficult to patch because I believe that it requires changes to code that is compiled directly into the kernel rather than simply loaded as a module.  &lt;br /&gt;
&lt;br /&gt;
That being said, the SCSI mid-layer takes a pretty large stall before it times out, enough that I'm not really sure it's a major issue.  The old problem was that even fairly short stalls, on the order of 10-30 seconds, could cause the issue, but, so far in my testing, it seems to take minutes before the SCSI mid-layer in RHEL5 times out.  If your storage array is pausing for minutes, you've got serious problems anyway.&lt;br /&gt;
&lt;br /&gt;
I've been running RHEL5 through the paces pretty hard for about a month, first using my own patch, and now using VMware's patch, and I haven't been able to trigger this problem just using high loads.  I have been able to trigger a timeout by doing things like disconnecting all cables or restarting the storage array, but I would actually expect those things to fail.  In my opinion it was always a luxury that RHEL4 with the patch would actually survive a storage array reboot that took minutes.&lt;br /&gt;
&lt;br /&gt;
Later,&lt;br /&gt;
Tom</description>
      <pubDate>Fri, 31 Aug 2007 21:56:54 GMT</pubDate>
      <author>tsightler</author>
      <guid>http://communities.vmware.com/message/739207?tstart=0#739207</guid>
      <dc:date>2007-08-31T21:56:54Z</dc:date>
      <clearspace:dateToText>2 years, 2 months ago</clearspace:dateToText>
      <clearspace:replyCount>16</clearspace:replyCount>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/732885?tstart=0#732885</link>
      <description>Has anybody tried this fix yet (for RH5)? Does it work or is tsightler's fix still required?&lt;br /&gt;
&lt;a class="jive-link-external" href="http://kb.vmware.com/selfservice/microsites/search.do?cmd=displayKC&amp;#38;externalId=1001778"&gt;http://kb.vmware.com/selfservice/microsites/search.do?cmd=displayKC&amp;#38;externalId=1001778&lt;/a&gt;&lt;br /&gt;
&lt;br /&gt;
I'm a little confused by the statement: "it also requires a SCSI mid-layer patch"&lt;br /&gt;
&lt;br /&gt;
Where exactly is the SCSI mid-layer patch applied? OS, HBA firmware on my side or SAN side, Fiber Switch? Just a little confused if VMWare isn't providing it then are we waiting on Red Hat, LSI, other?</description>
      <pubDate>Fri, 24 Aug 2007 14:47:07 GMT</pubDate>
      <author>alvserversupportteam</author>
      <guid>http://communities.vmware.com/message/732885?tstart=0#732885</guid>
      <dc:date>2007-08-24T14:47:07Z</dc:date>
      <clearspace:dateToText>2 years, 3 months ago</clearspace:dateToText>
      <clearspace:replyCount>17</clearspace:replyCount>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/704296?tstart=0#704296</link>
      <description>Hi - U5?  I am running the 55.0.2 kernel from Red Hat following this link &lt;a class="jive-link-external" href="http://kbase.redhat.com/faq/FAQ_85_10846.shtm"&gt;http://kbase.redhat.com/faq/FAQ_85_10846.shtm&lt;/a&gt; but have not got any more U5 updates (running up2date), largely because my /usr is almost full on the guest machine.  So I am running RHEL 4 AS with a U5 kernel - but will this fix my issue?  &amp;#38; are there any other specific RPM's I should get hold hold of?&lt;br /&gt;
&lt;br /&gt;
Thanks&lt;br /&gt;
&lt;br /&gt;
Andy</description>
      <pubDate>Tue, 24 Jul 2007 16:59:58 GMT</pubDate>
      <author>kiano</author>
      <guid>http://communities.vmware.com/message/704296?tstart=0#704296</guid>
      <dc:date>2007-07-24T16:59:58Z</dc:date>
      <clearspace:dateToText>2 years, 4 months ago</clearspace:dateToText>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/703141?tstart=0#703141</link>
      <description>For those running SLES9 or SLES10, see Novell's TID 3584352:&lt;br /&gt;
&lt;br /&gt;
SUSE Linux Enterprise 10&lt;br /&gt;
This issue has been fixed as of Service Pack 1.&lt;br /&gt;
&lt;br /&gt;
SUSE Linux Enterprise Server 9 and Open Enterprise Server (Linux based)&lt;br /&gt;
This issue has been fixed as of kernel 2.6.5-7.286.</description>
      <pubDate>Mon, 23 Jul 2007 17:01:49 GMT</pubDate>
      <author>GvdWereld</author>
      <guid>http://communities.vmware.com/message/703141?tstart=0#703141</guid>
      <dc:date>2007-07-23T17:01:49Z</dc:date>
      <clearspace:dateToText>2 years, 4 months ago</clearspace:dateToText>
      <clearspace:replyCount>1</clearspace:replyCount>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/696126?tstart=0#696126</link>
      <description>I've been happily running RHEL4 U5 on about a dozen VM's for weeks now, so far with no issues so, while RHEL4 U5 is not yet officially supported it seems to work fine.&lt;br /&gt;
&lt;br /&gt;
That being said I'm sure you can happily run the RHEL4 U4 kernel with all of the U5 updates if you want to.&lt;br /&gt;
&lt;br /&gt;
Later,&lt;br /&gt;
Tom</description>
      <pubDate>Sat, 14 Jul 2007 01:42:18 GMT</pubDate>
      <author>tsightler</author>
      <guid>http://communities.vmware.com/message/696126?tstart=0#696126</guid>
      <dc:date>2007-07-14T01:42:18Z</dc:date>
      <clearspace:dateToText>2 years, 4 months ago</clearspace:dateToText>
      <clearspace:replyCount>2</clearspace:replyCount>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/692373?tstart=0#692373</link>
      <description>Is anyone running the latest Update 4 kernel but all the other Update 5 patches from RHN?  Was wondering if that may be a good route to go until they fully support Update 5.&lt;br /&gt;
&lt;br /&gt;
Does anyone know a good thread that talks about RHEL 4 Update 5 support?  This one has good info, just cant find other ones.&lt;br /&gt;
&lt;br /&gt;
Thanks</description>
      <pubDate>Tue, 10 Jul 2007 17:09:18 GMT</pubDate>
      <author>Ops admin</author>
      <guid>http://communities.vmware.com/message/692373?tstart=0#692373</guid>
      <dc:date>2007-07-10T17:09:18Z</dc:date>
      <clearspace:dateToText>2 years, 4 months ago</clearspace:dateToText>
      <clearspace:replyCount>3</clearspace:replyCount>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/675991?tstart=0#675991</link>
      <description>I currently believe that the solution in RHEL4 U5 (kernel versions &amp;gt;= 2.6.9-55.EL) does fix this issue acceptably.  I still think it may be possible for a very long (say &amp;gt;5 minute) pause in IO to cause the error to occur but if your SAN is pausing/stopping IO for &amp;gt;5 minutes then you probably have other issues you should address.&lt;br /&gt;
&lt;br /&gt;
I've run every stress test I can throw at it, including rebooting my Equallogic array, and removing the zones on our CX700 and so far have not seen the issue with these kernels.  Based on my testing results I moved our development Oracle systems, and several other fairly high I/O systems to this kernel, without patches, and they have survived several weeks without issue.  These systems would typically fail in days, or even hours with the 2.6.9-42.x seriese of kernels.&lt;br /&gt;
&lt;br /&gt;
Based on this, I believe the issue is "fixed" for RHEL4 Update 5, however, that still leaves a lot of distros that either still need the VMware fix or the manual compiled driver with my small patch.&lt;br /&gt;
&lt;br /&gt;
RHEL5 still needs a "fix" as well, and since it's not yet officially supported from VMware for ESX my guess is it won't get a formal fix until it is certified.  I plan to post a patched driver for RHEL5 on my website in the next day or so.  I would suspect this newer driver would also work with other "newish" distros based on more recent kernels.&lt;br /&gt;
&lt;br /&gt;
Later,&lt;br /&gt;
Tom</description>
      <pubDate>Wed, 20 Jun 2007 14:32:39 GMT</pubDate>
      <author>tsightler</author>
      <guid>http://communities.vmware.com/message/675991?tstart=0#675991</guid>
      <dc:date>2007-06-20T14:32:39Z</dc:date>
      <clearspace:dateToText>2 years, 5 months ago</clearspace:dateToText>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/673740?tstart=0#673740</link>
      <description>&lt;div class="jive-quote"&gt;Anyone try this with RHEL 5 ?&lt;br /&gt;
&lt;p /&gt;
My kernel version is  2.6.18-8.1.1.el5.&lt;br /&gt;
&lt;p /&gt;
I am trying compile the module from source and&lt;br /&gt;
getting the following errors.&lt;br /&gt;
&lt;p /&gt;
In file included from&lt;br /&gt;
/home/sri/mptscsi_vmware/mptscsi-rhel-3.02.62.01/mptba&lt;br /&gt;
se.c:50:&lt;br /&gt;
include/linux/config.h:6:2: warning: #warning&lt;br /&gt;
Including config.h is deprecated.&lt;br /&gt;
/home/sri/mptscsi_vmware/mptscsi-rhel-3.02.62.01/mptba&lt;br /&gt;
se.c: In function âmpt_suspendâ:&lt;br /&gt;
/home/sri/mptscsi_vmware/mptscsi-rhel-3.02.62.01/mptba&lt;br /&gt;
se.c:1646: error: switch quantity not an integer&lt;br /&gt;
/home/sri/mptscsi_vmware/mptscsi-rhel-3.02.62.01/mptba&lt;br /&gt;
se.c: In function âMakeIocReadyâ:&lt;br /&gt;
/home/sri/mptscsi_vmware/mptscsi-rhel-3.02.62.01/mptba&lt;br /&gt;
se.c:2510: error: implicit declaration of function&lt;br /&gt;
âcrashdump_modeâ&lt;br /&gt;
make[2]: ***&lt;br /&gt;
[/home/sri/mptscsi_vmware/mptscsi-rhel-3.02.62.01/mptb&lt;br /&gt;
ase.o] Error 1&lt;br /&gt;
make[1]: ***&lt;br /&gt;
[_module_/home/sri/mptscsi_vmware/mptscsi-rhel-3.02.62&lt;br /&gt;
.01] Error 2&lt;br /&gt;
make[1]: Leaving directory&lt;br /&gt;
`/usr/src/kernels/2.6.18-8.1.4.el5-i686'&lt;br /&gt;
&lt;p /&gt;
Sriram&lt;/div&gt;
&lt;br /&gt;
Same problem here. Also the version of the in-kernel driver is somewhat newer than the one provided from vmware.</description>
      <pubDate>Mon, 18 Jun 2007 13:25:47 GMT</pubDate>
      <author>stericho</author>
      <guid>http://communities.vmware.com/message/673740?tstart=0#673740</guid>
      <dc:date>2007-06-18T13:25:47Z</dc:date>
      <clearspace:dateToText>2 years, 5 months ago</clearspace:dateToText>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/664158?tstart=0#664158</link>
      <description>Ok, I've got the driver installed, and we haven't had the file system (ReiserFS 3) go read only yet.  However, I'm concerned.  I'm still seeing tons of mpt errors on the SLES 10 server in /var/log/messages. Such as:&lt;br /&gt;
&lt;br /&gt;
Jun  7 01:00:41 MoodleFinal kernel: mptbase: ioc0: IOCStatus(0x0002): Busy&lt;br /&gt;
Jun  7 01:01:02 MoodleFinal kernel: mptbase: ioc0: IOCStatus(0x0002): Busy&lt;br /&gt;
Jun  7 10:35:15 MoodleFinal kernel: mptscsih: ioc0: attempting task abort! (sc=c1df1980)&lt;br /&gt;
Jun  7 10:35:15 MoodleFinal kernel: mptbase: ioc0: IOCStatus(0x004b): SCSI IOC Terminated&lt;br /&gt;
Jun  7 10:35:15 MoodleFinal kernel: mptscsih: ioc0: task abort: SUCCESS (sc=c1df1980)&lt;br /&gt;
&lt;br /&gt;
I'm sincerely hoping that even in the presence of these errors that the file system driver doesn' continue to remount the volume as read only.  I already announced to our community that the problem has been fixed.  Thankfully it hasn't happened yet since the driver update.&lt;br /&gt;
&lt;br /&gt;
Some of you have asked so I'm posting the relevant error entries in ESX /var/log/vmkernel:&lt;br /&gt;
&lt;br /&gt;
Jun  7 20:14:11 esx1 vmkernel: 145:08:22:29.648 cpu3:1687)VSCSI: 1829: Reset request on handle 8335 (0 outstanding commands)&lt;br /&gt;
Jun  7 20:14:11 esx1 vmkernel: 145:08:22:29.648 cpu3:1687)VSCSI: 1829: Reset request on handle 8336 (0 outstanding commands)&lt;br /&gt;
Jun  7 20:14:11 esx1 vmkernel: 145:08:22:29.648 cpu3:1663)VSCSI: 2028: Resetting handle 8335 [0/0]&lt;br /&gt;
Jun  7 20:14:11 esx1 vmkernel: 145:08:22:29.648 cpu3:1663)SCSI: 3222: handle 306590 / orig 0x7d71bb0&lt;br /&gt;
Jun  7 20:14:11 esx1 vmkernel: 145:08:22:29.648 cpu3:1663)VSCSI: 1878: Completing reset on handle 8335 (0 outstanding commands)&lt;br /&gt;
Jun  7 20:14:11 esx1 vmkernel: 145:08:22:29.648 cpu3:1663)VSCSI: 2028: Resetting handle 8336 [0/0]&lt;br /&gt;
Jun  7 20:14:11 esx1 vmkernel: 145:08:22:29.648 cpu3:1663)SCSI: 3222: handle 735951 / orig 0x7d70e08&lt;br /&gt;
Jun  7 20:14:11 esx1 vmkernel: 145:08:22:29.648 cpu3:1663)&amp;lt;6&amp;gt;scsi(1:0:0:12): DEVICE RESET SUCCEEDED.&lt;br /&gt;
Jun  7 20:14:11 esx1 vmkernel: 145:08:22:29.648 cpu3:1663)VSCSI: 1878: Completing reset on handle 8336 (0 outstanding commands)&lt;br /&gt;
Jun  7 20:17:24 esx1 vmkernel: 145:08:25:42.730 cpu2:1040)WARNING: SCSI: 1726: Unexpected status returned: bad000a I/O error&lt;br /&gt;
&lt;br /&gt;
Where should I go from here to eliminate these errors?&lt;br /&gt;
&lt;br /&gt;
Also, we're not currently backing up our Linux VMs (gasp) but we're going to be implementing ESX ranger within a month or two.  So heavy disk I/O during backup isn't the cause.  And our "read only" issue I don't believe is related to storage load.  It seems to happen randomly.&lt;br /&gt;
&lt;br /&gt;
Our hardware:&lt;br /&gt;
&lt;br /&gt;
IBM bladecenter&lt;br /&gt;
2 LS20 blades running ESX 3.0.0&lt;br /&gt;
Qlogic SFF FC adapters &lt;br /&gt;
Bladecenter integrated 14x2 port FC switch module &lt;br /&gt;
Qlogic Sanbox 3050&lt;br /&gt;
IBM FasTt 600 and Nexsan Satablade SAN arrays&lt;br /&gt;
&lt;br /&gt;
The two FC switches run in tandem with 2 uplinks between them for bandwidth and failover between the switches.  The SAN arrays are connected to the 8 port Sanbox 3050.  We are not multipathing.  Both FC ports on Satablade are connected to the switch and I'm exposing LUNs from both ports.  I have mapped raw LUNs exposed out of both ports for bandwidth balancing but I'm only exposing a given LUN on one port since we're not multipathing.  The FasTt 600 only has one port connected to the switch.</description>
      <pubDate>Fri, 08 Jun 2007 01:50:24 GMT</pubDate>
      <author>stan-whitfield</author>
      <guid>http://communities.vmware.com/message/664158?tstart=0#664158</guid>
      <dc:date>2007-06-08T01:50:24Z</dc:date>
      <clearspace:dateToText>2 years, 5 months ago</clearspace:dateToText>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/662079?tstart=0#662079</link>
      <description>Turns out swapping to the Buslogic isn't as straightforward as I thought it may be.  I can't seem to select a new scsi module from yast if the hardware isn't "present' when I do a scan...&lt;br /&gt;
&lt;br /&gt;
So, I downloaded the LSI logic SLES 10 driver patch linked in this KB:&lt;br /&gt;
&lt;br /&gt;
&lt;a class="jive-link-external" href="http://kb.vmware.com/selfservice/microsites/search.do?cmd=displayKC&amp;#38;externalId=51306"&gt;http://kb.vmware.com/selfservice/microsites/search.do?cmd=displayKC&amp;#38;externalId=51306&lt;/a&gt;&lt;br /&gt;
&lt;br /&gt;
However, the darn MD5 sum doesn't match.  All the other files linked on that KB page have MD5 sums that check out, with the exceptoin of the SLES 10 download.  I'm wary of using it because of this.  And darn it I've been up all night working on this and want to have it done before I go into the office today!!  I've got half a notion to just throw caution to the wind regarding the MD5 sum mismatch assuming it's a VMware documentation glitch.  Something like ... they uploaded a new file and forgot to change the MD5 sum in the text of the KB article.&lt;br /&gt;
&lt;br /&gt;
I've downloaded it twice to two different machines, one Linux one Windows, and the sum doesn't match what's listed in the KB.  Each time I downloaded the file the checksum is the same, so I doubt there's a problem with the download.  VMware please square this situation.  It's annoying.&lt;br /&gt;
&lt;br /&gt;
Update:  apparently someone was listening, as after a few more repeated downloads, the MD5 sums now match.  Weird...very weird.&lt;br /&gt;
&lt;br /&gt;
Message was edited by: &lt;br /&gt;
        stan-whitfield</description>
      <pubDate>Wed, 06 Jun 2007 09:47:29 GMT</pubDate>
      <author>stan-whitfield</author>
      <guid>http://communities.vmware.com/message/662079?tstart=0#662079</guid>
      <dc:date>2007-06-06T09:47:29Z</dc:date>
      <clearspace:dateToText>2 years, 5 months ago</clearspace:dateToText>
      <clearspace:replyCount>1</clearspace:replyCount>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/662023?tstart=0#662023</link>
      <description>I am so glad I found this thread!  We've had this problem with SLES 10 VMs using the LSI Logic adapter on 3.0.0 (we'll be uprading to 3.0.1 soon).  After researching this issue from the Linux kernel side, I found that the mptbase LSI Logic SCSI driver code in the 2.6.x kernel is buggy on its own, unrelated to VMware.&lt;br /&gt;
&lt;br /&gt;
&lt;a class="jive-link-external" href="http://lkml.org/lkml/2005/10/11/224"&gt;http://lkml.org/lkml/2005/10/11/224&lt;/a&gt;&lt;br /&gt;
&lt;br /&gt;
I have 2 SLES 10 and 1 SLES 9 VMs all on 2.6.x kernels.  The two SLES 10s were built with the LSI Logic adapter and driver and the SLES 9 VM was built with the Buslogic adapter.  Both of the SLES 10 VMs experience the massive scsi errors and the read only remounts.  The SLES 9 VM using Buslogic goes not.&lt;br /&gt;
&lt;br /&gt;
I'm going to convert to Buslogic on the SLES 10 VMs and see if that helps.  I think I saw a procedure for this documented in this thread.  If not I'll figure it out, implement it, and I'll post back with results.  &lt;br /&gt;
&lt;br /&gt;
Oh, btw, we're all FC SAN at my shop.  IBM and Nexsan arrays, Qlogic switches.  No multipathing.  We have a mix of VMFS volumes with multiple VMs stored on each VMFS.  We map RDMs to individual VMs that need lots of storage.  As a general rule, for any VM that will need over 10GB (i.e. database server), we create a 10GB virtual disk on a shared VMFS volume and install the base OS on it.  We then map an RDM to it, format it with Reiser3, and mount it into the filesystem.&lt;br /&gt;
&lt;br /&gt;
I do see some SCSI errors in our W2K3 VM event logs but we're not having huge issues as a result of them, not like with these Linux VMs.  However, if the Buslogic adapter/driver works better in Windows, I'll look into swapping those over as well.</description>
      <pubDate>Wed, 06 Jun 2007 07:42:11 GMT</pubDate>
      <author>stan-whitfield</author>
      <guid>http://communities.vmware.com/message/662023?tstart=0#662023</guid>
      <dc:date>2007-06-06T07:42:11Z</dc:date>
      <clearspace:dateToText>2 years, 5 months ago</clearspace:dateToText>
      <clearspace:replyCount>2</clearspace:replyCount>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/660815?tstart=0#660815</link>
      <description>Anyone try this with RHEL 5 ?&lt;br /&gt;
&lt;br /&gt;
My kernel version is  2.6.18-8.1.1.el5.&lt;br /&gt;
&lt;br /&gt;
I am trying compile the module from source and getting the following errors.&lt;br /&gt;
&lt;br /&gt;
In file included from /home/sri/mptscsi_vmware/mptscsi-rhel-3.02.62.01/mptbase.c:50:&lt;br /&gt;
include/linux/config.h:6:2: warning: #warning Including config.h is deprecated.&lt;br /&gt;
/home/sri/mptscsi_vmware/mptscsi-rhel-3.02.62.01/mptbase.c: In function âmpt_suspendâ:&lt;br /&gt;
/home/sri/mptscsi_vmware/mptscsi-rhel-3.02.62.01/mptbase.c:1646: error: switch quantity not an integer&lt;br /&gt;
/home/sri/mptscsi_vmware/mptscsi-rhel-3.02.62.01/mptbase.c: In function âMakeIocReadyâ:&lt;br /&gt;
/home/sri/mptscsi_vmware/mptscsi-rhel-3.02.62.01/mptbase.c:2510: error: implicit declaration of function âcrashdump_modeâ&lt;br /&gt;
make[2]: *** [/home/sri/mptscsi_vmware/mptscsi-rhel-3.02.62.01/mptbase.o] Error 1&lt;br /&gt;
make[1]: *** [_module_/home/sri/mptscsi_vmware/mptscsi-rhel-3.02.62.01] Error 2&lt;br /&gt;
make[1]: Leaving directory `/usr/src/kernels/2.6.18-8.1.4.el5-i686'&lt;br /&gt;
&lt;br /&gt;
Sriram</description>
      <pubDate>Tue, 05 Jun 2007 00:15:08 GMT</pubDate>
      <author>sriramrajan</author>
      <guid>http://communities.vmware.com/message/660815?tstart=0#660815</guid>
      <dc:date>2007-06-05T00:15:08Z</dc:date>
      <clearspace:dateToText>2 years, 5 months ago</clearspace:dateToText>
      <clearspace:replyCount>1</clearspace:replyCount>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/659470?tstart=0#659470</link>
      <description>I'm not sure I'd consider it fixed until Tom weighs in w/ more testing. I have about 15 production VMs running Centos that will soon be updated to the Centos 2.6.9-55 kernel. I'm going to do a yum update to a pair of these boxes in the near future, so I'll let you know if I see the problem.&lt;br /&gt;
&lt;br /&gt;
I would reccomend that someone who has a test farm (not production) load up a few VMs with the new kernel and do some insane disk access against the san.&lt;br /&gt;
&lt;br /&gt;
Another interesting way to test it is to reboot your SAN while the boxes are running! &lt;img class="jive-emoticon" border="0" src="http://communities.vmware.com/images/emoticons/wink.gif" alt=";)" /&gt; With Tom's patch, our Linux boxes just "pause" until the SAN comes back up and then continue on as normal (it's actually kind of cool to see).</description>
      <pubDate>Sat, 02 Jun 2007 12:58:36 GMT</pubDate>
      <author>Damin</author>
      <guid>http://communities.vmware.com/message/659470?tstart=0#659470</guid>
      <dc:date>2007-06-02T12:58:36Z</dc:date>
      <clearspace:dateToText>2 years, 5 months ago</clearspace:dateToText>
      <clearspace:replyCount>2</clearspace:replyCount>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/659222?tstart=0#659222</link>
      <description>So now, do we consider this fixed?  Just wondering if it would be wise to recompile the driver for the new kernel or if the update already included in the new kernel definitely fixes this issue.&lt;br /&gt;
&lt;br /&gt;
After updating to the newer kernel I have not seen this issue, but I only have 2 linux VM's.  Soon, I plan on helping someone roll out 20-30 so I'd like to get this right and not run into the issue down the road on heavy hit production machines.</description>
      <pubDate>Fri, 01 Jun 2007 20:02:39 GMT</pubDate>
      <author>jatwell</author>
      <guid>http://communities.vmware.com/message/659222?tstart=0#659222</guid>
      <dc:date>2007-06-01T20:02:39Z</dc:date>
      <clearspace:dateToText>2 years, 5 months ago</clearspace:dateToText>
      <clearspace:replyCount>4</clearspace:replyCount>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/651345?tstart=0#651345</link>
      <description>I also have the issue showing up on 2 of my RHEL5 systems. So we are all in the same boat...</description>
      <pubDate>Wed, 23 May 2007 00:49:59 GMT</pubDate>
      <author>ibm-sorcerer</author>
      <guid>http://communities.vmware.com/message/651345?tstart=0#651345</guid>
      <dc:date>2007-05-23T00:49:59Z</dc:date>
      <clearspace:dateToText>2 years, 6 months ago</clearspace:dateToText>
      <clearspace:replyCount>5</clearspace:replyCount>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/640565?tstart=0#640565</link>
      <description>Well, as stated above, RHEL4 U5 does include a patch which is supposed to correct the issue.  It's not the same patch that is on my website (which is really just a hack) but is supposed to be a more "correct" fix for the problem.  So far testing is looking very good.&lt;br /&gt;
&lt;br /&gt;
I've now upgraded two VM's to RHEL4 U5, one connected to an EMC AX150i (iSCSI) that was always easy to reproduce the issue with RHEL4 U3 and U4 in just a couple of hours (sometimes minutes), and another connected to an EMC CX700 (Fiber Channel) which I could usually make fail in 8-10 hours.&lt;br /&gt;
&lt;br /&gt;
I've now hammered on both of these for almost 48 hours with no signs of problems so it seems like the "fix" in RHEL4 U5 looks pretty good, but I'm conservative, I'll need a few weeks of production runtime without errors, or reports from others, before I'll consider it completely safe.&lt;br /&gt;
&lt;br /&gt;
Later,&lt;br /&gt;
Tom&lt;br /&gt;
&lt;br /&gt;
Message was edited by: &lt;br /&gt;
        tsightler</description>
      <pubDate>Wed, 09 May 2007 03:53:35 GMT</pubDate>
      <author>tsightler</author>
      <guid>http://communities.vmware.com/message/640565?tstart=0#640565</guid>
      <dc:date>2007-05-09T03:53:35Z</dc:date>
      <clearspace:dateToText>2 years, 6 months ago</clearspace:dateToText>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/639403?tstart=0#639403</link>
      <description>I would be interested in knowing if the RHEL4 U5 kernel has the fix too.</description>
      <pubDate>Mon, 07 May 2007 19:43:57 GMT</pubDate>
      <author>jatwell</author>
      <guid>http://communities.vmware.com/message/639403?tstart=0#639403</guid>
      <dc:date>2007-05-07T19:43:57Z</dc:date>
      <clearspace:dateToText>2 years, 6 months ago</clearspace:dateToText>
      <clearspace:replyCount>1</clearspace:replyCount>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/639354?tstart=0#639354</link>
      <description>U5 does include a patch that is supposed to correct this issue.  I've only got one VM running the new kernel so far but I hammered on it this weekend and it passed without any special patches so it looks promising.  I've got some systems on a much slower EMC AX150i that were always great and showing the problem.  I'll try to upgrade those and see how they behave in the next few days.&lt;br /&gt;
&lt;br /&gt;
Interestingly, the problem does seem to still be an issue for RHEL5 (admittedly not supported by VMware yet anyway, but we've got a few test systems running).  I'm hoping to update my page with a workaround for RHEL5 soon.&lt;br /&gt;
&lt;br /&gt;
Later,&lt;br /&gt;
Tom</description>
      <pubDate>Mon, 07 May 2007 19:03:14 GMT</pubDate>
      <author>tsightler</author>
      <guid>http://communities.vmware.com/message/639354?tstart=0#639354</guid>
      <dc:date>2007-05-07T19:03:14Z</dc:date>
      <clearspace:dateToText>2 years, 6 months ago</clearspace:dateToText>
      <clearspace:replyCount>8</clearspace:replyCount>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/638481?tstart=0#638481</link>
      <description>update 5 is out now, anyone know if this solves the issue?</description>
      <pubDate>Sat, 05 May 2007 20:16:52 GMT</pubDate>
      <author>soleblazer</author>
      <guid>http://communities.vmware.com/message/638481?tstart=0#638481</guid>
      <dc:date>2007-05-05T20:16:52Z</dc:date>
      <clearspace:dateToText>2 years, 6 months ago</clearspace:dateToText>
      <clearspace:replyCount>9</clearspace:replyCount>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/612599?tstart=0#612599</link>
      <description>Has anyone tested RHEL4 U5 Beta?  Kernels newer than 2.6.9-45 are supposed to have a more "correct" fix for this issue built in but I haven't had time to test.  I'm going to try to find time this week, but I'd love to hear if anyone else has given them a spin.&lt;br /&gt;
&lt;br /&gt;
Later,&lt;br /&gt;
Tom</description>
      <pubDate>Sun, 01 Apr 2007 16:29:07 GMT</pubDate>
      <author>tsightler</author>
      <guid>http://communities.vmware.com/message/612599?tstart=0#612599</guid>
      <dc:date>2007-04-01T16:29:07Z</dc:date>
      <clearspace:dateToText>2 years, 7 months ago</clearspace:dateToText>
      <clearspace:replyCount>10</clearspace:replyCount>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/592337?tstart=0#592337</link>
      <description>Hi,&lt;br /&gt;
&lt;br /&gt;
I am having problem on RHEL4U3, RHEL3U8 and SUSE9 SP3 VMs. This is a NAS / SAN system and I am running Connectathon test which is an NFS test for NAS side testing. When running the special test using NFS version 3 , using UDP protocol, it gets stuck at a point where it says "write/read 30 MB file". On RHEL3U8 and SUSE9 SP3, it gets stuck for 5-7 min, then continues fine. On RHEL4U3, it gets stuck forever. &lt;br /&gt;
&lt;br /&gt;
Is my problem and the problem you discussed can be related? Plesae let me know.</description>
      <pubDate>Tue, 06 Mar 2007 23:04:11 GMT</pubDate>
      <author>sjc_dogs</author>
      <guid>http://communities.vmware.com/message/592337?tstart=0#592337</guid>
      <dc:date>2007-03-06T23:04:11Z</dc:date>
      <clearspace:dateToText>2 years, 8 months ago</clearspace:dateToText>
      <clearspace:replyCount>11</clearspace:replyCount>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/588548?tstart=0#588548</link>
      <description>The "latest kernel" does not fix the problem, at least as of 2.6.9-42.0.8, so, yes, if you are going to update to the latest kernel then you also need either VMware's fix or my fix.&lt;br /&gt;
&lt;br /&gt;
On the other hand, if you are on kernel 2.6.9-22.0.2 or earlier, and you're not planning to upgrade your kernel, then you don't need to do anything as those kernels don't have the problem.&lt;br /&gt;
&lt;br /&gt;
Later,&lt;br /&gt;
Tom</description>
      <pubDate>Thu, 01 Mar 2007 22:27:36 GMT</pubDate>
      <author>tsightler</author>
      <guid>http://communities.vmware.com/message/588548?tstart=0#588548</guid>
      <dc:date>2007-03-01T22:27:36Z</dc:date>
      <clearspace:dateToText>2 years, 8 months ago</clearspace:dateToText>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/587007?tstart=0#587007</link>
      <description>So, to summarize should we get the kernel updated to 2.6.9-42.0.8 AND apply the vmware patch via &lt;a class="jive-link-external" href="http://kb.vmware.com/vmtnkb/search.do?cmd=displayKC&amp;#38;docType=kc&amp;#38;externalId=51306&amp;#38;sliceId=SAL_Public"&gt;http://kb.vmware.com/vmtnkb/search.do?cmd=displayKC&amp;#38;docType=kc&amp;#38;externalId=51306&amp;#38;sliceId=SAL_Public&lt;/a&gt;&lt;br /&gt;
OR do we just apply the kernel patch and we should be good to go since the problem really was in rhel4?&lt;br /&gt;
&lt;br /&gt;
I was set on just going to the latest kernel, but then "Ops Admin" replied 2 posts before this one and said that he was on the latest kernel and still had problems..</description>
      <pubDate>Wed, 28 Feb 2007 15:46:30 GMT</pubDate>
      <author>cuzic4n</author>
      <guid>http://communities.vmware.com/message/587007?tstart=0#587007</guid>
      <dc:date>2007-02-28T15:46:30Z</dc:date>
      <clearspace:dateToText>2 years, 8 months ago</clearspace:dateToText>
      <clearspace:replyCount>1</clearspace:replyCount>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/584988?tstart=0#584988</link>
      <description>&lt;div class="jive-quote"&gt;We have 20+ other systems that are running the stock&lt;br /&gt;
stuff "2.6.9-22.ELsmp #1 SMP" and "mptlinux-3.02.18".&lt;br /&gt;
 We need to update the kernel to apply the patch.  &lt;br /&gt;
&lt;p /&gt;
Does anyone know if we need to apply the patch to the&lt;br /&gt;
stock kernel systems?  I am just wondering if the&lt;br /&gt;
problem wont show itself with what we have on the&lt;br /&gt;
older systems.&lt;/div&gt;
&lt;br /&gt;
For RHEL4 the problem only occurs with U3 and above.  The 2.6.9-22.EL kernel is, I believe, the kernel from RHEL4 U2 so it doesn't actually have the problem.  I'm not 100% sure, but I believe that the first "released" kernel that included this problem is 2.6.9-34.EL.  Any non-development kernel older than that should be safe.&lt;br /&gt;
&lt;br /&gt;
I actually think the update that broke this was added in the U3 development chain around kernel 2.6.9-22.25.EL, but it's been a long time so I may not have the exact correct release.  For example, I have some 2.6.9-28.EL U3 beta kernels which definitely have the "broken" driver.&lt;br /&gt;
&lt;br /&gt;
Later,&lt;br /&gt;
Tom&lt;br /&gt;
&lt;p /&gt;
Later,&lt;br /&gt;
Tom</description>
      <pubDate>Mon, 26 Feb 2007 14:41:28 GMT</pubDate>
      <author>tsightler</author>
      <guid>http://communities.vmware.com/message/584988?tstart=0#584988</guid>
      <dc:date>2007-02-26T14:41:28Z</dc:date>
      <clearspace:dateToText>2 years, 9 months ago</clearspace:dateToText>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/584010?tstart=0#584010</link>
      <description>This came up on our farm.&lt;br /&gt;
&lt;br /&gt;
In fact one of the servers was rebooted, when it came back up all that was left of the "/" filesystem was "lost&amp;#38;found"  The entire rest was gone!!&lt;br /&gt;
&lt;br /&gt;
One thing we have noticed, the 3 systems that have the problem all have the updated Redhat kernel from RHN "2.6.9-42.0.8.ELsmp #1 SMP" and the following entry for the driver "mptlinux-3.02.62.01rh".  We can apply the patch to these servers.&lt;br /&gt;
&lt;br /&gt;
We have 20+ other systems that are running the stock stuff "2.6.9-22.ELsmp #1 SMP" and "mptlinux-3.02.18".  We need to update the kernel to apply the patch.  &lt;br /&gt;
&lt;br /&gt;
Does anyone know if we need to apply the patch to the stock kernel systems?  I am just wondering if the problem wont show itself with what we have on the older systems.</description>
      <pubDate>Sat, 24 Feb 2007 04:54:39 GMT</pubDate>
      <author>Ops admin</author>
      <guid>http://communities.vmware.com/message/584010?tstart=0#584010</guid>
      <dc:date>2007-02-24T04:54:39Z</dc:date>
      <clearspace:dateToText>2 years, 9 months ago</clearspace:dateToText>
      <clearspace:replyCount>3</clearspace:replyCount>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/576252?tstart=0#576252</link>
      <description>&lt;div class="jive-quote"&gt;Our company had a thorough discussion and meeting&lt;br /&gt;
sessions setup with redhat on this 'readonly issue'&lt;br /&gt;
not sure if its identical but we actually had a&lt;br /&gt;
hot-fix and new kernel created from our testing. If&lt;br /&gt;
you using RHEL your going to need to lay down&lt;br /&gt;
2.6.9-42.0.8 kernel. 42.0.3 is not sufficient enough.&lt;br /&gt;
Bugzilla 213921 is a good reference to the trouble we&lt;br /&gt;
had.&lt;/div&gt;
&lt;p /&gt;
The bug fixed in 2.6.9-42.0.8, and documented in Bugzilla 213921, is an issue with ext3 and is not really related to VMware, although I suspect that VMware may make the problem slightly more likely to occur.&lt;br /&gt;
&lt;br /&gt;
The "VMware" bug is an issue with the mptscsih driver, it's interaction with VMware server and how it reports timeouts to the SCSI midlayer in the guest.&lt;br /&gt;
&lt;br /&gt;
While the failure mode of both bugs does cause a nearly indetical end result (ext3 filesystem going read-only) but key differences can be noted in the log files in most cases.&lt;br /&gt;
&lt;br /&gt;
Later,&lt;br /&gt;
Tom</description>
      <pubDate>Wed, 14 Feb 2007 20:43:15 GMT</pubDate>
      <author>tsightler</author>
      <guid>http://communities.vmware.com/message/576252?tstart=0#576252</guid>
      <dc:date>2007-02-14T20:43:15Z</dc:date>
      <clearspace:dateToText>2 years, 9 months ago</clearspace:dateToText>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/575921?tstart=0#575921</link>
      <description>Do you have logs from guest &amp;#38; vmkernel from around time of failure?  You should hit this problem only if I/O takes longer than 30 seconds to finish (and you'll hit it on both real hardware and in the VM).  Perhaps your SAN decided to do something else in the middle of test?</description>
      <pubDate>Wed, 14 Feb 2007 16:52:10 GMT</pubDate>
      <author>petr</author>
      <guid>http://communities.vmware.com/message/575921?tstart=0#575921</guid>
      <dc:date>2007-02-14T16:52:10Z</dc:date>
      <clearspace:dateToText>2 years, 9 months ago</clearspace:dateToText>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/575900?tstart=0#575900</link>
      <description>Our company had a thorough discussion and meeting sessions setup with redhat on this 'readonly issue' not sure if its identical but we actually had a hot-fix and new kernel created from our testing. If you using RHEL your going to need to lay down 2.6.9-42.0.8 kernel. 42.0.3 is not sufficient enough. Bugzilla 213921 is a good reference to the trouble we had.</description>
      <pubDate>Wed, 14 Feb 2007 16:35:09 GMT</pubDate>
      <author>ddecker</author>
      <guid>http://communities.vmware.com/message/575900?tstart=0#575900</guid>
      <dc:date>2007-02-14T16:35:09Z</dc:date>
      <clearspace:dateToText>2 years, 9 months ago</clearspace:dateToText>
      <clearspace:replyCount>1</clearspace:replyCount>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/575856?tstart=0#575856</link>
      <description>This is just what I need.&lt;br /&gt;
Thanks for the assist &lt;img class="jive-emoticon" border="0" src="http://communities.vmware.com/images/emoticons/happy.gif" alt=":)" /&gt;&lt;br /&gt;
&lt;br /&gt;
If I come up with any variations of:&lt;br /&gt;
'iozone -r 1m -s 1024m -t 8'&lt;br /&gt;
with iozone or any other tools, I'll post back here.</description>
      <pubDate>Wed, 14 Feb 2007 15:56:27 GMT</pubDate>
      <author>magusnet</author>
      <guid>http://communities.vmware.com/message/575856?tstart=0#575856</guid>
      <dc:date>2007-02-14T15:56:27Z</dc:date>
      <clearspace:dateToText>2 years, 9 months ago</clearspace:dateToText>
      <clearspace:replyCount>2</clearspace:replyCount>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/575364?tstart=0#575364</link>
      <description>&lt;div class="jive-quote"&gt;1. Can someone provide a clear set of steps to&lt;br /&gt;
reproduce this in a lab?&lt;/div&gt;
&lt;br /&gt;
You mean reproduce the error?  It's a little difficult to provide clear steps since it can be more difficult to trigger on some hardware, however, the basic formula in my case is, run a continuous loop of ' iozone -r 1m -s 1024m -t 8' in 2 VM's against a single VMFS LUN.  Given enough time this has failed on every one of my SAN's without doing much of anything else, usually within just a few hours.&lt;br /&gt;
&lt;br /&gt;
Damin seems to have a more formal test that he feels comfortable with on his setup.&lt;br /&gt;
&lt;br /&gt;
Later,&lt;br /&gt;
Tom</description>
      <pubDate>Wed, 14 Feb 2007 00:28:47 GMT</pubDate>
      <author>tsightler</author>
      <guid>http://communities.vmware.com/message/575364?tstart=0#575364</guid>
      <dc:date>2007-02-14T00:28:47Z</dc:date>
      <clearspace:dateToText>2 years, 9 months ago</clearspace:dateToText>
      <clearspace:replyCount>4</clearspace:replyCount>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/574141?tstart=0#574141</link>
      <description>The VMware patch won't help us since we are on SLES9_SP3 kernel build -282.&lt;br /&gt;
&lt;br /&gt;
1. Can someone provide a clear set of steps to reproduce this in a lab?&lt;br /&gt;
2. Has anyone gotten any information from Novell for SLES9 on a fix like &lt;br /&gt;
     RedHat has done?</description>
      <pubDate>Mon, 12 Feb 2007 22:09:18 GMT</pubDate>
      <author>magusnet</author>
      <guid>http://communities.vmware.com/message/574141?tstart=0#574141</guid>
      <dc:date>2007-02-12T22:09:18Z</dc:date>
      <clearspace:dateToText>2 years, 9 months ago</clearspace:dateToText>
      <clearspace:replyCount>5</clearspace:replyCount>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/571998?tstart=0#571998</link>
      <description>Hi,&lt;br /&gt;
&lt;br /&gt;
VMware has updated the article:&lt;br /&gt;
&lt;a class="jive-link-external" href="http://kb.vmware.com/vmtnkb/search.do?cmd=displayKC&amp;#38;docType=kc&amp;#38;externalId=51306&amp;#38;sliceId=SAL_Public"&gt;http://kb.vmware.com/vmtnkb/search.do?cmd=displayKC&amp;#38;docType=kc&amp;#38;externalId=51306&amp;#38;sliceId=SAL_Public&lt;/a&gt;&lt;br /&gt;
&lt;br /&gt;
Now a hotfix for SLES 9 SP3 is available.&lt;br /&gt;
Does anyone if this does work for SLES 10, too?&lt;br /&gt;
&lt;br /&gt;
@doomdevice : Thanks for your work!&lt;br /&gt;
But  ask that because we are looking for an offiicially supported solution.&lt;br /&gt;
&lt;br /&gt;
Thanks in advance.&lt;br /&gt;
/egr</description>
      <pubDate>Fri, 09 Feb 2007 14:13:54 GMT</pubDate>
      <author>egr</author>
      <guid>http://communities.vmware.com/message/571998?tstart=0#571998</guid>
      <dc:date>2007-02-09T14:13:54Z</dc:date>
      <clearspace:dateToText>2 years, 9 months ago</clearspace:dateToText>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/569008?tstart=0#569008</link>
      <description>If you use Novell SLES, I´ve created an short guideline how to change the lsi driver based on Tom´s RedHat guide (Thanks Tom!).&lt;br /&gt;
You can find it in here:&lt;br /&gt;
&lt;a class="jive-link-external" href="http://www.vmachine.de/kb/index.php/Linux_Kernel_2.6_Problem_-_Read-Only_Filesystem_nach_Path_Failover"&gt;http://www.vmachine.de/kb/index.php/Linux_Kernel_2.6_Problem_-_Read-Only_Filesystem_nach_Path_Failover&lt;/a&gt;&lt;br /&gt;
&lt;br /&gt;
It´s in German but the commands should be understandable by everyone.&lt;br /&gt;
&lt;br /&gt;
Dennis</description>
      <pubDate>Tue, 06 Feb 2007 16:01:29 GMT</pubDate>
      <author>doomdevice</author>
      <guid>http://communities.vmware.com/message/569008?tstart=0#569008</guid>
      <dc:date>2007-02-06T16:01:29Z</dc:date>
      <clearspace:dateToText>2 years, 9 months ago</clearspace:dateToText>
      <clearspace:replyCount>1</clearspace:replyCount>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/567947?tstart=0#567947</link>
      <description>So, as I suspected, the patch is question is simply not enough.  Of interest however is that Redhat has commited a patch to the U5 stream which does include a supposed fix (and one that looks "sufficient" to me).  Test kernels that include this fix are available at the following location:&lt;br /&gt;
&lt;br /&gt;
&lt;a class="jive-link-external" href="http://people.redhat.com/~jbaron/rhel4/RPMS.kernel/"&gt;http://people.redhat.com/~jbaron/rhel4/RPMS.kernel/&lt;/a&gt;&lt;br /&gt;
&lt;br /&gt;
It would be interesting to see what you tests turn up with those kernels.&lt;br /&gt;
&lt;br /&gt;
Later,&lt;br /&gt;
Tom</description>
      <pubDate>Mon, 05 Feb 2007 15:47:04 GMT</pubDate>
      <author>tsightler</author>
      <guid>http://communities.vmware.com/message/567947?tstart=0#567947</guid>
      <dc:date>2007-02-05T15:47:04Z</dc:date>
      <clearspace:dateToText>2 years, 9 months ago</clearspace:dateToText>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/567588?tstart=0#567588</link>
      <description>After running several tests w/ the latest kernel, I've reverted back to using Tom's patched drivers. The .08 release will eventually fail under stress.&lt;br /&gt;
&lt;br /&gt;
Sigh..</description>
      <pubDate>Mon, 05 Feb 2007 04:17:45 GMT</pubDate>
      <author>Damin</author>
      <guid>http://communities.vmware.com/message/567588?tstart=0#567588</guid>
      <dc:date>2007-02-05T04:17:45Z</dc:date>
      <clearspace:dateToText>2 years, 9 months ago</clearspace:dateToText>
      <clearspace:replyCount>1</clearspace:replyCount>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/567163?tstart=0#567163</link>
      <description>The patch in question appears to be as follows:&lt;br /&gt;
&lt;br /&gt;
--- linux-2.6.9/fs/ext3/dir.c~	2007-01-08 16:19:36.000000000 -0500&lt;br /&gt;
+++ linux-2.6.9/fs/ext3/dir.c	2007-01-08 16:22:10.000000000 -0500&lt;br /&gt;
@@ -155,7 +155,7 @@ static int ext3_readdir(struct file * fi&lt;br /&gt;
 					brelse (tmp);&lt;br /&gt;
 			}&lt;br /&gt;
 			if (num) {&lt;br /&gt;
-				ll_rw_block (READA, num, bha);&lt;br /&gt;
+				ll_rw_block (READ, num, bha);&lt;br /&gt;
 				for (i = 0; i &amp;lt; num; i++)&lt;br /&gt;
 					brelse (bha&lt;i&gt;);&lt;br /&gt;
 			}&lt;br /&gt;
&lt;br /&gt;
I'm not sure that this is enough to correct the issue in VMware because the Bugzilla seems to imply the issue was only seen in cases that were using PowerPath or dm-multipath.  When using READA request under heavy load the bio request can sometimes return EAGAIN which is basically saying "I had a temporary error, please try again" but the ext3 code did not actually expect to handle this condition.&lt;br /&gt;
&lt;br /&gt;
Redhat initially looked at backporting the code in more recent 2.6 kernels but this appeared to have more complex interactions and they opted for the simple, low-risk fix, which appears to basically forgoe readahead on directory lookups.  It's possible that this might have a minor impact on performance, but probably not noticable in most workloads.&lt;br /&gt;
&lt;br /&gt;
I'll be interested in seeing how your tests turn out so let us know.&lt;br /&gt;
&lt;br /&gt;
Later,&lt;br /&gt;
Tom&lt;/i&gt;</description>
      <pubDate>Sat, 03 Feb 2007 21:55:14 GMT</pubDate>
      <author>tsightler</author>
      <guid>http://communities.vmware.com/message/567163?tstart=0#567163</guid>
      <dc:date>2007-02-03T21:55:14Z</dc:date>
      <clearspace:dateToText>2 years, 9 months ago</clearspace:dateToText>
      <clearspace:replyCount>2</clearspace:replyCount>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/564829?tstart=0#564829</link>
      <description>Alright people.. The new kernel update for RedHat claims to fix the issue:&lt;br /&gt;
&lt;br /&gt;
&lt;a class="jive-link-external" href="http://rhn.redhat.com/errata/RHSA-2007-0014.html"&gt;http://rhn.redhat.com/errata/RHSA-2007-0014.html&lt;/a&gt;&lt;br /&gt;
&lt;br /&gt;
Specifically:&lt;br /&gt;
&lt;br /&gt;
213921 - SAN file systems becoming read-only&lt;br /&gt;
&lt;br /&gt;
I'm going to pop it on a few VMs and beat the crap out of them to see.&lt;br /&gt;
&lt;br /&gt;
Anyone care to look at the patches applied to this update to see how they fixed it?</description>
      <pubDate>Thu, 01 Feb 2007 05:07:12 GMT</pubDate>
      <author>Damin</author>
      <guid>http://communities.vmware.com/message/564829?tstart=0#564829</guid>
      <dc:date>2007-02-01T05:07:12Z</dc:date>
      <clearspace:dateToText>2 years, 9 months ago</clearspace:dateToText>
      <clearspace:replyCount>3</clearspace:replyCount>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/556390?tstart=0#556390</link>
      <description>&lt;div class="jive-quote"&gt;Is this being asked because there is a possibility&lt;br /&gt;
that having a NIC bond going to the ISCSI target&lt;br /&gt;
could mitigate this issue? The reason I ask is I have&lt;br /&gt;
several ESX Hosts hitting an EQL ISCSI SAN, and I&lt;br /&gt;
have several RHEL4 UP4 guests with Sybase Databases&lt;br /&gt;
running in the on the guest and I've never seen this&lt;br /&gt;
issue. Even when I've spent 9 or 10 hours loading&lt;br /&gt;
1TB+ database.&lt;/div&gt;
&lt;br /&gt;
OK, I know you didn't ask me this question, but I thought my experiences might be interesting to you.  We have ESX servers running against several different storage arrays, a CX400 via FC, a CX700 via FC, a AX150i via iSCSI, and an Equalogic PS300E via iSCSI (and also a few systems running on local storage like IBM ServeRAID and Dell PERC controllers).&lt;br /&gt;
&lt;br /&gt;
We've found the problem to be fairly easy to reproduce on the AX150i, as well as the CX700.  For whatever reason it's actually more difficult to trigger it on the CX400, we theorize it has to do with the smaller write cache and thus lower latency during heavy writes but it may also be related to the fact that the CX400 simply has less contention because it services fewer hosts.&lt;br /&gt;
&lt;br /&gt;
The Equalogic PS300E is by far the most difficult for us to reproduce the issue.  We did mange to create the issue even when running against this array but it took a pretty crazy level of fake I/O load running in multiple VM's to do it.&lt;br /&gt;
&lt;br /&gt;
On the other hand, if you pull the plug on a network cable, it's likely that you'll see the issue on every Linux guest by the time the iSCSI connection fails over to another port.  My opinion is that this fix is an important proactive fix if you want your guest to continue running during any failure scenario, especially since that's usually the reason you invest in all of the redundancy.&lt;br /&gt;
&lt;br /&gt;
Later,&lt;br /&gt;
Tom</description>
      <pubDate>Sat, 20 Jan 2007 19:40:23 GMT</pubDate>
      <author>tsightler</author>
      <guid>http://communities.vmware.com/message/556390?tstart=0#556390</guid>
      <dc:date>2007-01-20T19:40:23Z</dc:date>
      <clearspace:dateToText>2 years, 10 months ago</clearspace:dateToText>
      <clearspace:replyCount>4</clearspace:replyCount>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/556386?tstart=0#556386</link>
      <description>&lt;div class="jive-quote"&gt;I'm seeing this behavior (Linux guest goes read-only&lt;br /&gt;
suddenly) on a guest OS that is running on _local_&lt;br /&gt;
storage.  (I.e., not on a SAN, and not on iSCSI.)&lt;br /&gt;
 Has anyone seen this?&lt;/div&gt;
&lt;br /&gt;
Oh yeah, I can definitely reproduce this issue on local storage, basically, anything that causes a delay of greater than 5 seconds in reading data from the disks.  It's especially easy to trigger if you have a storage system with large amounts of cache that creates long pauses during forced flushes.  We've reproduced the issues on IBM ServeRAID 6i and Dell PERC 4/Di controllers using locally attached UltraSCSI 320 drives.</description>
      <pubDate>Sat, 20 Jan 2007 19:26:41 GMT</pubDate>
      <author>tsightler</author>
      <guid>http://communities.vmware.com/message/556386?tstart=0#556386</guid>
      <dc:date>2007-01-20T19:26:41Z</dc:date>
      <clearspace:dateToText>2 years, 10 months ago</clearspace:dateToText>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/556281?tstart=0#556281</link>
      <description>&lt;div class="jive-quote"&gt;Do you use NIC teaming?. If yes, could you check if&lt;br /&gt;
the problem happen when there is only one uplink?.&lt;/div&gt;
&lt;br /&gt;
Is this being asked because there is a possibility that having a NIC bond going to the ISCSI target could mitigate this issue? The reason I ask is I have several ESX Hosts hitting an EQL ISCSI SAN, and I have several RHEL4 UP4 guests with Sybase Databases running in the on the guest and I've never seen this issue. Even when I've spent 9 or 10 hours loading 1TB+ database.&lt;br /&gt;
&lt;br /&gt;
Reading this thread makes me wonder if I should proactively apply the 'fix', but since I haven't ran into the issue, I'm reluctant to mess with something that is working.</description>
      <pubDate>Sat, 20 Jan 2007 13:31:31 GMT</pubDate>
      <author>letoatrads</author>
      <guid>http://communities.vmware.com/message/556281?tstart=0#556281</guid>
      <dc:date>2007-01-20T13:31:31Z</dc:date>
      <clearspace:dateToText>2 years, 10 months ago</clearspace:dateToText>
      <clearspace:replyCount>5</clearspace:replyCount>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/556207?tstart=0#556207</link>
      <description>I'm seeing this behavior (Linux guest goes read-only suddenly) on a guest OS that is running on _local_ storage.  (I.e., not on a SAN, and not on iSCSI.)  Has anyone seen this?&lt;br /&gt;
&lt;br /&gt;
-Rick</description>
      <pubDate>Sat, 20 Jan 2007 04:23:17 GMT</pubDate>
      <author>osterber</author>
      <guid>http://communities.vmware.com/message/556207?tstart=0#556207</guid>
      <dc:date>2007-01-20T04:23:17Z</dc:date>
      <clearspace:dateToText>2 years, 10 months ago</clearspace:dateToText>
      <clearspace:replyCount>1</clearspace:replyCount>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/554926?tstart=0#554926</link>
      <description>For those who come to this forum and need a VMWare fix, after much searching I've come up with this link: &lt;a class="jive-link-external" href="http://kb.vmware.com/vmtnkb/search.do?cmd=displayKC&amp;#38;docType=kc&amp;#38;externalId=51306&amp;#38;sliceId=SAL_Public"&gt;http://kb.vmware.com/vmtnkb/search.do?cmd=displayKC&amp;#38;docType=kc&amp;#38;externalId=51306&amp;#38;sliceId=SAL_Public&lt;/a&gt;&lt;br /&gt;
&lt;br /&gt;
We've been running this patch on several of our production Oracle and web servers for a few days now, and no more of those messages.&lt;br /&gt;
&lt;br /&gt;
Hope this saves some trouble for people!</description>
      <pubDate>Thu, 18 Jan 2007 19:15:17 GMT</pubDate>
      <author>tonywieczorek</author>
      <guid>http://communities.vmware.com/message/554926?tstart=0#554926</guid>
      <dc:date>2007-01-18T19:15:17Z</dc:date>
      <clearspace:dateToText>2 years, 10 months ago</clearspace:dateToText>
      <clearspace:replyCount>2</clearspace:replyCount>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/554773?tstart=0#554773</link>
      <description>thanks, I really need your help 'cause it's a critical machine and I have just one more day to find a solution...</description>
      <pubDate>Thu, 18 Jan 2007 16:54:34 GMT</pubDate>
      <author>CMCC</author>
      <guid>http://communities.vmware.com/message/554773?tstart=0#554773</guid>
      <dc:date>2007-01-18T16:54:34Z</dc:date>
      <clearspace:dateToText>2 years, 10 months ago</clearspace:dateToText>
      <clearspace:replyCount>3</clearspace:replyCount>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/554751?tstart=0#554751</link>
      <description>No, probably not.  &lt;br /&gt;
&lt;br /&gt;
Your problem appears to be that you updated your system and the initrd image does not contain the proper scsi drivers.  You will need to boot your RHEL4 U4 CD into rescue mode and fix modprobe.conf file and rebuild your initrd.  I'll try to reply with more details in your thread.&lt;br /&gt;
&lt;br /&gt;
Later,&lt;br /&gt;
Tom</description>
      <pubDate>Thu, 18 Jan 2007 16:41:41 GMT</pubDate>
      <author>tsightler</author>
      <guid>http://communities.vmware.com/message/554751?tstart=0#554751</guid>
      <dc:date>2007-01-18T16:41:41Z</dc:date>
      <clearspace:dateToText>2 years, 10 months ago</clearspace:dateToText>
      <clearspace:replyCount>4</clearspace:replyCount>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/554025?tstart=0#554025</link>
      <description>&lt;div class="jive-quote"&gt;VMware has now posted their "solution" in the KB&lt;br /&gt;
article, basically just an RPM with new mptscsih&lt;br /&gt;
modules and a script which makes installing the&lt;br /&gt;
drivers easier.  So far it is only for 32-bit RHEL4&lt;br /&gt;
U3 or U4 &lt;/div&gt;
&lt;br /&gt;
is this solution useful for my problem???&lt;br /&gt;
&lt;a class="jive-link-external" href="http://www.vmware.com/community/thread.jspa?threadID=67781&amp;#38;tstart=0"&gt;http://www.vmware.com/community/thread.jspa?threadID=67781&amp;#38;tstart=0&lt;/a&gt;</description>
      <pubDate>Wed, 17 Jan 2007 20:29:16 GMT</pubDate>
      <author>CMCC</author>
      <guid>http://communities.vmware.com/message/554025?tstart=0#554025</guid>
      <dc:date>2007-01-17T20:29:16Z</dc:date>
      <clearspace:dateToText>2 years, 10 months ago</clearspace:dateToText>
      <clearspace:replyCount>5</clearspace:replyCount>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/553320?tstart=0#553320</link>
      <description>VMware has now posted their "solution" in the KB article, basically just an RPM with new mptscsih modules and a script which makes installing the drivers easier.  So far it is only for 32-bit RHEL4 U3 or U4 (looks like that's all they've made binaries for).  Still, for people who only want to run "supported" solutions then at least you finally have an option.&lt;br /&gt;
&lt;br /&gt;
Later,&lt;br /&gt;
Tom</description>
      <pubDate>Wed, 17 Jan 2007 01:42:16 GMT</pubDate>
      <author>tsightler</author>
      <guid>http://communities.vmware.com/message/553320?tstart=0#553320</guid>
      <dc:date>2007-01-17T01:42:16Z</dc:date>
      <clearspace:dateToText>2 years, 10 months ago</clearspace:dateToText>
      <clearspace:replyCount>6</clearspace:replyCount>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/552225?tstart=0#552225</link>
      <description>Hello, have the same problem. Thanks to your comments,  we found a solution.&lt;br /&gt;
&lt;br /&gt;
Many thanks to all of you who got involved in this thread, this has been really really helpful.&lt;br /&gt;
You rule!</description>
      <pubDate>Mon, 15 Jan 2007 17:39:28 GMT</pubDate>
      <author>Daniel Herrero</author>
      <guid>http://communities.vmware.com/message/552225?tstart=0#552225</guid>
      <dc:date>2007-01-15T17:39:28Z</dc:date>
      <clearspace:dateToText>2 years, 10 months ago</clearspace:dateToText>
      <clearspace:replyCount>7</clearspace:replyCount>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/551238?tstart=0#551238</link>
      <description>That is pretty funny.  I'm sure glad that lots of users reported it and thus allowed them to "discover" it.  I also like the note that was added to the install documentation telling you to look at the KB article which basically says, "if you use RHEL4 U3 or U4 then don't upgrade 3.x" but they still list RHEL4 U3 or U4 as supported for 3.x.&lt;br /&gt;
&lt;br /&gt;
So basically their saying, "We support it, but that doesn't mean we can make it work".  Gee thanks, I always wondered what I was paying for.&lt;br /&gt;
&lt;br /&gt;
Oh well, I guess we can look forward to a fix eventually.&lt;br /&gt;
&lt;br /&gt;
Later,&lt;br /&gt;
Tom</description>
      <pubDate>Fri, 12 Jan 2007 22:12:23 GMT</pubDate>
      <author>tsightler</author>
      <guid>http://communities.vmware.com/message/551238?tstart=0#551238</guid>
      <dc:date>2007-01-12T22:12:23Z</dc:date>
      <clearspace:dateToText>2 years, 10 months ago</clearspace:dateToText>
      <clearspace:replyCount>8</clearspace:replyCount>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/551127?tstart=0#551127</link>
      <description>Vmware uncovered this? Hahahahaha.. that's spin! Vmware did not uncover this.. their users did!&lt;br /&gt;
&lt;br /&gt;
Makes you wonder just what goes on when they certify a specific OS distribution to work on Virtual Infrastructure....</description>
      <pubDate>Fri, 12 Jan 2007 20:15:56 GMT</pubDate>
      <author>Damin</author>
      <guid>http://communities.vmware.com/message/551127?tstart=0#551127</guid>
      <dc:date>2007-01-12T20:15:56Z</dc:date>
      <clearspace:dateToText>2 years, 10 months ago</clearspace:dateToText>
      <clearspace:replyCount>9</clearspace:replyCount>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/551117?tstart=0#551117</link>
      <description>VMware has uncovered a problem with RHEL4 U3, RHEL4 U4 or SLES9 SP3 virtual machines running on SAN or iSCSI storage. File systems could become read-only in the event of busy I/O retry or path failover. VMware does not recommend ESX Server 3.x customers upgrade virtual machines to these operating systems until this issue is resolved.&lt;br /&gt;
&lt;br /&gt;
From: &lt;a class="jive-link-external" href="http://kb.vmware.com/vmtnkb/dynamickc.do?externalId=51306&amp;#38;sliceId=SAL_Public&amp;#38;command=show&amp;#38;forward=nonthreadedKC&amp;#38;kcId=51306"&gt;http://kb.vmware.com/vmtnkb/dynamickc.do?externalId=51306&amp;#38;sliceId=SAL_Public&amp;#38;command=show&amp;#38;forward=nonthreadedKC&amp;#38;kcId=51306&lt;/a&gt;</description>
      <pubDate>Fri, 12 Jan 2007 19:56:05 GMT</pubDate>
      <author>CMCC</author>
      <guid>http://communities.vmware.com/message/551117?tstart=0#551117</guid>
      <dc:date>2007-01-12T19:56:05Z</dc:date>
      <clearspace:dateToText>2 years, 10 months ago</clearspace:dateToText>
      <clearspace:replyCount>22</clearspace:replyCount>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/528914?tstart=0#528914</link>
      <description>You should be able to look at the source and make the appropriate change as the code in FC6 is not significantly different in this particular area but I think the file is called mptscsih.c in the newer kernels.&lt;br /&gt;
&lt;br /&gt;
I don't have the FC6 source handy, but basically, if you simply simply take the FC6 source and look in drivers/message/fusion there should be a file called mptscsih.c  Open this file and do a search for DID_BUS_BUSY.  Approximately the third hit from the top of the file should be a line that looks like this:&lt;br /&gt;
&lt;br /&gt;
sc-&amp;gt;result = (DID_BUS_BUSY &amp;lt;&amp;lt; 16) | scsi_status;&lt;br /&gt;
&lt;br /&gt;
Simply change the DID_BUS_BUSY to DID_OK so that the line looks like this:&lt;br /&gt;
&lt;br /&gt;
sc-&amp;gt;result = (DID_OK &amp;lt;&amp;lt; 16) | scsi_status;&lt;br /&gt;
&lt;br /&gt;
And then remcompile and you should be good.  If you can't get this to work I can try to put some more specific instructions together in another day or so, perhaps even a download, I just don't have time at the moment.&lt;br /&gt;
&lt;br /&gt;
Later,&lt;br /&gt;
Tom</description>
      <pubDate>Sat, 09 Dec 2006 00:17:29 GMT</pubDate>
      <author>tsightler</author>
      <guid>http://communities.vmware.com/message/528914?tstart=0#528914</guid>
      <dc:date>2006-12-09T00:17:29Z</dc:date>
      <clearspace:dateToText>2 years, 11 months ago</clearspace:dateToText>
      <clearspace:replyCount>2</clearspace:replyCount>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/528654?tstart=0#528654</link>
      <description>Not that I am aware of. I try to stay away from Fedora for production use, as it is a testing ground for new technology. RedHat EL and derivatives aren't quite so cutting edge, but their bugs are well known.</description>
      <pubDate>Fri, 08 Dec 2006 18:13:41 GMT</pubDate>
      <author>Damin</author>
      <guid>http://communities.vmware.com/message/528654?tstart=0#528654</guid>
      <dc:date>2006-12-08T18:13:41Z</dc:date>
      <clearspace:dateToText>2 years, 11 months ago</clearspace:dateToText>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/528633?tstart=0#528633</link>
      <description>Hello:&lt;br /&gt;
&lt;br /&gt;
I am working on a Fedora Core 6 box, and have the same issue.&lt;br /&gt;
Currently running  2.6.18-1.2849.  When I try to compile the patch, of course, it breaks horribly:&lt;br /&gt;
&lt;br /&gt;
make -C /lib/modules/2.6.18-1.2849.fc6/build SUBDIRS=/usr/src/fusion clean&lt;br /&gt;
make[1]: Entering directory `/usr/src/kernels/2.6.18-1.2849.fc6-i686'&lt;br /&gt;
make[1]: Leaving directory `/usr/src/kernels/2.6.18-1.2849.fc6-i686'&lt;br /&gt;
make -C /lib/modules/2.6.18-1.2849.fc6/build SUBDIRS=/usr/src/fusion modules&lt;br /&gt;
make[1]: Entering directory `/usr/src/kernels/2.6.18-1.2849.fc6-i686'&lt;br /&gt;
  CC [M]  /usr/src/fusion/mptbase.o&lt;br /&gt;
/usr/src/fusion/mptbase.c:50:26: error: linux/config.h: No such file or directory&lt;br /&gt;
In file included from /usr/src/fusion/mptbase.c:71:&lt;br /&gt;
/usr/src/fusion/mptbase.h:1088:28: error: linux/diskdump.h: No such file or directory&lt;br /&gt;
/usr/src/fusion/mptbase.c: In function âmpt_suspendâ:&lt;br /&gt;
/usr/src/fusion/mptbase.c:1646: error: switch quantity not an integer&lt;br /&gt;
/usr/src/fusion/mptbase.c: In function âMakeIocReadyâ:&lt;br /&gt;
/usr/src/fusion/mptbase.c:2510: error: implicit declaration of function âcrashdump_modeâ&lt;br /&gt;
/usr/src/fusion/mptbase.c:2510: error: implicit declaration of function âdiskdump_mdelayâ&lt;br /&gt;
make[2]: *** [/usr/src/fusion/mptbase.o] Error 1&lt;br /&gt;
make[1]: *** [_module_/usr/src/fusion] Error 2&lt;br /&gt;
make[1]: Leaving directory `/usr/src/kernels/2.6.18-1.2849.fc6-i686'&lt;br /&gt;
make: *** [default] Error 2&lt;br /&gt;
&lt;br /&gt;
Any patches for this kernel? &lt;br /&gt;
&lt;br /&gt;
Thanks,&lt;br /&gt;
&lt;p /&gt;
-- Juan</description>
      <pubDate>Fri, 08 Dec 2006 17:52:22 GMT</pubDate>
      <author>jarjona</author>
      <guid>http://communities.vmware.com/message/528633?tstart=0#528633</guid>
      <dc:date>2006-12-08T17:52:22Z</dc:date>
      <clearspace:dateToText>2 years, 11 months ago</clearspace:dateToText>
      <clearspace:replyCount>4</clearspace:replyCount>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/519484?tstart=0#519484</link>
      <description>Just a followup to this thread. So far, I've had good luck with Tom's patch and haven't had any guests go read-only since. As a result, we put the system back into uber-aggresive DRS mode and it's done 798 migrations of virtual machines so far.&lt;br /&gt;
&lt;br /&gt;
Still no word from Vmware or RedHat on an "official" fix yet.</description>
      <pubDate>Fri, 24 Nov 2006 01:18:10 GMT</pubDate>
      <author>Damin</author>
      <guid>http://communities.vmware.com/message/519484?tstart=0#519484</guid>
      <dc:date>2006-11-24T01:18:10Z</dc:date>
      <clearspace:dateToText>3 years, 1 hour ago</clearspace:dateToText>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/512881?tstart=0#512881</link>
      <description>&lt;div class="jive-quote"&gt;BTW, I'd be interested in hearing more about your&lt;br /&gt;
56/60 byte problem.  Sounds like you have a switch&lt;br /&gt;
that is not adding the padding if it strips the VLAN&lt;br /&gt;
tag, which is actually very common.  Do you have&lt;br /&gt;
another thread about it?  Did you come up with the&lt;br /&gt;
patch yourself or is it something from VMware?&lt;/div&gt;
&lt;br /&gt;
I don't know where the initial information came from. I suspect our Operations Manager found the information somewhere in these forums and/or out on the net.&lt;br /&gt;
&lt;br /&gt;
The specific "patch" is nothing more than changing&lt;br /&gt;
&lt;br /&gt;
net.h:#define ETH_MIN_FRAME_LEN     60&lt;br /&gt;
&lt;br /&gt;
to a value of  56 in the vmxnet driver source.&lt;br /&gt;
&lt;br /&gt;
And yes, this is related to a specific vendors equipment not properly padding VLAN tagged packets. We've had discussions with them, but they seemt to just not get it. &lt;img class="jive-emoticon" border="0" src="http://communities.vmware.com/images/emoticons/wink.gif" alt=";)" /&gt;</description>
      <pubDate>Tue, 14 Nov 2006 02:02:46 GMT</pubDate>
      <author>Damin</author>
      <guid>http://communities.vmware.com/message/512881?tstart=0#512881</guid>
      <dc:date>2006-11-14T02:02:46Z</dc:date>
      <clearspace:dateToText>3 years, 1 week ago</clearspace:dateToText>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/512872?tstart=0#512872</link>
      <description>Excellent, thanks Damin.&lt;br /&gt;
&lt;br /&gt;
BTW, I'd be interested in hearing more about your 56/60 byte problem.  Sounds like you have a switch that is not adding the padding if it strips the VLAN tag, which is actually very common.  Do you have another thread about it?  Did you come up with the patch yourself or is it something from VMware?&lt;br /&gt;
&lt;br /&gt;
Later,&lt;br /&gt;
Tom</description>
      <pubDate>Tue, 14 Nov 2006 01:40:51 GMT</pubDate>
      <author>tsightler</author>
      <guid>http://communities.vmware.com/message/512872?tstart=0#512872</guid>
      <dc:date>2006-11-14T01:40:51Z</dc:date>
      <clearspace:dateToText>3 years, 1 week ago</clearspace:dateToText>
      <clearspace:replyCount>1</clearspace:replyCount>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/511216?tstart=0#511216</link>
      <description>&lt;div class="jive-quote"&gt;Here are the links to the kernels:&lt;br /&gt;
&lt;p /&gt;
&lt;a class="jive-link-external" href="http://www.nacs.net/yum/kernel-2.6.9-42.0.3.VM.src.rpm"&gt;http://www.nacs.net/yum/kernel-2.6.9-42.0.3.VM.src.rpm&lt;/a&gt;&lt;br /&gt;
&lt;p /&gt;
&lt;a class="jive-link-external" href="http://www.nacs.net/yum/kernel-2.6.9-42.0.3.VM.x86_64"&gt;http://www.nacs.net/yum/kernel-2.6.9-42.0.3.VM.x86_64&lt;/a&gt;.&lt;br /&gt;
rpm&lt;br /&gt;
&lt;a class="jive-link-external" href="http://www.nacs.net/yum/kernel-debuginfo-2.6.9-42.0.3"&gt;http://www.nacs.net/yum/kernel-debuginfo-2.6.9-42.0.3&lt;/a&gt;.&lt;br /&gt;
VM.x86_64.rpm&lt;br /&gt;
&lt;a class="jive-link-external" href="http://www.nacs.net/yum/kernel-devel-2.6.9-42.0.3.VM.x"&gt;http://www.nacs.net/yum/kernel-devel-2.6.9-42.0.3.VM.x&lt;/a&gt;&lt;br /&gt;
86_64.rpm&lt;br /&gt;
&lt;a class="jive-link-external" href="http://www.nacs.net/yum/kernel-largesmp-2.6.9-42.0.3.V"&gt;http://www.nacs.net/yum/kernel-largesmp-2.6.9-42.0.3.V&lt;/a&gt;&lt;br /&gt;
M.x86_64.rpm&lt;br /&gt;
&lt;a class="jive-link-external" href="http://www.nacs.net/yum/kernel-largesmp-devel-2.6.9-42"&gt;http://www.nacs.net/yum/kernel-largesmp-devel-2.6.9-42&lt;/a&gt;&lt;br /&gt;
.0.3.VM.x86_64.rpm&lt;br /&gt;
&lt;a class="jive-link-external" href="http://www.nacs.net/yum/kernel-smp-2.6.9-42.0.3.VM.x86"&gt;http://www.nacs.net/yum/kernel-smp-2.6.9-42.0.3.VM.x86&lt;/a&gt;&lt;br /&gt;
_64.rpm&lt;br /&gt;
&lt;a class="jive-link-external" href="http://www.nacs.net/yum/kernel-smp-devel-2.6.9-42.0.3"&gt;http://www.nacs.net/yum/kernel-smp-devel-2.6.9-42.0.3&lt;/a&gt;.&lt;br /&gt;
VM.x86_64.rpm&lt;/div&gt;
&lt;br /&gt;
Did some testing, and the patch that I applied when I rolled up these kernels does NOT give the desired behaviour. Guests go read-only.&lt;br /&gt;
&lt;br /&gt;
However, on a separate note, I did follow the process outlined in Tom's article using his driver archive and I am happy to report that guests with the patched drivers have sustained 24 hours of straight Iozone abuse, with hundreds of AsyncIO timeouts on the Iscsi layer. We're hammering the living crap out of the Iscsi bus and it's working like a champ.&lt;br /&gt;
&lt;br /&gt;
I've created an automated script for building / patching our RedHat boxes. You can review the script here:&lt;br /&gt;
&lt;br /&gt;
&lt;a class="jive-link-external" href="http://www.nacs.net/n2net-vmfix.sh"&gt;http://www.nacs.net/n2net-vmfix.sh&lt;/a&gt;&lt;br /&gt;
&lt;br /&gt;
This does two things;&lt;br /&gt;
1. Grabs, builds and installs the patched Fusion MPT drivers&lt;br /&gt;
2. Grabs, builds and installs a patched Vmxnet driver&lt;br /&gt;
&lt;br /&gt;
The Vmxnet driver deals with a specific issue that we have related to runt-packets on a Vlan tagged network, whereby if a piece of hardware does not properly pad a tagged packet to 60 bytes, it will be dropped. In our case, it lowers the threshold to 56 bytes. It's a simple, one line change, but it allows the networking to function while the vendor scratches their head and tries to figure out why their hardware is broken.&lt;br /&gt;
&lt;br /&gt;
Let me know if the script works for you. Simply run it, answer any questions it asks, then reboot. Do a "dmesg" and search for "vmfix" and you should see the driver version listed.</description>
      <pubDate>Fri, 10 Nov 2006 08:18:14 GMT</pubDate>
      <author>Damin</author>
      <guid>http://communities.vmware.com/message/511216?tstart=0#511216</guid>
      <dc:date>2006-11-10T08:18:14Z</dc:date>
      <clearspace:dateToText>3 years, 1 week ago</clearspace:dateToText>
      <clearspace:replyCount>3</clearspace:replyCount>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/511061?tstart=0#511061</link>
      <description>I havn't done the fix, but I can confirm that I am suffering from this same issue.  Its not necessarily under load for me though, I have used iozone alot and no issues.  The problem creeps up for me when esx does hba failover.  Doesnt always happen, but in my tests today I had on AS3update6 vm, ES4update3 vm, and a ES4update4 vm.  I pulled the fiber out of one of the hba's, took about 7 times, but eventually when problems did happen, they only happed on the ES4 vm's.  THe AS3 vm stayed up throughout.  I duplicated this 4 seperate times on two different ESX servers.&lt;br /&gt;
&lt;br /&gt;
Anybody have any idea when any sort of fix is out for this?  I guess its redhat that needs to fix, but this sucks big time for me as I have to put a total halt on all vmware use.</description>
      <pubDate>Thu, 09 Nov 2006 23:17:48 GMT</pubDate>
      <author>soleblazer</author>
      <guid>http://communities.vmware.com/message/511061?tstart=0#511061</guid>
      <dc:date>2006-11-09T23:17:48Z</dc:date>
      <clearspace:dateToText>3 years, 2 weeks ago</clearspace:dateToText>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/508866?tstart=0#508866</link>
      <description>Alright.. Those of you that are feeling adventurous...&lt;br /&gt;
&lt;br /&gt;
Until Vmware, LSI Logic and RedHat figure out an "official" solution to this that makes it into the mainline kernel, (and no, running the BusLogic driver is not an acceptable solution), I guess it is incumbent on the community to work around the issue.&lt;br /&gt;
&lt;br /&gt;
To that end, I've taken the patch discussed in Tom's Ramblings here:&lt;br /&gt;
&lt;a class="jive-link-external" href="http://www.tuxyturvy.com/blog/index.php?/archives/31-VMware-ESX-and-ext3-journal-aborts.html"&gt;http://www.tuxyturvy.com/blog/index.php?/archives/31-VMware-ESX-and-ext3-journal-aborts.html&lt;/a&gt;&lt;br /&gt;
&lt;br /&gt;
and I have rolled it into a custom kernel RPM for Centos 4.4. If the patch works, it should revert the behavior of the driver to pre U3 behavior and if there is an underlying storage timeout, the VM should pause indefinitely instead of timing out. It's not pretty, but it should solve the problem as others have noted earlier in this thread.&lt;br /&gt;
&lt;br /&gt;
I've only created x86_64 kernels currently, as I run on Opteron processors, but if things work, I'll roll up some i686 versions.&lt;br /&gt;
&lt;br /&gt;
Here are the links to the kernels:&lt;br /&gt;
&lt;br /&gt;
&lt;a class="jive-link-external" href="http://www.nacs.net/yum/kernel-2.6.9-42.0.3.VM.src.rpm"&gt;http://www.nacs.net/yum/kernel-2.6.9-42.0.3.VM.src.rpm&lt;/a&gt;&lt;br /&gt;
&lt;a class="jive-link-external" href="http://www.nacs.net/yum/kernel-2.6.9-42.0.3.VM.x86_64.rpm"&gt;http://www.nacs.net/yum/kernel-2.6.9-42.0.3.VM.x86_64.rpm&lt;/a&gt;&lt;br /&gt;
&lt;a class="jive-link-external" href="http://www.nacs.net/yum/kernel-debuginfo-2.6.9-42.0.3.VM.x86_64.rpm"&gt;http://www.nacs.net/yum/kernel-debuginfo-2.6.9-42.0.3.VM.x86_64.rpm&lt;/a&gt;&lt;br /&gt;
&lt;a class="jive-link-external" href="http://www.nacs.net/yum/kernel-devel-2.6.9-42.0.3.VM.x86_64.rpm"&gt;http://www.nacs.net/yum/kernel-devel-2.6.9-42.0.3.VM.x86_64.rpm&lt;/a&gt;&lt;br /&gt;
&lt;a class="jive-link-external" href="http://www.nacs.net/yum/kernel-largesmp-2.6.9-42.0.3.VM.x86_64.rpm"&gt;http://www.nacs.net/yum/kernel-largesmp-2.6.9-42.0.3.VM.x86_64.rpm&lt;/a&gt;&lt;br /&gt;
&lt;a class="jive-link-external" href="http://www.nacs.net/yum/kernel-largesmp-devel-2.6.9-42.0.3.VM.x86_64.rpm"&gt;http://www.nacs.net/yum/kernel-largesmp-devel-2.6.9-42.0.3.VM.x86_64.rpm&lt;/a&gt;&lt;br /&gt;
&lt;a class="jive-link-external" href="http://www.nacs.net/yum/kernel-smp-2.6.9-42.0.3.VM.x86_64.rpm"&gt;http://www.nacs.net/yum/kernel-smp-2.6.9-42.0.3.VM.x86_64.rpm&lt;/a&gt;&lt;br /&gt;
&lt;a class="jive-link-external" href="http://www.nacs.net/yum/kernel-smp-devel-2.6.9-42.0.3.VM.x86_64.rpm"&gt;http://www.nacs.net/yum/kernel-smp-devel-2.6.9-42.0.3.VM.x86_64.rpm&lt;/a&gt;&lt;br /&gt;
&lt;br /&gt;
NOTE: I didn't take the time to sign these RPMS, and while there is yum repodata that you can use, it will complain about the RPMS not being signed and refuse to install them. This is probably configurable, but this is only designed to be a temporary measure anyway, so just let me know if they work for you!</description>
      <pubDate>Mon, 06 Nov 2006 16:22:46 GMT</pubDate>
      <author>Damin</author>
      <guid>http://communities.vmware.com/message/508866?tstart=0#508866</guid>
      <dc:date>2006-11-06T16:22:46Z</dc:date>
      <clearspace:dateToText>3 years, 2 weeks ago</clearspace:dateToText>
      <clearspace:replyCount>5</clearspace:replyCount>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/507381?tstart=0#507381</link>
      <description>Good, I was pretty sure I didn't miss this but couldn't be sure.&lt;br /&gt;
&lt;br /&gt;
I'm very curious if others are seeing success with the patch.  Both of my most troublesome systems just hit the 10 day mark with no issues.  Previous record was 7 days and typically only 2-3 days so it seems to be working great for me.&lt;br /&gt;
&lt;br /&gt;
I've also had two other users who have reported at least initial success but more datapoints and a little more history would be nice.&lt;br /&gt;
&lt;br /&gt;
Thanks,&lt;br /&gt;
Tom</description>
      <pubDate>Fri, 03 Nov 2006 03:55:12 GMT</pubDate>
      <author>tsightler</author>
      <guid>http://communities.vmware.com/message/507381?tstart=0#507381</guid>
      <dc:date>2006-11-03T03:55:12Z</dc:date>
      <clearspace:dateToText>3 years, 2 weeks ago</clearspace:dateToText>
      <clearspace:replyCount>6</clearspace:replyCount>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/507269?tstart=0#507269</link>
      <description>It was my mistake, my rpmbuild had failed to update the source. The problem seems to exist on U4 too.</description>
      <pubDate>Thu, 02 Nov 2006 22:23:24 GMT</pubDate>
      <author>paithal</author>
      <guid>http://communities.vmware.com/message/507269?tstart=0#507269</guid>
      <dc:date>2006-11-02T22:23:24Z</dc:date>
      <clearspace:dateToText>3 years, 3 weeks ago</clearspace:dateToText>
      <clearspace:replyCount>7</clearspace:replyCount>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/502636?tstart=0#502636</link>
      <description>Where did you decide that U4 rolled back this change?  The code I am providing is against the mptscsih driver in U4, specifically, I took the mptscsih driver from the 2.6.9-42.0.3 kernel, the most recent public release, and patched that.&lt;br /&gt;
&lt;br /&gt;
I'd be interested to know why you think this was rolled back in U4.&lt;br /&gt;
&lt;br /&gt;
Later,&lt;br /&gt;
Tom</description>
      <pubDate>Thu, 26 Oct 2006 19:47:41 GMT</pubDate>
      <author>tsightler</author>
      <guid>http://communities.vmware.com/message/502636?tstart=0#502636</guid>
      <dc:date>2006-10-26T19:47:41Z</dc:date>
      <clearspace:dateToText>3 years, 4 weeks ago</clearspace:dateToText>
      <clearspace:replyCount>8</clearspace:replyCount>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/502077?tstart=0#502077</link>
      <description>Well done Tom. I looked the the mptscsi source too. Yes, U3 added the code that introduced this incorrect behavior. Looks like U4 rolledback the change.&lt;br /&gt;
&lt;br /&gt;
For others who do not want to upgrade to U4 or patch this driver module with the fix, you can use BusLogic emulation as the guest SCSI controller.</description>
      <pubDate>Thu, 26 Oct 2006 08:22:31 GMT</pubDate>
      <author>paithal</author>
      <guid>http://communities.vmware.com/message/502077?tstart=0#502077</guid>
      <dc:date>2006-10-26T08:22:31Z</dc:date>
      <clearspace:dateToText>3 years, 4 weeks ago</clearspace:dateToText>
      <clearspace:replyCount>9</clearspace:replyCount>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/500878?tstart=0#500878</link>
      <description>I am currently in Dallas at Astricon, however, David Coulson our Operations Manager has patched a virtual machine or two with the driver.  I will ask him to post his experience here.</description>
      <pubDate>Tue, 24 Oct 2006 22:38:32 GMT</pubDate>
      <author>Damin</author>
      <guid>http://communities.vmware.com/message/500878?tstart=0#500878</guid>
      <dc:date>2006-10-24T22:38:32Z</dc:date>
      <clearspace:dateToText>3 years, 1 month ago</clearspace:dateToText>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/500875?tstart=0#500875</link>
      <description>We have now survived 24 hours running our simple iozone benchmark which used to trigger the issue in only a few minutes so I'm very interested in your results.&lt;br /&gt;
&lt;br /&gt;
If you do have the problem, if you could possibly capture the output of dmesg that would be great.&lt;br /&gt;
&lt;br /&gt;
Later,&lt;br /&gt;
Tom</description>
      <pubDate>Tue, 24 Oct 2006 22:36:15 GMT</pubDate>
      <author>tsightler</author>
      <guid>http://communities.vmware.com/message/500875?tstart=0#500875</guid>
      <dc:date>2006-10-24T22:36:15Z</dc:date>
      <clearspace:dateToText>3 years, 1 month ago</clearspace:dateToText>
      <clearspace:replyCount>10</clearspace:replyCount>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/500801?tstart=0#500801</link>
      <description>Thanks. I like to hear from Damin too regarding his configuration and how was the usage of iSCSI.</description>
      <pubDate>Tue, 24 Oct 2006 20:25:18 GMT</pubDate>
      <author>paithal</author>
      <guid>http://communities.vmware.com/message/500801?tstart=0#500801</guid>
      <dc:date>2006-10-24T20:25:18Z</dc:date>
      <clearspace:dateToText>3 years, 1 month ago</clearspace:dateToText>
      <clearspace:replyCount>1</clearspace:replyCount>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/500559?tstart=0#500559</link>
      <description>I have updated my development CentOS 4.4 VM with your proposed fix. Usually it takes me 24-48 hours to encounter a read-only file system. So far it's been working well since 9:00 AM EST.&lt;br /&gt;
&lt;br /&gt;
If the VM is stable after a week I'll be fairly certain the cause of this problem has been found.&lt;br /&gt;
&lt;br /&gt;
Thank you Tom for the patch and directions.</description>
      <pubDate>Tue, 24 Oct 2006 16:04:36 GMT</pubDate>
      <author>CTeague</author>
      <guid>http://communities.vmware.com/message/500559?tstart=0#500559</guid>
      <dc:date>2006-10-24T16:04:36Z</dc:date>
      <clearspace:dateToText>3 years, 1 month ago</clearspace:dateToText>
      <clearspace:replyCount>11</clearspace:replyCount>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/500403?tstart=0#500403</link>
      <description>Wow, you sure ask a lot of questions!  &lt;img class="jive-emoticon" border="0" src="http://communities.vmware.com/images/emoticons/happy.gif" alt=":)" /&gt;&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;What kind of test you are doing to get to the read-only situation?.&lt;br /&gt;
&lt;/b&gt;&lt;br /&gt;
&lt;br /&gt;
Basically, anything that eventually leads to a timeout on the ESX host.  For the most part this simply involves running disk intensive benchmarks on multiple guests.  We've been using iozone, but I would suspect you could use almost anything that generates a lot of I/O.  &lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;I assume the boot disk is on iSCSI volume.&lt;/b&gt;&lt;br /&gt;
&lt;br /&gt;
No, boot disk is on locally attached RAID storage (Dell PERC 4)&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;Do you use virtual disk in VMFS volume or RDM for guest's boot disk?. If using VMFS, do you perform any I/O's to VMFS volume apart from booting (and guest doing I/O's)?.&lt;br /&gt;
&lt;/b&gt;&lt;br /&gt;
&lt;br /&gt;
We use a virtual disk in a VMFS volume.  Yes, of course there are I/O's other than booting since the entire virtual disk for the guest is on the VMFS volume.&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;Are you sharing the LUN with any other server?.&lt;br /&gt;
&lt;/b&gt;&lt;br /&gt;
I can't remember, but did you mention which target array you are using?.&lt;br /&gt;
&lt;br /&gt;
One other ESX server.&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;Do you use NIC teaming?. If yes, could you check if the problem happen when there is only one uplink?. &lt;br /&gt;
&lt;/b&gt;&lt;br /&gt;
&lt;br /&gt;
Yes, but the problem will happen even with a single uplink.&lt;br /&gt;
&lt;br /&gt;
You seem to be under the impression that the problem can only happen with iSCSI, however, we have reproduced the issue with locally attached RAID, and with fiber channel attached storage.  Basically, if you can genereate enough I/O that you get an Async I/O error, the chances of seeing your ext3 volumes go read only is high.&lt;br /&gt;
&lt;br /&gt;
If you see a message like this in your log:&lt;br /&gt;
&lt;br /&gt;
Oct 22 01:28:19 esxhost1 vmkernel: 13:20:19:56.753 cpu3:1028)SCSI: 3731: AsyncIO timeout (5000); aborting cmd w/ sn 399366, handle b78/0x6a028a8&lt;br /&gt;
&lt;br /&gt;
You are probably going to have a problem.  With the small change I made to the LSI Logic driver the linux systems seem to be able to survive these timeouts.&lt;br /&gt;
&lt;br /&gt;
Later,&lt;br /&gt;
Tom</description>
      <pubDate>Tue, 24 Oct 2006 13:06:35 GMT</pubDate>
      <author>tsightler</author>
      <guid>http://communities.vmware.com/message/500403?tstart=0#500403</guid>
      <dc:date>2006-10-24T13:06:35Z</dc:date>
      <clearspace:dateToText>3 years, 1 month ago</clearspace:dateToText>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/500252?tstart=0#500252</link>
      <description>What kind of test you are doing to get to the read-only situation?.&lt;br /&gt;
&lt;br /&gt;
I assume the boot disk is on iSCSI volume. Do you use virtual disk in VMFS volume or RDM for guest's boot disk?. If using VMFS, do you perform any I/O's to VMFS volume apart from booting (and guest doing I/O's)?.&lt;br /&gt;
&lt;br /&gt;
Are you sharing the LUN with any other server?. &lt;br /&gt;
&lt;br /&gt;
I can't remember, but did you mention which target array you are using?.&lt;br /&gt;
&lt;br /&gt;
Do you use NIC teaming?. If yes, could you check if the problem happen when there is only one uplink?.</description>
      <pubDate>Tue, 24 Oct 2006 07:38:13 GMT</pubDate>
      <author>paithal</author>
      <guid>http://communities.vmware.com/message/500252?tstart=0#500252</guid>
      <dc:date>2006-10-24T07:38:13Z</dc:date>
      <clearspace:dateToText>3 years, 1 month ago</clearspace:dateToText>
      <clearspace:replyCount>9</clearspace:replyCount>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/500102?tstart=0#500102</link>
      <description>OK, today I decided to try patching the mptscsi driver to revert it's behavior to RHEL4 U2 and prior.  So far, this seems to be working.  I've run significant tests on several systems today, including a set of my most troublesome boxes, connected to a lowly AX150i, that previously would fail in 5-10 minutes.  So far the boxes have survived the day without issue.&lt;br /&gt;
&lt;br /&gt;
There have been several iSCSI timeouts issues on the hosts (as noted in the vmkernel logs) which would normally have been propagated as SCSI timeouts in the guest but with the patched driver the Linux system seems to just pause, and the return to normal operation.&lt;br /&gt;
&lt;br /&gt;
I've posted the patched files and some crude instructions at &lt;a class="jive-link-external" href="http://www.tuxyturvy.com/blog/index.php?/archives/31-VMware-ESX-and-ext3-journal-aborts.html"&gt;http://www.tuxyturvy.com/blog/index.php?/archives/31-VMware-ESX-and-ext3-journal-aborts.html&lt;/a&gt; so others can give it a try if they want too.&lt;br /&gt;
&lt;br /&gt;
I'm not willing to say this is 100% yet, perhaps the systems are just behaving today, but so far it looks good.&lt;br /&gt;
&lt;br /&gt;
Later,&lt;br /&gt;
Tom</description>
      <pubDate>Mon, 23 Oct 2006 22:31:03 GMT</pubDate>
      <author>tsightler</author>
      <guid>http://communities.vmware.com/message/500102?tstart=0#500102</guid>
      <dc:date>2006-10-23T22:31:03Z</dc:date>
      <clearspace:dateToText>3 years, 1 month ago</clearspace:dateToText>
      <clearspace:replyCount>12</clearspace:replyCount>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/499325?tstart=0#499325</link>
      <description>I've been battling this issue for weeks so it's good to know I'm not alone.&lt;br /&gt;
&lt;br /&gt;
In researching the issue tonigh I found the following bug which looks like it might be related:&lt;br /&gt;
&lt;br /&gt;
&lt;a class="jive-link-external" href="https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=197158"&gt;https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=197158&lt;/a&gt;&lt;br /&gt;
&lt;br /&gt;
It looks like this problem may be caused by a fairly recent (in the last few months) change in the upstream LSI Logic driver.  For RHEL4 systems this code was changed between U2 and U3.  I wonder if we could simply recompile the mptscsih driver from U2 for use in U3 and newer as a temporary solution.  Has anybody tried that?&lt;br /&gt;
&lt;br /&gt;
Basically, the newer driver adds an extra DID_BUS_BUSY status to a SCSI command failure of , MPI_SCSI_STATUS_BUSY which causes the SCSI mid-layer only tries 5 times before reporting a timeout to the upper layers.  With previous versions without this extra return status the SCSI mid-layer would try indefinitely.&lt;br /&gt;
&lt;br /&gt;
If this is really the issue it would be pretty simple to patch and compile a new mptscsih driver to temporarily work around this issue.  Maybe I'll give that a try tomorrow.&lt;br /&gt;
&lt;br /&gt;
Later,&lt;br /&gt;
Tom</description>
      <pubDate>Mon, 23 Oct 2006 03:50:03 GMT</pubDate>
      <author>tsightler</author>
      <guid>http://communities.vmware.com/message/499325?tstart=0#499325</guid>
      <dc:date>2006-10-23T03:50:03Z</dc:date>
      <clearspace:dateToText>3 years, 1 month ago</clearspace:dateToText>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/499293?tstart=0#499293</link>
      <description>&lt;div class="jive-quote"&gt;I'm wondering if this fact is a "known issue" as in&lt;br /&gt;
"we will/must do something to fix it", or if it's an&lt;br /&gt;
expected behaviour.&lt;/div&gt;
&lt;br /&gt;
From what I understand, when the SAN times out (Which SANS can/will do occasionally!) Vmware is sending a signal to the Guest. This signal can either be "Storage Unavailable "or "Storage Timeout". The guest storage drivers can react differently to these signals. Under Linux, the LSI driver freaks out and tells the OS that storage is unavailable, and Ext3 goes into read-only mode.&lt;br /&gt;
&lt;br /&gt;
According to the engineer that I spoke with, this was fixed for Windows guests, but not yet for Linux. I'm not sure how they plan on fixing it.</description>
      <pubDate>Sun, 22 Oct 2006 23:56:03 GMT</pubDate>
      <author>Damin</author>
      <guid>http://communities.vmware.com/message/499293?tstart=0#499293</guid>
      <dc:date>2006-10-22T23:56:03Z</dc:date>
      <clearspace:dateToText>3 years, 1 month ago</clearspace:dateToText>
      <clearspace:replyCount>11</clearspace:replyCount>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/499116?tstart=0#499116</link>
      <description>I'm wondering if this fact is a "known issue" as in "we will/must do something to fix it", or if it's an expected behaviour.</description>
      <pubDate>Sun, 22 Oct 2006 07:21:41 GMT</pubDate>
      <author>manuel.wenger</author>
      <guid>http://communities.vmware.com/message/499116?tstart=0#499116</guid>
      <dc:date>2006-10-22T07:21:41Z</dc:date>
      <clearspace:dateToText>3 years, 1 month ago</clearspace:dateToText>
      <clearspace:replyCount>12</clearspace:replyCount>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/499065?tstart=0#499065</link>
      <description>#316658&lt;br /&gt;
&lt;br /&gt;
BTW.. this was closed by Vmware because the "iSCSI Target you are using is not supported".&lt;br /&gt;
&lt;br /&gt;
Never mind that the behavior occurs on HCL supported iSCSI Targets as well.&lt;br /&gt;
&lt;br /&gt;
The basic cause of this problem, as explained to me by Vmware engineering, is that if a SAN timeout happens, the Vmkernel will send a message up to the guest reflecting the timeout. I was told that this is a known issue.</description>
      <pubDate>Sun, 22 Oct 2006 00:20:31 GMT</pubDate>
      <author>Damin</author>
      <guid>http://communities.vmware.com/message/499065?tstart=0#499065</guid>
      <dc:date>2006-10-22T00:20:31Z</dc:date>
      <clearspace:dateToText>3 years, 1 month ago</clearspace:dateToText>
      <clearspace:replyCount>13</clearspace:replyCount>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/498972?tstart=0#498972</link>
      <description>Can you post the SR number?. We will take a look at the vm-support logs.</description>
      <pubDate>Sat, 21 Oct 2006 17:42:25 GMT</pubDate>
      <author>paithal</author>
      <guid>http://communities.vmware.com/message/498972?tstart=0#498972</guid>
      <dc:date>2006-10-21T17:42:25Z</dc:date>
      <clearspace:dateToText>3 years, 1 month ago</clearspace:dateToText>
      <clearspace:replyCount>14</clearspace:replyCount>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/496404?tstart=0#496404</link>
      <description>1. Vmkernel log was provided to Vmware support.&lt;br /&gt;
2. Yes I have an SR#.</description>
      <pubDate>Wed, 18 Oct 2006 01:30:40 GMT</pubDate>
      <author>Damin</author>
      <guid>http://communities.vmware.com/message/496404?tstart=0#496404</guid>
      <dc:date>2006-10-18T01:30:40Z</dc:date>
      <clearspace:dateToText>3 years, 1 month ago</clearspace:dateToText>
      <clearspace:replyCount>15</clearspace:replyCount>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/496289?tstart=0#496289</link>
      <description>Who did you provide the vmkernel log?. Did you file an SR?. Do you have SR number?.</description>
      <pubDate>Tue, 17 Oct 2006 21:05:08 GMT</pubDate>
      <author>paithal</author>
      <guid>http://communities.vmware.com/message/496289?tstart=0#496289</guid>
      <dc:date>2006-10-17T21:05:08Z</dc:date>
      <clearspace:dateToText>3 years, 1 month ago</clearspace:dateToText>
      <clearspace:replyCount>16</clearspace:replyCount>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/496130?tstart=0#496130</link>
      <description>&lt;div class="jive-quote"&gt;Can you see if there are any errors reported in ESX's&lt;br /&gt;
/var/log/vmkernel?.&lt;/div&gt;
&lt;br /&gt;
Yep. And those have been provided to Vmware. When using the Software iSCSI initiator, the problem happens much more rapidly than using an HBA. This makes a lot of sense, as the HBA is interrupt driven, while the Software is system-load dependent. On a highly utilized Host, the Software iSCSI initiator is NOT going to be able to service the data as quickly as an HBA will.&lt;br /&gt;
&lt;br /&gt;
So.. back to your question.. We see buffer overruns for reserve/release activity, and several iSCSI timeouts. The timeouts appear to indicate a networking issue such as dropped packets (none showd on either end, or on either port!!!), but we can account for every packet back and forth between the client and the target and there ARE NO DROPPED PACKETS!&lt;br /&gt;
&lt;br /&gt;
We've even removed the switch from the mix and gone to crossover cables between the host and the target. Same problem.&lt;br /&gt;
&lt;br /&gt;
&lt;div class="jive-quote"&gt;Also, make sure network connectivity is healthy and&lt;br /&gt;
there are no intermittent failures. Path to storage&lt;br /&gt;
being down for extended amount of time (for RH it is&lt;br /&gt;
typically 1min, I guess) can take the FS read-only.&lt;br /&gt;
Which target are you using?.&lt;/div&gt;
&lt;br /&gt;
IETD, specifically patched w/ Reserve/Release and several other patches that have been submitted back to the project as a result of 9 months of testing w/ Vmware.&lt;br /&gt;
&lt;p /&gt;
&lt;div class="jive-quote"&gt;Is this a shared network or dedicated to iSCSI only?.&lt;/div&gt;
&lt;br /&gt;
Dedicated to iSCSI only.&lt;br /&gt;
&lt;br /&gt;
&lt;div class="jive-quote"&gt;Are any VLAN configuration in the picture?.&lt;/div&gt;
&lt;br /&gt;
No.&lt;br /&gt;
&lt;br /&gt;
&lt;div class="jive-quote"&gt;Any&lt;br /&gt;
traffic shaping or bandwidth allocation restrictions&lt;br /&gt;
placed on the physical switch that is connected to&lt;br /&gt;
iSCSI?.&lt;/div&gt;
&lt;br /&gt;
On a crossover cable? &lt;img class="jive-emoticon" border="0" src="http://communities.vmware.com/images/emoticons/happy.gif" alt=":)" /&gt; No.</description>
      <pubDate>Tue, 17 Oct 2006 18:15:10 GMT</pubDate>
      <author>Damin</author>
      <guid>http://communities.vmware.com/message/496130?tstart=0#496130</guid>
      <dc:date>2006-10-17T18:15:10Z</dc:date>
      <clearspace:dateToText>3 years, 1 month ago</clearspace:dateToText>
      <clearspace:replyCount>17</clearspace:replyCount>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/496127?tstart=0#496127</link>
      <description>&lt;div class="jive-quote"&gt;It's similar to issues with redhat scsi drivers in a&lt;br /&gt;
multipath environment, where SAN switch reset confuse&lt;br /&gt;
the mutlipath driver and cause the filesystem to flag&lt;br /&gt;
a corrupt journal.  Is it possible that under heavy&lt;br /&gt;
load your iSCSI network is dropping packets or&lt;br /&gt;
performing badly in some way.&lt;/div&gt;
&lt;br /&gt;
No. This happens on a direct crossover cable (I.E. no switch) between the HBA and the iSCSI Target box. The ports indicate no errors, dropped packets etc.. so the underlying network layer (all Cat-5E 6 feet of it) is fine.&lt;br /&gt;
&lt;br /&gt;
&lt;div class="jive-quote"&gt;This might expose some&lt;br /&gt;
issues with the underlying iSCSI stack which are in&lt;br /&gt;
turn getting exposed to the filesystem ?&lt;/div&gt;
&lt;br /&gt;
I suspect that it is related to how Vmware passes SAN timeouts to it's Guests. I was told by a Second Level support that Vmware is aware of the problem, that they have fixed it for Windows in 3.0.1, but that they have NOT fixed it yet for Linux. I strongly, strongly, strongly, STRONGLY disagree that this is a problem with the iSCSI target. In all of our testing over 12 months, we've never had this issue w/ any other Clients. Our iSCSI target is used by more than just Vmware here.&lt;br /&gt;
&lt;br /&gt;
&lt;div class="jive-quote"&gt;Might be&lt;br /&gt;
worth trying to get an idea of the health of the&lt;br /&gt;
network which you are sending your iSCSI i/o to see&lt;br /&gt;
if something is underperforming/failling under load.&lt;/div&gt;
&lt;br /&gt;
The network is 100% clean. No dropped packets, no resends, no errors, no overruns. Nothing. It's clean.&lt;br /&gt;
&lt;p /&gt;
&lt;div class="jive-quote"&gt;I totally agree that this problem should not be&lt;br /&gt;
happening tho...&lt;/div&gt;</description>
      <pubDate>Tue, 17 Oct 2006 18:10:08 GMT</pubDate>
      <author>Damin</author>
      <guid>http://communities.vmware.com/message/496127?tstart=0#496127</guid>
      <dc:date>2006-10-17T18:10:08Z</dc:date>
      <clearspace:dateToText>3 years, 1 month ago</clearspace:dateToText>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/494633?tstart=0#494633</link>
      <description>There's another thread discussing the same problem, in case you missed it:&lt;br /&gt;
&lt;br /&gt;
&lt;a class="jive-link-external" href="http://www.vmware.com/community/thread.jspa?threadID=58081"&gt;http://www.vmware.com/community/thread.jspa?threadID=58081&lt;/a&gt;</description>
      <pubDate>Sun, 15 Oct 2006 20:25:39 GMT</pubDate>
      <author>manuel.wenger</author>
      <guid>http://communities.vmware.com/message/494633?tstart=0#494633</guid>
      <dc:date>2006-10-15T20:25:39Z</dc:date>
      <clearspace:dateToText>3 years, 1 month ago</clearspace:dateToText>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/494617?tstart=0#494617</link>
      <description>Wow, having this same problem I didn't even have to search to find another with it!&lt;br /&gt;
&lt;br /&gt;
I'm running CentOS 4.3 (4x guests) on ESX 3.0 and 3.0.1 w/ VMWare Tools installed.&lt;br /&gt;
&lt;br /&gt;
I experience the same exact read-only problem....I have had this happen on three occasions now and a reboot temporarily fixed the issue. The problem is that I don't have any heavy loads going to any of these guests (Apache web servers / mySQL servers, currently with little web traffic / db traffic as it's still heavy in development).&lt;br /&gt;
&lt;br /&gt;
I'm using PE2850 (Dual Duo-Core 2.8ghz / 8gb Ram / 2x QLogic PCI-E HBA) on a Sun based SAN. The SAN is controlled by a higher technical department within my school so the details I can give right now are lacking.&lt;br /&gt;
&lt;br /&gt;
Any ideas what could be causing this?</description>
      <pubDate>Sun, 15 Oct 2006 18:44:37 GMT</pubDate>
      <author>CTeague</author>
      <guid>http://communities.vmware.com/message/494617?tstart=0#494617</guid>
      <dc:date>2006-10-15T18:44:37Z</dc:date>
      <clearspace:dateToText>3 years, 1 month ago</clearspace:dateToText>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/493769?tstart=0#493769</link>
      <description>Can you see if there are any errors reported in ESX's /var/log/vmkernel?.&lt;br /&gt;
&lt;br /&gt;
Also, make sure network connectivity is healthy and there are no intermittent failures. Path to storage being down for extended amount of time (for RH it is typically 1min, I guess) can take the FS read-only. Which target are you using?.&lt;br /&gt;
&lt;br /&gt;
Is this a shared network or dedicated to iSCSI only?. Are any VLAN configuration in the picture?. Any traffic shaping or bandwidth allocation restrictions placed on the physical switch that is connected to iSCSI?.</description>
      <pubDate>Fri, 13 Oct 2006 16:29:18 GMT</pubDate>
      <author>paithal</author>
      <guid>http://communities.vmware.com/message/493769?tstart=0#493769</guid>
      <dc:date>2006-10-13T16:29:18Z</dc:date>
      <clearspace:dateToText>3 years, 1 month ago</clearspace:dateToText>
      <clearspace:replyCount>18</clearspace:replyCount>
    </item>
    <item>
      <title>Re: ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/493661?tstart=0#493661</link>
      <description>It's similar to issues with redhat scsi drivers in a multipath environment, where SAN switch reset confuse the mutlipath driver and cause the filesystem to flag a corrupt journal.  Is it possible that under heavy load your iSCSI network is dropping packets or performing badly in some way. This might expose some issues with the underlying iSCSI stack which are in turn getting exposed to the filesystem ? Might be worth trying to get an idea of the health of the network which you are sending your iSCSI i/o to see if something is underperforming/failling under load.  &lt;br /&gt;
&lt;p /&gt;
I totally agree that this problem should not be happening tho...&lt;br /&gt;
&lt;br /&gt;
Message was edited by: &lt;br /&gt;
        jonhutchings</description>
      <pubDate>Fri, 13 Oct 2006 14:26:25 GMT</pubDate>
      <author>jonhutchings</author>
      <guid>http://communities.vmware.com/message/493661?tstart=0#493661</guid>
      <dc:date>2006-10-13T14:26:25Z</dc:date>
      <clearspace:dateToText>3 years, 1 month ago</clearspace:dateToText>
      <clearspace:replyCount>1</clearspace:replyCount>
    </item>
    <item>
      <title>ESX 3.0.1 - Linux Guests go ReadOnly</title>
      <link>http://communities.vmware.com/message/493283?tstart=0#493283</link>
      <description>Hello,&lt;br /&gt;
   My hopes for the 3.0.1 release solving this problem have been dashed. Per the suggestion of Vmware support, we ripped out our Qlogic HBAs and went w/ software iSCSI and upgraded our 3.0.0 machine to 3.0.1.&lt;br /&gt;
&lt;br /&gt;
Same problem. Under heavy load, the Linux guests go into Read Only mode on their filesystem.&lt;br /&gt;
&lt;br /&gt;
Consoles indicate the following: &lt;br /&gt;
SCSI Error : &amp;lt;0 0 0 0&amp;gt; return code = 0x20008 &lt;br /&gt;
end_request: I/O error, dev sda, sector 4928181 &lt;br /&gt;
Aborting journal on device dm-0 &lt;br /&gt;
ext3_abort called. &lt;br /&gt;
EXT3-fs error (device dm-0): ext3_journal_start_sb: Detected aborted journal &lt;br /&gt;
Remounting filesystem read-only. &lt;br /&gt;
&lt;br /&gt;
All guests are using the LSI-Logic driver and have the latest Vmware tools installed.&lt;br /&gt;
&lt;br /&gt;
This is very clearly a Vmware issue, and we've now proven that it IS NOT related to the Qlogic QLA4010 HBAS, as we migrated to Software iSCSI and are getting the same problem.</description>
      <pubDate>Fri, 13 Oct 2006 02:45:32 GMT</pubDate>
      <author>Damin</author>
      <guid>http://communities.vmware.com/message/493283?tstart=0#493283</guid>
      <dc:date>2006-10-13T02:45:32Z</dc:date>
      <clearspace:dateToText>3 years, 1 month ago</clearspace:dateToText>
      <clearspace:replyCount>99</clearspace:replyCount>
    </item>
  </channel>
</rss>

