VMware

This Question is Possibly Answered

1 "correct" answer available (10 pts) 1 "helpful" answer available (6 pts)
1 2 3 Previous Next 30 Replies Last post: Jun 3, 2009 8:13 AM by CWedge@Amsa  

Single Threaded Application on HP DL 360 G3 faster than on Netapp 6070 posted: Oct 20, 2008 5:07 AM

Click to view meistermn's profile Master 1,162 posts since
Dec 7, 2004

The application Gupta SQL 9.0.1 is now running in a VM. The Datastores are on a Netapp 6070 (Raid DP 41 FC Disks a 300 GB)

The VM uses 5 LUN's. C: Windows 2003 D:Data E: Gupta SQL : F: Gupta SQL Logs G: Pagefile

Before it run on a HP DL360 G3 (Smart 5i raid controller 64 MB Cache) with 6 x 36 GB and 3 Logical Volumes on raid 5 .

On the HP DL360 G3 the databse update statistics runs 1Hour 3 Minuten.

The VM on the Netapp needs to 2 Hours 14 Minutes.

Analyzed Application SQL Gupta with perfmon: Average transfer size /sec for SQL Log 64 KB Read and Write (E:) and for SQL log 4 KB Read and Write

Performance Tweaks:

1.) Datacore Uptempo: Only 5 Mintes Faster. I think the DB , which 30 GB , is to big.

2.) Partition Aligment for E: and F Partition : 1 Hour 45 Minutes. Improvement 29 Minutes .

3.) OS Filesystem Blocksize(Clustersize) change for E: to 64 KB (default 4 KB) no improvement

4.) OS Filesystem Blocksize changed for F: to 32 KB (default 4 KB) no improvement

5.) Separate LUN for Pagefile G: no improvment

6.) VMware VC changed Queue Depth to 128 : no improvement

Arrgh!!!! What the hell is the problem? Is it cache for random io?


Click to view king@it.ibm.com's profile Virtuoso 2,927 posts since
Jan 16, 2004

Interesting.

Is this an I/O bound workload? Or is it CPU bound? Or ...

If it's I/O bound ... what does the disk queue lenght look like from Perf Monitor ?

Massimo.

Click to view Craig Baltzer's profile Expert 401 posts since
Oct 3, 2005

In addition to Disk Queue Length as Massimo suggested, Avg. Disk sec/Read and Avg. Disk sec/Write would be interesting to see, esp if you still have the DL360 G3 around for comparison.

Craig

Click to view Craig Baltzer's profile Expert 401 posts since
Oct 3, 2005

The dropping IO is just a symptom, something is causing the IO rate to drop. The disk sec/read and disk sec/write counters will give you some insight into whether the storage response is getting slower at the same time (i.e. if disk sec/write goes from 20ms to 500ms then disk usage (IO rate) is going to drop and you know to look at the IO path as a possible source. If the disk sec/write stay constant then contention likely lies elsewhere...
Click to view Craig Baltzer's profile Expert 401 posts since
Oct 3, 2005

Certainly "feels" like there are some IO challenges. Have you had a look at http://www.vmware.com/files/pdf/perf_analysis_methods_tn.pdf? It would be interesting to see what ESX is seeing in terms of IO for the LUNs being used for the databases, with GAVG/cmd and DAVG/cmd for the VM being of particular interest. I'd assume that since "read" is the issue that you'd mainly want to look at the database LUN as the log LUN shouldn't be getting used on a read operation...


Click to view Craig Baltzer's profile Expert 401 posts since
Oct 3, 2005
Forgot to ask if this VM was P2V'd from the physical DL360 or was a fresh OS install? If it was a P2V have all the DL360/HP specific drivers and devices been cleaned out of the VM?
Click to view mcowger's profile Virtuoso 2,071 posts since
Aug 22, 2007
How does your Host connect to the NetApp? FC? NFS? iSCSI? Over what kind of link?

--Matt

Click to view Craig Baltzer's profile Expert 401 posts since
Oct 3, 2005

The "3699%" is definitely strange. It should be close to "Memory Active" / "Memory Granted". Do any of your other VMs show a strange % like this? Wonder if this is a bug in the localization of the UI...

Consumed memory looks reasonable. Most times when the OS and applications start they allocate a bunch of memory (a.k.a. "consume" it) by setting up buffers, caches, etc. They then may never actually write any data there (i.e. cache never fills, etc.). In this case the VM is 3.9GB allocated to it, the OS + apps have issued memory requests for 3.6GB, and if busily using 1.5GB of that so the proportions make sense i.e. (granted > consumed > active).

There may not be a direct relationship between "Available" in System Information and "Consumed" in VC. Page sharing is going on (ESX looking for pages that can be shared thus reducing the "consumed" value). When I add "Memory Shared" to my graphs it is almost a mirror image of "Memory Consumed" (i.e. if "Memory Shared" is going up then "Memory Consumed" tends to be going down at the same time). If there are no other VMs running on the ESX server then they should be very close.

From a performance perspective memory looks good. The counters you have show the "Memory Balloon" driver is not active, meaning that there is no memory overcommitment putting pressure on this VM to give up memory. You could also add "Memory Swapped" just to verify that ESX is not doing any swapping, however I'm almost 100% certain there is no swapping from looking at Memory Active and Memory Consumed.

VMware Developer

SDKs, APIs, Videos, Learn and much more in the Developer community.

Learn More

Developer Sample Code

Increase your developer productivity with VMware API sample code.

Learn More

VMworld Sessions & Labs

Online access to the latest VMworld Sessions & Labs and online services.

Learn more

Purchase PSO Credits Online

Purchase credits to redeem training and consulting services online.

Buy Now

Community Hardware Software

View reported configurations or report your own.

Learn More

VMware vSphere

Come witness the next giant leap in virtualization.

Register Today

Communities