VMware Communities > Blogs > Virtualization Frontier > Tags

Blog Posts

Virtualization Frontier

10 Posts tagged with the performance tag
0

I just posted a new whitepaper that I have been working on for the last month or two. It was a great opportunity to test out how well Hyper-V performs with a large number of virtual machines (VMs) on a richly configured R900. You'll have to check out the whitepaper for the complete details. Here I'm going to give you a little bit of the background into how the paper was done.

I found out in early summer that we would be announcing and shipping some new 6-core Intel Xeon processors in our PowerEdge R900 server (4-socket, 4U). I was being asked what I thought would be a good way to show how well it performed for virtualization and recommended that we use Hyper-V and the Dell DVD store in a test similar to some tests that I had done in the past with ESX. At the time, Hyper-V was still just a release candidate, and I was told that I wouldn't get the processors until early August. Both of these things changed-Hyper-V went officially final with an RTM build (which was great), and I ended up not getting any 6-core processors to test with until late August (which was bad because I had only one week to complete testing at that point).

I switched over to EqualLogic for the storage in this test. Previous tests had mostly been with Dell | EMC CX series storage, but I wanted to get a chance to run a heavy load against the EqualLogic iSCSI storage. It was fun to set up and test with the new arrays. I thought that setup was much easier, and I learned about setting up MPIO with the iSCSI initiator on Windows 2008.

Showing that the R900 with 24 cores was able to support 40 VMs, achieving a total of 74K DVD Store Orders per minute, wouldn't mean anything unless it was compared to something else. Using an R900 and an HP DL585 as quad-core comparison points, the testing showed an advantage in not only performance, but performance per watt as well.

So check out the full paper, and let me know if you have any questions.

0 Comments Permalink
0


I just finished up testing for a new whitepaper on Hyper-V performance and one of the initial things I had to figure out was how to measure the CPU utilization on the server. What is known as the parent partition (the initial Windows Server 2008 that is installed on the server and then used to enable the Hyper-V role) does not include the CPU utilization for its VMs in the main Processor Utilization performance counter. This was surprising to me at first, but makes sense when you consider the architecture of Hyper-V. In this architecture, the VMs do not go through the parent partition to access the processors. (Although I/O does to use the device drivers of the parent partition.)

In order to solve this problem Microsoft has created some new performance counters that are specific to the Hyper-V Hypervisor. Early on in the betas for Hyper-V it wasn't clear what these counters did, as I blogged about back in January, but it is now much clearer with some guides from Microsoft.

In order to measure the total CPU utilization on a Windows Server 2008 system running VMs under Hyper-V use the Hyper-V Hypervisor Logical Processor % Total Run Time counter in Performance Monitor (more affectionately known as perfmon). If you just look at the % CPU Utilization or the performance tab of Task Manager you will only see the CPU utilization of the parent partition and not the VMs.

Todd

0 Comments Permalink
0


In response to a few questions that came up, I put together a video that shows how to do a cool trick with Microsoft Excel to create graphs of MD3000i array level performance data. I posted an entry here last week and also put up a wiki page about how you can use the smCLI command line tool to get array level performance data (Individual storage processors, array totals, and individual virtual disks) from an MD3000i. The output of this command is a csv (comma separated values) text file. I included on the wiki page a nice graph of the performance based on the data from this output file. What I didn't go into was how to create such a graph.

So now you have this great little video that will show you one way that you can use Excel to create a graph based on the data in the file. I don't claim to be a know-it-all when it comes to Excel, but the way that I do it in the video works. I would love to hear from anybody who knows of other (possibly better) ways to do it.

Todd

0 Comments Permalink
0

Virtual Geek has a some great performance and power consumption numbers up on his blog for virtualizing Microsoft Sharepoint. I've gotten lots of questions over the past year or so about how SharePoint would perform in a VM and this is the best data that I have seen to answer those questions. He has info about how you can download the full report from EMC's Powerlink site, but the charts and info in his blog post seem to cover the main points. The amount of savings in terms of power are so big as to be unbelievable, but after looking at the details behind them it all seems to add up. An excellent read - I recommend that you check it out.

Todd

0 Comments Permalink
0

A few months ago Dan Hambrick from our System Performance Analysis team had completed and published a performance report of what I thought to be a very complete and thorough look at the PERC 6 (PowerEdge RAID Controller). It posted to our humble site and I did a blog entry to highlight how cool I thought it was.

About a month later Dan comes back to my cube and tells me that he is getting all kinds of requests to do more testing. Of course any new factor could grow his already very large matrix and double the amount of testing. He ends up figuring out how to do a round of testing with the new MD1120 array and get the data into a form that is understandable in a relatively small number of graphs.

The new paper was posted just a few days ago and the amount of data that is behind it is staggering. The numbers behind the graphs are all in the Appendix if you are brave enough to take a look, but I would highly recommend that you stick with the really cool graphs.

Todd

0 Comments Permalink
0

During the weekly DellTechCenter Tech Tuesday chat a couple of weeks ago a question came up about performance monitoring with the PowerVault MD3000i iSCSI array. The initial question was how to do performance monitoring from Linux and we addressed it in a followup discussion thread. The answer at this point was to use iostat for Linux or perfmon in windows to monitor performance on each host that was attached to the MD3000i array. This lead to a follow on question which was how to monitor the performance for the entire array.

The management tool for the MD3000i, PowerVault Modular Disk Storage Manager, does not include any performance stats beyond basic iSCSI port level stats. Some investigation into the command line interface for the MD3000i revealed that there is a command to capture the performance stats on the array. Using the smcli, which is installed as part of the MDSM, it is possible to collect the performance stats for the array, controllers, and virtual disks to a csv file.

This turns out to be pretty cool and not too hard to do. Just a simple command and a little bit of spreadsheet magic and you too can produce cool performance graphs of your MD3000i. The details are posted here on delltechcenter.com.

Todd

0 Comments Permalink
0

In the past I was somewhat biased towards 2-socket servers for virtualization - which was due to the results of some testing we did. I had of course realized over the past year or so that things had changed and the 4-socket servers were now more competitive. So when I ran the series of chats on Selecting a Server for Virtualization, I decided that it was a good opportunity to re-run some of those exact same tests with the brand new R900 that had arrived in the lab. I posted the results on a TechCenter page so you can see the full results, but the short summary is that the R900 stacks up as more efficient than the 2950 we tested in the previous paper. The R900 showed 10 to 23 percent better performance per watt than the older 2950. The question that remains is how would the R900 compare to a current generation 2950?

Todd

0 Comments Permalink
0

Massive IOPS on ESX

Posted by ToddMuirhead May 22, 2008

They performance guys over at VMware have announced some really impressive IOPS testing results. Using three CX3-80 storage arrays they were able to push over 100,000 IO Operations per second from a single 4-socket server running ESX 3.5. Additionally, they reported the latency numbers which look very good as well.

I find this very interesting because it is in line with results that Kong and I got in our Exchange 2007 on ESX testing we did last year. We found that you could size storage for Exchange VMs the same as you size storage for Exchange on physical servers (up to the 2000 users in a single VM that we tested). Exchange is I/O intensive and is sensitive to any increased latencies. So while we didn't test a workload of 100,000 IOPS, we did see good storage performance with an application that is very sensitive to these kinds of things.

The other thing that I like about this test is that it is cool. As evidence I point to the cool graphs they included and the 495 15K RPM disks that were used.

Todd

0 Comments Permalink
0

New VMmark benchmark results highlight that the best performance for a single server is a 4-socket (or 16 cores with Quad-Core processors). This means that the R900 and R905 are at the top of the list, with the Intel based R900 slightly ahead of the AMD based R905. So if you absolutely need the most performance possible from a single server - then this is the way to go.

A little bit of analysis could lead you to believe that the 2-socket (or 8 core) servers are actually better performing. The VMmark score for the 4-socket R900 is 14.23 with 10 tiles, but the VMark score for the 2-socket 2950 III is 8.47 with 6 tiles. So on a per socket basis, the two-socket 2950 III is actually providing more performance. The same holds true for the two-socket M600 blade and two-socket R805 2U server.

There is much more to choosing a server than the results of a single benchmark, but I think that these results are fair barometers of performance. In general performance does not scale in a linear fashion when moving from 2-socket servers to 4-socket servers, and this seems to hold true with virtualization.

Todd

0 Comments Permalink
0

Virtual NIC Performance

Posted by ToddMuirhead Apr 17, 2008

During our regular TechTuesday webchat, a question came up about virtual NIC performance. The topic this week was Running Enterprise Apps on VMware, and one aspect of that is performance. Hywel, who has a successful SQL Server deployment on ESX, asked - why is he only getting 250 mb/s of throughput? I found a VMware blog post and a whitepaper that indicate it is possible to get much higher. Of course in real-world use through-put will be lower, but I would think that you could top 250 mb/s. I believe that the key to getting the best performance form a virtual NIC in VMware is to install the VMtools and use the VMxnet NIC.

Todd

0 Comments Permalink