Skip navigation
VMware
Office of the CTO - Josh Simons

Office of the CTO - Josh Simons

Manage categories

Close

Create and manage categories in High Performance Computing. Removing a category will not remove content.

Categories in High Performance Computing
Add a new category (0 remaining)

Recent Blog Posts

Refresh this widget
0

ibrn.pngOur research note exploring the use of InfiniBand in Red Hat guests with VM DirectPath I/O is now available for your reading pleasure here. In it, we present the results of bandwidth and latency tests run using two hosts connected back-to-back with Mellanox quad data rate (QDR) InfiniBand. We show that bandwidths over a wide range of messages sizes are comparable to those achievable in the non-virtualized case and that low latencies (under 2us) are achievable as well.

 

This paper represents our first performance results using RDMA. We expect to publish further papers examining the performance of full MPI applications as well as results related to Bhavesh's work on a virtualized RDMA device that would support RDMA while also maintaining the ability to perform vMotion and Snapshot operations.

422 Views 0 Comments Permalink Tags: performance, linux, esxi, infiniband, latency, vsphere, rdma
0

In this video interview, Jeff Buell, one of VMware's senior performance engineers, shares his perspectives on the importance of Big Data for customers and for VMware. He also addresses the issue of why his Hadoop testing showed significantly better performance running Hadoop on vSphere when compared to native in some instances. The video (running time 6:20) is here.

 

The full white paper detailing Jeff's Hadoop performance testing done in partnership with AMAX and Mellanox is here.

469 Views 0 Comments Permalink Tags: performance, hadoop, bigdata
0

As promised, Bhavesh's talk at the OpenFabrics User and Developer Workshop is now available for your viewing pleasure. In it, I spend a few minutes talking about some of the findings from our upcoming InfiniBand paper, and then turn it over to Bhavesh who presents the bulk of the talk. The video is here.

 

Part of Bhavesh's talk covers our interest in providing RDMA capabiltiies to guest operating system instances while maintaining the ability to vMotion (live migrate) such virtual machines. This is crucial if RDMA is going to gain wider acceptance in the modern, virtualized datacenter where the value of live migration has been repeatedly demonstrated.

 

From a technologist's perspective, it was good to see that Oracle apparently shares our view -- first that RDMA is gaining importance in the Enterprise (InfiniBand, for example, is the interconnect used in Oracle's Exa* line of appliances) and that live migration will be required for Enterprise use.

372 Views 0 Comments Permalink Tags: performance, infiniband, latency, hpc, rdma
0

The video of the Big Data panel at the OpenFabrics Alliance User and Developer Workshop has now been posted at insideHPC.com -- it is available here.

 

DK Panda from Ohio State University spoke first and gave a compressed version of a much longer talk that illustrates the benefits of high bandwidth and low latency interconnect for distributed Enterprise services like memcached as well as benchmarks like Yahoo's Cloud Serving Benchmark (YCSB) -- interesting stuff, especially as an illustration of the converging requirements we are seeing between HPC and Enterprise.

chaff.png

 

My talk starts at about 12m30s in the video and, embarrassingly since it was supposed to be a 10-minute talk, ends at about 29m30s. I made some general comments about the larger context of Big Data, presented results of our performance tests running Hadoop virtualized on the vSphere platform and then ended with some comments about the role of interconnects for Big Data.

 

Milind Bhandaskar, EMC/Greenplum Chief Engineer, spoke next about how applications drive system development, specifically in the areas of machine learning, analytics and reporting as well as visualization for BigData. He also explained the phases seen in Data Sciences workloads: Obtain, Scrub, Explore, Model, Interpret.

The final panel speaker was Sumanta Chatterjee, VP Server Technologies at Oracle who spoke about Oracle's view of a BigData appliance built around an Acquire -> Organize -> Analyze workflow that uses Oracle Exa* products. He then spoke about specific issues with respect to BigData and bandwidth.

425 Views 0 Comments Permalink Tags: performance, vsphere, hpc, hadoop, rdma, bigdata
0

I sat down for an interview with William Wallace from insideHPC at the OpenFabrics Alliance User and Developer Workshop in Monterey yesterday in which we talked about low latency and RDMA for the Enterprise and for HPC and the importance of Big Data to VMware. The video (running time 10:13) is here.

319 Views 0 Comments Permalink Tags: networking, esx, latency, vsphere, hpc, rdma, bigdata
0

bhavesh.pngBhavesh and I gave our talk [PDF] this morning at the OpenFabrics Alliance Developer and User Workshop here in Monterey. Actually, to be fair, I did a short introduction and then Bhavesh spent the bulk of our thirty-minute slot covering two topics. First, he presented several possible approaches to enabling RDMA for latency-sensitive applications on vSphere -- including what he called "the holy grail" -- a virtualized RDMA device that provides low-latency, high-bandwidth RDMA while maintaining the ability to use vMotion and Snapshots. In his view, this vRDMA option (Option F in the slide deck) is the most compelling and interesting for VMware and is the one he will be prototyping within the Office of th CTO over the coming months.

 

The second half of Bhavesh's talk focused on the potential benefits of using RDMA within VMware's virtualization platform to accelerate some of our important hypervisor services -- for example, vMotion and FT. The slide deck includes experimental results that demonstrate vMotion operations can be completed much more quickly and with far lower CPU utilization by using RDMA.

 

In my introduction, I showed a few graphs from our upcoming paper, RDMA Performance in Virtual Machines using QDR InfiniBand on VMware vSphere 5 that showed we can use passthrough mode (VM DirectPath I/O) to deliver 1.75us half ping-pong latencies for Send and 3us for RDMA Read using polling completions and 7.6us using interrpt completions for RDMA Read.

 

insideHPC taped our session so I will post a link once it is available.

 

 

601 Views 0 Comments Permalink Tags: performance, vmotion, esx, infiniband, latency, ft, hpc, rdma, roce
0

Bhavesh and I are packing our bags and heading to Monterey this weekend for the kickoff of the OpenFabrics Alliance User and Developer Workshop, which to my mind is the venue for discussing all aspects of RDMA across multiple transport technologies and multiple OS and virtualization platforms. As I mentioned in an earlier post, Bhavesh and I will be presenting our experiences with RDMA in a talk titled RDMA on vSphere: Update and Future Directions.

 

ibperf.png

I plan to give a quick report on the InfiniBand bandwidth and latency numbers we measured last summer on vSphere (teaser above) and then hand it over to Bhavesh, who will talk in detail about his experiences with hypervisor-level RDMA and also what he's been looking at relative to guest-level, virtual RDMA.

 

I'm also particularly interested to hear several talks listed in the agenda. Specifically, Live Migration With SR-IOV Supported InfiniBand by Wei Lin from Oracle, Microsoft SMB 2.2 - Running Over RDMA in Windows 8, by Tom Talpey from Microsoft, iWARP Learnings and Best Practices by Michael Feen from Penn State, Hadoop/Big Data-based Clusters with FDR InfiniBand by Apurva Desai from EMC, and OFED Use in the Financial Industry by Christoph Lameter.

 

I hope to do at least one blog entry from the conference , so stay tuned.

1,205 Views 0 Comments Permalink Tags: performance, esxi, infiniband, latency, hpc, rdma, roce, iwarp, bigdata
0

In a recent blog post, I offered Tuning tips to run low-latency applications in virtual machines. The blog post and the Best Practices white paper highlighted that by following certain tuning tips, a broad array of I/O latency sensitive applications which were historically difficult to virtualize due to overheads and added latency can now be successfully deployed on VMware vSphere 5.0.

 

I'm excited to annouce that my colleage Shilpi Agarwal, who manages the vSphere Network Performance Engineering team, just published a companion white paper that examines Network I/O Latency on VMware vSphere 5.

 

The paper is an excellent and highly recommended read to learn about the exact sources of latency overhead in the vSphere network virtualization stack. It also includes specific ping and netperf (common networking benchmarks) numbers comparing latencies achievable on physical non-virtualized environments versus vSphere VMs on the same hardware.

 

The paper compares latencies between a VM on an ESXi host connected to a non-virtualized physical host, two VMs on the same ESXi host connected across a vSwitch, as well as two VMs on two ESXi hosts using DirectPath I/O. It also measures the impact of CPU and network resource contention on latency in vSphere 5.0.

 

This white paper, along with the previously published Best Practices for Performance Tuning white paper, and the soon to be published research note summarizing our QDR InfiniBand performance study using DirectPath I/O on vSphere 5, all collectively demonstrate how VMware vSphere 5.0 is finally a hypervisor platform that's viable for a significantly increased number of latency-sensitive applications in a virtualized environment.

275 Views 0 Comments Permalink Tags: performance, networking, esxi, virtualization, latency, vsphere
0

I'll be teaching a course titled Virtualization and High Performance Computing at the Eighth International Summer School on Advanced Computer Architecture and Compilation for High-Performance and Embedded Systems (ACACES) the week of July 9th in Fiuggi, Italy. Full details on the summer school are here.

 

Here is the abstract for the class:

 

"System-level virtualization is widely used in commercial enterprise environments -- close to half of the workloads running in the world today run within virtual machines. And yet virtualization has not to date played any significant role in High Performance Computing despite the fact that virtualization technology can offer some unique advantages for HPC.   While the rise of cloud computing and its promise of computing on demand has increased the HPC community’s interest in virtualization (a key cloud enabler), there is considerable confusion about both the benefits and challenges of using virtualization for HPC.  In this course we will examine the essential characteristics of HPC workloads and then, using this as a lens, we will look in detail at system-level virtualization to understand both the benefits a modern virtual platform can offer for High Performance Computing as well as the challenges that must be overcome.  We will also examine which types of workloads virtualize well today from a performance perspective and which do not, and we will discuss the prospects for future performance improvements over the next few years."

193 Views 0 Comments Permalink Tags: virtualization, hpc
0

I'm looking forward to attending the OpenFabrics Alliance annual workshop again this year. It's being held in Monterey, California March 25-28, 2012. Details are here.

 

Last year was the first time I attended the workshop. I talked about VMware's interest and requirements for using RDMA in a virtualized environment and I found the atmosphere very conducive to good conversations with fellow attendees. I found everyone very welcoming and interested in what VMware wants to accomplish. To me, this is the venue that represents the locus of activity around low-latency, low-overhead communication.

 

This year my colleague Bhavesh Davda (also from the Office of the CTO) will be attending as well. We will present performance results for our initial InfiniBand work and also discuss issues related to building a virtualized RDMA device, a project that Bhavesh is running within VMware. We've made significant progress since last year and I'm eager to share the results.

 

In addition to presenting what we've been doing around RDMA, I will also be participating on a Big Data panel session with the working title "Big Data – Shared or Unified, Local or Remote Access".

168 Views 0 Comments Permalink Tags: performance, infiniband, latency, hpc, rdma, roce, iwarp
More

HTML

HTML

Actions

Notifications

HTML

Communities