May 182013
 

I had a recent conversation with my friend, Joe (@virtacit), and my ValCo Labs brother, Josh (@joshcoen), about the different access methods for EMC’s VPlex. Now, if you aren’t familiar with VPlex, EMC is doing some amazing things with storage virtualization. VPlex allows you to take one (or many) storage arrays (EMC or otherwise) and present the storage to VPlex, which in turn presents the storage to hosts as virtual volumes. This abstraction leads to some pretty neat storage availability configurations. One such configuration, and the basis for our discussion, was VPlex in a vSphere Stretch Metro Cluster.

There are many resources that discuss vMSC. A few can be found here:

http://kb.vmware.com/selfservice/microsites/search.do?language=en_US&cmd=displayKC&externalId=2007545

http://www.emc.com/collateral/software/white-papers/h11065-vplex-with-vmware-ft-ha.pdf

The purpose of this article is to discuss Uniform and Non-Uniform storage access, not recreate the work they have done. After reviewing the documents linked above, I was left scratching my head.

Ok, the way VMware defines it (see VMware link about)

  • Uniform

Uniform Host Access (Cross-Connect) – This deployment involves establishing a front-end SAN across the two sites, so that the hosts at one site can see the storage cluster at the same site as well as the other site.

  • Non-Uniform

Non-uniform Host Access – This type of deployment involves the hosts at either site seeing the storage volumes through the same site storage cluster only.

Now, the way EMC defines it (see EMC link above)

  • Uniform

Uniform access is typically based on active/passive technology where all I/O is serviced by only 50% of the available storage controllers in the same physical location (i.e. 50% of the controllers are passive); therefore, all I/O is sent to or received from the same location where the active controller resides, hence the term “uniform”. Typically this involves “stretching” dual controller active/passive mid-range storage products, but can also be architected by using legacy active/passive replication. In both cases the use of an ISL is typically required so all hosts can access the active storage controllers at the remote location. These two types of uniform access are known as “split cluster” and “replication” uniform access respectively.

clip_image001

Figure 10 from the EMC Article located above.

  • Non-Uniform

I/O can be serviced by any available storage controller (100%) at any given location; therefore I/O can be sent to or received from any storage target location, hence the term “nonuniform”. This is derived from “distributing” multiple active controllers/directors in each location and does not require an ISL (although an ISL can be optionally deployed).

clip_image002

Figure 11 from the EMC Article located above.

To me it appeared that VMware and EMC were describing the technology the same, but that their definitions of Uniform and Non-Uniform were reversed. Not a huge deal, but confusing when trying to explain why there were different.

Moment of Clarity

After reading (and re-reading) Duncan Epping’s blog post on the subject, (http://www.yellow-bricks.com/2012/11/13/vsphere-metro-storage-cluster-uniform-vs-non-uniform/) it started to make sense. The confusion came from VMware’s article mentioning “Uniform (cross-connect)” and it’s reference to VPlex.

Although VPlex can be configured to provide uniform access, where you see the most benefit is from non-uniform access. In fact, using uniform access would remove many of its benefits in a vMSC. Uniform access implies that we have an active/passive site configuration or storage replication technology. With VPlex, however, all controllers are active at both sites, and when configured for non-uniformed (cross-connect), all host paths from a host in site A, to its local VPlex front-end ports, as well as the paths to the remote sites VPlex front-end ports are active.

Non-Uniform (VPlex IO access pattern)

VPlex is ideally configured in (2) modes, non-uniform, and non-uniform (cross-connect).

  • Non-Uniform
    • Hosts at SiteA are zoned ONLY to the SiteA VPlex cluster
    • Hosts at SiteB are zoned ONLY to the SiteB VPlex cluster
    • ALL write activity of a distributed volume (DVol) traverses the VPlex WANCOM
  • Non-Uniform (cross-connect)
    • Host at SiteA are zoned to SiteA AND SiteB VPlex clusters
    • Host at SiteB are zoned to SiteA AND SiteB VPlex clusters
    • Write activity COULD be serviced by either VPlex cluster (See note below)

NOTE: Although technically the write activity in a VPlex non-uniform cross-connected configuration shows all paths as active, VMware/EMC recommended a ‘FIXED’ multipathing policy for these devices to prevent writes to the non-local VPlex cluster. This makes sense if you think about the write process of the host.

Host Write Penalty Using Non-local Path

From the diagram below you can see that if all paths are active, and a write is sent from a host in SiteA to the remote VPlex cluster (non-preferred), the write has to be mirrored back to the local cluster. (2) writes must be sent across the inter-site link. (1) write across the host WAN connectivity, and (1) write across the VPlex WANCom. If writes are only sent to the local (preferred) cluster, only (1) write needs to be sent to the remote cluster.

clip_image003

clip_image004

What about Uniform and VPlex

As I mentioned previously, VPlex is typically configured in a non-uniform configuration. There are two times where Uniform may be seen.

  • Forced Uniform (Failure of WANCom/VPlex Cluster Partition)

During a WANCom partition, certain rules (or witness election) are enacted to determine which site is the authoritative source for write. This is necessary to avoid a split brain condition, and ultimately corruption. In this state, storage is only accessible at the surviving site, and hosts at both sites (assuming they are cross-connected) MUST access storage from a single site. If the hosts are NOT cross-connected, manual intervention may be required depending on the version of VMware ESXi. Check out the failure scenarios table on page 38 or the EMC article above.

clip_image005

Figure 13 from the EMC Article located above.

  • Intentional (Uniform Configuration)

While technically it is possible, and there may be a specific use case, you could create a VPlex virtual volume at a single site, and forgo things like AccessAnywhere to make the volume available at the remote location. In this configuration, hosts at both sites could only access the storage at a single site.

Where is Uniform used?

I am not as familiar with other storage virtualization platforms, but can see the need for this configuration if the arrays at SiteA and SiteB are not truly active-active. In this case, hosts at SiteA, and SiteB would have to be ‘Uniform (cross-connected)’ to access the site’s storage that is read/write, while the other site is read-only until the site’s roles are switched. Check out this link for a better example. (http://www.yellow-bricks.com/2012/11/13/vsphere-metro-storage-cluster-uniform-vs-non-uniform/)

Questions?

- What happens to the remaining paths when selecting ‘FIXED’ as the multi-pathing policy? The remaining paths are all ‘active’ (just not active I/O). If there is a failure of the preferred path (local VPlex cluster), what stops a path to the remote VPlex cluster from being selected?

May 102013
 
PernixData Flash Virtualization Platform (FVP): The best idea you never had

PernixData, who recently came out of stealth mode, has introduced a product that will change the way you think about SSDs and server-side caching. PernixData was founded by Poojan Kumar (Oracle Exadata) and Satyam Vaghani (Mr. VMFS!), so lets just say PernixData knows a thing or two about storage. What is Flash Virtualization Platform (FVP)? [...]

Apr 212013
 
Setting Perennial Reservations for MSCS with EMC CX/VNX Arrays

A few months ago I was putting together a technical solution to support a vSphere upgrade (physical hosts and vSphere version) and a SAN upgrade, As part of the upgrade, the customer wanted to virtualize their file services as well. The customer wanted the same level of availability and redundancy in the virtual environment that [...]

Apr 102013
 
Jumbo Frames and Multi-NIC vMotion Performance over 10Gbe - Part 2

A week ago I wrote a post as sort of a follow-on to Chris Wahl’s post on performance gains (or lack thereof) using jumbo frames for multi-NIC vMotion. A fellow blogger, Josh Odgers (blog / twitter), posted a comment stating more testing with the same workloads across tests would be interesting to see. In my [...]

Apr 032013
 
Jumbo Frames and Multi-NIC vMotion Performance over 10Gbe

Chris Wahl recently wrote a blog post titled Do Jumbo Frames Improve vMotion Performance? Chris was doing these tests using a 1Gbe network and the results indicated that you get slightly better performance by NOT using jumbo frames. There were a few comments and questions regarding what this would look like if you had a [...]

Mar 082013
 
[Book Review] VCDX Boot Camp: Preparing for the VCDX Panel Defense (Rough Cuts version)

Since I plan on submitting my VCDX application in time for the VMworld 2013 deadline, I thought it would be a good idea to take a look at the VCDX boot camp book. While the book has not been officially released, there is a “rough cuts” version of it available right now via Safari Books [...]

Feb 252013
 
My Mind…nowhere to be found

  This is an atypical post for me as normally my posts are purely technical. There are days where I start working in my lab and I’ll ask myself the question “why are you doing this?”…I’ll ask “why are you spending all this time in your lab constantly trying to learn new products"?”. Before I [...]