Virtualization is not ‘The Answer’ for Clouds

There is a myth going around about virtualization and cloud computing. It’s expressed in a variety of ways, but the takeaway is always the same: “Public clouds are big virtual server clouds.” Sounds good, but untrue once you look under the covers. For good reason, since virtualization isn’t a panacea.

Here’s the deal. Public clouds (IaaS, PaaS, or SaaS) are all multi-tenant. It’s a fundamental definition. Multi-tenancy is one of the core properties of any cloud. Whether it’s a cloud like ours (GoGrid), EC2, Google App Engine (GAE), or Salesforce.COM (SFDC). Multi-tenancy is a mechanism whereby public cloud providers give customers cost efficiencies by aggregating capex at scale and providing it as a subscription service.

    Virtualization is just a multi-tenancy strategy.

Virtualization as Multi-Tenant Solution
That’s right. It’s only a multi-tenancy strategy. Not all clouds will use virtualization. Clouds like GAE and SFDC use completely different technologies to create multi-tenancy, but even for strict compute clouds, folks like AppNexus surface physical hardware that customers then carve up themselves into virtual machines. While others, like NewServers, serve up completely physical clouds. For those folks their multi-tenant strategy is more coarse, based simply on a single piece of physical hardware.

Scaling Up Still Matters
Simply put, for the foreseeable future there are many pieces of software that do better scaling ‘up’ versus ‘out’. For example, your traditional RDBMS is much easier to scale by throwing physical iron (instead of virtual) at the problem.

A well known Web 2.0 company recently expressed to me that they are running with 100s of thousands of customers on big database servers with 128GB of RAM and lots of high speed disk spindles. This is one of the poster children of the Web 2.0 movement. If they can scale out their RDBMS by simply throwing iron at it, why would they re-architect into (for example) 10 extra large EC2 instances and deal with the engineering effort involved with a heavily sharded database?

To put this in perspective, you could do this:

  • 10 extra large EC2 instances
    • 16GB RAM each
    • ~8 EBS network-based storage devices
    • 2 cores each
    • ~$6000/month including storage
    • $ X to engineer for sharding at application level

OR:

  • 2 redundant big iron physical servers
    • 128GB RAM each
    • 16 high-speed spindles on local disk
    • 8-12 cores each
    • $40,000 in capex or ~7,500/month for servers+storage

Conclusion
It’s kind of a no brainer. For certain use cases it’s more economical to scale using bigger hardware. There are two key reasons why this won’t change in the near future. The first is that many folks are working hard to make database software scale better across more cores. The second is that we’ll be at 16 and 32 cores per 1U server in the not so distant future. Scaling up will continue to be a viable option for the future. Period. Clouds need to enable this in the same way they enable virtualized servers for scaling out. It’s not an either/or proposition.



Update: The ‘well known’ Web 2.0 company I mentioned has informed me that my estimate on dedicated hardware was far too high. Something around $5,000 for those servers is more accurate, meaning there is even less reason to consider scale-out as an option.

3 comments December 17th, 2008

Must Read on “The” Cloud

I don’t often post simple links or even lists of interesting links. I prefer to try and provide more infrequent, but original content. I’m going to make an exception today because I want to point you to a really fantastic article by MasterMark:

    The Enterprise Cloud

Fair warning, this is a very long blog entry, but it really touches on a number of different thoughts and memes that have been wandering through the cloud computing community’s thinking for a while now. In particular, I think there are some very cogent observations about what is ‘the cloud’ vs. ‘cloud computing’ and how the enterprise fits in. As an added bonus you get a pretty deep dive into VPN-Cubed from CohesiveFT.

I consider this one a must read folks!

1 comment December 17th, 2008

Cloud-Oriented Architectures Coming of Age?

Cloud-Oriented Architectures (COA) mentioned briefly before by me look to be coming of age with this latest announcement from Force.com and Google App Engine.

Hopefully this will help get the message across. Building your entire application stack is soooo 1999…

3 comments December 8th, 2008

ZFS to go GPL?

Word on the street is that Sun is exploring the option of going GPL at the CEO level. They have been chatting with customer council(s) about whether they should do this.

The answer is simple: Do it. Do it now.

ZFS has inspired many. There are a number of up and coming new filesystem and large scale storage solutions being developed under the GPL like Btrfs, Tux3, and Ceph that all have similar characteristics to ZFS.

If you don’t GPL it now, you’ll lose the opportunity to make the kind of difference you are trying to make.

Add comment December 6th, 2008

What Happened to Vertebra?

Earlier this year during the 2008 Velocity Conference EngineYard made some fairly big waves (Google search) pre-announcing their open source cloud management tool, Vertebra.

This was of particular interest to me because of the CloudScale Project. We spent a considerable amount of time building a sysadmin messaging bus, but were never 100% happy with it. Vertebra looked like just the ticket, it would be open, and hopefully widely embraced, perhaps making our lives easier.

At the time, the EngineYard (EY) folks promised ‘right after summer’. With December literally right around the corner, you’ve got to wonder whether this vaporware will ever really show up. Given the recent layoff of the Rubinius team from EngineYard it seems logical to conclude that their management realizes that too many over-funded R&D projects aren’t necessarily going to be in the companies long term best interests.

Merb made it out the gate thankfully, but will Vertebra? There hasn’t been a single public release since it was pre-announced almost 6 months ago so I’m going to hazard a guess that we’ll either never see it or at least not for a long time. Maybe Velocity Conference 2009??

Come on, EngineYard, just get it out there, warts and all. Either it will gather some more support and live or it won’t.



Update: EY just released Vertebra today (2008/12/19). Find out more here.

5 comments November 30th, 2008

Next Posts Previous Posts


Strategy without tactics is the slowest route to victory. Tactics without strategy is the noise before defeat. -- Sun Tzu

Categories

Of Interest

Feeds & Misc

    Posts by Month

    Posts by Category