Welcome!

SDN Journal Authors: Elad Yoran, Elizabeth White, David Honan, Liz McMillan, Pat Romanski

Related Topics: Cloud Expo, SOA & WOA, Open Source, Virtualization, Apache, Big Data Journal, SDN Journal

Cloud Expo: Article

Skytap Offers Ready-to-Go Cloudera Hadoop

Skytap expects the widgetry to be used for Hadoop experimentation

Skytap, the cloud platform that offers virtual lab automation as a service, now has pre-configured Cloudera Hadoop (CDH4) templates in its library that can be used to spin up and manage physical or virtual clusters of up to 50 Hadoop nodes.

With complexity removed, the company claims a 10-node system should take no more than 10 minutes to deploy. In Cloudera system one node is always dedicated to Cloudera Manager.

The templates eliminate the time required to manually download, install, configure and network all of the required software and hardware components together.

Skytap expects the widgetry to be used for Hadoop experimentation or to test and develop prototypes and proofs-of-concept for Big Data offerings. It says if users go production they should use their own infrastructure.

It requires a Skytap subscription. Users will be charged by the number of concurrent VMs deployed and the amount of time they spend on the Skytap infrastructure. The Cloudera edition on offer is the freebie one.

Skytap's templates let users create, spin up, suspend, save and tear down Hadoop clusters of various sizes on-demand. They're supposed to provide complete multi-machine, multi-networked environments, and enable users with fast, remote access and root-level control of all virtual machines.

Using its multi-VPN capability, Skytap customers can create a secure hybrid cloud and run Hadoop cluster nodes on-premise and on the Skytap Cloud. That way they can scale Hadoop clusters when large datasets need additional compute, memory, storage and network resources.

The hybrid Hadoop clusters can be managed as a single, unified cluster on-premise or from within the Skytap Cloud. Users should get increased speed and flexibility and avoid the cost of purchasing additional on-premise hardware to meet peak scaling needs.

Skytap is offering a base Cloudera CDH4 Hadoop multi-node cluster template plus an additional single-node template to add more nodes. It says the basic configuration is good for HDFS, MapReduce, Hue, Oozie and ZooKeeper.

Hadoop clusters within the Skytap Cloud benefit from Skytap's SmartClient, AutoNetworks, CloudControl and SmartShare technologies.

The hardware starts with a single processor and 1GB of memory and goes to eight processors and 32GB of memory. Two processors and 2GB of memory can also be had. The stuff runs on Ubuntu 12.0.4 and is already networked.

Skytap subscriptions can run $500 a month to $100,000 a month plus time on the system. The company says 10-50 nodes or 20-100 concurrent VMs should provide a "reasonable Hadoop system."

More Stories By Maureen O'Gara

Maureen O'Gara the most read technology reporter for the past 20 years, is the Cloud Computing and Virtualization News Desk editor of SYS-CON Media. She is the publisher of famous "Billygrams" and the editor-in-chief of "Client/Server News" for more than a decade. One of the most respected technology reporters in the business, Maureen can be reached by email at maureen(at)sys-con.com or paperboy(at)g2news.com, and by phone at 516 759-7025. Twitter: @MaureenOGara

Comments (0)

Share your thoughts on this story.

Add your comment
You must be signed in to add a comment. Sign-in | Register

In accordance with our Comment Policy, we encourage comments that are on topic, relevant and to-the-point. We will remove comments that include profanity, personal attacks, racial slurs, threats of violence, or other inappropriate material that violates our Terms and Conditions, and will block users who make repeated violations. We ask all readers to expect diversity of opinion and to treat one another with dignity and respect.


Cloud Expo Breaking News
In an ideal developer/systems administrator’s world, most applications would deploy seamlessly to multiple platforms and scale elastically with minimal effort bringing the unprecedented agility of the cloud within immediate reach of developer teams and IT organizations. OpenStack, a RackSpace and NASA initiative, is now managed by an independent foundation and is supported by multiple vendors. It defines APIs for compute, storage, networking, services, monitoring, and additional infrastructure...
Companies around the world are moving into on-premise private cloud environments. Many connect their private cloud to their public cloud service providers. In his session at 12th Cloud Expo | Cloud Expo New York [June 10-13], Brian Patrick Donaghy will talk about examples of what worked, what failed and why we should think about this evolution.
Enterprise cloud adoption revolves around pushing the BYOD movement and focusing on data security. In his session at the 12th International Cloud Expo, Ross Brouse, COO and President of Solar VPS, will cover how cloud adoption is driven by consumerism, humanity’s need to socialize, our addiction to new gadgets and the ability of data to stay secure in a growing collaborative world. The cloud is a drug and we’re just getting hooked. Ross Brouse is the COO and President of Solar VPS. He is a tr...
Organizations across the world are increasingly starting to see the benefits of moving more and more services to the cloud. The focus on the cost-saving potential of cloud is rapidly shifting to completely transforming the business with cloud. As organizations are investing enormous sums on technology they are starting to realize that in order to maximize the return on investment and accelerate the business transformation process the first area of focus should be people. By ensuring the organiza...
A recent study by analyst firm IDC reports that in 2012, 1.7 million cloud computing-related roles across the globe could not be filled due to the lack of training, certification and experience in the applicant pool. As the global demand for cloud and big data expertise increases, employers are finding it difficult to recruit talent, which is slowing down the ability for organizations to adopt, implement, and realize benefits from innovative platforms like OpenStack. In this session join Clo...
Enterprises can't close their doors just because integration tools won't cope with the volume of information that their systems produce. As each day goes by, their information will become larger and more complicated, and enterprises must constantly struggle to manage the integration of dozens (or hundreds) of systems. Apache Hadoop has quickly become the technology of choice for enterprises that need to perform complex analysis of petabytes of data, but few are aware of its potential to hand...
Our more interconnected planet is accelerating the adoption and convergence of next-generation architectures, in the form of cloud, mobile and instrumented physical assets. Organizations that can effectively balance optimization and innovation, will be in a position to leverage new systems of engagement, out maneuver their peers and achieve desired outcomes. In the Opening Keynote at 12th Cloud Expo | Cloud Expo New York, IBM GM & Next Generation Platform CTO Dr Danny Sabbah will detail the crit...
The cloud-enabled data center sits at the center of IT transformation. It facilitates the interconnection and communities that come together, propelling growth for both buyers and sellers. In his session at the 12th International Cloud Expo, Gerry Fassig, CoreSite’s Vice President of Sales, will discuss how CoreSite is bringing together best-of-breed partners through the Open Cloud Exchange resulting in public, private, and hybrid cloud interconnection and management as well as connectivity to...
Companies around the world are collecting massive amounts of data everyday that’s sitting around and not being utilized. Take for example the fact that companies collect demographic and location-based data via mobile devices all the time, but have to figure out how to monetize that data. In this session, Joyent CTO and founder Jason Hoffman will examine the state of Big Data, taking a look at what we're doing now to discussing what's on the horizon, as companies prepare and realign their busines...
The massive computing and storage resources that are needed to support big data applications make cloud environments an ideal fit. In Nati Shalom's upcoming session at 12th Cloud Expo | Cloud Expo New York [June 10-13, 2013], you'll learn how to build your big data "database on-demand" using MongoDB, Cassandra, Solr, MySQL, or any other big data solution, as well as manage your big data application using a new open source framework called “Cloudify.” All this, on top of the OpenStack cloud.