Welcome!

SDN Journal Authors: John Walsh, Elizabeth White, Liz McMillan, Sven Olav Lund, Simon Hill

Related Topics: @CloudExpo, Microservices Expo, Containers Expo Blog, Cloud Security, @DXWorldExpo, SDN Journal

@CloudExpo: Article

Your Cloud Availability: 98%, 99.99% or 99.9999%?

How many companies are hosting critical production systems in the cloud?

I recently had a discussion with a few very interesting cloud professionals from CloudEndure about the changes that the cloud and AWS in particular, bring to the table in terms of business continuity and high availability. In one of my discussions with Ofer Gadish, the company's CEO, we exchanged information regarding the current state of the market, including tools, terms and relevant methodologies for real cloud users. We both agreed that in this market stage business continuity in the cloud is a very interesting subject since the market is still young and the traditional tools and perceptions don't apply.

How many companies are hosting critical production systems in the cloud? Where exactly is the fine line between the shared responsibilities of the cloud consumer and the cloud provider?

While we saw eye-to-eye on many topics, many questions arose in regards to specifics on the topic of cloud market maturity. In order to answer these questions and expand our cloud communities' knowledge, CloudEndure decided to run a survey to learn how the cloud has changed DR, Backup and High Availability.  And we'll share the results with you. Before continue on reading, let me invite you to take only three minutes of your time and support our research - Start here

What's Downtime?
We discussed my belief that the nines (99.99...%) are more marketing than the real deal today - What do you think? The market and tools used to measure your uptime immature or do not really exist. The concept of availability in the cloud is determined by the level of responsibility and liability that the vendors have for their customers. These notions include the ability to monitor, proactively fix and maintain continuous communication with users, giving them clear and genuine visibility into what exactly is going on and when the system is expected to return to normal. It is also necessary, here, to define the concept of downtime. At a very basic level, it is when the system is not available. However, the more precise answer depends on the criticality of specific features and components of an application or service.

On that note, we are interested in hearing your point of view. First and foremost, it is essential to ask, what do you consider "downtime"? How do you approach the matter of downtime with your customers? How do you compensate, if at all? Maybe even before discussing compensation, let us ask how or if you measure downtime? If so, are you able to calculate your own availability over the past week, month, or year?  With your help, we can gain even greater insights on the topic. Start here

RTO and RPO - Nothing Really Changes


Definitions first:

Recovery Time Objective (RTO) is the duration of time and a service level within which a business process must be restored after a disaster (or disruption) in order to avoid unacceptable consequences associated with a break in business continuity. Wikipedia

Recovery Point Objective (RPO) It is the maximum tolerable period in which data might be lost from an IT service due to a major incident Wikipedia


The cloud changes the ways in which you achieve optimal levels of RPO and RTO. There are much more creative ways now-a-days to deliver robust solutions at a very low cost.  Service degrades and downtimes can occur as a result of a bug, application scalability limitations, human error or even your cloud operator's availability. In any case you, as the service provider, have the responsibility to get it up and running again. Keeping the application, and its actual business values, in mind, you can address both RPO and RTO and make sure that you define and measure these metrics.

DR Cost Impacts and ROI
When you're an online retailer, every moment that your website-relevant functionalities are down, you lose money. Your business continuity should be prefect including accurate measuring capabilities for your site's uptime performance as well as ensuring there are no latencies or degrades that directly affect your business.

"...breaks down to about $120,000 per minute. Multiply that by the 40 minutes Amazon went down and you get around $4.8 million in lost dollars." Geekwire release following Amazon downtime on Aug-2013

A DR policy and rate is based on the required SLA; however good DR deployment on the cloud should be based on the central, pay-per-use cloud principle. You can actually create a spare cloud replica without the traditional upkeep costs if it is not utilized. Today, cloud users can avoid traditional costs by using an "On-demand DR" inline with changing their DR strategy and architecture.

Do you know your monthly DR costs? What's your estimated cost of downtime?

Final words
Over the past year, the "average enterprise" began its journey into the cloud, yet still views the cloud as an on-premise extension used as a means to generate replications and deal with less significant data. I believe that the sensible next step for companies that have already made this strategic move should be to move their production and more critical substance to the cloud. This would raise the bar and present the greater challenge of providing proven robust delivery of online services. Little by little, the cloud is making its presence and advantages known in a number of aspects of our daily lives. It's up to us to embrace it.

Calling All CIO/CTO/IT Leaders and team members: To see how your disaster recovery and backup strategy compares to your peers, participate in CloudEndure's  survey. Participants can win a new iPad! ;)

cross posted on iamondemand.com

More Stories By Ofir Nachmani

Ofir Nachmani is a Cloud Computing Evangelist, Blogger and Lecturer at IAmOnDemand.com. He has extensive experience helping ISV companies with cloud adoption and management. Today, Ofir is a Senior Vice President and Chief Evangelist at Newvem Analytics Ltd. Prior to these he led ClickSoftware’s On-Demand initiative and established the company cloud offering. On his previous adventure, he led ClickSoftware’s On-Demand (SaaS) initiative and also held several positions at Zarathustra SaaS development VP of product and company CEO. In 2009, ClickSoftware acquired the AST group and Zarathustra as part of it. Twitter: @iamondemand

@CloudExpo Stories
"Cloud Academy is an enterprise training platform for the cloud, specifically public clouds. We offer guided learning experiences on AWS, Azure, Google Cloud and all the surrounding methodologies and technologies that you need to know and your teams need to know in order to leverage the full benefits of the cloud," explained Alex Brower, VP of Marketing at Cloud Academy, in this SYS-CON.tv interview at 21st Cloud Expo, held Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clar...
In his session at 21st Cloud Expo, Carl J. Levine, Senior Technical Evangelist for NS1, will objectively discuss how DNS is used to solve Digital Transformation challenges in large SaaS applications, CDNs, AdTech platforms, and other demanding use cases. Carl J. Levine is the Senior Technical Evangelist for NS1. A veteran of the Internet Infrastructure space, he has over a decade of experience with startups, networking protocols and Internet infrastructure, combined with the unique ability to it...
The question before companies today is not whether to become intelligent, it’s a question of how and how fast. The key is to adopt and deploy an intelligent application strategy while simultaneously preparing to scale that intelligence. In her session at 21st Cloud Expo, Sangeeta Chakraborty, Chief Customer Officer at Ayasdi, provided a tactical framework to become a truly intelligent enterprise, including how to identify the right applications for AI, how to build a Center of Excellence to oper...
"IBM is really all in on blockchain. We take a look at sort of the history of blockchain ledger technologies. It started out with bitcoin, Ethereum, and IBM evaluated these particular blockchain technologies and found they were anonymous and permissionless and that many companies were looking for permissioned blockchain," stated René Bostic, Technical VP of the IBM Cloud Unit in North America, in this SYS-CON.tv interview at 21st Cloud Expo, held Oct 31 – Nov 2, 2017, at the Santa Clara Conventi...
Gemini is Yahoo’s native and search advertising platform. To ensure the quality of a complex distributed system that spans multiple products and components and across various desktop websites and mobile app and web experiences – both Yahoo owned and operated and third-party syndication (supply), with complex interaction with more than a billion users and numerous advertisers globally (demand) – it becomes imperative to automate a set of end-to-end tests 24x7 to detect bugs and regression. In th...
In his session at 21st Cloud Expo, James Henry, Co-CEO/CTO of Calgary Scientific Inc., introduced you to the challenges, solutions and benefits of training AI systems to solve visual problems with an emphasis on improving AIs with continuous training in the field. He explored applications in several industries and discussed technologies that allow the deployment of advanced visualization solutions to the cloud.
Agile has finally jumped the technology shark, expanding outside the software world. Enterprises are now increasingly adopting Agile practices across their organizations in order to successfully navigate the disruptive waters that threaten to drown them. In our quest for establishing change as a core competency in our organizations, this business-centric notion of Agile is an essential component of Agile Digital Transformation. In the years since the publication of the Agile Manifesto, the conn...
"MobiDev is a software development company and we do complex, custom software development for everybody from entrepreneurs to large enterprises," explained Alan Winters, U.S. Head of Business Development at MobiDev, in this SYS-CON.tv interview at 21st Cloud Expo, held Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA.
Large industrial manufacturing organizations are adopting the agile principles of cloud software companies. The industrial manufacturing development process has not scaled over time. Now that design CAD teams are geographically distributed, centralizing their work is key. With large multi-gigabyte projects, outdated tools have stifled industrial team agility, time-to-market milestones, and impacted P&L stakeholders.
"ZeroStack is a startup in Silicon Valley. We're solving a very interesting problem around bringing public cloud convenience with private cloud control for enterprises and mid-size companies," explained Kamesh Pemmaraju, VP of Product Management at ZeroStack, in this SYS-CON.tv interview at 21st Cloud Expo, held Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA.
Enterprises are adopting Kubernetes to accelerate the development and the delivery of cloud-native applications. However, sharing a Kubernetes cluster between members of the same team can be challenging. And, sharing clusters across multiple teams is even harder. Kubernetes offers several constructs to help implement segmentation and isolation. However, these primitives can be complex to understand and apply. As a result, it’s becoming common for enterprises to end up with several clusters. Thi...
"Space Monkey by Vivent Smart Home is a product that is a distributed cloud-based edge storage network. Vivent Smart Home, our parent company, is a smart home provider that places a lot of hard drives across homes in North America," explained JT Olds, Director of Engineering, and Brandon Crowfeather, Product Manager, at Vivint Smart Home, in this SYS-CON.tv interview at @ThingsExpo, held Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA.
"Codigm is based on the cloud and we are here to explore marketing opportunities in America. Our mission is to make an ecosystem of the SW environment that anyone can understand, learn, teach, and develop the SW on the cloud," explained Sung Tae Ryu, CEO of Codigm, in this SYS-CON.tv interview at 21st Cloud Expo, held Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA.
"Infoblox does DNS, DHCP and IP address management for not only enterprise networks but cloud networks as well. Customers are looking for a single platform that can extend not only in their private enterprise environment but private cloud, public cloud, tracking all the IP space and everything that is going on in that environment," explained Steve Salo, Principal Systems Engineer at Infoblox, in this SYS-CON.tv interview at 21st Cloud Expo, held Oct 31 – Nov 2, 2017, at the Santa Clara Conventio...
High-velocity engineering teams are applying not only continuous delivery processes, but also lessons in experimentation from established leaders like Amazon, Netflix, and Facebook. These companies have made experimentation a foundation for their release processes, allowing them to try out major feature releases and redesigns within smaller groups before making them broadly available. In his session at 21st Cloud Expo, Brian Lucas, Senior Staff Engineer at Optimizely, discussed how by using ne...
Vulnerability management is vital for large companies that need to secure containers across thousands of hosts, but many struggle to understand how exposed they are when they discover a new high security vulnerability. In his session at 21st Cloud Expo, John Morello, CTO of Twistlock, addressed this pressing concern by introducing the concept of the “Vulnerability Risk Tree API,” which brings all the data together in a simple REST endpoint, allowing companies to easily grasp the severity of the ...
While some developers care passionately about how data centers and clouds are architected, for most, it is only the end result that matters. To the majority of companies, technology exists to solve a business problem, and only delivers value when it is solving that problem. 2017 brings the mainstream adoption of containers for production workloads. In his session at 21st Cloud Expo, Ben McCormack, VP of Operations at Evernote, discussed how data centers of the future will be managed, how the p...
"NetApp is known as a data management leader but we do a lot more than just data management on-prem with the data centers of our customers. We're also big in the hybrid cloud," explained Wes Talbert, Principal Architect at NetApp, in this SYS-CON.tv interview at 21st Cloud Expo, held Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA.
Coca-Cola’s Google powered digital signage system lays the groundwork for a more valuable connection between Coke and its customers. Digital signs pair software with high-resolution displays so that a message can be changed instantly based on what the operator wants to communicate or sell. In their Day 3 Keynote at 21st Cloud Expo, Greg Chambers, Global Group Director, Digital Innovation, Coca-Cola, and Vidya Nagarajan, a Senior Product Manager at Google, discussed how from store operations and ...
"We're focused on how to get some of the attributes that you would expect from an Amazon, Azure, Google, and doing that on-prem. We believe today that you can actually get those types of things done with certain architectures available in the market today," explained Steve Conner, VP of Sales at Cloudistics, in this SYS-CON.tv interview at 21st Cloud Expo, held Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA.