Welcome!

SDN Journal Authors: Destiny Bertucci, Liz McMillan, Pat Romanski, Elizabeth White, Amitabh Sinha

Blog Feed Post

SDN and the implications of a single system image

There are two major technology trends that have the potential to move the networking industry away from managing discrete devices and instead thinking of the network as a single distributed system. Software-defined networking (SDN) aspires to manage through centralized control, and DevOps aims to treat configuration more like code that is managed and deployed more as a collective than as individual device configuration.

The question is how to take devices explicitly designed to operate discretely and make them behave as a single cohesive unit.

What is a single system image?

There are a lot of technical requirements for the purists, but the basic idea behind a single system image is that a number of devices act in concert as part of a distributed system. In networking terms, this means that the network itself (or, more likely, domains within the network) can be provisioned, monitored, and troubleshot as a single cohesive entity.

In many regards, this is part of the objective behind solutions like Ethernet fabrics. If the datacenter network can be managed as a single logical switch, then behavior can be specified at a global level and then translated into device-specific behavior without needing to touch every switch in the datacenter.

The downside of a single system

The Internet is based on distributed, autonomous behavior. The entire design premise of networking is that devices acting locally can ultimately route around failures in the network. If a particular node fails, surrounding nodes can detect the failure, adjust their forwarding behavior, and distribute their updated information.

Of course, this is most useful in sprawling, heterogeneous networks whose domains span various control spaces. When the physical path traffic must take crosses between multiple networks, each owned by separate players, then the only way to make it all work is through this type of design. But the datacenter is a different beast entirely. It is typically contained within well-defined boundaries, and the network within is usually managed by not just a single corporate entity but also a single network team (forgetting for a moment that there are multiple device types that sometimes lead to a separation between network, security, and appliance teams).

Does the datacenter have a single system image today?

While the datacenter lends itself more easily to a single system image approach, datacenters do not operate this way today. Most datacenters are heterogeneous environments featuring kit from a variety of vendors. Even in single vendor deployments, the disparity between device types is usually prohibitive to those that would manage the network as a single entity.

Fabric-based solutions and virtual chassis implementations remove some of these barriers. But even in these cases, the boundary of the single system image can stretch only up to and not including the WAN. Whether the WAN is implemented using optical gear or MPLS-enabled routers, it creates a barrier beyond which control does not extend.

Can SDN help?

The entire premise of SDN is that control can be separated from forwarding. This separation is not so much about whether the control plane physically resides within the sheet metal that defines the box. Indeed, in virtually every modern networking device, the separation of control and forwarding already exists. But if that control plane is logically central and it encompasses a networking domain, then the set of boxes (or virtual devices) within that domain can be managed collectively.

But even here, SDN has limits. Most SDN efforts in the market today are incremental additions to devices designed under the traditional model of discrete management and behavior. So while you can centralize some aspects of management, not everything ends up being central. For example, you have to wait for OpenFlow 2.0 to get QOS support. The result is that deployments will end up being a hybrid with some centralized behavior and some distributed behavior.

While this is a useful step, it falls short of treating the network as a single system image. You get the benefits of centralized control to help make more intelligent control decisions, but you still carry the burden of discrete device management for the bulk of what makes a typical piece of networking equipment function.

What about adding DevOps?

DevOps (or NetOps for the more specific networking version) is all about treating configuration more like traditional software projects treat code. Conceptually, if you imagine that individual device configuration is analogous to a piece of code, the collection of all those configuration makes up the source code that drives network behavior. Using this model, configuration can be managed much like source code is managed. When provisioning changes are made, the configuration is compiled (built and tested), and then deployed to all the devices that are impacted.

In this model, you get a lot closer to treating the network as a single system image. The configuration of the network can be done as if the network is a single cohesive entity, and since DevOps is more about the configuration and less about the behavior, the issues of dealing with different device types are not as important (though you obviously need to account for different configuration syntaxes and device capabilities).

What is missing?

Even though you can centralize configuration management through DevOps, the devices themselves still act as autonomous entities. This creates some interesting failure scenarios. For example, if there are 100 devices in a network domain, you might use DevOps tools to deploy new configuration to those devices. But what determines a successful change?

If 99 out of the 100 devices successful commit the configuration change but one does not, what is the desired behavior? Imagine the case where configuration changes are pushed to these 100 devices, but one device times out and does not yield a positive response. The likely resolution is to back out all of the changes across the other 99 devices and try again. I don’t want to suggest this is not manageable, but the failure scenarios get interesting.

For a timeout-related failure, the problem is that some operators will deal with the behavior by adjusting timeout parameters. If you set the timeout to some arbitrarily high value, what happens in the interim between issuing a configuration change and flagging the timeout? During those seconds or minutes, the network has 99 entities acting under one set of rules and the other acting under the previous set. These types of situations can be difficult to identify, and depending on the changes can be service impacting for applications and users on the network.

What is needed?

Ultimately, if you want your network to behave as a single system image (not just from a configuration management perspective but also from a behavior perspective), we likely need to look at how we develop networking equipment. Making things that are not one appear as if they are gives us the illusion of cohesiveness. For many cases, this can be an extremely powerful thing. But when you start looking at end-to-end service deployments across multiple domains (a single Layer-2 domain stretching across two datacenters using an interconnect network in between, for instance), it’s the details that will matter. Even slight mismatches in policy across multiple domains can create resource islands in the datacenter, which defeats the entire purpose of moving to distributed architectures.

The bottom line

SDN and DevOps are absolutely critical additions to the networker’s arsenal. They help to alleviate some of the issues that stem from the distributed nature of networking. But simply adding layers on top of the network doesn’t change the fundamental architecture on top of which applications reside. If applications continue their move towards scale-out architectures (Big Data, clustered compute, clustered storage, and the list goes on), the underlying network infrastructure will have to undergo similar transformations. How tightly we cling to legacy constructs will determine how quickly we move to something more capable of operating in conjunction with the new breed of applications.

[Today’s fun fact: A bloodhound can track a man by smell for up to 100 miles. Seriously, how does anyone know this? Do they just sit down and rest at mile 101?]

The post SDN and the implications of a single system image appeared first on Plexxi.

Read the original blog entry...

More Stories By Michael Bushong

The best marketing efforts leverage deep technology understanding with a highly-approachable means of communicating. Plexxi's Vice President of Marketing Michael Bushong has acquired these skills having spent 12 years at Juniper Networks where he led product management, product strategy and product marketing organizations for Juniper's flagship operating system, Junos. Michael spent the last several years at Juniper leading their SDN efforts across both service provider and enterprise markets. Prior to Juniper, Michael spent time at database supplier Sybase, and ASIC design tool companies Synopsis and Magma Design Automation. Michael's undergraduate work at the University of California Berkeley in advanced fluid mechanics and heat transfer lend new meaning to the marketing phrase "This isn't rocket science."

@CloudExpo Stories
In his Opening Keynote at 21st Cloud Expo, John Considine, General Manager of IBM Cloud Infrastructure, led attendees through the exciting evolution of the cloud. He looked at this major disruption from the perspective of technology, business models, and what this means for enterprises of all sizes. John Considine is General Manager of Cloud Infrastructure Services at IBM. In that role he is responsible for leading IBM’s public cloud infrastructure including strategy, development, and offering m...
With tough new regulations coming to Europe on data privacy in May 2018, Calligo will explain why in reality the effect is global and transforms how you consider critical data. EU GDPR fundamentally rewrites the rules for cloud, Big Data and IoT. In his session at 21st Cloud Expo, Adam Ryan, Vice President and General Manager EMEA at Calligo, examined the regulations and provided insight on how it affects technology, challenges the established rules and will usher in new levels of diligence arou...
The past few years have brought a sea change in the way applications are architected, developed, and consumed—increasing both the complexity of testing and the business impact of software failures. How can software testing professionals keep pace with modern application delivery, given the trends that impact both architectures (cloud, microservices, and APIs) and processes (DevOps, agile, and continuous delivery)? This is where continuous testing comes in. D
Modern software design has fundamentally changed how we manage applications, causing many to turn to containers as the new virtual machine for resource management. As container adoption grows beyond stateless applications to stateful workloads, the need for persistent storage is foundational - something customers routinely cite as a top pain point. In his session at @DevOpsSummit at 21st Cloud Expo, Bill Borsari, Head of Systems Engineering at Datera, explored how organizations can reap the bene...
Digital transformation is about embracing digital technologies into a company's culture to better connect with its customers, automate processes, create better tools, enter new markets, etc. Such a transformation requires continuous orchestration across teams and an environment based on open collaboration and daily experiments. In his session at 21st Cloud Expo, Alex Casalboni, Technical (Cloud) Evangelist at Cloud Academy, explored and discussed the most urgent unsolved challenges to achieve f...
The dynamic nature of the cloud means that change is a constant when it comes to modern cloud-based infrastructure. Delivering modern applications to end users, therefore, is a constantly shifting challenge. Delivery automation helps IT Ops teams ensure that apps are providing an optimal end user experience over hybrid-cloud and multi-cloud environments, no matter what the current state of the infrastructure is. To employ a delivery automation strategy that reflects your business rules, making r...
The 22nd International Cloud Expo | 1st DXWorld Expo has announced that its Call for Papers is open. Cloud Expo | DXWorld Expo, to be held June 5-7, 2018, at the Javits Center in New York, NY, brings together Cloud Computing, Digital Transformation, Big Data, Internet of Things, DevOps, Machine Learning and WebRTC to one location. With cloud computing driving a higher percentage of enterprise IT budgets every year, it becomes increasingly important to plant your flag in this fast-expanding busin...
In a recent survey, Sumo Logic surveyed 1,500 customers who employ cloud services such as Amazon Web Services (AWS), Microsoft Azure, and Google Cloud Platform (GCP). According to the survey, a quarter of the respondents have already deployed Docker containers and nearly as many (23 percent) are employing the AWS Lambda serverless computing framework. It’s clear: serverless is here to stay. The adoption does come with some needed changes, within both application development and operations. Tha...
SYS-CON Events announced today that Synametrics Technologies will exhibit at SYS-CON's 22nd International Cloud Expo®, which will take place on June 5-7, 2018, at the Javits Center in New York, NY. Synametrics Technologies is a privately held company based in Plainsboro, New Jersey that has been providing solutions for the developer community since 1997. Based on the success of its initial product offerings such as WinSQL, Xeams, SynaMan and Syncrify, Synametrics continues to create and hone in...
Smart cities have the potential to change our lives at so many levels for citizens: less pollution, reduced parking obstacles, better health, education and more energy savings. Real-time data streaming and the Internet of Things (IoT) possess the power to turn this vision into a reality. However, most organizations today are building their data infrastructure to focus solely on addressing immediate business needs vs. a platform capable of quickly adapting emerging technologies to address future ...
In his general session at 21st Cloud Expo, Greg Dumas, Calligo’s Vice President and G.M. of US operations, discussed the new Global Data Protection Regulation and how Calligo can help business stay compliant in digitally globalized world. Greg Dumas is Calligo's Vice President and G.M. of US operations. Calligo is an established service provider that provides an innovative platform for trusted cloud solutions. Calligo’s customers are typically most concerned about GDPR compliance, application p...
Kubernetes is an open source system for automating deployment, scaling, and management of containerized applications. Kubernetes was originally built by Google, leveraging years of experience with managing container workloads, and is now a Cloud Native Compute Foundation (CNCF) project. Kubernetes has been widely adopted by the community, supported on all major public and private cloud providers, and is gaining rapid adoption in enterprises. However, Kubernetes may seem intimidating and complex ...
In his session at 21st Cloud Expo, Michael Burley, a Senior Business Development Executive in IT Services at NetApp, described how NetApp designed a three-year program of work to migrate 25PB of a major telco's enterprise data to a new STaaS platform, and then secured a long-term contract to manage and operate the platform. This significant program blended the best of NetApp’s solutions and services capabilities to enable this telco’s successful adoption of private cloud storage and launching ...
You know you need the cloud, but you’re hesitant to simply dump everything at Amazon since you know that not all workloads are suitable for cloud. You know that you want the kind of ease of use and scalability that you get with public cloud, but your applications are architected in a way that makes the public cloud a non-starter. You’re looking at private cloud solutions based on hyperconverged infrastructure, but you’re concerned with the limits inherent in those technologies.
Nordstrom is transforming the way that they do business and the cloud is the key to enabling speed and hyper personalized customer experiences. In his session at 21st Cloud Expo, Ken Schow, VP of Engineering at Nordstrom, discussed some of the key learnings and common pitfalls of large enterprises moving to the cloud. This includes strategies around choosing a cloud provider(s), architecture, and lessons learned. In addition, he covered some of the best practices for structured team migration an...
Most technology leaders, contemporary and from the hardware era, are reshaping their businesses to do software. They hope to capture value from emerging technologies such as IoT, SDN, and AI. Ultimately, irrespective of the vertical, it is about deriving value from independent software applications participating in an ecosystem as one comprehensive solution. In his session at @ThingsExpo, Kausik Sridhar, founder and CTO of Pulzze Systems, discussed how given the magnitude of today's application ...
The “Digital Era” is forcing us to engage with new methods to build, operate and maintain applications. This transformation also implies an evolution to more and more intelligent applications to better engage with the customers, while creating significant market differentiators. In both cases, the cloud has become a key enabler to embrace this digital revolution. So, moving to the cloud is no longer the question; the new questions are HOW and WHEN. To make this equation even more complex, most ...
As you move to the cloud, your network should be efficient, secure, and easy to manage. An enterprise adopting a hybrid or public cloud needs systems and tools that provide: Agility: ability to deliver applications and services faster, even in complex hybrid environments Easier manageability: enable reliable connectivity with complete oversight as the data center network evolves Greater efficiency: eliminate wasted effort while reducing errors and optimize asset utilization Security: imple...
Mobile device usage has increased exponentially during the past several years, as consumers rely on handhelds for everything from news and weather to banking and purchases. What can we expect in the next few years? The way in which we interact with our devices will fundamentally change, as businesses leverage Artificial Intelligence. We already see this taking shape as businesses leverage AI for cost savings and customer responsiveness. This trend will continue, as AI is used for more sophistica...
In his session at 21st Cloud Expo, Raju Shreewastava, founder of Big Data Trunk, provided a fun and simple way to introduce Machine Leaning to anyone and everyone. He solved a machine learning problem and demonstrated an easy way to be able to do machine learning without even coding. Raju Shreewastava is the founder of Big Data Trunk (www.BigDataTrunk.com), a Big Data Training and consulting firm with offices in the United States. He previously led the data warehouse/business intelligence and B...