Welcome!

SDN Journal Authors: Pat Romanski, Destiny Bertucci, Liz McMillan, Elizabeth White, Amitabh Sinha

Related Topics: SDN Journal, Containers Expo Blog, @CloudExpo

SDN Journal: Blog Post

Requirements for Data Center Networks By @MatMathews | @CloudExpo [#SDN]

It's important when understanding what is needed for a data center network, to think about all six of these functional areas

The Six Requirements for Data Center Networks

One way or another, all data center networks exhibit at least six different functional areas that their operators need to engineer, implement, and operate with a differing set of needs and requirements. Similarly, in one way or another, most of the available SDN and virtualized network solutions available today or in progress aim to deal with issues in one or more of these areas to improve their functional effectiveness, cost, automated-ness, or integrated-ness. Yet some areas receive an inordinate amount of focus/attention and those areas may not necessarily have the most opportunity for improvement. Let’s take a look at these six requirements in order of the opportunity value to bring new levels of effectiveness to data centers.

1. Edge Switching (inter-server or more generically, inter-end point)
Edge switching loosely covers the function of providing switching between end points, whether they be virtual servers, physical servers, storage devices, or terminating services devices (load balancers, firewalls, etc.) It is important to note that in a virtualized server environment, there is typically 2 layers of edge – a set of virtual switches that connect together VMs and a set of physical switches that connect the physical hosts.

Much of the attention and focus in the industry has revolved around edge switching, possibly because this is the area that is most visible to customers in terms of cost. Technologies like distributed virtual switches, network overlays, and trends like white box switching / disaggregated switching software all aim to lower either the capital or operational costs of the edge. Much of this effort is predicated on the decades of near-monopolistic control of major incumbents on the switching infrastructure – yet it is important to note that in the age of merchant silicon (which is not largely offered through a single vendor and is pretty much common to every edge switch on the market), open source (or semi open source) switching operating systems/stacks and virtual switches, much of that control has been mitigated and costs have rightly come down. It will be interesting to see if the industry starts to move its attention to some of the other areas that could offer more potential gain in overall cost savings.

2. Edge Policy
Edge policy refers to the implementation of configuration of those edge switches that allows some type of policy to be enacted on the endpoints that are connecting to the network. The term “policy” here can refer to basic port configuration, all the way up to SLA-level configuration that effects the behavior of the traffic emanating from the connecting end points.

Edge policy has long been a troublesome area for networks (or really any system) because enforcing a policy across a disparate set of systems creates consistency challenges. The key for most policy efforts are a simplified way to express the policy, and a highly efficient way to distribute the policy to many edges. The virtual switching layer (vSwitches) seem like a natural place to solve edge policy due to the ability to quickly iterate the software in the edge devices. The folks at VMWare seem to understand this and are busily working on providing policy enforcement capabilities in their NSX product and tying that to a policy expressing capability in OpenStack (the Congress project). The important point to consider there is if their approach can easily be stretched across both physical and virtual environments for a truly seamless edge policy approach.

3. Fabric Switching (inter-[rack | row | cage | pod | data center]
Fabric switching refers to the switching of traffic for non end-point connected devices or functional blocks. This could be the spine switches connecting multiple ToR leafs together, or could be a core switching layer that connects multiple pods, or even could be a switching capability that connects multiple data centers together. The basic differentiating attribute of core switching versus edge switching is that typically the core does not connect directly to end points, except for ones providing transit services (like firewalls or load balancers).

There has been a surprising lack of attention (and similar lack of concern?) regarding fabric switching in the industry. The default path seems to be higher density spine switches leveraging the commodity silicon cost/performance curves. While “brute forcing” it seems like the path of least resistance here, we can probably look at the history of the “scale up” approach in other IT contexts to notice that it usually works, until it breaks. I’ll have a future post on this in more detail, but suffice it to say that we ought to see more attention paid here to scale-out solutions that attempt to bring to the network the same capabilities that multi-core processors brought to servers/compute.

4. Fabric Policy
Similar to edge policy, fabric policy refers to the implementation of configuration of fabric devices that allows some type of policy to be enacted in the inter-* network. Since most connectivity policy (like access control, port/VLAN configuration) happens on the edge devices, much of fabric policy relates specifically to how the overall network behaves in accordance with specific business imperatives, such as service level agreements or treatment of regulated data in transit.

Fabric policy can be done implicitly (or basically deferred) by treating all paths through the network as equal, and load balancing all traffic equally across all paths (e.g. Equal Cost Multipathing is an example of this implicit or deferred fitting). Or it can be done explicitly with complex mechanisms such as call admission control (CAC) that are typically not found in most data oriented networks. Fabric policy can also be done algorithmically (see “fitting” below), and increasingly as networks become “Software Defined” with a central controller entity and the applications/users of the network require more heterogeneous treatment, this function not only becomes more easily accomplished but also more important, and in conjunction with the concept of fitting, can be a very powerful area to drive effective utilization of network resources.

Surprisingly, there has been almost no talk in the industry about fabric policy, which is very surprising. After all, the guts of the resources that the network has to offer reside in the fabric and controlling those resources via a policy seems to be an area ripe for efficiency and performance improvements. Fabric policy would allow a user to express how inter-rack, inter-pod, or even inter-data center capacity is allocated not in a pre-determined / engineered way, but in a just-in-time or even a predictive way that follows that actual usage patterns of the data center network.

5. Fitting
Fitting is not a term that is generally familiar to folks that think about networking, and may currently be something that is completely unique to Plexxi’s view on the world (although Cisco’s “declarative networking” concept is similar at least in philosophy). Yet fitting it is something that is almost always done even if it is done implicitly. Most networks are built today with a gross level understand of capacity needs, segmentation needs, etc. The network is then engineered to provide, in aggregate, these capabilities via a set of network resources. The concept of fitting is that we explicitly define what each user of the network (a user could be an application, a set of applications, a site, or really any arbitrary grouping) and based on its business-centric attributes, best fit the network resources to that user. The concept of fitting is hard to do manually, or in traditional legacy networks with traffic is looked at on a packet-by-packet or flow-by-flow basis. But in more evolved “software-defined” networks, it becomes much easier to build a higher level view of the users of the network, and allow the software entities (i.e. the controller) to algorithmically determine how best to dole out the resources based on the information it has about the users.

The concept of fitting is extremely power, especially in a software-defined world. As we want and need to be able to leverage networks for a variety of users with a broad spectrum of business criticality, fitting allows us to explicitly put resources where they can have the most benefit rather than the typically networking approach of “spray and pray”, and this presents enormous opportunities not only for cost savings, but for driving business differentiation.

6. Integration and Automation
Finally, all networks need to be integrated to the rest of the world, and increasingly are being automated by the rest of the world. Integration is typically thought of as the way to drive edge policy – e.g. things like leveraging OpenStack Neutron or ML2 plug-ins to automate VLAN provisioning on an edge port. However, integration can not only drive edge policy, but can also drive fabric policy and fitting if done correctly. Ultimately, most companies are moving toward a data center model that is completely “lights out” and the network need not be an exception. Ultimately the data center network provides a services to applications, and as long as those applications can express their needs across the edge and the fabric, a network should be able to provide those services with a minimal amount of hand holding.

A well-integrated network ought to have the ability to express its capabilities across the edge and the fabric in a set of abstract “primitives” that can be easily driven from external systems. The network also should be able to effect different behaviors via policy and have the ability to efficiently fit available network resources to the most critical business needs. And all of this needs to be done in ways that can easily automatable.

Summary
It's important when understanding what is needed for a data center network, to think about all six of these functional areas and the potential opportunity to drive cost savings or business differentiation in each of these areas. While much of the industry attention is pointed right now on reducing the costs of the first layer of switching (edge switching), there are many more areas that provide more drastic areas of both cost savings and differentiation opportunity for businesses looking for an IT advantage.

The post The 6 Requirements for Data Center Networks appeared first on Plexxi.

Read the original blog entry...

More Stories By Mat Mathews

Visionary solutions are built by visionary leaders. Plexxi co-founder and Vice President of Product Management Mat Mathews has spent 20 years in the networking industry observing, experimenting and ultimately honing his technology vision. The resulting product — a combination of traditional networking, software-defined networking and photonic switching — represents the best of Mat's career experiences. Prior to Plexxi, Mat held VP of Product Management roles at Arbor Networks and Crossbeam Systems. Mat began his career as a software engineer for Wellfleet Communications, building high speed Frame Relay Switches for the carrier market. Mat holds a Bachelors of Science in Computer Systems Engineering from the University of Massachusetts at Amherst.

@CloudExpo Stories
The “Digital Era” is forcing us to engage with new methods to build, operate and maintain applications. This transformation also implies an evolution to more and more intelligent applications to better engage with the customers, while creating significant market differentiators. In both cases, the cloud has become a key enabler to embrace this digital revolution. So, moving to the cloud is no longer the question; the new questions are HOW and WHEN. To make this equation even more complex, most ...
As you move to the cloud, your network should be efficient, secure, and easy to manage. An enterprise adopting a hybrid or public cloud needs systems and tools that provide: Agility: ability to deliver applications and services faster, even in complex hybrid environments Easier manageability: enable reliable connectivity with complete oversight as the data center network evolves Greater efficiency: eliminate wasted effort while reducing errors and optimize asset utilization Security: imple...
Mobile device usage has increased exponentially during the past several years, as consumers rely on handhelds for everything from news and weather to banking and purchases. What can we expect in the next few years? The way in which we interact with our devices will fundamentally change, as businesses leverage Artificial Intelligence. We already see this taking shape as businesses leverage AI for cost savings and customer responsiveness. This trend will continue, as AI is used for more sophistica...
In his Opening Keynote at 21st Cloud Expo, John Considine, General Manager of IBM Cloud Infrastructure, led attendees through the exciting evolution of the cloud. He looked at this major disruption from the perspective of technology, business models, and what this means for enterprises of all sizes. John Considine is General Manager of Cloud Infrastructure Services at IBM. In that role he is responsible for leading IBM’s public cloud infrastructure including strategy, development, and offering m...
Digital transformation is about embracing digital technologies into a company's culture to better connect with its customers, automate processes, create better tools, enter new markets, etc. Such a transformation requires continuous orchestration across teams and an environment based on open collaboration and daily experiments. In his session at 21st Cloud Expo, Alex Casalboni, Technical (Cloud) Evangelist at Cloud Academy, explored and discussed the most urgent unsolved challenges to achieve f...
In his session at 21st Cloud Expo, Raju Shreewastava, founder of Big Data Trunk, provided a fun and simple way to introduce Machine Leaning to anyone and everyone. He solved a machine learning problem and demonstrated an easy way to be able to do machine learning without even coding. Raju Shreewastava is the founder of Big Data Trunk (www.BigDataTrunk.com), a Big Data Training and consulting firm with offices in the United States. He previously led the data warehouse/business intelligence and B...
The past few years have brought a sea change in the way applications are architected, developed, and consumed—increasing both the complexity of testing and the business impact of software failures. How can software testing professionals keep pace with modern application delivery, given the trends that impact both architectures (cloud, microservices, and APIs) and processes (DevOps, agile, and continuous delivery)? This is where continuous testing comes in. D
Blockchain is a shared, secure record of exchange that establishes trust, accountability and transparency across business networks. Supported by the Linux Foundation's open source, open-standards based Hyperledger Project, Blockchain has the potential to improve regulatory compliance, reduce cost as well as advance trade. Are you curious about how Blockchain is built for business? In her session at 21st Cloud Expo, René Bostic, Technical VP of the IBM Cloud Unit in North America, discussed the b...
SYS-CON Events announced today that Synametrics Technologies will exhibit at SYS-CON's 22nd International Cloud Expo®, which will take place on June 5-7, 2018, at the Javits Center in New York, NY. Synametrics Technologies is a privately held company based in Plainsboro, New Jersey that has been providing solutions for the developer community since 1997. Based on the success of its initial product offerings such as WinSQL, Xeams, SynaMan and Syncrify, Synametrics continues to create and hone in...
With tough new regulations coming to Europe on data privacy in May 2018, Calligo will explain why in reality the effect is global and transforms how you consider critical data. EU GDPR fundamentally rewrites the rules for cloud, Big Data and IoT. In his session at 21st Cloud Expo, Adam Ryan, Vice President and General Manager EMEA at Calligo, examined the regulations and provided insight on how it affects technology, challenges the established rules and will usher in new levels of diligence arou...
Nordstrom is transforming the way that they do business and the cloud is the key to enabling speed and hyper personalized customer experiences. In his session at 21st Cloud Expo, Ken Schow, VP of Engineering at Nordstrom, discussed some of the key learnings and common pitfalls of large enterprises moving to the cloud. This includes strategies around choosing a cloud provider(s), architecture, and lessons learned. In addition, he covered some of the best practices for structured team migration an...
Companies are harnessing data in ways we once associated with science fiction. Analysts have access to a plethora of visualization and reporting tools, but considering the vast amount of data businesses collect and limitations of CPUs, end users are forced to design their structures and systems with limitations. Until now. As the cloud toolkit to analyze data has evolved, GPUs have stepped in to massively parallel SQL, visualization and machine learning.
The 22nd International Cloud Expo | 1st DXWorld Expo has announced that its Call for Papers is open. Cloud Expo | DXWorld Expo, to be held June 5-7, 2018, at the Javits Center in New York, NY, brings together Cloud Computing, Digital Transformation, Big Data, Internet of Things, DevOps, Machine Learning and WebRTC to one location. With cloud computing driving a higher percentage of enterprise IT budgets every year, it becomes increasingly important to plant your flag in this fast-expanding busin...
Modern software design has fundamentally changed how we manage applications, causing many to turn to containers as the new virtual machine for resource management. As container adoption grows beyond stateless applications to stateful workloads, the need for persistent storage is foundational - something customers routinely cite as a top pain point. In his session at @DevOpsSummit at 21st Cloud Expo, Bill Borsari, Head of Systems Engineering at Datera, explored how organizations can reap the bene...
Kubernetes is an open source system for automating deployment, scaling, and management of containerized applications. Kubernetes was originally built by Google, leveraging years of experience with managing container workloads, and is now a Cloud Native Compute Foundation (CNCF) project. Kubernetes has been widely adopted by the community, supported on all major public and private cloud providers, and is gaining rapid adoption in enterprises. However, Kubernetes may seem intimidating and complex ...
In his session at 21st Cloud Expo, Michael Burley, a Senior Business Development Executive in IT Services at NetApp, described how NetApp designed a three-year program of work to migrate 25PB of a major telco's enterprise data to a new STaaS platform, and then secured a long-term contract to manage and operate the platform. This significant program blended the best of NetApp’s solutions and services capabilities to enable this telco’s successful adoption of private cloud storage and launching ...
In a recent survey, Sumo Logic surveyed 1,500 customers who employ cloud services such as Amazon Web Services (AWS), Microsoft Azure, and Google Cloud Platform (GCP). According to the survey, a quarter of the respondents have already deployed Docker containers and nearly as many (23 percent) are employing the AWS Lambda serverless computing framework. It’s clear: serverless is here to stay. The adoption does come with some needed changes, within both application development and operations. Tha...
In his general session at 21st Cloud Expo, Greg Dumas, Calligo’s Vice President and G.M. of US operations, discussed the new Global Data Protection Regulation and how Calligo can help business stay compliant in digitally globalized world. Greg Dumas is Calligo's Vice President and G.M. of US operations. Calligo is an established service provider that provides an innovative platform for trusted cloud solutions. Calligo’s customers are typically most concerned about GDPR compliance, application p...
You know you need the cloud, but you’re hesitant to simply dump everything at Amazon since you know that not all workloads are suitable for cloud. You know that you want the kind of ease of use and scalability that you get with public cloud, but your applications are architected in a way that makes the public cloud a non-starter. You’re looking at private cloud solutions based on hyperconverged infrastructure, but you’re concerned with the limits inherent in those technologies.
Smart cities have the potential to change our lives at so many levels for citizens: less pollution, reduced parking obstacles, better health, education and more energy savings. Real-time data streaming and the Internet of Things (IoT) possess the power to turn this vision into a reality. However, most organizations today are building their data infrastructure to focus solely on addressing immediate business needs vs. a platform capable of quickly adapting emerging technologies to address future ...