Welcome!

SDN Journal Authors: Stefan Bernbo, Michel Courtoy, Amitabh Sinha, Mike Wood, Liz McMillan

News Feed Item

Fujitsu Develops Cluster-Based Distributed Controller Technology to Implement Failure-Tolerant Wide-Area Software-Defined Networking

Enables uninterrupted operations of large-scale networks

Kawasaki, Japan, June 5, 2014 - (JCN Newswire) - Fujitsu Laboratories Ltd. today announced that it has developed technology for cluster-based distributed controllers in large-scale networks that implements a wide-area software-defined networking (SDN) and that can automatically handle controller failures and load fluctuations.

A cluster-based distributed controller runs on multiple physical controllers as a single logical controller to control multiple network switches. Compared to conventional centralized controllers, cluster-based distributed controllers offer better scalability and improved failure tolerance. Until now, however, the problem was that they had difficulty handling sudden load fluctuations and coordinated control when there was a controller failure.

Now, Fujitsu Laboratories has developed a distributed controller module for the coordinated control of multiple controllers, a load-balancing technology that transfers a switch being managed by one controller to another in a matter of seconds when a controller is under increasing load or has a failure, and an uninterrupted recovery technology. These technologies enable SDNs to work reliably when traffic rises beyond initially expected levels, or when multiple controllers have failures.

By deploying an SDN with these technologies to a wide-area network, infrastructure can recover quickly from disasters or other network failures while maintaining steady network operations.

These technologies are being presented at Interop Tokyo 2014, opening June 11 at Makuhari Messe in Chiba, Japan.

Background

Existing SDNs such as OpenFlow(1) are designed for centralized control, which means that operating wide-area networks, configured with switches transferring large volumes of communication packets, as SDNs results in highly concentrated loads in the controller when the number of users increases. This will be an obstacle to the smooth provision of service, and if the controller itself fails, the switch that it had been managing can no longer be controlled.
Fujitsu Laboratories solved these problems by treating multiple physical controllers as a single logical controller that can handle centralized control of thousands of switches. This is accomplished through a proprietary cluster-based distributed controller technology.

This technology consists of a module for control applications that is an add-on to existing controller applications, and a distributed controller module that connects multiple distributed controllers as components of an OpenFlow controller so that, depending on loads, application and controller components can be added along with server resources.

Issues

Cluster-based distributed controllers are different from centralized controllers in that multiple distributed controller modules need to be run in a coordinated way so that they do not compete with each other. Another challenge is ensuring continuity of control. Processes need to keep running even if a module fails, but difficulties are encountered with automatic switchovers when some controller components are heavily loaded or fail, and processing by the switches managing the controllers slows down or control becomes unsustainable.

About the Technology

Fujitsu Laboratories has developed a load-balancing technology that automatically redistributes control loads in a cluster-based distributed controller, and a recovery technology that automatically reassigns controllers without interruption when one fails.

Load-Balancing Technology Fujitsu Laboratories has developed a load-checking function as a new addition to the distributed-controller coordination module. This collects load information from each controller component (such as CPU utilization rate and number of switches) (step 1), and the coordination system periodically checks load information using one distributed-controller coordination module chosen as the "leader" based on module control number or other criterion (step 2) to detect load imbalances. If load rebalancing is judged to be needed according to the load-balancing logic, which switches to be reassigned are decided based on switch-reassignment logic, to balance the load according to a policy for CPU utilization rates and number of switches (step 3). As a result, the correspondence between the changed switches and the controllers is registered in the coordination system (step 4), and the load is balanced by reassigning the switches in accordance with the updated information from the distributed-controller (step 5).

Uninterrupted Recovery Technology Fujitsu Laboratories has developed a new failure-checking function for the distributed-controller coordination module. The distributed-controller coordination module chosen as leader detects a failure in a controller component (steps 1, 2) and determines a new controller component to manage the switches connected to the failed controller (step 3). This changes the controller/switch correspondence information to redistribute loads automatically based on controller-component load information (CPU utilization rates and number of switches) (step 4). The distributed-controller coordination modules that have not failed link to the information update and activate it to reassign the controllers managing switches (step 5) so that operations continue without any interruption in service. Because the controllers that are the reassignment destinations are decided using load-balancing technology, no controller should experience a sudden load spike that would cause it to shut down.

Furthermore, even if the leader module itself suffers from a failure, the coordination system will detect a session interruption and select a new leader, and that leader module will determine controllers to manage switches again.

Results

Using the cluster-based distributed controller makes it possible to handle sudden load fluctuations and to maintain continuity of network services even when controllers fail, enabling stable, highly reliable operations of wide-area networks.

For example, in the case of conventional controllers, when they are duplicated in the hot standby mode, i.e. active and on standby, for a ten-domain network, the total required number of the controllers is 20, or specifically two per domain. By contrast, using cluster-based distributed controllers, just one standby controller is added to the regularly running ten controllers, so that only 11 controllers are needed, enabling a reduction in the number of controllers by nearly half.

Future Plans

This technology could be used in the networks of telecommunications carriers and other network infrastructure to achieve highly reliable, stable operations with lower deployment costs and lower operating costs.

Fujitsu Laboratories is continuing with research and development on control technology for cluster-based distributed controllers with the goal of a practical implementation in fiscal 2015.

Note:

(1) OpenFlow

A centralized control technology that isolates the control unit for routers and switches from data transfer.

About Fujitsu Limited

Fujitsu is the leading Japanese information and communication technology (ICT) company offering a full range of technology products, solutions and services. Approximately 170,000 Fujitsu people support customers in more than 100 countries. We use our experience and the power of ICT to shape the future of society with our customers. Fujitsu Limited (TSE: 6702) reported consolidated revenues of 4.4 trillion yen (US$47 billion) for the fiscal year ended March 31, 2013 For more information, please see www.fujitsu.com.



Source: Fujitsu Limited

Contact:
Fujitsu Limited
Public and Investor Relations
www.fujitsu.com/global/news/contacts/
+81-3-3215-5259


Copyright 2014 JCN Newswire. All rights reserved. www.japancorp.net

More Stories By JCN Newswire

Copyright 2008 JCN Newswire. All rights reserved. Republication or redistribution of JCN Newswire content is expressly prohibited without the prior written consent of JCN Newswire. JCN Newswire shall not be liable for any errors or delays in the content, or for any actions taken in reliance thereon.

@CloudExpo Stories
"We are an IT services solution provider and we sell software to support those solutions. Our focus and key areas are around security, enterprise monitoring, and continuous delivery optimization," noted John Balsavage, President of A&I Solutions, in this SYS-CON.tv interview at 20th Cloud Expo, held June 6-8, 2017, at the Javits Center in New York City, NY.
"DivvyCloud as a company set out to help customers automate solutions to the most common cloud problems," noted Jeremy Snyder, VP of Business Development at DivvyCloud, in this SYS-CON.tv interview at 20th Cloud Expo, held June 6-8, 2017, at the Javits Center in New York City, NY.
"We want to show that our solution is far less expensive with a much better total cost of ownership so we announced several key features. One is called geo-distributed erasure coding, another is support for KVM and we introduced a new capability called Multi-Part," explained Tim Desai, Senior Product Marketing Manager at Hitachi Data Systems, in this SYS-CON.tv interview at 20th Cloud Expo, held June 6-8, 2017, at the Javits Center in New York City, NY.
There is a huge demand for responsive, real-time mobile and web experiences, but current architectural patterns do not easily accommodate applications that respond to events in real time. Common solutions using message queues or HTTP long-polling quickly lead to resiliency, scalability and development velocity challenges. In his session at 21st Cloud Expo, Ryland Degnan, a Senior Software Engineer on the Netflix Edge Platform team, will discuss how by leveraging a reactive stream-based protocol,...
SYS-CON Events announced today that Calligo, an innovative cloud service provider offering mid-sized companies the highest levels of data privacy and security, has been named "Bronze Sponsor" of SYS-CON's 21st International Cloud Expo ®, which will take place on Oct 31 - Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA. Calligo offers unparalleled application performance guarantees, commercial flexibility and a personalised support service from its globally located cloud plat...
DevOps at Cloud Expo, taking place October 31 - November 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA, is co-located with 21st Cloud Expo and will feature technical sessions from a rock star conference faculty and the leading industry players in the world. The widespread success of cloud computing is driving the DevOps revolution in enterprise IT. Now as never before, development teams must communicate and collaborate in a dynamic, 24/7/365 environment. There is no time to w...
Internet of @ThingsExpo, taking place October 31 - November 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA, is co-located with 21st Cloud Expo and will feature technical sessions from a rock star conference faculty and the leading industry players in the world. The Internet of Things (IoT) is the most profound change in personal and enterprise IT since the creation of the Worldwide Web more than 20 years ago. All major researchers estimate there will be tens of billions devic...
"The Striim platform is a full end-to-end streaming integration and analytics platform that is middleware that covers a lot of different use cases," explained Steve Wilkes, Founder and CTO at Striim, in this SYS-CON.tv interview at 20th Cloud Expo, held June 6-8, 2017, at the Javits Center in New York City, NY.
"With Digital Experience Monitoring what used to be a simple visit to a web page has exploded into app on phones, data from social media feeds, competitive benchmarking - these are all components that are only available because of some type of digital asset," explained Leo Vasiliou, Director of Web Performance Engineering at Catchpoint Systems, in this SYS-CON.tv interview at DevOps Summit at 20th Cloud Expo, held June 6-8, 2017, at the Javits Center in New York City, NY.
21st International Cloud Expo, taking place October 31 - November 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA, will feature technical sessions from a rock star conference faculty and the leading industry players in the world. Cloud computing is now being embraced by a majority of enterprises of all sizes. Yesterday's debate about public vs. private has transformed into the reality of hybrid cloud: a recent survey shows that 74% of enterprises have a hybrid cloud strategy. Me...
SYS-CON Events announced today that DXWorldExpo has been named “Global Sponsor” of SYS-CON's 21st International Cloud Expo, which will take place on Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA. Digital Transformation is the key issue driving the global enterprise IT business. Digital Transformation is most prominent among Global 2000 enterprises and government institutions.
SYS-CON Events announced today that Datera, that offers a radically new data management architecture, has been named "Exhibitor" of SYS-CON's 21st International Cloud Expo ®, which will take place on Oct 31 - Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA. Datera is transforming the traditional datacenter model through modern cloud simplicity. The technology industry is at another major inflection point. The rise of mobile, the Internet of Things, data storage and Big...
Kubernetes is an open source system for automating deployment, scaling, and management of containerized applications. Kubernetes was originally built by Google, leveraging years of experience with managing container workloads, and is now a Cloud Native Compute Foundation (CNCF) project. Kubernetes has been widely adopted by the community, supported on all major public and private cloud providers, and is gaining rapid adoption in enterprises. However, Kubernetes may seem intimidating and complex ...
"Outscale was founded in 2010, is based in France, is a strategic partner to Dassault Systémes and has done quite a bit of work with divisions of Dassault," explained Jackie Funk, Digital Marketing exec at Outscale, in this SYS-CON.tv interview at 20th Cloud Expo, held June 6-8, 2017, at the Javits Center in New York City, NY.
"We focus on SAP workloads because they are among the most powerful but somewhat challenging workloads out there to take into public cloud," explained Swen Conrad, CEO of Ocean9, Inc., in this SYS-CON.tv interview at 20th Cloud Expo, held June 6-8, 2017, at the Javits Center in New York City, NY.
"I think DevOps is now a rambunctious teenager – it’s starting to get a mind of its own, wanting to get its own things but it still needs some adult supervision," explained Thomas Hooker, VP of marketing at CollabNet, in this SYS-CON.tv interview at DevOps Summit at 20th Cloud Expo, held June 6-8, 2017, at the Javits Center in New York City, NY.
"We are still a relatively small software house and we are focusing on certain industries like FinTech, med tech, energy and utilities. We help our customers with their digital transformation," noted Piotr Stawinski, Founder and CEO of EARP Integration, in this SYS-CON.tv interview at 20th Cloud Expo, held June 6-8, 2017, at the Javits Center in New York City, NY.
"We've been engaging with a lot of customers including Panasonic, we've been involved with Cisco and now we're working with the U.S. government - the Department of Homeland Security," explained Peter Jung, Chief Product Officer at Pulzze Systems, in this SYS-CON.tv interview at @ThingsExpo, held June 6-8, 2017, at the Javits Center in New York City, NY.
"We're here to tell the world about our cloud-scale infrastructure that we have at Juniper combined with the world-class security that we put into the cloud," explained Lisa Guess, VP of Systems Engineering at Juniper Networks, in this SYS-CON.tv interview at 20th Cloud Expo, held June 6-8, 2017, at the Javits Center in New York City, NY.
Your homes and cars can be automated and self-serviced. Why can't your storage? From simply asking questions to analyze and troubleshoot your infrastructure, to provisioning storage with snapshots, recovery and replication, your wildest sci-fi dream has come true. In his session at @DevOpsSummit at 20th Cloud Expo, Dan Florea, Director of Product Management at Tintri, provided a ChatOps demo where you can talk to your storage and manage it from anywhere, through Slack and similar services with...