Fourth International Workshop on the 
Design of Reliable Communication Networks
(DRCN 2003)
19-22 October 2003 - Banff, Alberta, Canada
 

 

 

 

 


TUTORIALS

DRCN 2003 delegates were invited to attend their choice of four free half-day tutorials held on Sunday, 19 October 2003. Descriptions of each tutorial follow:


TUTORIAL T1 - 9:00 AM to 12:30 PM, Sunday 19 October 2003

Title: "Reliability and Security of IP Data Networks" 

Presenter: Ross Callon, Juniper Networks

Summary: IP data networks are increasingly becoming a critical part of the infrastructure used for business, shopping, travel, and entertainment. This tutorial will outline the methods used to ensure a high degree of reliability and availability of IP data networks. We will describe techniques and features which are essential to improve the reliability of individual components in the networks, with emphasis on router reliability. We will then discuss protocol features which can be used to improve the availability of the overall IP network. We will also discuss issues related to ensuring the availability of the services offered over data networks.

In recent years there has been a rapid increase in malicious attacks against IP data networks. Networks are now under essentially continuous attack. Ensuring the security of IP data networks against malicious attacks has become critical in order to ensure the reliability and availability of the network. This talk will outline the urgency of increasing the security of data networks. We will describe how the routing protocols and the router infrastructure can be protected against a wide range of attacks. We will also describe how routers can help to improve the security of network servers and users.

Ross Callon is a distinguished engineer in the protocols group at Juniper Networks. He has extensive experience in Internet protocol standards, high speed router design, and multi-protocol coexistence and interoperability. He is co-chair of Network Reliability and Interoperability Council 6, Focus Group 2 advising the Federal Communications Commission (FCC) on network reliability. He also was a participant in a recent effort to advise the White House on security in communications networks. Mr Callon is a long-standing participant in multiple IETF working groups, and has previous experience in the ATM Forum, internet engineering steering group, IEEE, ANSI, and ISO. He has authored or contributed toward VPN, MPLS, PNNI, IPv6, IS-IS and CLNP networking standards. He is a former co-chair of the IETF IP Next Generation (IPv6) working group. Mr. Callon has published numerous articles and been awarded twelve patents. He has a Bachelor of Science in Mathematics from the Massachusetts Institute of Technology and a Master of Science in Operations Research from Stanford University.


TUTORIAL T2 - 9:00 AM to 12:30 PM, Sunday 19 October 2003

Title: "Protection and Restoration in Optical Ring and Mesh Networks"

Presenters: Sudipta Sengupta and Dr. Vijay P. Kumar, Bell Labs, Lucent Technologies

Summary: Recent advances in optical switching and transmission technologies have enabled optical networks to offer virtually unlimited bandwidth, a very important ingredient for sustaining the continued growth of the Internet. Using Dense Wavelength Division Multiplexing (DWDM), a single optical fiber today can carry several tens to few hundreds of 2.5G/10G wavelengths. To keep up with the growth in bandwidth capacity of transmission systems, optical switching equipment is also evolving to support wavelength matching port speeds and scale to several hundreds to thousands in port count. Today's communications networks are going through a transformation from legacy SONET/SDH ring backbones to next generation optical switch based mesh topology architectures that offer the benefits of significant cost reduction, ease of manageability, automated provisioning, increased flexibility, and enabling new revenue generating services and applications.

Given that a single wavelength can carry potentially several tens and a single fiber several hundreds to thousands of upper layer connections like ATM/FR, TDM, and IP/MPLS, the impact of an equipment failure or fiber cut in terms of service disruption can be catastrophic. For example, a fiber cut could result in loss of several terabits/sec of information for a period of hours to days. Fortunately, in keeping with the growth of raw bandwidth at the optical layer, there have also been similar advances in architectures and protocols for automatic survivability (protection/restoration) in optical networks. The two most important performance objectives for any optical layer survivability mechanism are to (i) minimize the service disruption time (also called restoration latency), and (ii) minimize protection/restoration capacity overhead in the network. The first objective attempts to mask optical layer failures from upper network layers while the second aims to increase network utilization. The terms protection and restoration differ in the way backup resources are allocated – for protection, a dedicated backup resource is deployed to recover the services provided by a failed resource; for restoration, backup resources are not dedicated to specific primary resources, but a "pool" of resources are kept available for overall recovery purposes, and the allocation of backup resources occurs dynamically after failure. Thus, restoration could result in longer disruption times as compared to protection, but achieve possible improvements in network utilization.

This tutorial will cover survivability mechanisms in optical ring and mesh networks. We begin with an introduction to optical transmission and switching technologies, existing and emerging transport network infrastructure, nature and impact of network failures, and performance objectives for designing protection/restoration schemes. We then discuss protection mechanisms in traditional SONET/SDH rings. Protection and restoration in emerging optical mesh networks, which has been a subject of recent research and development, is presented next. Coverage will include the network architecture context, protection/restoration types, protection/restoration protocols, algorithms for routing with protection/ restoration, and protection/restoration capacity performance comparisons. The status of ongoing standardization activities in IETF for restoration in optical mesh networks will also be reviewed.

Sudipta Sengupta is currently in the Optical Networks Research Department at Bell Laboratories, Lucent Technologies. In addition to pursuing research in network protocols and routing algorithms, he is also a lead architect for distributed control plane architecture for Lucent’s optical networking product portfolio. Prior to this, Sudipta was a senior architect at Tellium and worked on the dynamic provisioning and restoration architecture and routing algorithms for one of the industry’s earliest mesh optical core switch. Sudipta holds an M.S. degree from the Massachusetts Institute of Technology (MIT), Cambridge, USA and a B.Tech. degree from the Indian Institute of Technology (IIT), Kanpur, India, both in Computer Science. He received the President of India Gold Medal at IIT-Kanpur for academic excellence. Sudipta has authored numerous publications for conferences, journals and technical magazines, and has filed US patents in the area of computer networking. He has also taught advanced courses on optical networking at academic/research and industry conferences. 

Dr. Vijay P. Kumar is Director of Intelligent Optical Networks Research at Bell Laboratories, Lucent Technologies, where he leads research in architectures, algorithms, and protocols for optical networks. Prior to undertaking this position in 2001, he was Director of High Speed Networks Research in Bell Labs and led the teams that created the ATLANTA ATM chip set (the industry-leading silicon solution for ATM switches) and the PacketStar IP router (the industry's first gigabit router with per-flow QoS). Kumar received his B.Engg. (Electronics and Communication Engineering) from Osmania University and the M.S. and Ph.D. degrees (Electrical and Communication Engineering) from the University of Iowa. He is a recipient of the 1998 Ellersick Prize Paper Award, an IEEE Fellow, and a Fellow of Bell Laboratories.


TUTORIAL T3 - 1:30 PM to 5:00 PM, Sunday 19 October 2003

Title: "Introduction to Telecommunications Systems Survivability - A Practitioner's View"

Presenters: Dr. Veena Mendiratta and Chuck Witschorik, Bell Labs, Lucent Technologies

Summary: Public telecommunications systems have evolved to be highly reliable through hardware and software redundancy, fault recognition and containment, and fast recovery. There is now a new emphasis for telecommunications systems to be highly survivable also to limit the extent of service loss after a catastrophic loss of infrastructure. This tutorial will provide an introduction to the subject of telecommunications system survivability from a practitioner’s perspective. Our focus is exclusively on the voice network for wireline, cellular and voice over packet implementations.

The following areas are covered. The introduction discusses the motivation for the need for survivable telecommunications systems with examples and related work in this area. The background section covers definitions, requirements, and network architectures. The section on principles defines the architecture layers for survivability, introduces the hardware and software components in each layer, defines the automatic detection and recovery principles currently used in telecommunications systems to achieve reliability, and provides an overview of the processes and procedures required for effecting recovery after a loss; principles for subscriber database recovery and survivability are also presented. The section on application brings together the architecture principles described earlier to define alternative architectures for survivability for wireline, cellular and voice over packet networks. Transport and network routing are addressed, network elements and their functions are described. The alternative architectures are presented in the context of incremental changes to existing networks to improve survivability as well as in the context of new installations (greenfield) which can be closer to an ideal architecture with respect to survivability. The alternative architectures are evaluated against the failure examples presented in the introduction. The last section summarizes what is required for architecting a survivable voice telecommunications system.

Dr. Veena B. Mendiratta has been at Bell Labs, Lucent Technologies (formerly AT&T) since 1984. She is currently a Consulting Member of Technical Staff in the Advanced Technologies business unit. Dr Mendiratta’s interests are in the areas of architecture, reliability, fault tolerant computing and software reliability engineering. Most of Dr Mendiratta’s work has focused on the reliability and performance analysis of switching and access systems to guide system architecture solutions; recent work has focused on Voice over Packet solutions and architecting survivable switching solutions. Last year she was one of the lead architects for a contracted study to evaluate voice network survivability for a large government installation. Recommendations from the study have been approved for implementation. She has presented papers at several refereed conferences, most recently a tutorial on Software Reliability Engineering Techniques at IEEE RAMS (January 2003), and is a member of IEEE and INFORMS. Dr Mendiratta holds a PhD (1981) in Operations Research from Northwestern University, Evanston, Illinois and a B.Tech (1970) in Engineering from the Indian Institute of Technology, New Delhi, India. 

Chuck Witschorik is a Consulting Member of Technical Staff with Lucent Technologies, where he has worked on PSTN switching product architecture and development for 24 years. He is currently leading a team of architects responsible for product evolution of public voice network switches and access points. His recent work has included the role of lead architect in a contracted study to evaluate voice network survivability for a large government installation. Recommendations from the study have been approved for implementation. He holds a M.Sc degree in Computer Science from the Illinois Institute of Technology and has been granted seven USA patents in telecommunications.


TUTORIAL T4 - 1:30 PM to 5:00 PM, Sunday 19 October 2003

Title: "Availbility and Survivability Considerations for Storage Area Networks"

Presenter: Dr. Geoff Hayward, YottaYotta

Summary: Until recently, storage area networks (SANs) have had a maximum reach of about 100 km. However, new protocols enable storage traffic to be reliably transmitted over tens of thousands of kilometres using shared TCP/IP networks. Consequently, a new form of data network, the storage wide area network (SWAN), has emerged. A SWAN can provide the end user with a "local disk" that actually comprises storage in many widely dispersed data centres.

Storage networking over large distances presents some unusually stringent demands for high availability. Customers expect 99.999% service layer availability. Further, they expect that any failures that cause network partitions will never result in data corruption (e.g., survivability to the "split-brain" problem). Also, since the primary focus is on meeting service level agreements, it is not sufficient to focus exclusively on network availability. Multi-layer "availability engineering" strategies are required. Fail-over sequences initiated by the various software and hardware subsystems must integrate well together and be sensitive to geographic and quality of service constraints.

In this tutorial, we provide

  • A brief review on storage networking (e.g., SAN, NAS, and "next-generation" architectures).
  • An overview of several strategies for extending SANs over the WAN (e.g., iFCP, FCIP, iSCSI, and native FC over WDM).
  • Availability engineering with pre-configured multi-layer failure action hierarchies.
  • Network partitioning and "split brain".
  • Geographically and bandwidth sensitive fail-over routing and system recovery.
  • QoS and SLAs for data availability.
  • Disaster recovery strategies.
  • Availability considerations for peer-to-peer storage network architectures and "storage grids".

Dr. Geoff Hayward, Director of Advanced Technology at YottaYotta, has been a pioneer in geographically distributed storage system research. His team established the world's first storage wide area network running on shared, production links at Gigabit per second rates. In October of 2002, Dr. Hayward led an international research collaboration that achieved the current world record for disk-to-disk data transfer over distance.

Dr. Hayward graduated cum laude from Yale University in 1983. He went on to receive his Ph.D. in theoretical physics from the University of Alberta in 1990. In 1990 and 1992, he was awarded international fellowships from the Natural Sciences and Engineering Research Council and the Canadian Institute for Theoretical Astrophysics (respectively) for his research on quantum gravitational systems. He has gone on to lead teams dedicated to research in the fields of photonics, seismology, and storage networking. He holds two patents with an additional two patents pending.

 


For further information, contact:

Deep Medhi, DRCN 2003 Tutorials Chair
School of Interdisciplinary Computing and Engineering
University of Missouri-Kansas City
5100 Rockhill Road
Kansas City, MO 64110-2499 USA
(816) 235-2006 [Ph]
(816) 235-5159 [Fax]
dmedhi@umkc.edu