DATA CENTER NETWORKING TOPOLOGIES 1 OVERVIEW Data Center Physical Layout Data Center Network Topologies ToR vs. EoR Data Center Networking Issues Data Center Networking Requirements 2 DATA CENTER STANDARDS ANSI/TIA-942 Telecommunications Infrastructure Standard for Data Centers Published 2005 – available through TIA at www.tiaonline.org ANSI/NECA/BICSI-002 Data Center Design and Implementation Best Practices complements TIA-942 –2007 3 PURPOSE OF TIA-942 Encourage early participation of telecom designers in data center design process Fill a void by providing standards for planning of data centers, computer rooms, serverrooms, and similar spaces. The standard encompasses much more thanjust telecommunications infrastructure. Close to half of the technical content deals with facility specifications. 4 PURPOSE OF TIA-942 Define a standard telecommunications infrastructure for data centers Structured cabling system for data centers using standardized architecture and media Accommodates a wide range of applications (LAN, WAN, SAN, channels, consoles) Accommodates current and future protocols (e.g., 10+ GbE (Giga bit Ethernet) ) Replaces unstructured point-to-point cabling that uses different cabling for different applications 5 DESIGN ELEMENTS 1. 2. 3. Cabling Design Facility Design Network Design 6 CABLING AND FACILITY DESIGN Cabling Design: Copper and fiber cabling performance Connectors, cables, distribution hardware Cabling distances Space management Facility Design: Data center sizing Power distribution methodologies Pathways and spaces HVAC, security, operations, and administration. Flexibility, scalability, reliability and space management 7 NETWORK DESIGN: Support of legacy systems Enable rapid deployment of new and emerging technologies such as 10 GbE and 10+ GbE copper and fiber applications. 8 GOOGLE DATA CENTER Take a walk through a Google data center http://www.google.com/about/datacenters/inside/s treetview/ 9 COOLING PLANT 10 MODULAR DATA CENTERS Called Ice cube Small: < 1 MW, 4 racks per unit Medium: 1-4 MW, 10 racks per unit Large: > 4 MW, 20 racks per unit Built-in cooling, high PUE (power usage effectiveness) 1.02 PUE = Power In/Power Used. It means cooling use 2 percent of power. Not efficiency of 0.98! Rapid deployment 11 CONTAINERIZED DATA CENTER All companies like IBM and cisco make it like this 12 UNSTRUCTURED CABLING 13 STRUCTURED CABLING 14 EQUIPMENT CABINETS Three Layers: 1. Bottom: Signaling (Ethernet),2. Middle: Power and 3. Top: Fiber Minimize patching between cabinets and racks 15 DATA CENTER PHYSICAL LAYOUT 16 ANSI/TIA-942-2005 STANDARD 17 ANSI/TIA-942-2005 STANDARD Computer Room: Main servers Entrance Room: Data Center to external cabling Cross-Connect: Enables termination of cables Main Distribution Area (MDA): Main cross connect. Central Point of Structured Cabling. Core network devices Horizontal Distribution Area (HDA): Connections to active equipment. Equipment Distribution Area (EDA): Active Servers+Switches. Zone Distribution Area (ZDA): Optionally between HDA and EDA. Rack, cabinet, or under floor enclosure that houses a zone outlet (ZO) or consolidation point (CP) Backbone Cabling: Connections between MDA, HDA, and Entrance room 18 ZONE DISTRIBUTION AREA 19 DATA CENTER NETWORK TOPOLOGIES Three levels of switches: Core, Aggregation, Access 20 DATA CENTER NETWORKS 20-40 servers per rack Each server connected to 2 access switches with 1 Gbps (10 Gbps becoming common) Access switches connect to 2 aggregation switches Aggregation switches connect to 2 core routers Aggregation layer is the transition point between L2-switched access layer and l3-routed core layer Low Latency: In high-frequency trading market, a few microseconds make a big difference. Cut-through switching and low-latency specifications. 21 DATA CENTER NETWORKS (CONT) Edge routers manage traffic between aggregation routers and in/out of data center All switches below each pair of aggregation switches form a single layer-2 domain Each Layer 2 domain typically limited to a few hundred servers to limit broadcast Most traffic is internal to the data center. Network is the bottleneck. Uplinks utilization of 80% is common. Most of the flows are small. Mode = 100 MB. DFS (Distributed File System) uses 100 MB chunks. 22 SWITCH LOCATIONS 23 TOR VS EOR ToR: Advantages: Easier cabling If rack is not fully populated unused ToR ports DisAdvantages: If rack traffic demand is high, difficult to add more ports Upgrading (1G to 10G) requires complete Rack upgrade EoR: Disadvantages Longer cables Advantages: Severs can be place in any rack Ports can easily added, upgraded 24 HIERARCHICAL NETWORK DESIGN All servers require application delivery services for security (VPN, Intrusion detection, firewall), performance (load balancer), networking (DNS, DHCP(Dynamic Host Control Protocol), NTP (Network Time Protocol), FTP, RADIUS (Remote Access Dial-In User Service server to perform authentication ), Database services (SQL) Stateful devices (firewalls) on Aggregation layer 25 DATA CENTER ACCESS LAYER DESIGN 1. 2. 3. 4. 4 Possibilities: Looped Triangle Looped Square Loop Free U Looped Free Inverted U 26 ACCESS AGGREGATION CONNECTIONS 27 CISCO USES ICONS TO REPRESENT DIFFERENT NETWORKING DEVICES 28 DATA CENTER NETWORKING ISSUES 29 DATA CENTER NETWORKING ISSUES (CONT) Under-utilization. Even when multiple paths exist only one is used. ECMP (Equal Cost Multipath) is used by routers to spread traffic to next hops using a hash function. However, only 2 paths exist. 30 DCN REQUIREMENTS Needs to be Scalable, Secure, Shared, Standardized, and Simplified (5 S's) Converged Infrastructure: Servers, storage, and network have to work together Workload Mobility: Large L2 domains required for VM mobility East-West Traffic: Significant server-to-server traffic as compared to server to user. One Facebook request required 88 cache looks, 35 database lookups, 392 backend RPC calls. Intranet traffic 935X the http request/response Storage traffic on Ethernet: Congestion management on Ethernet 31 4-POST ARCHITECTURE AT FACEBOOK Each rack contains a rack switch (RSW) with up to forty-four 10G downlinks and four or eight 10G uplinks (typically 10:1 oversubscription) one to each cluster switch (CSW) A cluster is a group of four CSWs and the corresponding server racks and RSWs 32 4-POST ARCHITECTURE AT FACEBOOK (CONT) Each CSW has four 40G uplinks (10G×4), one to each of four “FatCat” aggregation switches (typically 4:1 oversubscription). The four CSWs in each cluster are connected in an 80G protection ring (10G×8) and the four FC switches are connected in a 160G protection ring (10G×16) network failures used to be one of the primary causes of service outages. The additional redundancy in 4-post has made such outages rare. traffic that needed to cross between clusters used to traverse expensive router links. The addition of the FC tier greatly reduced the traffic through such links. All tiers are switched and not routers Ref: N. Farringon and A. Andreyev, “Facebook’s Data Center Network Architecture,” 2013 IEEE Optical Interconnect Conference, 33 http://nathanfarrington.com/papers/facebook-oic13.pdf MAIN DISADVANTAGES OF 1. 2. 3. 4. 5. 4-POST A CSW failure reduces intra-cluster capacity to 75% The cluster size is dictated by the size of the CSW Large switches are produced in smaller volumes from fewer manufacturers Large switches are often oversubscribed internally, meaning that not all of the ports can be used simultaneously Large switches are often very proprietary. This can lead to months and years between bug fixes 34 CLOS NETWORKS 35 FAT-TREE Fat-Tree networks were proposed by Charles E. Leiserson in 1985. Such network is a tree, and processors are connected to the bottom layer. The distinctive feature of a fat-tree is that for any switch, the number of links going down to its siblings is equal to the number of links going up to its parent in the upper level Therefore, the links get “fatter” towards the top of the tree, and switch in the root of the tree has most links compared to any other switch below it: 36 FOR ENTERPRISE NETWORKS However, for enterprise networks that connect servers, commodity (off-the-shelf) switches are used, and they have a fixed number of ports. Hence, the design of fat-tree, where the number of ports varies from switch to switch, is not very usable. Therefore, alternative topologies were proposed that can efficiently utilize existing switches with their fixed number of ports. There is a controversy whether such topologies should be called fat-trees, or rather “(folded) Clos networks”, or anything else. However, the term fat-tree is widely used to describe such topologies. 37 TWO-LEVEL FAT-TREE NETWORK. 38 SUMMARY 1. 2. 3. 4. 5. Modular data centers can be used for easy assembly and scaling Three tiers: Access, Aggregation, Core Application delivery controllers between Aggregation and core Need large L2 domains Fat-tree topology is sometimes used to improve performance and reliability 39