SAIN Networking SAIN = Synchronized Adaptive INfrastructure Ray W Sanders Chairman, SAIN Networks, Inc. overcoming unintended consequences in voice and data networks What this talk is about A simple paradigm that can overcome the unintended consequences of today’s stochastic data network. The paradigm results in a simple underlayer that can ensure a deterministic data network. 2 Sanders Prediction Packets will be forever, but the global Internet will morph into something that looks a little like a late 1970’s telephone network but with far more capability and without the fatal flaws of carrying only connections that must last for at least a few seconds and support only voice conversations 3 Comparing SAIN with existing networks Existing Networks Route one-connection-at-a time SAIN Networks Route aggregations of connections Hop-by-hop routing for each connection Route aggregations for a one-hop channel Network uses multiple control planes Network uses a single control plane Networks are largely stochastic Networks are deterministic Wire speed latency can be 100 ns Latency inversely proportional to data rate Internet uses many overlay protocols NICs make use of single purpose utility Data sent in bursts Bursts forwarded or smoothed out Complicated Quality of Service required Guaranteed delivery—one metric: delay Head-of-line blocking complications Small packet wins by increasing data rate Tough privacy and security problems Disjoint objects conceal cellet relevance Bursty data can require overprovisioning Aggregated streams require less BW Many networking protocols must exist 2 simple algorithms manage BW and routes 4 A thought experiment Assumptions: 1. 1,000 people want to see 1,000 two-hour movies starting at 8 p.m. 2. Each movie contains 9 gigabytes of data 3. The network can use up to 10 Gbps to deliver a collection of movies 4. Suppose we use 10 Gbps for each movie It takes 7.2 seconds to send one movie How long does it take to send all 1,000 movies one after another? 1,000 × 7.2 seconds= 2 hours How long would the average customer need to wait to start seeing his movie? 1 hour Now, suppose that we send each movie at 10 Mbps (1/1000th of 10 Gbps) How long does each of 1,000 customers wait to start watching his movie? 0 hours There is a compelling requirement to control bandwidth, (and hence delivery time) to meet each customer’s need This result can obtain if a network is deterministic 5 Goals for a SAIN network Define and build elemental pieces of a network architecture that: 1.can support all existing voice and data network traffic 2.can support unknown future traffic types 3.can grow from data centers, to metropolitan networks, to a global interconnected network 4.is robust, efficient, and simple 5.is a circuit-based architecture that can endure and scale for decades 6 Constraining networks to really improve their efficiency A core principle of the SAIN architecture Partition a network into small disjoint pairs of active objects such as pairs of NICs and pairs of switches What does this do? • Enhances a network’s privacy and security • Prevents one object in a network from changing the state of another without using a Control Vector to send messages from a source object to a destination object • Prevents any entity outside a network from changing the state of an object inside the network • Simplifies object addressing Generic Aggregation Switch Interconnecting Elements Generic Disaggregation Switch Basic Aggregation / Disaggregation Switch Pairs 7 Constraining networks to really improve their efficiency Another core principle of a SAIN architecture Nodes in a SAIN network are synchronized to a common clock What does this do? • Enables very cheap high-performance switches that can scale well beyond current limits • Removes the need for complex Quality of Service facilities inside a network 8 Constraining networks to really improve their efficiency A third core principle of the SAIN architecture All user data protocols are separated from data transport and its control SAIN Protocol SAIN Protocol Translator Translato Ingress | rEgress NIC SAIN Protocol Translator Ingress | Egress NIC What does this do? • Defines an underlay network whose only job is to transfer bits from a 20 20 data source to a data sink 4 SAIN 4 Host, Host, • Enables Network Interface Terminal, Terminal, Controllers (NICs) to support Server, or Underlay Server, or Network Network devices with any protocol. Network • Demands that an Egress NIC’s protocols must match its paired Ingress NIC’s protocols • Lets a NIC match from only one other NIC to a large number of NICs in a network 9 Constraining networks to really improve their efficiency A fourth core principle of the SAIN architecture Build a lot of a network’s physical and logical connectivity a priori to its use What does this do? • Enables each port of a network to have a physical connection to every other port of the network with a matching NIC; the connections are set up when the network is built or modified • Enables every possible route to be computed when the network is built and need not be recomputed until new nodes are added to the network 10 Constraining networks to really improve their efficiency A fifth core principle of the SAIN architecture All connections are ‘virtual’ that consume network bandwidth only when there are data bits to transport What does this do? • Enables each connection to be set up prior to use • Assures that no bandwidth is used until data is to be sent • Assures that an amount of bandwidth allocated to a connection is just enough to meet a customer’s needs 11 Constraining networks to really improve their efficiency A sixth core principle of the SAIN architecture All connections from a source node to a destination node are aggregated into a single logical data flow What does this do? • Significantly reduces the number of objects to be routed through a network • Packets do not get routed independently ; they are combined into aggregations sent from a source to a destination node through preset routes • No computing needed at each tandem node • A route is a virtual connection between two nodes; if it approaches congestion, another route can be quickly added 12 Constraining networks to really improve their efficiency A seventh core principle of the SAIN architecture The amount of available bandwidth and delay must be known for each possible route through a network before a connection is made What does this do? • Prevents discarding packets because of network congestion • Dynamically provides the most cost-effective route with bandwidth to meet each a connection’s need How can this be accomplished? • Delay over a route is known when nodes are installed • Each node connected to a transport connection (trunk) sends the trunk’s bandwidth availability to each source node in the network periodically (e.g. 1,000 times per second) 13 What are data networking’s unintended consequences? Some examples of unintended consequences in today’s networks 1.Traffic congestion and discarded packets 2.Jitter (= delay variation); traffic shaping and policing 3.Overprovisioning and Quality of Service 4.Flow-based traffic and circuit emulation 5.Lack of privacy, security and survivability 14 Network Behavior Constraint 1 Eliminate Traffic Congestion Packets and packet buffers are not going away in a SAIN network For each end-to-end connection there is a packet buffer at its ingress node and one at its egress node Each connection that occurs at a source-node/destination-node pair within a given period (an ‘epoch’ for a group of connections) originates within a pair of switches Generic Aggregation Switch Interconnecting Elements Generic Disaggregation Switch Basic Source Aggregation / Destination Disaggregation Switch Pairs Interconnecting Elements include source/destination node switches in three aggregation tiers above the lowest tier The lowest tier aggregates customer data; the higher tiers forward aggregations Each higher tier aggregates the next lower tier’s data 15 Network Behavior Constraint 2 Eliminate jitter, traffic shaping and policing Jitter (also known as delay variation) is the aperiodic arrival of each packet. Aperiodic arrivals of packets in data flows can cause service disruptions Changing bandwidth of a connection can assure that either the start time of a received packet or the time required to receive an entire packet provides uninterrupted service SAIN network synchronization provides ‘traffic shaping’ and ‘policing’ without additional complexity 16 Network Behavior Constraint 3 Reduce overprovisioining Aggregating connections into channels can benefit from the Law of Large Numbers The law can result in the bandwidth of a large aggregation changing slowly compared to faster bandwidth changes of the lowest tier Node synchronization can result in a network not needing Quality of Service as currently defined A desirable metric is end-to-end delay of entire packets— not wire speed starting time of sending a single packet 17 Network Behavior Constraint 4 Flow-based traffic without Circuit Emulation Nodal clocks can provide physical circuits in a simple manner compared to the current complexity of circuit emulation The physical circuits operate at all levels of aggregation and can be virtual or real The necessity of providing circuits for flow-based traffic is a major reason to implement the SAIN architecture In addition to basic algorithms, a third ‘floating frame’ algorithm exists for plesiochronous operation where span lengths of trunks vary (e.g., for moving nodes and environment variations) 18 Network Behavior Constraint 5 Provide better privacy, security and survivability Overcome current core network privacy and security weaknesses A SAIN network can assure that all network objects used to forward packet data through the network are disjoint. Network data forwarding control can be massively distributed with centralized monitoring and fault management A network object cannot change the state of another object except by using a certified Control Vector connected from a source node to a destination node A destination node can authenticate certification of a connected Control Vector. Certification can use round-trip delay of destination and source nodes Bandwidth management algorithm results in ever-changing aggregation frames ‘Floating frames’ enhance security 19 More Network Behavior Architecture scales beyond current limits Instead of forwarding entire packets a SAIN network forward only one or a few bits of a packet at a time This results in using very simple switches that forward large aggregations without requiring expensive large routers Not only are costs reduced; energy needs are reduced as well 20 More Network Behavior A single metric defining application needs There is no need for traffic shaping or policing; there is no need for circuit emulation; there are no out-of-order packets; and the packet loss rate is zero Synchronized network nodes and implicit addressing achieves this goal Node synchronization can result in a single metric that defines required delays for application types The single metric defines end-to-end delay of entire packets— not just the wire speed starting time of sending a single packet 21 More Network Behavior Results from simulations of a model network 17 07 4 010 5 07 27 2.1 042 278.0 226 .3 357.9 041 02 8 044 051 214.5 11 063 068 295.8 043 033 226.2 069 28 1.1 052 4 05 062 064 161.8 12 6 06 5 58.2 6 0 1 067 003 367 .9 05 5 20 3. 2 7 6 04 05 6 0.1 33 049 163.3 061 246.2 6 9.2 22 5 04 10 7 04 8.5 23 05 8 050 14 3 05 29 4.7 8 030 2 03 4 02 05 7 034 80 02 7 015 247 .0 004 217. 4 036 4 03 8 8 04 8.5 19 059 .6 273 33 0.7 5.9 26 3 02 .5 215 3 03 7 029 31 014 060 026 025 .1 204 021 9 03 5 15 5. 2 41 269.2 022 The green circles are transit nodes (T-Nodes) The red rectangles are 9 entry/exit nodes [E-Nodes] Each [E-Node] (T-Node) contains source switches connecting to paired destination switches in all other [E-Nodes] (T-Nodes) in a network 008 165.8 006 233.8 20 0 02 007 .8 248 002 071 0 04 188.8 035 2 1 001 013 7. 9 31 01 1 217.5 9 01 20 0.4 072 172 .1 016 18 012 005 017 423.5 .4 291 19 077 16 009 185.5 018 79 19 078 07 3 6 07 070 7 9. 27 13 A Metropolitan Area Network Example with 20 T-Nodes & 80 Simplex Trunks 500 E-Nodes each able to support >4,000 ports each with multiple IP addresses 22 Sanders Suggestion We should not let ourselves make another management mistake that the future of networking will be based entirely on using packet switches for routing Our focus should morph into efforts that enhance IP* addressing and DNS* in a circuit-based world with advanced NIC applications * Internet Protocol addressing and * Domain Name System 23 24 25 How to support goals 1 & 2 (Support existing and future traffic types) • Transport of bits is independent of data type SAIN Underlay Network 20 4 SAIN Protocol SAIN Protocol Translator Translato Ingress | rEgress NIC SAIN Protocol Translator Ingress | Egress NIC Host, Terminal, Server, or Network 20 4 Host, Terminal, Server, or Network • Packets appear only at ingress and egress ports with connected NICs • Packet or circuit data appears at an ingress NIC and is transferred to an egress NIC • An ingress/egress pair of NICs can support any matched data type • NIC pairs can support secure topologies and methods • Packets are transferred bit-by-bit at a deterministic data rate • An Egress NIC delivers the protocol entering its paired Ingress NIC 26 How SAIN works #1 What a packet flow can look like: Packet Header H Input A Packet Data D H Input B H D B1 Output H D A1 H D A2 H D A3 D B2 H D B1 H D A1 H D A2 H D B2 H D A4 H This method of multiplexing uses ‘explicit addressing’ D A3 H D A4 What a SAIN flow can look like: The size of each cellet is fixed for a given link in which a frame occurs The duration of an Epoch can depend on the desired end-to-end network delay of all embedded packets This method of multiplexing uses ‘implicit addressing’ where the position of each cellet defines its connection or channel identity 27 How SAIN works #2 A SAIN network contains simple network switches with a very different approach that uses very simple parts Generic Aggregation Switch Interconnecting Elements Generic Disaggregation Switch Basic Aggregation / Disaggregation Switch Pairs The ‘Interconnecting Elements’ are primarily made up of Aggregation Switch / Disaggregation Switch pairs that exist in three levels of aggregation Each tier contains Aggregation / Disaggregation Switch Pairs The three aggregation levels pass data use three network tiers plus an exchange tier to other networks and a virtual distribution sub-tier shown in the next slide 28 How SAIN works #3 Connections exist in an Entry/Exit E-Node tier that includes a virtual VE-Node subnetwork uses for traffic distribution Each E-Node connects large aggregations of connections within large channels to and from a parent Transit T-Node tier Each T-Node routes the aggregations of E-Node traffic for delivery from a Source T-Node to a Destination T-Node In addition to its T-Node tier routing functionality, a T-Node can connect to an eXchange X-Node that can have a channel to other X-Node domains including those that make up a global domain eXchange X-Nodes Transfer T-Nodes Entry/Exit E-Nodes Virtual Entry/Exit Nodes VE-Nodes 29 More Network Behavior Routing in the model network Each E-Node connects to a parent T-Node Each T-Node has full period connections to every other T-Node Each Source T-Node can set up a loop-less route through T-Nodes to every other T-Node Each route can be computed at network instantiation The computation begins with a table of single hops among the T-Nodes A second hop for each entry can be added for each second hop that does not include the first hop Repeat this process recursively for a two-hop table to build a three-hop table and continue for tables with more hops The process results in finding all routes that do not contain loops A 10-hop table has over 500,000 entries for all source to destination routes in a 20 T-Node model network The average number of routes for each of the 380 paired connections is about 1300 30 Today’s traffic is mostly flow-based, not bursty Today’s networks are based on early 1970’s needs: using minicomputers to send messages and transfer files Queuing theory provided solutions for an asynchronous stochastic world Today’s needs are circuit-based to satisfy a burgeoning market for flow-based traffic What is needed now is a network with synchronized nodes that support dynamic data rate connections 31 Another experiment Assumptions: 1. A financial trading firm wants to minimize its network delay 2. The smallest Ethernet frame is 84 bytes including a 46-byte payload 3. A SAIN network frame can have 5 bytes plus the 46-byte payload 4. In either case, a 1 Gb/s channel is carrying the data A SAIN 408-bit (51-byte) packet could be guaranteed delivery in one microsecond or less This compares to 672-bit (84-byte) Ethernet needing nearly one microsecond if there is no other traffic using the channel. Its delay is not guaranteed. There is a compelling requirement to control bandwidth, (and hence delivery time) to meet a customer’s need 32 A brief look at a basic principle that really matters 100 msec 10 msec 1.0 msec 100 μsec 1 4 4 1 1 1b 2 bit bit by 2 by by 8 by 6 b 32 b 64 b 28 it te s tes tes tes yte yte yte by s te s s s 64 s =4 10 μsec 15 6 +1 8b yte 18 90 =1 50 0 +1 00 =8 8b yte 98 s 2 +1 8b yte s s 1.0 μsec 100 nsec 10 nsec Delay vs. Data Rate An 8 x 8 orders of magnitude look at a key fundamental of data networking 1.0 nsec 1.0 kb/s 10 kb/s 100 kb/s 1.0 Mb/s 33 10 Mb/s 100 Mb/s 1.0 Gb/s 10 Gb/s 100 Gb/s Can we cover the earth with a SAIN network? The earth’s land mass area totals ~148,940,000 sq km. The area of each square within a 2 millisecond radius circle is ~320,000 sq km. Area of the Square ≈ 320,000 km² Area of the Circle ≈ 502,654 km² ~565.7 km 2 m se c op ~4 tica 00 l f km ibe r ra di us The number of supermetro networks needed to cover the land mass: 466 In the real world, sizes will likely be based on number of users and/or number of ports and market to determine a diameter 34