MIT OpenCourseWare http://ocw.mit.edu 6.033 Computer System Engineering Spring 2009 For information about citing these materials or our Terms of Use, visit: http://ocw.mit.edu/terms. L13: Sharing in network systems Dina Katabi Some slides are from lectures by Nick Mckeown, Ion Stoica, Frans Kaashoek, Hari Balakrishnan, Sam Madden, and Robert Morris Where is sharing happening? client server presentation Layer stub stub End-to-end layer RPC D Data Header Data RPC Header Network Layer H D H D D H H D D H Link Layer H This Lecture • Problems: • Sharing server • Sharing network • Solution: • Set the window size carefully • Sharing server: flow control • Sharing the network: congestin control Sliding Window Window Size Packets ACK’d Outstanding Packets OK Un-ack’d pkts to send Packets not OK to send yet • The window advances/slides upon the arrival of an ack • The sender sends only packets in the window • Receiver usually sends cumulative acks • i.e., receiver acks the next expected in-order packet window = 1-5 Src 1 2 3 4 5 x Rcvr p1 window = 2-6 Src 1 2 3 4 5 6 a2 x Rcvr p1 window = 2-6 Src 1 2 3 4 5 6 a2 x Rcvr p1 p3 a2 window = 2-6 Src 1 2 3 4 5 6 a2 Timeout 2 a2 a2 a2 a2 a7 x Rcvr p1 p3 p4 p5 p6 p2 In this example, the receiver sent cumulative acks, but the same behavior happens if the receiver acks the received sequence number What is the right window size? • The window limits how fast the sender sends • Two mechanisms control the window: • Flow control • Congestion control Flow Control • The receiver may be slow in processing the packets Æ receiver is a bottleneck • To prevent the sender form overwhelming the receiver, the receiver tells the sender the maximum number of packets it can buffer fwnd • Sender sets W ≤ fwnd How to set fwnd? Multiple applications run on the same machine but use different ports • Fwnd = B x RTT B1 B2 App App • Size of queue substitute for B • Adapts to • RTT changes • B changes • “self-pacing” TCP D P1 D D D P2 P1 P2 H H network Sharing the network How do you manage the resources in a huge system like the Internet, where users with different interests share the same resources? Difficult because of: • Size • Millions of users, links, routers • Heterogeneity • bandwidth: 9.6Kb/s (then modem, now cellular), 10 Tb/s • latency: 50us (LAN), 133ms (wired), 1s (satellite), 260s (Mars) Congestion S1 10Mb/s R1 S2 Why a problem? S1 S2 Sources are unaware of current state of resource Sources are unaware of each other Manifestations: D 100Mb/s Sources share links, and buffer space 2Mb/s Lost packets (buffer overflow at routers) Long delays (queuing in router buffers) Long delays may lead to retransmissions, which lead to more packets…. Danger: Congestion Collapse Increase in input traffic leads to decrease in useful work Throughput knee cliff Latency Congestion Collapse Input traffic Causes of Congestion Collapse Retransmissions introduce duplicate packets Duplicate packets consume resources wasting link capacity Example: old TCP implementations Fw n d1 2 d n Fw Long haul network (i.e., large RTT) Router drops some of TCP 2’s fwnd packets At timeout TCP 2 resends complete window Each discard packet will result in timeout Cumulative ACK, timeouts fire off at “same” time Blizzard of retransmissions can result in congestion collapse Insufficiently adaptive timeout algorithm made things worse What can be done in general? • Avoid congestion: • Increase network resources • But demands will increase too! • Admission Control & Scheduling • Used in telephone networks • Hard in the Internet because can’t model traffic well • Perhaps combined with Pricing • senders pay more in times of congestion • Congestion control: • Ask the sources to slow down; But how? • How do the sources learn of congestion? • What is the correct window? • How to adapt the window as the level of congestion changes? How do senders learn of congestion? Potential options: • Router sends a Source Quench to the sender • Router flags the packets indicating congestion • Router drops packets when congestion occurs • Sender learns about the drop because it notices the lack of ack • Sender adjusts window Case study: current TCP • Define a congestion control window cwnd • Sender’s window is set to W = min (fwnd, cwnd) • Simple heuristic to find cwnd: • Sender increases its cwnd slowly until it sees a drop • Upon a drop, sender decreases its cwnd quickly to react to congestion • Sender increases again slowly • No changes to protocol necessary! TCP Increase/decrease algorithm • AIMD: • Additive Increase Multiplicative Decrease • Every RTT: No drop: cwnd = cwnd + 1 drop: cwnd = cwnd /2 Additive Increase Src cwnd = 1 D Rcvr cwnd += 1 cwnd = 2 A D D cwnd = 3 A A D D D A A cwnd = 4 A TCP AIMD Cwnd Timeout because of a packet loss desired cwnd Grab capacity again Time Halve Cwnd Need the queue to absorb these saw-tooth oscillations TCP “Slow Start” How to set the initial cwnd? At the beginning of a connection, increase exponentially Src Every RTT, double cwnd 1 D 2 A D D 4 A A D D 8 D D A A Rcvr A A A A A A Timeout Slow Start + AIMD Cwnd Additive increase Multiplicative decrease Time Slow start Fairness? • No! • Applications don’t have to use TCP • Use multiple TCP connections Summary • Controlling complexity in network systems • Layering • Interesting division of labors based on E2E principle • Case study: Internet • Interesting problems and techniques • Packets • Protocols • … • Client-server implementation • Next: Application-level reliability and security