15-441 Computer Networking Lecture 7 - Ethernet Peter Steenkiste Departments of Computer Science and Electrical and Computer Engineering 15-441 Networking, Spring 2008 http://www.cs.cmu.edu/~dga/15-441/S08 1 Problem: Sharing a Wire Learned how to connect hosts … But what if we want more hosts? Wires for everybody! Switches Expensive! How can we share a wire? 2 Listen and Talk yak yak… Natural scheme – listen before you talk… »Works well in practice 3 3 Listen and Talk yada yada… Natural scheme – listen before you talk… »Works well in practice 4 4 Listen and Talk yada yak yak… yada… Natural scheme – listen before you talk… »Works well in practice But sometimes this breaks down »Why? How do we fix/prevent this? 5 5 Problem: Who is this packet for? Need to put an address on the packet What should it look like? How do you determine your own address? How do you know what address you want to send it to? 6 6 Outline Aloha Ethernet MAC Collisions Ethernet Frames 7 7 Random Access Protocols When node has packet to send » Transmit at full channel data rate R » No a priori coordination among nodes Two or more transmitting nodes “collision” Random access MAC protocol specifies: » How to detect collisions » How to recover from collisions (e.g., via delayed retransmissions) Examples of random access MAC protocols: » Slotted ALOHA and ALOHA » CSMA and CSMA/CD 8 8 Aloha – Basic Technique First random MAC developed » For radio-based communication in Hawaii (1970) Basic idea: » When you are ready, transmit » Receivers send ACK for data » Detect collisions by timing out for ACK » Recover from collision by trying after random delay – Too short large number of collisions – Too long underutilization 9 9 Slotted Aloha Time is divided into equal size slots » Equal to packet transmission time Node (w/ packet) transmits at beginning of next slot If collision: retransmit pkt in future slots with probability p, until successful Success (S), Collision (C), Empty (E) slots 10 10 Pure (Unslotted) ALOHA Unslotted Aloha: simpler, no synchronization Pkt needs transmission: » Send without awaiting for beginning of slot Collision probability increases: » Pkt sent at t0 collide with other pkts sent in [t0-1, t0+1] 11 11 Outline Aloha Ethernet MAC Collisions Ethernet Frames 12 12 Ethernet First practical local area network, built at Xerox PARC in 70’s “Dominant” LAN technology: » Cheap » Kept up with speed race: 10, 100, 1000 Mbps Metcalfe’s Ethernet sketch 13 13 Ethernet MAC – Carrier Sense Basic idea: » Listen to wire before transmission » Avoid collision with active transmission Why didn’t ALOHA have this? » In wireless, relevant contention at the receiver, not sender Hidden Exposed St.Louis NY Chicago CMU CMU Chicago NY – Hidden terminal – Exposed terminal 14 14 Ethernet MAC – Collision Detection But: ALOHA has collision detection also? » That was very slow and inefficient Basic idea: » Listen while transmitting » If you notice interference assume collision Why didn’t ALOHA have this? » Very difficult for radios to listen and transmit » Signal strength is reduced by distance for radio – Much easier to hear “local, powerful” radio station than one in NY – You may not notice any “interference” 15 15 Ethernet MAC (CSMA/CD) Carrier Sense Multiple Access/Collision Detection Packet? No Sense Carrier Send Detect Collision Yes Discard Packet attempts < 16 Jam channel b=CalcBackoff(); wait(b); attempts++; attempts == 16 16 16 Ethernet CSMA/CD: Making it word Jam Signal: make sure all other transmitters are aware of collision; 48 bits; Exponential Backoff: If deterministic delay after collision, collision will occur again in lockstep Why not random delay with fixed mean? » Few senders needless waiting » Too many senders too many collisions Goal: adapt retransmission attempts to estimated current load » heavy load: random wait will be longer 17 17 Ethernet Backoff Calculation Exponentially increasing random delay »Infer senders from # of collisions »More senders increase wait time First collision: choose K from {0,1}; delay is K x 512 bit transmission times After second collision: choose K from {0,1,2,3}… After ten or more collisions, choose K from {0,1,2,3,4,…,1023} 18 18 Outline Aloha Ethernet MAC Collisions Ethernet Frames 19 19 Collisions B C Time A 20 20 Minimum Packet Size What if two people sent really small packets » How do you find collision? 21 21 Ethernet Collision Detect Min packet length > 2x max prop delay »If A, B are at opposite sides of link, and B starts one link prop delay after A Jam network for 32-48 bits after collision, then stop sending »Ensures that everyone notices collision 22 22 End to End Delay c in cable = 60% * c in vacuum = 1.8 x 10^8 m/s Modern 10Mb Ethernet { » 2.5km, 10Mbps » ~= 12.5us delay » +Introduced repeaters (max 5 segments) » Worst case – 51.2us round trip time! Slot time = 51.2us = 512bits in flight » After this amount, sender is guaranteed sole access to link » 51.2us = slot time for backoff 23 23 Packet Size What about scaling? 3Mbit, 100Mbit, 1Gbit... » Original 3Mbit Ethernet did not have minimum packet size bonus question! – Max length = 1Km and No repeaters » For higher speeds must make network smaller, minimum packet size larger or both What about a maximum packet size? » Needed to prevent node from hogging the network » 1500 bytes in Ethernet 24 24 10BaseT and 100BaseT 10/100 Mbps rate; latter called “fast ethernet” T stands for Twisted Pair (wiring) Minimum packet size requirement » Make network smaller solution for 100BaseT 25 25 Gbit Ethernet Minimum packet size requirement » Make network smaller? – 512bits @ 1Gbps = 512ns – 512ns * 1.8 * 10^8 = 92meters = too small !! » Make min pkt size larger! – Gigabit Ethernet uses collision extension for small pkts and backward compatibility Maximum packet size requirement » 1500 bytes is not really “hogging” the network » Defines “jumbo frames” (9000 bytes) for higher efficiency 26 26 Outline Aloha Ethernet MAC Collisions Ethernet Frames 27 27 Ethernet Frame Structure Sending adapter encapsulates IP datagram (or other network layer protocol packet) in Ethernet frame 28 28 Ethernet Frame Structure (cont.) Preamble: 8 bytes »101010…1011 »Used to synchronize receiver, sender clock rates CRC: 4 bytes »Checked at receiver, if error is detected, the frame is simply dropped 29 29 Ethernet Frame Structure (cont.) Each protocol layer needs to provide some hooks to upper layer protocols » Demultiplexing: identify which upper layer protocol packet belongs to » E.g., port numbers allow TCP/UDP to identify target application » Ethernet uses Type field Type: 2 bytes » Indicates the higher layer protocol, mostly IP but others may be supported such as Novell IPX and AppleTalk) 30 30 Addressing Alternatives Broadcast all nodes receive all packets » Addressing determines which packets are kept and which are packets are thrown away » Packets can be sent to: – Unicast – one destination – Multicast – group of nodes (e.g. “everyone playing Quake”) – Broadcast – everybody on wire Dynamic addresses (e.g. Appletalk) » Pick an address at random » Broadcast “is anyone using address XX?” » If yes, repeat Static address (e.g. Ethernet) 31 31 Ethernet Frame Structure (cont.) Addresses: 6 bytes » Each adapter is given a globally unique address at manufacturing time – Address space is allocated to manufacturers 24 bits identify manufacturer E.g., 0:0:15:* 3com adapter – Frame is received by all adapters on a LAN and dropped if address does not match » Special addresses – Broadcast – FF:FF:FF:FF:FF:FF is “everybody” – Range of addresses allocated to multicast Adapter maintains list of multicast groups node is interested in 32 32 Why Did Ethernet Win? Failure modes » Token rings – network unusable » Ethernet – node detached Good performance in common case » Deals well with bursty traffic » Usually used at low load Volume lower cost higher volume …. Adaptable » To higher bandwidths (vs. FDDI) » To switching (vs. ATM) Easy incremental deployment Cheap cabling, etc 33 33 And .. It is Easy to Manage You plug in the host and it basically works » No configuration at the datalink layer » Today: may need to deal with security Protocol is fully distributed Broadcast-based. » In part explains the easy management » Some of the LAN protocols (e.g. ARP) rely on broadcast – Networking would be harder without ARP » Not having natural broadcast capabilities adds complexity to a LAN – Example: ATM 34 34 Summary CSMA/CD carrier sense multiple access with collision detection » Why do we need exponential backoff? » Why does collision happen? » Why do we need a minimum packet size? – How does this scale with speed? Ethernet » What is the purpose of different header fields? » What do Ethernet addresses look like? What are some alternatives to Ethernet design? 37 37 Lecture 6 Datalink – Framing, Switching Peter Steenkiste Departments of Computer Science and Electrical and Computer Engineering Carnegie Mellon University 15-441 Networking, Spring 2008 http://www.cs.cmu.edu/~dga/15-441/S08 45 From Signals to Packets Analog Signal “Digital” Signal Bit Stream Packets 0 0 1 0 1 1 1 0 0 0 1 0100010101011100101010101011101110000001111010101110101010101101011010111001 Header/Body Packet Transmission Sender Header/Body Header/Body Receiver 46 Datalink Functions Framing: encapsulating a network layer datagram into a bit stream. » Add header, mark and detect frame boundaries, … Media access: controlling which frame should be sent over the link next. » Easy for point-to-point links; half versus full duplex » Harder for multi-access links: who gets to send? Error control: error detection and correction to deal with bit errors. » May also include other reliability support, e.g. retransmission Flow control: avoid that the sender outruns the receiver. 47 Datalink Lectures Framing and error coding. Datalink architectures. Switch-based networks. » Packet forwarding » Flow and error control Taking turn protocols. Contention-based networks: basic Ethernet. Ethernet bridging and switching. Connectivity to the home. Circuit-based communication 48 Framing A link layer function, defining which bits have which function. Minimal functionality: mark the beginning and end of packets (or frames). Some techniques: » » » » out of band delimiters (e.g. FDDI 4B/5B control symbols) frame delimiter characters with character stuffing frame delimiter codes with bit stuffing synchronous transmission (e.g. SONET) 49 Character and Bit Stuffing Mark frames with special character. » What happens when the user sends this character? » Use escape character when controls appear in data: *abc*def -> *abc\*def » Very common on serial lines, in editors, etc. Mark frames with special bit sequence » must ensure data containing this sequence can be transmitted » example: suppose 11111111 is a special sequence. » transmitter inserts a 0 when this appears in the data: » 11111111 -> 111111101 » must stuff a zero any time seven 1s appear: » 11111110 -> 111111100 » receiver unstuffs. 50 Example: Ethernet Framing preamble datagram length more stuff Preamble is 7 bytes of 10101010 (5 MHz square wave) followed by one byte of 10101011 Allows receivers to recognize start of transmission after idle channel 51 SONET SONET is the Synchronous Optical Network standard for data transport over optical fiber. One of the design goals was to be backwards compatible with many older telco standards. Beside minimal framing functionality, it provides many other functions: » operation, administration and maintenance (OAM) communications » synchronization » multiplexing of lower rate signals » multiplexing for higher rates 52 Standardization History Process was started by divestiture in 1984. » Multiple telephone companies building their own infrastructure SONET concepts originally developed by Bellcore. First standardized by ANSI T1X1 group for the US. Later picked up by CCITT and developed its own version. SONET/SDH standards approved in 1988. 53 A Word about Data Rates Bandwidth of telephone channel is under 4KHz, so when digitizing: 8000 samples/sec * 8 bits = 64Kbits/second Common data rates supported by telcos in North America: » Modem: rate improved over the years » T1/DS1: 24 voice channels plus 1 bit per sample (24 * 8 + 1) * 8000 = 1.544 Mbits/second » T3/DS3: 28 T1 channels: 7 * 4 * 1.544 = 44.736 Mbits/second 54 Synchronous Data Transfer Sender and receiver are always synchronized. » Frame boundaries are recognized based on the clock » No need to continuously look for special bit sequences SONET frames contain room for control and data. » Data frame multiplexes bytes from many users » Control provides information on data, management, … 3 cols transport overhead 87 cols payload capacity 9 rows 55 SONET Framing Base channel is STS-1 (Synchronous Transport System). » Takes 125 sec and corresponds to 51.84 Mbps » 1 byte/frame corresponds to a 64 Kbs channel (voice) » Transmitted on an OC-1 optical carrier (fiber link) Standard ways of supporting slower and faster channels. » Support both old standards and future (higher) data rates Actual payload frame “floats” in the synchronous frame. » Clocks on individual links do not have to be synchronized 3 cols transport overhead 87 cols payload capacity, including 1 col path overhead 9 rows 56 How Do We Support Lower Rates? 125 sec 1 Byte in every consecutive frame corresponds to a 64 Kbit/second channel. » 1 voice call. 125 sec Higher bandwidth channels hold more bytes per frame. » Multiples of 64 Kbit/second Channels have a “telecom” flavor. 125 sec » Fixed bandwidth » Just data – no headers » SONET multiplexers remember how bytes on one link should be mapped to bytes on the next link – Byte 33 on incoming link 1 is byte 97 on outgoing link 7 57 How Do We Support Higher Rates? 125 sec Send multiple frames in a 125 sec time slot. The properties of a channel using a single byte/ST-1 frame are maintained! 125 sec » Constant 64 Kbit/second rate » Nice spacing of the byte samples 125 sec Rates typically go up by a factor of 4. Two ways of doing interleaving. » Frame interleaving » Column interleaving – concatenated version, i.e. OC-3c 58 The SONET Signal Hierarchy STS-1 carries one DS-3 plus overhead Signal Type line rate # of DS0 DS0 (POTS) DS1 DS3 64 Kbs 1.544 Mbs 44.736 Mbs 1 24 672 OC-1 OC-3 OC-12 STS-48 STS-192 51.84 Mbs 155 Mbs 622 Mbs 2.49 Gbs 9.95 Gbs 672 2,016 8,064 32,256 129,024 STS-768 39.8 Gbs 516,096 59 Using SONET in Networks Add-drop capability allows soft configuration of networks, usually managed manually. OC-48 mux DS1 OC-3c mux OC-12c mux 60 Self-Healing SONET Rings OC-48 mux mux mux mux DS1 OC-3c OC-12c 61 SONET as Physical Layer OC3/12 Access OC3/12 Access CO OC12/48 Metro CO POP OC3/12 Access CO CO OC3/12 Access OC12/48 Metro CO WDM Backbone OC48/192 POP OC3/12 Access POP OC3/12 Access CO OC12/48 Metro CO OC3/12 Access 62 Error Coding Transmission process may introduce errors into a message. » Single bit errors versus burst errors Detection: » Requires a convention that some messages are invalid » Hence requires extra bits » An (n,k) code has codewords of n bits with k data bits and r = (n-k) redundant check bits Correction » Forward error correction: many related code words map to the same data word » Detect errors and retry transmission 63 Basic Concept: Hamming Distance Hamming distance of two bit strings = number of bit positions in which they differ. If the valid words of a code have minimum Hamming distance D, then D-1 bit errors can be detected. If the valid words of a code have minimum Hamming distance D, then [(D-1)/2] bit errors can be corrected. 1 0 1 1 0 1 1 0 1 0 HD=2 HD=3 64 Examples A (4,3) parity code has D=2: 0001 0010 0100 0111 1000 1011 1101 1110 (last bit is binary sum of previous 3, inverted - “odd parity”) A (7,4) code with D=3 (2ED, 1EC): 0000000 0100011 1000110 1100101 0001101 0101110 1001011 1101000 0010111 0110100 1010001 1110010 0011010 0111001 1011100 1111111 1001111 corrects to 1001011 Note the inherent risk in correction; consider a 2-bit error resulting in 1001011 -> 1111011. There are formulas to calculate the number of extra bits that are needed for a certain D. 65 Cyclic Redundancy Codes (CRC) Commonly used codes that have good error detection properties. » Can catch many error combinations with a small number or redundant bits Based on division of polynomials. » Errors can be viewed as adding terms to the polynomial » Should be unlikely that the division will still work Can be implemented very efficiently in hardware. Examples: » CRC-32: Ethernet » CRC-8, CRC-10, CRC-32: ATM 66 Datalink Architectures Packet forwarding. Error and flow control. Media access control. Scalability. 67 Media Access Control How do we transfer packets between two hosts connected to the same network? Switches connected by point-to-point links -store-and-forward. » Used in WAN, LAN, and for home connections » Conceptually similar to “routing” – But at the datalink layer instead of the network layer » Today Multiple access networks -- contention based. » » » » Multiple hosts are sharing the same transmission medium Used in LANs and wireless Need to control access to the medium Mostly Thursday lecture 68 A Switch-based Network Switches are connected by point-point links. Packets are forwarded hop-by-hop by the switches towards the destination. » Forwarding is based on the address How does a switch work? How do nodes exchange packets over a link? How is the destination addressed? Switch PCs at Work Point-Point link PC at Home 69 Switching Introduction Idea: forward units of data based on address in header. Many data-link technologies use switching. » Virtual circuits: Frame Relay, ATM, X.25, .. » Packets: Ethernet, MPLS, … “Switching” also happens at the network layer. » » » » Layer 3: Internet protocol In this case, address is an IP address IP over SONET, IP over ATM, .. Otherwise, operation is very similar Switching is different from SONET mux/demux. » SONET channels statically configured - no addresses 70 An Inter-network Host Host Host Host Ethernet Host Host Wireless Ethernet Framerelay Host Host Host Host Ethernet ATM Host IP/SONET Host 802.X Host Host Host 71 Internetworking Options 7 6 5 4 3 2 1 physical 1 7 6 5 4 3 2 1 repeater 7 6 5 4 3 2 1 network 3 2 2 1 1 router 7 6 5 4 3 2 1 data link 2 1 1 7 6 5 4 3 2 1 Switching/bridging (e.g. 802 MAC) 7 6 5 4 3 2 1 7 6 5 4 3 2 1 ... 3 2 1 3 2 1 7 6 5 4 3 2 1 gateway 72 Switch Architecture Takes in packets in one interface and has to forward them to an output interface based on the address. » A big intersection » Same idea for bridges, switches, routers: address look up differs Control processor manages the switch and executes higher level protocols. » E.g. routing, management, .. The switch fabric directs the traffic to the right output port. The input and output ports deal with transmission and reception of packets. Control Processor Input Port Output Port Output Port Input Port Switch Fabric Output Port Input Port Output Port Input Port 73 Packet Forwarding: Address Lookup Switch Address Next Hop B31123812508 38913C3C2137 A21023C90590 3 3 0 Info 13 - » Absolute address (e.g. Ethernet) » (IP address for routers) » (VC identifier, e.g. ATM)) 128.2.15.3 1 (2,34) Address from header. Next hop: output port for packet. Info: priority, VC id, .. Table is filled in by routing protocol. 74 Link Flow Control and Error Control Naïve protocol. Dealing with receiver overflow: flow control. Dealing with packet loss and corruption: error control. Meta-comment: these issues are relevant at many layers. » Link layer: sender and receiver attached to the same “wire” » End-to-end: transmission control protocol (TCP) - sender and receiver are the end points of a connection How can we implement flow control? » “You may send” (windows, stop-and-wait, etc.) » “Please shut up” (source quench, 802.3x pause frames, etc.) » Where are each of these appropriate? 75 A Naïve Protocol Sender simply sends to the receiver whenever it has packets. Potential problem: sender can outrun the receiver. » Receiver too slow, buffer overflow, .. Not always a problem: receiver might be fast enough. Sender Receiver 76 Adding Flow Control Stop and wait flow control: sender waits to send the next packet until the previous packet has been acknowledged by the receiver. » Receiver can pace the receiver Drawbacks: adds overheads, slowdown for long links. Sender Receiver 77 Window Flow Control Stop and wait flow control results in poor throughput for long-delay paths: packet size/ roundtrip-time. Solution: receiver provides sender with a window that it can fill with packets. » The window is backed up by buffer space on receiver » Receiver acknowledges the a packet every time a packet is consumed and a buffer is freed Sender Receiver 78 Bandwidth-Delay Product RTT Sender Receiver Time Max Throughput = Window Size Roundtrip Time 79 Dealing with Errors Stop and Wait Case Packets can get lost, corrupted, or duplicated. » Error detection or correction turns corrupted packet in lost or correct packet Duplicate packet: use sequence numbers. Lost packet: time outs and acknowledgements. » Positive versus negative acknowledgements » Sender side versus receiver side timeouts Window based flow control: more aggressive use of sequence numbers (see transport lectures). Sender Receiver 80 What is Used in Practice? No flow or error control. » E.g. regular Ethernet, just uses CRC for error detection Flow control only. » E.g. Gigabit Ethernet Flow and error control. » E.g. X.25 (older connection-based service at 64 Kbs that guarantees reliable in order delivery of data) 81 Datalink Layer Architectures Packet forwarding. Error and flow control. Media access control. Scalability. 82 Datalink Classification Datalink Switch-based Virtual Circuits Packet Switching ATM, framerelay Bridged LANs Multiple Access Scheduled Access Random Access Token ring, Ethernet, FDDI, 802.11 802.11, Aloha 83 83 Multiple Access Protocols Prevent two or more nodes from transmitting at the same time over a broadcast channel. » If they do, we have a collision, and receivers will not be able to interpret the signal Several classes of multiple access protocols. » Partitioning the channel, e.g. frequency-division or time division multiplexing – With fixed partitioning of bandwidth – – Not flexible; inefficient for bursty traffic » Taking turns, e.g. token-based, reservation-based protocols, polling based » Contention based protocols, e.g. Aloha, Ethernet – Next lecture 84 Fiber Distributed Data Interface (FDDI) One token holder may send, with a time limit » Provides known upper bound on delay. Optical version of 802.5 token ring, but multiple packets may travel in train: token released at end of frame 100 Mbps, 100km Optional dual ring for fault tolerance Concerns: » Token overhead » Latency » Single point of failure 85 Other “Taking Turn” Protocols Central entity polls stations, inviting them to transmit » Simple design – no conflicts » Not very efficient – overhead of polling operation » Example: the “Point Control Function” mode for 802.11 Stations reserve a slot for transmission. » For example, break up the transmission time in contention-based and reservation based slots – Contention based slots can be used for short messages or to reserve time slots – Communication in reservation based slots only allowed after a reservation is made » Issues: fairness, efficiency 86 MAC Protocols - Discussion Channel partitioning MAC protocols: » Share channel efficiently at high load » Inefficient at low load: delay in channel access, 1/N bandwidth allocated even if only 1 active node! “Taking turns” protocols » More flexible bandwidth allocation, but » Protocol can introduce unnecessary overhead and access delay at low load Random access MAC protocols (next lecture) » Efficient at low load: single node can fully utilize channel » High load: collision overhead Lecture 7: 9-19-06 87 87