Lecture 5 Group Communication Point to Multipoint Communication • Single source, Multiple Destinations • Broadcast – 1 2 All nodes in the network are destinations • Multicast – Some nodes in the network are destinations • Only one copy of the information travels along common edges Message replication along forking points only. • Normally, route of a session is a tree – A tree is a portion of the graph without a cycle (acylcic subgraph) • A spanning tree is a tree which has all vertices of the graph • There may be multiple spanning trees • We need to choose the minimum weight tree for broadcast. 4 5 5 4 Blue edge spanning tree is the minimum weight spanning tree Properties of a Tree • A tree has V-1 edges • There exists a unique path between any two vertices of a tree. • Adding any edge to a tree creates a unique cycle. Breaking any edge on this cycle restores a tree. Minimum Spanning Tree Construction • We maintain a set of edges A, which is initially empty. • Edges are added to A one at a time such that finally A becomes a minimum spanning tree. • Edges are never removed from A. • So ``safe’’ edges must be added to A, i.e., at any stage A must be a part of a spanning tree. Safe Edge Addition • Consider a cut in a graph (a cut consists of 2 sets which partition the vertex set). • A cut respects set A if no edge in A crosses the cut. • A minimum weight edge crossing a cut is denoted a light weight edge in the cut. • Let A be a subset of a MST (minimum weight spanning tree). • Let (S, V – S) be any cut that respects A • Let edge (u, v) be a light edge crossing the cut • Then A (u, v) is subset of a MST Assume that all the edge weights are distinct. Let no MST containing A contain edge (u, v). Consider a MST T containing A Add the edge (u, v) to T Since (u, v) is not in T, T (u, v) contains a cycle, and (u, v) is in the cycle. Edge (u, v) are in the opposite sides of the cut. Since any cycle must cross the cut even number of times, there exists at least one other edge (x, y) crossing the cut. Clearly, w(x, y) > w(u, v). The edge (x, y) is not in A because (x, y) crosses the cut, and the cut respects A. Removing (x, y) from the cycle, breaks the cycle and hence creates a spanning tree, T’, s.t. T’ = T (u, v) – (x, y) w(T’) = w(T) + w(u, v) – w(x, y) w(T) (as w(u, v) < w(x, y)) This contradicts the fact that T is a MST. • So we always find a cut that respects A, • And add a light edge across the cut to A. • Kruskals and Prims algorithms find the cut differently. Kruskals Algorithm • 1. 2. 3. A= For each vertex u in V, Create_Set(u) Sort E in increasing order by weight w For each edge (u,v) in the sorted list – If Set(u)= Set(v) • • • Add (u,v) to E Union Set(u) and Set(v) Return A; Complexity Analysis • The operations create set, testing whether set(u) == set(v), union operations can be done in log V operations, using Union find data structure. • Step 1 can be done in Vlog V • Step 2 can be done in Elog E • Step 3 can be done in Elog V • Overall complexity is O(Vlog V + Elog E + Elog V) or O((V + E)log V) 1 7 7 8 2 0 2 5 2 5 1 2 2 7 8 1 2 0 2 1 7 2 0 5 5 7 5 5 8 2 0 0 1 2 8 5 5 8 5 5 Prims Algorithm • Maintains a set of vertices S already in the spanning tree. • Initially, S consists of one vertex r, selected arbitrarily. • For every vertex u in V – S, maintain the weight of the lightest edge between u and any vertex in S. • If there is no edge between u and S, then this weight associated with u is infinity. • Add the vertex with the least weight to S. • This is in effect adding the light weight edge crossing the cut S and V-S. • Whenever a vertex is added to S, the weights of all its neighbors are reduced, if necessary. Pseudo-Code S= For each u in V, key[u] = Key[r] = 0 Pred[r] = NULL While V = S u = Extract_Min(V-S) For each (v in Adj(u)) if (v not in S) key(v) = min(w(u, v), key(v)) and pred(v) = u Add u in S For each v in Adj[u]…. can be done in E complexity Rest of the loop can be done in V2 complexity So, overall O(V2) Using heaps we can solve in O((V + E)logV) Example 1 7 s 8 3 0 2 5 3 1 s 0 7 2 5 2 5 s 3 0 2 5 2 s 5 8 3 1 5 3 1 7 1 3 8 7 1 7 3 0 2 2 5 8 3 5 2 2 5 5 1 3 1 1 3 8 5 5 5 8 5 s 0 7 8 5 5 s 7 0 2 2 Multicast • Destination for a session is a subset of the node set • Need to find a minimum weight tree spanning the source and the destinations • Such a minimum weight tree is a steiner tree • This is a NP-complete problem • We first consider the following decision problem. • Given a multicast group, find whether there is a tree spanning the group of weight less than a certain amount B • Clearly the problem is NP • We need to show that the problem is NPhard Reduction We will show that a known NP-hard problem can be solved in polynomial complexity if the steiner decision problem can be solved in polynomial complexity Exact cover by 3-sets is NP-hard X = {x1, x2,……, x3p}, C={C1, C2,….. Cq}, Ci X , | |Ci|=3, i=1,…..q Is it possible to select some mutually disjoint subsets from C such that their union is X? This decision problem is NP-hard We will show that if we have a subroutine to find out in polynomial complexity whether a graph G has a tree of weight less than 4p, then we can use the subroutine to answer whether there is an exact 3-cover for the above problem in polynomial complexity. Construct a graph with vertices v, C1, C2,….. Cq , x1, x2,……, x3p Multicast group has source v and destinations x1, x2,……, x3p There is an edge from v to each of C1, C2,….. Cq , and from Ci to xj if xj is in Ci Every edge has weight 1 C1 v C2 C3 C4 x1 x2 x3 x4 x5 x6 x7 x8 x9 x10 C1 = {x1 x2 x3}, C2 = {x4 x5 x8}, C3 = {x6x9 x10 }, C4 = {x3x7x10} Let there be an exact 3-cover Clearly it uses p Ci s, C1, C2, …..Cp (wlog) The tree consisting of edges (v, C1), (v, C2), ….. (v, Cp), and the edges from Ci to its constituents is a tree spanning the multicast group. This tree has weight 4p So, if there is an exact 3-cover, then there is a steiner tree of weight 4p Let there be not be an exact 3-cover. Consider the minimum weight multicast tree connecting the source and the destinations. Since the tree must span x1, x2,……, x3p , the intermediate vertices Ci s in the tree must represent overlapping sets, else these sets will constitute an exact 3-cover. Also, none of these intermediate vertices have degree more than 3. It follows that there are at least p + 1 of them. Thus the weight of the tree is at least 3p + p + 1 or 4p + 1 So there is no tree of weight 4p or less. Thus the set has an exact 3-cover iff there is a tree of weight 4p or less. So, if we have a subroutine to find out in polynomial time whether any graph has steiner tree of weight B or less for any B, then we can use it to decide whether a set has an exact 3-cover or not in polynomial time. Hence, the steiner tree problem is NP-hard. Approximation Algorithms Approximation algorithms are used for providing approximate solutions to NP-hard problems in polynomial complexity. Suppose, we are trying to minimize a quantity. An approximation algorithm is one which guarantees a solution which is at most k times the minimum value, where k is a constant. This constant is the approximation ratio. An approximation algorithm for Steiner Tree Start with any vertex in the multicast group. Choose another vertex in the multicast group, which is closest to the first vertex and add the shortest path between the two to the tree (which is now empty). Choose that vertex of the multicast group which is closest to the tree (distance between a vertex and a tree is the minimum distance between the vertex and a node in the tree). Augment the tree by the shortest path between the vertex and the tree. Repeat this procedure till all vertices of the multicast group are in the tree. The resulting tree has weight at most (2 – 2/n) times the minimum weight tree, where n is the number of vertices in the tree. Intuitive Argument Consider a walk around the steiner tree travelling every edge twice. Weight of the walk is 2 times that of the steiner tree Remove the longest terminal terminal path z4 z2 z5 z7 z6 z1 z3 z8 Weight of the longest terminal to terminal path is at least 2/n the weight of the steiner tree, n is the number of members in the multicast group (source + destination) Weight of the remaining portion of the walk is at most (2 – 2/n) the weight of the steiner tree. Weight of the approximate tree is at most the weight of the remaining part of the tree It is possible to partition this portion into disjoint paths Lk, connecting terminals of index less than k to terminals of index greater than or equal to k Clearly, Lk is not less than the shortest path to index k. Given any , there exists a graph for which the algorithm will give a tree greater than 2 - times the weight of the steiner tree. Counter Example 1+ 1+ 1+ 2 1+ 2 2 Green vertices are in the multicast group The algorithm chooses the red tree and hence has n – 1 edges of weight 2. Total weight is 2n - 2 The optimal tree consists of only the 1 + edges, and hence has weight n(1 + ) As goes to 0, the ratio goes to 2-2/n, greater than 2 - for sufficiently small . There are approximation algorithms which give an approximation ratio of 11/6 or 16/9 However, the approximation ratio is a worst case ratio. Generally, the minimum path heuristic performs much better than the approximation ratio of 2. ``The Steiner Tree Problem’’, F. Hwang, D. Richards and P. Winter Internet Multicast Routing • Multicast groups – All multicast receivers constitute the multicast group • Every group has a unique IP address • Source sends packets to the group address • Multicast capable routers forward on appropriate links • Receivers join and leave groups dynamically • Source may not know the constitution of the group Multicast Routing Protocols • Source based Protocols – Multicast trees consist of shortest paths between source and the receivers • Distance vector multicast routing protocols(DVMRP) – Shortest paths are constructed based on unicast distance vector algorithms – S. Deering, C. Patridge, and D. Waitzman, ``Distance Vector Multicast Routing Protocol’’, RFC 1075, Nov. 1988 • Multicast open shortest path protocols (MOSPF) – Shortest paths are constructed based on link state protocols – J. Moy, ``OSFP version 2’’, Internet RFC 1247, p. 189, July 1991 Group Shared Trees • Source based trees suffer from scalability problem. • A multicast group may consist of several members. • Each member may potentially be a source and a receiver (video teleconferencing) • Intermediate routers need to maintain separate trees for every source (O(n) memory, where n is the number of members in the group) • Instead, there can be a single tree per group (group shared tree) • There is a core node in the tree • Every source sends its packet towards the core node. • The core node sends the packets towards all the receivers. • The core node uses shortest paths to all the receivers. • Memory scales with number of groups and not ``that much’’ with the number of group members. Group Shared Protocols • Core Based Trees – T. Ballardie, ``Core Based tree (CBT) multicast: Architectural Overview and specification,’’ Internet Draft RFC, July 1994 • Protocol Independent Multicast (PIM) – S. Deering, D. Estrin, D. Farinacci, V. Jacobson, C. Liu and L. Wei, ``Protocol Independent Multicast (PIM): Protocol Specification,’’ Internet Draft, RFC, Jan 11, 1995 • Group shared trees are scalable, but they suffer from congestion because the entire traffic is concentrated in the core node, and the protocol stops functioning if the core fails. • PIM switches between group shared and source based trees on an ``on demand’’ basis • PIM starts with group shared trees. • Particular receivers may request source specific trees • In the event of request, PIM switches to source based trees for the requesting receivers. • Both PIM and CBT are ``protocol independent’’ as they can acquire distance information from any underlying unicast protocol, e.g., OSPF, Distance vector protocols etc. Multicast specific Issues • Heterogeneity – Different receivers of the same session have different QoS requirements and different path bandwidths • Scalability – Routing – Reliable Multicast – Multicast Security • • Varying path capacities, e.g. Varying receiver processing capabilities u1 ISDN (128 kbps) T3 (45 Mbps) v Ethernet (100Mbps) T3 (45 Mbps) Modem (28.8 kbps) u2 u3 u4 A single rate of transmission for all members of the session is not sufficient Multirate Transmission • Normally used for loss tolerant real traffic like audio, video transmission. • Different receivers of the same session can receive different rates • Transmission rate of a session in a link is the maximum of the rates of the receivers downstream • Layered transmission of loss tolerant traffic in the internet – Source encodes signal into multiple layers. – Lowest layer gives basic information. – Reception of subsequent layers gives better reception quality. Multirate Multicast Transmission r = max (r3,, r4) r3 S r1 r4 = max (r1,, r2) r2 Fractional layer reception meaningful: Continuous bandwidth reception L4 L3 L2 L1 r3 r2 r1 Multirate Multicast Transmission r = max (r3,, r4) r3 S r1 r4 = max (r1,, r2) r2 Fractional layer reception not meaningful: Discrete bandwidth reception L4 L3 L2 L1 r3 r2 r1 NETWORK MODEL • N sessions • One predetermined route for each session • M virtual sessions – Every receiver of every session is a distinct virtual session u1 ISDN Ethernet (100Mbps) (128 kbps) T3 (45 Mbps) v T3 (45 Mbps) Modem (28.8 kbps) u2 u3 u4 Virtual sessions: (v, u1), (v,u2), (v,u3), (v,u4) • Minimum rate requirements of virtual sessions – QOS requirements • Maximum rate constraints of virtual sessions Feasible Set • rji is the rate allocated to the jth virtual session of the ith session • Rate allocation vector: (r11,….rt1,....,r1N,…ruN) 1,….r 1,....,r N,…r N): = {(r 1 t 1 u • Feasible Set: max i: session i traverses link l m r i i j j r ji Cl j: virtual session j of session i traverses link l p i j (Capacity Constraint) (minimum, maximum rate constraint) Good operating point in the feasible region Capacity Constraint For Unicast Case ri Cl i: session i traverses link l (Capacity Constraint)