Lirong_TAN

1. Give and verify a linear time algorithm that takes two sequences of events (say encoded as lists of binary integers), and determines whether the first sequence of events is a subsequence of the second. Problem formulation: Two sequences: 𝐴 and 𝐵, with length of 𝑚 and 𝑛, respectively. Test whether 𝐴 is a subsequence of 𝐵. Idea: Use two pointers, 𝑝1 for sequence 𝐴, 𝑝2 for sequence 𝐵. Compare 𝐴[𝑝1] with 𝐵[𝑝2]: If 𝐴[𝑝1] = 𝐵[𝑝2], move both pointers forward, which is 𝑝1 = 𝑝1 + 1, 𝑝2 = 𝑝2 + 1 If 𝐴[𝑝1] ≠ 𝐵[𝑝2], only move 𝑝2 forward, which is 𝑝2 = 𝑝2 + 1. In this way, 𝑝1 traverses 𝐴 once, and 𝑝2 traverses 𝐵 once. The time complexity is 𝑂(𝑚 + 𝑛) Algorithm: Verification: This is a greedy algorithm. We claim that greedy algorithm always finds the feasible solution if there is any. We prove this by contradiction. For the example below, greedy algorithm selects a set of jobs from sequence B to match sequence A, which is (g1, g2,..., gr, g(r+1), …). The solution is denoted as (s1, s2, …, sr, s(r+1) , …). g1=s1, g2=s2, …, gr=sr for largest possible value of r. Since g(r+1) and s(r+1) match the same job. Why do not replace s(r+1) with g(r+1)? After replacement, the solution is still feasible. We get, s(r+1) =g(r+1). There is another job in common, which contradicts with the largest possible value is r. Inductively, we can always replace the choice of solution with the choice of greedy algorithm. And such replacements will not jeopardize the feasibility of the solution. In another word, greedy algorithm always returns a feasible solution if there is any. 2. In lecture 3 we discussed the greedy cashier’s algorithm for making change of x units using the smallest number of coins. The cashier’s algorithm gives the customer one unit of the highest denomination coin of at most x units, say d units. Now repeat to make change of the remaining x-d units. For each of the following nation’s coinage, establish whether or not this greedy algorithm always minimizes the number of coins returned in change. If so, prove it, if not give a counter example. (a) MiddleEarth coinage, which includes coins for 1, 4, 5, 10, and 20. No. X=8 Optimal solution: 8=4+4, two coins Greedy Algorithm: 8=5+1+1+1, four coins So, greedy algorithm does not always minimize the number of coins. (b) English coinage before the decimalization, which consisted of half-crowns (30 pence), orins (24 pence), shillings (12 pence), sixpence (6 pence), threepence (3 pence), pennies (1 pence). No. X=48; Optimal solution: 48=24+24, two coins Greedy Algorithm: 48=30+12+6, three coins So, greedy algorithm does not always minimize the number of coins returned. (c) Martian coinage, where the available denominations are powers of some integer p>1, i.e., 1, p, p^2, p^3,…,p^k. Yes, greedy algorithm always minimizes the number of coins returned. Let arbitrary 𝑥 𝑎𝑛𝑑 𝑘 satisfy: 𝑐𝑘 ≤ 𝑥 < 𝑐𝑘+1 For greedy algorithm, the solution contains coin k. Suppose greedy algorithm is not an optimal solution. In another word, the optimal solution does not contain coin k. Thus, there exists a set of coefficients (𝑛0 , 𝑛1 , 𝑛2 , … , 𝑛𝑘−1 ) that satisfies that: 𝑥 = 𝑛0 × 𝑝0 + 𝑛1 × 𝑝1 + 𝑛2 × 𝑝2 + ⋯ + 𝑛𝑘−1 × 𝑝𝑘−1 This is impossible for any optimal solution, which can be seen from the table below. k Ck 0 𝑝0 All optimal solutions must satisfy 𝑛0 ≤ 𝑝 − 1 Max value of coins 0,1,….,k-1 in any OPT - 1 𝑝1 2 𝑝2 … … k 𝑝 𝑛1 ≤ 𝑝 − 1 𝑛2 ≤ 𝑝 − 1 … (𝑝 − 1) × 𝑝0 = 𝑝 − 1 (𝑝 − 1) × 𝑝1 + (𝑝 − 1) = (𝑝 − 1)(𝑝0 + 𝑝1 ) … 𝑘−1 𝑘 𝑛𝑘 ≤ 𝑝 − 1 ∑ (𝑝 − 1) × 𝑝𝑖 = 𝑝𝑘 − 1 𝑖=0 Therefore, we refuse the hypothesis that the optimal solution does not contain coin k. Then, problem reduces to coin-changing 𝑥 − 𝑐𝑘 , which, by induction, is also optimally solved by greedy algorithm. 3. The single-destination shortest path problem for a weighted directed graph is to find the shortest path from every vertex to a specified vertex v. Give and verify an efficient algorithm to solve the single-destination shortest paths problem. Idea: First, reverse all edges; second, use the Dijkstra’s Algorithm to find the shortest paths. Algorithm: Reverse the direction of all edges. Thus, the destination vertex 𝑣 becomes the source. Maintain a set of explored nodes 𝑆 for which we have determined the shortest path distance 𝑑(𝑢) from 𝑣 to vertex 𝑢. And a set of unexplored nodes 𝑃. Initialize 𝑆 = ∅, 𝑑(𝑣) = 0, 𝑑(𝑢) = ∞ 𝑓𝑜𝑟 𝑎𝑙𝑙 𝑜𝑡ℎ𝑒𝑟 𝑣𝑒𝑟𝑡𝑒𝑥 𝑢 while (𝑃 ≠ ∅) do { select the node 𝑥, which subjects to 𝑥 ∈ 𝑃, and 𝑥 has the smallest 𝑑(𝑥) among all nodes in 𝑃 add 𝑥 to S delete 𝑥 from P for node 𝑦, 𝑦 ∈ 𝑃 𝑎𝑛𝑑 𝑦 𝑖𝑠 𝑎 𝑛𝑒𝑖𝑔ℎ𝑏𝑜𝑟 𝑜𝑓 𝑥, do { 𝜋(𝑦) = 𝑑(𝑥) + 𝑙(𝑥, 𝑦), 𝑙(𝑥, 𝑦) is the weight of edge from 𝑥 to 𝑦 If 𝜋(𝑦) < 𝑑(𝑦) do 𝑑(𝑦) = 𝜋(𝑦) } } Verification: Claim: 𝑑(𝑢) is the shortest path from 𝑣 to 𝑢 We prove this by induction on |𝑆|. |𝑆| = 1, 𝑑(𝑢) is obviously the shortest path from 𝑣 to 𝑢, if 𝑢 is selected to be added to 𝑆 next. |𝑆| > 1, let 𝑢 be the next node to be added to 𝑆. And we need to prove 𝑑(𝑢) is still the shortest path from 𝑣 to 𝑢. There are two possibilities for the path from 𝑣 to 𝑢: 1, the path only contains the nodes in 𝑆. In this situation, 𝑑(𝑢) is the shortest path since we update 𝑑(𝑢) every time we add a new node to 𝑆 2, the path 𝑝 contains nodes in both 𝑆 and 𝑃, as shown in the figure below. Suppose there is a node 𝑦 on path 𝑝. Let 𝑥 − 𝑦 be the first edge in 𝑃 that leaves 𝑆, and let 𝑝’ be the subpath to 𝑥. Due to nonnegative property of weights, we ignore the weights from 𝑦 to 𝑢, thus 𝑙(𝑝) ≥ 𝑙(𝑝′ ) + 𝑙(𝑥, 𝑦) Due to the hypothesis that 𝑑(𝑥) is the shortest path from 𝑣 to 𝑥 for all 𝑥 in 𝑆 𝑙(𝑝) ≥ 𝑙(𝑝′ ) + 𝑙(𝑥, 𝑦) ≥ 𝑑(𝑥) + 𝑙(𝑥, 𝑦) From the definition of 𝜋(𝑦), we have 𝑙(𝑝) ≥ 𝑙(𝑝′ ) + 𝑙(𝑥, 𝑦) ≥ 𝑑(𝑥) + 𝑙(𝑥, 𝑦) ≥ 𝜋(𝑦) 𝑢 is the next node to be added to 𝑆, so 𝜋(𝑦) ≥ 𝜋(𝑥) 𝑙(𝑝) ≥ 𝑙(𝑝′ ) + 𝑙(𝑥, 𝑦) ≥ 𝑑(𝑥) + 𝑙(𝑥, 𝑦) ≥ 𝜋(𝑦) ≥ 𝜋(𝑢) To sum up, any path from 𝑣 to 𝑢 will have a greater weight than 𝑑(𝑢) Due to the reversibility of path, we conclude that 𝑑(𝑢) is the shortest path from 𝑢 to 𝑣. The time complexity for this algorithm is 𝑚𝑙𝑜𝑔𝑛, where 𝑚 is the maximal value of out-going degree among all vertices in the reversed graph, 𝑛 is the number of vertices. (Implemented with binary heap) 4. Let G = (V,E) be an undirected weighted graph, and let T be the shortest-path spanning tree rooted at a vertex v. (a) Consider the graph G* obtained by modifying all the edge weights in G by multiplying the weight by a constant factor c >0. Is T still the shortest-path spanning tree in G* from v? Justify your answer. Yes, T is still the shortest-path spanning tree in G* from 𝑣. First of all, we should clarify that shortest-path spanning tree is defined as a tree, in which the path weight from root node 𝑣 to any other node is minimized. In G, for an arbitrary vertex 𝑢, the path from 𝑣 to 𝑢 defined by 𝑇 is denoted as 𝑝, any other path from 𝑣 to 𝑢 is denoted as 𝑝’. We have 𝑙(𝑝′ ) ≥ 𝑙(𝑝) for any 𝑝’. In G*, all edges have been multiplied by a constant factor 𝑐. Hence, the path weight between any two vertices is 𝑐 times of the weight in the original graph 𝐺. Thus for G*, 𝑙 ∗ (𝑝′ ) = 𝑐 × 𝑙(𝑝′ ), 𝑙 ∗ (𝑝) = 𝑐 × 𝑙(𝑝). So, 𝑙 ∗ (𝑝′ ) ≥ 𝑙 ∗ (𝑝) 𝑖𝑓 𝑐 > 0, which means path 𝑝 still has the lowest weight in graph G*. Since vertex 𝑢 and path 𝑝’ are selected arbitrarily. We are safe to conclude that T is still the shortest-path spanning tree in G* from 𝑣. (b) Consider the graph G+ obtained by modifying all the edge weights in G by adding to the weight by a constant d >0. Is T still the shortest-path spanning tree in G+ from v? Justify your answer. No, T is not necessarily the shortest-path spanning tree in G+ from 𝑣. Similar to question (a), for an arbitrary vertex 𝑢, the path defined by 𝑇 is denoted as 𝑝, and another arbitrary path is denoted as 𝑝’. We have, 𝑙(𝑝′ ) ≥ 𝑙(𝑝) In G+, 𝑙 + (𝑝′ ) = 𝑙(𝑝′ ) + 𝑚 × 𝑑, 𝑙 + (𝑝) = 𝑙(𝑝) + 𝑛 × 𝑑, where 𝑚 and 𝑛 are the number of edges on path 𝑝’ and 𝑝, respectively. Since the relationship between 𝑚 and 𝑛 is not determined (I mean 𝑛 could be much larger than 𝑚), the relationship between 𝑙 + (𝑝′ ) and 𝑙 + (𝑝) can not be determined. It could be 𝑙 + (𝑝′) < 𝑙 + (𝑝). Following is a counter example. The shortest-path spanning tree is shown in red. We can see that the shortest-path spanning tree in two graphs are not the same. 5. Suppose that you run both depth-first search and breadth-first search on a connected graph G, and they both return the same tree T. Prove that G=T, i.e., there are no additional edges in the graph. Suppose 𝐺 ≠ 𝑇. First of all, T will not have edges that are not in G, since T has the minimal number of edges to construct a connected graph. Second, assume there is an edge in G but not in T. Then, there will be a cycle in G. Nodes in this cycle is denoted as: (𝑛1 , 𝑛2 , … , 𝑛𝑘 ). 𝑛1 is connected to 𝑛2 and 𝑛𝑘 . Let 𝑛𝑖 ∈ (𝑛1 , 𝑛2 , … , 𝑛𝑘 ) is the first accessed node in this cycle. Breadth-first search: both 𝑛𝑖−1 and 𝑛𝑖+1 are one depth deeper than node 𝑛𝑖 . Depth-first search: at most one of the two neighboring nodes (𝑛𝑖−1 and 𝑛𝑖+1 ) will be explored at the next level. This contradicts with “both depth-first search and breadth-first search return the same tree”. Therefore, there is no additional edges in G. Thus, G=T.

Lirong_TAN

Related documents

Products

Support

Lirong_TAN

Related documents

Add this document to collection(s)

Add this document to saved

Suggest us how to improve StudyLib