color test color test color test color test

COLOR TEST COLOR TEST COLOR TEST COLOR TEST Dueling Algorithms NICOLE IMMORLICA, NORTHWESTERN UNIVERSITY WITH A. TAUMAN KALAI, B. LUCIER, A. MOITRA, A. POSTLEWAITE, AND M. TENNENHOLTZ Social Contexts Normal-form games: Players choose strategies to maximize expected von Neumann-Morgenstern utility. Social context games [AKT’08]: Players choose strategies to achieve particular social status among peers. Social Contexts Ranking games [BFHS’08]: Players choose strategies to achieve particular payoff rank among peers. Two-Player Ranking Games Bob Alice and Bob play game: Alice G 1 Alice beats Bob in G RG payoff of Alice: ½ Alice ties Bob in G 0 Alice loses to Bob in G Implicit Representations Succinct games [FIKU’08]: Payoff matrix represented by boolean circuit. NE hard to solve or approximate. Blotto games [B’21, GW’50, R’06, H’08]: Distribute armies to battlefields. Implicit Representations Optimization duels [this work]: Underlying game is optimization problem. Goal is to optimize better than opponent. Ranking Duel A search engine is an algorithm that inputs • set Ω = {1, 2, …, n} of items • probabilities p1 + … + pn = 1 of each and outputs a permutation π of Ω. Monopolist objective: minimize Ei~p[π(i)]. Ranking Duel Competitive objective: Let the expected score of a ranking π versus a ranking π’ be Pri~p[ π(i) < π’(i) ] + (½) Pri~p[ π(i) = π’(i) ]. Then objective is to output a π that maximizes expected score given algorithm of opponent. Optimizing a Search Engine User searches for object drawn according to known probability dist. 0.19 0.16 0.27 0.07 Search: 0.22 pretty shape 1. (27%) 2. (22%) 3. (19%) 4. (16%) 5. (09%) 6. (07%) 0.09 Choosing a Search Engine 1. Search for “pretty shape”. 2. See which search engine ranks my favorite shape higher. 3. Thereafter, use that one. 0.19 0.16 Search: 0.27 pretty shape 0.07 Search: 0.22 0.09 pretty shape 1. (22%) 1. (27%) 2. (19%) 2. (22%) 3. (16%) 3. (19%) 4. (09%) 4. (16%) 5. (07%) 5. (09%) 6. (27%) 6. (07%) Questions Can we efficiently compute an equilibrium of a ranking duel? How poorly does greedy perform in a competitive setting? What consequences does the duel have for the searcher? Optimization Problems as Duels Ranking Binary Search Routing Finish ? ? ? ? Start Hiring Compression ? ? Parking ? Duel Framework Finite feasible set X of strategies. Prob. distribution p over states of nature Ω. Objective cost c: Ω × X R. Monopolist: choose x to minimize Eω~p[cω(x)]. Duel Framework 1 if cω(x) < cω(x’) v(x,x’) = Eω~p 0 if cω(x) > cω(x’) ½ if cω(x) = cω(x’) 1. Players select strategies x, x’ from X. 2. Nature selects state ω from Ω according to p. 3. Payoffs v(x,x’), (1-v(x,x’)) are realized. Results: Computation An LP-based technique to compute exact equilibria, A low-regret learning technique to compute approximate equilibria, … and a demonstration of these techniques in our sample settings Computing Exact Equilibria Formulate game as bilinear duel: 1. 2. 3. 4. Efficiently map strategies to points X in Rn. Define constraints describing K=convex-hull(X). Define payoff matrix M that computes values. Maps points in K back to strategies in original setting. Bilinear Duels If feasible strategies X are points in Rn, and payoff v(x, x’) is xtMx’ for some M in Rnxn, then maxv,x v s.t. xtMx’ ≥ v for all x’ in X x is in K (=convex-hull(X)) Exponential, but equivalent poly-sized LP. Ranking Duel Formulate game as bilinear duel: 1. Efficiently map strategies to points X in Rn. X = set of permutation matrices (entry xij indicates item i placed in position j) 2. Define constraints describing K=convex-hull(X). K = set of doubly stochastic matrices (entry yij = prob. item i placed in position j) Ranking Duel Formulate game as bilinear duel: 4. Design “rounding alg.” that maps points in K back to strategies in original setting. Birkhoff–von Neumann Theorem: Can efficiently construct permutation basis for doubly stochastic matrix (e.g., via matching). Ranking Duel Formulate game as bilinear duel: 3. Define payoff matrix M that computes values. Ep,y,y’[v(x,x’)] = ∑i p(i) ( ½ Pry,y’ [x i = x’i ] + Pry,y’ [x i > x’i ]) = ∑i p(i) (∑i yij ( ½ y’ij + ∑k>j y’ik )) which is bilinear in y,y’ and so can be written ytMy’. Ranking Duel Result: Can reduce computation time to poly(n) versus poly(n!) with standard LP approach. Technique also applies to hiring duel and binary search duel. Compression Duel data (each with prob. p(.)) Goal: smaller compression (i.e., lower depth in tree). Classical Algorithm : Repeatedly pair nodes with lowest probability. Compression Duel Formulate game as bilinear duel: 1. Efficiently map strategies to points X in Rn. X = subset of zero-one matrices* (entry xij indicates item i placed at depth j) 2. Define constraints describing K=convex-hull(X). K = subset of row-stochastic matrices* (entry yij = prob. item i placed at depth j) * Must correspond to depth profile of some binary tree! Compression Duel Formulate game as bilinear duel: 3. Define payoff matrix M that computes values. Ep,y,y’[v(x,x’)] = ∑i p(i) (∑i yij ( ½ y’ij + ∑k>j y’ik )) which is bilinear in y,y’ and so can be written ytMy’. Compression Duel Bilinear Form: maxv,x v s.t. xtMx’ ≥ v for all x’ in X x is in K (=convex-hull(X)) Problems: 1. How to round points in K back to a random binary tree with right depth profile? 2. How to succinctly express constraints describing K? Approximate Minimax Defn. For any ε > 0, an approximate minimax strategy guarantees payoff not worse than best possible value minus ε. Defn. For any ε > 0, an approximate best response has payoff not worse than payoff of best response minus ε. Best-Response Oracle Idea. Use approximate best-response oracle to get approximate minimax strategies. 1. Low-regret learning: if x1,…,xT and x’1,…,x’T have low regret, then ave. is approx minimax. 2. Follow expected leader: on round t+1, play best-response to x1,…,xt to get low-regret. Compression Best-Response Multiple-choice Knapsack: Given lists of items with values and weights, pick one from each list with max total value and total weight at most one. Compression Best-Response Depth: 1 2 3 4 Compression Best-Response (each with prob. p(.)) x’ in K For j from 1..n, list of depth j: v( ) = Pr[win at depth j | x’ ] w( ) = 2-j … Kraft inequality Other Duels 1. Hiring duel: constraints defining Euclidean subspace correspond to hiring probabilities. 2. Binary search duel: similar to hiring duel, but constraints defining Euclidean subspace more complex (must correspond to search trees). 3. Racing duel: seems computationally hard, even though single-player problem easy. Conclusion • Every optimization problem has a duel. • Classic solutions (and all deterministic algorithms) can usually be badly beaten. • Duel can be easier or harder to solve, and can lead to inefficiencies. OPEN QUESTION: effect of duel on the solution to the optimization problem?

color test color test color test color test

Related documents

Products

Support

color test color test color test color test

Related documents

Add this document to collection(s)

Add this document to saved

Suggest us how to improve StudyLib