Hidden Minimum-Norm Problems in Quadratic Programming by Robert M. Sloan W.P. #1768-86 Revised Freund April, 1986 Abstract: quadratic This note presents sufficient conditions program, or its dual, (Euclidean) norm problem, linear transformation conditions are shown i.e. to be transformed into a minimum a problem of minimizing of an element of a polytope. the norm of sufficient conditions, These sufficient As part of the we characterize when the two linear Ax > b and Az < b have simultaneous solutions. These results are used in conjunction with duality constructions obtain two equivalent reformulations of a given quadratic Key Words: Quadratic program, norm, minimum norm problem, program. Running Header: a to be necessary under a suitable restriction on the class of transformations that are allowed. inequality systems for a convex Hidden Norm Problems. to program. gauge A minimum (Euclidean) norm problem over a polytope is a program of the form: NP: UCx minimize + dl x subject to: where 11*11 denotes the Euclidean The purpose of implications Ex > f (12) this note is norm. to explore when can a convex quadratic of the following question: program or its dual be conveniently answers to and transformed into a minimum (Euclidean) norm program? The standard convex quadratic program is given as QP: 1/2 xtQx + qtx minimize x subject to: where Q is assumed to be standard dual QD: Ax > b symmetric and positive semi-definite. of QP, see Dorn [1], is given by -1/2 ytQy + btX maximize y,X subject The to: -Qy + AtX = q X > 0 The concern herein lies data (Q,q,A,b) in discovering properties of the problem that allow the objective function replaced by a norm, in QP or QD to be thus transforming QP or QD into an instance of NP. The rationale for exploring transformations of program to the minimum norm problem is threefold. a quadratic First, the minimum norm problem has an immediate geometric interpretation that may be useful program. in Second, the solution of a given quadratic the analysis and the minimum norm problem is 1 *1__1_______1______ __11 iI_____ a classical optimization problem, and has received extensive study, example, [5]. Luenberger Third, the author has see for recently investigated an alternate duality theory for the minimum norm problem [2], that is applicable to quadratic programs that can be so transformed. If the matrix Q is symmetric and positive semi-definite, then Q can be written as Q = MtM for some matrix M; efficient for constructing M are well-known, see e.g. Gill, procedures Murray, and Wright [3]. Proposition 1. At If the system of Qs (1.1) = -q (1.2) > 0 (1.3) + bt6 btT > 0, 6 > has a solution (N, inequalities q + Qs = At6 - linear O, 6, s), (1): (1.4) then the program QP is equivalent to the minimum norm problem NP1: {{Mx minimize x subject to: + Msl Ax > b, where M is any matrix for which MtM = Q. For any x satisfying Ax > b, Proof: 1/2 xtMtMx + qtx = 1/2 xtMtMx + - 1/2 and stMtMs tAx similarly, using 1/2 xtQx btR + + qtx > -bth, < 1/2 > 1/2 tAx IIMx + MsH 2 1/2 xtQx + qtx + stMtMx = - 1/2 1/2 stMtMs = UIMx + Msll2 + btr , (1.2), IIMx + MsI1 2 - 1/2 stMtMs equality is obtained throughout, 2 - bt6. But since and 1/2 xtQx + qtx = 1/2 IIMx + Ms112 increasing in - 1/2 IMx + Msl, solution, If q lies then (0, Remark 2. (, particular, any solves 6, s) > b will index i, (1). = (0, (n, 6, s) to (1), i < m, transformed into a minimum norm problem. tAx from Q if CtC = Q. for any feasible points x, (respectively, ) (respectively, >). Proposition 2. = 6tb. In i > 0 will for QP to be Before turning to the we first to be monotonically transformable and any > b. introduce some Given the matrix Q of the program QP, said to be derived -bt6, i > 0 or for which the system Ax question of necessary conditions, If Q is (1). = bt = ntb ttAxand satisfy 1 Q-lq) solves Proposition 1 provides sufficient conditions is function can Qs = q has a i.e., In particular, , be an always-active constraint of terminology. strictly [X] For any solution x that solves Ax is This expression in the column space of Q, 0, s) positive definite, + btn. and so the quadratic objective + MslI. be replaced by [IMx Remark 1. stMtMs A norm the norm Cx + d Cx + dl is said in the objective function of x of QP. [lCx + dl > if and only if 1/2 xtQx + qtx If QP is feasible, Cx QP if + dl > 1/2 xtQx + qtx then QP can be monotonically transformed to a minimum norm problem with objective function llCx + d (1) derived from Q only if the system of linear inequalities has a solution. Proof: Suppose that system of the alternative, scalar e (1) has no solution. there exist vectors x, that satisfies: 3 Then, by a theorem x and a nonnegative Ax > be Ax > be Qx = Qx qtx > qtx There are two cases to consider, depending on whether e is positive or zero. Case 1: 8 e > . Without loss of generality, we can assume that = 1, and so x and x satisfy: Ax > b Ax > b Qx = Qx qtx > qtx If there exists C and d such that IICx + dll is a strictly monotonic transformation of the objective function of QP and is derived from Q, then Cx = Cx, 11Cx + dl and so > Cx + dll. IICx + dl = But Qx = Qx and Q = CtC IlCx + dll, a contradiction. implies Thus no that such norm can be found. Case 2: e = 0. Because QP is feasible, there exists x' that satisfies Ax' > b. Therefore (x' + x) and (x' + x) satisfy: A(x' + x) A(x' + x) Q(x' + qt(x' > b > b ) = + x) Q(x + x) > qt(x' + x) and the proof follows that of case 1, with x replaced by (x' + x) and x replaced by (x' +x). [X] Turning to the dual quadratic program QD, our first result is: 4 Av > b (2.1) Az < b (2.2) Qv - Qz = o (2.3) qtv - qtz has a solution (v, = 0 , (2): (2.4) then the dual quadratic program QD is z), to the minimum norm problem equivalent NP2: inequalities If the system of linear Proposition 3. IlMy - Mz minimize y,X subject -Qy + AtX to: = q X > 0O where M is any matrix for which Q = MtM. Proof: - 1/2 ytQy + btX < - + vtQy = we that is feasible For any (y, X) obtain - 1/2 HMy - 2 with qtz = qtv, we have HMy - Mz, IlMy - Mz11. Remark 3. applied ztQZ. and 1/2 ytMtMy + qtv 1/2 My - 1/2 2 + qtz + 1/2 (2.2), ztQz. and combining this relation ytQy + btX This expression is Mzl Similarly, using = - 1/2 flMy - Mzll2 strictly decreasing in so we can replace the maximand by the minimand [X] Proposition 3 is to the dual QD in - that Mz = Mv, - - + qtv + 1/2 vtQv. 1/2 ytQy + btX < However, Qz = Qv implies + qtz + 1/2 + vtAtX = 1/2 ytQy Mv for QD, dual. structurally the same as Proposition 1, Proposition 3 is obtained by rewriting the the format of the primal and then applying Proposition In this sense, the two propositions are the same. 5 1. In order to prove a result about necessary conditions for QD to be transformed amended. into a minimum norm problem, Given the matrix Q of the dual quadratic program QD, norm {ICy + EX + dl E = O. our notation must be is said to Cy + EX + dl A norm transformable in to be monotonically the objective function of QD points (y, X), (y, X) (respectively, ) if CtC = Q and be derived from Q if is said of QD, Cy + EX + d and only if - the 1/2 > if for any feasible Cy + EX + dl ytQy + btX < 1/2 ytQy + btX <). (respectively, Analogous to Proposition 2, we have: Proposition 4. transformed inequalities derived from Q only if the system of linear (2) has a solution. The proof exactly parallels that of is feasible, vectors feasible, then QD can be monotonically to a minimum norm problem with objective function RCy + EX + d Proof: If QD is X, then if system (2) Propostion 2. has no solution, If QD there must exist X, y that satisfy - Qy + AtX = q - Qy + AtX = q X > X> 0 btX > Thus if Cy + EX + dl function of QD, < Cy + EX + d, and btX is strictly monotonic is derived from Q, in the objective then ilCy + EX + d{ which is clearly a contradiction, Thus no such norm can be found. [X] 6 because E = 0. Proposition 2 (and in the dual, Proposition 4) shows solvability of the system of linear inequalities (1) that the is necessary for QP to be transformed to the minimum norm program, provided that the transformation is restricted by the monotonicity condition and the condition that the norm be derived from Q. restrictions are relaxed, the solvability of the system (1) longer the necessary condition, Q 0 0 Then (1) b [31 0 have no solution. However, Cx + d, C this instance 1 where and 1 = d -8] this, note that for any x feasible in QP, qtx > 8, To see indeed, the optimal UCx + d is solution is x* = = (3 ,5 )t, 42(qtx -8)(qtx with qtx* and 8. strictly increasing in qtx for qtx solution. However, C is not 8. derived from Q in to the above transformation was Also, -8) Thus Cx + d monotonically transformable from QP, even though system (1) key Let is monotonically transformable to the minimum norm program with objective function which is no as the following example shows. I, A =1 the inequalities of QP When these has no this example. the judicious choice of is The d, based on a known lower bound on the optimal value of QP. If the montonicity condition is relaxed, of the set of optimal solutions to QP allows us to write any instance of QP as a minimum norm program. optimal solution x* of QP Is unique 7 II ____l______sll then prior knowledge For example, if the (which it can be even if Q=O, i.e. QP is just a linear program), then QP is equivalent to the minimum norm program: IIx-x*11 minimize x subject to: Ax > b This transformation appears somewhat pointless in that the quadratic program has been solved before the transformation is even made. Nevertheless, the transformation can be accomplished time, because linear and convex quadratic programming are solvable in polynomial time quadratic depends [4]. program to be on the class of in polynomial The question of necessary conditions for a transformed into a norm program thus clearly transformations Note that the pair of inequalities that are allowed. (2.1) and (2.2) of Proposition 3 are described by reflecting the halfspaces defined by the feasible region of QP. A curious Proposition 3 is the simultaneous and Az < b. aff(x) and This issue is rec(x) denote, recession cone of a set X, in the system Ax constraint Ax > b, issue raised in light of solvability of treated in the next proposition. respectively, the affine hull see Rockafellar [6]. b is said to be parallel is redundant and Ajx = d > bj i.e., the system Ax > b Let and the A constraint Ajx > bj redundant if the for every x that satisfies Ajx is constant for every x satisfying Ax > b. The following proposition characterizes when the system Ax > b and Az < both have a solution. Proposition 5. furthermore Let assume that Ax > b is parallel dim(rec(x)) X = {xERnAx > b} X redundant. and Z = {zERnlAz < b), and and that no constraint of the system Then Z = dim(x). 8 if and only if b Proof: and Az 0 E Suppose Z *. Then there Let k = dim(x), < b. rec(x), equality must hold, dim aff{x, and so xl,...,xk) {xl-z But A(xl-z) ... elements of X, for which ,xk-, Conversely, If index set M B, where If k > 0, > O, *, Z = rec(X) > k, x-z} and hence equal and assume that dim(rec(x)) such that Aax = b interior of k=O, X is a singleton (x) for every satisfies Ax > b X). Furthermore, Therefore, there (x is any element (A.) = n-k. = b, and so x = aff(rec(x)) for which Ajxi = < dim(x). = Aj. and Aaxi = 0, {x E RnIAax = b). = 0, combination of the tXA i=l,...,k, i=l,...,k, rows Thus X ba of A, bj, If k > 0, If there < dim(x). i.e. and aff(xl,...,xk,o) exists an index J 6 then Aj must be a nonnegative i.e. there exists Xa > 0 for which and Ajx > bj is a parallel redundant constraint, violating the hypothesis of the proposition. dim(rec(x)) If Z, there exists linearly independent vectors x 1 ,...,xk in rec(x), > 0, dim(X) B is a parallel Thus Ax dim(rec(X)) = sets a and x E X, and rank and every index j redundant constraint, whereby B=0. that satisfy Axi to k, then the constraint can be partitioned into disjoint x of X that contradicting Z=*. there exist < dim(x). = {l,...,m} the relative < dim(x), {xl-z,...,xk-z, i.e. the constraint matrix A has m rows, exists an element of Thus dim suppose u B = M, dim(rec(x)) has k linearly independent elements. > 0,...,A(xk-z) > 0, A(x-z) because dim(rec(x)) If k=O, then since Therefore dim aff{x, x,...,xk, z}) > k, -z) are elements of rec(X). = k > 0. However, since = dim(X). = k. z for which Ax > b 0. > dim(rec(x)) i.e., vectors xl,...,xk, all and note that k > 0 = k. dim(rec(x)) exists x and [X] 9 Thus B It characterized in Proposition 5 can be the dimension of terms of and rec(X), as opposed to a characterization in terms multipliers via a theorem of the alternative. Q = In this example linear inequalities satisfy the both programs will 0 q = , 6, (, minimum norm program. = s) (0, A = 0 ) 0, solves neither, or (1), (2). (1) and/or b 1 0 X of dual show, the primal, dual, As the following examples Example 1 b in that the solvability of Ax > b and Az is curious 3 = whereby QP is a However, Az < b has no solution, and so the transformation of the dual QD by Proposition 3 cannot be accomplished. Example 2 In this (1, Q = example, z = (-1, )t, transformable to Example 3 1 - = s = solves (2); (1, 1 )t solves (1), therefore this example, 1 and x = both QP and QD are norm programs. Q 0 0 00 11 = A = 10 1 0 In 1 1 0 0 A = - 0, 6 = = 0, -l)t I , q it is straightforward to 10 = 0 -1 show that neither (2) have a solution, and so the transformations and 3 cannot be applied. 01b -- I -1 - (1) nor of Propositions 1 in Gauge Duality An Application of Hidden Norms One motivation case. norm the author's recent programming is quadratic gauge for the study of hidden minimum norm programs [2], programs of which If QP can be converted problem NP1, the minimum norm problem is a specific (through Proposition 1) to the minimum standard then either the see dual of NP1, GNPI: is given [2], (Lagrange) dual to: Mth - (stAt by = AtX Replacing the objective (Lagrange) subject to: dual Af - 0O function by 1/2 hth and then taking the yields: 1/2 ftqf minimize f,e 0 + bt)X = 1 > QP: or the IhU 2 minimize h,X subject standard investigation of dual of NP1 can be constructed. gauge dual The gauge in + estQf + 1/2 stQse 2 - e be > 0. Because QP is obtained from QP by two consecutive duality constructions (first the gauge dual, should expect QP and QP to then the Lagrange be equivalent. The feasible dual), we region of QP is obtained by adding the extra variable e that scales the righthand-side b, system Af cone - and converts the be > O, constraints Ax > b to the homogeneous replacing the polytope of QP by a polyhedral in one higher dimension. The sense of equivalence of QP and QP is shown in the next proposition, which shows how optimal solutions of QP and QP transform one to the other. 11 Let x be an s) be a solution to (1). , Let (T, 6. Proposition solution to QP with optimal Karush-Kuhn-Tucker optimal , and define t = stQx + stQs multipliers if (i) t O, if t = O, = ( Ax + btn + btS. Then e) = (x/t, l/t) solves QP with optimal K-K-T (f, multipliers (ii) (K-K-T) > b, + )/t. Qx + Qs = 0 has a solution, and QP is unbounded. Let (f, 8) c. Then: (i) solution to QP with optimal K-K-T multipliers be an optimal if e O, # x = f/e solves QP with optimal K-K-T multipliers = C/e + S. (ii) The if e = O, proof of this then QP is infeasible. proposition follows [X] the K-K-T from an examination of conditions and from direct substitution of the indicated transformations. The program QP was followed by the the (Lagrange) obtained by taking the gauge dual of QP, standard (Lagrange) dual dual. If standard quadratic program dual QD, QP: feasible Aw > 0 to: region of QP recession cone of homogeneous (2), then Starting with the and converting QD to the minimum and then taking the gauge (_qt The solves dual of NP2 yields: 1/2 wtQw minimize w subject z) QD is transformable to a minimum norm problem, and the order of the dualization can be reversed. norm problem NP2, (v, ZtQ)w = 1 (X) (r) is composed of the intersection of the the feasible region of QP (described by the constraints Aw > 0) with a hyperplane 12 (described by - (_qt tQ) w = 1) that scales elements of the recession cone. Analagous to Proposition 6, we have: Proposition 7. Let (v, z) be a solution to (2). Let x be an optimal solution to QP with optimal K-K-T multipliers A, and define u = -qtx + qtz -_ (i) tQx + if u # O, w = multipliers (ii) if u = 0, X tQz. (x = n/u, Then )/u solves QP with optimal K-K-T r = 1/u. QP is infeasible. Let w be an optimal solution to QP with optimal K-K-T multipliers X, r. Then (i) if r # 0, x = w/r + v solves QP with optimal K-K-T multipliers (ii) if r = 0, = t/r. QP is unbounded. [X] It is hoped that the equivalences of QP to the programs QP and QP given herein will be useful in applications of quadratic programming. 13 References [1] W.S. Dorn, "Duality in quadratic 18, 155-162, 1960. [2] R.M. Freund, "Dual gauge programs, with applications to quadratic programming and the minimum-norm problem," to appear in Mathematical Programming. [3] P. Gill, W. Murray, and M. Wright, Academic Press (New York, 1981). [4] M.K. Kozlov, S. P. Tarasov, and L.G. Khachiyan, "Polynomial solvability of convex quadratic programming," Doklady Academii English translation, Soviet Mathematics Nauk SSR 248(5) (1979) Doklady 20(5) ( 1979)]. [5] D.G. Luenberger, Optimization by vector space methods, John Wiley & Sons (New York, 1969). [6] R.T. Rockafellar, Convex analysis, Princeton University Press (Princeton, New Jersey, 1970). 14 programming," Quart. Appl. Math. Practical Optimization,