Lecture 4. The Dual Cone and Dual Problem

advertisement
IE 8534
1
Lecture 4. The Dual Cone and Dual Problem
Shuzhong Zhang
IE 8534
2
For a convex cone K, its dual cone is defined as
K∗ = {y | ⟨x, y⟩ ≥ 0, ∀ x ∈ K}.
The inner-product can be replaced by ‘xT y’ if the coordinates of the
vectors are given. Clearly, K∗ is always a closed set, regardless if K is
closed or not. In fact, K∗ is also always convex regardless if K is
convex or not.
Shuzhong Zhang
IE 8534
3
In light of the dual cone, the problem of finding a lower bound on the
optimal value for the standard conic optimization problem can be
done as follows. Introduce the Lagrangian multiplier y for the equality
constraint b − Ax = 0, and s for the conic constraint x ∈ K. The
objective becomes
cT x + y T (b − Ax) − sT x = y T b + (c − AT y − s)T x.
This will be a lower bound on the optimal value whenever
c − AT y − s = 0 and s ∈ K∗ . The quest to find the best lower bound
leads to the dual problem for the standard conic optimization problem:
maximize
bT y
subject to
AT y + s = c
(1)
s ∈ K∗ .
Shuzhong Zhang
IE 8534
4
As an example, for the standard SDP problem (primal),
minimize
C •X
subject to
Ai • X = bi , i = 1, ..., m
X ≽ 0,
its dual problem is in the form of
maximize
subject to
∑m
i=1 bi yi
∑m
i=1 yi Ai
+S =C
S ≽ 0.
Shuzhong Zhang
IE 8534
5
The so-called weak duality relationship is immediate, by the
construction of the dual problem. This is shown in the next theorem.
Theorem 1 Let x be feasible for the primal problem, and (y, s) be
feasible to its dual. Then,
bT y ≤ cT x.
Moreover, if bT y = cT x then xT s = 0, and they are both optimal to
their own respective problems.
If a pair of primal-dual feasible solutions x and (y, s) both exist and
they satisfy xT s = 0 then we call them a pair of complementary
optimal solutions.
Shuzhong Zhang
IE 8534
6
A very useful theorem is the following.
Theorem 2 Let x be feasible for the primal problem, and (y, s) be
feasible to its dual, and x ∈ int K and s ∈ int K∗ . Then, the sets of
optimal solutions for both problems are nonempty and compact, and
any pair of primal-dual optimal solutions will be complementary to
each other.
Shuzhong Zhang
IE 8534
7
Let us spend some time now to study the structures of the dual cones.
Let us start with the topological structures. If rel (K) is denoted as the
relative interior of K, then (rel (K))∗ = K∗ . Let the Minkowski
summation of two sets A and B be
A + B = {z | z = x + y with x ∈ A, y ∈ B}.
It is clear that the sum of two convex sets remains convex. However,
the sum of two closed convex sets may not be closed any more.
Shuzhong Zhang
IE 8534
8

−1




Example 1 The set R+  0 
 + SOC(3) is convex but not closed.
1
For instance, the point (0, 1, 0)T is not in the set, but it is in the
closure of the set, since for any ϵ > 0, we may write
 
 


1
ϵ
−1
ϵ +ϵ
  1
 

 1  =  0  +  1 ,
 
 

ϵ
0
1
− 1ϵ
where the second term is in SOC(3).
Shuzhong Zhang
IE 8534
9
However, if A and B are both polyhedra, then A + B remains a
polyhedron, hence closed. It is interesting to note that the
set-summation and the set-intersection operations are a duality pair:
Proposition 1 Suppose that K1 and K2 are convex cones. It holds
that
(K1 + K2 )∗ = K1∗ ∩ K2∗ .
Proof. For y ∈ (K1 + K2 )∗ , it follows that 0 ≤ (x1 + x2 )T y for all
x1 ∈ K1 and x2 ∈ K2 . Since 0 ∈ K2 , it follows that 0 ≤ (x1 )T y for all
x1 ∈ K1 , thus y ∈ K1∗ , and similarly y ∈ K2∗ . Hence, y ∈ K1∗ ∩ K2∗ .
Now, take any y ∈ K1∗ ∩ K2∗ , and any x1 ∈ K1 and x2 ∈ K2 . We have
(x1 + x2 )T y = (x1 )T y + (x2 )T y ≥ 0, and so y ∈ (K1 + K2 )∗ . This
proves (K1 + K2 )∗ = K1∗ ∩ K2∗ as desired.
2
Shuzhong Zhang
IE 8534
10
The following bipolar theorem can be considered as a cornerstone for
convex analysis.
Theorem 3 Suppose that K is a convex cone. Then (K∗ )∗ = cl K.
Proof. For any x ∈ K, by the definition of the dual cone, xT y ≥ 0
for all y ∈ K∗ , and so K ⊆ (K∗ )∗ . Therefore cl K ⊆ (K∗ )∗ .
The other containing relationship (K∗ )∗ ⊆ cl K can be shown by
contradiction. Suppose such is not true then there exists
z ∈ (K∗ )∗ \ cl K. By the separation theorem for convex cones, there is
vector c such that
cT z < 0, and cT x ≥ 0 ∀x ∈ cl K.
The last relation suggests that c ∈ K∗ , which contradicts with the first
relation because, as z ∈ (K∗ )∗ , it should lead to cT z ≥ 0, but not the
opposite. This contradiction completes the proof.
2
Shuzhong Zhang
IE 8534
11
This proof suggests that the bipolar theorem for convex cone is
equivalent to the separation theorem itself. Suppose that K is a closed
convex cone, and v ̸∈ K. Then, the bipolar theorem asserts that
v ̸∈ (K∗ )∗ , and this implies that there is y ∈ K∗ such that v T y < 0,
and xT y ≥ 0 for any x ∈ K because y ∈ K∗ , hence a separation exists.
Shuzhong Zhang
IE 8534
12
In case K is closed, the bipolar theorem is what in folklore understood
as ‘the dual of the dual is the primal itself again’. Combining
Theorem 3 and Proposition 1, it follows that
cl (K1 + K2 ) = (K1 ∩ K2 )∗ .
It is also useful to compute the effect of linear transformation on a
convex cone in the context of dual operation. Let A be an invertible
matrix. Then, we have
(AT K)∗ = A−1 K∗
by observing that y ∈ (AT K)∗ ⇐⇒ (AT x)T y ≥ 0 ∀x ∈ K ⇐⇒ Ay ∈ K∗
⇐⇒ y ∈ A−1 K∗ .
Shuzhong Zhang
IE 8534
13
A convex cone is called solid if it is full dimensional. Mathematically,
K ⊆ Rn is solid ⇐⇒
span K = Rn , or K + (−K) = Rn .
A convex cone is called pointed if it does not contain any nontrivial
subspace; i.e. K ∩ (−K) = {0}. The following result proves to be useful:
Proposition 2 If K is solid then K∗ is pointed, and if K is pointed
then K∗ is solid.
Shuzhong Zhang
IE 8534
14
Proof. Suppose that K is solid, i.e. there is a ball B(x0 ; δ) ⊆ K with
radius δ > 0 and centered at x0 , and at the same time suppose that
there is a line Rc ∈ K∗ , then
0 ≤ s(x0 + t)T c, ∀∥t∥ ≤ δ and s ∈ R,
which is clearly impossible. This contradiction shows that if K is solid
then K∗ must be pointed. By the bipolar theorem, the second part
follows by symmetry.
2
Shuzhong Zhang
IE 8534
15
It is easy to compute the following dual cones
• (Rn+ )∗ = Rn+ .
• SOC(n)∗ = SOC(n).
n×n
n×n ∗
n×n
n×n ∗
.
) = H+
, and (H+
) = S+
• (S+
The above standard cones are so regular that their corresponding dual
cones coincide with the original cones themselves (the primal cones).
In general, they are not the same. As an example, consider the
following Lp cone, where p ≥ 1,



 t

n
 t ∈ R, x ∈ R , t ≥ ∥x∥p .
Lp (n + 1) = 
 x

Shuzhong Zhang
IE 8534
16
One can show that (Lp (n + 1))∗ = Lq (n + 1) where 1/p + 1/q = 1.
Now, the self-duality of SOC(n + 1) is due to the fact that
SOC(n + 1) = L2 (n + 1).
In convex analysis, the dual object for a convex function f (x) is known
as the conjugate of f (x), defined as
f ∗ (s) = sup{(−s)T x − f (x) | x ∈ dom f },
where ‘dom f ’ stands for the domain of the function f . The conjugate
function is also known as the Legendre-Fenchel transformation.
Shuzhong Zhang
IE 8534
17
Examples of some popular conjugate functions are:
1. For f (x) = 12 xT Qx + bT x where Q ≻ 0, the conjugate function is
f ∗ (s) = 12 (s + b)T Q−1 (s + b).
∑ n xi
2. For f (x) = i=1 e , the conjugate function is
∑n
∑n
∗
f (s) = − i=1 si log(−si ) + i=1 si .
∑n
3. For f (x) = i=1 xi log xi , the conjugate function is
∑n 1+si
∗
f (s) = i=1 e
.
4. For f (x) = cT x, the conjugate function is f ∗ (s) = 0 for s = −c
and f ∗ (s) = +∞ when s ̸= −c.
∑n
5. For f (x) = − i=1 log xi , the conjugate function is
∑n
∗
f (s) = −n − i=1 log si .
6. For f (x) = max1≤i≤n xi , the conjugate function is f ∗ (s) = 0 if
∑n
∗
s
≤
−1,
and
f
(s) = +∞ otherwise.
i
i=1
Shuzhong Zhang
IE 8534
18
If f (x) is strictly convex and differentiable, then f ∗ (s) is also strictly
convex and differentiable. The famous bi-conjugate theorem states
that f ∗∗ = cl f . There is certainly a relationship between the two dual
objects: the dual cone and the conjugate function. Let us now set out
to find the exact relationship. First of all, suppose that f is a convex
function, mapping from its domain in Rn to R. Its epigraphy is a set
in Rn+1 defined as




 t
 t ≥ f (x), t ∈ R, x ∈ dom (f ) .
epi (f ) := 

 x
Shuzhong Zhang
IE 8534
19
It is well known that f is a convex function if and only if epi (f ) is a
convex set. It turns out that H (epi (f ∗ )) is the dual of H (epi (f )),
after taking the closure and reorienting itself. To be precise, we have:
Theorem 4 Suppose that f is a convex function, and








  p   q  p > 0, q − pf (x/p) ≥ 0 .
K := cl








x
Then,







  u  
 v  v > 0, u − vf ∗ (s/v) ≥ 0 .
K∗ = cl








s
Shuzhong Zhang
IE 8534
20


Let L2 := 
0, 1
 and L := L2 ⊕ In = [L2 , 02,n ; 0n,2 , In ]. The effect
1, 0
of applying L to x is to swap the position of x1 and x2 . Equivalently,
Theorem 4 can be stated as
∗
(H (epi (f ))) = cl (L (H (epi (f ∗ )))) .
Shuzhong Zhang
IE 8534
21
Another related concept is the so-called polar set. Suppose S ⊂ Rn .
Then its polar set is defined as
S ◦ = {y | (−y)T x ≤ 1, ∀x ∈ S}.
Clearly, S ◦ is always closed and convex, and it always contains the
origin.
If S is a cone, then S ◦ = S ∗ .
One can verify that

 


 1
 ∈ H (S)∗ .
S ◦ = y 

 y
Shuzhong Zhang
IE 8534
22
Some basic properties regarding the polar set immediately follow:
• If A ⊆ B then A◦ ⊇ B ◦ ;
• (∪i∈I Si )◦ = ∩i∈I Si◦ ;
• If S = conv(v1 , · · · , vm ) then
S ◦ = {x | −viT x ≤ 1, i = 1, 2, ..., m}.
In other words, the polar of a polytope is a polyhedron.
• It always holds S ⊆ S ◦◦ .
Shuzhong Zhang
IE 8534
23
By Theorem 3, we also have
Theorem 5 If S is a convex set, then
S ◦◦ = cl S.
The above is evidently equivalent to Theorem 3. They can both be
referred to as the bipolar theorem.
Shuzhong Zhang
IE 8534
24
Let us return to primal-dual conic optimization models.
In linear programming, it is known that whenever the primal and the
dual problems are both feasible then complementary solutions must
exist. This important property, however, is lost, for a general conic
optimization model, as the following example shows.
Consider
minimize
x11
subject to
x + x21 = 1,
12

x , x12
 11
 ≽ 0.
x21 , x22
This problem is clearly feasible. However, there is no attainable
optimal solution.
Shuzhong Zhang
IE 8534
25
Its dual problem is
maximize
y
subject to
y
0, 1
+
s11 , s12

1, 0

s21 , s22

s11 , s12


s21 , s22



=

1, 0
,
0, 0
 ≽ 0,
or, more explicitly,
maximize
y

subject to

1,
−y
−y,
0

 ≽ 0.
The dual problem is feasible and has an optimal solution; however, a
pair of primal-dual complementary optimal solutions fails to exist,
although both primal and dual problems are feasible.
Shuzhong Zhang
IE 8534
26
One may guess that the unfortunate situation was caused by the
non-attainable optimal solution. Next example shows that even if both
primal and dual problems are nicely feasible with attainable optimal
solutions, there might still be a positive duality gap, meaning that the
complementarity condition does not hold.
minimize
−x12 − x21 + 2x33
subject to
x12 + x21 + x33 = 1,
x22 = 0,

x11 , x12 , x13

 x21 , x22 , x23

x31 , x32 , x33


 ≽ 0.

Shuzhong Zhang
IE 8534
27
The dual of this problem is
maximize
subject to
y1

0,

 −1 − y1 ,

0,
−1 − y1 ,
0
−y2 ,
0
0,
2 − y1


 ≽ 0.

The primal problem has an optimal solution, e.g.


1, 0, 0


∗

X =  0, 0, 0 
, with the optimal value equal to 2. The dual
0, 0, 1
problem also is feasible, and has an optimal solution e.g.
(y1∗ , y2∗ ) = (−1, −1), with optimal value equal to −1. Both problems
are feasible and have attainable optimal solutions; however, their
optimal solutions are not complementary to each other, and there is a
positive duality gap: cT x∗ − bT y ∗ = (x∗ )T s∗ > 0.
Shuzhong Zhang
IE 8534
28
Key References:
• Yu. Nesterov and A. Nemirovsky, Interior Point Polynomial
Methods in Convex Programming, Studies in Applied Mathematics
13, SIAM, 1994.
• J.F. Sturm, Theory and Algorithms for Semidefinite Programming.
In High Performance Optimization, eds. H. Frenk, K. Roos, T.
Terlaky, and S. Zhang, Kluwer Academic Publishers, 1999.
• S. Zhang, A New Self-Dual Embedding Method for Convex
Programming, Journal of Global Optimization, 29, 479 – 496, 2004.
Shuzhong Zhang
Download