Journal of Inequalities in Pure and Applied Mathematics

Journal of Inequalities in Pure and
Applied Mathematics
http://jipam.vu.edu.au/
Volume 6, Issue 3, Article 87, 2005
A MINKOWSKI-TYPE INEQUALITY FOR THE SCHATTEN NORM
MARKUS SIGG
D EPARTMENT OF M ATHEMATICS AND S TATISTICS
U NIVERSITY OF KONSTANZ
G ERMANY
mail@MarkusSigg.de
Received 16 July, 2004; accepted 29 June, 2005
Communicated by F. Hansen
A BSTRACT. Let F be a Schatten p-operator and R, S positive operators. We show that the
1 c
1 c
1 c
inequality |F (R + S) c |p ≤ |F R c |p +|F S c |p for the Schatten p-norm |·|p is true for p ≥ c = 1
and for p ≥ c = 2, conjecture it to be true for p ≥ c ∈ [1, 2], give counterexamples for the other
cases, and present a numerical study for 2 × 2 matrices. Furthermore, we have a look at a
generalisation of the inequality which involves an additional factor σ(c, p).
Key words and phrases: Schatten class, Schatten norm, Norm inequality, Minkowski inequality, Triangle inequality, Powers
of operators, Schatten-Minkowski constant.
2000 Mathematics Subject Classification. 47A30, 47B10.
1. I NTRODUCTION
Let H and K be complex Hilbert spaces and 0 < p ≤ ∞. Following [1], we denote by
cp (H, K) the space of Schatten p-operators T : H −→ K, equipped with the Schatten pnorm or quasi-norm | · |p . Note that [1] deals only with the spaces cp (H) := cp (H, H). The
generalisations cp (H, K) can be found in textbooks like [2] and [3] (there written as Bp (H, K)
and Sp (H, K) respectively).
By L(H) we denote the space of bounded linear operators on H, and by L(H)+ the subset
of positive operators. With |T | := (T ∗ T )1/2 ∈ L(H)+ for T ∈ L(H, K) we have for p < ∞
|T |pp = tr |T |p
for T ∈ cp (H, K), and consequently
|T |pp
for T ∈ cp (H)+ := cp (H) ∩ L(H)+ .
= tr T
p
Applying |T |p = |T ∗ |p for T ∈ cp (H, K), this shows in case of p < ∞
1 2 2
1 2
2 ∗
∗ p2 p
2
F
U
=
F
=
tr
(F
U
F
)
= |F U F ∗ | p
U
p
p
ISSN (electronic): 1443-5756
c 2005 Victoria University. All rights reserved.
This work was supported by the Dr. Helmut Manfred Riedl Foundation.
135-04
2
2
M ARKUS S IGG
for F ∈ cp (H, K) and U ∈ L(H)+ . Because | · |∞ is the usual operator norm,
1 2
2
F U = |F U F ∗ | p
2
p
is also true for p = ∞, with the common convention ∞
:= ∞.
2
Our question, which arose while studying the integration of Schatten operator valued functions in [4], is: For what values of p ∈ (0, ∞] and c ∈ (0, ∞) is the Minkowski-like inequality
1 c
1 c
1 c
c
c
c
F (R + S) ≤ F R + F S (MS)
p
p
p
true for all F ∈ cp (H, K) and R, S ∈ L(H)+ ?
2. T HE C ONJECTURE
Let H, K, p, c, F, R, S be as above.
Theorem 2.1. Inequality (MS) is true for p ≥ c = 1 and for p ≥ c = 2.
Proof. For p ≥ c = 1, the triangle inequality for | · |p shows
|F (R + S)|p = |F R + F S|p ≤ |F R|p + |F S|p .
For p ≥ c = 2, the triangle inequality for | · | p shows
2
2
1 2
1
1 2
F (R + S) 2 = |F (R + S)F ∗ | p ≤ |F RF ∗ | p + |F SF ∗ | p = F R 2 + F S 2 .
2
p
2
2
p
p
Theorem 2.1 suggests the following conjecture.
Conjecture 2.2. Inequality (MS) is true for p ≥ c ∈ [1, 2].
For c ∈ (1, 2) we have at the present time no proof of this conjecture for other than trivial
situations, not even for the special case of 2 × 2 matrices. However, some justification will be
given in Section 4.
3. T HE C ASE p < c AND
THE
C ASE c 6∈ [1, 2]
In this section we will demonstrate, by providing counterexamples, that inequality (MS) is
not necessarily true for other values of (c, p) than those stated in Conjecture 2.2. We will offer
one example for 0 < p < c < ∞, and one for arbitrary p when c < 1 or c > 2, both examples
using 2 × 2 matrices. The power U t for t > 0 of a non-negative matrix U can be calculated
easily with help of the spectral decomposition of U .
Example 3.1. Inequality (MS) is violated for 0 < p < c < ∞ by
1 0
1 0
0 0
F :=
, R :=
, S :=
.
0 1
0 0
0 1
Proof. From U t = U for U ∈ {R, S, R + S} and t ∈ (0, ∞) we get
1
1
1
c
F
U
= |U |p = (tr U p ) p = (tr U ) p ,
p
yielding
1
F R c = 1,
p
J. Inequal. Pure and Appl. Math., 6(3) Art. 87, 2005
1
F S c = 1,
p
1
1
F (R + S) c = 2 p ,
p
http://jipam.vu.edu.au/
A M INKOWSKI -T YPE I NEQUALITY FOR THE S CHATTEN N ORM
and using p < c,
3
c
1 c
1 c
1 c
p
c
c
c
F R + F S = 2 < 2 = F (R + S) .
p
p
p
The second example makes use of an inequality which is interesting in its own right. Seeming
simple, it is surprisingly fiddly to prove:
Lemma 3.1. For x ∈ (0, 1) ∪ (2, ∞) we have
√ !x √ !x
1
1
3+ 5
3− 5
1+ √
+ 1− √
< 1 + 3x .
2
2
5
5
√
Proof. Setting r := 5, α1 := 1 + 1r , α2 := 1 − 1r , and ω := 3+r
, we have to show
2
α1 ω x + α2 ω −x < 1 + 3x .
The case x ∈ (2, ∞): Set f (x) := α1 ω x , g(x) := α2 ω −x , h(x) := 1 + 3x for x ∈ (0, ∞).
Because α2 > 0 and ω > 1, g is strictly decreasing, thus f (x) + g(x) < f (x) + g(2) for x > 2.
We will show f (x) + g(2) < h(x) for x > 2. Because f (2) + g(2) = h(2), this is done if we
prove f 0 (x) < h0 (x) for x > 2, which is equivalent to α1 ( ω3 )x ln ω < ln 3. This inequality is
true for x = 2. All factors of its left side are positive, and ω < 3, so the left side is strictly
decreasing for x ≥ 2. Hence the inequality is true for x > 2 as well.
ln 3
The case x ∈ (0, 1): After substituting s := ω x and setting δ := ln
, we have to prove the
ω
equivalent inequality
1 1
1
s+ +
s−
< 1 + sδ
s r
s
for s ∈ (1, ω), which can be done by building a sandwich with a suitable polynomial function
inside: Set
1 1
1
s−1
s−
, p(s) := 2 1 +
,
ϕ(s) := s + +
s r
s
ω−1
(s − 1)(s − ω)
(ϕ(3) − p(3))
q(s) :=
(3 − 1)(3 − ω)
for s > 0. The claim is
ϕ(s) < p(s) + q(s) < 1 + sδ
for s ∈ (1, ω). The left inequality is verified by the fact that s · (p(s) + q(s) − ϕ(s)) defines a
polynomial of degree 3 with three zeros {1, ω, 3}, where 1 < ω < 3, and with positive leading
coefficient λ := 21 (ϕ(3) − p(3))/(3 − ω). To prove the second inequality, we inspect
ψ(s) := 1 + sδ − p(s) − q(s)
for s > 0 and get ψ 00 (s) = δ(δ − 1)sδ−2 − 2λ. Because 1 < δ < 2, ψ 00 has a unique zero
1
δ(δ − 1) 2−δ
s0 :=
, 1 < s0 < ω,
2λ
with ψ 00 (s) > 0 for s ∈ (0, s0 ) and ψ 00 (s) < 0 for s ∈ (s0 , ∞). Now ψ(1) = 0, ψ 0 (1) > 0,
and ψ 00 (s) > 0 for s ∈ (1, s0 ) show ψ(s) > 0 for s ∈ (1, s0 ], while ψ(s0 ) > 0, ψ(ω) = 0,
ψ 0 (ω) < 0, and ψ 00 (s) < 0 for s ∈ (s0 , ω) show ψ(s) > 0 for s ∈ [s0 , ω).
Example 3.2. Inequality (MS) is violated for 0 < p ≤ ∞ and c < 1 as well as c > 2 by
0 0
0 0
2 −1
F :=
, R :=
, S :=
.
1 0
0 1
−1 1
J. Inequal. Pure and Appl. Math., 6(3) Art. 87, 2005
http://jipam.vu.edu.au/
4
M ARKUS S IGG
Proof. Evaluation of the matrix powers for t ∈ (0, ∞) gives
Rt = R, S t =
(R
with r :=
p<∞
√
1
(α1 ω t + α2 ω −t )
2
1
(ω −t − ω t )
r
t
1 1+3
t
+ S) =
t
2
1−3
5, α1 := 1 + 1r , α2 := 1 − 1r , ω :=
3+r
.
2
!
1
(ω −t − ω t )
r
1
(α2 ω t + α1 ω −t )
2
1 − 3t
,
!
1 + 3t
For U ∈ {R, S, R + S} we get in case of
p1 √
p
= ut
|F U t |p = tr (F U 2t F ∗ ) 2
2
with ut being the top left entry of U 2t . Using |F U t |∞ = |F U 2t F ∗ |∞ , the case p = ∞ yields the
same result, thus for all p:
r
1
1
1
(α1 ω 2/c + α2 ω −2/c ),
F R c = 0, F S c =
2
p
p
r
1
1
(1 + 32/c ).
F (R + S) c =
2
p
Substituting 2c by x, we have to prove α1 ω x +α2 ω −x < 1+3x for x ∈ (2, ∞) and for x ∈ (0, 1),
which is the statement of Lemma 3.1.
4. S OME N UMERICAL E VIDENCE
To justify Conjecture 2.2, we present the results of a numerical study performed with 2 × 2
matrices.
From functional calculus it is known: For an operator T ≥ 0 on a complex Hilbert space the
powers T α , T β for α, β ∈ (0, ∞) obey the rule T α T β = T α+β . If T is invertible, then T α can
be defined for α ≤ 0 as well, and T α T β = T α+β is true for all α, β ∈ R.
Before turning to the matrix case, we note the following general lemma.
Lemma 4.1. Let H, K, F be as above and α ∈ (0, ∞).
(a) Let T ∈ L(H)+ . Then F T α = 0 if and only if F T = 0.
(b) Let R, S ∈ L(H)+ . Then F (R + S)α = 0 if and only if F Rα = 0 and F S α = 0.
2
Proof. (a) Suppose F T α = 0. Then |F T α/2 | = |F T α F ∗ | = 0, hence F T α/2 = 0. Repeated
application yields β ∈ (0, 1) with F T β = 0, thus F T = F T β T 1−β = 0.
Now suppose F T = 0. There is nothing to prove in the case of α = 1, so assume α 6= 1.
If T is invertible, then F T α = F T T α−1 = 0. If T is not invertible, then we have 0 ∈ σ(T ),
the spectrum of T . Choose polynomials fn ∈ R[t] for n ∈ N such that fn (x) → xα for
n → ∞ uniformly for x ∈ σ(T ). Then fn (T ) → T α and F fn (T ) → F T α for n → ∞, hence
F fn (T ) = fn (0)F → 0 for n → ∞, thus F T α = 0.
(b) Part (a) shows:
F Rα = 0 ∧ F S α = 0
⇐⇒
FR = 0 ∧ FS = 0
=⇒
F (R + S) = 0
⇐⇒
F (R + S)α = 0.
To prove the missing implication, suppose F (R + S) = 0. Then F RF ∗ + F SF ∗ = 0. Because
F RF ∗ ≥ 0 and F SF ∗ ≥ 0, we get F RF ∗ = 0, thus |F R1/2 |2 = |F RF ∗ | = 0 and F R1/2 = 0.
Applying (a) again gives F R = 0. Symmetry shows F S = 0.
J. Inequal. Pure and Appl. Math., 6(3) Art. 87, 2005
http://jipam.vu.edu.au/
A M INKOWSKI -T YPE I NEQUALITY FOR THE S CHATTEN N ORM
5
We will also use the following well-known property of 2 × 2 matrices:
Lemma 4.2. A complex 2 × 2 matrix M is positive semidefinite if and only if there exist a, b ∈
[0, ∞) and γ ∈ C with |γ|2 ≤ ab such that
a γ
M=
.
γ b
Lemma 4.1(b) shows that, when checking Conjecture 2.2, one may assume the denominator
to be non-zero, or setting 00 := 0, in
1 c
F (R + S) c p
.
qc,p (F, R, S) := 1 c
1 c
c
c
F R + F S p
p
We are searching for the supremum of qc,p (F, R, S) over all complex 2 × 2 matrices F, R, S
with R, S ≥ 0. For r ∈ [0, ∞) and x ∈ C define r ∧ x := x if |x| ≤ r and r ∧ x := (r/|x|) x
otherwise. Lemma 4.2 shows that R has the structure
α2
|α β| ∧ γ
R=
=: P (α, β, γ)
|α β| ∧ γ
β2
with α, β ∈ R and γ ∈ C, and a corresponding representation is valid for the matrix S.
This means that we have to deal with six complex and four real variables, resulting in a 16dimensional real optimisation problem: For λ = (λ1 , . . . , λ16 ) ∈ R16 we set
λ1 + λ 2 i λ 3 + λ 4 i
Fλ :=
,
λ5 + λ 6 i λ 7 + λ 8 i
Rλ := P (λ9 , λ10 , λ11 + λ12 i),
Sλ := P (λ13 , λ14 , λ15 + λ16 i)
and are asking for
σ(c, p) := sup qc,p (Fλ , Rλ , Sλ ).
λ∈R16
To attack this problem, GNU Octave [5], version 2.1.57, was utilised. It offers a function
for determining the singular values of a matrix, which can be employed for calculating the
Schatten norms. For the optimisation task the implementation [6], version 2002/05/09, with
standard parameters of the Downhill Simplex Method of Nelder and Mead ([7], 10.4) was used.
The results are in perfect agreement with Conjecture 2.2. For visualisation, approximations for
σ(c, p) for c ∈ {1.2, 1.4, 1.6, 1.8, 2.0} have been calculated and plotted with a step size of 0.01
for p, see Figure 4.1.
The apparently smooth shape of p 7→ σ(c, p) for p ≤ c, together with the fact that for each p
a new random starting point λ was used for the Nelder-Mead algorihm, gives some confidence
in the validity of the data.
A closer inspection of some of the calculated numerical values suggests
1
1
σ(2, 1) = 2, σ 32 , 1 = σ 95 , 65 = 2 2 , σ 85 , 65 = σ 2, 32 = 2 3 ,
1
1
σ 45 , 1 = σ 32 , 54 = σ 74 , 75 = σ 2, 85 = 2 4 , σ 65 , 1 = σ 95 , 32 = 2 5 ,
which leads to the idea to look at log2 σ(c, p). It seems there is a linear dependency of log2 σ(c, p)
from c if c ≥ p. This observation will be made precise in the next section.
J. Inequal. Pure and Appl. Math., 6(3) Art. 87, 2005
http://jipam.vu.edu.au/
6
M ARKUS S IGG
Figure 4.1: Experimental approximations of σ(c, p).
2
1.9
1.8
1.7
1.6
1.5
1.4
1.3
1.2
1.1
1
0.9
c = 1.2
c = 1.4
c = 1.6
c = 1.8
c = 2.0
1
1.2
1.4
1.6
1.8
2
p
5. G ENERALISATION OF (MS)
It is natural to generalise (MS) and to ask for the smallest σ(c, p) ∈ [0, ∞] for c ∈ (0, ∞) and
p ∈ (0, ∞] such that
1 c
1 c
1 c
c
c
c
F (R + S) ≤ σ(c, p) F R + F S p
p
p
for all F ∈ cp (H, K) and R, S ∈ L(H)+ (and for all complex Hilbert spaces H and K). It
is tempting to call σ(c, p) the Schatten-Minkowski constant for (c, p). By choosing F 6= 0 and
setting R to be the identity and S := 0 it can be seen that σ(c, p) ≥ 1. Now Conjecture 2.2 can
be re-phrased using σ(c, p), and, motivated by the numerical results, we add another conjecture:
Conjecture 5.1. (a) For 1 ≤ c ≤ 2 and p ≥ c we have σ(c, p) = 1.
c
(b) For 0 ≤ c ≤ 2 and p ≤ c we have σ(c, p) = 2 p −1 .
Again, the cases c = 1 and c = 2 are not too difficult to prove:
(
(
1
1
for p ≥ 1
(b) σ(2, p) =
Theorem 5.2. (a) σ(1, p) =
1
2
−1
for p ≤ 1
2 p −1
2p
for p ≥ 2
.
for p ≤ 2
Proof. σ(1, p) ≤ 1 for p ≥ 1 and σ(2, p) ≤ 1 for p ≥ 2 is the subject of Theorem 2.1, while
σ(c, p) ≥ 1 is noted above. Example 3.1 tells us that σ(c, p) ≥ 2c/p−1 for 0 < p ≤ c < ∞,
yielding
1
2
and
σ(2, p) ≥ 2 p −1 for p ≤ 2.
σ(1, p) ≥ 2 p −1 for p ≤ 1
Now for the missing ‘≤’ inequalities. For the case c = 1, recall the inequality between the
power means of degrees p ≤ 1 and 1, see e.g. [8], 8.12, which reads
p
1
α + βp p
α+β
≤
or equivalently αp + β p ≤ 21−p (α + β)p
2
2
for α, β ∈ [0, ∞). Together with the quasi-norm inequality of | · |p this gives
|F (R + S)|pp ≤ |F R|pp + |F S|pp ≤ 21−p (|F R|p + |F S|p )p
J. Inequal. Pure and Appl. Math., 6(3) Art. 87, 2005
http://jipam.vu.edu.au/
A M INKOWSKI -T YPE I NEQUALITY FOR THE S CHATTEN N ORM
7
1
and thus |F (R + S)|p ≤ 2 p −1 (|F R|p + |F S|p ).
For the case c = 2, start with the power means inequality for the degrees p ≤ 2 and 2,
p
1 2
1
p
p
α + β2 2
α + βp p
≤
or equivalently αp + β p ≤ 21− 2 (α2 + β 2 ) 2
2
2
for α, β ∈ [0, ∞). Together with the quasi-norm inequality of | · | p this gives
2
p
1 p
F (R + S) 2 = |F (R + S)F ∗ | p2
p
2
p
2
p
2
p
≤ |F RF ∗ | + |F SF ∗ | p2
2
p2
p
1 p
1 p
1 2
1 2
1−
= F R 2 + F S 2 ≤ 2 2 F R 2 + F S 2 p
p
p
p
and consequently
2
1 2
1 2
1 2
−1
F (R + S) 2 ≤ 2 p
F R 2 + F S 2 .
p
p
p
6. C ONCLUSION
Starting with Conjecture 2.2, which we proved for the cases c = 1 and c = 2 in Theorem 2.1,
a numerical study of 2 × 2 matrices led to the generalised Conjecture 5.1, which we also proved
for c = 1 and c = 2 in Theorem 5.2.
The given proofs make use of the (quasi-) triangle inequality of the Schatten (quasi-) norm.
Another ingredient to Theorem 5.2 is the power means inequality. Presumably, a combination
of these inequalities shall also be central when dealing with the case c 6= 1, 2. However, it is
unclear how to apply the triangle inequality in this situation, because there is no obvious way
to get from F (R + S)1/c to an expression where R and S can be separated.
R EFERENCES
[1] C.A. MCCARTHY, cp , Israel J. Math., 5 (1967), 249–271.
[2] J. WEIDMANN, Linear Operators in Hilbert Spaces, Springer, New York 1980.
[3] R. MEISE AND D. VOGT, Introduction to Functional Analysis, Clarendon Press, Oxford 1997.
[4] M. SIGG, Zur sesquilinearen Integration Schatten-operatorwertiger Funktionen, Konstanzer
Schriften in Mathematik und Informatik, 200 (2004), http://www.inf.uni-konstanz.de/
Preprints/.
[5] J.W. EATON et al, http://www.octave.org.
[6] E. GROSSMANN,
min.html.
http://octave.sourceforge.net/index/f/nelder_mead_
[7] W.H. PRESS, B.P. FLANNERY, S.A. TEUKOLSKY AND W.T. VETTERLING, Numerical Recipes
in C: The Art of Scientific Computing, Cambridge University Press 1992, http://www.nr.com.
[8] J. HERMAN, R. KUČERA AND J. ŠIMŠA, Equations and Inequalities, Springer, New York (2000).
J. Inequal. Pure and Appl. Math., 6(3) Art. 87, 2005
http://jipam.vu.edu.au/