Minimization Property of the Median m & |µ − m| ≤ σ

advertisement
Minimization Property of the Median m & |µ − m| ≤ σ
Fritz Scholz
Fact 1: |E(Y )| ≤ E(|Y |) by generalization of the triangle inequality.
Also from: |Y | ≥ Y ⇒ E(|Y |) ≥ E(Y ) and |Y | ≥ −Y ⇒ E(|Y |) ≥ E(−Y ) = −E(Y ),
hence |E(Y )| ≤ E(|Y |).
Fact 2: E(Y 2 ) ≥ (E(|Y |))2 since var(|Y |) = E(Y 2 ) − (E(|Y |))2 ≥ 0.
Fact 3 (Minimization Property of the Median): If m denotes the median of Y then
E(|Y − m|) ≤ E(|Y − a|) for any constant a, in particular for a = µ = E(Y ).
Thus the median m minimizes E(|Y − a|) over a.
This important property of the median parallels that of a = µ minimizing E[(Y − a)2 ].
Proof: We use the following notation: IA (x) = 1 when x ∈ A and IA (x) = 0 otherwise. Without
loss of generality we assume a < m.
We have the following identity (verified by checking for x ≤ a, x ≥ m and a < x < m):
|x − a| − |x − m| = (a − m)I(−∞,a] (x) + (m − a)I[m,∞) (x) + (2x − a − m)I(a,m) (x) .
Simple manipulation turns that into
|x − a| − |x − m| =
=
=
=
2(x − a)I(a,m) (x) + (a − m)I(a,m) (x) + (a − m)I(−∞,a] (x) + (m − a)I[m,∞) (x)
2(x − a)I(a,m) (x) + (a − m)I(−∞,m) (x) + (m − a)I[m,∞) (x)
2(x − a)I(a,m) (x) + (a − m)I(−∞,m) (x) + (m − a)(1 − I(−∞,m) (x))
2(x − a)I(a,m) (x) + (m − a) + 2(a − m)I(∞,m) (x) .
Using the random variable Y for x and taking expectations on both sides we get
E(|Y − a|) − E(|Y − m|) = E 2(Y − a)I(a,m) (Y ) + (m − a) − 2(m − a)P (Y < m) .
By definition of the median we have P (Y < m) ≤ 1/2 and P (Y > m) ≤ 1/2 (or equivalently
P (Y ≥ m) ≥ 1/2 and P (Y ≤ m) ≥1/2) and thus (m − a) − 2(m − a)P (Y < m) ≥ 0.
Furthermore, E 2(Y − a)I(a,m) (Y ) ≥ 0 since the random variable within E( ) is nonnegative.
Thus E(|Y − a|) − E(|Y − m|) ≥ 0, q.e.d.
The proof for |µ − m| ≤ σ now becomes a 1-liner:
Fact1
Fact3
Fact2
|µ − m| = |E(Y ) − m| = |E(Y − m)| ≤ E(|Y − m|) ≤ E(|Y − µ|) ≤
q
E(|Y − µ|2 ) = σ
q.e.d.
Download