Lecture IV

$ ' EE 5407 Part II: Spatial Based Wireless Communications Instructor: Prof. Rui Zhang E-mail: rzhang@i2r.a-star.edu.sg Website: http://www.ece.nus.edu.sg/stfpage/elezhang/ Lecture IV: MIMO Systems March 21, 2011 & 1 % ' $ Introduction to MIMO Systems • So far, channel fading is treated as a “disadvantageous” factor for reliable transmission. • From now on, we will look at how to make use of channel fading, rather than “compensating” for it. • Consider a MIMO system with t > 1 transmit antennas and r > 1 receive antennas: there are in total t × r physical links (very likely over different channels due to independent fading) • Intuitively, if each transmit antenna sends one (possibly independent) data stream, the total transmission rate can be increased dramatically over the SISO system: a technique so-called spatial multiplexing. • However, each receive antenna receives the combinations of transmitted signals from all transmit antennas. Also, each modulated symbol can be & 2 % $ ' transmitted over all or part of transmit antennas. Thus there may exist cross-talk among different data streams at the receiver. • One major challenge for MIMO spatial multiplexing is the transceiver design, whereby the transmitted symbols from multiple data streams can be recovered at the receiver with reasonable complexity and yet reasonably good performance. • We will see that whether the CSI is known at the transmitter can lead to very different transceiver designs for MIMO spatial multiplexing. & 3 % $ ' Outline • Review of capacity results for SISO AWGN (additive white Gaussian noise) and fading channels • MIMO AWGN channel – CSIT-known case: capacity; transceiver design (eigenmode transmission) – CSIT-unknown case: capacity; transceiver design (horizontal encoding, linear vs. nonlinear receivers) • MIMO fading channel – Ergodic capacity with or without CSIT – Outage capacity with or without CSIT & 4 % ' $ Channel Capacity Analysis • Channel capacity describes the maximum information rate that can be reliably transmitted over a channel subject to a given transmit power constraint. • The capacity for AWGN channels was derived by Claude Shannon in 1948. • The capacity of SISO fading channels has been studied by Goldsmith and Varaiyaa , Caire and Shamaib , et al. a A. J. Goldsmith and P. P. Varaiya, “Capacity of fading channels with channel side information,” IEEE Transactions on Information Theory, vol.43, No.6, pp.1986-1992, November 1997. b G. Caire and S. Shamai (Shitz), “On the capacity of some channels with channel state information,” IEEE Transactions on Information Theory, vol.45, No.6, pp.2007-2019, September 1999. & 5 % $ ' • The capacity of MIMO fading channels has been studied by Telatarc , Foshini and Gansd , et al. c I. E. Telatar, “Capacity of multi-antenna Gaussian channels,” European Transactions on Telecommunications, vol.10, No.6, pp.585-595, November 1999. d G. J. Foshini and M. J. Gans, “On limits of wireless communications in a fading environment when using multiple antennas,” Wireless Personal Communications, vol.6, No.3, pp.311-335, March 1998. & 6 % ' $ Capacity of SISO AWGN Channel • Consider the following SISO AWGN channel: y(n) = hx(n) + z(n), n = 1, . . . , N (1) – h ∈ C denotes the channel, which is constant and assumed known at the receiver – E[|x(n)|2 ] ≤ P , where P is the transmit power constraint – z(n) ∼ CN (0, σz2 ), is independent over n (AWGN) • Channel capacity measures the maximum information rate that can be reliably transmitted over the channel with any arbitrarily small probability of decoding error as the code length N goes to infinity. • Claude Shannon proved that the capacity of the SISO AWGN channel is & 7 % $ ' equal to the mutual information between x and y, which is given by I(x; y) = h(y) − h(y|x) = h(y) − h(z) (2) where h(X) is the differential entropy of a RV X (measuring the amount of uncertainty for X), and h(Y |X) denotes the conditional differential entropy of a pair of RVs X and Y (measuring the amount of uncertainty for Y conditional on knowing X). 2 2 • For a CSCG RV X ∼ CN (0, σX ), h(X) = log2 πeσX bits. • Since z(n) ∼ CN (0, σz2 ), we have h(z) = log2 πeσz2 . • The mutual information is then written as I(x; y) = h(y) − log2 πeσz2 (3) • The capacity is defined as the maximum mutual information over all possible distributions of x subject to E[|x(n)|2 ] ≤ P . & 8 % $ ' • Note that E[|y(n)|2 ] ≤ |h|2 P + σz2 . 2 • For a RV X with zero-mean and variance σX , h(X) is maximized when X is Gaussian distributed. • To maximize h(y), we need that y(n) is Gaussian distributed such that h(y) = log2 πe(|h|2 P + σz2 ). • This suggests that x also needs to be Gaussian distributed with E(|x(n)|2 ) = P . • Thus it follows that the capacity of the SISO AWGN channel is achieved when x(n) ∼ CN (0, P ) (i.e., x(n) is drawn from a Gaussian codebook), and is given by 2 |h| P (4) C = log2 πe(|h|2 P + σz2 ) − log2 πeσz2 = log2 1 + 2 σz where the capacity unit is bits/second/Hz (bps/Hz). & 9 % $ ' • Note that the capacity is equal to log2 (1 + γ), where γ = receiver SNR. |h|2 P σz2 is the • At the asymptotically low-SNR regime, i.e., γ → 0, since log(1 + x) ≈ x as x → 0, we have γ (5) C(γ → 0) ≈ log 2 Thus the capacity doubles for every 3dB increase in SNR. • At the asymptotically high-SNR regime, i.e., γ → ∞, since log(1 + x) ≈ log x as x → ∞, we have C(γ → ∞) ≈ log2 γ (6) Thus the capacity increases by 1 bps/Hz for every 3dB increase in SNR. & 10 % $ ' Capacity of SISO Fading Channel • Consider the following SISO fading channel similarly as defined for the SISO AWGN channel (with the symbol index n dropped): y = hx + z (7) whereas the channel h ∈ C is now a RV, which is constant during each transmission block, but can change from one block to the other (i.e., block-fading); it is assumed that the instantaneous channel h is known at the receiver. • Each transmission block consists of N symbols. • The instantaneous mutual information (IMI) between x and y & 11 % $ ' conditional on channel state h is given by 2 |h| P I(x; y|h) = log2 1 + σz2 (8) • Note that the IMI is a RV, the PDF of which is determined by the distribution of |h|2 . • Two types of fading channel capacities are defined as follows: – Ergodic capacity is defined as the statistical average of the IMI: 2 |h| P (9) Cerg = E log2 1 + σz2 where the expectation is taken over |h|2 . – Outage capacity is defined as the information rate, below which the & 12 % ' $ IMI falls with a prescribed probability q%: 2 |h| P Pr log2 1 + < Cout,q% = q% σz2 (10) where Cout,q% is called q% outage capacity. • Ergodic capacity has two important applications: – In the case where the instantaneous CSI on |h| is unknown at the transmitter, the transmitter can employ a long code that spans over different fading states of the channel to achieve reliable communication, provided that the information rate of the code is less than Cerg of the fading channel. This technique is known as coded diversity. In this case, N needs not to be large. – In the CSIT-known case, the transmitter can use different codes for different channel fading states. As long as the selected code for each fading state has a rate smaller than the IMI, reliable communication & 13 % ' is ensured. The maximum achievable average rate over all different fading states is then given by Cerg of the fading channel. This technique is known as adaptive coding. In this case, N needs to be sufficiently large. • According to Jensen’s inequalitya we have 2 2 |h| P E[|h| ]P ≤ log 1 + E log2 1 + 2 σz2 σz2 $ (11) Thus, the ergodic capacity of a fading channel is no larger than that of an AWGN channel with a constant channel gain, which is equal to the average channel gain of the fading channel. • Outage capacity is usually applicable to data traffic with stringent delay a Jensen’s inequality says that for a concave function f (x) where x is a RV, it holds that E[f (x)] ≤ f (E[x]). A function f (x) is concave iff for any two arbitrary values of x, x1 and x2 , and positive number λ between 0 and 1, we have λf (x1 )+(1−λ)f (x2 ) ≤ f (λx1 +(1−λ)x2 ). & 14 % ' requirements (e.g., voice, real-time video), where the information of each transmission block has a constant rate, and needs to be decoded at the receiver with a block decoding error probability less than the prescribed outage probability target. In this case, N needs to be sufficiently large to achieve reliable transmission for each block. $ • For a given outage probability target q%, the effective capacity is defined as the maximum average rate of the transmitted information successfully decoded at the receiver Ceff,q% = (1 − q%)Cout,q% (12) • The outage probability has different meanings for the CSIT-known and CSIT-unknown cases – In the CSIT-known case, the transmitter knows the instantaneous channel and thus the IMI. Thus, if an outage event will occur at the receiver, this is known at the transmitter, and is thus called & 15 % ' $ transmitter-aware outage. – In the CSIT-unknown case, an outage event is only known at the receiver but not at the transmitter, and is thus called receiver-aware outage. 2 • Let γ = |h|σ2P denote the instantaneous receiver SNR. Suppose that the z transmitter employs an optimal Gaussian codebook with information rate equal to Cout,q% . Then from (10) it follows that an outage event occurs iff γ < 2Cout,q% − 1 , γmin (13) • However, practical MCS cannot perform as well as the optimal Gaussian code. As a result, there is usually a positive SNR “gap” between the theoretical minimum SNR γmin and the actual required operating SNR γ̄, where γ̄ > γmin . Thus the outage probability is generally defined as & pout = Pr(γ < γ̄) (14) 16 % ' $ SISO Fading Channel: Ergodic Capacity Assume iid Rayleigh fading with σh2 = E[|h|2 ] = 1, and σz2 = 1. 7 6 Capacity (bps/Hz) 5 AWGN Channel Capacity Fading Channel Ergodic Capacity 4 3 2 1 0 −10 & −5 0 5 SNR (dB) 10 15 20 17 % ' $ SISO Fading Channel: Outage Capacity Assume iid Rayleigh fading with σh2 = E[|h|2 ] = 1, σz2 = 1, and P = 5. Outage Probability 1 0.8 0.6 0.4 0.2 0 0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 5 3.5 4 4.5 5 Effective Capacity (bps/Hz) Outage Capacity (bps/Hz) & 1.5 1 0.5 0 0 0.5 1 1.5 2 2.5 3 Outage Capacity (bps/Hz) 18 % ' $ Capacity of MIMO AWGN Channel • Consider the following r × t MIMO channel (with symbol index n dropped): y = Hx + z (15) – H ∈ Cr×t is a constant matrix, which is known at the receiver – Assume Gaussian signals, i.e., x ∼ CN (0, S x ), where S x , E[xxH ] is the covariance matrix for the transmitted signal vector – If Rank(S x ) = 1, it is called “beamforming mode”; if Rank(S x ) > 1, it is called “spatial multiplexing mode” – Tr(S x ) = E[kxk2 ] ≤ P , where P is the sum-power constraint at the transmitter – z ∼ CN (0, σz2 I r ): spatially white Gaussian noise • The capacity of the MIMO AWGN channel for a given S x is equal to the & 19 % $ ' mutual information between x and y, which is given by C = I(x; y) = h(y) − h(y|x) = h(y) − h(z) (16) • For a CSCG random vector x ∼ CN (0, S x ), h(x) = log2 πedet(S x ) bits. • Note that y ∼ CN (0, HS x H H + σz2 I r ) and z ∼ CN (0, σz2 I r ). • For square matrices A and B, det(A)/det(B) = det(AB −1 ). Thus we have C = log2 πedet(HS x H H + σz2 I r ) − log2 πedet(σz2 I r ) 1 = log2 det I r + 2 HS x H H σz (17) where the unit is bits/second/Hz (bps/Hz). • Next, we study the design of S x to maximize the MIMO AWGN channel capacity for the CSIT-known case and CSIT-unknown case, respectively, as well as their respective MIMO transceiver designs. & 20 % ' $ MIMO AWGN Channel With CSIT • Assume that H is perfectly known at the transmitter. • We want to find the optimal S x to maximize the MIMO channel capacity subject to Tr(S x ) ≤ P . • First, let the truncated SVD of H denoted by H = Ũ Λ̃Ṽ H (18) H where Ũ ∈ Cr×m with m = Rank(H), and Ũ Ũ = I m ; Ṽ ∈ Ct×m with H Ṽ Ṽ = I m ; and Λ̃ is a m × m strictly positive diagonal matrix with the diagonal elements (singular values) given by λ1 ≥ . . . ≥ λm > 0. • Then the MIMO AWGN channel capacity given in (17) is written as 1 H H C = log2 det I r + 2 Ũ Λ̃Ṽ S x Ṽ Λ̃Ũ (19) σz & 21 % $ ' H • Let S̃ x = Ṽ S x Ṽ . Using the fact that det(I + AB) = det(I + BA), we have 1 2 (20) C = log2 det I m + 2 Λ̃ S̃ x σz • The Hadamard’s inequality states that the determinant of any positive semi-definite matrix A is less than the product of all its diagonal elements, i.e., Y [A]i,i (21) det(A) ≤ i with equality iff A is a diagonal matrix. • Thus we have C≤ & m X i=1 log2 λ2i [S̃ x ]i,i 1+ σz2 ! (22) 22 % $ ' with equality iff S̃ x is a diagonal matrix, i.e., H S̃ x = Ṽ S x Ṽ = Σ (23) where Σ is a m × m non-negative diagonal matrix. • Then we conclude that the optimal transmit covariance to maximize the MIMO channel capacity is in the form of S opt x = Ṽ ΣṼ H (24) • Let the diagonal elements of Σ be denoted by p1 , . . . , pm . Then the resulted MIMO channel capacity is given by m X λ2i pi log2 1 + 2 Copt = (25) σ z i=1 • Consider now the power constraint: & Tr(S opt x ) ≤ P (26) 23 % $ ' Since Tr(AB) = Tr(BA), we have H H Tr(S opt x ) = Tr(Ṽ ΣṼ ) = Tr(ΣṼ Ṽ ) = Tr(Σ) = m X i=1 pi ≤ P (27) • One method to set pi ’s is the “equal-power (EP)” allocation: pi = P , i = 1, . . . , m m (28) And the resultant capacity is CEP = m X i=1 & 2 λP log2 1 + 2i σz m (29) 24 % $ ' Eigenmode Transmission • According to S opt x , we can design the transmitted signal vector as 1 x = Ṽ Σ 2 s (30) where – Ṽ is called “precoding” matrix – Σ is called “power allocation” matrix – s ∈ Cm×1 is the information signal vector, s ∼ CN (0, I m ) – Check power constraint: P 1 1 H E[kxk2 ] = E[xH x] = E[sH Σ 2 Ṽ Ṽ Σ 2 s] = E[sH Σs] = m i=1 pi & 25 % $ ' • Substituting this form of x into the MIMO channel (15) yields 1 2 y = H Ṽ Σ s + z H 1 2 = Ũ Λ̃Ṽ Ṽ Σ s + z 1 = Ũ Λ̃Σ 2 s + z (31) • Suppose that the receiver multiplies the received signal vector by the H “decoding” matrix Ũ . We thus have H ỹ = Ũ y H 1 2 = Ũ Ũ Λ̃Σ s + z̃ 1 = Λ̃Σ 2 s + z̃ (32) H where z̃ = Ũ z ∼ CN (0, σz2 I m ) • Let ỹ = [ỹ1 , . . . , ỹm ]T , s = [s1 , . . . , sm ]T , and z̃ = [z̃1 , . . . , z̃m ]T . 1 2 • Since Λ̃Σ is a diagonal matrix, we see from (32) that the MIMO & 26 % ' channel has been decomposed into a set of m parallel SISO channels represented by √ (33) ỹi = λi pi si + z̃i , i = 1, . . . , m $ • The receiver SNR for the ith decomposed SISO channel is given by λ2i pi γi = 2 σz (34) and the channel capacity is given by 2 λ pi Ci = log2 (1 + γi ) = log2 1 + i 2 σz (35) • Then the sum capacity over m SISO channels is given by m m X X λ2i pi Ci = log2 1 + 2 = Copt σ z i=1 i=1 (36) • Thus, the joint deployment of linear precoders Ṽ and linear decoders Ũ & 27 H % $ ' achieves the MIMO AWGN channel capacity in the case of known CSIT. • The transceiver design based on the SVD of the MIMO channel matrix is usually called “eigenmode transmission”. & 28 % $ ' Water-Filling Power Allocation • We can further optimize pi ’s subject to given in (25). Pm i=1 pi ≤ P to maximize Copt • This yields the following optimization problem: m X λ2i pi Maximize log2 1 + 2 {pi } σz i=1 Subject to m X i=1 pi ≤ P pi ≥ 0, i = 1, . . . , m (37) • Applying the Lagrange multiplier method to solve this problem, the cost & 29 % $ ' function is written as J(p1 , . . . , pm ) = m X i=1 log2 1 + λ2i pi σz2 −υ m X pi (38) i=1 where υ ≥ 0 is the Lagrange multiplier associated with the sum-power constraint. • The cost function is maximized when the derivatives of J(·) over pi , i = 1, . . . , m, are all equal to zero. Thus we have 1 • Let µ = & 1 . υ log 2 λ2i σz2 λ2i pi + σ2 z = υ log 2, i = 1, . . . , m (39) We have σz2 pi = µ − 2 , i = 1, . . . , m λi (40) 30 % $ ' • Furthermore, using the fact that pi ≥ 0 yields the optimal power allocation as 2 + σ pi = µ − z2 , i = 1, . . . , m λi (41) where (x)+ , max(0, x). • The constant µ can be determined from the equality m 2 + X σ =P µ − z2 λi i=1 (42) • The above power allocation rule is called “water-filling (WF)” where – µ: fixed water-level for all sub-channels – σz2 : λ2i normalized (to sub-channel power gain) noise power of the ith sub-channel – pi : water poured (power allocated) to the ith sub-channel & 31 % $ ' • The capacity of the MIMO AWGN channel with WF power allocation is  +  2 σz 2 + 2 m m µ − λ X X 2 i λi µλi   CWF = log2 1 + (43) log2 = 2 2 σ σ z z i=1 i=1 & 32 % $ ' Optimality of Beamforming • First, consider the r × 1 SIMO AWGN channel: y = hx + z (44) • The (truncated) SVD of h is h= h khk1 khk (45) • Thus the optimal precoder is trivially 1, and the optimal decoder is hH u = khk H (46) • This is same as MRC receive beamforming. & 33 % $ ' • The receiver SNR after applying MRC beamforming is γSIMO khk2 P = σz2 (47) • The SIMO AWGN channel capacity is P H 1 H CSIMO = log2 det I r + 2 hP h = log2 1 + 2 h h σz σz 2 khk P = log2 (1 + γSIMO ) = log2 1 + 2 σz (48) • Thus, MRC receive beamforming achieves the SIMO AWGN channel capacity. • Next, consider the following 1 × t MISO channel: & y = hT x + z (49) 34 % $ ' • The (truncated) SVD of hT is hT h = 1khk khk T (50) • Thus the optimal decoder is trivially 1, and the optimal precoder is h∗ v= khk (51) • This is same as P-MRC transmit beamforming (without power gain √ P ). • The optimal transmit covariance matrix is H S opt = vP v = x P ∗ T h h 2 khk (52) • Note that Rank(S opt x ) = 1. Thus “beamforming mode” is optimal. • The SNR of the MISO channel after applying P-MRC transmit & 35 % $ ' beamforming is γMISO khk2 P = σz2 (53) • The MISO AWGN channel capacity is 4 1 P khk ∗ = log 1 + CMISO = log2 1 + 2 hT S opt h 2 x σz σz2 khk2 khk2 P = log2 1 + = log2 (1 + γMISO ) 2 σz (54) • Thus, P-MRC transmit beamforming achieves the MISO AWGN channel capacity. • Last, consider the strongest eigenmode beamforming (SEB) scheme for the MIMO AWGN channel: & y = Hx + z (55) 36 % $ ' • Recall that the precoder for this scheme is given by √ opt wt = P v 1 (56) where v 1 is the first column of V (Recall that the SVD of H is U ΛV H ). • Thus the transmit covariance matrix for SEB is S SEB = P v1vH x 1 (57) ) = 1, thus beamforming mode is used. Note that Rank(S SEB x • With this transmit covariance, the maximum achievable rate is P H RSEB = log2 det I r + 2 Hv 1 v H H 1 σz P P = log2 1 + 2 kHv 1 k2 = log2 1 + 2 ku1 λ1 k2 σz σz 2 λP = log2 1 + 1 2 σz & (58) 37 % $ ' where u1 is the first column of U . = u1 yields the receiver SNR as • Applying the decoder wopt t γSEB • Then it follows that 2 2 Hv | P |uH λ 1 1 1P = = 2 2 σz |uH z| 1 RSEB = log2 (1 + γSEB ) = log2 1 + (59) λ21 P σz2 (60) Thus, the SEB achieves the maximum rate over the AWGN MIMO (i.e., channel with rank-one transmit covariance matrix S SEB x beamforming mode is used). • However, in general the capacity of the MIMO AWGN channel is & 38 % $ ' achieved by spatial multiplexing mode with Rank(S opt x ) ≥ 1:  +  2 σz 2 m µ − λ X i λ2i   log2 1 + CWF =  2 σ z i=1 (61) • Clearly, RSEB is equal to CWF iff σz2 µ− 2 = P λ1 σz2 µ − 2 ≤ 0, i = 2, . . . , m λi (62) (63) • Since λ2 ≥ λ3 ≥ . . . ≥ λm , the conditions in (62) and (63) are satisfied iff 2 σz2 σz σz2 σz2 P+ 2 ≤ 2 ⇒P ≤ 2 − 2 (64) λ1 λ2 λ2 λ1 • Thus the SEB is capacity optimal only when P is sufficiently small. & 39 % $ ' Asymptotic Capacity Analysis • We have shown that for the asymptotically low-power regime, i.e., P → 0, the MIMO AWGN channel capacity is achieved by beamforming mode and behaves as λ21 P λ21 P (65) ≈ CMIMO (P → 0) = log2 1 + 2 σz (log 2)σz2 • Recall that the SISO AWGN channel capacity at the asymptotically low-power regime behaves as |h|2 P CSISO (P → 0) ≈ (log 2)σz2 (66) • Thus the capacity gain of the MIMO system over the SISO system for & 40 % $ ' the asymptotically low-power regime is given by λ21 CMIMO (P → 0) = 2 CSISO (P → 0) |h| (67) which is the achievable by the strongest eigenmode beamforming (SEB). • On the other hand, consider the asymptotically high-power regime, i.e., P → ∞, the MIMO AWGN channel capacity is achieved by spatial multiplexing mode and behaves as (lower-bounded by equal-power allocation) m 2 X λi P CMIMO (P → ∞) ≥ ≈ c + m log2 P log2 1 + 2 (68) σ m z i=1 where c = λ2i i=1 log2 σz2 m Pm is a constant independent of P . • Notice that the MIMO AWGN channel capacity with the optimal WF power allocation is upper-bounded by (considering each sub-channel is & 41 % $ ' allocated with full-power P ) CMIMO (P → ∞) ≤ where c̃ = λ2i i=1 log2 σz2 Pm m X i=1 log2 1 + λ2i P σz2 ≈ c̃ + m log2 P (69) is still a constant independent of P . • With upper and lower capacity bounds, the asymptotic ratio between the MIMO channel capacity and the logarithm (base-2) of the transmission power as the power goes to infinity, so-called “spatial multiplexing gain”, is given by CMIMO =m P →∞ log2 P (70) lim • Note that for very large values of P , the MIMO channel capacity (using either EP or WF power allocation) increases by m bps/Hz for every 3dB increase in P . & 42 % ' • Since for the SISO system, the channel capacity with P → ∞ behaves as where ĉ = CSISO (P → ∞) ≈ ĉ + log2 P (71) CMIMO (P → ∞) =m CSISO (P → ∞) (72) |h|2 log2 σ2 . z • Thus it follows that • Note that the spatial multiplexing gain is equal to m, the rank of the MIMO channel matrix, and is not necessarily equal to min(t, r) unless the channel matrix is full-rank, which occurs with probability one for the case of iid Rayleigh fading MIMO channel. Thus, independent channel fading is an advantageous factor for maximizing the spatial multiplexing gain of MIMO channel. & $ 43 % ' $ MIMO AWGN Channel Capacity With CSIT Assume t = r = 2, H = [1 ρ; ρ 1] with ρ = 0.5, and σz = 1. 10 9 Achievable Rate (bps/Hz) 8 7 SISO Capacity MISO/SIMO Capacity MIMO: Strongest Eigenmode Beamforming (SEB) MIMO: Eigenmode Transmission with EP MIMO Capacity: Eigenmode Transmission with WF 6 5 4 3 2 1 0 −10 & −5 0 5 10 15 20 SNR (dB) 44 % ' $ MIMO AWGN Channel Without CSIT • For the CSIT-known case, we have shown that the linear precoders and decoders based on the MIMO channel SVD (eigenmode transmission) together with WF power allocation achieve the MIMO channel capacity. Furthermore, the MIMO channel is decomposed into parallel SISO AWGN channels, which greatly simplifies the transceiver design. • Now consider the case where the channel H is only known at the receiver, but is unknown at the transmitter. • Assume that the transmitter employs an “isotropic transmission” with the “white” signal covariance matrix given by S (w) x = P It t (73) • Note that Tr(S (w) x ) = P. & 45 % ' • With the white transmit covariance, the capacity of the MIMO AWGN channel is given by 1 H C (w) = log2 det I r + 2 HS (w) x H σz P = log2 det I r + 2 HH H σz t P H H = log2 det I r + 2 Ũ Λ̃Ṽ Ṽ Λ̃Ũ σz t P 2 = log2 det I m + 2 Λ̃ σz t m X λ2i P = log2 1 + 2 (74) σ t z i=1 • Note that C (w) is identical to CEP in the case of known CSIT with equal-power allocation if t = m, i.e., t ≤ r and H is full-rank. & 46 $ % ' • It follows from (74) that the spatial multiplexing gain for the MIMO AWGN channel with the white transmit covariance is m, which is the rank of the MIMO channel matrix, like in the CSIT-known case. $ • However, we will see soon that the transceiver design in the CSIT-unknown case to achieve C (w) is in general far more complex as compared with the eigenmode transmission in the CSIT-known case. • Before that, let’s take a look at the special case with t = 2 and r = 1. Recall that in Lecture III we have introduced the Alamouti code, for P which the transmit covariance matrix can be shown to be S (w) = I . x 2 2 • In this case, from (74) we see that the MISO AWGN channel capacity with the white transmit covariance is given by 2 khk P (75) = log2 (1 + γAC ) C (w) = log2 1 + 2 2σz & 47 % ' where γAC = 2 khk P 2σz2 $ is the SNR after decoding the Alamouti code. • Thus Alamouti code achieves the MISO AWGN channel capacity with t = 2. • In Lecture III, we have also introduced a heuristic scheme (which achieves the same diversity order of 2 as the Alamouti code) using alternate transmission and repetition coding over two transmit antennas. The maximum achievable rate for this scheme is given by 2 1 khk P R = log2 1 + (76) 2 2 σz where the factor 12 in front of the log function is due to repetition coding. 1 x • Since log 1 + 2 ≥ 2 log(1 + x), ∀x ≥ 0, we can show that for any P ≥ 0 & R ≤ C (w) (77) 48 % $ ' • As P → ∞, we have khk 1 log2 (P ) + 12 log2 σ2 R 2 z lim (w) = lim 2 P →∞ C P →∞ khk log2 (P ) + log2 2σ2 2 = 1 2 (78) z Thus, the achievable rate of the heuristic scheme is half of that of the Alamouti code (the channel capacity) at the high-power regime, due to repetition coding. • On the other hand, as P → 0, we have khk 2 P (2 log 2)σz2 R = lim =1 lim 2 P →0 C (w) P →0 khk P (79) (2 log 2)σz2 Thus, the achievable rate of the heuristic scheme is equal to that of the Alamouti code (the channel capacity) at the low-power regime. & 49 % $ ' • Next, consider the case of t = 2 and r ≥ 2. In Lecture III, we have introduced the joint transmit diversity (using Alamouti code) and receiver beamforming (using MRC) scheme to achieve the full diversity order tr for the iid Rayleigh fading MIMO channel. • Assume that H is full-rank and thus m = min(t, r) = 2. The maximum achievable rate for this scheme is given by Ωsum P (λ21 + λ22 )P R = log2 1 + = log2 1 + (80) 2 2 2σz 2σz P = I is given by • The capacity of the MIMO channel with S (w) x 2 2 2 2 λ1 P λ2 P (w) C = log2 1 + + log2 1 + 2σz2 2σz2 & (81) 50 % $ ' • As P → ∞, we have λ21 +λ22 log2 2σ2 z log2 (P ) + R = lim lim P2 P →∞ C (w) P →∞ 2 log (P ) + 2 λ2i i=1 log2 2σz2 1 = 2 (82) Thus, the achievable rate of this scheme is half of the channel capacity at the high-power regime, although having the same S (w) x . • This is because the Alamouti code on the average transmits only one data stream with two transmit antennas. • As a counterpart, as P → 0, we have R = lim P →0 C (w) (λ21 +λ22 )P (2 log 2)σz2 lim 2 2 P →0 (λ1 +λ2 )P (2 log 2)σz2 =1 (83) Thus, the achievable rate of this scheme is equal to the channel capacity at the low-power regime. & 51 % ' $ Horizontal Encoding • For general MIMO system configurations, how to achieve/approach the channel capacity C (w) with the white transmit covariance by practical transceivers is a challenging task in the CSIT-unknown case. • A practical transmitter design for spatial multiplexing is “Horizontal Encoding”, where the transmitted signal vector is given by r P x= [s1 , . . . , st ]T (84) t where s1 , . . . , st correspond to independently encoded data streams with information rates R1 , . . . , Rt ; and E[|si |2 ] = 1, i = 1, . . . , t. Pt • Then the transmission sum-rate is Rsum = i=1 Ri . • The above spatial multiplexing scheme is also popularly known as “Bell Labs Layered Space Time (BLAST)” transmission scheme. & 52 % ' • For horizontal encoding, each receive antenna receives the combined signal of all transmitted data streams. Thus spatial signal processing over all receive antennas is needed to recover each data stream by suppressing the interference from other data streams. $ • There are in general two classes of receivers for horizontal encoding – Linear receiver: zero-forcing (ZF) receiver; minimum-mean-squared-error (MMSE) receiver – Nonlinear receiver with successive interference cancelation (SIC): ZF-SIC receiver; MMSE-SIC receiver • Note that another possible scheme for MIMO spatial multiplexing is “Vertical Encoding”, where s1 , . . . , st are jointly encoded rather than independently encoded as for horizontal encoding. At the receiver, iterative (soft) MIMO detection and channel decoding based on the maximum likelihood (ML) principle are applied to decode the information from all data streams. & 53 % ' $ Linear Receiver • Consider the MIMO AWGN channel: y = Hx + z with S (w) x = P I, t t (85) and z ∼ CN (0, σz2 I r ). Assume that H is full-rank. • Linear receiver first applies a decoding matrix T ∈ Ct×r to extract the signals from different data streams and then decodes them separately. ỹ = T y = T Hx + T z (86) • Two design criteria are commonly adopted for linear receivers – ZF: T is designed to satisfy T H = I t , i.e., the interference in each received data stream due to all other data streams is removed. – MMSE: T is designed to minimize the MSE between ỹ and x: E[kỹ − xk2 ]. & 54 % ' $ ZF Receiver • For ZF receiver, in order to have a feasible T such that T H = I t , the channel matrix H needs to be square or tall, i.e., r ≥ t. • If r = t and H is a square matrix, the only candidate for T to make T H = I t is T = H −1 (87) • However, if r > t and H is a tall matrix, there are more than one candidates for T to make T H = I t . • In this case, one commonly adopted choice for T is the pseudo-inverse of H given by −1 H H H , H† (88) T ZF = H H Note that T ZF becomes H −1 if r = t. & 55 % $ ' • The reason for choosing H † as T ZF in the case of r > t is given next. • Suppose that there is no noise at the receiver, i.e., z = 0. Thus, the MIMO channel becomes y = Hx (89) • The above MIMO system consists of r linear equations with t < r unknowns, and is thus over-determined. • Let x̂ be an estimate of x. It is desirable to minimize the squared error between H x̂ and y, denoted by J(x̂) = kH x̂ − yk2 . • Then we have J(x̂) = (H x̂ − y)H (H x̂ − y) & = x̂H H H H x̂ − y H H x̂ − x̂H H H y + y H y (90) 56 % $ ' • For any complex vector x, we have dxH Ax dxH y dy H x = Ax, = y, =0 ∗ ∗ ∗ dx dx dx (91) • In order to find x̂ to minimize the squared error, we have dJ(x̂) H H ∗ = 0 ⇒ H H x̂ = H y dx̂ (92) • Thereby we obtain the so-called “least-square (LS)” estimate of x as −1 H H x̂LS = H H H y = T ZF y (93) • Thus the LS estimation matrix is identical to the ZF decoding matrix. • For r ≥ t and H is full-rank, the truncated SVD of H is given by H H = Ũ Λ̃V H , where Ũ ∈ Cr×t with Ũ Ũ = I t ; V ∈ Ct×t with V H V = V V H = I t ; and Λ̃ is a t × t diagonal matrix with the diagonal elements (singular values) given by λ1 ≥ . . . ≥ λt > 0. & 57 % $ ' • Then it can be verified that −1 H H −1 H H = V Λ̃Ũ Ũ Λ̃V H V Λ̃Ũ T ZF = H H H −2 = (V Λ̃ V H )V Λ̃Ũ H −1 = V Λ̃ Ũ H (94) • Substituting the above alternative expression for T ZF into (86) yields ỹ = x + z̃ −1 H where z̃ = V Λ̃ Ũ z ∼ CN (0, σz2 V −2 (95) H −2 Λ̃ V ), V Λ̃ V H H = H H −1 . • For the above equivalent channel, it is observed that the total signal Pt σz2 2 2 power is E[kxk ] = P , and the total noise power is E[kz̃k ] = i=1 λ2 . i Thus, the average SNR is given by ZF γavg & E[kxk2 ] P = = Pt E[kz̃k2 ] i=1 σz2 λ2i λ2t P ≤ 2 σz (96) 58 % ' where the upper bound is determined by the smallest singular value λt . $ • This disadvantageous phenomenon for ZF receiver is so-called “noise enhancement” due to the fact that the LS estimator in fact ignores the noise effect at the receiver. • Let x = [x1 , . . . , xt ]T , z̃ = [z̃1 , . . . , z̃t ]T , and vij = [V ]i,j , i = 1, . . . , t, j = 1, . . . , t. • The SNR of the ith data stream with ZF receiver is γiZF P E[kxi k2 ] t = = P E[kz̃i k2 ] σz2 tj=1 |vij |2 λ2j = σz2 [(H P t H H)−1 ]i,i , i = 1, . . . , t (97) • Thus, the achievable sum-rate for ZF receiver is ZF = Rsum & t X log2 (1 + γiZF ) (98) i=1 59 % ' $ MMSE Receiver • MMSE receiver is the optimal linear receiver that maximizes the receiver SNR for each data stream, where the noise includes both the additive noise and the interference from all other data streams. • In general, MMSE receiver works for any t and r. However, in order to achieve spatial multiplexing gain equal to t, we need that r ≥ t. • Let the ith row of the MMSE decoding matrix T MMSE be tH i , i = 1, . . . , t, where ti ∈ Cr×1 . • The total MSE to be minimized by T MMSE is expressed as t X H 2 2 E |ti y − xi | E kT MMSE y − xk = (99) i=1 • Thus the total MSE can be minimized by independently optimizing ti ’s. & 60 % ' 2 • Let Ji (ti ) = E[|tH i y − xi | ], i = 1, . . . , t, be the MSE for the ith data stream, which can be further expressed as $ H ∗ Ji (ti ) = E[(tH y − x )(y t − x i i i i )] H H 2 = tH i Ryy ti − r xy ti − ti r xy + E[|xi | ] (100) where Ryy r xy P = E[yy ] = E[(Hx + z)(Hx + z) ] = HH H + σz2 I r t P = E[yx∗i ] = E[(Hx + z)x∗i ] = hi (101) t H H • Note that H = [h1 , . . . , ht ]. • The MMSE estimator ti is then obtained as & dJi (ti ) = 0 ⇒ Ryy ti = r xy ∗ dti (102) 61 % $ ' • Thus we have ti = R−1 yy r xy = where Γ = HH H + 1 Ir Γ −1 hi , i = 1, . . . , t (103) P . tσz2 • Substituting ti = R−1 yy r xy into (100) yields the minimum MSE for the ith data stream as −1 R Jimin = E[|xi |2 ] − r H xy yy r xy • Furthermore, we have T MMSE −1 1 H H HH + I r =H Γ (104) (105) • As Γ → ∞, i.e., the noise effect can be safely ignored, it can be shown that T MMSE → T ZF . & 62 % $ ' • Let H {−i} be the matrix H with the ith column being deleted, i = 1, . . . , t. • Using the matrix inversion lemma, (A − BD −1 C)−1 = A−1 + A−1 B(D − CA−1 B)−1 CA−1 (106) H + (1/Γ)I , B = h , C = h with A = H {−i} H H r i {−i} i , and D = −1, we have −1 −1 1 A−1 hi hH i A H −1 HH + I r =A − (107) H −1 Γ 1 + hi A hi • Thus, from (103), we obtain an alternative expression for ti as ti = 1 A−1 hi 1 + βi (108) −1 where βi = hH i A hi . • Using this new expression for ti , from (104) it follows that the minimum & 63 % $ ' MSE is given by 1 P P −1 − hH A hi i t t 1 + βi P βi P P 1 = − = t t 1 + βi t 1 + βi Jimin = E[|xi |2 ] − r H xy ti = (109) • Now consider the equivalent SIMO channel for decoding the ith data stream given by y = hi xi + z i (110) P where z i = j6=i hj xj + z denotes the effective noise for the ith data stream including both the interference from all other data streams and the additive noise (SIMO channel with correlated noise studied in Lecture II). • Note that z i ∼ CN (0, (P/t)A). & 64 % $ ' • Let ỹ = [ỹ1 , . . . , ỹt ]T . Applying ti given in (108) to y yields ỹi = tH i y = tH i hi xi 2 where z̃i = tH z ∼ CN (0, σ i z̃ ). i H • Note that σz̃2 = tH i E[z i z i ]ti = + 1 1+βi tH i zi 2 βi = xi + z̃i 1 + βi −1 P H −1 h A AA hi t i = • Thus the MMSE receiver SNR for the ith data stream is 2 2 βi βi P 2 E[|x | ] i 1+βi 1+βi t MMSE = = = βi γi 2 σz̃2 1 P β 1+βi t i (111) 1 1+βi 2 P β. t i (112) • Notice that for the SIMO channel given in (110) with correlated noise, we have shown in Lecture II that the optimal receive beamforming −1 P hi , which is identical to the MMSE decoding vector is wopt = t A vector ti in (108) if ignoring the multiplication constants. & 65 % $ ' • Furthermore, wopt maximizes the receiver SNR as (see (51) of Lecture II) −1 P P H −1 hi = hH A hi = βi (113) A γmax = hi i t t which is same as that achieved by the MMSE decoder. • The achievable sum-rate for MMSE receiver is MMSE Rsum = t X log2 (1 + γiMMSE ) (114) i=1 where the achievable rate for the ith data stream is RiMMSE = log2 (1 + γiMMSE ) = log2 (1 + βi ). • It can be shown that RiMMSE is indeed the capacity of the SIMO channel given in (110) for the ith data stream. & 66 % ' $ Nonlinear Receiver • Nonlinear receiver applies successive interference cancelation (SIC) to decode different data streams. • Assume that the decoding order for the SIC is given by the reverse of the transmit antenna index, i.e., the data stream from the 1st transmit antenna is decoded last, the data stream from the 2nd transmit antenna is decoded second last, ...., the data stream from the tth transmit antenna is decoded first. • Then the SIC is described as follows. – Step 1: Apply a linear decoder tt to the received signal y, and extract the data stream from the tth transmit antenna & ỹt = tH t y (115) 67 % $ ' – Step 2: Decode from ỹt the tth data stream information st , and reconstruct xt – Step 3: Subtract ht xt from y, and yield y {−t} = y − ht xt = H {−t} x{−t} + z (116) where H {−t} is H with the tth column deleted, and x{−t} is x with the tth element deleted. – Step 4: Return to Step 1, extract and decode the (t − 1)-th data stream from y {−t} , and then subtract the corresponding signal component from y {−t} , until all t data streams are decoded. • For each iteration of Step 1, if ti , i = t, t − 1, . . . , 2, is designed using the ZF criterion, the nonlinear receiver is called ZF-SIC; while if the MMSE criterion is used, it is called MMSE-SIC. Note that t1 is given by the receive MRC for both cases of ZF-SIC and MMSE-SIC. & 68 % ' $ ZF-SIC • For each iteration of SIC, the ZF decoding vector tH i , i = 1, . . . , t, can be obtained as the last row of the matrix H †[i] , where H [i] = [h1 , . . . , hi ] (117) • Alternatively, the decoding vectors for ZF-SIC can be derived from the QR decomposition of the MIMO channel matrix. • Assume that r ≥ t and H is full-rank, the truncated QR decomposition of H is given by H = QR (118) where Q ∈ Cr×t satisfies that QH Q = I t , and R ∈ Ct×t is an upper-triangular matrix with φij = [R]i,j = 0, ∀i > j. • The QR decomposition of H can be obtained using the “Gram-Schmidt” & 69 % $ ' procedure as presented next. • Let Q = [e1 , . . . , et ], where kei k = 1, i = 1, . . . , t and eH i ej = 0, ∀i 6= j. • First, since h1 = φ11 e1 , we obtain immediately e1 = h1 , φ11 = kh1 k kh1 k (119) • Second, since h2 = φ12 e1 + φ22 e2 , we multiply both left- and right-hand sides by eH 1 to obtain φ12 = eH 1 h2 (120) Then by letting u2 = h2 − φ12 e1 , we obtain φ22 = ku2 k, e2 = u2 ku2 k • Thus in general for any i ∈ {1, . . . , t} with hi = & (121) P j<i φji ej + φii ei , we 70 % $ ' can obtain first φji = eH j hi , ∀j < i and then ui = hi − P j<i (122) φji ej , and finally ui φii = kui k, ei = kui k (123) • The decoding matrix for ZF-SIC is given by T ZF−SIC = QH (124) H • Note that the ith row of T ZF−SIC is tH i = ei , i = 1, . . . , t. • Applying T ZF−SIC to y yields ỹ = T ZF−SIC y = QH QRx + QH z = Rx + z̃ (125) where z̃ = QH z ∼ CN (0, σz2 I t ). & 71 % $ ' • Thus we have the equivalent channel for decoding the ith data stream as X ỹi = φii xi + φij xj + z̃i , i = 1, . . . , t (126) j>i • Since for decoding the ith data stream, all data streams with index j > i have already been decoded and thus can be subtracted from ỹi . Thus, the effective channel for the ith data stream becomes X ŷi = ỹi − φij xj = φii xi + z̃i , i = 1, . . . , t (127) j>i • Thus the SNR for the ith data stream by ZF-SIC is given by γiZF−SIC & φ2ii Pt = 2 σz (128) 72 % $ ' • The achievable sum-rate for ZF-SIC is given by ZF−SIC = Rsum t X log2 (1 + γiZF−SIC ) (129) i=1 • Next, we show that the precoders obtained by the channel QR decomposition are identical to those obtained by the last rows of H †[i] ’s. • Without loss of generality, consider i = t and thus H [t] = H. • We thus have H † = (H H H)−1 H H = (RH QH QR)−1 RH QH = R† QH = R−1 QH • The last row of H † is thus given by [R −1 ]t,t eH t 1 H = e φtt t (130) where we have used the fact that for an upper-triangular matrix R, R−1 & 73 % ' is also upper-triangular, and [R]ii and [R−1 ]ii are inverses of each other. $ H • This row vector is thus identical to tH t = et by ignoring the multiplication constant 1/φtt . • Furthermore, we obtain an alternative expression for the SNR of the tth data stream by the linear ZF receiver as γtZF = γtZF−SIC φ2tt Pt = 2 σz (131) • Comparing this with that given in (97), we have 1 2 = φ tt H [(H H)−1 ]t,t (132) • In general, it can be shown that & γiZF < γiZF−SIC , 1 ≤ i < t (133) 74 % $ ' • Thus we have ZF ZF−SIC < Rsum Rsum (134) • So far, we have assumed that the decoding order for ZF-SIC is fixed as the inverse of the antenna index. • Notice that in total there are t! different decoding orders for ZF-SIC, each of which corresponds to a different channel QR decomposition and ZF−SIC thus different φii ’s, γiZF−SIC ’s, as well as Rsum . • Let P be a t × t permutation matrix (each row/column has one element equal to one and zeros elsewhere), which specifies the decoding order. • Then consider the following QR decomposition: HP = Qp Rp (135) • Note that H = Qp Rp P −1 = Qp Rp P since P −1 = P . & 75 % $ ' • By letting T ZF−SIC = QH p , we obtain H Q R P x + Q ỹ = T ZF−SIC y = QH p p p p z = Rp xp + z̃ (136) 2 z ∼ CN (0, σ where xp = P x is a permuted version of x and z̃ = QH p z I t ). • In the case r = t and for the asymptotically high-power region, i.e., P → ∞, we have ! t t P 2 X Y φ P ZF−SIC Rsum = log2 1 + ii2t ≈ t log2 2 + log2 φ2ii σz σz t i=1 i=1 P = t log2 2 + log2 |det(H)|2 σz t 2 H 2 (137) 2 since |det(H)| = det(H H) = det(R ) = (det(R)) = Qt i=1 φ2ii . • Thus the achievable rate of ZF-SIC is independent of the decoding order for the high-power regime. & 76 % $ ' MMSE-SIC • For MMSE-SIC, the effective SIMO channel for decoding the ith data stream with data streams of index j > i already decoded and subtracted from y is given by ŷ i = hi xi + z i (138) P where z i = j<i hj xj + z denotes the total noise including both the interference from data streams with index j < i and the additive noise. • From our previous study, we know that the optimal MMSE decoding vector for the channel given in (138) is ti = & 1 A−1 i hi 1 + αi (139) 77 % $ ' where Ai = X hj hH j j<i −1 αi = hH i Ai hi X 1 tσz2 H + Ir = hj hj + Ir Γ P j<i (140) (141) • The resultant SNR for the ith data stream is given by γiMMSE−SIC = αi (142) • The achievable sum-rate for MMSE-SIC is given by MMSE−SIC Rsum = t X log2 (1 + γiMMSE−SIC ) (143) i=1 MMSE−SIC is indeed equal to the MIMO channel • Next, we show that Rsum capacity C (w) with the white transmit covariance as given in (74), i.e., the MMSE-SIC receiver is capacity-optimal. & 78 % ' • First, we express the achievable rate for the ith data stream, i > 1, as $ RiMMSE−SIC = log2 (1 + αi )  = log2 1 + hH i  (144) X j<i P H  = log2 1 + 2 hi tσz  = log2 det I r + &  hj hH j tσz2 Ir + P !−1 P X H h h j j + Ir tσz2 j<i P X H h h j j + Ir 2 tσz j<i X P H h h = log2 det  j j + Ir tσz2 j<i !−1  hi  !−1 !−1 (145)  hi  (146)  P H h h i i tσz2 ! P X H  h h j j + Ir 2 tσz j≤i (147) (148) 79 % ' • Since det(B −1 A) = det(A) , det(B ) $ we have RiMMSE−SIC = log2 det ,Ii − Ii−1 • Thus we have MMSE−SIC = Rsum P X H h h j j + Ir 2 tσz j≤i t X ! RiMMSE−SIC = I1 + i=1 = log2 det − log2 det t P X H h h i i + Ir tσz2 i=1 t X i=2 ! P X H h h j j + Ir 2 tσz j<i ! (149) (Ii − Ii−1 ) = It = log2 det P H HH + Ir tσz2 = C (w) • Since the above proof holds regardless of the decoding order, the achievable sum-rate for MMSE-SIC is independent of the decoding order. & 80 % $ ' • However, different decoding orders will result in different transmission MMSE−SIC rates allocated over the t data streams, to make Rsum = C (w) . • Question: Do we need r ≥ t in the above proof for MMSE-SIC? • In general, for ZF-based receivers (linear ZF or ZF-SIC), it needs that r ≥ t; for MMSE-based receivers (linear MMSE or MMSE-SIC), r can be smaller than t. & 81 % ' MIMO AWGN Channel Capacity Without CSIT $ Assume t = r = 2, H = [1 ρ; ρ 1] with ρ = 0.5, and σz = 1. 18 16 Achievable Rate (bps/Hz) 14 12 Capacity/MMSE−SIC ZF−SIC MMSE ZF 10 8 6 4 2 0 0 & 5 10 15 20 25 30 SNR (dB) 82 % ' MIMO AWGN Channel Capacity Without CSIT $ Assume t = r = 2, H = [1 ρ; ρ 1] with ρ = 0.9, and σz = 1. 14 Achievable Rate (bps/Hz) 12 10 Capacity/MMSE−SIC ZF−SIC MMSE ZF 8 6 4 2 0 0 & 5 10 15 20 25 30 SNR (dB) 83 % ' $ Capacity of MIMO Fading Channel • Consider the following r × t MIMO fading channel: y = Hx + z (150) – H is constant during each transmission block, but can change from one block to the other (i.e., block-fading); assume that the instantaneous channel H is known at the receiver. – x ∼ CN (0, S x ), where the transmit covariance matrix S x is constant over all transmission blocks if the channel is unknown at the transmitter, but may change over transmission blocks according to the instantaneous channel if known at the transmitter. – Tr(S x ) ≤ P for any transmission block. – z ∼ CN (0, σz2 I r ). & 84 % $ ' • The IMI of the MIMO fading channel for a given pair of H and S x is given by 1 I(x; y|H) = log2 det I r + 2 HS x H H (151) σz • The ergodic capacity of the MIMO fading channel is then defined as Cerg = E [I(x; y|H)] (152) where the expectation is taken over H. • The q% outage capacity of the MIMO fading channel is denoted by Cout,q% , which is defined as Pr (I(x; y|H) < Cout,q% ) = q% & (153) 85 % ' $ MIMO Fading Channel With CSIT • In the CSIT-known case, S x for each transmission block can be optimized based upon the instantaneous channel H. • In the ergodic capacity case, S x for each transmission block should be chosen to maximize the IMI, which is equivalent to the MIMO AWGN channel capacity for the given H. Thus, the optimal S x is designed using eigenmode transmission and WF power allocation. • In the outage capacity case, if we can find the minimum outage probability q% for a constant transmission rate R such that Pr (I(x; y|H) < R) ≤ q% (154) Then we can claim that R is the q% (transmitter-aware) outage capacity. • Thus maximizing the outage capacity for a given outage probability & 86 % ' target is equivalent to minimizing the outage probability for a given constant rate target, i.e., Cout,q% and q% have one-to-one correspondence. $ • In order to minimize the outage probability for a given constant transmission rate, it is desirable to maximize the IMI for each transmission block based on the instantaneous channel H given P . • Thus the optimal S x for each transmission block is given by the eigenmode transmission and WF power allocation, same as that for the ergodic capacity case. • However, for transmission blocks with superior channel conditions, S x that satisfies the power constraint and results in an IMI larger than the given rate target R may not be unique. For such cases, it is usually desirable to find the optimal S x to minimize the transmit power given that the resultant IMI is equal to R. • For example, consider a SISO fading channel with constant transmission & 87 % ' rate R. A transmission outage will not occur if log2 1 + Thus if |h|2 > (2R −1)σz2 , P any transmit power between P |h|2 σz2 (2R −1)σz2 |h|2 ≥ R. $ and P can make a non-outage transmission, while the minimum power is (2R −1)σz2 . |h|2 • For MIMO fading channels, the transmit power to support a given rate R is minimized by eigenmode transmission with WF power allocation: 2 + σz , i = 1, . . . , m (155) pi = µ − 2 λi where the water-level µ in this case should be chosen such that the rate constraint is satisfied with equality, i.e., 2 + m X µλi log2 =R (156) 2 σz i=1 Pm • If i=1 pi > P , transmitter-aware outage occurs; otherwise, no outage occurs. & 88 % ' $ MIMO Fading Channel Without CSIT • In the CSIT-unknown case, a constant transmit covariance matrix S x is used for all transmission blocks. • In the ergodic capacity case, the optimal S x should maximize the statistical average of the IMI given by 1 (157) E log2 det I r + 2 HS x H H σz if the distribution of H is known at the transmitter. • In the case of H ∼ H w , it can be shown that the optimal transmit covariance is the white covariance given by & S (w) x = P It t (158) 89 % $ ' • From (74), it follows that the resultant ergodic capacity is given by m 2 X λi P (159) Cerg = E log2 1 + 2 σ t z i=1 • Thus the ergodic capacity is determined by the distribution of the squared singular values of the random MIMO channel matrix. • Note that λ2i ’s are the eigenvalues of the matrix H H H if r ≥ t = m, since with the truncated SVD of H, we have H 2 H H H = V Λ̃Ũ Ũ Λ̃V H = V Λ̃ V H (160) • Similarly, it can be shown that λ2i ’s are the eigenvalues of HH H if t > r = m. & 90 % $ ' • Define the following matrix   HHH r ≥ t W =  HH H r < t (161) • The distribution law of W when H ∼ H w is called “Wishart distribution”, for which the joint distribution of the eigenvalues λ2i ’s are known (details are omitted here). • With the Wishart distribution for W , it can be shown that the spatial multiplexing gain for the ergodic capacity of MIMO fading channel is given by Cerg = min(t, r) P →∞ log2 P lim (162) • This is consistent with the spatial multiplexing gain we have obtained for the MIMO AWGN channel case, which is shown equal to the rank of the & 91 % $ ' given MIMO channel matrix, since for H ∼ H w , the rank of the random matrix H is min(t, r) with probability one. • In the CSIT-unknown case, Cerg can be achieved by horizontal encoding at the transmitter and MMSE-SIC decoding at the receiver, similarly as for the MIMO AWGN channel case. However, it is worth noting that in the MIMO fading channel case, each data stream of horizontal encoding needs to span over all different fading states (i.e., to achieve coded diversity), and the transmission rate of each data stream needs to be set appropriately according to the channel distribution as well as the decoding order at the receiver. • Next, consider the outage capacity of the MIMO fading channel without CSIT. In this case, it is desirable to find a constant transmit covariance S x that minimizes the outage probability for a given constant & 92 % $ ' transmission rate R: 1 Pr log2 det I r + 2 HS x H H < R σz (163) by assuming that the channel distribution is known at the transmitter. • For the case of H ∼ H w , it has been conjectured that the optimal S x is in the following form Sx = P diag(1, . . . , 1, 0 . . . , 0 ) | {z } | {z } k k ones (164) t−k zeros i.e., the transmitter selects (arbitrarily) a subset of k out of t transmit antennas for transmission with the white covariance. • For example, consider the MISO fading channel with t = 2, r = 1, and h ∼ hw . We know that if k = 1 and thus S x = P diag(1, 0), i.e., only the first transmit antenna is used and the MISO fading channel becomes a SISO fading channel, the resultant outage probability for a constant & 93 % $ ' transmission rate R is 2 P |h1 | <R Pr log2 1 + σz2 (165) Since |h1 |2 is exponentially distributed with mean σh2 , we obtain the outage probability as (1) pout = 1 − e−γ where γ = (2R −1)σz2 . 2P σh (166) On the other hand, if k = t = 2 and thus S x = (P/2)I 2 (e.g., using Alamouti code), the outage probability for the MISO fading channel with the same rate R is given by 2 2 P (|h1 | + |h2 | ) Pr log2 1 + <R (167) 2 2σz Since |h1 |2 + |h2 |2 is chi-square distributed with 4 degrees of freedom, we & 94 % $ ' obtain the outage probability as (2) pout = 1 − e−2γ (1 + 2γ) (1) (168) (2) Thus, we conclude that if eγ > (1 + 2γ), then pout < pout , i.e., using single transmit antenna results in a lower outage probability than using both antennas, and vice versa. Alamouti code can be capacity-suboptimal! • In general, with other parameters fixed, the smaller the rate target R is, the larger is the optimal number of active antennas, k. This is because smaller R corresponds to smaller outage probability, which is achieved by more diversity or more transmit antennas. • For general MIMO systems without CSIT, the outage capacity of the MIMO fading channel with the (truncated) white transmit covariance is achieved by vertical encoding (e.g., space-time code) and iterative receiver. & 95 % $ ' • If horizontal encoding is used along with linear/nonlinear receivers, a practical design is to assign all data streams/transmit antennas the same transmission rate R/t. • However, horizontal encoding with equal-rate allocation in general does not achieve the outage capacity even with the MMSE-SIC receiver and r ≥ t. & 96 % ' $ Summary • MIMO AWGN channel – CSIT-known case: eigenmode transmission and WF power allocation achieve the capacity – CSIT-unknown case: horizontal encoding and nonlinear MMSE-SIC receiver achieve the capacity – For both cases, spatial multiplexing gain is equal to the rank of MIMO channel matrix • MIMO fading channel – Ergodic capacity ∗ CSIT-known case: eigenmode transmission and WF power allocation based on instantaneous CSI achieve the capacity ∗ CSIT-unknown case: horizontal encoding and nonlinear MMSE-SIC & 97 % $ ' receiver across different fading states achieve the capacity – Outage capacity ∗ CSIT-known case: eigenmode transmission and WF power allocation based on instantaneous CSI achieve the capacity ∗ CSIT-unknown case: vertical encoding (with transmit antenna selection) and iterative receiver achieve the capacity & 98 %

Lecture IV

Related documents

Products

Support

Lecture IV

Related documents

Add this document to collection(s)

Add this document to saved

Suggest us how to improve StudyLib