schlegelnotes

Upload: saeedfirstman66

Post on 06-Apr-2018

223 views

Category:

Documents


0 download

TRANSCRIPT

  • 8/2/2019 SchlegelNotes

    1/68

    MIMO Channels

    and Space-Time Coding

    Presenters: Christian Schlegel and Zachary [email protected]; [email protected]

    WOC 2002, Tutorial PresenationBanff, AB, CANADA, July, 2002

    Outline:

    Capacity of MIMO Channels: We discuss the information theo-retic bases for the capacity arguments of MIMO channels and presentfundamental results and methods.

    Channel Modeling and Realizable Capacity: We discuss sim-ple ray-tracing channel models and study the capacity of these ar-tificially generated channels. Basic conclusions on the behavior of

    real-world MIMO channels are drawn. Space-Time Coding: We introduce the basics of space-time codingand modulation methods, such as orthogonal designs, unitary space-time codes, group space-time codes. We discuss error performancemeasures, and optimal and sub-optimal decoding algorithms.

    Space-Time Communications Systems: We discuss basic layer-ing methods and the addition of error control coding to space-timesystems. We show how complex decoders work and how they per-form.

  • 8/2/2019 SchlegelNotes

    2/68

    MIMO CHANNELS AND CAPACITY

  • 8/2/2019 SchlegelNotes

    3/68

    cChristian Schlegel, Zachary Bagley MIMO Channel Tutorial, WOC02, Banff 2

    Foundation: Channel Capacity

    Shannon derived the following capacity formula (1948) for an additivewhite Gaussian noise channel (AWGN):

    C = W log2 (1 + S/N) [bits/second]

    W is the bandwidth of the channel in Hz S is the signal power in watts N is the total noise power of the channel watts

    Channel Coding Theorem (CCT):

    1. Its direct part says that for rate R < C there exists a coding systemwith arbitrarily low error rates as we let the codelength N.

    2. The converse part states that for R C the bit and block errorrates are strictly bounded away from zero for any coding system

    Bandwidth Efficiency characterizes how efficiently a system uses itsallotted bandwidth and is defined as

    =Transmission Rate

    Channel Bandwidth W[bits/s/Hz].

    From it we calculate the Shannon limit as

    max = log21 + SN [bits/s/Hz].

    Note: In order to calculate , we must suitably define the channel bandwidth W. One com-monly used definition is the 99% bandwidth definition, i.e., W is defined such that 99%of the transmitted signal power falls within the band of width W.

  • 8/2/2019 SchlegelNotes

    4/68

    cChristian Schlegel, Zachary Bagley MIMO Channel Tutorial, WOC02, Banff 3

    The Shannon Bound

    Average Signal Power S can be expressed as

    S =kEb

    T= REb,

    Eb is the energy per bit k is the number of bits transmitted per symbol T is the duration of a symbol

    R = k/T is the transmission rate of the system in bits/s.

    S/N is called the signal-to-noise ratio N = N0W is the total noise power N0 is the one-sided noise power spectral density

    max = log2

    1 +

    REbN0W

    .

    This can be solved to obtain the minimum bit energy required for reliabletransmission, called the Shannon bound:

    EbN0

    2max 1

    max,

    Fundamental limit: For infinite amounts of bandwidth max 0

    Eb

    N0 lim

    max0

    2max 1max

    = ln(2) =

    1.59dB

    This is the absolute minimum signal energy to noise power spectral den-sity ratio required to reliably transmit one bit of information.

  • 8/2/2019 SchlegelNotes

    5/68

    cChristian Schlegel, Zachary Bagley MIMO Channel Tutorial, WOC02, Banff 4

    Normalized Capacity

    Normalize our formulas per signal dimension as given by [WoJ65]. This isuseful when the question of waveforms and pulse shaping is not a centralissue, since it allows one to eliminate these considerations by treatingsignal dimensions [Schl97].

    Cd =1

    2log2

    1 + 2RdEb

    N0 [bits/dimension]

    Cc = log2

    1 +

    REbN0

    [bits/complex dimension]

    Shannon bound normalized per dimension

    Eb

    N0 22Cd

    1

    2Cd ;

    Eb

    N0 2Cc

    1

    Cc .

    System Performance Measure In order to compare different commu-nications systems we need a parameter expressing the performance level.It is the information bit error probability Pb and typically falls into therange 103 Pb 106.

    References:[WoJ65] J.M. Wozencraft and I.M. Jacobs, Principles of Communication Engineering, John

    Wiley & Sons, Inc., New York, 1965, reprinted by Waveland Press, 1993.

    [Schl97] C. Schlegel, Trellis Coding, IEEE Press, Piscataway, NJ, 1997.

  • 8/2/2019 SchlegelNotes

    6/68

    cChristian Schlegel, Zachary Bagley MIMO Channel Tutorial, WOC02, Banff 5

    Spectral Efficiencies of Popular Systems

    Spectral Efficiencies versus power efficiencies of various coded and un-coded digital transmission systems, plotted against the theoretical limitsimposed by the discrete constellations.

    Unachi

    evable

    Region

    QPSK

    BPSK

    8PSK

    16QAMBTCM32QAM

    Turbo65536

    TCM16QAM

    ConvCodes

    TCM8PSK

    214Seqn.

    214Seqn.

    (256)

    (256)

    (64)

    (64)

    (16)

    (16)

    (4)

    (4)

    32QAM

    16PSK

    BTCM64QAM

    BTCM16QAM

    TTCM

    BTCBPSK

    (2,1,14)CC

    (4,1,14)CCTurbo65536

    Cc [bits/complex dimension]

    -1.59 0 2 4 6 8 10 12 14 150.1

    0.2

    0.3

    0.4

    0.5

    1

    2

    5

    10

    EbN0

    [dB]

    BPSK

    QPSK

    8PSK

    16QAM

    16PSK

    Shannon

    Bound

  • 8/2/2019 SchlegelNotes

    7/68

    cChristian Schlegel, Zachary Bagley MIMO Channel Tutorial, WOC02, Banff 6

    Discrete Capacities

    For discrete constellations, Shannons formula needs to be altered.

    C = maxq

    [H(Y) H(Y|X)] (classic definition)

    = maxq

    y

    ak

    q(ak)p(y|ak)logq(ak)

    ak

    p(y|ak)

    yak q(ak)p(y|ak)log(p(y|ak))= max

    q

    ak

    q(ak)

    y

    log

    p(y|ak)

    akq(ak)p(y|ak)

    where {ak} are the K discrete signal points, q(ak) is the probability withwhich ak is selected, and

    p(y|ak) =1

    22 exp(y ak)2

    22

    in the one-dimensional case, and

    p(y|ak) = 122

    exp

    (y ak)

    2

    22

    in the complex case.

    Symmetrical Capacity: When q(ak) = 1/K, then

    C = log(K) 1K

    ak

    n

    log

    ak

    exp

    (n a

    k + ak)

    2 n222

  • 8/2/2019 SchlegelNotes

    8/68

    cChristian Schlegel, Zachary Bagley MIMO Channel Tutorial, WOC02, Banff 7

    Code Efficiency

    Shannon et. al. [SGB67] proved the following lower boundon the codeworderror probability PB:

    PB > 2N(Esp(R)+o(N)); Esp(R) = maxq max>1

    (E0(q, ) R))

    E0(, q) log2y

    x

    q(x)p(y|x)1/(1+)1+

    dy

    The bound is plotted for rate R = 1/2 for BPSK modulation [ScP99],together with selected Turbo and classic concatenated coding methods:

    N=44816-state

    N=36016-state

    N=13344-state

    N=1334

    16-state

    N=4484-state

    N=204816-state

    N=2040 concatenated (2,1,8) CCRS (255,223) code

    N=2040 concatenated(2,1,6) CC +RS (255,223) code

    N=1020016-state

    N=1638416-state N=65536

    16-state

    N=4599

    concate

    nate

    d (2,1,8) CCRS (511,479) code

    N=1024blockTurbo codeusing (32,26,4) BCH codes

    UnachievableRegion

    Shannon

    Capacity

    10 100 1000 104 105 1061

    0

    1

    2

    3

    4

    EbN0

    [dB]

    N

    References:[SGB67] C.E. Shannon, R.G. Gallager, and E.R. Berlekamp, Lower bounds to error proba-

    bilities for coding on discrete memoryless channels, Inform. Contr., vol. 10, pt. I, pp.65103, 1967, Also, Inform. Contr., vol. 10, pt. II, pp. 522-552, 1967.

    [ScP99] C. Schlegel and L.C. Perez, On error bounds and turbo codes,, IEEE Communi-

    cations Letters, Vol. 3, No. 7, July 1999.

  • 8/2/2019 SchlegelNotes

    9/68

    cChristian Schlegel, Zachary Bagley MIMO Channel Tutorial, WOC02, Banff 8

    Parallel Additive Gaussian Channels

    Let us assume that we have N parallel one-dimensional channels disturbedby noise sources with variances 21, , 2N.

    +

    +

    x1

    xN

    y1

    yN

    N(0, 21)

    N(0, 2N)

    Energy Constraint: The total input energy is constrained on averageto E, the total average energy per channel use:

    N

    n=1x2n =

    N

    n=1En = E

    The capacity of these parallel channels is achieved by

    2n + En = ; 2n <

    En = 0; 2n

    where is the Lagrange multiplier chosen such that

    n En = E.

    Theorem: The capacity of this set of parallel channels is given by

    C =N

    n=1

    1

    2log

    1 +

    En2n

    =

    Nn=1

    1

    2log

    2n

    References:

    [Gal68] R.G. Gallager, Information Theory and Reliable Communication, John Wiley & Sons,Inc., New-York, 1968, Section 7.5, pp. 343 ff.

  • 8/2/2019 SchlegelNotes

    10/68

    cChristian Schlegel, Zachary Bagley MIMO Channel Tutorial, WOC02, Banff 9

    Waterfilling Theorem

    Proof: Let x = [x1, , xN] and y = [y1, , yN] and consider

    I(x; y)(1)

    N

    n=1

    I(xn; yn) (1) independent xn

    (2)

    n=1

    1

    2log

    1 +

    En2n

    f(E)(2) Gaussian distributed xn

    Since equality can be achieved the next step is to find the maxi-mizing energy distribution E = [E1, , EN].

    f(E)

    En

    1

    2(2n + En)

    2

    n + En 1

    2 =

    This theorem is called the Waterfilling Theorem. Its operation can bevisualized in the following figure, where the power levels are in black:

    Power level:Difference 2n

    Channels 1 through N

    level: 2N

  • 8/2/2019 SchlegelNotes

    11/68

    cChristian Schlegel, Zachary Bagley MIMO Channel Tutorial, WOC02, Banff 10

    Correlated Parallel Channels (MIMO)

    Correlated channels arise from e.g., multiple antenna channels:

    +

    +

    +xNt

    x1

    x2

    h11

    hNtNr

    hij

    yNr

    y2

    y1

    nNr

    n2

    n1Transmit

    Antenna

    Array

    Receive Array

    This channel is a multiple-input multiple-output (MIMO) channeldescribed by the matrix equation:

    y = Hx + n

    The transmitted signals xn are complex signals, as are the channelgains hij and the received signals yn.

    The noise is complex additive Gaussian noise with variance N0 (thatis N0/2 in each dimension).

    The path gains hij are complex gain coefficients modeling a randomphase shift and a channel gain. Often these are modeled as Rayleighrandom variables modeling a scattering-rich or mobile radio trans-mission environment.

    MIMO Rayleigh Channel: The hij are modeled as i.i.d. (or correlated)complex Gaussian random variables with variance 1/2 in each dimension.

  • 8/2/2019 SchlegelNotes

    12/68

    cChristian Schlegel, Zachary Bagley MIMO Channel Tutorial, WOC02, Banff 11

    Channel Decomposition

    The correlated MIMO channel can be decomposed via the singular valuedecomposition SVD:

    H = U DV+

    (if r < t)

    where U and V are unitary matrices, i.e., U U+ = I, and V V+ = I.The matrix D contains the singular values ofH, which are the (positive)square roots of the eigenvalues of HH+ and H+H.

    The channel equation can now be written in an equivalent form:

    y = Hx + n

    = U DV+x + n

    U+y = y = Dx + n

    IfNt > Nr only the first Nr signals of

    x will be received.If Nr > Nt the Nr Nt bottomchannels will carry no signal.

    This leads to parallel Gaussian channels yn = dnxn + nn

    +

    +

    +

    +

    x1

    xN

    y1

    yN

    N(0, 22)

    N(0, 22)

    d1

    dN N = min(Nt, Nr)

  • 8/2/2019 SchlegelNotes

    13/68

    cChristian Schlegel, Zachary Bagley MIMO Channel Tutorial, WOC02, Banff 12

    MIMO Capacity

    The multiplicative factors dn can be eliminated by multiplying the re-ceived signal y with D1. This leads back exactly to the parallel chan-nels:

    C =N

    n=1

    log

    1 +

    d2nEn22

    =

    Nn=1

    log

    d2n

    22

    Note: The channels are complex, and hence there is no factor 1/2 and the

    variance is 22.This capacity is achieved with the waterfilling power allocation:

    22

    d2n+ En = ;

    2

    d2n<

    En = 0;2

    d2n

    Optimal System: This leads to the optimal signalling strategy:

    1. Perform the SVD of the channel H U, V, D.2. Multiply the input signal vector x with V. This is matrix processing.

    3. Multiply the output signal y with the matrix processor U+.

    4. Use each channel with signal-to-noise ratio d2nEn/(22) independently.

    x x y yV U+

    Matrix Processor Matrix ProcessorMIMO Channel

  • 8/2/2019 SchlegelNotes

    14/68

    cChristian Schlegel, Zachary Bagley MIMO Channel Tutorial, WOC02, Banff 13

    Symmetric MIMO Capacity

    Drawback: the channel H needs to be known at both the transmitterand the receiver so the SVD can be computed.

    Fact: Channel knowledge is not typically available at the transmitter,and the only choice we have is to distribute the energy uniformly over allcomponent channels. This leads to the Symmetric Capacity:

    C =N

    n=1

    log

    1 + d2nE

    2Nt2

    = logN

    n=1

    1 + d

    2nE2Nt2

    Noting that the d2n are the eigenvalues ofHH+, the above formula can be

    written in terms of matrix eigenvalues, using the fact det(M) =

    (M),and det(I + M) =

    (1 + (M)):

    C = logN

    n=1

    1 +

    d2nE

    2Nt2

    = log det

    INr +

    E

    2Nt2HH+

    = log det

    INr +

    E

    2Nt2H+H

    Discussion:

    The capacity of a MIMO channel is goverend by the singular valuesofH, or in the symmetrical case by its eigenvalues. These determinethe channel gains of the independent parallel channels.

    Channel H needs to be known at the receiver Channel Estimation This implies a study of the behavior of channel eigenvalues.

  • 8/2/2019 SchlegelNotes

    15/68

    cChristian Schlegel, Zachary Bagley MIMO Channel Tutorial, WOC02, Banff 14

    MIMO Fading Channels

    Assumed that the channel H is known at the receiver

    I(x; (y, H)) = I(x; H) + I(x; y|H)= I(x; y|H)= EH [I(x; y|H = H0)]

    We need to average the mutual information over all channel realizations.

    I(x; y|H) is maximized if x iscircularly symmetric complex Gaussian with covariance Q, and

    I(x; (y, H)) = EH

    log det

    Ir +

    E

    2Nt2HQH+

    I(x; (y, H)) = EH

    log det

    Ir +

    E

    2Nt2(HU)D(U+H+)

    The spectral decomposition ofQ = U DU+ produces an equivalent

    channel H = HU, hence the maximizing Q is diagonal. Furthermore, concavity of the function log det() shows that Q = I,

    hence the maximizing Q is a multiple of the identity

    Capacity of the MIMO Rayleigh Channel:

    C = EH

    log det

    INr +

    E

    2Nt2HH+

    By the law of large numbers:

    HH+Nt NtIr and C = Nr log

    1 +

    E

    22

    H+H

    Nr NrIt and C = Nt log

    1 +NrNt

    E

    22

  • 8/2/2019 SchlegelNotes

    16/68

    cChristian Schlegel, Zachary Bagley MIMO Channel Tutorial, WOC02, Banff 15

    Evaluation of the Capacity Formula

    Following Telatar [Tel99] define the random matrix

    W =

    HH+ if Nr < NtH+H if Nr Nt

    W is an mm; m = min(r, t) non-negative definite matrix with real,non-negative eigenvalues n = d2nThe capacity can be written in terms of these eigenvalues:

    C = E{n} N

    n=1

    log

    1 +

    E

    2Nt2n

    for r = t symmetric chan-nels, let {n} be the eigen-values of H, then n =

    2n

    The (ordered) eigenvalues of W follow a Wishart distribution:

    p(1, , N) = N

    i=1 e

    i

    M

    N

    i i

  • 8/2/2019 SchlegelNotes

    17/68

    cChristian Schlegel, Zachary Bagley MIMO Channel Tutorial, WOC02, Banff 16

    Large Systems

    As the number of antennas Nt , Nr , the number of eigenvaluesN() , and the capacity formula

    C = E{i}

    Ni=1

    log

    1 +

    E

    2Nt2

    0

    Nlog

    1 +

    N E

    2Nt2

    dF()

    F() is the cumulative distribution (CDF) of the eigenvalues of W

    For random matrices like W, a general result states

    dF()

    d

    12

    + 1

    1

    ; for [, +]

    0 otherwise

    2 4 6 8

    0.1

    0.2

    0.3

    0.4

    0.5

    MN 1, and = ( 1)2.

    = 1

    = 2

    = 4

    In the limit, the capacity of the Rayleigh MIMO is given by

    C

    N=

    1

    2

    d+d

    log

    1 +

    N E

    2Nt2

    + 1

    1

    d

    Reference:[Tel99] I.E. Telatar, Capacity of mulit-antenna Gaussian channels, Eur. Trans. Telecom.,

    Vol. 10, pp. 585595, Nov. 1999.

  • 8/2/2019 SchlegelNotes

    18/68

    cChristian Schlegel, Zachary Bagley MIMO Channel Tutorial, WOC02, Banff 17

    Physical Channel Modeling

    Objective: Model realistic correlation among the statistical parametersof the channel parameters based upon ray-tracing models.

    Method: Emulate the correlated complex path gains represented by theelements of the channel matrix H using basic ray tracing techniques with:

    Symbol time Ts set large enough such that flat fading occurs.

    Randomly placed scattering objects inside a ring of radius R meters.

    Elements spaced d carrier wavelengths apart. Transmitter and receiver arrays separated by L meters.

    The sampled path gainbetween the ith receiver element and the jth trans-mitter element at time nTs is the complex superposition:

    hij(n) = AK

    k=1

    gkej(k+2fdknTs)

    The random variables gk, k, and fdk determine hij(n) as the superpositionof K electromagnetic waves as follows:

    gk is the amplitude of the kth wave at time nTs k is the kth phase term fdk is the Doppler error frequency determined by angle of arrival hij(n) is normalized to unit power

  • 8/2/2019 SchlegelNotes

    19/68

    cChristian Schlegel, Zachary Bagley MIMO Channel Tutorial, WOC02, Banff 18

    2 2 and 8 8 Examples

    Channel gains for 2 or 8 transmit and 2 or 8 receive antennas with 15scattering objects located near the mobile array.

    vmobile = 50 kph mobile receiver speed Ts = 1s sampling time R = 50m radius of circle containing the scatterers L = 2km separation of arrays

    d = 5 separation of antenna elements

    fc = 2.4GHz carrier frequency Es/N0 = 10dB symbol signal to noise ratio

    0 1 2 3 4 5 6 7x 10

    4

    10-3

    10-2

    10-1

    1

    10

    Symbols

    |

    h|

    (dB)

  • 8/2/2019 SchlegelNotes

    20/68

    cChristian Schlegel, Zachary Bagley MIMO Channel Tutorial, WOC02, Banff 19

    Capacities for the 2 2 and 8 8 Channels

    Capacities of correlated and uncorrelated channels:

    8 8 and 2 2 antenna arrays ideally uncorrleated (red lines) 8 8 and 2 2 antenna arrays corrleated by the scattering process

    (blue lines)

    Note: The relative variance decreases with the number of antennas.

    0 1 2 3 4 5 6 7 8 9 10 x10 40

    5

    10

    15

    20

    25

    30

    35

    Symbols

    Capacity

    (bits/channeluse)

    Reference:

    [Scb02] C. Schlegel and Z. Bagley, Efficient processing for high-capacity MIMO channels,submitted to IEEE J. Select. Areas Commun., Special Issue: MIMO Systems, May 2002.

    [BaS01] Z. Bagley and C. Schlegel, Classification of correlated flat fading MIMO channels(multiple antenna channels), CIT 2001, June 3-6, Vancouver, BC.

  • 8/2/2019 SchlegelNotes

    21/68

    cChristian Schlegel, Zachary Bagley MIMO Channel Tutorial, WOC02, Banff 20

    Characteristics of the MIMO Channel

    Rank ClassificationMIMO channels can be classified as high-rank or low-rank channels.

    This classification is made based upon correlation properties of the re-ceiver array response vector, or the singular values of the channel responsematrix H.

    Orthogonal channel path gains present the upper limiting case for the

    MIMO channel capacity. In this case, the non-zero squared singular val-ues of the channel response matrix H are given by

    d2n = NrE{h+nnhnn}, i = 1, 2, , Nt

    Statistically indepedent channel path gains hij are usually modelledas uncorrelated complex Gaussian random variables. In this case, theeigenvalues of HH+ for Nt Nr and H+H for Nt Nr are given bythe previously described Wishart distribution.

    Correlatedchannel path gains occur in real-world cases. The completelycorrelated case such as in long-distance scatter-free wireless links identifiesthe lower capacity limit for MIMO channels. In this case, the single non-zero squared singular value of the channel response matrix H is

    d2 = Nr

    Ntn=1

    E{h+nnhnn} = NrNt,

    References: (for linear algebraic concepts)

    [HrJc99] R.A. Horn and C.R. Johnson, Matrix Analysis, Cambridge University Press 1999.

  • 8/2/2019 SchlegelNotes

    22/68

    cChristian Schlegel, Zachary Bagley MIMO Channel Tutorial, WOC02, Banff 21

    High-Rank MIMO Channels

    High-rank MIMO channels occur when there is a rich scattering envi-ronment and when the Tx and Rx arrays are relatively near one another.

    High rank MIMO channels occur when there is little correlationamong the channel path gains.

    MIMO channels have a diversity gain defined by the rank of HH+.

    The maximum achievable diversity gain is

    rank(HH+) = min(Nt, Nr)

    The orthogonal channel gain case represents the upper limit for thecapacity of MIMO channels, and the maximum diversity gain.

    If the Nt columns ofH are orthogonal and the entries ofH are normalizedto unit power, the squared non-zero singular values of H are:

    d2n = Nr; n = 1, 2, , NtCapacity:The capacity of the high-rank MIMO channel can now be written as

    Chigh =

    min(Nr,Nt)i=1

    log1 + d2n

    Nt ; = E22

    min(Nt, Nr) array capacity

    advantage

    log

    1 +

    NrNt

    receiver antennaSNR advantage

  • 8/2/2019 SchlegelNotes

    23/68

    cChristian Schlegel, Zachary Bagley MIMO Channel Tutorial, WOC02, Banff 22

    Low-Rank MIMO Channels

    Low rank MIMO channels occur under scatter-free or long-distancelinks. The low rank MIMO channel is equivalent to a single antennachannel with the same total power.

    Low rank MIMO channels occur when there is strong correlationbetween the channel path gains.

    The correlation characteristics determine the rank of HH+, whichin turn determines the diversity advantage.

    A completely correlated H matrix is a scaled version of the all onesmatrix with with dimensions NrNt and provides no diversity gainover the single antenna case.

    If the paths are highly correlated, all gains hij are roughly equal, and H,a multiple of the all-one matrix, has a single non-zero singular value

    d2 = Nr

    Ntn=1

    E{h+nnhnn} NrNt,

    Capacity:The capacity of the low-rank MIMO channel can now be written as

    Clow =

    min(Nr,Nt)

    n=1log

    1 +

    Ntd2n

    log

    1 +

    Ntd2

    log (1 + Nr) .Note: The channel behaves like a point-to-point channel with Nr times

    the received signal power due to the antenna array, achieved bysimple maximum-ratio combining at the receiver

  • 8/2/2019 SchlegelNotes

    24/68

    cChristian Schlegel, Zachary Bagley MIMO Channel Tutorial, WOC02, Banff 23

    Low-SNR MIMO Channels

    If the available is low, a Taylor Series approximation of

    log(1 + x) xfor small values of x lets us develop both Chigh and Clow as

    Chigh

    min(Nr, N

    t)

    Nr

    Nt; C

    low N

    r.

    Furthermore, these formulas present overbounds to the actual capacities

    Conclusions:

    Correlation in the channel has little effect on capacity for low SNR.

    Correlation in the channel has a pronounced effect where the linearapproximation to log(1 + x) does not apply. The MIMO resources must be allocated differently based upon the

    classification of the MIMO channel.

    Easily estimated correlation statistics can be used to help classifythe operating conditions as high-rank or low-rank.

  • 8/2/2019 SchlegelNotes

    25/68

    cChristian Schlegel, Zachary Bagley MIMO Channel Tutorial, WOC02, Banff 24

    High and Low-Rank MIMO Channel Capacities

    The following figures the general capacity behavior for high and low rankMIMO channels values of 0 dB and 20 dB.

    Bits/channel use

    Bits

    2 4 6 8 10 12 14 16 18 200

    0.5

    1

    1.5

    2

    2.5

    3

    High correlation (case H)Low correlation (case L)

    Capacity as a function of

    the number of antennas inthe arrays

    2 4 6 8 10 12 14 16 18 20

    0

    20

    40

    60

    80

    100

    120

    High correlation (case H)UncorrelatedFading

    Low correlation (case L)

    Standard deviation of thecapacity as a function ofthe number of antennas

    References:

    [Tel99] I.E. Telatar, Capacity of mulit-antenna Gaussian channels, Eur. Trans. Telecom.,Vol. 10, pp. 585595, Nov. 1999.

    [BaS01] Z. Bagley and C. Schlegel, Classification of correlated flat fading MIMO channels(multiple antenna channels), Canadian Information Theory Workshop, CITW2001,Vancouver, BC, June 36, 2001.

  • 8/2/2019 SchlegelNotes

    26/68

    cChristian Schlegel, Zachary Bagley MIMO Channel Tutorial, WOC02, Banff 25

    Effects of the Array Geometry

    The condition for orthogonality between the columns ofH in a scatterfreeenvironment is given by [GBGP01]

    dtdrL

    =

    Nr.

    dt and dr are the Tx and Rx antenna array element seperations

    L is the distance between arrays Nr is the number of receiver elements is the carrier wavelength

    The following figure illustrates the transition from a high rank channelto a low rank channel for high and low SNR values and for two valuesof the element separation.

    100

    101

    102

    103

    104

    105

    0

    5

    10

    15

    20

    25

    L / r

    Capacity(bits/

    use)

    Capacityversus LinkSeperationtoScatteringRadiusRatiofor Nt=N

    r=4

    SNR=20dB

    SNR=0dB

    d=5wavelengths(BLUE)

    d=1wavelength(RED)

  • 8/2/2019 SchlegelNotes

    27/68

    cChristian Schlegel, Zachary Bagley MIMO Channel Tutorial, WOC02, Banff 26

    Capacity Advantage Factor

    From the previous figure, it is easy to conclude that scatter-free linksdegrade to equivalent single-antenna channels very quickly, leading tothe pin-hole channel described by the correlated channel capacity

    Clow log(Nr) (1)

    Under high SNR conditions, the capacity formula can be approximated

    as

    C =N

    n=1

    log

    1 +

    Ntd2n

    N

    n=1

    log

    Ntd2n

    = log

    Ni=1

    Ntd2n

    . (2)

    N = rank(HH+) The total instantaneous capacity is a function of the rank ofHH

    +

    ,or equivalently the condition number of HH+.

    Link geometry determines correlation and fading parameters. The H matrix entries become highly correlated rather quickly rela-

    tive to the antenna element separation parameters [GBGP01].

    Comparing (1) and (2), we can define a capacity benefit factor repre-senting the capacity gain (at high SNR values) over the pin-hole channel

    since

    log

    N

    n=1

    Ntd2n

    log(Nr)

    which after collecting terms becomes

    log

    NtN1

    Nn=1 d

    2n

    NtNr

    0

  • 8/2/2019 SchlegelNotes

    28/68

    cChristian Schlegel, Zachary Bagley MIMO Channel Tutorial, WOC02, Banff 27

    Capacity Advantage Factor

    Noting that

    C Clow log

    Nt

    N1 Nn=1 d

    2n

    NtNr

    = log() 0,

    we can write the capacity under high SNR conditions in terms of thecapacity advantage over the pin-hole channel:

    C

    log + Clow = log( Nr), = NtN1

    Nn=1 d

    2n

    NtNr

    10 5 0 5 10 15 20 25

    0

    10

    20

    30

    40

    50

    60

    70

    / N0(dB)

    Capacity(bits

    /use)

    Csym, Cwaterfilling (orthogonal)

    Csym, Cwaterfilling (i.i.d)

    Clow + log()

    Chigh

    Clow

    References:[GBGP01] D. Gesbert, H. Bolcskei, D. A. Gore, A. J. Paulraj, Outdoor MIMO Wireless Chan-

    nels: Models and Performance Prediction, submitted to IEEE Trans. Communications,July 2000.

    [ScB02] C. Schlegel and Z. Bagley, Efficient Processing for High-Capacity MIMO Channels,submitted to IEEE J. Select. Areas Commun., Special Issue: MIMO Systems, May 2002.

    [Bag02] Z. Bagley, Serially Concantenated and Layered Space-Time Coding, Ph.D. thesis,

    work in progress.

  • 8/2/2019 SchlegelNotes

    29/68

    SPACE-TIME MODULATION

  • 8/2/2019 SchlegelNotes

    30/68

    cChristian Schlegel, Zachary Bagley MIMO Channel Tutorial, WOC02, Banff 29

    Space-Time Coding

    Space-time coding (STC) systems make use of the MIMO channel gener-ated by a multiple-antenna transmit/receive setup:

    +

    +

    +xNtr

    x1r

    x2r

    h11

    hNtNr

    hij

    yNrr

    y2r

    y1r

    nNrr

    n2r

    n1rTransmit

    AntennaArray

    Receive Array

    Each antenna transmits a DSB-SC signal:

    yj(t) =

    Nt

    i=1

    EsNt

    hijxi(t) + ni(t)

    and y(t) =Es

    NtHx(t) + n(t),

    where y = (y1, , yNr) and x = (x1, , xNt).Channel Gains: modeled as independent complex coefficients, i.e.,

    p(h) =1

    2exp

    |h|

    2

    2

    ; E[hihj] = 0

    which leads to a Rayleigh distributed amplitude

    p(a = |h|) = a expa

    2

    2

    p(a = |h|2) = 1

    2aexp(a/2)

  • 8/2/2019 SchlegelNotes

    31/68

    cChristian Schlegel, Zachary Bagley MIMO Channel Tutorial, WOC02, Banff 30

    MIMO Channels Transmit Diversity

    The Alamouti scheme [Ala98] uses two transmit antennas:

    +

    [x1, x0]

    [x0,x1]

    h1

    h0

    [x0,x1][x0, x1]

    [h0, h1]

    Transmit

    Antenna

    Array

    Combiner

    Estimate

    The transmitted 2 2 STC codeword is X, and thesymbols xi can be any quadrature modulated symbols.

    X =

    x0 x1x1 x

    0

    The received signal for a single receive antenna is

    r = [r0, r1] = [h0x0 + h1x1,h0x1 + h1x0] + [n0, n1]= [h0, h1]X + n

    The demodulator calculates

    [x0, x1] =

    h0 h1h1 h0

    r0r1

    =

    (|h0|2 + |h1|2)x0 + h0n0 + h1n1 n0

    , (|h0|2 + |h1|2)x1 + h0n1 + h1n0 n1

    If the channel path h0 and h1 are uncorrelated, the noise sources ni

    have twice the variance of the original noise sources.

    The system provides dual diversity due to the factor (|h0|2 + |h1|2)which exhibits a -square distribution of fourth order.

    Reference:

    [Ala98] S.M. Alamouti, A simple transmit diversity technique for wireless communications,IEEE J. Select. Areas Commun., Vol. 16, No. 8, October 1998.

  • 8/2/2019 SchlegelNotes

    32/68

    cChristian Schlegel, Zachary Bagley MIMO Channel Tutorial, WOC02, Banff 31

    Multiple Receive Antennas

    The Alamouti scheme can be extended to multiple receive antennas:

    R=

    r0 r1r2 r3

    =

    h0 h1h2 h3

    x0 x1x1 x

    0

    + N

    Multiplying the received signals Rwith the channel estimate H we obtain

    [x0, x1] =

    h0 h1h1 h0

    r0r1

    +

    h2 h3h3 h2

    r2r3

    = (|h0|2 + |h1|2 + |h2|2 + |h3|2)x0 + h0n0 + h1n1 + h2n2 + h3n3 n0

    ,

    + (|h0|2 + |h1|2 + |h2|2 + |h3|2)x1 + h0n1 + h1n0 + h2n2 + h3n3 n1

    ]

    This system provides 4-fold diversity as expressed by the amplitude

    A = (|h0|2 + |h1|2 + |h2|2 + |h3|2)This is possible due to the fact that the rows of X are orthogonal. TheAlamouti scheme is the most basic representative of what are known as

    Orthogonal DesignsReal: The following is an example of a 4 4 (real) orthogonal design

    X = x1 x2 x3 x4

    x2 x1

    x4 x3

    x3 x4 x1 x2x4 x3 x2 x1

    Complex: A rate R = 0.5 complex design is

    X =

    x1 x2 x3 x4 x1 x2 x3 x4x2 x1 x4 x3 x2 x1 x4 x3

    x3 x4 x1 x2 x3 x4 x1 x2

  • 8/2/2019 SchlegelNotes

    33/68

    cChristian Schlegel, Zachary Bagley MIMO Channel Tutorial, WOC02, Banff 32

    Space-Time Orthogonal Block Codes (STOB)

    STOBs are based on the theory of orthogonal designs [TJC99].As 3 4 real orthogonal design is

    X = D4,3(x) =

    x1 x2 x3x2 x1 x4x3 x4 x1x4 x3 x2

    XTX = KsI; Kx = L

    i=1

    x2i

    This STOB is used to drive 3 transmit antennas:

    [x1,x2,x3,x4]

    [x2, x1, x4,x3]

    [x3,x4, x1, x2] h3

    h2

    h1

    h = [h1, h2, h3]T

    GTrx3

    x2

    x1

    x4

    The received signal at (each) receive antenna is

    r = Xh + n =

    h1 h2 h3 0h2 h1 0 h3h3 0 h1 h20 h3 h2 h1

    x1x2x3x4

    + n = Gx + n

    G = D4,4([h1, h2, h3, h4 = 0]) is also an orthogonal design: GTG = KhI.

    Optimal reception is ackomplished with a matched filter receiver

    x =1

    Kh= GTr = x + n; Kh =

    Nti=1

    |hi|2

    Reference:

    [TJC99] V. Tarokh, H. Jafarkhani, and A.R. Calderbank, Space-time block codes from or-thogonal designs, IEEE Trans. Inform. Theory, vol. 45, no. 5, July 1999.

  • 8/2/2019 SchlegelNotes

    34/68

    cChristian Schlegel, Zachary Bagley MIMO Channel Tutorial, WOC02, Banff 33

    STOB: Diversity and SNR Gains

    Processing at each receive antenna is identical, and the Nr channels foreach symbol xj are added together (Maximum-ratio combining):

    +

    +

    +

    BasebandSignals

    r1

    rNr

    gTNr,1

    gT11

    gT12

    gT1,Nc

    x1

    x2

    xNc

    DespreadingMR Combining

    Diversity and SNR

    It is straightforward to see that xj = a2jxj + nj, where a2j is 2-distributed with degree NtNr, and 2NtNr for complex designs.

    The noise nj has power NrN0/2, and NrN0 for complex designs.

    Diversity Gain : d =NtNr

    2

    SNR Gain : d = NtNri=1 |hi|2

    NtN0/2

    Complex orthogonal designs exist only for rates R 1/2, i.e., thereare at most L/2 orthogonal vectors.

    On the other hand, real orthogonal designs do exist for R 1, i.e.,there exist up to L real orthogonal vectors.

    Orthogonal designs can be used with single-sideband modulation

  • 8/2/2019 SchlegelNotes

    35/68

    cChristian Schlegel, Zachary Bagley MIMO Channel Tutorial, WOC02, Banff 34

    STOB: Capacity and Spectral Efficiency

    Space-time orthogonal codes completely separate the L Nt (L Nt/2 in the case of complex signals) channels and have a very simpleoptimal receiver structure.

    Each of the separated channels has a 2 distributed power level withdegree NtNr, which is close to constant.

    Nevertheless the restriction to STOBs causes a loss in capacity

    Capacity of STOB

    Transmit Power: Pt = NcEs/Nt; (Nc symbols transmitted).

    Received Power/Antenna: Pr = (NtNr)2Es/Nt, each path contributes.

    Noise Power/Antenna: Pn = NtNrN0/2, each receive antennacontributes a noise source.

    STOB Capacity is then given by the applying Shannons formula:

    CSTOB =NcL

    log

    1 +

    2NrEsN0

    ; (bits/channel use)

    Notes:

    CSTOB is directly proportional to the rate R = NcL of the orthogonaldesign

    The SNR gain is proportional to the number of receive antennas

    nothing else The large diversity ensures that the amplitude fluctuations are min-imal Capacity identical to that of an AWGN.

  • 8/2/2019 SchlegelNotes

    36/68

    cChristian Schlegel, Zachary Bagley MIMO Channel Tutorial, WOC02, Banff 35

    STOB Capacity: Example

    Capacities of 8 8 Systems: Comparison between the MIMO sym-metric capacity and the STOB capacity for an 88 multiantenna system.

    MIMO Limit

    STOBCapacityAWGNCapacity

    -10 -5 0 5 10 15 200.1

    1

    10

    102Bits/Channel Use

    40 bits

    9dB

    one sideband

    both sidebands

    Es/N0

    Observation: STOB codes provide adiversity advantage but do not provideany capacity advantage over a point-to-point channel with the same resources.

  • 8/2/2019 SchlegelNotes

    37/68

    cChristian Schlegel, Zachary Bagley MIMO Channel Tutorial, WOC02, Banff 36

    Space-Time Codes: Error Computation

    In general a space-time codeword is an NtNr matrix (array) of complexsignals:

    X = [x1, , xNc] = x11 x12 x1Nc... ...

    xNt1 xNt2 xNtNc

    Each row of X is a space-time symbol, and the received STC word is

    Y =

    EsNt

    HX + N

    This means the conditional probability density function of Y given Hand the transmitted STC codeword is multi-variant Gaussian:

    p(Y|H, X) = 1NrNc

    exp

    trYEsNt HXYEsNt HX+

    2N0

    Note: The exponent term tr

    M M+

    is merely the squared sum of all

    the entries in M.

    N is a matrix of complex noise samples with variance N0, that isvariance

    2

    = N0/2 in each of the two dimensions. The signal energy per space-time symbol is Es and

    EsNt

    per con-

    stellation point Each constellation point xij is energy normalized to unity.

  • 8/2/2019 SchlegelNotes

    38/68

    cChristian Schlegel, Zachary Bagley MIMO Channel Tutorial, WOC02, Banff 37

    Space-Time Codes II: Optimal Decoding

    ML reception: The ML-receiver requires channel knowledge and andmaximises the squared metric

    X = arg minX

    tr

    Y

    EsNt

    HX

    Y

    EsNt

    HX

    +

    = arg minX

    trEs

    NtHXX+H+

    2trEsNt

    HXY+

    Fixed Channels

    At any given time-instant, the channel is fixed and the impairment isGaussian noise.Pairwise Error Probability

    P(X X) = Q

    EsNt

    d2(X, X)2N0

    Squared Euclidean Distance

    d2(X, X) =Es

    Nt

    Nc

    n=1

    Nr

    j=1

    Nt

    i=1

    hij(xin

    xin

    )2

  • 8/2/2019 SchlegelNotes

    39/68

    cChristian Schlegel, Zachary Bagley MIMO Channel Tutorial, WOC02, Banff 38

    Squared Euclidean Distance (SED)

    d2(X, X) can be expressed in terms of H, X and X as

    d2(X, X) =EsNt

    Nrj=1

    Nti=1

    Nti=1

    hijhij

    Ncn=1

    (xin xin)(xin xin) Kii

    The matrix K is a kernel matrix with entries Kii

    .

    SED as a Quadratic FormDefine hj = [h1j, , hNtj]T as the signature vector of receive antenna j.

    d2(X, X) =EsNt

    Nrj=1

    hjKhj

    Since K is hermitian (K = K+), we can spectrally decompose d2(X, X)

    d2(X, X) =EsNt

    Nrj=1

    hjV DV+hj

    =EsNt

    Nrj=1

    vjDvj

    The components of these equations are:

    V is a unitary matrix D is a diagonal matrix with the eigenvalues of K. It can be shown

    that all these eigenvalues are nonnegative real

    h is a vector of complex Gaussian gains v = V+h is a vector of rotated complex channel gains

  • 8/2/2019 SchlegelNotes

    40/68

    cChristian Schlegel, Zachary Bagley MIMO Channel Tutorial, WOC02, Banff 39

    STC Error Probability

    The SED can be expressed in terms of the eigenvalues of K

    d2(X, X) =EsNt

    Nrj=1

    Nti=1

    di|vij|2; vj = V+hj

    and, given a fixed channel H

    P(X X) = QEsNrj=1 Nti=1 di|vij|2

    2N0Nt

    Error Probability depends on the eigenvalues of the matrix

    K = (X X) (X X)+

    in particular on the rank and the product of the eigenvalues.

    Chernoff Bound is easier to manipulate:

    P(X X) exp Es

    4N0Nt

    Nrj=1

    Nti=1

    di|vij|2

    =Nrj=1

    exp Es4N0Nt

    Nti=1

    di|vij|2 Dj

  • 8/2/2019 SchlegelNotes

    41/68

    cChristian Schlegel, Zachary Bagley MIMO Channel Tutorial, WOC02, Banff 40

    Fading Error Analysis

    Fading channels: If independent fading is assumed then

    h is a vector of independent complex Gaussian random variables v = V+h is also a vector of independent complex Gaussian random

    variables, because

    E

    vv+

    = V+E

    hh+

    V = I

    The components vij are unit-variance complex Gaussian random vari-ables, |vij|2 is -square distributed with two degrees of freedom:

    p(a = |v|2) = exp(a); p(da = d|v|2) = 1d

    exp(a/d)

    d2(X, X) is the weighted sum of -square random variables.

    PDF of SED The PDF is found via a partial fraction expansion of thecharacteristic function:

    p(x = d2) =

    Nti=1

    pi

    Nrj=1

    xm1

    dmi (m 1)exp

    x

    di

    This PDF can be integrated in closed form to

    Pe =1

    2

    Nti=1

    pi

    Nrj=1

    1 1

    1 + 4N0NtEsdi

    m1n=0

    2n

    22nn21

    1+Esdi4N0Nt

    n

    References:

    [TNSC98] V. Tarokh, A.F. Naguib, N. Seshadri, and A.R. Calderbank, Space-time codes forhigh data rate wireless communications: Performance criterion and code construction,IEEE Trans. Inform. Theory, pp. 744765, March 1998.

    [BaS02] Z. Bagley and C. Schlegel, Pair-wise error probability for space-time codes undercoherent and differentially coherent decoding, submitted to IEEE Trans. Commun.,January 2002.

    [Sch96] C. Schlegel, Error probability calculation for multibeam Rayleigh channels, IEEETrans. Commun., Vol. 44, No. 3, March 1996.

  • 8/2/2019 SchlegelNotes

    42/68

    cChristian Schlegel, Zachary Bagley MIMO Channel Tutorial, WOC02, Banff 41

    Chernoff Error Bounds

    The Chernoff error bounding technique avoids many algebraic technical-ities. The bound is calculated from the characteristic function as:

    P(X X) E

    exp

    Es

    4N0Nt

    Nrj=1

    Nti=1

    di|vij|2

    =

    Nrj=1

    Nti=1

    1

    1 + Esdi4N0Nt

    The final form of the bound is:

    P(X X) 1Nt

    i=1(1 +Esdi

    4N0Nt)

    Nr

    Nti=1

    di

    NrEs

    4N0Nt

    NrNt

    Design Criteria:

    The Rank Criterion: To achieve maximum diversity gainthe ker-nel matrix K = (X X)(XX)+ has to have full rank.

    The Determinant Criterion: The product Nti=1 di = det(K)needs to be maximized to give maximum coding gain.

    Orthogonal Designs revisited: Orthogonal designs provide full diver-sity. This condition is equivalent to requiring that XX is non -singularfor any X= X.Proof: The determinant of X is

    det(X) =

    det(XXT) =

    det diag

    i

    x2i , ,i

    x2i

    =

    i

    x2i

    Nt/2

    and therefore

    det(X X) =

    i

    |xi xi|2Nt/2

    = 0

    Therefore the maximum diversity NtNr is achieved.

  • 8/2/2019 SchlegelNotes

    43/68

    cChristian Schlegel, Zachary Bagley MIMO Channel Tutorial, WOC02, Banff 42

    Code Construction

    Following these criteria Tarokh et. al. [TNSC98] have constructed trelliscodes which provide maximal diversity. For example:

    00 01 02 03

    10 11 12 13

    20 21 22 23

    30 31 32 33

    22 23 20 21

    32 33 30 31

    02 03 00 01

    12 13 10 11m(1), m(2) m(3), m(4) m(5), m(6) m(7), m(8)

    These codes achieve full diversity on two-antenna systems. The transmis-sion rate is 2 bits per symbol using two QPSK signals over two antennas,because there are four choices at each state.

    Decoding:Decoding follows the trellis using a sequence metric calculator Viterbi.Branch Metrics:The Viterbi algorithm works by using metrics m(r) along their branchesand accumulates them to find the global minimum, where

    m(r) =

    Nrj=1

    yjr Nti=1

    hijxir

    2

    = need channel estimates

    m(r) = yr Hxr2

    The metric is simply the squared Euclidean distance between hypothesisand received signal.

  • 8/2/2019 SchlegelNotes

    44/68

    cChristian Schlegel, Zachary Bagley MIMO Channel Tutorial, WOC02, Banff 43

    Error Performance of STCs

    4 states

    8 states

    16states

    32states

    64 states

    4 5 6 7 8 9 10 11 12 13 14104

    103

    102

    101

    1

    Frame Error Probability (BER)

    SNR

  • 8/2/2019 SchlegelNotes

    45/68

    cChristian Schlegel, Zachary Bagley MIMO Channel Tutorial, WOC02, Banff 44

    No Channel Information and Noncoherent Detection

    If the channel H is not known at the receiver the worst case then Yconditioned only on X is Gaussian distributed since both N and H(t)are Gaussian distributed.If we decompose the channel equation

    Y =

    EsNt

    HX + N =

    yT1...

    yT

    Nr

    =

    EsNt

    hT1 X + nT1

    ...

    hT

    NrX + nT

    Nr

    into signal sequences of different antennas, and consider the cross-correlationof the j-th sequence:

    E

    yjy+j

    = E

    (X+hj + nj)(h

    +j X + nj)

    = X+E

    hjh

    +j

    X + N0I

    If each receive antenna statistically sees the same channel, then this cross-

    correlation is independent of j and thus, for independent subchannels:

    E

    yjy+j

    =

    EsNt

    X+X + N0I = X

    In this case the conditional PDF of the received space-time codeword canbe written as:

    p(Y|X) = exptrYX1Y+

    NrdetX

  • 8/2/2019 SchlegelNotes

    46/68

    cChristian Schlegel, Zachary Bagley MIMO Channel Tutorial, WOC02, Banff 45

    Noncoherent Detection and Unitary Space-Time Codes

    Since the trace operator is invariant to a rotation of its arguments:

    tr

    Y

    EsNt

    X+X + N0I

    1Y+

    = tr

    EsNt

    X+X + N0I

    1Y+Y

    the following conclusions can be drawn:

    (1) The Nc

    Nc matrix Y

    +Y is a sufficient statistic

    for optimal detection.

    (2) p(Y|X) depends on X only through X+X.(3) Unitarily rotated signals U X have the same

    PDF p(Y|X) = p(Y|U X).

    Unitary Space-Time Codes: The transmitted signals over each an-

    tenna (rows of X) are orthogonal:

    XX+ = NtI; detX = det

    EsNt

    XX+ + N0I

    = Es + N0

    Note: Unitary STC codewords X are not necessarily unitary matrices.For a matrix to be unitary it must be square and U U+ = U+U= I.

    Using (A + BC D)1 = A1 A1B(C1 + DA1B)1DA1

    1X

    = I EsNtEs + 1

    X+X

    Optimal Detection: X = arg maxX

    tr

    Y X+XY+

  • 8/2/2019 SchlegelNotes

    47/68

    cChristian Schlegel, Zachary Bagley MIMO Channel Tutorial, WOC02, Banff 46

    Unitary Space-Time Codes: Basic Properties

    The non-coherentreceiver and unitary STCs have the following properties

    the receiver is quadratic and requires no channel estimates. Unitary STCs send orthogonal signals over the Nt transmit antennas. Orthogonal designs are unitary, but not necessarily visa versa. Orthogonal designs can be decoded non-coherently (without channel

    state information)

    Using orthogonal spreading sequences for each antenna is one wayof achieving unitary STCs.

    Capacity Result: It is shown in [MaH99] that for the block-wise con-stant channel unitary signals can achieve the capacity for non-coherentdetection.

    Block-wise constant channels are a model for, e.g., frequency hopping

    systems, or interleaved fast fading channels.

    Further results are:

    There is no point in using more transmit antennas than the coherenceduration of the channel, i.e., Nt Nc. The capacity is unchangedfor Nt Nc.

    Reference:

    [MaH99] T.L. Marzetta and B.M. Hochwald, Capacity of a mobile multiple-antenna com-munication link in Rayleigh flat fading, IEEE Trans. Inform. Theory., Vol. 45, No. 1,January 1999.

  • 8/2/2019 SchlegelNotes

    48/68

    cChristian Schlegel, Zachary Bagley MIMO Channel Tutorial, WOC02, Banff 47

    Unitary Group Codes

    Hughes [Hug00] proposed the use of unitary group codes G, whose mem-bers G are unitary matrices. The transmitted codewords are:

    X = T G

    where T is a NtNc transmission matrix adapting G to the transmissionsystem with Nt antennas.

    Quaternion CodeA basic such unitary STC with Nc = 2, with elements in the quaternaryphase shift keyed (QPSK) constellation is

    Q =

    1 00 1

    ,

    j 00 j

    ,

    0 11 0

    ,

    0 jj 0

    This code can be used on a Nt = 2 antenna system with T = 1 1

    1 1 The group code Q is isomorphic to Hamiltons quaternion group.Differential Encoding of STCsThese codes are transmitted differentially as shown below:

    +G(t) X(t)

    X(t1)

    X(t) = X(t 1)G(t); X(0) = T

    Reference:

    [Hugh00] B.L. Hughes, Differential space-time modulation, IEEE Trans. Inform. Theory,Vol. 46, No. 7, pp. 25672578, Nov. 2000.

  • 8/2/2019 SchlegelNotes

    49/68

    cChristian Schlegel, Zachary Bagley MIMO Channel Tutorial, WOC02, Banff 48

    Differential Reception

    Unitary space-time codes can be differentiallydetected. This also requiresno channel state information, which is gleaned from a previously receivedSTC symbol.Considering a composite STC word consisting of two unitary STC words:

    X2(t) = [X(t 1) : X(t 1)G]we see that is fullfills the unitary condition for non-coherent detection:

    X2(t)X2(t)+ = [X(t 1) : X(t 1)G] [X(t 1) : X(t 1)G]+= X(t 1)X(t 1)+ + X(t 1)GG+X(t 1)+ = 2NtI

    The optimal detector for this 2-block code selects

    X = arg maxX

    tr

    Y2X+2 X2Y

    +2

    = arg maxX trY(t

    1)

    Y(t 1)T

    NtI NtG(t)NtG(t)+ NtI Y(t 1)+

    Y(t)+ X = arg max

    X

    RtrY(t 1)GY(t)+

    +

    z1

    ()+Y(t)+Y(t 1)

    Rtr{G1}

    Rtr{G2}

    Rtr{GM}

  • 8/2/2019 SchlegelNotes

    50/68

    cChristian Schlegel, Zachary Bagley MIMO Channel Tutorial, WOC02, Banff 49

    Algebra of Unitary STCs

    Cyclic Codes: A cyclic group code for Nt = 2 antennas is defined by

    G =

    M 00 kM

    M = exp(2j/M)

    For example for 4 = j we obtain the code

    G =

    j 00 j

    ,

    1 00 1

    ,

    j 00 j

    ,

    1 00 1

    ,

    These codes are all of the form I, G, G2, , GM1.

    8-PSK cyclic code:

    1 0

    0 1

    ej/4 0

    0 ej/4

    j 00 j

    ej3/4 0

    0 ej3/4

    1 0

    0 1

    ej5/4 00 ej5/4

    j 0

    0 j

    ej7/4 0

    0 ej7/4

    Di-Cyclic Codes: For M 8 there exist also the dicyclic group codes:

    G =

    M/2 00 M/2

    ,

    0 11 0

    Quaternion code:

    1 00 1

    0 1

    1 0

    0 j

    j 0

    j 00

    j 1 0

    0

    1

    0 1

    1 0

    0 j

    j 0

    j 0

    0 j

  • 8/2/2019 SchlegelNotes

    51/68

    cChristian Schlegel, Zachary Bagley MIMO Channel Tutorial, WOC02, Banff 50

    Equivalent Codes

    Unitary space-time group codes have a number of desirable properties:

    Uniform Error ProbabilityEach codeword of a unitary space-time code has the same error perfor-mance. Consider the STC X = T GG1

    Y = HX + N = HTGG1 + N

    Since G is unitary, the rotated received signal

    Y = Y G+1 = HT G + N G+1 = H T G

    X

    +N

    where N and N have the same noise statistics. Therefore G and G =GG1 have the same error performance.

    Equivalent CodesEquivalent codes are codes whose codewords are related by

    X = U CV; U, Vunitary

    Proof: Again consider

    Y = HX + N = HU XV + N= HX + N

    Y = Y V+ = HX + N

    Since H and H have the same statistics, and the noises N andN have the same statistics the first and last line express equivalentequations and X and X have the same error performance.

  • 8/2/2019 SchlegelNotes

    52/68

    cChristian Schlegel, Zachary Bagley MIMO Channel Tutorial, WOC02, Banff 51

    Optimal STCs

    Space-time codes are defined to be optimalif rank ((X X)(XX)+) =Nt and if det ((XX)(X X)+) is maximal.

    Theorem:Optimal space-time group codes are unitary

    Proof: We decompose the transmission matrix T using the SVD into

    T = U DV+ = U D[I : 0]V+

    and calculate the product the determinant using the two STCsX = T I, X = T G:

    det

    (X X)(X X)+ = minG

    T(I G)(I G+)T+= min

    G U D[I : 0]V+(I G)(IG+)V[I : 0]+D+U+

    = |DD+|minG

    [I : 0]V+(I G)(IG+)[I : 0]+Making |DD+| non-singular and as large as possible is a necessarycondition, and

    |DD+|

    1

    Nttr(DD+)

    Nt NNtc

    with equality if and only if D =

    NcI. Therefore

    T =

    NcU[I : 0]V+ T T+ = NcI

    And hence optimal STC group codes are unitary, i.e., they have orthogo-nal rows. The codewords which drive each of the antennas are orthogonal.

  • 8/2/2019 SchlegelNotes

    53/68

    cChristian Schlegel, Zachary Bagley MIMO Channel Tutorial, WOC02, Banff 52

    Optimal STC Group Codes

    Again appealing to the SVD we see that maximizing

    p =[I : 0]V+(IG)(I G+)V[I : 0]+

    is quite independent and can be done using T0 =

    Nc[I : 0].

    Note: The initial matrix T0 is quite uninteresting since it simply amountsto antenna switching.

    Theorem:Hughes [Hug02] shows that the only STC group codes that areoptimal are equivalent to two families, (i)

    Cyclic Group Codes:

    G =

    exp(2jk1/M) 0 00 exp (2jk2/M) 0...

    ..

    .

    . ..

    ..

    .0 0 exp(2jkt/M)

    where 0 < k1 kt < M are odd integers, and (ii)Di-Cyclic Group Codes:

    G =

    exp(4jk1/M) 0 0

    0 exp (4jk2/M) 0...

    ... . . ....

    0 0 exp(4jkt/M)

    , 0 It/2

    It/2 0

    where 0 < k1 kt < M/2 are odd integers.

  • 8/2/2019 SchlegelNotes

    54/68

    cChristian Schlegel, Zachary Bagley MIMO Channel Tutorial, WOC02, Banff 53

    Practical Codes

    If the transmission matrix T is chosen to be Hadammard, e.g,

    1 1 1 11 1 1 11 1 1 11 1 1 1

    then the actual transmitted symbol on each of the antennas are PSKsymbols.

    Lower Cardinality Codes: Often codes exist which have a lower car-dinality of the signal constellation, which is emminently important forimplementation. For example, the following code is equivalent to theNt = 4 16-PSK cyclic code, but takes symbol only from a QPSK constel-lation!

    I, G0, G

    20, , G150

    ; G0 =

    0 0 0 j

    1 0 0 0

    0 1 0 0

    0 0 1 0

    Reference:

    [Hug02] B.L. Hughes, Optimal space-time constellations from groups, IEEE Trans. Inform.Theory, submitted, March 2000.

  • 8/2/2019 SchlegelNotes

    55/68

    cChristian Schlegel, Zachary Bagley MIMO Channel Tutorial, WOC02, Banff 54

    Space-Time Communications Systems - Channel Layering

    MIMO Channel ProcessingFrom previous results, we note that

    MIMO channels are classified into low and high-rank channels. If Es/N0 = is large, the capacity advantage of a high-rank MIMO

    channel over a single antenna (low-rank) channel is

    log Nt

    N1 Nn=1 d2nNtNr

    bits/use. If Es/N0 is small, the system is power limited and the high-rank

    MIMO channel capacity advantage cannot be exploited.

    Complexity of the optimal maximum-likelihood decoding of high-rank MIMO channels grows exponentially with additional antennas.

    from which we can conclude

    1. Optimal processing for low SNR or low-rank MIMOchannels consists of using a single transmit antennawaveform and performing maximal ratio combining ofthe multiple receive antennas.

    2. High-rank MIMO channels with a large number of an-tennas require methods to reduce the complexity, such

    asspace-time layering

    .

    References:

    [BaS01] Z. Bagley and C. Schlegel, Classification of correlated flat fading MIMO channels(multiple antenna channels), Canadian Information Theory Workshop, CITW2001,Vancouver, BC, June 36, 2001.

    [BaS02a] Z. Bagley and C. Schlegel, Pair-Wise Error Probability for Space-Time Codes andDierential Detection, submitted to IEEE Trans. Commun., January 2002.

    [BaS02b] Z. Bagley and C. Schlegel, Efficient Processing for High-Capacity MIMO Channels,

    submitted to JSAC, MIMO Systems Special Issue, April, 2002.

  • 8/2/2019 SchlegelNotes

    56/68

    cChristian Schlegel, Zachary Bagley MIMO Channel Tutorial, WOC02, Banff 55

    Processing High-Rank MIMO Channels

    A number of methods known as space-time layeringexist for reducing thecomplexity by processing only a small portion of the total data stream atonce.

    One such method, discussed in [Fos96], is a combination of the two lay-ering methods discussed here:

    1. Successive information processing:

    Optimal method decomposes the channel via successive cancellation. Derived from the chain rule of mutual information. Compared to the parallel channels obtained by SVD.

    2. Parallel information processing:

    A subspace signal projection and successive cancellation structureis shown to be an asymptotically optimal (at high SNR) processingmethod.

    A simplified method employing only sub-space projection filtering isexamined.

    References:

    [Fos96] G.J. Foschini, Layered space-time architecture for wireless communication in a fad-ing environment when using multi-element antennas, Bell Labs Technical Journal, vol.1 (2), August 1996, pp. 4159.

  • 8/2/2019 SchlegelNotes

    57/68

    cChristian Schlegel, Zachary Bagley MIMO Channel Tutorial, WOC02, Banff 56

    Channel Layering and Successive Information Processing

    MIMO Capacity is results based on successive interference cancellationand compare the achievable rates at the different cancellation stages withthose of the parallel channels obtained by SVD.

    The chain rule of mutual information states

    I(c; y|H) =Nt

    k=1I(ck; y|H, c0, , ck1) .

    which implies a successive interference cancellation structure since

    I(ck; y|H, c0, , ck1) is the maximum information rate oflayer ck given that signals c0, , ck1 are known exactly.

    Notes:

    For an additive channel knowledge of some users signals c0, , ck1implies cancelling these signals

    The successive interfernce cancellation receiver and an optimal re-ceiver will NOT necessarily arrive at the same decoded signals.

    Good, that is capacity-achieving receivers will be need at each level.

  • 8/2/2019 SchlegelNotes

    58/68

    cChristian Schlegel, Zachary Bagley MIMO Channel Tutorial, WOC02, Banff 57

    Interfernce Cancellation

    The received signal at level k after cancellation is

    yk = y k1r=1

    hrcr = hkck +

    Ntk+1

    hrcr + n,

    Since the capacity achieving distributions at each level are Gaussian, the

    residual interference plus noise

    n =Ntk+1

    hrcr + n

    is also Gaussian, and so the mean and co-variance statistics of n com-pletely characterize the noise and interference.

    Enn+ =

    EsNt

    [hk+1, , hNt][hk+1, , hNt]+ + N0I

    = N0

    EsN0Nt

    AkA+k + I

    = N0Qk,

    Hence, the sub-layer capacity of a vector channel embedded in correlatedGaussian noise is given by [CoT91]:

    I(Ck; y|H, c0, , ck1) = log detI +Es

    N0NtQ1k hkh

    +k

    References:

    [CoT91] T. Cover and J. Thomas, Elements of Information Theory, Wiley, 1991.

  • 8/2/2019 SchlegelNotes

    59/68

  • 8/2/2019 SchlegelNotes

    60/68

    cChristian Schlegel, Zachary Bagley MIMO Channel Tutorial, WOC02, Banff 59

    Channel Layering via Subspace Projection

    Objective: Show that subspace signal projection provides an asymptot-ically optimal preprocessing methodology for high signal-to-noise ratiovalues which eliminates all residual interference at each stage k.

    The second matrix term from the subchannel capacity formula

    NtQ1k hkh

    +k , with

    EsN0

    can be thought of as an effective signal-to-noise ratio at the k-th layer.

    The matrix inversion lemma1 allows us to manipulate the term into

    NtQ1k hkh

    +k =

    1

    N0

    AkA

    +k +

    Nt

    I

    1 EsNt

    hkh+k

    =

    Nt I Ak A+k Ak +

    Nt

    I1

    A+khkh+k .

    For large signal-to-noise ratios, Nt/ 0, and

    NtQ1k hkh

    +k

    Nt

    I Ak

    A+k Ak

    1A+k

    hkh

    +k =

    NtMkhkh

    +k

    However the matrix Mk =

    I Ak

    A+k Ak1

    A+k

    is the projection

    matrix [HoJ90] onto the subspace which is orthogonal to Ak.

    References:

    [HoJ90] R.A. Horn and C.J. Johnson, Matrix Analysis, Cambridge University Press, NewYork, 1990.

    1(A + BCD)1 = A1 A1B(DA1B + C1)1DA1

  • 8/2/2019 SchlegelNotes

    61/68

    cChristian Schlegel, Zachary Bagley MIMO Channel Tutorial, WOC02, Banff 60

    Channel Layering and Parallel Information Processing

    With the following linear algebraic results

    Mk = MkM+k , i.e. the projection matrix Mk is idempotent. det(I + AB) = det(I + BA)

    the term in question is equivalent to

    NtQ1k hkh+k Nth+k M+k Mkhk = NtMkhk2

    which then suggests the asymptotically optimal information processingvia successive cancellation and interference projection:

    +

    +

    + +

    +

    +

    y1

    y2

    yk

    n1

    n2

    nk

    FEC Dec.

    FEC Dec.

    FEC Dec.

    M1

    M2

    Mk

    CancellationFront-End

    This is the methodology of the original BLAST system.

  • 8/2/2019 SchlegelNotes

    62/68

    cChristian Schlegel, Zachary Bagley MIMO Channel Tutorial, WOC02, Banff 61

    Channel Layering and Parallel Information Processing

    Properties of sub-space projection filtering:

    The projection matrix Mk completely eliminates the interferencefrom the as yet un-cancelled component channels ck+1, , cNt.

    The price is a loss in signal-to-noise ratio, i.e., Mkh2k hk2.For the case of uncorrelated random equal-energy array responses, the

    average loss has been calculated by a number of authors [1, 2, 3] in thecontext of random CDMA communications. Using these results the lossfactor, k, at cancellation stage k can easily be calculated as

    k =k + (Nr Nt)

    Nr; Nr Nt

    Remarks:

    According to k, the first state in an 8 antenna system loses18 9dB.

    This loss is with respect to the optimal symmetric capacity. This is nicely evident in the 8x8 example channel figure, even though

    that is only a single sample channel.

    The technique proposed in [Fos96] essentially uses this projection/cancellationapproach. An additional cyclic rotation over the antennas has no effect onthe capacity, but it does even out the data rates on the different channels.

    References:

    [ARS97] P.D. Alexander, L. Rasmussen, and C. Schlegel, A class of linear receivers for codedCDMA, IEEE Trans. Commun., Vol. 45, No. 5, pp. 605610, May 1997.

    [TsH99] D.N.C. Tse and S.V. Hanly, Linear multiuser receivers: effective interference, eectivebandwidth and user capacity, IEEE Trans. Inform. Theory, Vol. 45, No. 2, March 1999.

    [VeS99] S. Verdu and S. Shamai, Spectral efficiency of CDMA with random spreading,IEEE Trans. Inform. Theory, Vol. 45, No. 2, March 1999.

  • 8/2/2019 SchlegelNotes

    63/68

    cChristian Schlegel, Zachary Bagley MIMO Channel Tutorial, WOC02, Banff 62

    Parallel Cancellation: A Simplified Decoding Strategy

    Clearly the long decoding delays involved with the serial cancellationmethod may not be desirable. A lower complexity, sub-optimalmethod omits the cancellation stage:

    The preprocessing filters are set to suppress all interfering signaldimensions, i.e.,

    rank(M1) = rank(M2) =

    = rank(MK).

    This method suffers a significant performance loss w.r.t. optimalprocessing. The SNR loss is given by k with k = 1.

    For a Nt = Nr = 8 antenna system, for example, single channelprojection will incur an SNR loss of 1/8, which is 9dB, w.r.t. or-thogonal channels. Note, however, that adding just a single extrareceive antenna reduces this loss to 2/9, which is about 6.5dB.

    A loss of 3dB corresponds to a capacity loss of about 1 bit per com-plex dimenstion for large SNR values.

    Comparison of Layering Methods

    The next figure shows the waterfilling capacity, the successive cancella-tion/projection capacity and the capacity of 8 single projected subchan-nels for a channel with i.i.d. path gains.

    Notes on the figure:

    1. Single channel projection processing loses 9dB as predicted.

    2. Optimal successive cancellation processing loses 8.6 bits w.r.t. or-thogonal channels.

    3. This is also close to the inherent loss of the random MIMO channelw.r.t. orthogonal channels.

    4. The addition of a few extra receive antennas or processing channelsin pairs can significantly relieve this loss.

  • 8/2/2019 SchlegelNotes

    64/68

    cChristian Schlegel, Zachary Bagley MIMO Channel Tutorial, WOC02, Banff 63

    Channel Layering and Parallel Information Processing

    0 2 4 6 8 10 12 14 16 18 200

    5

    10

    15

    20

    25

    30

    35

    40

    45

    Capacity

    bits/channelus

    e

    /Nt dB

    WaterfillingCapacity

    OrthogonalChannels

    SymmetricCapacity

    SuccessiveProjection

    Capacity ParallelProjection

    Capacity

    Capacity

    Capacitiesof differentProjectionLayers

    9dB loss

    8.6 bits

    Conclusions:

    1. For high-rank channels, optimal reception strategies consist of succes-sive cancellation and projection operations to generate interference-free component channels used by a smaller number of antennas. Fornearly i.i.d. channels, single antenna layers are sufficient.

    2. Optimal transmission on low-rank channels is the same as optimaltransmission on channels with low signal-to-noise ratios and consistsof pooling all transmission energy into a single transmit antenna,

    possibly using the transmit array to beamform.

  • 8/2/2019 SchlegelNotes

    65/68

    cChristian Schlegel, Zachary Bagley MIMO Channel Tutorial, WOC02, Banff 64

    Iterative Information Processing

    In conjunction with forward error control (FEC) coding, the MIMOchannel can be viewed as a serially concatenated communications system:

    urvr v

    r

    InterleaverBinary

    Encoder

    TrellisEncoder/Mapper

    Precoder

    xr

    Iterative Information Processing operates according to the well-knownTurbo Principle with component decoders exchanging soft information:

    DeinterleaveMarginalize

    InterleaveCombineMIMO

    Soft-OutputDecoder

    APPBinarySoft-OutputDecoder

    y

    {a(ur)}

    {e(ur)}{Pra(xr)}

    {Pre(xr)

    }

    DIV

    +-

    Notes:

    Simple binary encoders can be used and LLR processing The MIMO soft decoder can be a

    1. Canceller

    2. APP approximation

  • 8/2/2019 SchlegelNotes

    66/68

    cChristian Schlegel, Zachary Bagley MIMO Channel Tutorial, WOC02, Banff 65

    Example: Differential Space-Time Coding

    Simple differential space-time codes such as the Quaternion Code canbe used inner codes. They have simple trellis representations which canbe used by an APP decoder.

    EXIT chart The figure below shows the extrinsic information transferchart of this serially concatenated systems for several binary convolutionalcodes of rate R = 2/3 bits/symbol. This chart can be used to determinethe onset of the Turbo Cliff of the system.

    0

    0.5

    1

    1.5

    2

    2.5

    3

    0 0.5 1 1.5 2 2.5 3

    -1dB

    -1.6dB

    Parity

    4-state

    16-state

    64-state

    Ip

    Ie

    Dashed:Quaternion code

    References:

    [GrS01] A. Grant and C. Schlegel, Differential turbo space-time coding, Proc. IEEE Infor-mation Theory Workshop, 2001, pp. 120122, 2001.

    [ScG01] C. Schlegel and A. Grant, Concatenated space-time coding, IEEE Trans. Inform.

    Theory, to appear.

  • 8/2/2019 SchlegelNotes

    67/68

    cChristian Schlegel, Zachary Bagley MIMO Channel Tutorial, WOC02, Banff 66

    Differential Space-Time Coding: Error Performance

    The error performance of this system is shown for several binary codes.It is counter-intuitive that the simple [3,2,2] parity check code shouldoutperform all stronger codes.These results agree with the predictions from the EXIT chart

    Pinch-o

    ffSNR

    Pinch-o

    ffSNR

    Pinch-o

    ffSNR

    -1.5 -1.4 -1.3 -1.2 -1.1 -1 -0.9 -0.8 -0.7 -0.6107

    106

    105

    104

    103

    102

    101

    1Bit Error Probability (BER)

    EbN0

    Parity

    4-state

    16-state

    64-state

  • 8/2/2019 SchlegelNotes

    68/68

    cChristian Schlegel, Zachary Bagley MIMO Channel Tutorial, WOC02, Banff 67

    Iterative Detectors: Large Systems

    This principle can be applied to larger systems, but their performanceis not that near-capacity anylonger. The results below are taken from[HtB01]:

    2 4 6 8 10 12010-6

    10-5

    10-4

    10-3

    10-2

    10-1

    100

    C

    apacityQPSK

    1

    .6dB

    C

    apacity

    1

    6QAM3.8

    dB

    C

    apacity64QAM6.4

    dB

    QPSK16QAM

    64QAM

    Eb/N0 [dB]

    BER 8 8 antenna system