chapter 1 financial statistics

Upload: domagoj-dresar

Post on 14-Apr-2018

219 views

Category:

Documents


0 download

TRANSCRIPT

  • 7/30/2019 Chapter 1 Financial statistics

    1/44

    Financial Statistics

    Master in FinanceUniversidad Carlos III de Madrid

    Esther Ruiz and Diego Friesoli

    2011-2012

  • 7/30/2019 Chapter 1 Financial statistics

    2/44

    Chapter 1. Introduction: Basic concepts

    The use of quantitative analysis to make better investment decisions

    Mark J.P. Anson

    Defusco et al. (2004)

    Objective: Introduce some statistical tools useful for analyzing financial timeseries.

    Outline:

    1. Why quantitative tools are important for financial professionals

    2. Differences between cross-sectional and time series data

    3. Covariance and strict stationarity

    4. Correlations and independence: differences between differencemartingala, white noise, strict white noise and Gaussian white noise

    5. Describing variables: Unconditional and conditional moments

    6. Linear and non-linear models

  • 7/30/2019 Chapter 1 Financial statistics

    3/44

    1.1 Why quantitative tools are important for financial

    professionals

    Financial economics is a highly empirical discipline. Despite itsempirical nature, like other social sciences, financial

    economics is almost entirely nonexperimental. Therefore,the primary method of inference for the financialeconomist is model-based statistical inference: Financialeconometrics.

    The main distinction between econometrics in other areasand financial econometrics is the central role thatuncertainty plays in both financial theory and its empiricalimplementation: The substance of every financial modelinvolves the impact of uncertainty on the behaviour ofinvestors and, ultimately, on the the market prices.

    Campbell, Lo and McKinlay (1997)

  • 7/30/2019 Chapter 1 Financial statistics

    4/44

    Financial markets are very complicated places. There are many

    interwoven variables that can affect the price of securities inan investment portfolio:

    Macroeconomic factors: level of interest rates, currentaccount deficits, government spending and economic cycles.

    Factors peculiar to the company: cash flow, working capital,book-to-market value, earning growth rates, dividend policy,debt-to-equity ratios.

    Financial market variables: beta (measure of systematic risk).

    , ,to earnings announcements, momentum trading.

    Only quantitative techniques can help to understand the largenumber of plausible variables that can impact the price of a

    security.

  • 7/30/2019 Chapter 1 Financial statistics

    5/44

    1.2 Differences between cross-sectional and time

    series data: stochastic processes

    Economic and financial data can take one of three forms:

    a) Cross-sectional data. At a given moment of time we observe oneor several variables corresponding to different economic orfinancial units. Usually microeconomic data.

    b) Time series data. We observe one or several variables over time.These are often macroeconomic and financial variables.

    c) Panel data. One or several variables corresponding to differenteconomic entities are observed over time.

    In any case, the data can be univariate (only one variable is observed)or multivariate (several variables are observed).

  • 7/30/2019 Chapter 1 Financial statistics

    6/44

    Cross-sectional observations are usually

    assumed to be independent and identically

    distributed (iid):

    Independence: The order of the observations is

    not important

    Identically distributed: All observations are

    generated by the same (univariate or multivariate)

    random variable.

  • 7/30/2019 Chapter 1 Financial statistics

    7/44

    Example of multivariate (bivariate) cross-sectional data:

    i) Event study: Information content of quarterly earnings

    announcements for firms in the Dow Jones Industrial Index

    ii) Relation between expected returns and market betas.

    2000

    2400

    Y vs. X

    0

    400

    800

    1200

    1600

    -400 400 8001200 2000 2800

    X

    Y

  • 7/30/2019 Chapter 1 Financial statistics

    8/44

    A time series is a succession of observations of

    one or several variables taken over time.

    When just one variable is observed, we talk

    about a univariate time series:Tyyy ,...,, 21

    However, when several variables are observed

    over time, we have a multivariate time series:

    T

    T

    T

    zzz

    xxx

    yyy

    ...

    ...

    ...

    21

    21

    21

  • 7/30/2019 Chapter 1 Financial statistics

    9/44

    Time series data are characterized by:a) Dependent. We can no longer consider that

    the observation at time t is independent from

    what we observe at time t-1. As aconsequence, and unlike what happens in thecross-sectional data, the order in which the

    .b) The observations cannot be considered as

    identically distributed: They are observed in a

    context that evolves over time. We cannotconsider that we have T observations of thesame random variable .

  • 7/30/2019 Chapter 1 Financial statistics

    10/44

    Examples of (univariate ) time series data

    We may analyse historical earnings per share (EPS) to

    forecast future EPS.

    We may use a companys past returns to infer its risk:Prices and returns of SP500 observed daily from

    4/1/1993 until 20/9/2011.

    -.10

    -.05

    .00

    .05

    .10

    .15

    400

    600

    800

    1,000

    1,200

    1,400

    ,

    500 1000 1500 2000 2500 3000 3500 4000 4500

    S&P500 RETURN_DAILY

  • 7/30/2019 Chapter 1 Financial statistics

    11/44

    Example of multivariate time series: Monthly interest rates observed

    from Sept. 1987 to Sept. 2006 for different maturities: i) 1-monthLondon interbank bid (LIBID) rate for US Dollars; ii) 3-month US

    treasury-bill rates; iii) 6- 3-month US treasury-bill rates; iv) US

    government bond rates for 1, 2, 3, 5, 7 and 10 years.

  • 7/30/2019 Chapter 1 Financial statistics

    12/44

    In order to describe the mechanism generating atime series, we need to assume that we have arandom variable at each moment of time (astochastic process: succession of variables

    ordered in time)

    Then, each random variable generates oneobservation, obtaining our time series.

  • 7/30/2019 Chapter 1 Financial statistics

    13/44

    Cross-sectional Time series

    Univariate z1, z2,,zn

    n independent observations from

    the (unidimensional) variable Z

    with (identical) distribution d(Z)

    y1, y2,,yT

    1 observation from the (univariate)

    random process (Y(1), Y(2),,Y(T))

    with joint distribution d(Y(1),

    Y(2),,Y(T))

    Multivariate z11, z12,,z1n

    z21, z22,,z2n

    y11, y12,,y1T

    y21, y22,,y2T.

    zk1, zk2,,zkn

    n independent observations from

    the (multidimensional) variable

    (Z1, Z2,,Zk)

    with (identical) (joint)distribution d(Z1, Z2,,Zk)

    .

    yk1, yk2,,ykT

    1 observation from the

    (multivariate) random process

    (Y1(1), Y2(1),, YK(1),Y1(2),

    Y2(2),Yk(2),, Y1(T), Y2(2),Yk(T))with joint distribution d(Y(1),

    Y(2),,Y(T))

  • 7/30/2019 Chapter 1 Financial statistics

    14/44

    1.3 Covariance stationarity and strict stationarity

    A stochastic process can, in

    principle, generate aninfinite number of

    ...

    ...

    ...

    )(...)2()1(

    )2()2(2

    )2(1

    )1()1(2

    )1(1

    T

    T

    yyy

    yyy

    TYYY

    period t=1,,T............

    ...)3()3(

    2)3(

    1 Tyyy

  • 7/30/2019 Chapter 1 Financial statistics

    15/44

    If we had several realizations of the process, we

    could compute the mean of each randomvariable that constitutes the process according

    m

    ym

    j

    jt

    t

    ==1

    )(

  • 7/30/2019 Chapter 1 Financial statistics

    16/44

    Unfortunately, in practice,

    we only have a singlerealization, and Tyyy

    TYYY

    ...

    ...

    )(...)2()1(

    21

    compute the moments

    of the random variables

    that constitute the

    process.

  • 7/30/2019 Chapter 1 Financial statistics

    17/44

    In order to estimate the moments (mean,

    variance etc) of every variable in time it isnecessary to restrict the properties of the

    .

    The restrictions that are usually imposed are

    called stationarity.

  • 7/30/2019 Chapter 1 Financial statistics

    18/44

    A process is (covariance, weakly) stationary if

    i)

    ii)

    ttYE = ,))((

    ttYVar

  • 7/30/2019 Chapter 1 Financial statistics

    19/44

    The first condition allows us to estimate the mean (which is

    common to all variables) using the sample mean:

    yT

    i

    The same can be said about the other conditions.

    Tyi

    ===1

  • 7/30/2019 Chapter 1 Financial statistics

    20/44

    In practice, the conditions of stationarity allowus to estimate the autocorrelations (the mean

    of the linear dependence between the

    observations that are h periods apart) with

    the sample autocorrelations

    T

    yyyy

    hch

    T

    hthtt

    == +=

    1

    ))((

    )()(

  • 7/30/2019 Chapter 1 Financial statistics

    21/44

    Alternatively, we can define a strictly stationaryprocess as follows:

    A stochastic process {Yt} is strictly stationary if themultivariate distribution function of {Yi,Yi+1,,Yi+k-1}

    and {Xj,Xj+1,,Xj+k-1} are identical, for all integers i,j andfor all k>0.

    A special example of a strictly stationary process is

    Normal variables. All multivariate distributions arethen determined by the mean and varianceparameters.

    However, a sequence of independent Normalvariables whose variance depend on the day of theweek is not a strictly stationary process.

  • 7/30/2019 Chapter 1 Financial statistics

    22/44

    Weak stationarity does not imply strict

    stationarity as only there is guarantee that thefirst two order moments are constant overtime.

    Under Normality, weak stationarity implies.

    Strict stationarity does not imply weak

    stationarity unless the second order momentis finite.

  • 7/30/2019 Chapter 1 Financial statistics

    23/44

    1.4 Correlation and independence: differences

    between martingale difference, white noise, strictwhite noise and Gaussian white noise

    The most popular measure of the dependence

    between two random variables is thecorrelation which is given by

    ( )( )[ ]YX YXE

    The correlation only measures lineardependence between the variables.

    In the context of (stationary) time series, theautocorrelation is given by

    YX

    ,

    ( )( )[ ] ( )( )2

    ][),(

    =

    =

    htt

    htt

    hthttthtt

    yyEyyEyyCorr

  • 7/30/2019 Chapter 1 Financial statistics

    24/44

    In general, if the autocorrelations are zero we can only

    conclude that there are is not a linear dependencebetween sucessive observations, i.e. we cannot predictthe expected future observations by looking at the pastevolution of the series.

    However, zero autocorrelations do not implyindependence. It is possible that there are nonlinear

    .

    In a Gaussian process, if there is dependence betweensuccesive variables, this dependence can only be linear.Therefore, if the autocorrelations are zero (i.e. there isnot linear dependence), we can conclude that thevariables in the process are independent.

  • 7/30/2019 Chapter 1 Financial statistics

    25/44

    Consider first an example of a nonlinear relation

    between two random variables in a cross-sectionalexample

    4

    6

    8

    10

    12y=x

    2+a

    The sample correlation between x and y is 0.1521 so,

    there is not linear dependence. However, it is obious

    that the variables are not independent.

    -4 -3 -2 -1 0 1 2 3 4-4

    -2

    0

    2

  • 7/30/2019 Chapter 1 Financial statistics

    26/44

    In this case, x is a NID(0,1) variable but y is

    clearly non-Normal

    6

    8

    10

    12

    -3 -2 -1 0 1 2 3

    -2

    0

    2

    4

    x

    y

  • 7/30/2019 Chapter 1 Financial statistics

    27/44

    Consider, now a time series example: the SP500 returns

    described above. The marginal distribution (samplemoments) is given by

    800

    1,200

    1,600

    2,000

    Series: RETURN_DAILYSample 1 4716Observations 4715

    Mean 0.000215Median 0.000594Maximum 0.109572Minimum -0.094695Std. Dev. 0.012214Skewness -0.244155

    The marginal distribution of daily returns is clearlyleptokurtic. Therefore, the marginal distribution is non-Normal and returns are not Gaussian.

    0

    400

    -0.10 -0.05 0.00 0.05 0.10

    .

    Jarque-Bera 14503.73Probability 0.000000

  • 7/30/2019 Chapter 1 Financial statistics

    28/44

    The sample autocorrelations of returns and of

    squared returns are given by

  • 7/30/2019 Chapter 1 Financial statistics

    29/44

    Therefore, daily returns are serially uncorrelated

    but they are not independent. Their squares(volatilities) are correlated.

    By looking to past evolution of daily returns, we

    can predict the future evolution of daily

    volatilities.

  • 7/30/2019 Chapter 1 Financial statistics

    30/44

    Three categories of uncorrelated processes are of

    particular importance in financial econometrics:White noise, strict white noise and martingaledifferences. These all are zero mean processes.

    A process is white noise if i) it is stationary, ii)uncorrelated, and iii) has zero mean.

    The absence of correlation from a white noisedoes not im l inde endence. The stron er

    assumption that the variables are independentand identically distributed (iid) with zero means,defines strict white noise.

    A martingale difference process has the followingfair playproperty

    E[Yt|Y1,,Yt-1]=0

  • 7/30/2019 Chapter 1 Financial statistics

    31/44

    Gaussian WN is SWN because uncorrelated

    variables are independent when theirmultivariate joint distribution is Normal.

    The distinction between WN and SWN isimportant when considering non-Gaussianprocesses as those required to model returns.

    ,

    returns might have zero mean, be stationary ,be non-autocorrelated, (WN) and possesvolatility clustering. Then, the process is not

    SWN because information about recentvolatility influences the variance ossubsequent returns.

  • 7/30/2019 Chapter 1 Financial statistics

    32/44

    A MD is a zero mean, uncorrelated process.

    Therefore, if it is further stationary, then it willbe WN.

    WN may not be MD because the conditionalexpectation of an uncorrelated process can be

    a nonlinear function

    Bilinear process of Granger and Newbold (1986)which is a WN when 0

  • 7/30/2019 Chapter 1 Financial statistics

    33/44

    1.5 Describing variables: Unconditional and conditional

    distributionsThe complete description of the stochastic process is

    given by thejoint distribution

    When the joint distribution is multivariate Normal, the

    rocess is said to be a Gaussian rocess.

    ( )Tyyd ,...,1

    A Gaussian process is described by the mean andcovariance matrix given by

    =

    TTy

    yy

    E

    2

    1

    2

    1

    [ ]

    =

    221

    2

    2

    212

    11221

    21

    2

    1

    TTT

    T

    T

    T

    T

    yyy

    y

    y

    y

    E

  • 7/30/2019 Chapter 1 Financial statistics

    34/44

    In practice, the dimension T is often very large

    (in financial time series often over 2000).

    Consequently, we look for alternative ways of

    describing the distribution of the stochasticprocess.

    There are two different univariate distributionsof interest: marginal and conditional

    distributions.

  • 7/30/2019 Chapter 1 Financial statistics

    35/44

    The marginal distribution is the scalar distribution of each of the variables in theprocess.

    When the joint distribution is Normal (the process is Gaussian), the marginaldistribution of aech of the variables in the process is Normal. However, the Normality ofthe marginal distributions does not guarantee the joint Normality.

    When dealing with univariate marginal distributions we lose information about thedependence.

    The stationarity conditions refer to the moments of the marginal distribution.

  • 7/30/2019 Chapter 1 Financial statistics

    36/44

    However, we know that there is dependence

    between succesive observations. Therefore, itcould be of interest to analyse the distribution

    of yt

    conditional on y1

    ,,yt-1

    .

    If the rocess is Gaussian the conditional

    distribution is Normal for all t. However, theconditional distribution can be Normal and

    neither the marginal nor the joint being

    Normal.

  • 7/30/2019 Chapter 1 Financial statistics

    37/44

    Example 1

    i) Consider the following model for a time series:

    where is a strict white noise sequence with variance

    In this case, the conditional mean is given by

    ttt yy ++= 18.05

    11

    11 8.05][],...,|[

    +== ttt

    tt yyEyyyE

    t2

    The marginal mean is constant (stationarity condition)while the conditional mean evolves over time.

    )8.01(5][8.05][ 1

    =+= tt yEyE

  • 7/30/2019 Chapter 1 Financial statistics

    38/44

    Consider now the conditional variance

    The marginal variance is given by

    ( ) 221

    2

    1111 ))((),...,|( ===

    t

    tt

    tt

    ttt EyEyEyyyVar

    2

    2 ==

    EVar

    Note that the marginal variance is larger than

    the conditional: using the information in the

    past, we reduce the uncertainty about thefuture.

    8.01

  • 7/30/2019 Chapter 1 Financial statistics

    39/44

    If we further assume that is Gaussian, then

    the conditional distribution of is also

    Normal. Furthermore, given that is linear,

    the marginal distribution is also Normal.

    t

    ty

    ty

  • 7/30/2019 Chapter 1 Financial statistics

    40/44

    Example 2

    Consider the following process

    Conditional mean

    2

    14.06.0 += ttt yy

    0)(4.06.0)(2

    1 =+=

    ttt EyyE

    Marginal mean

    Both the conditional and the marginal mean areconstant over time and equal to zero.

    0))(()(1

    ==

    tt

    t yEEyE

  • 7/30/2019 Chapter 1 Financial statistics

    41/44

    Conditional variance

    Marginal variance

    2

    1

    2

    1

    2

    1

    2

    114.06.0)()4.06.0()()(

    +=+== tt

    ttt

    tt

    tyEyyEyVar

    ]4.06.0[)]()4.06.0[()()(2

    1

    2

    1

    2

    1

    2 =+=+==

    ttt

    ttt yEEyEyEyVar

    The marginal variance is constant (stationarity

    condition) but the conditional varianceevolves over time.

    1)4.01/(6.0=

  • 7/30/2019 Chapter 1 Financial statistics

    42/44

    If we further assume that is Gaussian, thenthe conditional distribution of is also

    Normal. However, given that is non-linear,

    the marginal distribution is not Normal.

    t

    ty

    ty

  • 7/30/2019 Chapter 1 Financial statistics

    43/44

    1.6 Linear and non-linear models

    Consider again the relationship between one variable andits own past evolution

    yt=f(y1,,yt-1)+atwhere at is a white noise.

    between yt and its own past evolution. If at is a strictwhite noise (possibily non-Gaussian), then there is notany further dependencies between yt and its past. Inthis case, we say that the model is lineal.

    However, if at is an uncorrelated white noise with a non-Gaussian distribution, then it is possible that yt mayalso have non-linear dependencies with its past.

  • 7/30/2019 Chapter 1 Financial statistics

    44/44

    In a linear model, if the conditional

    distribution is Normal, the marginal is Normal. However, in nonlinear models, the conditional

    can be Normal and the marginal being non-

    Normal. Maravall (1983) shows that in a linear

    ,

    autocorrelations of sqaures are equal to thesquared autocorrelations.

    Note that a linear process can be also written

    as a linear combination of past realizations ofthe innovations, .

    t