discriminant ana

Upload: niknish

Post on 03-Apr-2018

218 views

Category:

Documents


0 download

TRANSCRIPT

  • 7/29/2019 Discriminant Ana

    1/8

    Discriminant Analysis

    Discriminant analysis is a statistical techniquethat is used to classify the dependent variable

    between two or more categories.

    Discriminant analysis also has a regression

    technique, which is used for predicting the value

    of the dependent categorical variable.

    In discriminant analysis, we predict the value of

    two categories.

    When the category of a dependent variable is

    more than two, it will simply be an extension of

    the simple discriminant analysis called the

    multiple discriminant analysis.

  • 7/29/2019 Discriminant Ana

    2/8

    Multiple discriminant analysis and MANOVA

    are considered to be similar because both tests

    share many similar assumptions and tests.

    The F test (Wilks' lambda) is used to test

    whether or not the discriminant model is

    significant as a whole.

    If the F test shows the overall significance of the

    model, then the individual variables are accessed

    to see which variable will move the significance

    from the group mean.

    Discriminant analysis also assumes several

    assumptions, such as multiple linear regressions,

    linear relationships, homoscedastic relationships,

    untruncated interval data, etc.

    http://www.statisticssolutions.com/manovahttp://www.statisticssolutions.com/multiple-regressionhttp://www.statisticssolutions.com/manovahttp://www.statisticssolutions.com/multiple-regression
  • 7/29/2019 Discriminant Ana

    3/8

    Logistic regression is the alternative technique

    and it is frequently used in place of discriminant

    analysis when data does not meet the

    assumptions.

    Key Terms and Concepts:

    Discriminating variables: Discriminating

    variables are independent variables that are used

    to predict the dependent variable. These

    variables are also called the predictors.

    The criterion variable: Dependent variables are

    also called the criterion variables.

    Discriminant function: The Linear

    combination of the discriminating (independent)

    variable is called the discriminant function. For

    example,

  • 7/29/2019 Discriminant Ana

    4/8

    L = b1x1 + b2x2 + ... + bnxn + c

    L= discriminant function

    b1= discriminant coefficients

    X= independents variables

    C = constants

    Number of discriminant functions: For the

    two groups, there is one discriminant analysis

    function. For multivariate discriminant analysis

    there will be g-1 discriminant function.

    The Eigenvalues: This is also called

    characteristic root, which tells us the variance

    explained by each discriminant function.

    The discriminant score: By applying

    discriminant formulas, the value that comes is

  • 7/29/2019 Discriminant Ana

    5/8

    called the discriminant score. This discriminant

    score helps us to classify the group category.

    Cutoff: This is the value which divides the

    group value into two parts. When the value of

    the discriminant score is at the negative side of

    the cutoff point, then the group will fall into a

    lower category, and when it is at the positive

    side, the group will be at a higher category.

    Unstandardized discriminant coefficients:

    Unstandardized discriminant coefficients are

    simply like the regression beta, which is used to

    predict the discriminate score. Standardized

    discriminant coefficients are used to compare

    the relative importance of the independent

    variables.

    Tests of significance:

    Wilks' lambda: The overall model significance

    of the discriminant function is tested by the

  • 7/29/2019 Discriminant Ana

    6/8

    walks lambda test. If the overall model is

    significant, than the F test is used to test whether

    or not the individual variable means differ from

    the group mean function.

    Assumptions in Discriminant analysis:

    1. Independence: Each case should be

    independent of each other. Correlated data

    cannot be used in discriminant analysis.

    2. Adequate sample size: There must be at least

    two cases for each category of the dependent

    variable. However, it is recommended that there

    should be at least four or five times as many

    cases as independent variables.

    http://www.statisticssolutions.com/sample-size-calculation-justificationhttp://www.statisticssolutions.com/sample-size-calculation-justification
  • 7/29/2019 Discriminant Ana

    7/8

    3. Interval data: In discriminant analysis, there

    should be an interval data for independent

    variable.

    4. Variance: No independents have a zero

    standard deviation in one or more of the groups

    formed by the dependent.

    5. Random error: Error terms are assumed to be

    randomly distributed.

    6. Homogeneity of variances: Variance with

    each group of independent variables should be

    equal.

    7. Absence of perfect multicollinearity: There

    should be no perfect multicollinearity between

    the independent variables.

    http://www.statisticssolutions.com/multicollinearityhttp://www.statisticssolutions.com/multicollinearity
  • 7/29/2019 Discriminant Ana

    8/8

    8. Assumes linearity: The discriminant functions

    should be linear and related to each other.

    9. Normally distributed: The predictor variable

    should be normally distributed.