fall 2013 lecture 5: chapter 5 statistical analysis of data …yes the “s” word

42
Fall 2013 Lecture 5: Chapter 5 Statistical Analysis of Data …yes the “S” word

Upload: morris-webb

Post on 26-Dec-2015

221 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Fall 2013 Lecture 5: Chapter 5 Statistical Analysis of Data …yes the “S” word

Fall 2013 Lecture 5: Chapter 5

Statistical Analysis of Data

…yes the “S” word

Page 2: Fall 2013 Lecture 5: Chapter 5 Statistical Analysis of Data …yes the “S” word

What is a Statistic???? What is a Statistic????

Population

Sample

SampleSample

Sample

Parameter: value that describes a population

Statistic: a value that describes a sample PSYCH always using samples!!!

Page 3: Fall 2013 Lecture 5: Chapter 5 Statistical Analysis of Data …yes the “S” word

Descriptive & Inferential Descriptive & Inferential StatisticsStatisticsDescriptive StatisticsDescriptive Statistics

• OrganizeOrganize

• SummarizeSummarize

• SimplifySimplify

• Presentation of dataPresentation of data

Inferential StatisticsInferential Statistics •Generalize from Generalize from samples to popssamples to pops•Hypothesis testingHypothesis testing•Relationships Relationships among variables among variables

Describing data Make predictionsMake predictions

Page 4: Fall 2013 Lecture 5: Chapter 5 Statistical Analysis of Data …yes the “S” word

Descriptive Descriptive StatisticsStatistics

3 Types

1. Frequency Distributions 3. Summary Stats

2. Graphical Representations

# of Ss that fallin a particular category

Describe data in just one number

Graphs & Tables

Page 5: Fall 2013 Lecture 5: Chapter 5 Statistical Analysis of Data …yes the “S” word

1. Frequency Distributions

# of Ss that fallin a particular category

How many males and how many females are in our class?

Frequency(%)

? ?

?/tot x 100 ?/tot x 100

-----% ------%

total

scale of measurement?

nominal

Page 6: Fall 2013 Lecture 5: Chapter 5 Statistical Analysis of Data …yes the “S” word

1. Frequency Distributions

# of Ss that fallin a particular category

Categorize on the basis of more that one variable at same timeCROSS-TABULATION

Democrats

Republican

total

24 1 25

19 6 25

Total 43 7 50

Page 7: Fall 2013 Lecture 5: Chapter 5 Statistical Analysis of Data …yes the “S” word

1. Frequency Distributions

How many brothers & sisters do you have?

# of bros & sis Frequency

7 ?6 ?5 ?4 ?3 ?2 ?1 ?0 ?

Page 8: Fall 2013 Lecture 5: Chapter 5 Statistical Analysis of Data …yes the “S” word

2. Graphical Representations

Graphs & Tables

Bar graph (ratio data - quantitative)

Page 9: Fall 2013 Lecture 5: Chapter 5 Statistical Analysis of Data …yes the “S” word

Histogram of the categorical variables

2. Graphical Representations

Page 10: Fall 2013 Lecture 5: Chapter 5 Statistical Analysis of Data …yes the “S” word

Polygon - Line Graph

 

2. Graphical Representations

Page 11: Fall 2013 Lecture 5: Chapter 5 Statistical Analysis of Data …yes the “S” word

2. Graphical Representations

Graphs & TablesGraphs & Tables

How many brothers & sisters do you have? Lets plot class data: HISTOGRAM

# of bros & sis Frequency

7 ?6 ?5 ?4 ?3 ?2 ?1 ?0 ?

Page 12: Fall 2013 Lecture 5: Chapter 5 Statistical Analysis of Data …yes the “S” word

Altman, D. G et al. BMJ 1995;310:298

Central Limit Theorem: the larger the sample size, the closer a distribution will approximate the normal distribution or

A distribution of scores taken at random from any distribution will tend to form a normal curve

jagged

smooth

Page 13: Fall 2013 Lecture 5: Chapter 5 Statistical Analysis of Data …yes the “S” word

2.5% 2.5%

5% region of rejection of null hypothesisNon directional

Two Tail

body temperature, shoe sizes, diameters of trees,Wt, height etc…

IQ

68%

95%

13.5%13.5%

Normal Distribution: half the scores above mean…half below(symmetrical)

Page 14: Fall 2013 Lecture 5: Chapter 5 Statistical Analysis of Data …yes the “S” word
Page 15: Fall 2013 Lecture 5: Chapter 5 Statistical Analysis of Data …yes the “S” word

Summary Statisticsdescribe data in just 2 numbers

Measures of central tendency• typical average score

Measures of variability• typical average variation

Page 16: Fall 2013 Lecture 5: Chapter 5 Statistical Analysis of Data …yes the “S” word

Measures of Central Tendency

• Quantitative data:– Mode – the most frequently occurring

observation– Median – the middle value in the data (50

50 ) – Mean – arithmetic average

• Qualitative data:– Mode – always appropriate– Mean – never appropriate

Page 17: Fall 2013 Lecture 5: Chapter 5 Statistical Analysis of Data …yes the “S” word

Mean

• The most common and most useful average

• Mean = sum of all observations number of all observations

• Observations can be added in any order.

• Sample vs population• Sample mean = X

• Population mean =• Summation sign = • Sample size = n• Population size = N

Notation

Page 18: Fall 2013 Lecture 5: Chapter 5 Statistical Analysis of Data …yes the “S” word

Special Property of the MeanBalance Point

• The sum of all observations expressed as positive and negative deviations from the mean always equals zero!!!!– The mean is the single point of equilibrium

(balance) in a data set

• The mean is affected by all values in the data set– If you change a single value, the mean

changes.

Page 19: Fall 2013 Lecture 5: Chapter 5 Statistical Analysis of Data …yes the “S” word

The mean is the single point of equilibrium (balance) in a data set

SEE FOR YOURSELF!!! Lets do the Math

Page 20: Fall 2013 Lecture 5: Chapter 5 Statistical Analysis of Data …yes the “S” word

Summary Statisticsdescribe data in just 2 numbers

Measures of central tendency• typical average score

Measures of variability• typical average variation 1. range: distance from the

lowest to the highest (use 2 data points)

2. Variance: (use all data points)3. Standard Deviation4. Standard Error of the Mean

Page 21: Fall 2013 Lecture 5: Chapter 5 Statistical Analysis of Data …yes the “S” word

Descriptive & Inferential Descriptive & Inferential StatisticsStatisticsDescriptive StatisticsDescriptive Statistics

• OrganizeOrganize

• SummarizeSummarize

• SimplifySimplify

• Presentation of dataPresentation of data

Inferential StatisticsInferential Statistics •Generalize from Generalize from samples to popssamples to pops•Hypothesis testingHypothesis testing•Relationships Relationships among variables among variables

Describing data Make predictionsMake predictions

Page 22: Fall 2013 Lecture 5: Chapter 5 Statistical Analysis of Data …yes the “S” word

Measures of Variability

2. Variance: (use all data points):

average of the distance that each score is from the mean (Squared deviation from the mean)

otation for variances2

3. Standard Deviation= SD= s2

4. Standard Error of the mean = SEM = SD/ n

Page 23: Fall 2013 Lecture 5: Chapter 5 Statistical Analysis of Data …yes the “S” word

Inferential Statistics

Population

Sample

Draw inferences about the larger group

Sample

Sample

Sample

Page 24: Fall 2013 Lecture 5: Chapter 5 Statistical Analysis of Data …yes the “S” word

Sampling Error: variability among samples due to chance vs population

Or true differences? Are just due to

sampling error?Probability…..

Error…misleading…not a mistake

Page 25: Fall 2013 Lecture 5: Chapter 5 Statistical Analysis of Data …yes the “S” word

Probability•Numerical indication of how likely it is that a given event will occur (General Defi nition)“hum…what’s the probability it will rain?”

•Statistical probability:

the odds that what we observed in the sample did not occur because of error (random and/ or systematic)“hum…what’s the probability that my results are not just due to chance”

•I n other words, the probability associated with a statistic is the level of confi dence we have that the sample group that we measured actually represents the total population

Page 26: Fall 2013 Lecture 5: Chapter 5 Statistical Analysis of Data …yes the “S” word

Chain of Reasoning forInferential Statistics

PopulationSample

Inference

Selection

Measure

Probability

data

Are our inferences valid?…Best we can do is to calculate probability about inferences

Page 27: Fall 2013 Lecture 5: Chapter 5 Statistical Analysis of Data …yes the “S” word

Inferential Statistics: uses sample data to evaluate the credibility of a hypothesis about a population

NULL Hypothesis:

NULL (nullus - latin): “not any” no differences between means

H0 : 1 = 2

“H- Naught”Always testing the null hypothesis

Page 28: Fall 2013 Lecture 5: Chapter 5 Statistical Analysis of Data …yes the “S” word

Inferential statistics: uses sample data to evaluate the credibility of a hypothesis about a population

Hypothesis: Scientific or alternativehypothesis

Predicts that there are differences between the groups

H1 : 1 = 2

Page 29: Fall 2013 Lecture 5: Chapter 5 Statistical Analysis of Data …yes the “S” word

HypothesisA statement about what findings are expected

null hypothesis

"the two groups will not differ“

alternative hypothesis

"group A will do better than group B""group A and B will not perform the same"

Page 30: Fall 2013 Lecture 5: Chapter 5 Statistical Analysis of Data …yes the “S” word

Inferential Statistics

When making comparisons btw 2 sample means there are 2

possibilities

Null hypothesis is true

Null hypothesis is false

Not reject the Null HypothesisReject the Null

hypothesis

Page 31: Fall 2013 Lecture 5: Chapter 5 Statistical Analysis of Data …yes the “S” word

Possible Outcomes inHypothesis Testing (Decision)

Null is True Null is False

Accept

Reject

CorrectDecision

CorrectDecisionError

Error

Type I Error

Type II Error

Type I Error: Rejecting a True HypothesisType II Error: Accepting a False Hypothesis

Page 32: Fall 2013 Lecture 5: Chapter 5 Statistical Analysis of Data …yes the “S” word

Hypothesis Testing - Decision

Decision Right or Wrong?

But we can know the probability of being right or wrong

Can specify and control the probability of making TYPE I of TYPE II Error

Try to keep it small…

Page 33: Fall 2013 Lecture 5: Chapter 5 Statistical Analysis of Data …yes the “S” word

ALPHA

the probability of making a type I error depends on the criterion you use to accept or reject the null hypothesis = significance level (smaller you make alpha, the less likely you are to commit error) 0.05 (5 chances in 100 that the difference observed was really due to sampling error – 5% of the time a type I error will occur)

Possible Outcomes inHypothesis Testing

Null is True Null is False

Accept

Reject

CorrectDecision

CorrectDecisionError

Error

Type I Error

Type II Error

Alpha (

Difference observed is really just sampling error

The prob. of type one error

Page 34: Fall 2013 Lecture 5: Chapter 5 Statistical Analysis of Data …yes the “S” word

When we do statistical analysis… if alpha (p value- significance level) greater than 0.05

WE ACCEPT THE NULL HYPOTHESIS

is equal to or less that 0.05 we

REJECT THE NULL (difference btw means)

Page 35: Fall 2013 Lecture 5: Chapter 5 Statistical Analysis of Data …yes the “S” word

2.5% 2.5%

5% region of rejection of null hypothesisNon directional

Two Tail

Page 36: Fall 2013 Lecture 5: Chapter 5 Statistical Analysis of Data …yes the “S” word

5%

5% region of rejection of null hypothesisDirectional

One Tail

Page 37: Fall 2013 Lecture 5: Chapter 5 Statistical Analysis of Data …yes the “S” word

BETAProbability of making type II error occurs when we fail to reject the Null when we should have

Possible Outcomes inHypothesis Testing

Null is True Null is False

Accept

Reject

CorrectDecision

CorrectDecisionError

Error

Type I Error

Type II Error

Beta (

Difference observed is realFailed to reject the Null

POWER: ability to reduce type II error

Page 38: Fall 2013 Lecture 5: Chapter 5 Statistical Analysis of Data …yes the “S” word

POWER: ability to reduce type II error(1-Beta) – Power Analysis

The power to find an effect if an effect is present

1. Increase our n

2. Decrease variability

3. More precise measurements

Effect Size: measure of the size of the difference between means attributed to the treatment

Page 39: Fall 2013 Lecture 5: Chapter 5 Statistical Analysis of Data …yes the “S” word

Inferential statistics

Significance testing:

Practical vs statistical significance

Page 40: Fall 2013 Lecture 5: Chapter 5 Statistical Analysis of Data …yes the “S” word

Inferential statisticsUsed for Testing for Mean Differences

T-test: when experiments include only 2 groups

a. Independent b. Correlated i. Within-subjects

ii. Matched

Based on the t statistic (critical values) based on

df & alpha level

Page 41: Fall 2013 Lecture 5: Chapter 5 Statistical Analysis of Data …yes the “S” word

Inferential statisticsUsed for Testing for Mean Differences

Analysis of Variance (ANOVA): used when comparing more than 2 groups

1. Between Subjects 2. Within Subjects – repeated measures

Based on the f statistic (critical values) based on

df & alpha level

More than one IV = factorial (iv=factors)Only one IV=one-way anova

Page 42: Fall 2013 Lecture 5: Chapter 5 Statistical Analysis of Data …yes the “S” word

Inferential statistics

Meta-Analysis:

Allows for statistical averaging of results

From independent studies of the samephenomenon