biostatistics iii
TRANSCRIPT
Descriptive Methods for Descriptive Methods for Categorical DataCategorical Data
Mahmoud Alhussmi, D.Sc., PhD
April 13, 2023 2
TopicsTopics
• ProportionsProportions
• RatesRates– Change ratesChange rates– Measures of morbidity and mortalityMeasures of morbidity and mortality– Standardization of ratesStandardization of rates
• RatiosRatios– Relative riskRelative risk– Odds RatioOdds Ratio
April 13, 2023 3
Categorical DataCategorical Data
Data that can be classified as belonging to a Data that can be classified as belonging to a distinct number of categories.distinct number of categories.
– BinaryBinary – data can be classified into one of 2 – data can be classified into one of 2 possible categories (yes/no, positive/negative)possible categories (yes/no, positive/negative)
– OrdinalOrdinal – data that can be classified into – data that can be classified into categories that have a natural ordering (i.e.. categories that have a natural ordering (i.e.. Levels of pain: none, moderate, intense)Levels of pain: none, moderate, intense)
– NominalNominal- data can be classified into >2 - data can be classified into >2 categories (i.e.. Race: Arab, African, and categories (i.e.. Race: Arab, African, and other) other)
April 13, 2023 4
Data examplesData examples
• What type of data would result from these What type of data would result from these questions?questions?– How old are you? ____________How old are you? ____________– How old are you?How old are you?
• A. under 18A. under 18• B. 19-35B. 19-35• C. 36-49C. 36-49• D. over 50D. over 50
April 13, 2023 5
ProportionsProportions
• Numbers by themselves may be misleading: they are on Numbers by themselves may be misleading: they are on different scales and need to be reduced to a standard different scales and need to be reduced to a standard basis in order to compare them.basis in order to compare them.
• We most frequently use proportions: that is, the fraction We most frequently use proportions: that is, the fraction of items that satisfy some property, such as having a of items that satisfy some property, such as having a disease or being exposed to a dangerous chemical.disease or being exposed to a dangerous chemical.
• "Proportions" are the same thing as fractions or "Proportions" are the same thing as fractions or percentages. In every case you need to know what you percentages. In every case you need to know what you are taking a proportion of: that is, what is the are taking a proportion of: that is, what is the DENOMINATOR in the proportion.DENOMINATOR in the proportion.
n
xp )100()100(
n
xpercent
April 13, 2023 6
Proportions and ProbabilitiesProportions and Probabilities
• We often interpret proportions as We often interpret proportions as probabilities. If the probabilities. If the proportionproportion with a with a disease is 1/10 then we also say that thedisease is 1/10 then we also say that the probabilityprobability of getting the disease is 1/10, or of getting the disease is 1/10, or 1 in 10. 1 in 10.
• Proportions are usually quoted for Proportions are usually quoted for samplessamples - - probabilities are almost always quoted for probabilities are almost always quoted for populations.populations.
April 13, 2023 7
Workers ExampleWorkers Example
• For the cases:For the cases:– Proportion of exposure=84/397=0.212 or 21.2%Proportion of exposure=84/397=0.212 or 21.2%
• For the controls: For the controls: – Proportion of exposure=45/315=0.143 or 14.3%Proportion of exposure=45/315=0.143 or 14.3%
SmokingSmokingWorkersWorkersCasesCasesControlsControls
NoNoYesYes11113535
NoNo5050203203
YesYesYesYes 84844545
NoNo313313270270
April 13, 2023 8
PrevalencePrevalence
Disease Prevalence = the proportion of people with a given Disease Prevalence = the proportion of people with a given disease at a given time.disease at a given time.
disease prevalence =disease prevalence =
Prevalence is usually quoted as per 100,000 Prevalence is usually quoted as per 100,000 people so the above proportion should be people so the above proportion should be multiplied by 100,000.multiplied by 100,000.
Number of diseased persons at a given time
Total number of persons examined at that time
InterpretationInterpretation
April 13, 2023 9
Total
newoldCasesevalence
)(Pr
Problem of exposure, consequentlyNot comparable measurementOld = duration of the diseaseNew = speed of the disease
At time t
April 13, 2023 10
Screening TestsScreening Tests
• Through screening tests people are Through screening tests people are classified as healthy or as falling into one classified as healthy or as falling into one or more disease categories.or more disease categories.
• These tests are not 100% accurate and These tests are not 100% accurate and therefore misclassification is unavoidable.therefore misclassification is unavoidable.
• There are 2 proportions that are used to There are 2 proportions that are used to evaluate these types of diagnostic evaluate these types of diagnostic procedures. procedures.
April 13, 2023 11
Screening TestsScreening Tests
General Population
Diseased
Positive Test Results
April 13, 2023 12
Sensitivity and SpecificitySensitivity and Specificity
• Sensitivity and specificity are terms used to Sensitivity and specificity are terms used to describe the effectiveness of screening tests. They describe the effectiveness of screening tests. They describe how good a test is in two ways - finding describe how good a test is in two ways - finding false positives and finding false negativesfalse positives and finding false negatives
• SensitivitySensitivity is the Proportion of diseased who is the Proportion of diseased who screen positive for the disease screen positive for the disease
• SpecificitySpecificity is the Proportion of healthy who screen is the Proportion of healthy who screen healthyhealthy
Sensitivity and SpecificitySensitivity and Specificity
Condition Present Condition Absent
……………………………………………………………………………………………
Test Positive True Positive (TP) False Positive (FP)
Test Negative False Negative (FN) True Negative (TN)
…………………………………………………………………………………………… Test Sensitivity (Sn) is defined as the probability that the test is positive
when given to a group of patients who have the disease. Sn= (TP/(TP+FN))x100. It can be viewed as, 1-the false negative rate.
The Specificity (Sp) of a screening test is defined as the probability that the test will be negative among patients who do not have the disease. Sp = (TN/(TN+FP))X100. It can be understood as 1-the false positive rate.
Positive & Negative Predictive Values
• The positive predictive value (PPV) of a test is the probability that a patient who tested positive for the disease actually has the disease. PPV = (TP/(TP+FP))X 100.
• The negative predictive value (NPV) of a test is the probability that a patent who tested negative for a disease will not have the disease. NPV = (TN/(TN+FN))X100.
The Efficiency
• The efficiency (EFF) of a test is the probability that the test result and the diagnosis agree.
• It is calculated as:
EFF = ((TP+TN)/(TP+TN+FP+FN)) X 100
April 13, 2023 16
ExampleExample• A cytological test was undertaken to screen A cytological test was undertaken to screen
women for cervical cancer.women for cervical cancer.
• Sensitivity =?Sensitivity =?
• Specificity = Specificity = ??
Test PositiveTest PositiveTest NegativeTest NegativeTotalTotal
Actually PositiveActually Positive154154) ) TPTP((225225) ) FPFP((379379
Actually NegativeActually Negative362362) ) FNFN((
516516) ) TP+FNTP+FN((
23,36223,362) ) TNTN((
2358723587))FP+TNFP+TN((
23,72423,724
April 13, 2023 17
Displaying ProportionsDisplaying Proportions
• Types of charts that can be used:Types of charts that can be used:– HistogramHistogram– Pie ChartPie Chart– Line GraphLine Graph
• BEWARE of the type of display you use – BEWARE of the type of display you use – some charts are better at displaying some charts are better at displaying certain types of data than others.certain types of data than others.
April 13, 2023 18
Displaying ProportionsDisplaying Proportions
Percent of Children Living in Crack/cocaine Households
0%
10%
20%
30%
40%
50%
60%
70%
80%
Black White AmericanIndian
Other
Percent of Children Living inCrack/cocaine Households
AfricanAfrican70%70%
ArabArab18%18%
MixedMixed8%8%
OtherOther4%4%
Distribution of Race in Sudan
0
10
20
30
40
50
60
70
80
African Arab foreigners others
April 13, 2023 19
Displaying ProportionsDisplaying ProportionsDistribution of Race in Sudan
0 20 40 60 80
African
Arab
foreigners
others
Distribution of Race in Sudan
African
Arab
foreigners
others
April 13, 2023 20
Displaying ProportionsDisplaying ProportionsCause of DeathCause of Death##of Deathsof DeathsProportion of DeathsProportion of Deaths
Heart DiseaseHeart Disease12,27812,2780.380.38
CancerCancer6,4486,4480.200.20
Cerebrovascular DiseaseCerebrovascular Disease3,9583,9580.120.12
AccidentsAccidents1,8141,8140.060.06
OtherOther8,0888,0880.250.25
Causes of Death
Heart Disease
Cancer
Cerebrovascular disease
Accidents
Other
April 13, 2023 21
RatesRates
• The term The term rate rate is often used interchangeably is often used interchangeably with the term proportion although with the term proportion although sometimes it refers to a quantity of a very sometimes it refers to a quantity of a very different nature.different nature.
• Types of rates we will cover:Types of rates we will cover:– Incidence rateIncidence rate– Change ratesChange rates– Death rateDeath rate– Follow-up death rate Follow-up death rate
April 13, 2023 22
Calculation of Incidence Calculation of Incidence RateRate
April 13, 2023 23
DefinitionDefinition
The incidence rate is the production of new cases in a population. It measure the number of cases per unit of time, i.e. It measure the average of the speed of the apparition of new cases in given population.
There are three measures of incidence:
1. Incidence rate (=news cases/time of participation)
2. Instantaneous incidence
3. Accumulate incidence
April 13, 2023 24
In incidence study we are interested on the occurrence of event (disease) over the time,. Such study deals with follow up of each subject and the moment (ti) that each event (disease) occurs.The problem of incidence data is that the existence of observation incomplete, subjects are still not affected at the moment of analysis.
April 13, 2023 25
To study incidence rate we should know for each subject the following information:
Date of origin
Is the date that an individual enter in the study.
Date of last information
Is the recent date that we receive information about the status of the subject. If the subject is affected, so the date of last information is date of getting the disease.
April 13, 2023 26
Duration of follow up
Is the delay between the date of origin and the date of last information
Date of point
Is the date that we decide to stop collecting information about subjects.
Loss of view
A subject that we do not know his status at the date of point is called loss of view.
April 13, 2023 27
Time of participation ti
Time of participation for loss of view
t1, t2, t3, ….,
time 0
Date of last information
Date of point
Time of participation for affected subject
?
Time of participation
Time not consider
Time of participation for loss of view
April 13, 2023 28
ExampleThe following example follow 30 individuals between 1982 to 1988 for the disease D.
# #of individualof individualDate of originDate of originDate of diseaseDate of diseaseDate of last Date of last informationinformation
Time of Time of participationparticipation
1111-8211-821-841-8412-8812-881414
2211-8211-8210-8710-8712-8812-885959
336-826-822-862-866-876-874444
4411-8211-8211-8611-8612-8812-883636
5511-8211-823-853-8512-8812-882828
6611-8211-821-881-8812-8812-886262
776-836-833-843-846-856-8599
881-831-8312-8612-8612-8712-874747
991-841-841-871-8712-8812-883636
101011-8211-8210-8510-8512-8812-883535
111112-8212-828-868-8612-8812-884444
12121-831-836-836-837-857-8555
13136-836-8311-8711-877-887-885353
141411-8211-828-848-842-872-872121
April 13, 2023 29
15151-831-833-863-8612-8812-883838
16166-836-835-875-8712-8812-884747
17175-835-831-841-8412-8812-8888
181811-8311-836-886-8812-8812-885555
19196-836-835-865-8612-8712-873535
20203-833-832-882-8812-8812-885959
21214-834-8312-8812-886868
222211-8211-8211-8511-853636
23231-831-8312-8812-887171
24246-836-8312-8812-886666
252511-8211-8211-8611-864848
26266-826-8212-8812-887878
272712-8312-8312-8812-886060
282811-8211-8212-8812-887373
29296-846-8412-8812-885454
30301-831-831-841-841212
Time of participation = 3001
Number of cases = 20
Rate of incidence = 20/3001 = 0.015
Or 1.5 case per 100 individuals.
April 13, 2023 30
Sometimes it is difficult to know the exact date of origin of the case or even the duration of follow up, and this is always take place when the population under study is open . In this case what we know very well :1.Number of new cases, but not the time exact of participation = m2.Total number of population from the begin to end of the study = NThe calculation using the same method as before will be impossible, because many information was missed. The assumption over which we are going to build our hypothesis is that all the cases enter or quiet the study are distributed uniformly during the period of follow up, i.e. the date of follow up for each of these subject will be the half average of the period of the follow up of the study.The calculation will be as follow: Item Number time of participationNot diseases N N*ΔtEnter along the study Ne Ne*Δt/2Quiet along the study Ns Ns*Δt/2Disease m m*Δt/2
2)2(t
mNNN
mIR
se
April 13, 2023 31
Change RatesChange Rates
• These types of rates are used to describe These types of rates are used to describe changes after a certain period of time.changes after a certain period of time.
• Example: A total of 35,238 new AIDS cases Example: A total of 35,238 new AIDS cases were reported in 1989 compared to 32,196 were reported in 1989 compared to 32,196 reported during 1988.reported during 1988.– The change rate for new AIDS cases:The change rate for new AIDS cases:
100 Xvalue old
value old-valuenew (%) rate change
%4.910019632
1963223835
,
,,
April 13, 2023 32
Measures of Morbidity and MortalityMeasures of Morbidity and Mortality
years-person total
deaths ofnumber ratedeath up-follow
year) a (ie. timeof period defined for the
followed wererisk whoat people of #
year) a (ie. timeof period defined aover
disease thedeveloped that people of #
rate incidence
yearon that population the
yearcalendar ain deaths #ratedeath crude
April 13, 2023 33
Crude Death RateCrude Death Rate
• Example: The 1980 population in Example: The 1980 population in California was 23,000,000 (as estimated California was 23,000,000 (as estimated on 1 July) and there were 190,237 deaths on 1 July) and there were 190,237 deaths during that year.during that year.
– Crude death rate Crude death rate =(190,237/23,000,000)*1,000=(190,237/23,000,000)*1,000
= 8.3 deaths per 1,000 per year= 8.3 deaths per 1,000 per year
April 13, 2023 34
Displaying Proportions over TimeDisplaying Proportions over Time
Female Death Rates (1984-1987)
780
785
790
795
800
805
810
815
1984 1985 1986 1987
Death Rate per 100,000
Death Rate per Death Rate per 100,000100,000
19841984793793
19851985807807
19861986809809
19871987813813
April 13, 2023 35
Standardization of RatesStandardization of Rates
• Crude rates are used to describe a population but Crude rates are used to describe a population but comparisons of crude rates are often invalid because comparisons of crude rates are often invalid because the populations may be different w.r.t important the populations may be different w.r.t important characteristics (ie. age, gender, race).characteristics (ie. age, gender, race).
• To account for these differences adjusted rates are To account for these differences adjusted rates are used in the comparison.used in the comparison.
X100,000population standard in #
expected deaths #rate adjusted
April 13, 2023 36
Group AGroup AGroup BGroup B
Age groupAge groupno. no.
deathsdeathspersonspersonsdeaths/deaths/100000100000
no. no. deathsdeathspersonspersons
deaths/deaths/100000100000
0-40-416216240,00040,000405.0405.02,0492,049546,000546,000375.3375.3
5-195-19107107128,000128,00083.683.61,1951,1951,982,0001,982,00060.360.3
20-4420-44449449172,000172,000261.0261.05,0975,0972,676,0002,676,000190.5190.5
45-6445-6445145158,00058,000777.6777.619,90419,9041,807,0001,807,0001101.51101.5
6565++4444449,0009,0004933.34933.363,50563,5051,444,0001,444,0004397.94397.9
TotalsTotals16131613407000407000396.3396.39175091750845500084550001085.21085.2
April 13, 2023 37
Using the X population for 1970 as a standard we getUsing the X population for 1970 as a standard we get::
Group AGroup AGroup BGroup B
Age Age groupgroupStandardStandard
Age spec. Age spec. raterate
Exp Exp deathsdeaths
Age spec. Age spec. raterate
Exp Exp deathsdeaths
0-40-484,41684,416405.0405.0342342375.3375.3317317
5-195-19294,353294,35383.683.624624660.360.3177177
20-4420-44316,744316,744261.0261.0827827190.5190.5603603
45-6445-64205,745205,745777.6777.6160016001101.51101.522662266
6565++98,74298,7424933.34933.3487148714397.94397.943434343
TotalsTotals1,000,0001,000,0007886788677067706
Expected deaths for Group A for age group 65+ = Expected deaths for Group A for age group 65+ = (98,742)(4933.3)/100,000 = 4871(98,742)(4933.3)/100,000 = 4871
Age adjusted rate for Age adjusted rate for Group AGroup A = = 788.6788.6)=)=7886/1,000,0007886/1,000,000*(*(100,000100,000
Age adjusted rate for Age adjusted rate for Group BGroup B = = 770.6770.6)=)=7706/1,000,0007706/1,000,000*(*(100,000100,000
April 13, 2023 38
Relative RiskRelative Risk
• Relative risks Relative risks are the ratio of risks for two different are the ratio of risks for two different populations (ratio=a/b). populations (ratio=a/b).
• If the risk (or proportion) of having the outcome is 1/10 If the risk (or proportion) of having the outcome is 1/10 in one population and 2/10 in a second population, then in one population and 2/10 in a second population, then the relative risk is: (2/10) / (1/10) = 2.0the relative risk is: (2/10) / (1/10) = 2.0
• A relative risk >1 indicates increased risk for the group A relative risk >1 indicates increased risk for the group in the numerator and a relative risk <1 indicates in the numerator and a relative risk <1 indicates decreased risk for the group in the numerator.decreased risk for the group in the numerator.
2 group in incidence disease
1 group in incidence diseaseRisk Relative
April 13, 2023 39
Odd’s Ratio and Relative RiskOdd’s Ratio and Relative Risk
• Odds ratiosOdds ratios are better to use in case- are better to use in case-control studies (cases and controls are control studies (cases and controls are selected and level of exposure is selected and level of exposure is determined retrospectively)determined retrospectively)
• Relative risksRelative risks are better for cohort studies are better for cohort studies (exposed and unexposed subjects are (exposed and unexposed subjects are chosen and are followed to determine chosen and are followed to determine disease status - prospective)disease status - prospective)
April 13, 2023 40
Odd’s Ratio and Relative RiskOdd’s Ratio and Relative Risk
• When we have a two-way classification of When we have a two-way classification of exposure and disease we can approximate the exposure and disease we can approximate the relative risk by the odds ratiorelative risk by the odds ratio
• Relative Risk=Relative Risk=A/(A+B) divided by C/(C+D)A/(A+B) divided by C/(C+D)• Odd’s Ratio= Odd’s Ratio= A/B divided by C/D = AD/BCA/B divided by C/D = AD/BC
ExposureExposure
DiseaseDisease
YesYesNoNo
YesYesAABBA+BA+B
NoNoCCDDC+DC+D
April 13, 2023 41
Relationship Between the Two Relationship Between the Two MeasuresMeasures
DCBA
C*B
D*ARR
:by edapproximat becan risk relative theTherefore
B BA
D DC
: thennegative, disease as classified those tocompared small is
positive disease as classified subjects ofnumber theif
B)C(A
D)A(C
DC
C
BA
ARR
April 13, 2023 42
Case Control Study ExampleCase Control Study Example• Disease: Pancreatic CancerDisease: Pancreatic Cancer
• Exposure: Cigarette SmokingExposure: Cigarette Smoking
ExposureExposure
DiseaseDisease
YesYesNoNo
YesYes38388181119119
NoNo2256565858
April 13, 2023 43
Example ContinuedExample Continued
• Relative risk for exposed vs. non-exposedRelative risk for exposed vs. non-exposed– Numerator- proportion of exposed people that Numerator- proportion of exposed people that
have the diseasehave the disease– Denominator-proportion of non-exposed that Denominator-proportion of non-exposed that
have the diseasehave the disease
– Relative Risk= Relative Risk= (38/119)/(2/58)=9.26(38/119)/(2/58)=9.26
April 13, 2023 44
Example ContinuedExample Continued
• Odd’s Ratio for exposed vs. non-exposedOdd’s Ratio for exposed vs. non-exposed– Numerator- ratio of diseased vs. non-Numerator- ratio of diseased vs. non-
diseased in the exposed groupdiseased in the exposed group– Denominator- ratio of diseased vs. non-Denominator- ratio of diseased vs. non-
diseased in the non-exposed groupdiseased in the non-exposed group
– Odd’s Ratio= Odd’s Ratio= (38/81)/(2/56)=(38*56)/(2*81) (38/81)/(2/56)=(38*56)/(2*81) =13.14=13.14
April 13, 2023 45
Relative RiskRelative Risk
• Relative risk – the chance that a member of a group Relative risk – the chance that a member of a group receiving some exposure will develop a disease relative to receiving some exposure will develop a disease relative to the chance that a member of an unexposed group will the chance that a member of an unexposed group will develop the same disease.develop the same disease.
• Recall: a RR of 1.0 indicates that the probabilities of Recall: a RR of 1.0 indicates that the probabilities of disease in the exposed and unexposed groups are disease in the exposed and unexposed groups are identical – an association between exposure and disease identical – an association between exposure and disease does not exist.does not exist.
unexposed) |P(disease
exposed)|P(diseaseRR
April 13, 2023 46
Relative RiskRelative Risk
• When we have a two-way classification of When we have a two-way classification of exposure and disease we can calculate exposure and disease we can calculate the relative risk the relative risk
ExposureExposure
DiseaseDisease
YesYesNoNo
YesYesAABBA+BA+B
NoNoCCDDC+DC+D
April 13, 2023 47
Case Control Study ExampleCase Control Study Example• Disease: Pancreatic CancerDisease: Pancreatic Cancer
• Exposure: Cigarette SmokingExposure: Cigarette Smoking
ExposureExposure
DiseaseDisease
YesYesNoNo
YesYes38388181119119
NoNo2256565858
Data Interpretation • Consideration:
1. Accuracy1. critical view of the data
2. investigating evidence of the results
3. consider other studies’ results
4. peripheral data analysis
5. conduct power analysis: type I & type II
Correct Type-II
Type -ICorrect
True
False
True False
Types of Errors
If You……When the Null Hypothesis is…
Then You Have…….
Reject the null hypothesis
True (there really
are no difference) Made a Type I Error
Reject the null hypothesis
False (there really
are difference) ☻
Accept the null hypothesis
False (there really are difference)
Made Type II Error
Accept the null hypothesis
True (there really are no difference)
☻
• alpha : the level of significance used for establishing type-I error
• β : the probability of type-II error
• 1 – β : is the probability of obtaining
significance results ( power)
• Effect size: how much we can say that the intervention made a significance difference
2. Meaning of the results - translation of the results and make it
understandable3. Importance: - translation of the significant findings into
practical findings 4. Generalizability: - how can we make the findings useful for all
the population 5. Implication: - what have we learned related to what has
been used during study
POWER--Uses and Misuses
• Sources– Cohen Statistical Power Analysis for the
Behavioral Sciences (gold standard for power)– Kraemer & Thieman How Many Subjects?
(also a good review)
Needed Parameters
• Alpha--chance of a Type I error
• Beta--chance of a Type II error
• Power = 1 - beta
• Effect size--difference between groups or amount of variance explained or how much relationship there is between the DV and the IVs
Remember this in English?
• Type I error is when you say there is a difference or relationship and there is not
• Type II error is when you say there is no difference or relationship and there really is
What Affects Power?
• Size of the difference in means or amount of variance explained (ES)
• alpha
• Unexplained variance
• N
Which is more important?
• Type I error more important if possibility of harm or lethal effect
• Type II error more important in relatively unexplored areas of research
• In some studies, Type I and Type II errors may be equally important
How to Increase Power1. Increase the n2. Decrease the unexplained variance--control by design or statistics
(e.g. ANCOVA)3. Increase alpha (controversial)4. Use a one tailed test (directional hypothesis)--puts the zone of
rejection all in one tail; same effect as increasing alpha5. Use parametric statistics as long as you meet the assumptions. If
not, parametric statistics are LESS powerful6. Decrease measurement error (decrease unexplained variance)--use
more reliable instruments, standardize measurement protocol, frequent calibration of physiologic instruments, improve inter-rater reliability
What is good power?
By tradition, “good” power is 80%
The correct answer is it depends on the nature of the phenomenon and which kind of error is most important in your study. This is a theoretical argument that you have to make.
Using convention (alpha = .05 and power = .80, beta = .20) you are saying that Type I error is _________ as serious as a Type II error
Effect Size
How large an effect do I expect exists in the population if the null is false?
OR
How much of a difference do I want to be able to detect?
The larger the effect, the fewer the cases needed to see it. (The difference is so big you can trip on it.)
The World According to PowerKraemer & Thiemann
• The more stringent the significance level, the greater the necessary sample size. More subjects are needed for a 1% level than a 5% level
• Two tailed tests require larger sample sizes than one tailed tests. Assessing two directions at the same time requires a greater investment.
• The smaller the effect size, the larger the necessary sample size. Subtle effects require greater efforts.
• The larger the power required, the larger the necessary sample size. Greater protection from failure requires greater effort.
• The smaller the sample size, the smaller the power, ie the greater the chance of failure
The World According to PowerKraemer & Thiemann
• If one proposed to go with a sample size of 20 or fewer, you have to be willing to have a high risk of failure or a huge effect size
• To achieve 99% power for a effect size of .01, you need > 150,000 subjects
Test Yourself
Keeping the other parameters the same:
• As ES decreases, needed n ____
• As alpha decreases, needed n ____
• Higher power requires _____ n
Power for each test
• You do a power analysis for each statistic you are going to use.
• Choose the sample size based on the highest number of subjects from the power analysis.
• Use the most conservative power analysis--guarantees you the most subjects
What about multiple time points?
• More time points requires fewer subjects since more is known about the subjects from prior time points as compared to a cross sectional study
• In other words, less variance is unexplained since you have baseline information
• How many fewer? It depends
Power analysis and secondary analysis
If you have a set sample size, your power analysis then works backward. You set the n, alpha and ES and determine the power given the first three parameters.
Determining ES
If you want to determine effect size from a completed study, you have the n, alpha and power and can work backwards to determine the ES.
Especially important in relatively unexplored areas
Power and MR
• ES is the amount of explained variance expected since there may not be group differences, based on past research
• Increasing the number of independent variables _______ sample size needed to achieve adequate power.
Sampling Distribution
• A sample statistic is often unequal to the value of the corresponding population parameter because of sampling error.
• Sampling error reflects the tendency for statistics to fluctuate from one sample to another.
• The amount of sampling error is the difference between the obtained sample value and the population parameter.
• Inferential statistics allow researchers to estimate how close to the population value the calculated statistics is likely to be.
• The concept of sampling, which are actually probability distributions, is central to estimates of sampling error.
Characteristics of Sampling Distribution
• Sampling error= sample mean-population mean.• Every sample size has a different sampling distribution of
the mean.• Sampling distributions are theoretical, because in
practice, no one draws an infinite number of samples from a population.
• Their characteristics can be modeled mathematically and have determined by a formulation known as the central limit theorem.
• This theorem stipulates that the mean of the sampling distribution is identical to the population mean.
• The average sampling error-the mean of the (mean-μ)s-would always equal zero.
Standard Error of the Mean
• The standard deviation of a sampling distribution of the mean has a special name: the standard error of the mean (SEM).
• The smaller the SEM, the more accurate are the sample means as estimates of the population value.
• Estimation
• Hypothesis Testing
Both activities use sample statistics (for example, X[ ) to make inferences about a population parameter (μ).
71
• Why don’t we just use a single number (a point estimate) like, say, X[ to estimate a population parameter, μ?
• The problem with using a single point (or value) is that it will very probably be wrong. In fact, with a continuous random variable, the probability that the variable is equal to a particular value is zero. So, P(X[ =μ) = 0.
• This is why we use an interval estimator. • We can examine the probability that the
interval includes the population parameter.
72
Types of Statistical Inference
• Parameter estimation:– It is used to estimate a population value, such as a
mean, relative risk index or a mean difference between two groups.
– Estimation can take two forms:• Point estimation: involves calculating a single statistic to
estimate the parameter. E.g. mean and median. – Disadvantages: they offer no context for interpreting their
accuracy and a point estimate gives no information regarding the probability that it is correct or close to the population value.
• Interval estimation: is to estimate a range of values that has a high probability of containing the population value .
• How wide should the interval be? That depends upon how much confidence you want in the estimate.
• For instance, say you wanted a confidence interval estimator for the mean income of a college graduate:
• The wider the interval, the greater the confidence you
will have in it as containing the true population parameter μ.
74
You might haveThat the mean income is between
100% confidence
$0 and∞$
95% confidence
$35,000 and $41,000
90% confidence
$36,000 and $40,000
80% confidence
$37,500 and $38,500
……0% confidence$38,000) a point estimate(
Interval Estimation
• For example, it is more likely the population height mean lies between 165-175cm.
• Interval estimation involves constructing a confidence interval (CI) around the point estimate.
• The upper and lower limits of the CI are called confidence limits.
• A CI around a sample mean communicates a range of values for the population value, and the probability of being right. That is, the estimate is made with a certain degree of confidence of capturing the parameter.
Confidence Intervals around a Mean
• 95% CI = (mean + (1.96 x SEM)• This statement indicates that we can be 95% confident that the
population mean lies between the confident limits , and that these limits are equal to 1.96 times the true standard error, above and below the sample mean.
• E.g. if the mean = 61 inches, and SEM = 1, What is 95% CI.– Solution: 95% CI = (61 + (1.96 X 1))
95% CI = (61 + 1.96)95% CI = 59.04 < μ < 62.96
• E.g. if the mean = 61 inches, and SEM = 1, What is 99% CI.– Solution: 99% CI = (61 + (2.58 X 1))
99% CI = (61 + 2.58)99% CI = 58.42 < μ < 63.58
Confidence Intervals and the t distribution
• When sample size is small then we cannot use confidence intervals around the mean, instead, we measure confidence intervals by the t-distribution.
• t-distribution is similar to a normal distribution in a standard form.
• The exact shape of the t-distribution is influenced by the number cases in the sample.
• Statisticians have developed tables for the area under the t-distribution for different sample size and probability levels.
• To use this table, we must enter at the appropriate row based on the number of degrees of freedom.
Confidence Intervals and the t distribution
• 95% CI = (mean + (t x SEM) – Where mean = the sample mean
T = tables t value at 95% CI for df = N-1SEM = the calculated SEM for the sample data
• E.g. SEM = 1, mean = 61, N = 25, df = 25-1, t for the 95% CI with 24 df is 2.06– Solution:
95% CI = (61 + (2.06 X 1))95% CI = (61 + 2.06)95% CI = 58.95 < μ < 63.06
To compute CIs around a mean with SPSS:Analyze------descriptive stat----explore then click on the statistics
pushbutton.
Types of Statistical Inference
• Hypothesis testing:– Hypothesis testing is a second approach to inferential statistics.– Hypothesis testing involves using sampling distributions and the
laws of probability to make an objective decision about whether to accept or reject the null hypothesis.
– The sample may deviate from the defined population’s true nature by certain amount.
– This deviation is called sampling error.– Drawing the wrong conclusion is called an error of inference.– There are two types of errors of inference defined in terms of the
null hypothesis:• Type I error• Type II error
• Testing a Claim: Companies often make claims about products. For example, a frozen yogurt company may claim that its product has no more than 90 calories per cup. This claim is about a parameter – i.e., the population mean number of calories per cup (μ).
• The claim is tested is by taking a sample - say, 100 cups - and determining the sample mean. If the sample mean is 90 calories or less we have no evidence that the company has lied. Even if the sample mean is greater than 90 calories, it is possible the company is still telling the truth (sampling error). However, at some point – perhaps, say, a sample average of 500 calories per cup – it will be clear that the company has not been completely truthful about its product.
80
• A hypothesis is made about the value of a parameter, but the only facts available to estimate the true parameter are those provided by the sample. If the statistic differs (and of course it will) from the hypothesis stated about the parameter, a decision must be made as to whether or not this difference is significant. If it is, the hypothesis is rejected. If not, it cannot be rejected.
• H0: The null hypothesis. This contains the hypothesized parameter value which will be compared with the sample value.
• H1: The alternative hypothesis. This will be “accepted” only if H0 is rejected.
Technically speaking, we never accept H0 What we actually say is that we do not have the evidence to reject it.
81
• Two types of errors may occur: α (alpha) and β (beta). The α error is often referred to as a Type I error and β error as a Type II error.– You are guilty of an alpha error if you reject H0 when it
really is true.
– You commit a beta error if you “accept” H0 when it is false.
82
• This alpha error is related to the (1- α) we just learned about when constructing confidence intervals. We will soon see that an error of .05 in testing a hypothesis (two-tail test) is equivalent to a confidence of 95% in constructing a two-sided interval estimator.
83
-Z/2 Z/2
/2 /2
TRADEOFF!
•There is a tradeoff between the alpha and beta errors. We cannot simply reduce both types of error. As one goes down, the other rises.•As we lower the error, the β error goes up: reducing the error of rejecting H0 (the error of rejection) increases the error of “Accepting” H0 when it is false (the error of acceptance).
•This is similar (in fact exactly the same) to the problem we had earlier with confidence intervals. Ideally, we would love a very narrow interval, with a lot of confidence. But, practically, we can never have both: there is a tradeoff.
84
• Our legal system understands this tradeoff very well. – If we make it extremely difficult to convict criminals
because we do not want to incarcerate any innocent people we will probably have a legal system in which no one gets convicted.
– On the other hand, if we make it very easy to convict, then we will have a legal system in which many innocent people end up behind bars.
– This is why our legal system does not require a guilty verdict to be “beyond a shadow of a doubt” (i.e., complete certainty) but “beyond reasonable doubt.”
85
• Quality Control.– A company purchases chips for its smart phones, in
batches of 50,000. The company is willing to live with a few defects per 50,000 chips. How many defects?
– If the firm randomly samples 100 chips from each batch of 50,000 and rejects the entire shipment if there are ANY defects, it may end up rejecting too many shipments (error of rejection). If the firm is too liberal in what it accepts and assumes everything is “sampling error,” it is likely to make the error of acceptance.
– This is why government and industry generally work with an alpha error of .05
86
1.Formulate H0 and H1. H0 is the null hypothesis, a hypothesis about the value of a parameter, and H1 is an alternative hypothesis.
– e.g., H0: µ=12.7 years; H1: µ≠12.7 years
2.Specify the level of significance (α) to be used. This level of significance tells you the probability of rejecting H0 when it is, in fact, true. (Normally, significance level of 0.05 or 0.01 are used)
3.Select the test statistic: e.g., Z, t, F, etc. So far, we have been using the Z distribution. We will be learning about the t-distribution (used for small samples) later on.
4.Establish the critical value or values of the test statistic needed to reject H0. DRAW A PICTURE!
5.Determine the actual value (computed value) of the test statistic.
6.Make a decision: Reject H0 or Do Not Reject H0.
87
•When we Formulate H0 and H1, we have to decide whether to use a one-tail or two-tail test. •With a “two-tail” hypothesis test, α is split into two and put in both tails. H1 then includes two possibilities: μ = # OR μ ≠ #. This is why the region of rejection is divided into two tails. Note that the region of rejection always corresponds to H1.
• With a “one-tail” hypothesis test, the α is entirely in one of the tails.
Hypothesis Testing 88
•For example, if the company claims that a certain product has exactly 1 mg of aspirin, that would result in a two-tail test. Note words like “exactly” suggest two tail tests. There are problems with too much aspirin and too little aspirin in a drug.•On the other hand, if a firm claims that a box of its raisin bran cereal contains at least 100 raisins, a one-tail test has to be used. If the sample mean is more than 100, everything is ok. The problems arise only if the sample mean is less than 100. The question will be whether we are looking at sampling error or perhaps the company is lying and the true (population) mean is less than 100 raisins.
89
•A company claims that its soda vending machines deliver exactly 8 ounces of soda. Clearly, You do not want the vending machines to deliver too much or too little soda. How would you formulate this?
Answer: H0: µ = 8 ouncesH1: µ ≠ 8 ounces
If you are testing at α=.01, The .01 is split into two: .005 in the left tail and .005 in the right tail The critical values are ±2.575
90
-2.575 2.575
.005 .005
•A company claims that its bolts have a circumference of exactly 12.50 inches. (If the bolts are too wide or narrow, they will not fit properly):
Answer: H0: µ = 12.50 inchesH1: µ ≠ 12.50 inches
•A company claims that a slice of its bread has exactly 2 grams of fiber. Formulate this:
Answer: H0: µ = 2 gramsH1: µ ≠ 2 grams
91
•A company claims that its batteries have an average life of at least 500 hours. How would you formulate this?
Answer: H0: µ 500 hours≧H1: µ < 500 hours
If you are testing at an α = .05, The entire .05 is in the left tail (hint: H1 points to where the rejection region should be.) The critical value is -1.645.
92
A company claims that its overpriced, bottled spring water has no more than 1 mcg of benzene (poison). How would you formulate this:
Answer: H0: µ 1 mcg. benzene≦H1: µ > 1 mcg. benzene
If you are testing at an α = .05, The entire .05 is in the right tail (hint: H1 points to where the rejection region should be.) The critical value is +1.645.
93
.05
1.645
A pharmaceutical company claims that each of its pills contains exactly 20.00 milligrams of Cumidin (a blood thinner). You sample 64 pills and find that the sample mean X[ =20.50 mg and s = .80 mg. Should the company’s claim be rejected? Test at α = 0.05. •Formulate the hypotheses
H0: µ =20.00 mg H1: µ 20.00 mg
•Choose the test statistic and find the critical values; draw region of rejection
Test statistic: Z
At α = 0.05, the critical values are ±1.96.
•Use the data to get the calculated value of the test statistic
Z = = = 5 [ .80/√.64 = .10 This is the standard error of the mean. ]
•Come to a Conclusion: Reject H0 or Do Not Reject H0
The computed Z value of 5 is deep in the region of rejection. Thus, Reject H0 at p < .05
94
• Suppose we took the above data, ignored the hypothesis, and constructed a 95% confidence interval estimator.
20.50 1.96(.10)
95%, CIE: 20.304 mg 20.696 mg• We note that 20.00 mg is not in this interval. • As you can see, hypothesis testing and CIE are virtually the same
exercise; they are merely two sides of the same coin. Both rely on the sample evidence.
• If a claim is made about a parameter, do a hypothesis test. If no claim is made and a company wants to use sample evidence to estimate a parameter (perhaps to determine what claims may be made in the future about a parameter), construct a confidence interval estimator.
95
• A company claims that its LED bulbs will last at least 8,000 hours. You sample 100 bulbs and find that X[ =7,800 hours and s=800 hours. Should the company’s claim be rejected? Test at α = 0.05.
• H0: µ 8,000 hours≧H1: µ < 8,000 hours
• Z = 7,800 – 8,000 / (800/√100) = -200/80 = -2.50 • [800/√100 = 80, the standard error of the mean]
• The computed Z value of -2.50 is in the region of rejection. Thus, reject H0 at p < .05– Note: When testing a hypothesis, we often have to perform a one-tail test if the claim
requires it. However, we will always use only two-sided confidence interval estimators when using sample statistics to estimate population parameters.
96
5%
-1.645
• In estimating µ based on sample statistics, how large a sample do we need for the level of precision we want? –To determine the sample size we need, we must
know the (1) desired precision and (2) σ.
• e, the half-width of the confidence interval estimator is the precision with which we are estimating. e is also called sampling error.
97
ecisione
nZX
Pr
/
…continued
We use e to solve for n:
If then
and so
98
n
Ze
e
Zn
2
22
e
Zn
99
2
22
10
2096.1n
• Similarly, taking e (precision) from formula for the half-width of a confidence interval estimator for P:
• Q: If we are trying to estimate the population proportion, P, what do we use for P in this formula?
100
2
2 )1(
e
PPZ
Suppose a pollster wants a maximum error of e = .01 with 95% confidence. We assume that variance is the highest possible, so we use P=.5. This is the way we ensure that sampling error will be within ±.01 of the true population Proportion.Then,n = = 9,604
That is a VERY large sample.
101
2
2
01.
)5.1(5.96.1
…continued
Let’s try that again with e = .03.
n = = 1,067
This is the sample size that most pollsters work with.
102
2
2
03.
)5.1(5.96.1