statistical analyses. spss statistical analysis program it is an analytical software recognized by...

50
Statistical analyses

Upload: antonia-pierce

Post on 24-Dec-2015

218 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Statistical analyses. SPSS  Statistical analysis program  It is an analytical software recognized by the scientific world (e.g.: the Microsoft Excel

Statistical analyses

Page 2: Statistical analyses. SPSS  Statistical analysis program  It is an analytical software recognized by the scientific world (e.g.: the Microsoft Excel

SPSS

Statistical analysis program

It is an analytical software recognized by the scientific world (e.g.: the Microsoft Excel program is not recognized by the scientific world)

Page 3: Statistical analyses. SPSS  Statistical analysis program  It is an analytical software recognized by the scientific world (e.g.: the Microsoft Excel

SPSS

Let’s start the SPSS software!

Paste the data onto the DATA VIEW window!

It has two windows, one of them contains the data (DATA VIEW), and the types of the variables must be given in the other one (VARIABLES).

Exact coding of variables is the basis of successful SPSS use.

Page 4: Statistical analyses. SPSS  Statistical analysis program  It is an analytical software recognized by the scientific world (e.g.: the Microsoft Excel

Basics of computer-based

analysis

Page 5: Statistical analyses. SPSS  Statistical analysis program  It is an analytical software recognized by the scientific world (e.g.: the Microsoft Excel

Types of data Measurable data

Differences between data are equal E.g.

interval scale How old are you? How much is your weight?

Ordinal data Data originating from gradation Special type: reletad gradation positions

Nominal scaleThe data are replaced by numbers. E.g. Gender? 1. Male 2. Female The data do not signal order The data cannot be added

Page 6: Statistical analyses. SPSS  Statistical analysis program  It is an analytical software recognized by the scientific world (e.g.: the Microsoft Excel

Statistical procedures

Descriptive statistics If we analyze actual persons,

that is population = samples Statistical indicators

Frequencies Central tendency Dispersion Correlation

Page 7: Statistical analyses. SPSS  Statistical analysis program  It is an analytical software recognized by the scientific world (e.g.: the Microsoft Excel

Statistical procedures

Mathematical statistics It provides the information whether we may draw

conclusion based on the representative sample referring to the population.

Definition Population: the group which the conclusions

refer to E.g.: university student; German people;

teachers Sample: the ones actually involved in the

surveys Representative sample: when the composition

of the sample mirrors the composition of the population. E.g.: Gallup’s deal with the Public Opinion Office

around the time of the presidential elections in 1936

Page 8: Statistical analyses. SPSS  Statistical analysis program  It is an analytical software recognized by the scientific world (e.g.: the Microsoft Excel

Mathematical statistics

Analysis of differences The aims: to show the criteria in

which elements differ from each other

Types of data

Number of samples

Scale Ordinal Nominal

One One-sample t-samples test

Wilcoxon-test Crosstabs analysis,Chi-square test

Two Independent t-sampleF-test

Mann-Whitney-test

Cross database analysis,Chi-square test

Three or more

ANOVA analysis Kruskall-Wallis-test

Cross database analysis,Chi-square test

Page 9: Statistical analyses. SPSS  Statistical analysis program  It is an analytical software recognized by the scientific world (e.g.: the Microsoft Excel

Mathematical statistics

Analyzing correlations

Types of data

Number of samples

Scale Ordinal Nominal

Two Correlate Spearmancorrelate

Crosstabs analysis,Chi-square test

Two or more Regression

More than two

Partial correlateFactor analysisCluster analysis

Page 10: Statistical analyses. SPSS  Statistical analysis program  It is an analytical software recognized by the scientific world (e.g.: the Microsoft Excel

Descriptive statistics

Page 11: Statistical analyses. SPSS  Statistical analysis program  It is an analytical software recognized by the scientific world (e.g.: the Microsoft Excel

Central Tendency

Mean

Modus : (most frequent data)

Median

Page 12: Statistical analyses. SPSS  Statistical analysis program  It is an analytical software recognized by the scientific world (e.g.: the Microsoft Excel

Frequency1. Determining the number of categories

An odd number between 10 and 20

If the number of the samples is low (e.g.50 responders) there can be fewer categories (7 categories)

2. Determining the intervals

1, 2, 3, 5, 10 depending on the number of categories

Disjunction: It should be noted that the each item in the sample must be categorized into one particular category, so the groups may not overlap.

E.g.: Bad samples:Age groupsBelow 2020-3030-40…

E.g.: God examples:Age groupsBelow 2020-2930-39…

Page 13: Statistical analyses. SPSS  Statistical analysis program  It is an analytical software recognized by the scientific world (e.g.: the Microsoft Excel

Absolute frequency

Def: The number of items belonging to particular category is absolute frequency value.

the subgroup frequencies together create theabsolute frequency distribution of the sample.

Page 14: Statistical analyses. SPSS  Statistical analysis program  It is an analytical software recognized by the scientific world (e.g.: the Microsoft Excel

Further frequency indicators

Relative frequency means the quotient of the absolute frequency values and the number of the samples.The relative frequency gives the percent of the responders in one particular category compared to the total number of samples.

Cumulative frequency means how many items of the sample can be found all together below the upper limit of the category.

Cumulative percent means the quotient of the cumulative frequency and the number of the sample.

IT shows what percent of the sample can be found below the upper limit of the category.

Page 15: Statistical analyses. SPSS  Statistical analysis program  It is an analytical software recognized by the scientific world (e.g.: the Microsoft Excel

Dispersion indicators

Range: the range of the samples means the difference between the highest and lowest items.R = Xmax - Xmin

Average difference:the average distance (absolute deviation) of the items from the average.

Square sum:Sum of the quadrant of the deviation from the average.

Page 16: Statistical analyses. SPSS  Statistical analysis program  It is an analytical software recognized by the scientific world (e.g.: the Microsoft Excel

Variance

Variance the square sum divided by the degree of freedom of the sample

Degree of freedom is the number of the independent elements (the number of the responders) of the sample.

Page 17: Statistical analyses. SPSS  Statistical analysis program  It is an analytical software recognized by the scientific world (e.g.: the Microsoft Excel

Standard deviation

Standard deviation is the square root with a positive sign of the variance.

Page 18: Statistical analyses. SPSS  Statistical analysis program  It is an analytical software recognized by the scientific world (e.g.: the Microsoft Excel

Theorem

More than 2/3 of the data belong to a 1 standard deviation extending to the positive and negative directions from the mean.

More than 90% of the data belong to a 2 standard deviation taken from the mean.

More than 90% of the data belong to a 3 standard deviation taken from the mean.

Page 19: Statistical analyses. SPSS  Statistical analysis program  It is an analytical software recognized by the scientific world (e.g.: the Microsoft Excel

Relative standard deviation

The Relative deviation is an indicator related which provides what percent of the mean is the standard deviation.

standard deviationRelative deviation = mean

Page 20: Statistical analyses. SPSS  Statistical analysis program  It is an analytical software recognized by the scientific world (e.g.: the Microsoft Excel

Quartiles

The quartiles are the quartering points of the sample.

Interquartiles half-extension: is the difference between the third and the first quartile: Q3-Q1

Page 21: Statistical analyses. SPSS  Statistical analysis program  It is an analytical software recognized by the scientific world (e.g.: the Microsoft Excel

Interrelations

Page 22: Statistical analyses. SPSS  Statistical analysis program  It is an analytical software recognized by the scientific world (e.g.: the Microsoft Excel

Interrelations between frequency and mean indicator

Left tendency: Modus > Median > Mean

Right tendency : Modus < Median < Mean

Page 23: Statistical analyses. SPSS  Statistical analysis program  It is an analytical software recognized by the scientific world (e.g.: the Microsoft Excel

Normal distribution (bell curve) : All the three indicators coincideModus = Median = Mean

Interrelations between frequency and mean indicatior

Page 24: Statistical analyses. SPSS  Statistical analysis program  It is an analytical software recognized by the scientific world (e.g.: the Microsoft Excel

Mathematical statistics

Page 25: Statistical analyses. SPSS  Statistical analysis program  It is an analytical software recognized by the scientific world (e.g.: the Microsoft Excel

Relations examinations

Page 26: Statistical analyses. SPSS  Statistical analysis program  It is an analytical software recognized by the scientific world (e.g.: the Microsoft Excel

Correlation

Correlation coefficient is the indicator which shows the direction and strength between two data list.

Page 27: Statistical analyses. SPSS  Statistical analysis program  It is an analytical software recognized by the scientific world (e.g.: the Microsoft Excel

Correlation

There is correlation between the two samplestáblázatxy rr

táblázatxy rr There is no correlation between the two samples

Page 28: Statistical analyses. SPSS  Statistical analysis program  It is an analytical software recognized by the scientific world (e.g.: the Microsoft Excel

Correlation coefficientThe interpretation of the correlation coefficient

0,9 – 1 extremely strong correlation between the two data lists

0,75 – 0,9 strong0,5 – 0,75 detectable0,25 – 0,5 weak0,0 – 0,25 no relationship

Direction

If the correlation coefficient is negative contrasting relationship

E.g. The numbers of hours doing sports – your weight

If the correlation coefficient is positive data changing simultaneously

E.g. The size of your home library – the rate of loving to read

28

Page 29: Statistical analyses. SPSS  Statistical analysis program  It is an analytical software recognized by the scientific world (e.g.: the Microsoft Excel

Relationship between/among variables – Crosstabs Crosstabs – illustrating the distribution of

two nominal or ordinal variables on the same chart.

Page 30: Statistical analyses. SPSS  Statistical analysis program  It is an analytical software recognized by the scientific world (e.g.: the Microsoft Excel

Crosstabs- Chi-square

It is an indicator which shows whether the correlations in the cross tabs are valid only for the samples or for the population as well.

It cannot be used efficiently if the value is less then 5 in more than 20% of the cells.

Page 31: Statistical analyses. SPSS  Statistical analysis program  It is an analytical software recognized by the scientific world (e.g.: the Microsoft Excel

Hypothesis analyses

It is a method to decide whether the differences in data are significant or random.

Page 32: Statistical analyses. SPSS  Statistical analysis program  It is an analytical software recognized by the scientific world (e.g.: the Microsoft Excel

Paired-samples T-test

The paired-samples T-test is used when the same people are asked or tested twice (e.g. one-sample experiment)

ns

zt '

Where:

- mean

s - Standard deviationz

Page 33: Statistical analyses. SPSS  Statistical analysis program  It is an analytical software recognized by the scientific world (e.g.: the Microsoft Excel

Match the t-number with the value of the „Critical values of the t-distribution” chart

If t’ > t chart the different is significant

If t’ < t chart the different is random

Paired-samples T-test

Page 34: Statistical analyses. SPSS  Statistical analysis program  It is an analytical software recognized by the scientific world (e.g.: the Microsoft Excel

T-test with computer

It is not necessary use the „Critical values of the t-distribution” chart, because most software provides the „p” value (Signif of t, Sig.Level).

The „p” shows what percent is the failure rate.

If „p”<0.05 (5%) then the difference is significant

Page 35: Statistical analyses. SPSS  Statistical analysis program  It is an analytical software recognized by the scientific world (e.g.: the Microsoft Excel

Independent t-test

H0: two independent samples taken from the same population.

(H0 definition: the zero hypothesis is that the difference is random )

This type of test can only can be conductived if the variances of the two groups not too different.

The F-test can give the answer.

Page 36: Statistical analyses. SPSS  Statistical analysis program  It is an analytical software recognized by the scientific world (e.g.: the Microsoft Excel

F-test

The F-test is the quotient of the variance squares.

If Fnumber < Fchart there is no significant differenceIf Fnumber > Fchart there is a great difference between the variances

the T-test cannot be done. you can try the Welch-test.

22

21

s

sF

Page 37: Statistical analyses. SPSS  Statistical analysis program  It is an analytical software recognized by the scientific world (e.g.: the Microsoft Excel

Independent t-test

mnmn

mn

yyxx

yxt

m

i

i

n

ii

2

)()(1

2

1

2

The degree of freedom = n+m-2.

Page 38: Statistical analyses. SPSS  Statistical analysis program  It is an analytical software recognized by the scientific world (e.g.: the Microsoft Excel
Page 39: Statistical analyses. SPSS  Statistical analysis program  It is an analytical software recognized by the scientific world (e.g.: the Microsoft Excel

Illustration of result

0 5 10 15 20 25 30

REL

0

1

2

3

4

5

6

Fre

qu

ency

Mean = 12,9Std. Dev. = 5,515N = 20

Histogram

0 5 10 15 20 25 30

REL

0

1

2

3

4

5

6

Fre

qu

en

cy

Mean = 12,9Std. Dev. = 5,515N = 20

REL

9

12

15

18

24

Missing

Egyéni eredmény

Aim: to make the result look conceivable and visual

Page 40: Statistical analyses. SPSS  Statistical analysis program  It is an analytical software recognized by the scientific world (e.g.: the Microsoft Excel

Frequency polygon

Illustrating frequency data with a line diagram.

Page 41: Statistical analyses. SPSS  Statistical analysis program  It is an analytical software recognized by the scientific world (e.g.: the Microsoft Excel

Histogram

Illustrating frequency data with a bar diagram.

The title of the X axis is intervals.

Page 42: Statistical analyses. SPSS  Statistical analysis program  It is an analytical software recognized by the scientific world (e.g.: the Microsoft Excel

Histogram shapes

Symmetrical, peaked

Symmetrical, normal

Page 43: Statistical analyses. SPSS  Statistical analysis program  It is an analytical software recognized by the scientific world (e.g.: the Microsoft Excel

Histogram shapes

bimodal

Page 44: Statistical analyses. SPSS  Statistical analysis program  It is an analytical software recognized by the scientific world (e.g.: the Microsoft Excel

Histogram shapes

Right side tendency

Page 45: Statistical analyses. SPSS  Statistical analysis program  It is an analytical software recognized by the scientific world (e.g.: the Microsoft Excel

Histogram shapes

Left side tendency

Page 46: Statistical analyses. SPSS  Statistical analysis program  It is an analytical software recognized by the scientific world (e.g.: the Microsoft Excel

Interrelations between frequency and mean indicator

Normal distribution: Mean = Median = Modus

Skewness = 0

Page 47: Statistical analyses. SPSS  Statistical analysis program  It is an analytical software recognized by the scientific world (e.g.: the Microsoft Excel

Interrelations between frequency and mean indicator

Symmetric with two modes

Bimodul

Skewness = 0

Page 48: Statistical analyses. SPSS  Statistical analysis program  It is an analytical software recognized by the scientific world (e.g.: the Microsoft Excel

Interrelations between frequency and mean indicator

Right side tendency

Mode<Median<Mean

Skewness = (-)

Page 49: Statistical analyses. SPSS  Statistical analysis program  It is an analytical software recognized by the scientific world (e.g.: the Microsoft Excel

Interrelations between frequency and mean indicator

Right side tendency

Mean < Median < Mode

Skewness = (+)

Page 50: Statistical analyses. SPSS  Statistical analysis program  It is an analytical software recognized by the scientific world (e.g.: the Microsoft Excel

Normal distribution with different standard deviation

Kurtosis = 1 normal

If the Kurtosis value is bigger the polygon is flatter