the arithmetic mean the arithmetic mean is the statisticians term for what the layman knows as the...

45
THE ARITHMETIC MEAN The arithmetic mean is the statistician’s term for what the layman knows as the average. It can be thought of as that value of the variable series which is numerically MOST representative of the whole series. “The arithmetic mean or simply the mean is a value obtained by dividing the sum of all the observations by their number.” Sum ofallthe observations X = N um ber ofthe observations

Upload: sebastian-harrison

Post on 26-Mar-2015

225 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: THE ARITHMETIC MEAN The arithmetic mean is the statisticians term for what the layman knows as the average. It can be thought of as that value of the variable

THE ARITHMETIC MEAN

The arithmetic mean is the statistician’s term for what the layman knows as the average. It can be thought of as that value of the variable series which is numerically MOST representative of the whole series.

“The arithmetic mean or simply the mean is a value obtained by dividing the sum of all the observations by their number.”

Sum of all the observations X = Number of the observations

Page 2: THE ARITHMETIC MEAN The arithmetic mean is the statisticians term for what the layman knows as the average. It can be thought of as that value of the variable

n

XX

n

1ii

where n represents the number of observations in the sample that has been the ith observation in the sample (i = 1, 2, 3, …, n), and represents the mean of the sample.

For simplicity, the above formula can be written as

n

XX

(In other words, it is not necessary to insert the subscript ‘i’.)

Page 3: THE ARITHMETIC MEAN The arithmetic mean is the statisticians term for what the layman knows as the average. It can be thought of as that value of the variable

Day Receipt of News Agent

Monday £ 9.90Tuesday £ 7.75Wednesday £ 19.50Thursday £ 32.75Friday £ 63.75Saturday £ 75.50Sunday £ 50.70Week Total £ 259.85

EXAMPLE

Information regarding the receipts of a news agent for seven days of a particular week are given below:

Mean sales per day in this week :

= £ 259.85/7 = £ 37.12

(to the nearest penny).

X

Page 4: THE ARITHMETIC MEAN The arithmetic mean is the statisticians term for what the layman knows as the average. It can be thought of as that value of the variable

Interpretation:

The mean, £ 37.12, represents the amount (in pounds sterling) that would have been obtained on each day if the same amount were to be obtained on each day.

To calculate the approximate value of the mean, the observations in each class are assumed to be identical with the class midpoint Xi.

As was just mentioned, the observations in each class are assumed to be identical with the midpoint i.e. the class-mark. , (This is based on the assumption that the observations in the group are evenly scattered between the two extremes of the class interval).

The mid-point of every class is known as its class-mark.In other words, the midpoint of a class ‘marks’ that class.

Page 5: THE ARITHMETIC MEAN The arithmetic mean is the statisticians term for what the layman knows as the average. It can be thought of as that value of the variable

Mid PointX

Frequencyf

X1 f1

X2 f2

X3 f3

:::

:::

Xk fk

FREQUENCY DISTRIBUTION

In case of a frequency distribution, the arithmetic mean is defined as:

n

Xf

f

XfX

k

1iii

k

1ii

k

1iii

Page 6: THE ARITHMETIC MEAN The arithmetic mean is the statisticians term for what the layman knows as the average. It can be thought of as that value of the variable

For simplicity, the above formula can be written as

n

fX

f

fXX

(The subscript ‘i’ can be

dropped.)

Class (Mileage Rating)

Frequency (No. of Cars)

30.0 – 32.9 2 33.0 – 35.9 4 36.0 – 38.9 14 39.0 – 41.9 8 42.0 – 44.9 2

Total 30

EPA MILEAGE RATINGS OF 30 CARS OF A CERTAIN MODEL

Page 7: THE ARITHMETIC MEAN The arithmetic mean is the statisticians term for what the layman knows as the average. It can be thought of as that value of the variable

CLASS-MARK

(MID-POINT):

The mid-point of each class is obtained by adding the

sum of the two limits of the class and dividing by 2. Hence, in this example, our mid-points are computed in this manner:

30.0 plus 32.9 divided by 2 is equal to 31.45, 33.0 plus 35.9 divided by 2 is equal to 34.45,

and so on.

Class(Mileage Rating)

Class-mark(Midpoint)

X30.0 – 32.9 31.4533.0 – 35.9 34.4536.0 – 38.9 37.4539.0 – 41.9 40.4542.0 – 44.9 43.45

Page 8: THE ARITHMETIC MEAN The arithmetic mean is the statisticians term for what the layman knows as the average. It can be thought of as that value of the variable

Class-mark(Midpoint)

X

Frequencyf

fX

31.45 2 62.934.45 4 137.837.45 14 524.340.45 8 323.643.45 2 86.9

30 1135.5

Applying the formula: ,f

fXX

we obtain

85.3730

5.1135X

Page 9: THE ARITHMETIC MEAN The arithmetic mean is the statisticians term for what the layman knows as the average. It can be thought of as that value of the variable

GROUPING ERROR

“Grouping error” refers to the error that is introduced by the assumption that all the values falling in a class are equal to the mid-point of the class interval.

In reality, it is highly improbable to have a class for which all the values lying in that class are equal to the mid-point of that class.

This is why the mean that we calculate from a frequency distribution does not give exactly the same answer as what we would get by computing the mean of our raw data.This grouping error arises in the computation of many descriptive measures such as the geometric mean, harmonic mean, mean deviation and standard deviation.

Page 10: THE ARITHMETIC MEAN The arithmetic mean is the statisticians term for what the layman knows as the average. It can be thought of as that value of the variable

But, experience has shown that in the calculation of the arithmetic mean, this error is usually small and never serious.

Only a slight difference occurs between the true answer that we would get from the raw data, and the answer that we get from the data that has been grouped in the form of a frequency distribution.

In this example, if we calculate the arithmetic mean directly from the 30 EPA mileage ratings, we obtain:

30

8.399.33.....1.303.36 X

82.3730

7.1134

Page 11: THE ARITHMETIC MEAN The arithmetic mean is the statisticians term for what the layman knows as the average. It can be thought of as that value of the variable

The arithmetic mean is predominantly used as a measure of central tendency. The question is, “Why is it that the arithmetic mean is known as a measure of central tendency?”

The answer to this question is that we have just obtained i.e. 37.85 falls more or less in the centre of our frequency distribution.

02468

10121416

Miles per gallon

Nu

mb

er

of

Cars

X

Y

Mean = 37.85

Page 12: THE ARITHMETIC MEAN The arithmetic mean is the statisticians term for what the layman knows as the average. It can be thought of as that value of the variable

DESIRABLE PROPERTIES OF THE ARITHMETIC MEAN

•Best understood average in statistics. •Relatively easy to calculate •Takes into account every value in the series.But there is one limitation to the use of the arithmetic mean: As we are aware, every value in a data-set is included in the calculation of the mean, whether the value be high or low.

Where there are a few very high or very low values in the series, their effect can be to drag the arithmetic mean towards them. this may make the mean unrepresentative.Let us consider an example:

Page 13: THE ARITHMETIC MEAN The arithmetic mean is the statisticians term for what the layman knows as the average. It can be thought of as that value of the variable

Example of the Case Where the Arithmetic Mean Is Not a Proper Representative of the Data:

Suppose one walks down the main street of a large city centre and counts the number of floors in each building.

Suppose, the following answers are obtained:

5, 4, 3, 4, 5, 4, 3, 4, 5, 20, 5, 6, 32, 8, 27

The mean number of floors is 9 even though 12 out

of 15 of the buildings have 6 floors or less.

The three skyscraper blocks are having a disproportionate effect on the arithmetic mean.

Page 14: THE ARITHMETIC MEAN The arithmetic mean is the statisticians term for what the layman knows as the average. It can be thought of as that value of the variable

EXAMPLESuppose that in a particular high school, there are:-100 – freshmen80 – sophomores 70 – juniors50 – seniors

And suppose that on a given day, 15% of freshmen, 5% of sophomores, 10% of juniors, 2% of seniors are absent.

The problem is that: What percentage of students is absent for the school as a whole on that particular day?

Now a student is likely to attempt to find the answer by adding the percentages and dividing by 4 i.e.

84

32

4

210515

Page 15: THE ARITHMETIC MEAN The arithmetic mean is the statisticians term for what the layman knows as the average. It can be thought of as that value of the variable

As we have already noted, 15% of the freshmen are absent on this particular day. Since, in all, there are 100 freshmen in the school, hence the total number of freshmen who are absent is also 15.

But as far as the sophomores are concerned, the total number of them in the school is 80, and if 5% of them are absent on this particular day, this means that the total number of sophomores who are absent is only 4.

Categoryof Student

Number ofStudents inthe school

Number ofStudents who

are absentFreshman 100 15Sophomore 80 4Junior 70 7Senior 50 1

TOTAL 300 27

Page 16: THE ARITHMETIC MEAN The arithmetic mean is the statisticians term for what the layman knows as the average. It can be thought of as that value of the variable

Dividing the total number of students who are absent by the total number of students enrolled in the school, and multiplying by 100, we obtain:

9100300

27

Page 17: THE ARITHMETIC MEAN The arithmetic mean is the statisticians term for what the layman knows as the average. It can be thought of as that value of the variable

Category of Student

Percentage of Students

who are absent

Xi

Number of students

enrolled in the school (Weights)

Wi

WiXi (Weighted Xi)

Freshman 15 100 100 15 = 1500

Sophomore 5 80 80 5 = 400 Junior 10 70 70 10 = 700 Senior 2 50 50 2 = 100

Total Wi = 300 WiXi = 2700

In this example, the number of students enrolled in each category acts as the weight for the number of absences pertaining to that category i.e.

Page 18: THE ARITHMETIC MEAN The arithmetic mean is the statisticians term for what the layman knows as the average. It can be thought of as that value of the variable

i

iiw W

XWX

WEIGHTED MEAN

And, in this example, the weighted mean is equal to:

i

iiw W

XWX

9300

2700

An important point to note here is the criterion for assigning weights. Weights can be assigned in a number of ways depending on the situation and the problem domain.

In the example that we have just considered, greater weights are assigned to larger groups.

Page 19: THE ARITHMETIC MEAN The arithmetic mean is the statisticians term for what the layman knows as the average. It can be thought of as that value of the variable

MEDIAN

The median is the middle value of the series when the variable values are placed in order of magnitude.

The median is defined as a value which divides a set of data into two halves, one half comprising of observations greater than and the other half smaller than it. More precisely, the median is a value at or below which 50% of the data lie.

The median value can be ascertained by inspection in many series. For instance, in this very example, the data that we obtained was:

Page 20: THE ARITHMETIC MEAN The arithmetic mean is the statisticians term for what the layman knows as the average. It can be thought of as that value of the variable

EXAMPLE:

The average number of floors in the buildings at the centre of a city:

5, 4, 3, 4, 5, 4, 3, 4, 5, 20, 5, 6, 32, 8, 27Arranging these values in ascending order, we

obtain3, 3, 4, 4, 4, 4, 5, 5, 5, 5, 6, 8, 20, 27, 32Picking up the middle value, we obtain the

medianequal to 5.X

~

Page 21: THE ARITHMETIC MEAN The arithmetic mean is the statisticians term for what the layman knows as the average. It can be thought of as that value of the variable

Interpretation:

The median number of floors is 5. Out of those 15 buildings, 7 have up to 5 floors and 7 have 5 floors or more.We noticed earlier that the arithmetic mean was distorted toward the few extremely high values in the series and hence became unrepresentative. The median = 5 is much more representative of this series.

Page 22: THE ARITHMETIC MEAN The arithmetic mean is the statisticians term for what the layman knows as the average. It can be thought of as that value of the variable

EXAMPLEHeight of buildings (number of floors)3344 7 lower4555 = median height5568 7 higher202732

Page 23: THE ARITHMETIC MEAN The arithmetic mean is the statisticians term for what the layman knows as the average. It can be thought of as that value of the variable

EXAMPLERetail price of motor-car (£) (several makes and sizes) 415 480 525

4 above

608

719 = median price

1,090 2,059 4,000

4 above

6,000

Page 24: THE ARITHMETIC MEAN The arithmetic mean is the statisticians term for what the layman knows as the average. It can be thought of as that value of the variable

Number of passengers travelling on abus at six Different times during the day491418

= median value

2347

Median = 2

1814= 16 passengers

EXAMPLE

Page 25: THE ARITHMETIC MEAN The arithmetic mean is the statisticians term for what the layman knows as the average. It can be thought of as that value of the variable

Example of Discrete a Frequency Example of Discrete a Frequency DistributionDistribution

Comprehensive SchoolComprehensive SchoolNo. of Pupils per classNo. of Pupils per class No. of classNo. of class

2323

2424

2525

2626

2727

2828

2929

3030

3131

11

00

11

33

66

99

88

1010

77

Page 26: THE ARITHMETIC MEAN The arithmetic mean is the statisticians term for what the layman knows as the average. It can be thought of as that value of the variable

EXAMPLE OF A DISCRETE FREQUENCY DISTRIBUTION

Comprehensive School:

Number of pupils per class Number of Classes23 124 025 126 327 628 929 830 1031 7

45

Page 27: THE ARITHMETIC MEAN The arithmetic mean is the statisticians term for what the layman knows as the average. It can be thought of as that value of the variable

In this school, there are 45 classes in all, so that we require as the median that class-size below which there are 22 classes and above which also there are 22 classes. In other words, we must find the 23rd class in an ordered list. We could simply count down noticing that there is 1 class of 23 children, 2 classes with up to 25 children, 5 classes with up to 26 children.

Proceeding in this manner, we find that 20 classes contain up to 28 children whereas 28 classes contain up to 29 children. This means that the 23rd class --- the one that we are looking for --- is the one which contains exactly 29 children.

Page 28: THE ARITHMETIC MEAN The arithmetic mean is the statisticians term for what the layman knows as the average. It can be thought of as that value of the variable

Raw DataRaw Data

23, 25, 26, 26, 26, 27, 27, 27, 27, 2723, 25, 26, 26, 26, 27, 27, 27, 27, 27

27, 28, 28, 28, 28, 28, 28, 28, 28, 2827, 28, 28, 28, 28, 28, 28, 28, 28, 28

29, 29, 29, 29, 29, 29, 29, 29, 30, 3029, 29, 29, 29, 29, 29, 29, 29, 30, 30

30, 30, 30, 30, 30, 30, 30, 30, 31, 3130, 30, 30, 30, 30, 30, 30, 30, 31, 31

31, 31, 31, 31, 31 31, 31, 31, 31, 31

Median = 23rd Value

Page 29: THE ARITHMETIC MEAN The arithmetic mean is the statisticians term for what the layman knows as the average. It can be thought of as that value of the variable

Comprehensive School:

Number ofpupils per class

X

Number ofClasses

f

CumulativeFrequency

cf23 1 124 0 125 1 226 3 527 6 1128 9 2029 8 2830 10 3831 7 45

45

median

Page 30: THE ARITHMETIC MEAN The arithmetic mean is the statisticians term for what the layman knows as the average. It can be thought of as that value of the variable

Median number of pupils per class:

29~ X

This means that 29 is the middle size of the class. In other words, 22 classes are such which contain 29 or less than 29 children, and 22 classes are such which contain 29 or more than 29 children.

Page 31: THE ARITHMETIC MEAN The arithmetic mean is the statisticians term for what the layman knows as the average. It can be thought of as that value of the variable

ExampleExample

Displayed in the following table are the Displayed in the following table are the annual attendance figures in millions of annual attendance figures in millions of visitors of 32 U.S public zoological parks:visitors of 32 U.S public zoological parks:

Page 32: THE ARITHMETIC MEAN The arithmetic mean is the statisticians term for what the layman knows as the average. It can be thought of as that value of the variable

Attendance figures of 32 zoos (in millions)Attendance figures of 32 zoos (in millions)

0.60.6

1.41.4

1.31.3

0.60.6

0.90.9

1.01.0

1.21.2

0.90.9

0.20.2

1.41.4

0.30.3

2.72.7

0.50.5

0.40.4

6.06.0

0.10.1

2.02.0

1.61.6

1.11.1

0.30.3

1.31.3

0.60.6

1.31.3

1.51.5

1.41.4

0.70.7

1.01.0

0.60.6

0.40.4

0.80.8

0.30.3

0.90.9

Source: The World Almanac and Book of Source: The World Almanac and Book of Facts, Funk & Wagnalls, 1995. Facts, Funk & Wagnalls, 1995.

Page 33: THE ARITHMETIC MEAN The arithmetic mean is the statisticians term for what the layman knows as the average. It can be thought of as that value of the variable

For these data, measures of locationFor these data, measures of location

can yield such information as thecan yield such information as the

average attendance of zoos, theaverage attendance of zoos, the

middle attendance figure and the mostmiddle attendance figure and the most

frequently occurring figure.frequently occurring figure.

Page 34: THE ARITHMETIC MEAN The arithmetic mean is the statisticians term for what the layman knows as the average. It can be thought of as that value of the variable

Compute the mean, median and theCompute the mean, median and the

mode for the attendance figure listedmode for the attendance figure listed

in the above table. in the above table.

Page 35: THE ARITHMETIC MEAN The arithmetic mean is the statisticians term for what the layman knows as the average. It can be thought of as that value of the variable

SolutionSolution

1.1. Computation of Mean:Computation of Mean:

For these data we haveFor these data we have

Hence, the mean is:Hence, the mean is:41.6, 32X n

41.61.3 million

32X

Page 36: THE ARITHMETIC MEAN The arithmetic mean is the statisticians term for what the layman knows as the average. It can be thought of as that value of the variable

2.2. Computation of the MedianComputation of the Median::

Step-1Step-1

Arrange the data in an ordered arrayArrange the data in an ordered array

0.3 0.4 0.5 0.6 0.6 0.6 0.6 0.6 0.7 0.80.3 0.4 0.5 0.6 0.6 0.6 0.6 0.6 0.7 0.8

0.9 0.9 0.9 1.0 1.0 1.0 1.1 1.2 1.3 1.30.9 0.9 0.9 1.0 1.0 1.0 1.1 1.2 1.3 1.3

1.3 1.4 1.4 1.4 1.5 1.6 2.0 2.0 2.7 3.01.3 1.4 1.4 1.4 1.5 1.6 2.0 2.0 2.7 3.0

3.0 4.0 3.0 4.0

Page 37: THE ARITHMETIC MEAN The arithmetic mean is the statisticians term for what the layman knows as the average. It can be thought of as that value of the variable

Step-2Step-2

In order to compute the median, the In order to compute the median, the

first point to be noted is that, in thisfirst point to be noted is that, in this

example, we are dealing with an evenexample, we are dealing with an even

number of values i.e. 32 number of values i.e. 32

Page 38: THE ARITHMETIC MEAN The arithmetic mean is the statisticians term for what the layman knows as the average. It can be thought of as that value of the variable

We compute the average of the We compute the average of the

values of the ordered data-set.values of the ordered data-set.

Here, we discuss another way to solve this Here, we discuss another way to solve this

problem. problem.

2&

2 2

n nth th

Page 39: THE ARITHMETIC MEAN The arithmetic mean is the statisticians term for what the layman knows as the average. It can be thought of as that value of the variable

As, there are n = 32 values, we can say that As, there are n = 32 values, we can say that the median is located atthe median is located at

1 32 1 3316.5th value

2 2 2

n

Page 40: THE ARITHMETIC MEAN The arithmetic mean is the statisticians term for what the layman knows as the average. It can be thought of as that value of the variable

X

15th

val

ue

16th

val

ue

17th

val

ue

18th

val

ue

“16.5th value”

Page 41: THE ARITHMETIC MEAN The arithmetic mean is the statisticians term for what the layman knows as the average. It can be thought of as that value of the variable

Hence, we haveHence, we have

Median =

1.0 1.1

1.05 million. 2

Page 42: THE ARITHMETIC MEAN The arithmetic mean is the statisticians term for what the layman knows as the average. It can be thought of as that value of the variable

3.3. Computation of the ModeComputation of the Mode

By inspecting the attendance figures, we By inspecting the attendance figures, we find that 0.6 is occurring five times find that 0.6 is occurring five times whereas all the other figures are whereas all the other figures are occurring less often.occurring less often.

Hence, Hence,

Mode = 0.6 million Mode = 0.6 million

Page 43: THE ARITHMETIC MEAN The arithmetic mean is the statisticians term for what the layman knows as the average. It can be thought of as that value of the variable

ConclusionConclusion

The mean or average, attendance at these The mean or average, attendance at these

32 zoological parks is 1.3 million. 32 zoological parks is 1.3 million.

The median or middle attendance figure isThe median or middle attendance figure is

1.05 million1.05 million

The mode i.e. the most frequently occurringThe mode i.e. the most frequently occurring

attendance figure is 0.6 million attendance figure is 0.6 million

Page 44: THE ARITHMETIC MEAN The arithmetic mean is the statisticians term for what the layman knows as the average. It can be thought of as that value of the variable

IN TODAY’S LECTURE, YOU LEARNT

•The importance of the mode

•The non-modal and the bi-modal situation

•The (simple) arithmetic mean

•The weighted arithmetic mean

•The median (in case of raw data and in case of the frequency distribution of a discrete variable).

Page 45: THE ARITHMETIC MEAN The arithmetic mean is the statisticians term for what the layman knows as the average. It can be thought of as that value of the variable

IN THE NEXT LECTURE, YOU WILL LEARN

•Computation of the median in the case of the frequency distribution of a continuous variable.

•Empirical relation between the mean, median and the mode.