it colleges introduction to statistical computer packages lecture 3 eng. heba hamad week 3 - 2008

31
IT Colleges Introduction to Statistical Computer Packages Lecture 3 Eng. Heba Hamad week 3 - 2008

Upload: august-long

Post on 02-Jan-2016

216 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: IT Colleges Introduction to Statistical Computer Packages Lecture 3 Eng. Heba Hamad week 3 - 2008

IT Colleges

Introduction to Statistical Computer Packages

Lecture 3

Eng. Heba Hamad

week 3 - 2008

Page 2: IT Colleges Introduction to Statistical Computer Packages Lecture 3 Eng. Heba Hamad week 3 - 2008

Chapter 2 (part 2)

Statistics for Describing Data

Introduction to Statistics

Page 3: IT Colleges Introduction to Statistical Computer Packages Lecture 3 Eng. Heba Hamad week 3 - 2008

Statistics for Describing, Exploring, and Comparing Data

Measures of Center

Measures of Variation

Introduction to Statistics

Page 4: IT Colleges Introduction to Statistical Computer Packages Lecture 3 Eng. Heba Hamad week 3 - 2008

Measures of Center

Introduction to Statistics

Page 5: IT Colleges Introduction to Statistical Computer Packages Lecture 3 Eng. Heba Hamad week 3 - 2008

Key ConceptWhen describing, exploring, and comparing data sets, these characteristics are usually extremely important:

center,

variation,

distribution,

Outliers.

Introduction to Statistics

Page 6: IT Colleges Introduction to Statistical Computer Packages Lecture 3 Eng. Heba Hamad week 3 - 2008

Definition

Measure of Center (Measures of location)

The value at the center or middle of a data set.

Measures of centers are often referred to as averages. The purpose of a measure of center is to pinpoint the center of a set of values.

Introduction to Statistics

Page 7: IT Colleges Introduction to Statistical Computer Packages Lecture 3 Eng. Heba Hamad week 3 - 2008

Definition

Arithmetic Mean (Mean)

The measure of center obtained by adding the values and dividing the total by the number of values

Introduction to Statistics

Page 8: IT Colleges Introduction to Statistical Computer Packages Lecture 3 Eng. Heba Hamad week 3 - 2008

Notation

denotes the sum of a set of values.

x is the variable usually used to represent the individual data values.

n represents the number of values in a sample.

N represents the number of values in a population.

Introduction to Statistics

Page 9: IT Colleges Introduction to Statistical Computer Packages Lecture 3 Eng. Heba Hamad week 3 - 2008

Population Mean

The population mean is the sum of all values in the population divided by the number of the values in the population.

is pronounced ‘mu’ and denotes the mean of all values in a population

N

x

Introduction to Statistics

Page 10: IT Colleges Introduction to Statistical Computer Packages Lecture 3 Eng. Heba Hamad week 3 - 2008

Population Mean ExampleThere are 12 automobile manufacturing companies in the United States. Listed below is the no. of patents granted by U.S. government to each company in a recent year

CompanyNo. of PatentsCompanyNo. of Patents

General Motors511Mazda210

Nissan385Chrysler97

DaimlerChrysler275Porsche50

Toyota257Mitsubishi36

Honda249Volvo23

Ford234BMW13

Introduction to Statistics

Page 11: IT Colleges Introduction to Statistical Computer Packages Lecture 3 Eng. Heba Hamad week 3 - 2008

Population Mean Example

This is an example of a population mean because we are considering all the automobiles manufacturing companies obtaining patents.

=195

Introduction to Statistics

Page 12: IT Colleges Introduction to Statistical Computer Packages Lecture 3 Eng. Heba Hamad week 3 - 2008

Sample Mean

The sample mean is the sum of all the sampled values divided by the total number of sampled values.

is pronounced ‘x-bar’ and denotes the mean of a set of sample valuesx

n

xx

Introduction to Statistics

Page 13: IT Colleges Introduction to Statistical Computer Packages Lecture 3 Eng. Heba Hamad week 3 - 2008

Sample Mean Example

SunCom is studying the number of minutes used monthly by clients in a particular cell phone rate plan. A random sample of 12 clients showed the following number of minutes used last month

90 77 94 89 119 112 91 110 100 92 113 83

What is the arithmetic mean number of minutes used?

Sample mean = 97.5 minutesx

Introduction to Statistics

Page 14: IT Colleges Introduction to Statistical Computer Packages Lecture 3 Eng. Heba Hamad week 3 - 2008

Properties of the Arithmetic Mean (Mean)

• All the values are included in the computing mean

• A set of data has only one mean. The mean is unique.

• The sum of the deviations of each value from the mean will be zero.

• The mean is affected by outliers

Introduction to Statistics

Page 15: IT Colleges Introduction to Statistical Computer Packages Lecture 3 Eng. Heba Hamad week 3 - 2008

Exercise

There are 10 salespeople employed by midtown ford. The no. of new cars sold last month by respective salespeople were:

15,23,4,19,18,10,10,8,28,19

Compute the mean and indicate whether it is a statistic or a parameter.

Introduction to Statistics

Page 16: IT Colleges Introduction to Statistical Computer Packages Lecture 3 Eng. Heba Hamad week 3 - 2008

Definitions

Median

the middle value when the original data values are arranged in order of increasing (or decreasing) magnitude

is not affected by an

extreme value

Introduction to Statistics

Page 17: IT Colleges Introduction to Statistical Computer Packages Lecture 3 Eng. Heba Hamad week 3 - 2008

Finding the Median

If the number of values is odd, the median is the number located in the exact middle of the list.

If the number of values is even, the median is found by computing the mean of the two middle numbers.

Introduction to Statistics

Page 18: IT Colleges Introduction to Statistical Computer Packages Lecture 3 Eng. Heba Hamad week 3 - 2008

5.40 1.10 0.42 0.73 0.48 1.10

0.42 0.48 0.73 1.10 1.10 5.40

2

(in order - even number of values – no exact middleshared by two numbers)

MEDIAN is 0.915

Introduction to Statistics

Page 19: IT Colleges Introduction to Statistical Computer Packages Lecture 3 Eng. Heba Hamad week 3 - 2008

5.40 1.10 0.42 0.73 0.48 1.10 0.66

0.42 0.48 0.66 0.73 1.10 1.10 5.40

(in order - odd number of values)

exact middle MEDIAN is 0.73

Introduction to Statistics

Page 20: IT Colleges Introduction to Statistical Computer Packages Lecture 3 Eng. Heba Hamad week 3 - 2008

Properties of the Median

• A set of data has only one median. The mean is unique.

• The mean is not affected by extremely large or smale values.

Introduction to Statistics

Page 21: IT Colleges Introduction to Statistical Computer Packages Lecture 3 Eng. Heba Hamad week 3 - 2008

Definitions

Mode

the value that occurs most frequently

Mode is not always unique

A data set may be:Bimodal

MultimodalNo Mode

Introduction to Statistics

Page 22: IT Colleges Introduction to Statistical Computer Packages Lecture 3 Eng. Heba Hamad week 3 - 2008

a. 5.40 1.10 0.42 0.73 0.48 1.10

b. 27 27 27 55 55 55 88 88 99

c. 1 2 3 6 7 8 9 10

Mode - Examples

Mode is 1.10

Bimodal - 27 & 55

No Mode

Introduction to Statistics

Page 23: IT Colleges Introduction to Statistical Computer Packages Lecture 3 Eng. Heba Hamad week 3 - 2008

Midrange

The value midway between the maximum and minimum values in the original data set

Definition

Midrange = maximum value + minimum value

2

Introduction to Statistics

Page 24: IT Colleges Introduction to Statistical Computer Packages Lecture 3 Eng. Heba Hamad week 3 - 2008

Example 1

Table 3.1

Table 3.2

Data Set IData Set II

For each data set determine: Mean Median Mode

Introduction to Statistics

Page 25: IT Colleges Introduction to Statistical Computer Packages Lecture 3 Eng. Heba Hamad week 3 - 2008

Example 1

Solution

Introduction to Statistics

Page 26: IT Colleges Introduction to Statistical Computer Packages Lecture 3 Eng. Heba Hamad week 3 - 2008

The weighted meanThe weighted mean is a special case of the arithmetic mean. It occurs when there are several observations of the same value. To explain, suppose we want to compute the mean for the following values: 9, 9, 9, 12.5, 12.5, 12.5, 12.5, 15, 15, 15 ,

Weighted mean = (3*9 +4*12.5 + 3*15) / 10

= 12.2

Introduction to Statistics

Page 27: IT Colleges Introduction to Statistical Computer Packages Lecture 3 Eng. Heba Hamad week 3 - 2008

use class midpoint of classes for variable x

Mean from a Frequency Distribution

Class Mid point

Introduction to Statistics

Page 28: IT Colleges Introduction to Statistical Computer Packages Lecture 3 Eng. Heba Hamad week 3 - 2008

Example 2

Class limitsClass Mid point xFrequency ff . x21 - 3025.52871431 - 4035.530106541 - 5045.51254651 - 6055.5211161 - 7065.5213171 - 8075.52151

Sum762718

76.3576

2718.

f

xfx

Introduction to Statistics

Page 29: IT Colleges Introduction to Statistical Computer Packages Lecture 3 Eng. Heba Hamad week 3 - 2008

Best Measure of Center

Introduction to Statistics

Page 30: IT Colleges Introduction to Statistical Computer Packages Lecture 3 Eng. Heba Hamad week 3 - 2008

Symmetricdistribution of data is symmetric if

the left half of its histogram is roughly a mirror image of its right half

Skeweddistribution of data is skewed if it is

not symmetric and if it extends more to one side than the other

Definitions

Introduction to Statistics

Page 31: IT Colleges Introduction to Statistical Computer Packages Lecture 3 Eng. Heba Hamad week 3 - 2008

Skewness

Introduction to Statistics