fcm2063 - printed notes
DESCRIPTION
statistics/mathematicsTRANSCRIPT
-
5/27/2018 FCM2063 - Printed Notes
1/126
1
Probability and Statistics
For Scientist and Technologist
A Course in Probability and Statistics
By
Radzuan Razali
Afza Shafie
-
5/27/2018 FCM2063 - Printed Notes
2/126
2
Contents:
Part One: Probabili ty and Distri bution
Chapter 1
1. The concept of probability
1.1Introduction1.2Sample Space, probability of events, counting rule1.3Conditional probability1.4Multiplication rule1.5Bayes theorem
Chapter 2:
2. Discrete random variable and probability distribution
2.1Introduction2.2Discrete random variable2.3Discrete probability distribution2.4Special functions for discrete probability distribution
Chapter 3:
3. Continuous random variable and probability distribution
3.1Introduction3.2Continuous random variable3.3Continuous probability distribution3.4Special functions for continuous probability distribution
Part Two: Descriptive Statistics
Chapter 4:
4. Data display and summary of data
4.1 Introduction4.2 The definition and the difference between sample and population4.3 Graphical display of data: Stem and leaf, and Box-plot4.4 The mean, variance and standard deviation of the data
Chapter 5:
5. Statistical process control: X-bar and R-charts.
-
5/27/2018 FCM2063 - Printed Notes
3/126
3
5.1 Introduction5.2 Statistical process control: X-bar and R-charts
Part Three: I nferential Statistics
Chapter 6:
6. Hypothesis testing for single population
6.1Introduction6.2Test about a sample mean for large sample, population variance is known6.3TheP-valuefor the test and confidence interval for mean6.4Test about sample mean for small sample, population variance is unknown6.5TheP-valuefor the test and confidence interval for mean6.6Test about proportion
6.7TheP-valuefor the test and confidence interval for proportion6.8Confidence interval for proportion
6.9Test about variance6.10 TheP-valuefor the test and confidence interval for variance
Chapter 7:
7. Simple linear regression model
8.1 Introduction8.2 Least squares estimator
8.3 The coefficient of determination8.4 Confidence intervals and significance tests
Part Four: Design of Experiments
Chapter 8:
8. The design and analysis of experiments
10.1 Introduction10.2 Two-Factorial design experiment
Appendix
Table 1: The normal - Z distributionTable 2: The Students t-distribution
Table 3: The chi-squared, distributionTable 4: The F-distribution
-
5/27/2018 FCM2063 - Printed Notes
4/126
4
Preface
This book provides an introduction to probability and statistics, with particular emphasis on
applications in applied sciences, technology and engineering. Typically introductory texts onengineering statistics spend a great deal of time on basic probability ideas for the first severalchapters. In fact, basic probabilities can easily fill up a standard introductory course. Becauseengineering students often have only one probability/statistics course, the material needs to bereorganized in order to allow for coverage of statistical methodology.
This book will be divided to four parts; part 1 is related to the basic concept of probability andthe distributions as in the Chapter 1 till Chapter 4. Chapter 1, we give a brief introduction to thebasic concept of the probability. In Chapter 2, we introduce the definition of discrete randomvariables and the probability distributions. The continuous and their probability distributions willbe discussed in Chapter 3.
Part 2 is covering on descriptive statistics as in the Chapter 4 and Chapter 5. In Chapter 4, weintroduce the types of how to display data and summary of the data. Meanwhile, the randomsample, central limit theorem, normal approximation and statistical process control will bediscussed in the Chapter 5.
The engineering students also must be given some experience on how to do a basic data analysis.The inferential statistics are important as a part of statistical methods to do the data analysis. PartIII is covering the inferential statistics such as in the Chapter 6 till Chapter 9. In Chapter 6, weintroduce the hypothesis testing for single population and the hypothesis testing for twopopulations will be discussed in Chapter 7. While in Chapter 8 and Chapter 9, we introduce the
simple and multiple linear regressions, respectively.
Finally, in part IV, we also want the students have some experience on the real application inengineering. The related topics such as a factorial design and design of experiment will bediscussed in the Chapter 10.
Afza Shafie
Radzuan Razali
April 2010.
-
5/27/2018 FCM2063 - Printed Notes
5/126
5
Chapter 1
1. Basic concept of probabilityLearning objectives:
At the end of this chapter, student should be able to:
Define and construct sample space of an experiment.
Define random events, identify types of events, apply Venn Diagram and laws Tofind event set including intersection, union and complement.
Identify mutually exclusive and exhaustive events. To apply Bayes theorem to find the conditional probability of an event when the
event is partitioned into several mutually exclusive and exhaustive subsets.
1.1 Introduction
Probability theory refers to the study of randomness and uncertainty. Probability formsthe basis knowledge which we can make inferences about a population based on thedistribution and its providemethods for quantifying the chances or likelihood associatedwith various outcomes. Probability helps to explain a lot of everyday occurrences and weactually discuss it frequently.
Probability also has been used everyday in engineering and technology. For example: theprobability of a good part being produce, the reliability of a new machine (reliabilities areactually probabilities) etc.
An engineer wants to be fairly certain that the percentage of good rods is at least 90%;otherwise he will shut down the process for recalibration. How certain that he has at least90% of the 1000 rods are good?
What is the different between probability and inferential statistics? Probability isinvolving properties of the population under study which are assumed known and
questions regarding a sample taken from the population are posed and answered. While,inferential statistics is involved a characteristics of a sample which are available to theexperimenter and this information enables experimenter to draw conclusions about thepopulations.
-
5/27/2018 FCM2063 - Printed Notes
6/126
6
1.1.1 Definition:
Some definitions or terms in basic probability must be known and well understand.Among the definitions are:
Random Process is a situation in which possible results are known but actual resultscannot be predicted with certainty in advance.
Outcome is related toeach possible result for a random process
Experiment is a process by which an observation or measurement is obtained (yieldoutcomes)
1,2 Sample Space, probability of events, counting rule
In the process of collecting data before analysis and interpretation being done, the methodof how to model the random experiment is crucial. The terms related to it such as samplespace and an event are important.
1.2.1 Sample Space:
Sample space denoted by S, is the set of all possible outcomes of an experiment.Event is any collection (subset) of outcomes contained in the sample space S.
An event is called simple if it consists of exactly one outcome and called compoundevent if it consists of more than one outcome. Mean while the null event is an event withno outcomes. This is actually impossible event or empty set.
Example 1.1:
Experiment of roll a die:
The sample space is: S = {1, 2, 3, 4, 5, 6}
The simple events (or outcomes) are:
E1: observe No. 1 = {1} E2= {2} E3 = {3}
E4= {4} E5= {5} E6= {6}
The compound events are:A : observe an odd number = {1, 3, 5}B: observe a number greater than or equal to 4 = {4, 5, 6}
-
5/27/2018 FCM2063 - Printed Notes
7/126
7
Example 1.2:
Toss a coin for three times and observed the number of heads. The sample space is,
S = {0, 1, 2, 3}
The sample space for the lifetime of a machine (in days) is,
S = {t | t 0 }= [0, )
The sample space for the number of calls at a telephone exchange during a specific timeinterval is,
S = {0, 1,.}
The knowledge in set theory is important to understand the basic of probability. Theunion of eventsA andBdenoted byA U Band read A orB is the event consisting of alloutcomes that are either inAor inBor in both events.
The intersection ofAandBdenoted byA Band read A andB, is the event consistingof all outcomes that are in bothAandB.
The complement of event A, denoted by AC, is the event of all outcomes in the sample
space Sthat are not contained in eventA.
If two events A and B have no outcomes in common they are said to be mutuallyexclusive or disjoint events. This means that if one of the events occurs the other cannot.
All these events can be visualized in term of Venn diagram:
1.2.2 Probability of Events
An event is a subset of all of the possible outcomes of an experiment. The probability of
event is to assign for each event, sayE, a number,P(E),called the probability ofEwhichwill give a precise measure of the chance that Ewill occur. The probability of an eventE,is defined as the ratio of the number of outcome favorable to the event, ndivided by thetotal number of all possible outcomes,N. That isP(E) = n/N.
For example, in the experiment tossing a die repeatedly, in the long run, what would weexpect that the probability of even number will occurs,P(E=2 or 4 or 6)?
-
5/27/2018 FCM2063 - Printed Notes
8/126
8
In this experiment, an event is even number will occur three times, so n=3.The totalpossible outcomes is six, so N=6. Hence the probability of even number will occur is,P(E=2 or 4 or 6)=3/6=0.5
Condition of Probability
A probability denoted byPis a rule (or function) which assigns a number between 0 and1 to each event and must satisfies:
0 P(E) 1 for any event E
P( ) = 0 , P(S) = 1,
If A1, A2, is an infinite collection of mutuall y exclusiveevents, then
The probability of the complementof any event A is given as
For example, ifP(rain tomorrow) = 0.6 then P(no rain tomorrow) = 0.4
Other notations for complement forA isAcor
Example 1.3:
An oil-prospecting firm plans to drill two exploratory wells. Past evidence is used toassess the possible outcomes listed in the following table:
Event Description Probability
A
BC
Neither well produces oil nor gas
Exactly one well produces oil or gas.Both wells produce oil or gas
0.85
0.120.03
Find and give description.
1 2 1 2( ...) ( ) ( ) ...P A A P A P A
( ') 1 ( )P A P A
)()(),( CBPandCBPBAP
-
5/27/2018 FCM2063 - Printed Notes
9/126
9
Solution:
EventsA, Band C are mutually exclusive because the occurrence of one eventprecludes the occurrence of either of the other two.
P(A or B) = P(A) + P(B) = 0.97 (probability at most one well produces oil orgas)
P(B or C) = P(B) + P(C)= 0.15(probability at least one well produces gas or oil
P(B) = 1 P(B) = 0.88(probability both wells not produce or both produce oil or gas)
1.2.3 General Addition Law
LetA andBbe two events defined in a sample space S.
If two eventsA andBare mutually exclusive, then
Thus
This can be expanded to consider more than twomutually exclusive events.
Example 1.4
One of the residential in Ipoh, 45% of all households subscribe to the Sinar Hariannewspaper published in a nearby city, 75% subscribe to the Utusan Malaysia, and 30% ofall households subscribe to both papers. Draw a Venn diagram for this problem.If a household is selected at random, what is the probability that it subscribes toa) At least one of the two newspapersb) Exactly one of the two newspapers
Solution:
a)A= event subscribe to Sinar Harian,B= event subscribe to Utusan Malaysia
P(A U B) = [ P(A) + P(B)P(A B)] = 0.45 + 0.750.30 = 0.9
b)P (exactly one) =P (A B) +P (A B) = 0.15 + 0.45 = 0.6
P(A B) P(A) P(B) P(A B)
P(A B) 0
P(A B) P(A) P(B)
-
5/27/2018 FCM2063 - Printed Notes
10/126
10
The probability of an event Aequals the number of outcomes (sample points) containedinAdivided by the total number of possible outcomes. That is:
P(A) = n(A) / n(S)
Important condition: all outcomes are equally likely to occur. Inefficient when n(S) islarge.
1.2.4 Counting Rule:
Eliminates the need for listing each simple event and help to easily assigned probabilitiesto various events when the outcomes are equally likely. Especially helpful if the samplespace is quite large.
Product (Multiplication) Rule
If there are kelements ( or things) to choose and there are n1choices for the firstelement, n2for the second element, and so on to nkchoices for the k
th element,
then the number of possible ways of selecting them is only applies when elements aredifferent or the order of elements matters.
Example 1.5:
A chemical engineer wishes to conduct an experiment to determine how these fourfactors affect the quality of the coating. She is interested in comparing two charge levels,three density levels, four temperature levels, and five speed levels. How many
experimental conditions are possible?
Solution:
The possible experiment conditions are 2x3x4x5=120
Permutations and Combinations
Permutation is an orderedarrangement of kobjects taken from a set of ndistinct objects (k n ).
The number of waysof permutation of kobjects from ndistinct objects will be denotedby the symbolPk,n
)!(
!PP
kn
nnknk
,
-
5/27/2018 FCM2063 - Printed Notes
11/126
11
Example 1.6:
8 teaching assistants are available to grade an exam of four questions. Wish to select adifferent assistant to grade each question (only one assistant per question). How manypossible ways can the assistant are chosen for grading?
Solution:
The number of possible ways is
Combination
Combination: an unorderedsubset of kobjects taken from a set of ndistinct objects.
The number of waysof combination of kobjects from ndistinct objects is denoted by thesymbol Ck,n
Permutation vs. Combination
Permutations are larger in number than combinations: e.g., the three numbers (1,2, 3), (1,3,2) (2,3,1) , (3,1,2), (3,2,1) are all different permutations of the numbers 1, 2 and 3.However, they all represent the same combination of numbers.
Example 1.7:
Fifteen players compete in a tournament. In how many ways cana) rankings be assigned to the top five competitors?b) the best five competitors be randomly chosen?
Solution
The number of rankings that can be assigned to the top five competitors is
The number of ways that five competitors can be chosen is
168084 PPn
k
!!
!,
)( knk
n
k
nCC
k
nnk
,!!( )! !
k nk
nn PnCk k n k k
360,360155 PPn
k
003,3!10!5
!15
5
15
nkC
-
5/27/2018 FCM2063 - Printed Notes
12/126
12
1.3 Conditional Probability; Independent Events;
Sometimes it is useful to know the probability that an event will occur given that anotherevent occurred. Given two possible events, if we know that one event occurred then this
information can be applied in calculating the other events probability.
1.3.1 Conditional Probability
The conditional probability ofA, given that Bhas already occurred, is denoted as P (A |B)and defined as:
0)(provided,)(
)()(
Bp
Bp
BApBAp
The conditional probability of B, given thatAhas already occurred, is denoted as P ( B |
A)and defined as:
0)(provided,)(
)()(
Ap
Ap
BApABp
Example 1.8:
The Information Resource Center(IRC), UTP displays three types of books entitledScience (S), Engineering (E), and Technology (T). Reading habits of randomlyselected reader with respect to these types of books are
Read regularly S E T SE ST ET SETProbability 0.14 0.23 0.37 0.08 0.09 0.13 0.05
Find the following probabilities and interpret
a) P( S | E )b) P( S |E U T )c) P( S | reads at least one )d) P( S U E | T)Solution:
-
5/27/2018 FCM2063 - Printed Notes
13/126
13
3478.023.0
08.0
)(
)(
)(
Ep
ESp
ESp
2553.047.0
12.0
)(
)()(
TEp
TESpTESp
2857.049.0
14.0
)(
)(
)(
)(
)()oneleastatreads(
TESp
Sp
TESp
TESSp
TESSpSp
5946.037.022.0
)()()(
TpTESpTESp
1.3.2 Independent Events
The probability of both events occurring can be calculated by rearranging the terms in theexpression of conditional probability.
Two eventsAandBare called independent if the probability of eventAis not affected bythe occurrence of eventB, so and
Example 1.9:
In rolling a fair die, let eventA = {1, 3, 5}and eventB = {4, 5, 6}.Are eventsAandBindependent?
0.07
0.20
0.02
0.04
0.03
0.080.05
S E
T39.3x
)()|( APBAP
)()()( BPBAPBAP
)()()( BPAPBAP
-
5/27/2018 FCM2063 - Printed Notes
14/126
14
Solution:
P(A) = , P(B)=1/2 and
Since , soA andB are not independent events.
Whats the difference between mutually exclusiveand independentevents?Two events mutually exclusive (disjoint): both cannot happen when the experiment isperformed, soP( A| B) = 0, or vice versa
Two mutually exclusive events: P(A B) = 0and P( A U B) = P(A) + P(B)Mutually exclusive events must be dependent.Two events are independent:P( A U B) = P(A) + P(B)P(A B)
Example 1.10:
Toss a single die and observe the eventsA: a number less than 4
B: a number less than or equal to 2C: a number greater than 3
Are eventsAandBindependent? Are eventsAandBmutually exclusive?Are eventsAand Cindependent? Are eventsAand Cmutually exclusive?
Solution:
P(A) = , P(B) = 1/3, P(C ) = P( A | B) P(A),AandBdependent but not mutually exclusive.Aand Care dependent but mutually exclusive.
1.4 Bayes theorem
1.4.1 Multiplicative Law of Probability and Independence
For two events A and B,
Events A and B are independentif and only if
If eventsA1, .., Akare independentthen,
( ) ( | ). ( )P A B P A B P B
( ) ( ). ( )P A B P A P B
1 2 1 2( ... ) ( ) ( ) ( )k kP A A A P A P A P A
)()()( BPAPBAP
6/1)( BAP
-
5/27/2018 FCM2063 - Printed Notes
15/126
15
Multiplication rule is most useful when the experiment consists of several stages insuccession. The conditioning event, B, describes the outcome of the first stage andAtheoutcome of the second, so that P( A| B)conditioning on what occurs first will often be
known.
Example 1.11:
During a space shot, the primary computer system is backed up by two secondarysystems. They operate independently of one another, and each is 95% reliable.What is the probability that all three systems will be operable at the time of the launch?
SolutionLet,
A1: event main system is operableA2: event first backup is operableA3: event second backup is operable
GivenP(A1) = P(A2) = P(A3) = 0.95Since they operate independentlyP(A1 A2A3) = P(A1)P(A2) P(A3) = 0.857
1.4.2 The Law of Total Probability
Suppose B1, B2,, Bn are mutually exclusiveand exhaustivein S, then for any eventA
1.4.3 Bayes Theorem
Suppose B1, B2,, Bn are mutually exclusive and exhaustive (whose union is S). LetAbe an event such thatP(A)> 0. Then for any event Bj, j =1, 2, , n,
Example 1.12:
A store stocks bulbs for LCD projector from three suppliers. Suppliers A, B, and Csupply 10%, 20%, and 70% of the bulbs respectively. It has been determined thatcompanyAs bulbs are 1% defective while company Bs are 3% defective and company
1
( ) ( | ) ( )( | )
( ) ( | ) ( )
k k k
k n
i i
i
P A B P A B P BP B A
P A P A B P B
1 1
( ) ( ) ( | ) ( )n n
i i i
i i
P A P A B P A B P B
-
5/27/2018 FCM2063 - Printed Notes
16/126
16
Cs are 4% defective. If a bulb is selected at random and found to be defective, what isthe probability that it came from supplierB?
Solution:
LetDis a defective, then the probability that it came from supplierBis
||
| | |
P B P D BP B D
P A P D A P B P D B P C P D C
0.2 0.03
0.1 0.01 0.2 0.03 0.7 0.04
0.1714
-
5/27/2018 FCM2063 - Printed Notes
17/126
17
Exercise Chapter 1:
1. Each message in a digital communication system is classified as to whether it is receivedwithin the time specified by the system design. If 3 messages are classified, what is an
appropriate sample space for this experiment?
2. A digital scale is used that provide weights to the nearest gram. Let event A: a weightexceeds 11 grams, B: a weight is less than or equal to 15 grams, C: a weight is greater than orequal to 8 grams and less than 12 grams. What is the sample space for this experiment? andfind
(a) A UB (b) A (c) A B
(d) (A UC) (e) A B C (f) B C
3. Samples of building materials from three suppliers are classified for conformance to air-quality specifications. The results from 100 samples are summarized as follows:
Conforms
Yes No
Supplier
R 30 10
S 22 8
T 25 5
Let A denote the event that a sample is from supplier R, and B denote the event that a sample
conforms to the specifications. If sample is selected at random, determine the followingprobabilities:
(a) P(A) (b) P(B) (c) P(B)(d) P(AUB) (e) P(AB) (f) P(AUB)
(g) )( BAP (h) )( ABP
4. The compact discs from a certain supplier are analyzed for scratch and shock resistance. Theresults from 100 discs tested are summarized as follows:
ScratchResistance
High Low
ShockResistance
High 30 10
Medium 22 8
Low 25 5
-
5/27/2018 FCM2063 - Printed Notes
18/126
18
Let A denote the event that a disc has high shock resistance, and B denote the event that adisc has high scratch resistance. If sample is selected at random, determine the followingprobabilities:
(a) P(A) (b) P(B) (c) P(B)
(d) P(AUB) (e) P(AB) (f) P(AUB)(g) )( BAP (h) )( ABP
5. The reaction times ( in minutes) of a reactor for two batches are measured in an experiment.
(a) Define the sample space of the experiment.(b) Define event A where the reaction time of the first batch is less than 45 minutes and event
B is the reaction time of the second batch is greater than 75 minutes.(c) FindA UB, A B and A(d) Verify whether events A and B are mutually exclusive.
6. When a die is rolled and a coin is tossed, use a tree diagram to describe the set of possibleoutcomes and find the probability that the die shows an odd number and the coin shows ahead.
7. A bag contains 3 black and 4 while balls. Two balls are drawn at random one at a timewithout replacement.
(i) What is the probability that a second ball drawn is black?(ii) What is the conditional probability that first ball drawn is black if the second ball is
known to be black?
8. An oil-prospecting firm plans to drill two exploratory wells. Past evidence is used to assessthe possible outcomes listed in the following table:
Find and give description for
9. In a residential suburb, 60% of all households subscribe to the metro newspaper published ina nearby city, 80% subscribe to the local paper, and 50% of all households subscribe to bothpapers. Draw a Venn diagram for this problem. If a household is selected at random, what isthe probability that it subscribes to(a) at least one of the two newspapers(b) exactly one of the two newspapers
Event Description Probability
ABC
Neither well produces oil or gasExactly one well produces oil or gas
Both wells produce oil or gas
0.800.180.02
)'()(),( BPandCBPBAP
-
5/27/2018 FCM2063 - Printed Notes
19/126
19
10.In a student organization election, we want to elect one president from five candidates, onevice president from six candidates, and one secretary from three candidates. How manypossible outcomes?
11.Suppose each student is assigned a 5 digit number. How many different numbers can becreated?
12.A chemical engineer wishes to conduct an experiment to determine how these four factorsaffect the quality of the coating. She is interested in comparing three charge levels, fivedensity levels, four temperature levels, and three speed levels. How many experimentalconditions are possible?
13.A menu has five appetizers, three soup, seven main course, six salad dressings and eightdesserts. In how many ways can(a)a full meal be chosen?
(b)a meal be chosen if either and appetizer or a soup is ordered, but not both?
14.Ten teaching assistants are available to grade a test of four questions. Wish to select adifferent assistant to grade each question (only one assistant per question). How manypossible ways can the assistant be chosen for grading?
15.Participant samples 8 products and is asked to pick the best, the second best, and the thirdbest. How many possible ways?
16.Suppose that in the taste test, each participant samples eight products and is asked to selectthe three best products. What is the number of possible outcomes?
17.A contractor has 8 suppliers from which to purchase electrical supplies. He will select 3 ofthese at random and ask each supplier to submit a project bid. In how many ways can theselection of bidders be made?
18.Twenty players compete in a tournament. In how many ways can(a) rankings be assigned to the top five competitors?(b)the best five competitors be randomly chosen?
19.Three balls are selected at random without replacement from the jar below. Find theprobability that one ball is red and two are black.
20.A university warehouse has received shipment of 25 printers, of which 10 are laser printersand 15 are inkjet models. If 6 of these 25 are selected at random by a technician, what is theprobability that exactly 3 of those selected are laser printers?
21.There are 17 broken light bulbs in a box of 100 light bulbs. A random sample of 3 light bulbsis chosen without replacement.(a)How many ways are there to choose the sample?
-
5/27/2018 FCM2063 - Printed Notes
20/126
20
(b)How many samples contain no broken light bulbs?(c)What is the probability that the sample contains no broken light bulbs?(d)How many ways to choose a sample that contains exactly 1 broken light bulb?(e)What is the probability that the sample contains no more than 1 broken light bulb?
22.An agricultural research establishment grows vegetables and grades each one as either goodor bad for taste, good or bad for its size, and good or bad for its appearance. Overall, 78% ofthe vegetables have a good taste. However, only 69% of the vegetables have both a goodtaste and a good size. Also, 5% of the vegetables have a good taste and a good appearance,but a bad size. Finally, 84% of the vegetables have either a good size or a good appearance.(a) if a vegetable has a good taste, what is the probability that it also has a good size?(b) if a vegetable has a bad size and a bad appearance, what is the probability that it has a
good taste?
23.A local library displays three types of books entitled Science (S), Arts (A), andNovels (N). Reading habits of randomly selected reader with respect to these types of
books are
Read regularly S A N SA SN AN SANProbability 0.14 0.23 0.37 0.08 0.09 0.13 0.05
Find the following probabilities and interpret(a)P( S | A )(b)P( S | A U N )(c)P( S | reads at least one )(d)P( S U A | N)
24.A batch of 500 containers for frozen orange juice contains 5 that are defective. Two areselected at random, without replacement, from the batch. Let A and B denote that the firstand second selected is defective respective(a)Are A and B independent events?(b)If the sampling were done with replacement, would A and B be independent?
25.Everyday (Mon to Fri) a batch of components sent by a first supplier arrives at certaininspection facility. Two days a week, a batch also arrives from a second supplier. Eightypercent of all batches from supplier 1 pass inspection, and 90% batches of supplier 2 passinspection. On a randomly selected day, what is the probability that two batches passinspection?
26.The probability is 1% that an electrical connector that is kept dry fails during the warrantyperiod of a portable computer. If the connector is ever wet, the probability of a failure duringthe warranty period is 5%. If 90% of the connectors are kept dry and 10% are wet, whatproportion of connectors fail during the warranty period?
27.Computer keyboard failures are due to faulty electrical connects (12%) or mechanical defects(88%). Mechanical defects are related to loose keys (27%) or improper assembly (73%).
-
5/27/2018 FCM2063 - Printed Notes
21/126
21
Electrical connect defects are caused by defective wires (35%), improper connections (13%)or poorly welded wires (52%). Find the probability that a failure is due to(a) loose keys(b)improperly connected or poorly welded wires.
28.During a space shot, the primary computer system is backed up by two secondary systems.They operate independently of one another, and each is 90% reliable. What is the probabilitythat all three systems will be operable at the time of the launch?
29.A store stocks light bulbs from three suppliers. SuppliersA,B, and Csupply 10%, 20%, and70% of the bulbs respectively. It has been determined that company As bulbs are 1%defective while companyBs are 3% defective and company Cs are 4% defective. If a bulbis selected at random and found to be defective, what is the probability that it came fromsupplierB?
30.A particular city has three airports. Airport A handles 50% of all airline traffic, while airports
B and C handle 30% and 20%, respectively. The rates of losing a baggage in airport A, B andC are 0.3, 0.15 and 0.14 respectively. If a passenger arrives in the city and losses a baggage,what is the probability that the passenger arrives at airport A?
31.A company rated 75% of its employees as satisfactory and 25% unsatisfactory. Of thesatisfactory ones 80% had experience, of the unsatisfactory only 40%. If a person withexperience is hired, what is the probability that (s)he will be satisfactory?
32.In a certain assembly plant, three machines, B1, B2, B3, make 30%, 45% and 25%,respectively, of the products. It is known from past experience that 2%,3% and 2% of theproducts made by each machine, respectively, are defective. Now, suppose that a finished
product is randomly selected.(a)What is the probability that it is defective?(b)If a product was chosen randomly and found to be defective, what is the probability that
it was produced by machine B3?
33.Three machines A, B and C produce identical items of their respective output 5%, 4% and3% of the items are faulty. On a certain day A has produced 25%, B has produced 30% andC has produced 45% of the total output. An item selected at random is found to be faulty.What are the chances that it was produced by C?
34.Suppose that a test for Influenza A, H1N1 disease has a very high success rate: if a tested
patient has the disease, the test accurately reports this, a positive, 99% of the time, and if atested patient does not have the disease, the test accurately reports that, a negative, 95% of
the time. Suppose also, however, that only 0.1% of the population have that disease.(a)What is the probability that the test returns a positive result?(b)If the patient has a positive, what is the probability that he has the disease?(c)What is the probability of a false positive?
-
5/27/2018 FCM2063 - Printed Notes
22/126
22
35.An insurance company charges younger drivers a higher premium than it does older driversbecause younger drivers as a group tend to have more accidents. The company has 3 agegroups: Group A includes those less than 25 years old, have a 22% of all its policyholders.Group B includes those 25-39 years old, have a 43% of all its policyholders, Group Cincludes those 40 years old and older, have 35% of all its policyholders. Company records
show that in any given one-year period, 11% of its Group A policyholders have an accident.The percentages for groups B and C are 3% and 2%, respectively.(a)What is the probability that the companys policyholders are expected to have an accident
during the next 12 months?(b)Suppose Mr. Chong has just had a car accident. If he is one of the companys
policyholders, what is the probability that he is under 25?
-
5/27/2018 FCM2063 - Printed Notes
23/126
23
Chapter 2
2. Discrete random variable and discrete probability distributionsLearning objectives:
At the end of this chapter, student should be able to:
Define the random variables
Differentiate between discrete and continuous random variables
Define the discrete probability distributions
Know the special functions for discrete probability distribution
2.1 Introduction
A random variable is a rule that assigns a number to each outcome of an experiment.These numbers are called the measured valuesof the random variable. The capital letterslikeX, YandZis used to denote a random variable and the small letters like x, y and z todenote the measured values.
Example 2.1:
Select a soccer player; the random variable Yis the number of goals the player has
scored during the season.
The measured values of Yare 0, 1, 2, 3,
The test marks for 100 engineering students; the random variableZis the average numberof goals scored by the students.
The values ofZare 65.4, 67.8, 70.5, 77.3,
There are two types of random variables called a discrete random variable and acontinuous random variable.
2.2 Discrete random variableThe measured values for a discrete random variable are finite or countable. The valuesare in terms of integer value. The number of students in this class is the example of adiscrete random variable.
-
5/27/2018 FCM2063 - Printed Notes
24/126
24
2.3 Continuous random variableThe measured values for continuous random variables are in terms of real number in therange. It can be any values within the range. The weight of students in this class is theexample of a continuous random variable.
Example 2.2:
Identify whether the random variable below is discrete or continuous random variable.
(i) The number of female students in the class.(ii) The number of telephone calls.(iii) The time between two accidents.(iv) The number of cracks in a certain length of road.(v) The height of the athletes participated in the Asian Game.(vi) The volume of water in the tank.
(vii) A score on the statistics final examination.(viii) The number of cars on the road at a certain period of time.
Solution:
(i) discrete(ii) discrete(iii) continuous(iv) discrete(v) continuous(vi) continuous
(vii) continuous(viii) discrete
2.4Discrete probability distribution
If arandom variableis a discrete variable, itsprobability distributionis called adiscrete probability distribution or probability mass function, pmf.
Suppose the experiment is flipping a coin two times. This simple experimentcan have
four possible outcomes or sample space: HH, HT, TH, and TT. Now, let the random
variableX
represent the number of Heads that result from this experiment. The random
variableXcan only take on the values 0, 1,or 2, so it is a discrete random variable.
http://stattrek.com/Help/Glossary.aspx?Target=Random_variablehttp://stattrek.com/Help/Glossary.aspx?Target=Random_variablehttp://stattrek.com/Help/Glossary.aspx?Target=Random_variablehttp://stattrek.com/Help/Glossary.aspx?Target=Probability_distributionhttp://stattrek.com/Help/Glossary.aspx?Target=Probability_distributionhttp://stattrek.com/Help/Glossary.aspx?Target=Statistical_experimenthttp://stattrek.com/Help/Glossary.aspx?Target=Statistical_experimenthttp://stattrek.com/Help/Glossary.aspx?Target=Statistical_experimenthttp://stattrek.com/Help/Glossary.aspx?Target=Probability_distributionhttp://stattrek.com/Help/Glossary.aspx?Target=Random_variable -
5/27/2018 FCM2063 - Printed Notes
25/126
25
The probability distribution for this experiment appears as below.
Number ofheads, X
Probability function, P(X)
0 1/4
1 1/2
2 1/4
The above table represents a discrete probability distribution and the probability function,
P(X=xi)is called probability mass function (pmf) of Xbecause it relates each value of a
discrete random variable with its probability of occurrence.
2.4.1 The properties of the probability mass function (pmf)The pmf, P(X=xi)of a discrete random variable Xmust satisfied two conditions;
(i) 1)(0 ixXP
(ii) 1)( ix
ixXP
Given pmf, the probability of Xoccurs can be calculated. For example the probability atmost one occurs is )1()0()1( XPXPXP
Example 2.3:
Two balls are drawn at random in succession without replacement from an urn containing4 red balls and 6 black balls. Find the probabilities of all the possible outcomes.
Solution:
Let Xdenote the number of red balls in the outcome.
Possibleoutcomes
RR RB BR BB
X 2 1 1 0
Here,x1= 2,x2= 1,x3= 1,x4= 0
-
5/27/2018 FCM2063 - Printed Notes
26/126
26
Now, the probability of getting 2 red balls when we draw out the balls one at a time is:
Probability of first ball being red = 4/10
Probability of second ball being red = 3/9 (because there are 3 red balls left in the urn, out
of a total of 9 balls left.) So:
Likewise, for the probability of red first is 4/10 followed by black is 6/9 (because thereare 6 black balls still in the urn and 9 balls all together). So:
Similarly for black then red:
Finally, for 2 black balls:
So the probability distribution is:
X 2 1 0P(X=x) 2/15 8/15 5/15
Example 2.4:
Given the probability distribution,
X 0 1 2 3 4 5
P(X=x) 1/10 1/5 k 1/5 3/10 1/10
Find the value of kthat makes P(X=x)a valid pmf of X.
Solution:
For P(X=x) is truly pmf of X, its must satisfy 1)( ix
ixXP . Hence,
-
5/27/2018 FCM2063 - Printed Notes
27/126
27
10/1110/110/35/15/110/1)( kkxXPix
i.
2.4.2 The cumulative distribution function (cdf)
The cdf of a discrete random variable Xis defined by,
)()()( xx
ii
xXPxXPxF
Example 2.5:
Given pmf,
X 2 1 0
P(X=x) 2/15 8/15 5/15
Find the cdf of X.
Solution:
For x< 0, 0)()( xXPxF
For 15/5)0()0()(,10 xPXPxFx
For 15/1315/815/5)1()0()1()(,21 xPxPXPxFx
For
115/214/815/5
)2()1()0()2()(,2
xPxPxPXPxFx
So the cdf of Xis:
2,1
21,15/13
10,15/5
0,0
)(x
x
x
x
xF
-
5/27/2018 FCM2063 - Printed Notes
28/126
28
2.4.3 The mean and the variance of X
Given the pmf of X, P(X=x), all the parameters of X such as the mean, the variance andthe standard deviation can be determined by using the expectation definition.
The mean of Xis defined by,
)()(1
n
iii xXPxXE
The variance of Xis defined by,
2
1
2222 )())(()()(
n
iii xXPxXEXEXVar
The standard deviation, is a square root of the variance.
Example 2.6:
Let Xis a random variable with pmf,
X 2 1 0
P(X=x) 2/15 8/15 5/15
Find the mean, the variance and the standard deviation of X.
Solution:
The mean of X is:
8.015/12)15/5(0)15/8(1)15/2(2)()(1
n
iii xXPxXE
The variance of X is:
0.4270.64-16/15
)8.0()15/5(0)15/8(1)15/2(2
)())(()()(
2222
2
1
2222
n
iii xXPxXEXEXVar
-
5/27/2018 FCM2063 - Printed Notes
29/126
29
Standard deviation is = 0.653
2.5Special functions for discrete probability distribution
There are many special discrete probability distributions such as Bernoulli distribution,Binomial distribution and Poisson distribution.
2.5.1 Bernoulli distribution
The experiment conducted with only two possible outcomes. In an experiment of tossinga fair coin for 1 time and Xis the number of head. There are only two possible outcomes,X =0 or X=1with probability distribution:
Possibleoutcomes
Head Tail
X 1 0P(X=x) 1/2 1/2
2.5.2 Binomial distribution
If the Bernoulli experiment conducted for n times, and the random variable X is thenumber of success, then the probability distribution of Xis called Binomial distributionwith pmf,
,.......3,2,1,0,)(
xqpx
n
xXP
xnx
where pis the probability of success and q=1-p.
By using the definition, it can be shown that, if X is a Binomial distribution, then themean of Xis E(X) = npand the variance of X, is Var(X) = npq.
Example 2.7:
In the experiment of tossing a fair coin for 10 times, and Xis the number of head.(i) What is the pmf of X?.(ii) Find the probability the head will appear exactly 5 times.(iii) What is the probability no head?(iv) Find the mean and the variance ofX.
Solution:
(i) The probability mass function of Xis given by:
-
5/27/2018 FCM2063 - Printed Notes
30/126
30
,.......3,2,1,0,)5.0()5.0(10
)( 10
x
xxXP xx
(ii) 246.0)5.0()5.0(!5!5
!10)5.0()5.0(
5
10)5( 5555
XP
(iii) 00097.0)5.0()5.0(!10!0
!10)5.0()5.0(
0
10)0(
100100
XP
(i i i) The mean of Xis np = 10(0.5)=5The variance of Xis npq=10(0.5)(0.5)=2.5
2.5.3 Poisson distribution
Another important discrete distribution is a Poisson distribution. The random variable Xis the number of occurrences in the interval of interest. The example of Poissondistribution is the number of accidents within a certain period of times. If Xis a randomvariable with a Poisson distribution then the pmf of Xis given by,
,.......3,2,1,0,!
)(
xx
exXP
x
where is the mean of Xfor the interval of interest.
By using the definition, it can be shown that, if X is a Binomial distribution, then the
mean of Xis E(X) = and the variance of X, is Var(X) = .Example 2.8:
Anne's answering machine receives about 6 telephone calls between 8 a.m. and 10 a.m.What is the probability that Anne receives more than 1 call in the next 15 minutes?
Solution:
Let X= the number of calls Anne receives in 15 minutes. (The interval of interest is 15minutes. The random variable X takes on the values 0, 1, 2,.. If Anne receives, on theaverage, 6 telephone calls in 2 hours, then Anne will receives 1/8 = 0.75 calls in 15
minutes, on the average. So, it means that is 0.75.
Hence, the probability that Anne receives more than 1 call in the next 15 minutes is,
-
5/27/2018 FCM2063 - Printed Notes
31/126
31
174.0)354.0472.0(1!1
)75.0(
!0
)75.0(1
)]1()0([1)1(1)1(
175.0075.0
ee
XPxPXPXP
Exercise 2
1. Identify each of the random variables as continuous or discrete random variable.
(a) The number of atoms(b) The number of fish in a pond(c) The home team score in a football game(d) The voltage on a power line(e) A score on the mathematic final exam
(f) The volume of gas in the tank(g) The number of cars at the petrol station(h) The number of accidents in Ipoh(i) The number of cakes left in the pantry(j) The height of civil engineering students in UTP
2. Let,
x 0 1 2 3
0.15 0.25 k 0.35
(i) Find the value of k that result in a valid probability distribution.(ii)Find the expected number ofXand the standard deviation ofX.(iii)What is the probability thatXgreater than or equal to 1?
3. At UTP, the business students run an investment club. Each semester they create investmentportfolios in multiples of RM1,000 each. Records from the past several years show thefollowing probabilities of profits (rounded to the nearest RM50). In the table below, x =profit per RM1, 000 andP(x) is the probability of earning that profit.
x 0 50 100 150 200
0.15 0.35 k 0.2 0.05
(a)Determine the value of kthat results in a valid probability distribution.(b)The profit per RM1, 000 is a random variable. Is it discrete or continuous? Explain.(c)Find the expected value of the profit in a $1,000 portfolio.(d)Find the standard deviation of the profit.(e)What is the probability of a profit of $150 or more in a RM1, 000 portfolios?
-
5/27/2018 FCM2063 - Printed Notes
32/126
-
5/27/2018 FCM2063 - Printed Notes
33/126
33
(b)What is the probability that none of the student pilots successfully lands the plane usingthe simulator?
(c)What is the probability that exactly eight of the student pilots successfully land the planeusing the simulator?
8. SupposeXhas a Poisson distribution a mean of 7Determine the following.(a)P(X = 0);(b)P(X = 5);(c)P(X < 3); and(d) )4( XP .
9. At the Mc Donald drive-thru window of food establishment, it was found that during slowerperiods of the day, vehicles visited at the rate of 15 per hour. Determine the probability that(a)no vehicles visiting the drive-thru within a ten-minute interval during one of these slow
periods;(b)only 3 vehicles visiting the drive-thru within a ten-minute interval during one of these
slow periods; and(c)at least three vehicles visiting the drive-thru within a ten-minute interval during one of
these slow periods.
10.The number of cracks in a section of PLUS highway that are significant enough to requirerepair is assumed to follow a Poisson distribution with a mean of two cracks per kilometer.Determine the probability that(a) there are no cracks at all in 2km of highway;(b) at least one crack in 500meter of highway; and(c) there are exactly 3 cracks in 0.5km of highway.
-
5/27/2018 FCM2063 - Printed Notes
34/126
34
Chapter 3
3. Continuous probability distributionsLearning objectives:
At the end of this chapter, student should be able to:
Define the continuous probability distributions
Know the special functions for continuous probability distribution
3.1 Introduction
If the outcomes of the experiment conducted are continuous random variables, itsprobability distribution is called a continuous probability distribution or probability
density function, pdf.
3.1.1 The properties of the probability density function (pdf)
The pdf, f(x)of a continuous random variable Xmust satisfied two conditions;
(j) 1)(0 xf
(ii)1)(
dxxf
Given pdf, the probability of Xoccurs can be calculated. For example the probability atmost one occurs is
1)()1( dxxfXP
Example 3.1:
Let Xbe continuous random variable with pdf given by,
http://stattrek.com/Help/Glossary.aspx?Target=Random_variablehttp://stattrek.com/Help/Glossary.aspx?Target=Probability_distributionhttp://stattrek.com/Help/Glossary.aspx?Target=Probability_distributionhttp://stattrek.com/Help/Glossary.aspx?Target=Probability_distributionhttp://stattrek.com/Help/Glossary.aspx?Target=Random_variable -
5/27/2018 FCM2063 - Printed Notes
35/126
35
elsewhere,0
20,)(
2 xkxxf
Find the value of kthat makes f(x)a valid pdf of X.
Solution:
To be a valid pdf, f(x)must satistify,
1)(
dxxf
So,
8
31
3
8
0
2
3)(
32
0
2
kkx
kdxkxdxxf
3.1.2 The cumulative distribution function (cdf)
The cdf of a continuous random variable Xis defined by,
xdxxfxXPxF x
,)()()(
Example 3.2:
Let Xbe continuous random variable with pdf given by,
elsewhere,0
10,3)(
2 xxxf
Find,
(i ) P(X < 0.5), P( 0.5 < X < 0.75)(ii) The cdf of X.Solution:
(i)
8
1)5.0(
0
5.03)()5.0( 33
5.0
0
25.0 xdxxdxxfXP
-
5/27/2018 FCM2063 - Printed Notes
36/126
36
64
19)5.0()75.0(
5.0
75.03)()75.05.0(
33
375.0
5.0
275.0
5.0
xdxxdxxfXP
-
5/27/2018 FCM2063 - Printed Notes
37/126
37
(ii) The cdf of X,
x
dxxfxXPxF 0)()()(0,For x
x x
xdxxdxdxxfxXPxFx0
0
3230)()()(,10For
xx
dxdxxdxdxxfxXPxFx1
0 1
0
2 1030)()()(,1For
So the cdf of Xis:
1,1
10,
0,0
)( 3
x
xx
x
xF
3.1.3 The mean and the variance of X
Given the pdf of X, f(x), all the parameters of Xsuch as the mean, the varianceand the standard deviation can be determined by using the expectation definition.
The mean of Xis defined by,
dxxxfXE )()(
The variance of Xis defined by,
2222 )())(()()(
dxxxfXEXEXVar
The standard deviation, is a square root of the variance.
Example 3.3:
Let Xbe continuous random variable with pdf given by,
-
5/27/2018 FCM2063 - Printed Notes
38/126
38
elsewhere,0
20,8
3
)(
2 xxxf
Find the mean and the variance of X.
Solution:
The mean of Xis
5.12
3
0
2
32
3
8
3)()(
43
2
0
xdxxdxxxfXE
The variance of Xis,
15.020
3
4
9
5
12
4
9
0
2
40
3
4
9
8
3
)5.1(8
3)()(
52
0
4
22
0
22222
xdxx
dxxxdxxfxXVar
3.2Special functions for continuous probability distribution
There are many special continuous probability distributions such as Uniformdistribution, Exponential distribution, Gamma distribution and Normaldistribution.
3.2.1 Uniform distribution
The random variable Xis a uniform distribution, and then the pdf of X is givenby,
elsewhere,0
,1
)(bxa
abxf
By using the definition, it can be shown that, if Xis a uniform distribution, then
the mean of Xis E(X) = and the variance of X, is Var(X) = .
-
5/27/2018 FCM2063 - Printed Notes
39/126
39
3.2.2 Exponential distribution
The random variable X is an exponential distribution, and then the pdf of X isgiven by,
elsewhere,0
0,)(
xexf
x
By using the definition, it can be shown that, if Xis a uniform distribution, then
the mean of Xis E(X) = and the variance of X, is Var(X) =1/.Example 3.4:
Let Xis the number of individuals failing in a large group and has a exponentialdistribution. If we assume that the mean of X under a certain situation is 10, whatis the probability that more than 20 will fail at the same time?
Solution:
The random variable Xhas pdf,
elsewhere,0
0,)(
xexf
x
where is the 1/mean of X. So, = 1/10=0.1
Hence,
135.020
)1.0()20(21.0
20
1.0
eedxeXP xx
3.2.3 Normal distribution
The random variable Xis a Normal distribution, and then the pdf of Xis given by,
xexfx
,2
1)(
2
2
)(
By using the definition, it can be shown that, if X is a random variable with
normal distribution, then the mean of X is E(X) = and the variance of X, isVar(X) = . If Xis a random variable with normal distribution, then Xis alwaysbe written as,
-
5/27/2018 FCM2063 - Printed Notes
40/126
-
5/27/2018 FCM2063 - Printed Notes
41/126
41
Exercise 3
1. Suppose that X is a continuous random variable having the probability density
function
(a)Find the value of constant k(b)Find P(-0.5 1)(c)P(0 < X < 2)
(d)the mean ofX(e) the variance ofX.
4. LetXbe a continuous random variable with pdf given by
elsewhere
forkf
,0
1x1x)x(
2
-
5/27/2018 FCM2063 - Printed Notes
42/126
42
elsewhere,0
30,)(
2 xkxxf
Find(a) the value of constant k(b)the cdf, F(x)(c)P(X >1)(d)the mean ofX(e) the variance ofX.
5. Find the cumulative probability distribution ofX given that the density function is
Find(a) the value of constant k(b)the cdf, F(x)(c)P(0.25 < X < 0.5)(d)the mean ofX(e) the variance ofX.
6. Suppose a random variable,X has a uniform distribution with a = 5 and b = 9. Find(a)P(5.5 60)(c)P(50
-
5/27/2018 FCM2063 - Printed Notes
43/126
43
(b)will be between 50 to 100 seconds?; and(c)at most 150 seconds?
10.The mean weight of 500 UTP students is 68kg and the variance is 72.25kg. Find theprobability of students who weight
(a)between 65kg and 72kg(b)more than 70kg
11.An average LCD Projector bulb manufactured by the ABC Corporation lasts 300 dayswith variance of 2500days. By assuming that the bulb life is normally distributed,what is the probability that the bulb will last(a)at most 365 days?(b)between 250days and 350days?(c)at least 400days?
12.The line width of a tool used for semiconductor manufacturing is assumed to be
normally distributed with a mean of 0.5 micrometer and a standard deviation of 0.05micrometer.(a)What is the probability that a line width is greater than 0.62 micrometer?(b)What is the probability that a line width is between 0.47 and 0.63 micrometer?(c)The line width of 90% of samples is below what value?
oooOOOooo
-
5/27/2018 FCM2063 - Printed Notes
44/126
44
Chapter 4
4. Data display and summary of dataLearning objectives:
At the end of this chapter, student should be able to:
Explain the different between population and sample
Find the sample mean, sample variance and sample standard deviation
Plot data using stem and leaf display
Construct the Box-Plot
4.1 Introduction
The major use of inferential statistics is to use information from a sampleto infer
something about a population. A population is a collection of data whoseproperties are analyzed. The population is the complete collection to be studied; itcontains all subjects of interest. A sample is a part of the population of interest, asub-collection selected from a population. A parameter is a numericalmeasurement that describes a characteristic of a population, while a statistic is anumerical measurement that describes a characteristic of a sample. In general, wewill use a statistic to infer something about a parameter.
4.2Mean and variance
The mean is the sum of all numbers in the list divided by the total numbers in the
list. If the given list is Statistical Population then the mean is called Population
Mean and the given list is a Statistical Sample, then the mean is called Sample
mean. The mean has an expected value of , known as the population mean. The
sample mean makes a good estimator of the population mean, as its expected value
which is as the same as the population mean.
-
5/27/2018 FCM2063 - Printed Notes
45/126
45
Often, since the population variance is an unknown parameter, it is estimated by
the mean sum of squares, which changes the distribution of the sample mean from
a normal distribution to a Student's t distribution with n1 degrees of freedom.
The mean and the variance of population and sample mean and sample variance
can be expressed as follows. By using the following equations we can identify thedifference.
Population Mean and Variance are defined as:
2
1
21 )(1
VarianceMean
N
ii
N
ii
xN
N
x
where Nis the size of the Population.
Sample Mean and sample variance are defined as:
2
1
21 )(1
1VarianceMean xx
ns
n
x
xN
ii
n
ii
where nis the sample size
Example 4.1:
Given the sample data as 55, 68, 90, 42, 89, 70. Find the sample mean and thesample variance of this data.
Solution:
-
5/27/2018 FCM2063 - Printed Notes
46/126
46
6.353]6
)414(30334[
5
1
)(
1
1)(
1
1
is,varianceThe
696
414
6
708942906855
is,MeanThe
2
1
2
1
22
1
2
1
n
x
xn
xxn
s
n
x
x
n
iin
ii
n
ii
n
ii
4.3 A Stem and Leaf plot
Data can be shown in a variety of ways including graphs, charts and tables. AStem and Leaf plot is a type of graph that is similar to a histogram but showsmore information. The Stem-and-Leaf plot summarizes the shape of a set of data(the distribution) and provides extra detail regarding individual values.
The data is arranged byplace value.The digits in the largest place are referred toas the stem and the digits in the smallest place are referred to as the leaf (leaves).The leaves are always displayed to the left of the stem. Stem and Leaf plots aregreat organizers for large amounts of information. It provides an at a glance toolfor specific information in large sets of data, otherwise one would have a long ofmarks to sift through and analyze. The totals of data, median and mode are alsocan be determined by Stem and Leaf plots. They are usually used when there arelarge amounts of numbers or data to analyze. Series of scores on sports teams,series of temperatures or rainfall over a period of time, series of classroom testscores are examples of when Stem and Leaf plots could be used.
Example 4.2:
The following data is the temperatures for August in Malaysia.
77 80 82 68 65 59 6155 50 62 61 70 69 6465 70 62 65 65 75 76
http://math.about.com/cs/baseten/index.htmhttp://math.about.com/cs/baseten/index.htmhttp://math.about.com/cs/baseten/index.htm -
5/27/2018 FCM2063 - Printed Notes
47/126
47
85 80 82 83 79 79 7180 77 89
Use the Stem and Leaf plot to determine the mode and the median for thetemperatures.
Solution:
First step should be to place the numbers in order from smallest to the largest.
The mode is 65 and the median is 70.
4.4 A Box plot
Indescriptive statistics,a box plot or (also known as a box-and-whisker diagram)is an excellent visual summary of many important aspects of a data distributionthrough their five-number summaries: the smallest observation (sampleminimum), lower quartile (Q1), median (Q2), upper quartile (Q3), and largest
observation (sample maximum). A box plot may also indicate which observations,if any, might be consideredoutliers.Box plot can be drawn either horizontally orvertically.
4.4.1 Construct a box plot
Step 1: Place the numbers in order from smallest to the largest.
Step 2: Find the median,Q2, thelower quartile, Q2and the upper quartile, Q3of agiven set of data.
Step 3: Find the interquartile range(IQR). The IQR is the difference between theupper quartile and the lower quartile.
Step 4: Start to draw the Box-plot either horizontally or vertically.
Temperatures
Tens Ones
5 0 5 9
6 1 1 2 2 4 5 5 5 5 8 9
7 0 0 1 5 6 7 7 9 9
8 0 0 0 2 2 3 5 9
http://en.wikipedia.org/wiki/Descriptive_statisticshttp://en.wikipedia.org/wiki/Five-number_summaryhttp://en.wikipedia.org/wiki/Sample_minimumhttp://en.wikipedia.org/wiki/Sample_minimumhttp://en.wikipedia.org/wiki/Quartilehttp://en.wikipedia.org/wiki/Medianhttp://en.wikipedia.org/wiki/Quartilehttp://en.wikipedia.org/wiki/Sample_maximumhttp://en.wikipedia.org/wiki/Outlierhttp://en.wikipedia.org/wiki/Outlierhttp://en.wikipedia.org/wiki/Sample_maximumhttp://en.wikipedia.org/wiki/Quartilehttp://en.wikipedia.org/wiki/Medianhttp://en.wikipedia.org/wiki/Quartilehttp://en.wikipedia.org/wiki/Sample_minimumhttp://en.wikipedia.org/wiki/Sample_minimumhttp://en.wikipedia.org/wiki/Five-number_summaryhttp://en.wikipedia.org/wiki/Descriptive_statistics -
5/27/2018 FCM2063 - Printed Notes
48/126
48
Step 5: Calculate the 1.5IQR and determine the range of 1.5IQR from upperquartile and the lower quartile. The value(s) that place outside of the1.5IQR range called the outlier(s). The value(s) that place outside of the3IQR range called the extreme outlier(s).
Example 4.3
Suppose that thirty UTP students live in Village 2. These are the following ages:
18, 20, 21, 26, 24, 19, 25, 20, 22, 21,
19, 24, 25, 28, 24, 20, 26, 20, 35, 17,
18, 24, 20, 21, 22, 27, 25, 28, 27, 24.
Step 1: Place the numbers in order from smallest to the largest.
17, 18, 18, 19, 19, 20, 20, 20, 20, 20,
21, 21, 21, 22, 22, 24, 24, 24, 24, 24,
25, 25, 25, 25, 26, 26, 27, 27, 28, 35.
Step 2: Find the median,Q2, the lower quartile, Q2and the upper quartile, Q3of agiven set of data.
The median, Q2= (X15+ X16)/2 = (22+24)/2=23
The position of Q1= (0.25) (n+1) = 0.25(31) = 7.75
So the lower quartile, Q1is X7+ 075(X8-X7) =20 + 0.75(20-20)=20
The position of Q3= (0.75) (n+1) = 0.75(31) = 23.25
So the upper quartile, Q3is X23+ 0.25(X24-X23) =25+0.25(25-25) = 25
Step 3: The interquartile range(IQR) = Q3- Q1= 2520 = 5
-
5/27/2018 FCM2063 - Printed Notes
49/126
49
The 1.5IQR = 7.5 and 3IQR = 15
Step 4: Start to draw the Box-plot either horizontally or vertically.
outlier
17 28 o35
Q1=20 Q2=23 Q3=25
12.5.< ------------..IQR=5 ---------------->.32.5
Exercise 4:
1. Find the mean, median and mode for the following observations:
6.5 7.8 4.6 3.7 6.5 9.2 12.1 6.5 3.7 10.8
2. Find the mean, median and mode for the following observations:
2.3 3.6 2.6 2.8 3.2 3.6 4.3 5.2 6.9 2.8 3.6
3. Seven oxide thickness measurements of wafers are studied to assess quality in asemiconductor manufacturing process. The data (in angstroms) are: 1264, 1280, 1301,
1300, 1292, 1307, and 1275. Calculate the sample average, variance and standarddeviation.
4. The following data are direct solar intensity measurements (watts/m2) on different
days at a location in southern Spain: 562, 869, 708, 775, 775, 704, 809, 856, 655, 806,878, 909, 918, 558, 768, 870, 918, 940, 946, 661, 820, 898, 935, 952, 957, 693, 835,905, 939, 955, 960, 498, 653, 730, 753. Calculate the sample mean, variance andsample standard deviation.
-
5/27/2018 FCM2063 - Printed Notes
50/126
50
5. Find the mean, variance and standard deviation of the following samples of marks forthe probability and statistics final examination.
84.981.9 80.8 79.4 78.2 76.5
75.073.8 72.7 72.6 71.4 70.969.368.6 67.5 66.8 65.2 64.4
59.558.3 58.5 57.6 56.9 55.248.248.0 47.8 46.5 45.9 44.638.337.4 36.8 36.5 35.6 34.938.4
6. Find the mean, variance and standard deviation of the following samples of marks forthe engineering drawing course.
98.4 98.1 98.0 97.8 96.4 95.2 94.3 92.6 91.8 90.589.6 88.7 87.3 86.8 85.7 84.2. 83.7 82.8 80.5 80.8
79.7 78.2 77.4 77.4 76.8 75.9 74.2 73.9 72.6 71.469.8 68.6 67.5 66.8 65.2 64.4 63.7 62.8 61.4 60.759.2 58.3 58.5 57.6 56.9 55.2 54.7 53.9 52.9 51.259.6 48.0 47.8 46.5 45.9 44.6 43.8 42.7 41.8 40.639.8 37.4 36.8 36.5 35.6 34.9 33.2 33.8 32.7 31.6
7. The shear strengths of 100 spot welds in a titanium alloy follow. Construct a stem-and-leaf diagram for the weld strength data and comment on any important featuresthat you notice.
5408 5431 5475 5442 5376 5388 5459 5422 5416 5435
5420 5429 5401 5446 5487 5416 5382 5357 5388 5457
5407 5469 5416 5377 5454 5375 5409 5459 5445 5429
5463 5408 5481 5453 5422 5354 5421 5406 5444 5466
5399 5391 5477 5447 5329 5473 5423 5441 5412 5384
5445 5436 5454 5453 5428 5418 5465 5427 5421 5396
5381 5425 5388 5388 5378 5481 5387 5440 5482 5406
5401 5411 5399 5431 5440 5413 5406 5342 5452 5420
5458 5485 5431 5416 5431 5390 5399 5435 5387 5462
5383 5401 5407 5385 5440 5422 5448 5366 5430 5418
(a)Construct a stem-and-leaf display for these data.(b)Find the median, the quartiles, and the 5th and 95th percentiles.
8. The data that follow represent the yield on 90 consecutive batches of ceramicsubstrate to which a metal coating has been applied by a vapor-deposition process.
-
5/27/2018 FCM2063 - Printed Notes
51/126
51
94.1 87.3 94.1 92.4 84.6 85.4
93.2 84.1 92.1 90.6 83.6 86.6
90.6 90.1 96.4 89.1 85.4 91.7
91.4 95.2 88.2 88.8 89.7 87.5
88.2 86.1 86.4 86.4 87.6 84.2
86.1 94.3 85.0 85.1 85.1 85.1
95.1 93.2 84.9 84.0 89.6 90.5
90.0 86.7 78.3 93.7 90.0 95.6
92.4 83.0 89.6 87.7 90.1 88.3
87.3 95.3 90.3 90.6 94.3 84.1
86.6 94.1 93.1 89.4 97.3 83.7
91.2 97.8 94.6 88.6 96.8 82.9
86.1 93.1 96.3 84.1 94.4 87.3
90.4 86.4 94.7 82.6 96.1 86.4
89.1 87.6 91.1 83.1 98.0 84.5
(a)Construct a cumulative frequency plot and histogram for the yield(b)Construct a stem-and-leaf display for these data.(c)Find the median, the quartiles, and the 5th and 95th percentiles for the yield
9. The average age of the football players on each team of the premier league as follows.
29.4 29.8 29.4 31.8 32.7 34.0
28.5 27.9 30.9 29.3 28.8 28.6
29.1 31.0 30.7 30.3 29.7 31.0
28.4 28.9 27.7 28.7 30.5 29.8
26.6 27.9 27.9 29.9 29.3 28.1
(a)Construct a cumulative frequency plot and histogram for the yield(b) Construct a stem-and-leaf display for these data.(c) Find the median, the quartiles, and the 5th and 95th percentiles for the yield
10.The following cold start ignition time of an automobile engine obtained for a testvehicle are as follows:
1.75 1.92 2.62 2.35 3.09 3.15 2.53 1.91
(a)Calculate the sample median, the quartiles and the IQR(b)Construct a box plot of the data.
-
5/27/2018 FCM2063 - Printed Notes
52/126
52
11.The following data are the joint temperatures of the O-rings (F) for each test firing oractual launch of the space shuttle rocket motor (fromPresidential Commission on theSpace Shuttle Challenger Accident,Vol. 1, pp. 129131): 84, 49, 61, 40, 83, 67, 45,66, 70, 69, 80, 58, 68, 60, 67, 72, 73, 70, 57, 63, 70, 78, 52, 67, 53, 67, 75, 61, 70, 81,76, 79, 75, 76, 58, 31.
(a)Compute the sample mean and sample standard deviation;(b)Calculate the median, the quartiles and the IQR;(c)Construct a box plot of the data and comment on the possible presence of outliers.
12.Ipoh Pantai Hospital compiles data on the length of stay by patients in short-termhospitals. A random sample of 28 patients yielded the following data on length ofstay, in days.
3 6 15 7 3 55 14 4 12 18 9 6 12
5
10 13 7 1 23 96 8 11 9 4 21 10
(a)Compute the sample mean and sample standard deviation;(b)Calculate the median, the quartiles and the IQR;(c)Construct a box plot of the data and comment on the possible presence of outliers.
oooOOOooo
Chapter 5
5. Random sample, central limit theorem and Normal Approximation;Statistical process control
Learning objectives:
At the end of this chapter, student should be able to:
Define the random sample and sample mean
Use the Central Limit Theorem to define the sample mean distribution
Define the Normal Approximation to Binomial and Poisson distribution
Construct the X-bar chart and R chart in statistical process control
5.1Random sample and sample mean
-
5/27/2018 FCM2063 - Printed Notes
53/126
53
In statistical terms, a random sample is a set of independent random variables X1,X2, , Xn that have been drawn from a population in such a way that eachrandom variable was selected has the same distribution and has the same chanceof being selected.
Sample mean is the average of the sample. If we have n observation in onesample, the sample mean is the total of the observation divide by the number ofsample size, n.
5.2Central Limit Theorem and sample mean distribution
Central limit theorem says that if the sample size is large, and a random sample isa set of independent random variables X1, X2, , Xn has a normal distribution
with mean, and variance, then the sample mean, X is also normallydistributed with mean, and variance, n. That is
)/,(~ 2 nNX
Example 5.1:
At chemical engineering department, Universiti Teknologi PETRONAS, the meanage of the students is 20.6 years old, and the variance is 20 years. A randomsample of 80 students is drawn from 250 students. What is the probability that theaverage age of these students is greater than 22 years old?
Solution:
0808.09192.01)4.1(1
)4.1(1)4.1()
25.0
6.2022()22(So,
)25.0,6.20(~Hence,
25.080
20)(and6.20)(ofmeanthe,80For
20)(ofvariancetheand6.20)(ofmeanThe
2
ZPZPZPXP
NX
nXVXEXn
XVXXEX
5.3Normal Approximation
The binomial and Poisson distributions are discrete random variables, whereas thenormal distribution is continuous. We need to take this into account when we are
-
5/27/2018 FCM2063 - Printed Notes
54/126
54
using the normal distribution to approximate a binomial or Poisson using acontinuity correction.
The continuity correction, 5.0 for probability of X is depend on the inequalitysign, ,,, . For example P(X < a) = P(X - 0.5 < a - 0.5) and for
)5.05.0()( aXPaXP
5.3.1 Normal approximation to Binomial
The Central Limit Theorem says that as n increases, the binomial distributionwith n trials and probability p of success gets closer and closer to a normaldistribution. That is, the binomial probability of any event gets closer and closerto the normal probability of the same event.
The normal distribution is a good approximation to Binomial when n issufficiency large and pis not too close to0or 1. How largenneeds to be depends
on the value of p. It is better to be conservative and limit the use of the normaldistribution as an approximation to the binomial when np > 5and n(1 - p) > 5.
That is, if we have a random variable X ~ Bin(n , p) and nis large and pis smallsuch that np > 5, than X can be calculated approximately using the Normaldistribution. It means that the random variable Xwill be normally distributed with
mean = np and variance, )1(2 pnp i.e X ~ N( ).
Example 5.2:
Suppose in experiment of tossing a fair coin for 50 times. What is the probabilityof getting between 9 and 11 heads?
Solution:
Let X be the random variable representing the number of heads thrown.
X ~ Bin (50, 0.5)
Since nis large and np > 5, then we can use normal approximation to find theprobability. It mean that now, X is normally distributed with mean np =25andvariance 12.5. i.e X ~ N (25, 12.5). Hence,
-
5/27/2018 FCM2063 - Printed Notes
55/126
55
7665.00934.08599.0)32.1()08.1(
)8.0.132.1(5.12
255.11
5.12
255.8
5.12
)25)5.011(
5.12
)25)5.09()119(
ZPZP
ZPXP
5.3.2 Normal approximation to Poisson
Thenormal distribution can also be used to approximate thePoisson distribution
for large values of (the mean of the Poisson distribution).That is, if we have a random variable X ~ Poisson () and is large than Xcan becalculated approximately using the Normal distribution. It means that the random
variable Xwill be normally distributed with mean = and variance, 2 i.eX ~ N ( )Example 5.3:
A car hire firm has 20 cars to hire. The number of demands for a car is hired per
day is a Poisson distribution with mean of 3. Calculate the probability that at most
ten cars will be hired in one day.
Solution:
Let a random variable Xdenotes the number of demands for a car.
The given mean value is 3. By the Poisson distribution
20.....,....,4,3,2,1,0,!
)(
xx
exXP
x
Since is large, then the probability can be calculated using a normalapproximation with mean and variance is also i.eX ~ N( Hence,
994.0)5.2()5.2(
33)5.010(
33)5.0()10(
ZP
XPXP
5.4Statistical process control
http://www.mathsrevision.net/alevel/pages.php?page=74http://www.mathsrevision.net/alevel/pages.php?page=75http://www.mathsrevision.net/alevel/pages.php?page=75http://www.mathsrevision.net/alevel/pages.php?page=74 -
5/27/2018 FCM2063 - Printed Notes
56/126
56
Statistical process control (SPC), is a powerful tools that implement the conceptof prevention as a shift from the traditional quality by inspection/correction. SPCis a technique that employs statistical tools for controlling and improvingprocesses. It is an important ingredient in continuous process improvement (CPI)strategies. It uses simple statistical means to control, monitor, and improve
processes.
Among the most commonly used tools of SPC:
histograms cause-and-effect diagrams Pareto diagrams control charts scatter or correlation diagrams run charts process flow diagrams
The most important SPC tool is called control charts. That is a graphicalrepresentations of process performance over time concerned with how (orwhether) processes vary at different intervals and identifying nonrandom orassignable causes of variation. The control charts are also providing a powerfulanalytical tool for monitoring process variability and other changes in process
mean. There are two common charts use in the SPC. The X - chart and R-chart.
The X and Range, R Charts are a set of control charts for variables data (data thatis both quantitative and continuous in measurement, such as a measured
dimension or time). The X - chart monitors the process location over time, based
on the average of a series of observations, called a subgroup. While the R-chartmonitors the variation between observations in the subgroup over time.
The X - chart or R- chart are used when you can rationally collect measurementsin groups (subgroups) of between two and ten observations. The charts' x-axes aretime based, so that the charts show a history of the process. The data is time-ordered; that is, entered in the sequence from which it was generated.
5.4.1 How to construct X - chart and R-chart
In order to construct the chart, the sample mean, the average of the sub-group andthe limits must be calculated.
The sample mean is calculated from a set of ndata values as
n
iix
nx
1
1.
The average of the subgroups data is calculated as
m
j
n
iijx
mnx
1 1
1
-
5/27/2018 FCM2063 - Printed Notes
57/126
57
where n is the subgroup size and mis the total number of subgroups included inthe analysis.
Thisx is a centre line of the chart and is called the estimate process mean.
The average range is calculated as
m
iir
mr
1
1 , whereris range between the largest
and the smallest value in each subgroup.
The upper and lower limits for the X - chart are calculated by using the formula
rAx 2 where A2can be find from the process control chart table.
While the upper for the R-chart is calculated by using the formula rD4 and for
lower limit using the formula rD3 where D4and D3can be find from the processcontrol chart table.
After the centre line and limits are calculated, and then the chart can be
constructed by plotting the observations of sample number versus x for X -chart
and the sample number versus rfor R-chart.
Example 5.4:
A component part for a jet aircraft engine is manufactured by an investmentcasting process. The vane opening on this casting is an important functionalparameter of the part.
We will illustrate the use of X and R control charts to assess the statisticalstability of this process. The table presents 20 samples of five parts each. Thevalues given in the table have been coded by using the last three digits of thedimension; that is, 31.6 should be 0.50316 inch.
Sample Number x1 x2 x3 x4 x5 X r1 33 29 31 32 33 31.6 4
2 33 31 35 37 31 33.4 6
3 35 37 33 34 36 35.0 44 30 31 33 34 33 32.2 4
5 33 34 35 33 34 33.8 2
6 38 37 39 40 38 38.4 3
7 30 31 32 34 31 31.6 4
8 29 39 38 39 39 36.8 10
9 28 33 35 36 43 35.0 15
10 38 33 32 35 32 34.0 6
-
5/27/2018 FCM2063 - Printed Notes
58/126
58
11 28 30 28 32 31 29.8 4
12 31 35 35 35 34 34.0 4
13 27 32 34 35 37 33.0 10
14 33 33 35 37 36 34.8 4
15 35 37 32 35 39 35.6 7
16 33 33 27 31 30 30.8 617 35 34 34 30 32 33.0 5
18 32 33 30 30 33 31.6 3
19 25 27 34 27 28 28.2 9
20 35 35 36 33 30 33.8 6
(a) Construct X andR control charts.(b) After the process is in control, estimate the process mean and standard
deviation.
Exercise 5
1. Suppose X1, X2, , X20 is a sample from normal distribution N (2) with = 5, 2 = 4. Find(a)Expectation and Variance of(b)Distribution of
2. Given that Xis normally distributed with mean 50 and standard deviation 4, computethe following for n=25.
(a)Mean and variance of X
(b) )49( XP
(c) )52( XP
(d) )5.5149( XP
3. Given that Xis normally distributed with mean 20 and standard deviation 2, computethe following for n=40.
X
X
-
5/27/2018 FCM2063 - Printed Notes
59/126
59
(a)Mean and variance of X
(b) )19( XP
(c) )22( XP
(d) )5.2119( XP
4. LetXdenote the number of flaws in a 1 in length of copper wire. The pmf ofXisgiven in the following table
X=x 0 1 2 3
P(X=x) 0.48 0.39 0.12 0.01
100 wires are sampled from this population. What is the probability that the averagenumber of flaws per wire in this sample is less than 0.5?
5. At a large university, the mean age of the students is 22.3 years, and the standard
deviation is 4 years. A random sample of 64 students is drawn. What is theprobability that the average age of these students is greater than 23 years?
6. Assuming an equal chance of a new baby being a boy or a girl, what is the probabilitythat 60 or more out of the next 100 births at Pantai Hospital will be girls?
7. If 10% of UTP students are international students, what is the probability that fewerthan 100 in a random sample of 818 students are coming from overseas?
8. Suppose that a sample of n = 1,600 tires of the same type are obtained at random froman ongoing production process in which 8% of all such tires produced are defective.
What is the probability that in such a sample 150 or fewer tires will be defective?
9. For overseas flights, an airline has three different choices on its dessert menuicecream, apple pie, and chocolate cake. Based on past experience the airline feels thateach dessert is equally likely to be chosen.(a) If a random sample of four passengers is selected, what is the probability that at
least two will choose ice cream for dessert?(b)If a random sample of 21 passengers is selected, what is the approximate
probability that at least two will choose ice cream for dessert?
10.Suppose that at a certain automobile plant, the number of work stoppage is a Poisson
distribution with an average per day due to equipment problems during the productionprocess is 12.0.What is the approximate probability of having 15 or fewer workstoppages due to equipment problems on any given day?
11.The number of cars arriving per minute at a toll booth on a particular bridge isPoisson distributed with a mean of 2.5.What is the probability that in any givenminute
-
5/27/2018 FCM2063 - Printed Notes
60/126
60
(a)no cars arrive?(b)not more than two cars arrive?
If the expected number of cars arriving at the toll booth per ten-minute interval is25.0, what is the approximateprobability that in any given ten-minute period
(c)not more than 20 cars arrive?(d)between 20 and 30 cars arrive?
12.A component part for a jet aircraft engine is manufactured by an investment castingprocess. The vane opening on this casting is an important functional parameter of the
part. We will illustrate the use of X and Rcontrol charts to assess the statisticalstability of this process. The table presents 20 samples of five parts each. The valuesgiven in the table have been coded by using the last three digits of the dimension; that
is, 31.6 should be 0.50316 inch.
Sample Number x1 x2 x3 x4 x5 X r1 33 29 31 32 33 31.6 4
2 33 31 35 37 31 33.4 6
3 35 37 33 34 36 35.0 4
4 30 31 33 34 33 32.2 4
5 33 34 35 33 34 33.8 2
6 38 37 39 40 38 38.4 3
7 30 31 32 34 31 31.6 4
8 29 39 38 39 39 36.8 10
9 28 33 35 36 43 35.0 15
10 38 33 32 35 32 34.0 611 28 30 28 32 31 29.8 4
12 31 35 35 35 34 34.0 4
13 27 32 34 35 37 33.0 10
14 33 33 35 37 36 34.8 4
15 35 37 32 35 39 35.6 7
16 33 33 27 31 30 30.8 6
17 35 34 34 30 32 33.0 5
-
5/27/2018 FCM2063 - Printed Notes
61/126
61
18 32 33 30 30 33 31.6 3
19 25 27 34 27 28 28.2 9
20 35 35 36 33 30 33.8 6
(a)Construct X andR control charts.
(b)After the process is in control, estimate the process mean and standard deviation.
13.The overall length of a skew used in a knee replacement device is monitored usingand R charts. The following table gives the length for 20 samples of size 4.(Measurements are coded from 2.00 mm; that is, 15 is 2.15 mm.)
Observation Observation
Sample 1 2 3 4 Sample 1 2 3 4
1 16 18 15 13 11 14 14 15 13
2 16 15 17 16 12 15 13 15 16
3 15 16 20 16 13 13 17 16 15
4 14 16 14 12 14 11 14 14 21
5 14 15 13 16 15 14 15 14 13
6 16 14 16 15 16 18 15 16 14
7 16 16 14 15 17 14 16 19 16
8 17 13 17 16 18 16 14 13 19
9 15 11 13 16 19 17 19 17 13
10 15 18 14 13 20 12 15 12 17
(a)Using all the data, find trial control limits for andRcharts, construct the chart,and plot the data.
(b)Use the trial control limits from part (a) to identify out-of-control points. Ifnecessary, revise your control limits, assuming that any samples that plot outsidethe control limits can be eliminated.
(c)Assuming that the process is in control, estimate the process mean and processstandard deviation.
14.The thickness of a printed circuit board (PCB) is an important quality parameter. Dataon board thickness (in cm) are given below for 25 samples of three boards each.
Sample 1 2 3 Sample 1 2 3
1 0.0629 0.0636 0.0640 14 0.0645 0.0640 0.0631
2 0.0630 0.0631 0.0622 15 0.0619 0.0644 0.0632
3 0.0628 0.0631 0.0633 16 0.0631 0.0627 0.0630
-
5/27/2018 FCM2063 - Printed Notes
62/126
62
Sample 1 2 3 Sample 1 2 3
4 0.0634 0.0630 0.0631 17 0.0616 0.0623 0.0631
5 0.0619 0.0628 0.0630 18 0.0630 0.0630 0.0626
6 0.0613 0.0629 0.0634 19 0.0636 0.0631 0.0629
7 0.0630 0.0639 0.0625 20 0.0640 0.0635 0.0629
8 0.0628 0.0627 0.0622 21 0.0628 0.0625 0.0616
9 0.0623 0.0626 0.0633 22 0.0615 0.0625 0.0619
10 0.0631 0.0631 0.0633 23 0.0630 0.0632 0.0630
11 0.0635 0.0630 0.0638 24 0.0635 0.0629 0.0635
12 0.0623 0.0630 0.0630 25 0.0623 0.0629 0.0630
13 0.0635 0.0631 0.0630
(a)Using all the data, find trial control limits for andRcharts, construct the chart,
and plot the data.(b)Use the trial control limits from part (a) to identify out-of-control points. If
necessary, revise your control limits, assuming that any samples that plot outsidethe control limits can be eliminated.
(c) Assuming that the process is in control, estimate the process mean and processstandard deviation.
oooOOOooo
Chapter 6
6. Hypothesis TestingOne populationLearning objectives:
At the end of this chapter, student should be able to:
Explain the concept of hypothesis testing
Understand the procedure or steps to perform the test
-
5/27/2018 FCM2063 - Printed Notes
63/126
63
Do a testing about the mean when the population variance is known and isunknown
Do a testing about the proportion
Perform the testing about the variance
6.1 Introduction
There are two types of statistical inferences: estimation of population parametersand hypothesis testing. Hypothesis testing is one of the most important tools ofapplication of statistics to real life problems. Most often, decisions are required tobe made concerning populations on the basis of sample information. Statisticaltests are used in arriving at these decisions.
Statistical hypotheses are based on the concept of proof by contradiction. For
example, say, we test the mean () of a population to see if an experiment has
caused an increase or decrease in . We do this by proof of contradiction by
formulating a null hypothesis against alternative hypothesis.
6.1.1 Null Hypothesis:
It is a hypothesis which states that there is no difference between the proceduresand is denoted by H0. For the above example the corresponding H0would be thatthere has been no increase or decrease in the mean. Always the null hypothesis istested, i.e., we want to either accept or reject the null hypothesis because we haveinformation only for the null hypothesis.
6.1.2 Alternative Hypothesis:
It is a hypothesis which states that there is a difference between the proceduresand is denoted by H1.
In hypothesis testing there will be a correct decision or false decision would bemade on the null hypothesis as summarize in this table.
Suppose Accept H0as true Reject H0as false
H0is trueCorrect decision. Probability:1 -
Type I error. Probability:
H0is false Type II error. Probability: Correct decision.Probability: 1
-
5/27/2018 FCM2063 - Printed Notes
64/126
64
We will make a correct decision if we accept H0when H0is true or we will rejectH0when actually H0is false.
The risk of rejecting the null hypothesis when we should not reject it is calledtype I error with probability .It means that we make a false decision because we
reject H0when actually H0is true with probability .
While when we accept H0 but actually H0 is not true, then we make a wrongdecision and this decision is called type II error is. The probability of type II error
is . We cannot determine (beta) with the statistical tools you learn in thiscourse.
The probability type I error, is called the level of significance and (1- )100%is called the confidence level of the test and (1 ) is called the "power" of thetest.
6.1.3 Types of test
In hypothesis testing there are three types of test on any parameters of interestcalled two tailed (sided) test, upper tailed test and lower tailed test such as in thetable below.
Type NullHypothesis,