international baccalaureate math sl investigation_the correlation between lung cancer incidents and...
DESCRIPTION
TRANSCRIPT
![Page 1: International baccalaureate math sl investigation_the correlation between lung cancer incidents and the mean amount of smoked cigarettes by felix dyrek](https://reader036.vdocuments.net/reader036/viewer/2022082603/548a990ab47959090d8b45fb/html5/thumbnails/1.jpg)
Candidate name: Felix Dyrek Candidate number: 001528-031
Investigation
Math Studies
Working Title:
Candidate Name: Felix Dyrek
Candidate number: 001528-031
School name: Kolegium Europejskie
School number: 001528
Assignment supervisor: Katarzyna Nosalska
1
The correlation between lung cancer incidents and the mean amount of
smoked cigarettes
![Page 2: International baccalaureate math sl investigation_the correlation between lung cancer incidents and the mean amount of smoked cigarettes by felix dyrek](https://reader036.vdocuments.net/reader036/viewer/2022082603/548a990ab47959090d8b45fb/html5/thumbnails/2.jpg)
Candidate name: Felix Dyrek Candidate number: 001528-031
Introduction:
The aim of my investigation is to find out if there is a correlation between lung cancer incidents
in total and tobacco consumption of men and women in 6 countries.
Hypothesis:
My hypothesis assumes that the rate of lung cancer incidents is bigger in the countries with
higher tobacco consumption then in the countries with smaller tobacco consumption.
Method:
In order to be able to investigate the correlation between lung cancer incidents and tobacco
consumption I needed to collect data from the tobacco industry and various (lung) cancer
constitutions. The next step is to verify the collected data and form statistics. In order to calculate
the correlation the following mathematical methods are used:
The Pearson’s Correlation Coefficient
The X2 Test
2
![Page 3: International baccalaureate math sl investigation_the correlation between lung cancer incidents and the mean amount of smoked cigarettes by felix dyrek](https://reader036.vdocuments.net/reader036/viewer/2022082603/548a990ab47959090d8b45fb/html5/thumbnails/3.jpg)
Candidate name: Felix Dyrek Candidate number: 001528-031
Raw Data:
Table 1 : Lung Cancer incident rates per 100000 people
Country Lung Cancer
incidents
Female Lung Cancer
incidents
Male Lung Cancer
incidents
China 93 27 66
Japan 60 13 47
Thailand 87 37 50
Sweden 35 13 22
Poland 85 21 64
UK 73 22 51
Chart 1: Lung Cancer incident rates per 100000 people
China Japan Thailand Sweden Poland UK0
10
20
30
40
50
60
70
80
90
100
TotalFemaleMale
3
![Page 4: International baccalaureate math sl investigation_the correlation between lung cancer incidents and the mean amount of smoked cigarettes by felix dyrek](https://reader036.vdocuments.net/reader036/viewer/2022082603/548a990ab47959090d8b45fb/html5/thumbnails/4.jpg)
Candidate name: Felix Dyrek Candidate number: 001528-031
Table 2: Adult smokers in total
Country Adult smokers in
total
Adult smokers (per
100000 people)
Adult smokers (%)
China 462.800.000 35600 35.6
Japan 42.132.466 33100 33.1
UK 12.721.380 21000 21
Sweden 1.939.183 19000 19
Poland 13.149.988 34500 34.5
Thailand 15.019.407 23400 23.4
Chart 2: Comparison between percentage of adult smokers in total and countries in %
Adult Smokers
ChinaJapanUKSwedenPolandThailand
4
![Page 5: International baccalaureate math sl investigation_the correlation between lung cancer incidents and the mean amount of smoked cigarettes by felix dyrek](https://reader036.vdocuments.net/reader036/viewer/2022082603/548a990ab47959090d8b45fb/html5/thumbnails/5.jpg)
Candidate name: Felix Dyrek Candidate number: 001528-031
Table 3: Average amount of cigarettes smoked per year
Country Total amount of cigarettes
smoked per person
Total amount of cigarettes
smoked per 100000 people
China 1791 179.100.000
Japan 3023 302.300.000
UK 2232 223.200.000
Sweden 1202 120.200.000
Poland 2061 206.100.000
Thailand 1067 106.700.000
Chart 3: Comparison between the average amounts of cigarettes smoked per person / year and
countries in %
Average amount of smoked cigarettes per person / year
ChinaJapanUKSwedenPolandThailand
5
![Page 6: International baccalaureate math sl investigation_the correlation between lung cancer incidents and the mean amount of smoked cigarettes by felix dyrek](https://reader036.vdocuments.net/reader036/viewer/2022082603/548a990ab47959090d8b45fb/html5/thumbnails/6.jpg)
Candidate name: Felix Dyrek Candidate number: 001528-031
Calculations
The Pearson Correlation coefficient
The Pearson Correlation coefficient is used to identify if there is a correlation between the lung
cancer incidents and the average amount of smoked cigarettes per person. Table 4 is divided into
6 countries in order to be able to compare them. First of all it is to chart the researched data for
lung cancer incidents (x) and the amount of smoked cigarettes per person / year (y). The next
step is to multiply these data in order to obtain XY. Data in x and y have to be raised by 2 to
obtain the results for the last two columns. The following step is to sum up the obtained data up.
Table 4
Table 4
X Y XY X2 Y2
Country Lung Cancer Incidents
Amount of smoked cigarettes per person / year
1 China 93 179100000 16656300000
8649 32076810000000000
2 Japan 60 302300000 18138000000
3600 91385290000000000
3 UK 73 223200000 16293600000
5329 49818240000000000
4 Sweden 35 120200000 4207000000 1225 14448040000000000 5 Poland 85 206100000 1751850000
07225 42477210000000000
6
![Page 7: International baccalaureate math sl investigation_the correlation between lung cancer incidents and the mean amount of smoked cigarettes by felix dyrek](https://reader036.vdocuments.net/reader036/viewer/2022082603/548a990ab47959090d8b45fb/html5/thumbnails/7.jpg)
Candidate name: Felix Dyrek Candidate number: 001528-031
6 Thailand 87 106700000 9282900000 7569 11384890000000000Total 6 433 1137600000 8209630000
033597 241590480000000000
The last step is to insert the collected data into the Pearson Correlation Coefficient formula and
solve the equation.
There is no correlation.
7
r= 82096300000−6×72.16×189600000_______________ √33597−6×72.162 √241590480000000000−6×1896000002
r= 82089216000____________ √2354.60√25901520000000000
r= -1.54x1011
![Page 8: International baccalaureate math sl investigation_the correlation between lung cancer incidents and the mean amount of smoked cigarettes by felix dyrek](https://reader036.vdocuments.net/reader036/viewer/2022082603/548a990ab47959090d8b45fb/html5/thumbnails/8.jpg)
Candidate name: Felix Dyrek Candidate number: 001528-031
The X 2 Test
χcalc2 =∑
( f o−f e)2
f e
Where:
f o is an observed frequency
f e is an expected frequency
Observed Value Table (fo): Taken from Table 1, average of the European and Asian countries
within the female and male lung cancer incidents.
Female lung cancer
incidents
Male lung cancer
incidents
Sum
Europe 18.67 45.67 64.34
Asia 25.67 54.33 80
Sum 44.34 100 144.34
Calculation Table: The calculation table will be used to change the observed values into expected
values to have the possibility to calculate the x2 test.
S1 S2 Sum
R1 wy÷n wz÷n w
R2 xy÷n xz÷n x
8
![Page 9: International baccalaureate math sl investigation_the correlation between lung cancer incidents and the mean amount of smoked cigarettes by felix dyrek](https://reader036.vdocuments.net/reader036/viewer/2022082603/548a990ab47959090d8b45fb/html5/thumbnails/9.jpg)
Candidate name: Felix Dyrek Candidate number: 001528-031
Sum Y Z n
Expected Value Table (fe): This table represents the lung cancer incidents between Europe and
Asia. The data is based on my previous data on the 6 countries – China, Japan, Thailand,
Sweden, Poland and the United Kingdom divided into their representing continents. It is also
divided between male and female groups.
Female lung cancer
incdents
Male lung cancer
incidents
Sum
Europe 19.76 44.58 64.34
Asia 24.58 55.42 80
Sum 44.34 100 144.34
Now I am going to calculate the x2 test in order to observe if there exists a correlation between observed and expected values extracted from the tables concerning male and female lung cancer incidents within Europe and Asia.
χ2 Calculations:
fo fe fo−fe (fo−fe)2 (fo−fe)2÷fe
18.67 19.76 -1.09 1.1881 0.0601
45.67 44.58 1.09 1.1881 0.0267
25.67 24.58 1.09 1.1881 0.0483
54.33 55.42 -1.09 1.1881 0.0214
Total 0.1565
9
![Page 10: International baccalaureate math sl investigation_the correlation between lung cancer incidents and the mean amount of smoked cigarettes by felix dyrek](https://reader036.vdocuments.net/reader036/viewer/2022082603/548a990ab47959090d8b45fb/html5/thumbnails/10.jpg)
Candidate name: Felix Dyrek Candidate number: 001528-031
So, χ2
= 0.1565
The χ2
is small enough to observe that there is a correlation between observed and expected
Degrees of freedom
df = (r – 1)(c – 1)
The next step is find df and using a table to find the meaning of x2 which I just have obtained.
The x2 distribution depends on the number of degrees of freedom (df) where df = (r – 1)(c – 1)
My table equals:
df=(r-1)(c-1)df=(2-1)(2-1)df=1x1=1
10
![Page 11: International baccalaureate math sl investigation_the correlation between lung cancer incidents and the mean amount of smoked cigarettes by felix dyrek](https://reader036.vdocuments.net/reader036/viewer/2022082603/548a990ab47959090d8b45fb/html5/thumbnails/11.jpg)
Candidate name: Felix Dyrek Candidate number: 001528-031
Conclusion and Evaluation
Due to the results which I have obtained during my research it can be concluded that there
doesn’t exist a direct correlation between the amount of smoked cigarettes and the lung cancer
incidents. So my hypothesis is proven to be wrong. There can be various factors resulting in lung
cancer such as second hand smoke, car exhaust, multiple alpha, beta and gamma rays. As these
facots can oncrease the chance of lung cancer my data is not 100% accurate as there are external
factors which can increase the lung cancer incidents. Thus lung cancer incidents are not purely
based on the amount of consuming cigarettes even though it is a known fact that excessive
cigarette consumption may cause lung cancer. As Due to the explanation above the investigation
could be improved by including more external factors such as the one previously mentioned.
11