Thursday, March 12, 2020
Investigate if there is any correlation between the GDP per capita ($) of a country and the life expectancy at birth (years) Essay Example
Investigate if there is any correlation between the GDP per capita ($) of a country and the life expectancy at birth (years) Essay Example Investigate if there is any correlation between the GDP per capita ($) of a country and the life expectancy at birth (years) Essay Investigate if there is any correlation between the GDP per capita ($) of a country and the life expectancy at birth (years) Essay Essay Topic: Life Of Pi My aim is to investigate if there is any correlation between the GDP per capita ($) of a country and the life expectancy at birth (years). The GDP is the gross domestic product or value of all final goods and services produced within a nation in a given year. GDP dollar ($) estimates are derived from purchasing power parity (PPP) calculations. The GDP per capita ($) shows GDP on a purchasing power parity basis divided by population. The life expectancy at birth shows the average number of years to be lived by a group of people born in the same year, if mortality at each age remains constant in the future. It shows the life expectancy on average for the total population for male and females. Life expectancy at birth is also a measure of overall quality of life in a country and summarizes the mortality at all ages. The reason for doing this investigation is that I have seen a lot of documentaries and read a lot of articles in the newspaper which have talked about how the gap between rich and poor has increased. This has led to a poorer quality of life in developing countries. So I wanted to see if there was any link between how rich a country is per person and what on average is the life expectancy for a person is in that country. This will help me get a better understanding of how rich a country is how much it affects the quality of life. This is the reason why I think the investigation is worth doing. Data collection: The data I collected was the GDP per capita using the purchasing power parity ($) and the life expectancy at birth (years). I have collected data for these two variables from the whole world. So my population is defined as the whole world. I obtained the data from the www.CIA.gov and clicked on the world fact book. I got 239 pieces of data originally for both then I had to reject 11 pieces of data for both because some countries did not have any data for the GDP. So from the 228 I used a sampling method of choosing every 4th country on the list until I narrowed my sample to 50 countries. I chose every 4th number because when you divide 228 by 50 and choose the integer number you get 4 this ensures this is a random sampled number which provides the most representative sample from the population. I used a systematic sampling method. The list was in alphabetical order and not in rank order for both variables so by using this method Im not creating any bias. Since the data is from the CI A website I must presume that the data is accurate and reliable. Here is a table of my data which has been systematically sampled to show 50 pairs of data: Country GDP per capita, Purchasing Power Parity ($) Life expectancy at birth (years) American Samoa 8000 75.75 Anguilla 8600 76.7 Armenia 3600 66.68 Bahamas, The 15300 65.71 Barbados 15000 71.84 Benin 1100 51.08 Bolivia 2500 64.78 British Virgin Islands 16000 76.06 Burma 1700 55.79 Cameroon 1700 48.05 Central African Republic 1200 41.71 China 4700 72.22 Congo, Democratic Republic of the 600 48.93 Cote dIvoire 1400 42.65 Djibouti 1300 43.13 East Timor 500 65.2 El Salvador 4600 70.62 Ethiopia 700 41.24 French Guiana 14400 76.69 Gambia, The 1800 54.38 Ghana 2000 56.53 Grenada 5000 64.52 Guatemala 3900 65.23 Guinea-Bissau 700 46.97 Honduras 2500 66.65 India 2600 63.62 Iraq 2400 67.81 Jersey 24800 78.93 Kenya 1100 45.22 Korea, South 19600 75.36 Laos 1800 54.3 Liberia 1000 48.15 Macau 18500 81.87 Malaysia 8800 71.67 Malta 17200 78.43 Martinique 10700 78.72 Mayotte 600 60.6 Monaco 27000 79.27 Morocco 3900 70.04 Nauru 5000 61.95 New Caledonia 14000 73.52 Nigeria 900 51.01 Pakistan 2000 62.2 Papua New Guinea 2100 64.19 Philippines 4600 69.29 Reunion 5600 73.43 Saint Helena 2500 77.38 Saint Pierre and Miquelon 11000 78.11 San Marino 34600 81.43 Saudi Arabia 11400 68.73 Modelling procedures: I am going to do a scatter diagram of GDP per capita against life expectancy at birth for my 50 pairs of data to see if there is any correlation. A scatter diagram is an appropriate modeling procedure as it shows a clear relationship between two random variables. As you can see from the scatter diagram the points form a relationship which appears to be a curve so to try to establish a more linear relationship. I am going to do this by first logging my data for the GDP per capita and not logging the life expectancy data and then do a scatter diagram of this data. I am then going to log the life expectancy data but not the GDP per capita data and do a scatter diagram of this data. Then finally I am going to log both my data for GDP per capita and the life expectancy at birth and do a scatter diagram. I am going to check which scatter diagram gives the strongest linear correlation and thats the data Im going to chose. Country Life expectancy at birth (years) Log of GDP per capita, Purchasing Power Parity ($) American Samoa 75.75 3.903089987 Anguilla 76.7 3.934498451 Armenia 66.68 3.556302501 Bahamas, The 65.71 4.184691431 Barbados 71.84 4.176091259 Benin 51.08 3.041392685 Bolivia 64.78 3.397940009 British Virgin Islands 76.06 4.204119983 Burma 55.79 3.230448921 Cameroon 48.05 3.230448921 Central African Republic 41.71 3.079181246 China 72.22 3.672097858 Congo, Democratic Republic of the 48.93 2.77815125 Cote dIvoire 42.65 3.146128036 Djibouti 43.13 3.113943352 East Timor 65.2 2.698970004 El Salvador 70.62 3.662757832 Ethiopia 41.24 2.84509804 French Guiana 76.69 4.158362492 Gambia, The 54.38 3.255272505 Ghana 56.53 3.301029996 Grenada 64.52 3.698970004 Guatemala 65.23 3.591064607 Guinea-Bissau 46.97 2.84509804 Honduras 66.65 3.397940009 India 63.62 3.414973348 Iraq 67.81 3.380211242 Jersey 78.93 4.394451681 Kenya 45.22 3.041392685 Korea, South 75.36 4.292256071 Laos 54.3 3.255272505 Liberia 48.15 3 Macau 81.87 4.267171728 Malaysia 71.67 3.944482672 Malta 78.43 4.235528447 Martinique 78.72 4.029383778 Mayotte 60.6 2.77815125 Monaco 79.27 4.431363764 Morocco 70.04 3.591064607 Nauru 61.95 3.698970004 New Caledonia 73.52 4.146128036 Nigeria 51.01 2.954242509 Pakistan 62.2 3.301029996 Papua New Guinea 64.19 3.322219295 Philippines 69.29 3.662757832 Reunion 73.43 3.748188027 Saint Helena 77.38 3.397940009 Saint Pierre and Miquelon 78.11 4.041392685 San Marino 81.43 4.539076099 Saudi Arabia 68.73 4.056904851 Country GDP per capita, Purchasing Power Parity ($) Log of Life expectancy at birth Log (years) American Samoa 8000 1.879382637 Anguilla 8600 1.884795364 Armenia 3600 1.823995591 Bahamas, The 15300 1.817631467 Barbados 15000 1.856366324 Benin 1100 1.708250889 Bolivia 2500 1.811440944 British Virgin Islands 16000 1.881156321 Burma 1700 1.746556361 Cameroon 1700 1.681693392 Central African Republic 1200 1.62024019 China 4700 1.858657484 Congo, Democratic Republic of the 600 1.689575216 Cote dIvoire 1400 1.629919036 Djibouti 1300 1.634779458 East Timor 500 1.814247596 El Salvador 4600 1.848927713 Ethiopia 700 1.615318657 French Guiana 14400 1.884738738 Gambia, The 1800 1.735439203 Ghana 2000 1.752278985 Grenada 5000 1.809694359 Guatemala 3900 1.814447379 Guinea-Bissau 700 1.67182056 Honduras 2500 1.823800154 India 2600 1.803593665 Iraq 2400 1.831293744 Jersey 24800 1.897242103 Kenya 1100 1.655330558 Korea, South 19600 1.87714089 Laos 1800 1.73479983 Liberia 1000 1.682596291 Macau 18500 1.91312479 Malaysia 8800 1.855337404 Malta 17200 1.894482215 Martinique 10700 1.896085085 Mayotte 600 1.782472624 Monaco 27000 1.899108858 Morocco 3900 1.845346137 Nauru 5000 1.792041311 New Caledonia 14000 1.866405498 Nigeria 900 1.707655324 Pakistan 2000 1.793790385 Papua New Guinea 2100 1.807467376 Philippines 4600 1.840670561 Reunion 5600 1.865873528 Saint Helena 2500 1.888628725 Saint Pierre and Miquelon 11000 1.892706638 San Marino 34600 1.910784435 Saudi Arabia 11400 1.837146344 Country Log of GDP per capita, Purchasing Power Parity Log ($) Log of Life expectancy at birth Log (years) American Samoa 3.903089987 1.879382637 Anguilla 3.934498451 1.884795364 Armenia 3.556302501 1.823995591 Bahamas, The 4.184691431 1.817631467 Barbados 4.176091259 1.856366324 Benin 3.041392685 1.708250889 Bolivia 3.397940009 1.811440944 British Virgin Islands 4.204119983 1.881156321 Burma 3.230448921 1.746556361 Cameroon 3.230448921 1.681693392 Central African Republic 3.079181246 1.62024019 China 3.672097858 1.858657484 Congo, Democratic Republic of the 2.77815125 1.689575216 Cote dIvoire 3.146128036 1.629919036 Djibouti 3.113943352 1.634779458 East Timor 2.698970004 1.814247596 El Salvador 3.662757832 1.848927713 Ethiopia 2.84509804 1.615318657 French Guiana 4.158362492 1.884738738 Gambia, The 3.255272505 1.735439203 Ghana 3.301029996 1.752278985 Grenada 3.698970004 1.809694359 Guatemala 3.591064607 1.814447379 Guinea-Bissau 2.84509804 1.67182056 Honduras 3.397940009 1.823800154 India 3.414973348 1.803593665 Iraq 3.380211242 1.831293744 Jersey 4.394451681 1.897242103 Kenya 3.041392685 1.655330558 Korea, South 4.292256071 1.87714089 Laos 3.255272505 1.73479983 Liberia 3 1.682596291 Macau 4.267171728 1.91312479 Malaysia 3.944482672 1.855337404 Malta 4.235528447 1.894482215 Martinique 4.029383778 1.896085085 Mayotte 2.77815125 1.782472624 Monaco 4.431363764 1.899108858 Morocco 3.591064607 1.845346137 Nauru 3.698970004 1.792041311 New Caledonia 4.146128036 1.866405498 Nigeria 2.954242509 1.707655324 Pakistan 3.301029996 1.793790385 Papua New Guinea 3.322219295 1.807467376 Philippines 3.662757832 1.840670561 Reunion 3.748188027 1.865873528 Saint Helena 3.397940009 1.888628725 Saint Pierre and Miquelon 4.041392685 1.892706638 San Marino 4.539076099 1.910784435 Saudi Arabia 4.056904851 1.837146344 You can see from the scatter diagrams that the log of GDP per capita against the life expectancy shows the strongest linear correlation so that is the one I am going to choose. Therefore this means that I am going to use the data for log of GDP per capita and the life expectancy at birth. From the scatter diagram I can see that there is a positive correlation between the two variables. From looking at the scatter diagram I can see that the data takes an elliptical shape. Since the ellipse appears to be quite narrow it implies that there is a good positive correlation i.e. as one variable increases, so does the other. Therefore the data shows a clear linear relationship. Another technique that I am going to use is a histogram because you are able to see the distribution clearly and able to determine whether I can use Pearsons product moment correlation (PMCC) or Spearmans coefficient of rank order. I am going to draw a histogram for each variable and if the distribution is not normally distributed I shall use Spearmans and if it is I shall use PMCC. As the histograms roughly show a normal distribution I am going to use PMCC method. Analysis: Now I am going to calculate the PMCC with the help of Microsoft Excel. x y x2 y2 XY 75.75 3.903089987 5738.063 15.23411 295.6591 76.7 3.934498451 5882.89 15.48028 301.776 66.68 3.556302501 4446.222 12.64729 237.1343 65.71 4.184691431 4317.804 17.51164 274.9761 71.84 4.176091259 5160.986 17.43974 300.0104 51.08 3.041392685 2609.166 9.250069 155.3543 64.78 3.397940009 4196.448 11.546 220.1186 76.06 4.204119983 5785.124 17.67462 319.7654 55.79 3.230448921 3112.524 10.4358 180.2267 48.05 3.230448921 2308.803 10.4358 155.2231 41.71 3.079181246 1739.724 9.481357 128.4326 72.22 3.672097858 5215.728 13.4843 265.1989 48.93 2.77815125 2394.145 7.718124 135.9349 42.65 3.146128036 1819.023 9.898122 134.1824 43.13 3.113943352 1860.197 9.696643 134.3044 65.2 2.698970004 4251.04 7.284439 175.9728 70.62 3.662757832 4987.184 13.41579 258.664 41.24 2.84509804 1700.738 8.094583 117.3318 76.69 4.158362492 5881.356 17.29198 318.9048 54.38 3.255272505 2957.184 10.5968 177.0217 56.53 3.301029996 3195.641 10.8968 186.6072 64.52 3.698970004 4162.83 13.68238 238.6575 65.23 3.591064607 4254.953 12.89575 234.2451 46.97 2.84509804 2206.181 8.094583 133.6343 66.65 3.397940009 4442.223 11.546 226.4727 63.62 3.414973348 4047.504 11.66204 217.2606 67.81 3.380211242 4598.196 11.42583 229.2121 78.93 4.394451681 6229.945 19.31121 346.8541 45.22 3.041392685 2044.848 9.250069 137.5318 75.36 4.292256071 5679.13 18.42346 323.4644 54.3 3.255272505 2948.49 10.5968 176.7613 48.15 3 2318.423 9 144.45 81.87 4.267171728 6702.697 18.20875 349.3533 71.67 3.944482672 5136.589 15.55894 282.7011 78.43 4.235528447 6151.265 17.9397 332.1925 78.72 4.029383778 6196.838 16.23593 317.1931 60.6 2.77815125 3672.36 7.718124 168.356 79.27 4.431363764 6283.733 19.63698 351.2742 70.04 3.591064607 4905.602 12.89575 251.5182 61.95 3.698970004 3837.803 13.68238 229.1512 73.52 4.146128036 5405.19 17.19038 304.8233 51.01 2.954242509 2602.02 8.727549 150.6959 62.2 3.301029996 3868.84 10.8968 205.3241 64.19 3.322219295 4120.356 11.03714 213.2533 69.29 3.662757832 4801.104 13.41579 253.7925 73.43 3.748188027 5391.965 14.04891 275.2294 77.38 3.397940009 5987.664 11.546 262.9326 78.11 4.041392685 6101.172 16.33285 315.6732 81.43 4.539076099 6630.845 20.60321 369.617 68.73 4.056904851 4723.813 16.45848 278.8311 Totals 3224.34 179.0276425 215012.6 653.5361 11793.26 This shows that my variables have a good positive correlation. I am now going to carry out a hypothesis test on the correlation coefficient to see if there is enough evidence from my sample to conclude that there is correlation in the whole population. : ? = 0 (There is no correlation between the two variables in all the countries in the world) : ? 0 (Positive Correlation) N= 50 I will be doing a one tail test at the 5% significant level So the critical value = 0.2353 So 0.833872644 0.2353 Therefore I can conclude that there is enough evidence from the sample to say that I accept that there is a positive correlation. Regression line The equation of the regression line is: As you can see on the page here is my scatter diagram with the regression line drawn on it which was all done in excel. This is Y upon X regression line. Interpretation: From the investigation that I have carried out I have discovered that that there is a positive correlation between my two sets of data which is shown on my graph and regression line. The aim of my investigation was to see if there is any correlation between the GDP per capita ($) of a country and the life expectancy at birth (years). I can now confidently say that I have achieved my aim as there is a positive correlation as predicted. The sample that I took is of the whole world and is a good representation of the whole population. By using the correlation results I can predict if there was a country with a low GDP then it is expected that they have a low average life expectancy. This trend would be expected for every country in a similar position but some countries may incur lower life expectancies than normal due to some external factor e.g. war, outbreak of a new disease or some sort of natural disaster. But regardless of these exceptions they shall not affect the overall correlation. I think that this data was worth investigating and collecting because I now realise how important the GDP per capita of a country is in affecting how long a person lives and how the higher the GDP the better the quality of life is for a person. This investigation has shown that people living in developing countries are more likely to die at a young age and will not have such a high quality of life as we enjoy in a country like the UK. I also think this investigation will act as very good evidence to try and convince richer nations to help poorer ones. This data should be given to an organisation like the United Nations to try an act as a catalyst to convince them to do something about this before it is too late. Accuracy and refinements: One possible source of error was that the data may have been displayed incorrectly on the website or I may have copied it incorrectly. I would improve this by comparing data from a number of different sources to ensure accurate and reliable results. The sampling method that I used could have been a possible source of error. This is because my systematic sample only included every 4th so for example every 3rd did not have a chance to be chosen. I could have improved my sampling method by using simple random sampling instead of systematic sampling. Simple random sampling ensures that every item of data has an equal chance of being chosen. This is a very important factor in ensuring the reliability of my work. Even though the data is very reliable there are some improvements that could be made. First of all the data was only collected for a given year in my case it was for 2003. For more accurate data I could have used data over five years to see if there is actually a difference and to see if for example at that given years there may have been a low life expectancy due to an external factor like war or disease. Also the sample was only from 228 countries and there are more countries in the world so a more fair representation would be to random sample from every country in the world. This was not possible because my source did not include some of these countries due to political reasons and from lack of information for those countries. In my investigation I had to reject 11 statistics for 11 countries this reduced the randomness of my sample. I would improve this by making sure that data was available for every item in the parent population. Overall I am very happy with the accuracy and reliability of my data because I got it from a very reliable source which was www.CIA.gov. Having a reliable source for my data enables me to achieve my aim of a positive correlation.
Subscribe to:
Posts (Atom)