Presentation is loading. Please wait.

Presentation is loading. Please wait.

Uses of Biostatistics in Epidemiology (1) Amornrath Podhipak, Ph.D. Department of Epidemiology Faculty of Public Health Mahidol University 2006.

Similar presentations


Presentation on theme: "Uses of Biostatistics in Epidemiology (1) Amornrath Podhipak, Ph.D. Department of Epidemiology Faculty of Public Health Mahidol University 2006."— Presentation transcript:

1 Uses of Biostatistics in Epidemiology (1) Amornrath Podhipak, Ph.D. Department of Epidemiology Faculty of Public Health Mahidol University 2006

2 Why Statistics ?? Why Computers ?? Why Software ?? Medical doctors and public health personnel A tools for calculation

3 Why do we need “statistics” in medicine and public health? (particularly, epidemiology??) *Medicine is becoming increasingly quantitative in describing a condition. Most of malaria patients are infected with P.falciparum. 82.5% got P.falciparum. Those patients looks pale.Haemoglobin level was 9.89 mg%, on average. Epidemiology concerns with describing disease pattern in a group of people. Descriptive statistics give a clearer picture of what we want to describe. * The answer to a research question need to be more definite. Is the new treatment better: how much better?, in what aspect?, any evidence? could it be a real difference? Inferential statistics give an answer in the world of uncertainty.

4 Measurement of characteristics (Variables vs Constant) 4 scales of measurement Qualitative variables - Nominal scale (group classification only) - Ordinal scale (classification with ordering / ranking) Quantitative variables - Interval (magnitude + constant distance between points) - Ratio (magnitude + constant distance between points + true zero) Before using statistics, we need some kinds of measurements, in order to get more detailed information.

5 Weght? 80 kg Height? 160 cm Handsom e? Intelligent? Income? 100,000 Married? BP? 140/90 HIV?

6 Female Male 1 2 Nominal scale Values have no meaning. Ordinal scale 1 23 Equal distance between points does not reflect equal interval value.

7 Interval scale i.e. degree celcius Ratio scale i.e. weight 0 Freezing point was supposed to be zero degree celcius Not the true ZERO temperature (no heat ) 102030 0 True ZERO (nothing here) 102030 Equal distance between points means equal interval value.

8 Questionnaire (TB and Passive smoking) Sex [ ] Male [ ] Female Education [ ] 1-6 yr [ ] 7-9 yr [ ] 9+ yr Family income ……………………. Baht/m Passive Smoking ……... Result from tuberculin test ……………………. mm X-ray [ ] +ve [ ] -ve Weight …………. kg,Height ………………….. cm Record form

9 Variable (characteristic being measured)Result of measurementType Marital statussingle/married/divorcednominal gendermale/femalenominal smokingyes/nonominal smokingnonsmoker/ light smoker/ ordinal moderate smoker/ heavy smoker smokingnumber of cig/dayratio feeling of painyes/nonominal feeling of painnone/light/moderate/highordinal feeling of pain0 ---------> 10ordinal attitude toward strongly agree/ agree/ordinal selective abortion not sure/ disagree/ strongly disagree blood pressuremmHgratio temperaturedegree celciusinterval weight gramratio tumor stageI, II, III, IVordinal

10 Quantitative (numeric, metric) variables are classified as continuousIt can take all values in an interval e.g. weight, temperature, etc. discreteIt can take only certain values (often integer value) e.g. parity, number of sex partners, etc. Continuous data can be categorised into groups, which one needs to define “upper boundary” and “lower boundary” of a value (or a class) 120121122123124125126127 boundaries: 120.5, 121.5, 122.5, 123.5, 124.5 … 120.1120.2120.3120.4120.5120.6120.7120.8 boundaries: 120.15, 120.25, 120.35, 120.45, 120.55 … 120.11120.12120.13120.14120.15120.16120.17120.18 boundaries: 120.115, 120.125, 120.135, 120.145, 120.155 …

11 Descriptive statistics - a way to summarize a dataset (a group of measurement) Example:Height of 100 children, 10-12 years of age. 140140140136141123125134125129 123161142155129130139129134130 140132138142155125136129136153 151141138125123134135135135130 155130134146135139134142139149 147155158135141136136147139132 134140141153142127147142146127 151140151140141147139134140149 132140141142165153146134151151 134141138130141132140138127129 What are values that best describe the height of these 100 persons?

12 1)Rearrange the data: 123123124125125125125127127127 129129129129129130130130130130 132132132132134134134134134134 134134134135135135135135136136 136136136138138138138139139139 139139140140140140140140140140 140140141141141141141141141141 142142142142142142146146146147 147147147149149151151151151151 153153153155155155155158161165 Minimum, Maximum, Range, Median, Mode 123, 165, 42, 139, 140 Max-Min, Value in the middle, Most repeated value

13 3) Present in a graph (Histogram) Frequency Height (cm)

14 Methods of data presentation 1. Table 2. Graph - line graph - bar chart - pie chart

15 - scatter plot - area graph - error bar - histogram

16 Another set of value for describing a dataset is the MEAN and STANDARD DEVIATION. Mean indicates the location. Standard deviation indicates the scatterness of data (roughly). Example: Dataset 1: Age of 6 children 444444 Mean = 4.0 years sd = 0 y (no variation) Example: Dataset 2: Age of 6 children 224466 Mean = 4.0 years sd = 1.79 y(with variation) or, another example:  The average body height of these children was 138.9 cm. with standard deviation of 8.9 cm.  The average body height of these children was 138.9 cm. with standard deviation of 0.2 cm.

17 If we categorize the data into qualitative (tall/short) the proportion would then be calculated. Descriptive statistics (proportion and/or percentage)  Most of the children were less than 150 cm. tall.  85% of them had height less than 152 cm.

18 A final note on defining a variable and a measurement: Important things to consider before making any measurement: 1.Do we measure the right thing? Fatty food and CVD 2.What is the tool that can actually measure what we want to measure? Morphology (measure) indicators % standard weight body mass index (wt/ht 2 ) tricep skinfold thickness Wt for age Wt for height etc. Food intake (ask)Protein calorie intake (ask & calculate) 3.How valid the instrument? Does the questionnaire actually get the fatty food intake information? (scope of questions, recall of subjects, certainty of reported amount of food, variability of ingredients, etc.) Does the information obtained actually reflect fatty food intake? 4.How precise the instrument? Does the information precisely estimate the amount of fatty food intake for each individual?

19 In summary: Statistics (and epidemiology) deals with a group (the bigger the group, the better the result) of persons (not one individual patient). We look for the characteristics which are most common in the group. Descriptive statistics is used for explaining our sample (or findings) i.e.  Most of the patients were anemic.  80% of them had haemoglobin level less than 10 mg%.  The average haemoglobin level was 9.5 mg% with standard deviation of 1.5 mg%. Inferential statistics (Infer to general population of interest)


Download ppt "Uses of Biostatistics in Epidemiology (1) Amornrath Podhipak, Ph.D. Department of Epidemiology Faculty of Public Health Mahidol University 2006."

Similar presentations


Ads by Google