Presentation is loading. Please wait.

Presentation is loading. Please wait.

Chapter 3 Descriptive Statistics for Qualitative Data.

Similar presentations


Presentation on theme: "Chapter 3 Descriptive Statistics for Qualitative Data."— Presentation transcript:

1 Chapter 3 Descriptive Statistics for Qualitative Data

2 Categorical data Binary Multiple categories Gender 1,2 (m,f) Blood Type of ABO A,B,AB,O Degree of protein found in urine - +, +, ++,+++ Type of blood pressure Low, normal, high Ordinal Nominal review

3

4

5 Statistical Description for enumeration data

6 Absolute measure: The numbers counted for each category (frequencies) The absolute measure can hardly be used for comparison between different populations.

7 Relative measure Three kinds of relative measures: Proportion Intensity (Rate) Ratio

8 Proportion ( 构成比 ) : A part considered in relation to the whole. Eg, proportion of sex proportion of age proportion of mortality of diseases

9 DiseaseMortalityProportion (%) Malignant tumor5033.33 Circulation system4026.67 Respiration system3020.00 Digestive system2013.33 Infectious disease10 6.67 Total150100.00 Table 3.1 proportions of 5 disease death in 2001

10 Example 1 Question: Which grade has the most serious condition of myopias?

11 (2) Intensity Example A smoking population had followed up for 562833 person-years, 346 lung cancer cases were found. The incidence rate of lung cancer in the smoking population is : The incidence rate of lung cancer in the smoking population is : Incidence rate =346/562833 Incidence rate =346/562833 =61.47 per 100,000 person-year =61.47 per 100,000 person-year

12 In general, Denominator: Sum of the person-years observed in the period Numerator: Total number of the event appearing in the period Unit: person/person year, or 1/Year Nature: the relative frequency per unit of time.

13 Example The mortality rate of liver cancer in Guangzhou is 32 per 100,000 per year.

14 (3) Ratio Ratio is a number divided by another related number Examples Sex ratio of students in this class: No. of males : No. of females = 52% Coefficient of variation: CV=SD/mean Ratio of time spent per clinic visit: Large hospital : Community health station = 81.9 min. : 18.6 min. = 4.40

15 2. Caution in use of relative measures a.The denominator should be big enough! Otherwise the absolute measure should be used. Example: Out of 5 cases, 3 were cured– 60% ? b. Attention to the population where the relative measure comes from. Prevalence rate: Population is the students in the same grade Constitutes: Population is all the patients

16 The above two frequency distributions reflect two populations of all patients; To describe the prevalence rate, one has to look at the general population;

17 c. Pooled estimate of the frequency Pooled estimate =  numerators /  denominators Example: The prevalence of myopia among 3 grades ≠ (15.16+15.89+18.37)/3 The prevalence of myopia among 3 grades = (67+68+56)/(442+428+305) = 192/1175 = 16.34 d. Comparability between frequencies or between frequency distributions – Notice the balance of other conditions

18 e. If the distributions of other variables are different, to improve the comparability, “Standardization” is needed. f. To compare two samples, hypothesis test is needed. (See Chi square test) The following will emphasize the above two points: Standardization Hypothesis test

19 3. Standardization for crude frequency or crude intensity 3. Standardization for crude frequency or crude intensity Crude incidence rate of city A=28.96; Crude incidence rate of city B=35.03 -- Strange!? They are not comparable ! -- Because the constitute are quite different Table 10-3 Incidence rates of infectious diseases, children of two cities

20 Standardized incidence rate of city A = 793/24767 = 32.02 ‰ Standardized incidence rate of city B = 3523/24767 = 21.12 ‰ Two steps: Select a standard population– taking as “weight” Weighted average of the actual incidence rates–direct standardization rate

21 Known: Age specific populations N i1, N i2 ; Total no.of deaths D i1 =432, D i2 =210 Select a set of standard mortality rates Standard mortality ratio: SMR 1 = D i1 / N i1 P i = 432/100.67 = 4.2912 (smoker) SMR 2 = D i2 / N i2 P i = 210/243.61 = 0.8620 (non-smoker) Standardized mortality rate P ’ 1 =34.60 SMR 1 =148.48 (1/10 5 ), P ’ 2 =34.60 SMR 2 =29.83 (1/10 5 )

22 Table The total number of patients between 1991-1999

23 a 0 indicator for base year a n indicator for n st year Average speed of growth=average speed of development  1

24

25


Download ppt "Chapter 3 Descriptive Statistics for Qualitative Data."

Similar presentations


Ads by Google