2014.3.3 1 Medical Statistics Medical Statistics Tao Yuchun Tao Yuchun 2

Slides:



Advertisements
Similar presentations
Describing Quantitative Variables
Advertisements

DESCRIBING DISTRIBUTION NUMERICALLY
Descriptive Measures MARE 250 Dr. Jason Turner.
Class Session #2 Numerically Summarizing Data
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 4. Measuring Averages.
Measures of Dispersion
Introduction to Summary Statistics
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
B a c kn e x t h o m e Parameters and Statistics statistic A statistic is a descriptive measure computed from a sample of data. parameter A parameter is.
Sullivan – Statistics: Informed Decisions Using Data – 2 nd Edition – Chapter 3 Introduction – Slide 1 of 3 Topic 16 Numerically Summarizing Data- Averages.
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
B a c kn e x t h o m e Classification of Variables Discrete Numerical Variable A variable that produces a response that comes from a counting process.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 3-1 Introduction to Statistics Chapter 3 Using Statistics to summarize.
Introductory Mathematics & Statistics
Basic Practice of Statistics - 3rd Edition
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
Describing Data: Numerical
Programming in R Describing Univariate and Multivariate data.
Numerical Descriptive Measures
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
(c) 2007 IUPUI SPEA K300 (4392) Outline: Numerical Methods Measures of Central Tendency Representative value Mean Median, mode, midrange Measures of Dispersion.
Objectives 1.2 Describing distributions with numbers
Chapter-2 Statistical description of quantitative variable.
7.7 Statistics & Statistical Graphs p.445. What are measures of central tendency? How do you tell measures of central tendency apart? What is standard.
1 Laugh, and the world laughs with you. Weep and you weep alone.~Shakespeare~
STAT 280: Elementary Applied Statistics Describing Data Using Numerical Measures.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
Section 1 Topic 31 Summarising metric data: Median, IQR, and boxplots.
Practice 1 Tao Yuchun Medical Statistics
Chapter 2 Describing Data.
Interpreting Performance Data
Descriptive Statistics1 LSSG Green Belt Training Descriptive Statistics.
Lecture 3 Describing Data Using Numerical Measures.
Skewness & Kurtosis: Reference
Medical Statistics Medical Statistics Tao Yuchun Tao Yuchun 3
Lecture 5 Dustin Lueker. 2 Mode - Most frequent value. Notation: Subscripted variables n = # of units in the sample N = # of units in the population x.
Describing and Displaying Quantitative data. Summarizing continuous data Displaying continuous data Within-subject variability Presentation.
INVESTIGATION 1.
Dr. Serhat Eren 1 CHAPTER 6 NUMERICAL DESCRIPTORS OF DATA.
Chap 3-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 3 Describing Data Using Numerical.
INVESTIGATION Data Colllection Data Presentation Tabulation Diagrams Graphs Descriptive Statistics Measures of Location Measures of Dispersion Measures.
1 Descriptive Statistics 2-1 Overview 2-2 Summarizing Data with Frequency Tables 2-3 Pictures of Data 2-4 Measures of Center 2-5 Measures of Variation.
7.7 Statistics & Statistical Graphs p.445. An intro to Statistics Statistics – numerical values used to summarize & compare sets of data (such as ERA.
Numerical Measures. Measures of Central Tendency (Location) Measures of Non Central Location Measure of Variability (Dispersion, Spread) Measures of Shape.
Chapter 5 Describing Distributions Numerically.
Statistics 1: Introduction to Probability and Statistics Section 3-2.
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall2(2)-1 Chapter 2: Displaying and Summarizing Data Part 2: Descriptive Statistics.
LIS 570 Summarising and presenting data - Univariate analysis.
Cumulative frequency Cumulative frequency graph
Medical Statistics (full English class) Ji-Qian Fang School of Public Health Sun Yat-Sen University.
By Tatre Jantarakolica1 Fundamental Statistics and Economics for Evaluating Survey Data of Price Indices.
1 STAT 500 – Statistics for Managers STAT 500 Statistics for Managers.
Descriptive Statistics(Summary and Variability measures)
Statistics -Descriptive statistics 2013/09/30. Descriptive statistics Numerical measures of location, dispersion, shape, and association are also used.
PRESENTATION OF DATA.
Chapter 2 Summarizing and Graphing Data
Chapter 3 Describing Data Using Numerical Measures
Introduction to Summary Statistics
Description of Data (Summary and Variability measures)
Chapter 3 Describing Data Using Numerical Measures
Drill {A, B, B, C, C, E, C, C, C, B, A, A, E, E, D, D, A, B, B, C}
Statistics 1: Introduction to Probability and Statistics
Continuous Statistical Distributions: A Practical Guide for Detection, Description and Sense Making Unit 3.
Measures of Central Tendency
Numerical Descriptive Measures
Numerical Descriptive Statistics
Prepared by: C.Cichanowicz, March 2011
11.1 Find Measures of Central Tendency & Dispersion
Ch. 12 Vocabulary 9.) measure of central tendency 10.) outlier
Presentation transcript:

Medical Statistics Medical Statistics Tao Yuchun Tao Yuchun 2

Descriptive Statistics Descriptive Statistics 1. Statistical Description for Numerical Data

Frequency Distribution Example 1-1 There are 132 observations on blood sugar ( mmol/L ) of 55~58 years old normal adults (see below)

Look at the raw data, what feeling? vague and confuse! How to sort the data? Frequency table !

(1) Steps of building a frequency table a. Find Minimum, maximum and calculate range. Range=R=Max-Min= =1.99 b. Decide the interval of group. R/10=1.99/10=0.199≈0.2 i=0.2 ( i : length of sub-intervals) c. Work out the list of sub-intervals. sub-intervals: [lower limit , upper limit )

★ the first sub-interval must include the Min. ★ the last sub-interval must include the Max. ★ the number of sub-interval may be 8~15. 1st sub-interval: [3.60, 3.80) 2st sub-interval: [3.80, 4.00) … last sub-interval: [5.40, 5.60) (see column 1 of Table 1-1)Table 1-1

d. Read, mark and count for whole data. Mark methods: “ 正 ” or “ ” Tip Tip: one by one. (see column 2 of Table 1-1)Table 1-1 c. Calculate the frequencies. The total strokes for each sub-interval equals the frequency. (see column 3 of Table 1-1)

9 (2) Frequency plot ---histogram

(3) The use of frequency table  Show the profile of data distribution. Symmetric? Skew ? Central position? Variability? Outliers? ★ Frequency table is a good way to summarize the data figures are reduced to 22 figures and are much easier to present.

Numerical Description Central position (central tendency)

Variation (measure of dispersion)

Measures for Average (1) Arithmetic Mean Based on observed data Example: Blood sugar: 6.2, 5.4, 5.7, 5.3, 6.1, 6.0, 5.8, 5.9 (1.1)

Based on frequency tableBased on frequency table

(1.2) where f i and X k are the frequency and mid-value of the k-th sub-interval, n is the total sample size. Weighted mean For raw data

(2) Geometric mean where lg -1 is the anti-logarithm of lg, i.e. lg -1 =10 x. Example: titer values: 2, 4, 8, 32, 32, 64, 64.  (1.3) For titer 100×2 << 1×1024 !

(3) Median Ranking the values of observation from the smallest to the largest, Median = the value in the middle Based on raw dataBased on raw data Example 1: latent period (day): 6, 5, 4, 7, 12, 4, 5, 7, 9. (9 values) rank : 4 、 4 、 5 、 5 、 6 、 7 、 7 、 9 、 12 Median = 6 Tip Tip: odd number.

Example 2: latent period (day): 6, 5, 4, 7, 12, 4, 5, 7, 9, 20. (10 values) rank : 4 、 4 、 5 、 5 、 6 、 7 、 7 、 9 、 12 、 20 Median = (6+7)/2=6.5 Tip Tip: even number.

Based on frequency tableBased on frequency table Example 1-3 The same as Example 1-1. (see table 1-3)

formulaformula (1.4) where L is lower limit of the interval that included M, i is interval for each sub-interval, f M is frequency of the interval that included M, n is the total sample size, Σf L is cumulated frequency of the whole sub-intervals below L. How to confirm which interval the Median is in?

Method a. Calculate the cumulated frequency or the cumulated frequency (%). see Table 1-3.Table 1-3 b. Who is the 1st number that included n/2 or 50%? n/2=132/2=66, 86 is ! (50%, 65.15% is ) c. The median is in the sub-interval that this number is in. M is in [4.60, 4.80).

So: L = 4.60 , i = 0.2 , f M = 25 , Σf L =61. take in (1.4): Do you remember how to calculate mean of the raw data? ★ This M is the approximate value of the mean, a better approximated value.

Add: Add: Percentile --- P x  Ranking the values of observation from the smallest to the largest, and divide the range into 100 equal parts, (Percentile) P x = the x-th value. P 50 = the 50-th value, i.e. the middle value. So, P 50 = Median. MinMax P 50 = M PxPx x%x%100-x%

formulaformula (1.5) where L is lower limit of the interval that included P x, i is interval for each sub-interval, f M is frequency of the interval that included P x, n is the total sample size, Σf L is cumulated frequency of the whole sub-intervals below L. How to confirm which interval the P x is in? See an example : calculate P 25 of Example 1-3.

Method a. Calculate the cumulated frequency or the cumulated frequency (%). see Table 1-3.Table 1-3 b. Who is the 1st number that included P 25 or 25%? P 25 =132×25%=33, 37 is ! (25%, 28.03% is ) c. The P 25 is in the sub-interval that this number is in. P 25 is in [4.20, 4.40).

So: L = 4.20 , i = 0.2 , f 25 = 23 , Σf L =14. take in (1.5): also:

Symmetric RBC (10 12 /L)of 130 normal male adults in a place RBC(10 /L) Frequency

Positive skew Hair Mercury (μg/g) of 238 normal adults Hg(ug/g) Frequency

Summary 1.Mean 1.Mean: Suitable to symmetric distribution. Geometric mean 2. Geometric mean: G Suitable to positive skew distribution. Median 3. Median: M Suitable to all kinds of data, but poor attribute for further analysis.

C ★ Remember there are three averages after learning statistics, not only one!