SPREAD MEASURES Prof. Dr. Hamit ACEMOĞLU 1.

Slides:

Advertisements

Similar presentations

Chapter 3, Numerical Descriptive Measures

Advertisements

Unit 16: Statistics Sections 16AB Central Tendency/Measures of Spread.

Calculating & Reporting Healthcare Statistics

B a c kn e x t h o m e Parameters and Statistics statistic A statistic is a descriptive measure computed from a sample of data. parameter A parameter is.

Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.

WFM 5201: Data Management and Statistical Analysis

Intro to Descriptive Statistics

Biostatistics Unit 2 Descriptive Biostatistics 1.

Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 3-1 Introduction to Statistics Chapter 3 Using Statistics to summarize.

Chapter 3 – Descriptive Statistics

Summary statistics Using a single value to summarize some characteristic of a dataset. For example, the arithmetic mean (or average) is a summary statistic.

Chapter 3 Averages and Variations

Why statisticians were created Measure of dispersion FETP India.

1 1 Slide Descriptive Statistics: Numerical Measures Location and Variability Chapter 3 BA 201.

STAT 280: Elementary Applied Statistics Describing Data Using Numerical Measures.

Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.

Central Tendency and Variability Chapter 4. Variability In reality – all of statistics can be summed into one statement: – Variability matters. – (and.

Lecture 3 Describing Data Using Numerical Measures.

Describing Data Using Numerical Measures. Topics.

Skewness & Kurtosis: Reference

13-1 Copyright  2010 McGraw-Hill Australia Pty Ltd PowerPoint slides to accompany Croucher, Introductory Mathematics and Statistics, 5e Chapter 13 Measures.

INVESTIGATION 1.

Chap 3-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 3 Describing Data Using Numerical.

Chapter 3, Part A Descriptive Statistics: Numerical Measures n Measures of Location n Measures of Variability.

1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.

Describing Data Descriptive Statistics: Central Tendency and Variation.

Variability Introduction to Statistics Chapter 4 Jan 22, 2009 Class #4.

Applied Quantitative Analysis and Practices LECTURE#07 By Dr. Osman Sadiq Paracha.

CHAPTER 2: Basic Summary Statistics

Medical Statistics (full English class) Ji-Qian Fang School of Public Health Sun Yat-Sen University.

1 STAT 500 – Statistics for Managers STAT 500 Statistics for Managers.

Descriptive Statistics(Summary and Variability measures)

Summarizing Data with Numerical Values Introduction: to summarize a set of numerical data we used three types of groups can be used to give an idea about.

Introduction Dispersion 1 Central Tendency alone does not explain the observations fully as it does reveal the degree of spread or variability of individual.

Distribution Center (Clutter) Criteria The average

Measures of Dispersion

SPSS CODING/GRAPHS & CHARTS CENTRAL TENDENCY & DISPERSION

Business and Economics 6th Edition

One-Variable Statistics

Descriptive Statistics

Descriptive Statistics

Chapter 3 Describing Data Using Numerical Measures

Measures of dispersion

Data Mining: Concepts and Techniques

Introductory Mathematics & Statistics

Chapter 3 Created by Bethany Stubbe and Stephan Kogitz.

Chapter 6 ENGR 201: Statistics for Engineers

CHAPTER 3 Data Description 9/17/2018 Kasturiarachi.

Numerical Measures: Centrality and Variability

Description of Data (Summary and Variability measures)

Chapter 3 Describing Data Using Numerical Measures

Descriptive Statistics

Central tendency and spread

STA 291 Spring 2008 Lecture 5 Dustin Lueker.

STA 291 Spring 2008 Lecture 5 Dustin Lueker.

Numerical Descriptive Measures

Displaying Distributions with Graphs

Displaying and Summarizing Quantitative Data

Summary descriptive statistics: means and standard deviations:

Chapter 2 Exploring Data with Graphs and Numerical Summaries

Descriptive Statistics

Chapter 1: Exploring Data

Numerical Descriptive Measures

Descriptive Statistics Healey Chapters 3 and 4 (1e) or Ch. 3 (2/3e)

MBA 510 Lecture 2 Spring 2013 Dr. Tonya Balan 4/20/2019.

CHAPTER 2: Basic Summary Statistics

Business and Economics 7th Edition

Numerical Descriptive Measures

UNIT 8: Statistical Measures

Central Tendency & Variability

Presentation transcript:

SPREAD MEASURES Prof. Dr. Hamit ACEMOĞLU 1

The Aim By the end of this lecture, the students will be aware of spread measures and to calculate the extent of spread by using SPSS. 2

The Goals Able to count spread measures, By using SPSS to be able to calculate interval (range) percentiles variance standard deviation To tell the variance and standard deviation formulas, To explain the variation within individuals and between individuals. 3

Spread measures -Interval (range) -Percentile range -Variance -Standart deviation -Variations within and betwen individuals 4

If we specify two properties including -central tendency and -spread measures of a numerical dataset, we summarises enough of our data structure. It was mentioned about measure of central tendency in a previous lecture. It is time to talk on spread measures. 5 5

Sayfa 1 / 2 T.C. İSTANBUL VALİLİĞİ İSTANBUL HALK SAĞLIĞI MÜDÜRLÜĞÜ HASTA LABORATUAR SONUÇLARI Tar TETKİK ADI SONUÇ BİRİMİ REFERANS ARALIĞI WBC 5.8 K/µ 3.0 - 12.0 NE% 51.7 % 35 - 80 NE# 3 1.1 - 9.6 LYM% 38.9 15 - 50 LYM# 2.3 0.5 - 6.0 MONO% 7.4 2 - 12 MONO# 0.4 0.1 - 1.4 EOS% 1.3 0 - 6 EOS# 0.1 0 - 0.4 BASO% 0.7 0 - 2 BASO# 0.0 - 0.3 RBC 3.8 M/uL 3.2 - 6.0 HGB 11.9 g/dL 10 - 18 HCT 35 30 - 55 MCV 91.9 fL 78 - 105 MCH 31.3 pg 25 - 33 MCHC 34 30 - 36 RDW 13 9 - 18 PLT 316 150 - 500 MPV 7.8 0 - 15 Demir (serum) 89 ug/dL 37 - 145 Demir bağlama kapasitesi 233 155 - 355 Ferritin 28.9 ng/mL 11.0 - 306.8 Vitamin B12 240 pg/mL 126.5 - 505 LDL kolestrol ▲178 mg/dL 0 - 130 Kolesterol ▲256 0 - 200 HDL kolesterol 53 35 - 70 Trigliserid 125 0 - 150 Alanin aminotransferaz (ALT) 14 U/L 0 - 35 Aspartat transaminaz (AST) 19 IU/L Bilirubin (total) - İndirek Bilirubin 0.6 0.1 - 1.0 Bilirubin (direkt) - Direkt Bilirubin 0.0 - 0.2 Bilirubin (total) - Total Bilirubin 0.3 - 1.2 Gamma glutamil transferaz (GGT) 20 0 - 38 Laktik Dehidrogenaz (LDH) 193 0 - 247 Glukoz 87 74 - 106 Glikozile hemoglobin (Hb A1C) 5.5 4.0 - 6.0 Serbest T3 3.1 2.1 - 3.8

Interval (range) The difference between largest and the smallest value of our data is called range. R=Xmax-Xmin Usually small (min. ) and largest (max.) values are given instead of range. If there is more outlier, It should be noted that the extent of the range is not sufficiently reliable. 9

Percentile ranges When we sort our data from small to large, where *1 % of the total data section is called as 1 percentile, *50 % of the total data section is called as 50 percentile. 1. place quartile:(n+1)/4 3. place quartile:3*1. quartile 10

-The place of 1. quartile in this data set (8+1)/4=2.25 value No Height 1 145 2 148 3 154 4 160 5 166 6 170 7 176 8 182 -The place of 1. quartile in this data set (8+1)/4=2.25 value -1. quartile=148+ (154-148)x0.25=149.5 -3. quartile place=3*1. quartile place=3*2.25=6.75 value -3. quartile=170+(176-170)*0.75=174.2 13

Exact 50% of the limit value is called " median ". Between 25-75 percentile is called interquartile range When the data is sorted, interquartile range shows 50% remaining in the middle. 14

When the data originated form a large enough sample representing the population both ends in the remaining value of the 2.5 % are called reference interval, reference range or normal range. In the case of measuremets such as lab. while comparing our data with population, we decide to look at the range, whether it is normal or not.

Variance A way of measuring the distribution of data is to look at how each observation deviates from the arithmetic mean. We can not take the average of the values we will achieve. Because, since the things on the plus side will be near to the minus side, they will cancel each other. We make a calculation taking the square of the distance from arithmetic mean of each value . We sum these values and divide by sample size (sample size(n-1)). This is called variance calculation. It is represented as s2 16

While calculating variance, unlike arithmetic mean we divide by (n-1) While calculating variance, unlike arithmetic mean we divide by (n-1). The reason for this, our work is on a particular sample, not the entire population. In this case it is shown theoretically getting a close variance to population value. 17

Example S2=(9726-(300)2/10))/9=80,67 S=8,98 No 1 2 3 4 5 6 7 8 9 10 Sum Age 14 25 38 41 22 26 33 35 300 X2 196 625 1444 1681 484 676 1089 1225 9726 S2=(9726-(300)2/10))/9=80,67 S=8,98

No Age=Xi Mean Xi-Mean 1 14 30 -16 256 2 25 -5 3 38 8 64 4 41 11 121 5 22 -8 6 7 26 -4 16 9 33 10 35 Sum 300 726 n-1= 10-1=9 Variance= 726/9=80,67 Standart deviation= 8,98 Variance and standard deviation can be calculated as in the figure

Standart deviation Standard deviation is the square root of the variance . Dividing standard deviation by arithmetic mean and expressing as a percentage, we finde coefficient of variation. The advantage of the coefficient of variance is not affected by the variable unit (expressed as %). But it is not prefered due to the theoretical disadvantages. 20

Variations within and between individuals We can get different results if we make multiple measurements of the same individual ( intra-individual differences ) This difference may arise from not giving the same answer every time by the individual or measurement error. However, intra-individual differences is less than inter-individual differeneces. These differences will be important during the research design. 21

Only use two observation Affected by outliers Spread criteria Pozitive properties Negative properties Interval (Range) Easily detectable Only use two observation Affected by outliers It tends to increase as the number of samples increases Interval based on percentiles Generally not affected bay outliers. It is independent of the sample size Suitable for squed data Calculation is cumbersome Not calculated for small samples It is defined as algebraically 22

Considers every observations It is defined as algebraically Spread creteria Pozitive properties Negative properties Variance Considers every observations It is defined as algebraically The measurement unit is square of the raw data Affected by outliers Not suitable for squed data Standart deviation It has the same advantages as the variance The measurement unit is the same as the raw data İnterpreted easyly Not suitible for squed data 23

Exercises Sample data:3,5,8,9,11,13,23 The distribution range of the sample data 1. and 3. percentiles ? Variance? Standart deviation ? 24

Spread measures Summary -Interval (range) -Percentile range -Variance -Standart deviation -Variations within and betwen individuals 25