Introductory Statistics for Laboratorians dealing with High Throughput Data sets Centers for Disease Control.

Slides:



Advertisements
Similar presentations
Introductory Statistics for Laboratorians dealing with High Throughput Data sets Centers for Disease Control.
Advertisements

Describing Quantitative Variables
The Normal distributions BPS chapter 3 © 2006 W.H. Freeman and Company.
Measures of Dispersion
Statistics for the Social Sciences
Descriptive Statistics Chapter 3 Numerical Scales Nominal scale-Uses numbers for identification (student ID numbers) Ordinal scale- Uses numbers for.
PSY 307 – Statistics for the Behavioral Sciences
Measures of Variability or Dispersion
Variability Measures of spread of scores range: highest - lowest standard deviation: average difference from mean variance: average squared difference.
Standard Deviation A measure of variability
Data Transformation Data conversion Changing the original form of the data to a new format More appropriate data analysis New.
As with averages, researchers need to transform data into a form conducive to interpretation, comparisons, and statistical analysis measures of dispersion.
X = =2.67.
Learning Objectives In this chapter you will learn about the importance of variation how to measure variation range variance standard deviation.
Chapter 5: Variability and Standard (z) Scores How do we quantify the variability of the scores in a sample?
Measures of Variability: Range, Variance, and Standard Deviation
Chapter 4 SUMMARIZING SCORES WITH MEASURES OF VARIABILITY.
The Normal distributions PSLS chapter 11 © 2009 W.H. Freeman and Company.
Objectives (BPS 3) The Normal distributions Density curves
Today: Central Tendency & Dispersion
Introductory Statistics for Laboratorians dealing with High Throughput Data sets Centers for Disease Control.
Introductory Statistics for Laboratorians dealing with High Throughput Data sets Centers for Disease Control.
BIOSTATISTICS II. RECAP ROLE OF BIOSATTISTICS IN PUBLIC HEALTH SOURCES AND FUNCTIONS OF VITAL STATISTICS RATES/ RATIOS/PROPORTIONS TYPES OF DATA CATEGORICAL.
3.3 Density Curves and Normal Distributions
Looking at Data - Distributions Density Curves and Normal Distributions IPS Chapter 1.3 © 2009 W.H. Freeman and Company.
Overview Summarizing Data – Central Tendency - revisited Summarizing Data – Central Tendency - revisited –Mean, Median, Mode Deviation scores Deviation.
Introductory Statistics for Laboratorians dealing with High Throughput Data sets Centers for Disease Control.
The Normal distributions BPS chapter 3 © 2006 W.H. Freeman and Company.
BUS250 Seminar 4. Mean: the arithmetic average of a set of data or sum of the values divided by the number of values. Median: the middle value of a data.
Business Research Methods William G. Zikmund Chapter 17: Determination of Sample Size.
Tuesday August 27, 2013 Distributions: Measures of Central Tendency & Variability.
Measures of Central Tendency and Dispersion Preferred measures of central location & dispersion DispersionCentral locationType of Distribution SDMeanNormal.
Measures of Variability Variability. Measure of Variability (Dispersion, Spread) Variance, standard deviation Range Inter-Quartile Range Pseudo-standard.
Chapter 5 The Normal Curve. In This Presentation  This presentation will introduce The Normal Curve Z scores The use of the Normal Curve table (Appendix.
Describing Behavior Chapter 4. Data Analysis Two basic types  Descriptive Summarizes and describes the nature and properties of the data  Inferential.
Transformations, Z-scores, and Sampling September 21, 2011.
1 Univariate Descriptive Statistics Heibatollah Baghi, and Mastee Badii George Mason University.
Dr. Serhat Eren 1 CHAPTER 6 NUMERICAL DESCRIPTORS OF DATA.
The Normal distributions BPS chapter 3 © 2006 W.H. Freeman and Company.
3 common measures of dispersion or variability Range Range Variance Variance Standard Deviation Standard Deviation.
Chapter 3: Averages and Variation Section 2: Measures of Dispersion.
IPS Chapter 1 © 2012 W.H. Freeman and Company  1.1: Displaying distributions with graphs  1.2: Describing distributions with numbers  1.3: Density Curves.
1 Psych 5500/6500 Measures of Variability Fall, 2008.
Statistical Analysis of Data. What is a Statistic???? Population Sample Parameter: value that describes a population Statistic: a value that describes.
An article on peanut butter reported the following scores (quality ratings on a scale of 0 to 100) for various brands. Construct a comparative stem-and-leaf.
Statistics Unit 9 only requires us to do Sections 1 & 2. * If we have time, there are some topics in Sections 3 & 4, that I will also cover. They tie in.
Measures of Dispersion Section 4.3. The case of Fred and Barney at the bowling alley Fred and Barney are at the bowling alley and they want to know who’s.
Chapter 9 – The Normal Distribution Math 22 Introductory Statistics.
Descriptive Statistics for one Variable. Variables and measurements A variable is a characteristic of an individual or object in which the researcher.
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 5. Measuring Dispersion or Spread in a Distribution of Scores.
Chapter 4: Variability. Variability The goal for variability is to obtain a measure of how spread out the scores are in a distribution. A measure of variability.
Variability Introduction to Statistics Chapter 4 Jan 22, 2009 Class #4.
Normal distributions Normal curves are used to model many biological variables. They can describe a population distribution or a probability distribution.
COURSE: JUST 3900 INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE Test Review: Ch. 4-6 Peer Tutor Slides Instructor: Mr. Ethan W. Cooper, Lead Tutor © 2013.
© 2012 W.H. Freeman and Company Lecture 2 – Aug 29.
Welcome to… The Exciting World of Descriptive Statistics in Educational Assessment!
2.4 Measures of Variation The Range of a data set is simply: Range = (Max. entry) – (Min. entry)
MM150 ~ Unit 9 Statistics ~ Part II. WHAT YOU WILL LEARN Mode, median, mean, and midrange Percentiles and quartiles Range and standard deviation z-scores.
Exploratory Data Analysis
The Normal distribution
Normal distributions x x
11. The Normal distributions
The normal distribution
Objectives The student will be able to:
Chapter 3.
Teacher Introductory Statistics Lesson 2.4 D
Measures in Variability
10.2 Variance Math Models.
Measures of Dispersion
Numerical Statistics Measures of Variability
Presentation transcript:

Introductory Statistics for Laboratorians dealing with High Throughput Data sets Centers for Disease Control

Problem 7: Dispersion Prepare 2 line graphs, one for males and one for females using the data presented below. Put both line graphs on the same axes.

Problem 7: Dispersion Attitudes on Race Relations MalesFemales XfXf

Problem 7: Dispersion

How can we quantify the difference between the men and the women in this problem. Compute the mean (average) for the men. Compute the mean (average) for the women.

Problem 7: Dispersion What are the highest and lowest scores for the men? What are the highest and lowest scores for the women? Count the number of scores from lowest to highest. This number is called the Range of the scores. In this case the Range doesn’t help us describe the difference between the males and the females. We need better measures of dispersion.

Problem 8: Dispersion For the following data: What is the highest and lowest score? What is the Range? (count the number of scores from the lowest to the highest.) What is the Mean (average)? How far is each person from the Mean? (Fill in the column. Always subtract the mean from the score. )

Problem 8: Dispersion Data Table SubjectScore X Distance from Mean x = (Score – Mean) Squared Distance from Mean Fred0 George1 Harry2 Jerry4 Larry5 Jennifer6 Jan7 Joan8 Jessica8 Juana9 N =Total = Mean = Total deviation =Sum Squares =

Problem 8: Dispersion Compute the “Sum of Squared Deviations from the Mean” (SS) for this data set (or sample or whatever you call it). Compute the variance of the sample. Compute the standard deviation of the sample.

Dispersion Definitions The range is the number of scores from the smallest to the largest. Deviation Score = Score – Mean – Always subtract the mean from the score – Always preserve the sign (positive or negative) – The total of the deviation scores is always zero Sum Squares = Total of the squared deviation scores. (SS) Variance = SS/N Standard Deviation = square root of variance

Standard Deviation Surely there is an easier way to measure dispersion than using all this squaring and square rooting. Turns out, the standard deviation is the exact point on a normal curve where the second derivative is zero. If you were skiing down the slope, it would get steeper and steeper then it would start to flatten out. That point is the standard deviation. That’s why it is the preferred measure of dispersion.

Standard Deviation

Problem 9 Given the following collection of scores: 2, 3, 5, 6, 6, 8 – Calculate the range of the scores – Calculate the sum of squares – Calculate the variance – Calculate the standard deviation

Problem 9 Data Table SubjectXDeviation score (x)x2x2 Fran2 Frank3 Frangelica5 Fonz6 Frieda6 Fabiano8 N =Total = Mean = SS =

Normal distributions e = … The base of the natural logarithm π = pi = … Normal—or Gaussian—distributions are a family of symmetrical, bell- shaped density curves defined by a mean  (mu) and a standard deviation  (sigma): N (  ). xx

A family of density curves Here the means are different (  = 10, 15, and 20) while the standard deviations are the same (  = 3). Here the means are the same (  = 15) while the standard deviations are different (  = 2, 4, and 6).

mean µ = 64.5 standard deviation  = 2.5 N(µ,  ) = N(64.5, 2.5) All Normal curves N  ) share the same properties Reminder: µ (mu) is the mean of the idealized curve, while is the mean of a sample. σ (sigma) is the standard deviation of the idealized curve, while s is the s.d. of a sample.  About 68% of all observations are within 1 standard deviation (  of the mean (  ).  About 95% of all observations are within 2  of the mean .  Almost all (99.7%) observations are within 3  of the mean. Inflection point

Definitions: Statistical Symbols In an actual sample – Scores are represented by – Mean = – Deviation Score – Standard Deviation = s – Variance = s 2 In a theoretical distribution (density curve) – Mean = μ – Standard Deviation = σ – Variance = σ 2