Introductory Statistics for Laboratorians dealing with High Throughput Data sets Centers for Disease Control.

Slides:



Advertisements
Similar presentations
Introductory Statistics for Laboratorians dealing with High Throughput Data sets Centers for Disease Control.
Advertisements

The Normal Distribution
Normal distribution. An example from class HEIGHTS OF MOTHERS CLASS LIMITS(in.)FREQUENCY
The Normal Distribution and the Rule p.html p.html.
C HAPTER 2: T HE N ORMAL D ISTRIBUTIONS. R ECALL SECTION 2.1 In section 2.1 density curves were introduced: A density curve is an idealized mathematical.
Today’s Question Example: Dave gets a 50 on his Statistics midterm and an 50 on his Calculus midterm. Did he do equally well on these two exams? Big question:
The Normal Curve Z Scores, T Scores, and Skewness.
Chapter 6 Introduction to Continuous Probability Distributions
Z - SCORES standard score: allows comparison of scores from different distributions z-score: standard score measuring in units of standard deviations.
PSY 307 – Statistics for the Behavioral Sciences
14.4 The Normal Distribution
Normal Distribution Z-scores put to use!
Statistics Normal Probability Distributions Chapter 6 Example Problems.
The Normal distributions PSLS chapter 11 © 2009 W.H. Freeman and Company.
1 Normal Distributions Heibatollah Baghi, and Mastee Badii.
Introductory Statistics for Laboratorians dealing with High Throughput Data sets Centers for Disease Control.
Looking at Data - Distributions Density Curves and Normal Distributions IPS Chapter 1.3 © 2009 W.H. Freeman and Company.
Normal Distribution Z-scores put to use!
Warm-Up If the variance of a set of data is 12.4, what is the standard deviation? If the standard deviation of a set of data is 5.7, what is the variance?
Section 2.2, Part 1 Standard Normal Calculations AP Statistics Berkley High School/CASA.
Describing Location in a Distribution. Measuring Position: Percentiles Here are the scores of 25 students in Mr. Pryor’s statistics class on their first.
Chapter 6 The Standard Deviation as a Ruler and the Normal Model.
And the Rule THE NORMAL DISTRIBUTION. SKEWED DISTRIBUTIONS & OUTLIERS.
NOTES The Normal Distribution. In earlier courses, you have explored data in the following ways: By plotting data (histogram, stemplot, bar graph, etc.)
Descriptive Statistics Measures of Variation. Essentials: Measures of Variation (Variation – a must for statistical analysis.) Know the types of measures.
Nature of Science Science Nature of Science Scientific methods Formulation of a hypothesis Formulation of a hypothesis Survey literature/Archives.
Copyright © 2012 by Nelson Education Limited. Chapter 4 The Normal Curve 4-1.
Answering Descriptive Questions in Multivariate Research When we are studying more than one variable, we are typically asking one (or more) of the following.
Chapter 6 The Standard Deviation as a Ruler and the Normal Model.
Measures of Dispersion & The Standard Normal Distribution 2/5/07.
The Standard Normal Distribution
Standardized Distributions Statistics Introduction Last time we talked about measures of spread Specifically the variance and the standard deviation.
An Introduction to Statistics. Two Branches of Statistical Methods Descriptive statistics Techniques for describing data in abbreviated, symbolic fashion.
3 Some Key Ingredients for Inferential Statistics.
Chapter 5: z-Scores x = 76 (a) X = 76 is slightly below average x = 76 (b) X = 76 is slightly above average 3 70 x = 76 (c) X = 76 is far.
IE(DS)1 Many of the measures that are of interest in psychology are distributed in the following manner: 1) the majority of scores are near the mean 2)
IPS Chapter 1 © 2012 W.H. Freeman and Company  1.1: Displaying distributions with graphs  1.2: Describing distributions with numbers  1.3: Density Curves.
Stats 95. Normal Distributions Normal Distribution & Probability Events that will fall in the shape of a Normal distribution: –Measures of weight, height,
THE NORMAL DISTRIBUTION AND Z- SCORES Areas Under the Curve.
The Normal distribution and z-scores
Answering Descriptive Questions in Multivariate Research When we are studying more than one variable, we are typically asking one (or more) of the following.
Honors Advanced Algebra Presentation 1-6. Vocabulary.
+ Unit 4 – Normal Distributions Week 9 Ms. Sanchez.
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 5. Measuring Dispersion or Spread in a Distribution of Scores.
Chapter 4 Lesson 4.4a Numerical Methods for Describing Data
Normal distributions Normal curves are used to model many biological variables. They can describe a population distribution or a probability distribution.
Psych 230 Psychological Measurement and Statistics Pedro Wolf September 16, 2009.
Unit 4: Normal Distributions Part 2 Statistics. Focus Points Given mean μ and standard deviation σ, convert raw data into z-scores Given mean μ and standard.
STANDARDIZED VALUES: LESSON 21. HOW CAN YOU USE STANDARDIZED VALUES TO COMPARE VALUES FROM TWO DIFFERENT NORMAL DISTRIBUTIONS? STANDARDIZED VALUE TELLS.
2.2 Standard Normal Calculations. Empirical vs. Standardizing Since all normal distributions share common properties, we have been able to use the
6.2 – USE NORMAL DISTRIBUTIONS Unit 6 – Data Analysis and Probability.
Homework Check.
Finding Percentages with z-scores
z-Scores, the Normal Curve, & Standard Error of the Mean
Interpreting Center & Variability.
Let’s look at how this is done.
z-Scores, the Normal Curve, & Standard Error of the Mean
Applications of the Normal Distribution
Interpreting Center & Variability.
Homework Check.
Many of the measures that are of interest
Warm Up – Tuesday The table to the left shows
Interpreting Center & Variability.
Measuring location: percentiles
Section 2.2 Standard Normal Calculations
Warm Up Your textbook provides the following data on the height of men and women. Mean Std. Dev Men Women ) What is the z score.
Normal Curve 10.3 Z-scores.
Normal Distributions.
6.2 Use Normal Distributions
Warm Up Your textbook provides the following data on the height of men and women. Mean Std. Dev Men Women ) What is the z score.
Presentation transcript:

Introductory Statistics for Laboratorians dealing with High Throughput Data sets Centers for Disease Control

Interpreting Scores What do the numbers mean?

Johnny came home from 4 th grade and told his mother he’d made 100 on his test. That’s good! But it was a 200 point test. That’s bad! But it was a very difficult test and Johnny’s score was one of the highest in the district. That’s good! But Johnny wasn’t the only one who got 100, the average score on the test was 100. That’s not so good.

What have we learned? The fact is that a raw score by itself is meaningless. To interpret a persons score you must know how everybody else scored. For a score to have meaning, you have to know where that score is in the distribution.

The two main things we need to know to interpret a score are: How far is is from the mean How spread out are the scores

The Deviation Score Deviation score commonly used in statistics to make a score more interpretable. Deviation score: how far the score is from the mean

Some Notation In statistics the raw score is symbolized by a UPPER CASE The mean of the raw scores is symbolized by The deviation score is symbolized by a lower case The deviation score is computed by subtracting the mean from the score:

If someone scores at the mean, the deviation score would be zero. If someone scores above average, the deviation score will be a positive number. If the score is below the mean the deviation score will be a negative number

If Johnny had come home and told his mother that his deviation score on the test was 0, she would have known immediately that he was average. (Johnny’s mother is a professor of statistics at the local college)

But that is not all. While the distance a persons score is from the mean is more meaningful than the raw score, the interpretation of the distance from the mean depends on how spread out the scores are.

The importance of Dispersion For example, if Johnny tells his mother he scored 10 points above the mean on a test, we know right away that he is above average. Question is, how much above average.

If the average score on the test is 55 and Johnny scores 65 and that is the highest score on the test then scoring 10 points above the mean is very good. (see figure1)

If on the other hand, the highest score on the test is 100, then a 65 is not so great.

So? What we really need is a way to express a score that takes into account both how far the score is from the mean and how spread out the scores are.

z-Scores The standard deviation is the parameter that measures the dispersion or spread of the distribution. z-scores measure the distance from the mean in standard deviation units.

z-Scores If a person scores 1 standard deviation (SD) above the mean, the z-score will be +1 If they score 1 SD below the mean the z-score will be –1 If they score 2 SD’s above the mean the z-score will be +2 If they score at the mean the z-score will be zero. Etc.

Areas Under the Normal Curve The proportion of the area under the normal curve can be interpreted as the probability that a score appears in that area. Areas here are shown for standard deviation units.

Areas Under the Curve As shown here, the percentage of the distribution in a standard deviation band is the same regardless of the shape of the distribution

Problem 10: Compute z-Scores SubjectScorex = X - Meanx2x2 z score S11 S24 S34 S45 S55 S66 S77 S88 N=Total = Mean = SS = s =

Problem 10: Compute z-Scores SubjectScorex = X - Meanx2x2 z score S S S S45000 S55000 S S77241 S N= 8Total = 40 Mean = 5 SS = 32 s = 2

Problem 11: Properties of z-Scores Subjectz – Scores (from Problem 10) Deviation score of the z’s Squared deviations of z’s S1 S2 S3 S4 S5 S6 S7 S8 N =Total of z’s = Mean of z’s = SS of z’s = Standard deviation of z’s =

Problem 11: Properties of z-Scores Subjectz – Scores (from Problem 10) Deviation score of the z’s Squared deviations of z’s S1-2 4 S S S4000 S5000 S S7111 S N = 8Total of z’s = 0 Mean of z’s = 0 SS of z’s = 8 Standard deviation of z’s = 1

Using the Standard Normal Distribution Because all Normal distributions share the same properties, we can us the standard normal distribution (the distribution of z- scores) for our computations and get the same results. In the distribution with mean of 64.5 and standard deviation of 2.5, 68% of the distribution is between 62 and 67 (-1 SD to +1 SD). In the standard normal distribution (with mean 0 and standard deviation 1), 68% of the distribution is between -1 SD and +1 SD. N(0,1) => N(64.5, 2.5) Standardized height (no units)

Problem 12: Women’s Heights The average woman is 64.5 inches tall. Mean = 64.5 Standard Deviation = 2.5

Problem 12: Women’s Heights Maria is 67 inches tall (5’ 7”). What is Maria’s z- score? What percent of women are shorter than Maria? What percent are taller?

Problem 12: Women’s Heights Alexis is 62 inches tall (5’ 2”). What is Alexis’ z- score? What percent of women are shorter than Alexis? What percent are taller?

Problem 12: Women’s Heights Barbie is 69.5 inches tall (5’ 9.5”). What is Barbie’s z-score? What percent of women are shorter than Barbie? What percent are between Alexis and Barbie?

Problem 12: Women’s Heights Leela is 68 ¾ inches tall (5’ 8 ¾ ”). What is Leela’s z- score? Can we compute the percent of women who are shorter than Leela? Why or why not?

Problem 12: Women’s Heights Leela is 68 ¾ inches tall. Her z-score is 1.5 Use to compute the percent of women who are shorter than Leela.

Problem 12: Women’s Heights How tall do you have to be to be taller than 50% of the women? How tall do you have to be to be taller than 84% of the women? How tall do you have to be to be taller than 97.6% of the women?

Problem 12: Women’s Heights Use for the following problems: How tall do you have to be to be taller than 95% of the women? How tall do you have to be to be taller than 99% of the women? We can be sure that 95% of the women are between what heights?

Problem 13 Use for the following problem: You have been timing how long it takes to get to work in the mornings. The mean is 22.6 minutes with a standard deviation of 8.16 minutes. You have to be at work at 8:30 am at the latest. How many minutes before 8:30 do you have to leave to be 95% confident that you will get there at or before 8:30? When do you have to leave to be 99% sure you’ll be there by 8:30?