Unit 3: Variation and the Normal Curve. Not normal doesn’t mean abnormal The “normal (Gaussian, bell-shaped) curve” is common, but NOT universal –salaries.

Slides:



Advertisements
Similar presentations
UNIT 8: Statistical Measures
Advertisements

Measures of Central Tendency and Variation 11-5
The Normal Distribution
The Normal Distribution
Normal Distribution; Sampling Distribution; Inference Using the Normal Distribution ● Continuous and discrete distributions; Density curves ● The important.
The Standard Deviation as a Ruler and the Normal Model.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 6 The Standard Deviation as a Ruler and the Normal Model.
Unit 5: Regression. Psych Dept’s version Regression line is z y = r z x –i.e., change in x by 1 std unit changes y by the fraction r std unit. Algebraically.
A surveys ask high school freshman whether they would be willing to pay $5 for their yearbooks. Out of the 225 freshman surveyed, 198 said “yes”. What.
Graphing Linear Equations In Slope-Intercept Form.
1 The Basics of Regression. 2 Remember back in your prior school daze some algebra? You might recall the equation for a line as being y = mx + b. Or maybe.
2-5 : Normal Distribution
Unit 3: Variation and the Normal Curve. Review: Standard Units “z-score” (“std units”): z = ( x –  x ) / σ –the number of σ’s above average –(if negative,
Return to the definitions in Class Handout #2 to see some properties of a normal distribution, which is one type of bell-shaped (mound shaped) distribution.
Normal Probability Distributions Chapter 5. § 5.1 Introduction to Normal Distributions and the Standard Distribution.
Unit 9: Statistics By: Jamie Fu and Neha Surapaneni.
Copyright © 2012 Pearson Education. All rights reserved Copyright © 2012 Pearson Education. All rights reserved. Chapter 10 Sampling Distributions.
Linear Regression.
STAT 5372: Experimental Statistics Wayne Woodward Office: Office: 143 Heroy Phone: Phone: (214) URL: URL: faculty.smu.edu/waynew.
Normal Distribution Links Standard Deviation The Normal Distribution Finding a Probability Standard Normal Distribution Inverse Normal Distribution.
Chapter 3 – Descriptive Statistics
Chapter 9 Sampling Distributions and the Normal Model © 2010 Pearson Education 1.
The Normal Distribution The “Bell Curve” The “Normal Curve”
Statistics Introduction to Statistic [stuh-tis-tik] noun. A numerical fact or datum, especially one computed from a sample.
Significance Tests: THE BASICS Could it happen by chance alone?
2.3 – Slopes, Forms of Lines. Slope Slope = measure of steepness of a line in the Cartesian plane for two points Slope = m = Two ways to calculate slope:
UNIT 8:Statistical Measures Measures of Central Tendency: numbers that represent the middle of the data Mean ( x ): Arithmetic average Median: Middle of.
Copyright © 2010 Pearson Education, Inc. Chapter 6 The Standard Deviation as a Ruler and the Normal Model.
NOTES The Normal Distribution. In earlier courses, you have explored data in the following ways: By plotting data (histogram, stemplot, bar graph, etc.)
Lesson 4-4 Example Determine the slope of the function y = 5x – 2. Step 1Complete a function table for the equation.
Objectives Vocabulary
1 Chapter 10 Correlation and Regression 10.2 Correlation 10.3 Regression.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 6 The Standard Deviation as a Ruler and the Normal Model.
Stats 95.
Chapter 6 The Normal Curve. A Density Curve is a curve that: *is always on or above the horizontal axis *has an area of exactly 1 underneath it *describes.
Describing Location in a Distribution 2.1 Measures of Relative Standing and Density Curves 2.1 Measures of Relative Standing and Density Curves Text.
SP 225 Lecture 9 The Normal Curve.  ‘Bell’ Shaped  Unimodal in center  Tails extend to infinity in either direction.
Chapter 10 Correlation and Regression
5.3 Random Variables  Random Variable  Discrete Random Variables  Continuous Random Variables  Normal Distributions as Probability Distributions 1.
Final Exam Review!!!!!. Experimental Design Random Sampling Stratified Sampling Convenience Sampling Cluster Sampling Systematic Sampling Multistage Sampling.
Slide 6-1 Copyright © 2004 Pearson Education, Inc.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 6 The Standard Deviation as a Ruler and the Normal Model.
Part III Taking Chances for Fun and Profit Chapter 8 Are Your Curves Normal? Probability and Why it Counts.
1 Psych 5500/6500 Standard Deviations, Standard Scores, and Areas Under the Normal Curve Fall, 2008.
Lesson 6-6 Example Example 2 Graph y = 3x + 2 and determine the slope of the line. 1.Complete a function table for the equation.
The Standard Deviation as a Ruler and the Normal Model
Anthony J Greene1 Where We Left Off What is the probability of randomly selecting a sample of three individuals, all of whom have an I.Q. of 135 or more?
Write linear equations from tables or graphs.
Introduction A Review of Descriptive Statistics. Charts When dealing with a larger set of data values, it may be clearer to summarize the data by presenting.
Data and Variation.
MFM 2P Minds On Plot the following points: (5, 7) (-2, 3) (-10, 0) (-8, -5) (0, 6) Using either a whiteboard or graph paper, draw the Cartesian Plane and.
1-1 Copyright © 2015, 2010, 2007 Pearson Education, Inc. Chapter 5, Slide 1 Chapter 5 The Standard Deviation as a Ruler and the Normal Model.
Stats Lunch: Day 3 The Basis of Hypothesis Testing w/ Parametric Statistics.
Slide Chapter 2d Describing Quantitative Data – The Normal Distribution Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley.
Statistics What is statistics? Where are statistics used?
Normal Distributions (aka Bell Curves, Gaussians) Spring 2010.
Chapter 5 The Standard Deviation as a Ruler and the Normal Model.
Uncertainty and confidence Although the sample mean,, is a unique number for any particular sample, if you pick a different sample you will probably get.
Chapter 131 Normal Distributions. Chapter 132 Thought Question 2 What does it mean if a person’s SAT score falls at the 20th percentile for all people.
Unit 2: Some Basics. The whole vs. the part population vs. sample –means (avgs) and “std devs” [defined later] of these are denoted by different letters.
Z-scores, normal distribution, and more.  The bell curve is a symmetric curve, with the center of the graph being the high point, and the two sides on.
Chapter 6 (part 2) WHEN IS A Z-SCORE BIG? NORMAL MODELS A Very Useful Model for Data.
MM150 Unit 9 Seminar. 4 Measures of Central Tendency Mean – To find the arithmetic mean, or mean, sum the data scores and then divide by the number of.
Chapter 3.3 – 3.4 Applications of the Standard Deviation and Measures of Relative Standing.
Copyright © 2009 Pearson Education, Inc. Chapter 6 The Standard Deviation as a Ruler and the Normal Model.
Describing Data Week 1 The W’s (Where do the Numbers come from?) Who: Who was measured? By Whom: Who did the measuring What: What was measured? Where:
Graphing Lines Using Slope Intercept Form Algebra 1 Glencoe McGraw-HillLinda Stamper.
Daniel S. Yates The Practice of Statistics Third Edition Chapter 2: Describing Location in a Distribution Copyright © 2008 by W. H. Freeman & Company.
Measures of Central Tendency and Variation
Summary (Week 1) Categorical vs. Quantitative Variables
Presentation transcript:

Unit 3: Variation and the Normal Curve

Not normal doesn’t mean abnormal The “normal (Gaussian, bell-shaped) curve” is common, but NOT universal –salaries –small data sets Other common distributions are –uniform: all counts are the same (rolls of a die) –Poisson: life lengths, etc. Sums or averages of big samples from a population often have normal distributions –(We’ll see this later) –IQ’s: sums of many small mental properties –people heights: sums of many small bone lengths

Review: Standard Units “z-score” (“std units”): z = ( x –  x ) / σ –the number of σ’s above average –(if negative, below average) Ex: Data 3, 3, 5, 6, 7, 9:  x = 5.5 –differences: -2.5, -2.5, -.5,.5, 1.5, 3.5 –σ = RMS of differences ≈ 2.15 –z ≈ -1.17, -1.17, -.23,.23,.70, 1.63 NOT normally distributed

Ex: A list of 100 numbers, already in standard units, begins -5.8, -4.3, 6.1,.2, 10.2, Is something wrong? They seem large -- remember, 3σ away from μ, which is ±3 in std units, is very rare Can we check? Well, μ = 0, σ = 1, so sum of their squares should be 1 = 100/100 But (-5.8) 2 + (4.3) 2 + (6.1) is adding up to more than 100 fast In fact, (10.2) 2 alone is more than 100 So yes, they are too big to be in std units

Normal approximation Valid only IF you believe data is roughly normally distributed... and you need its avg and std dev Normal table in text gives percentages of data within z σ’s of μ, for normally distributed data in std units Switch to and from std units for other normally distributed data Don’t make an algorithm -- draw the bell curve(s) to decide what arithmetic to do

Normal table z Area(%) z Area(%) z Area(%) z Area(%) z Area(%)

Normal approx: Ex 1 Weights in the population of a city follow the normal curve, with  w = 140, σ = 30. About what % of pop weighs over 185? –In std units, 185 is ( )/30 = 1.5. Normal table says % > 1.5 or < -1.5 is ( )% = 13.36%. We only want right half: 13.36%/2 = 6.68% –Much too “accurate”; this is only approximation: 6.7%, or even 7%

Normal approx: Ex 2 Scores on a college entrance exam follow normal curve (odd!), with  x = 120 and σ = 40. –(a) About what score is the 80th %ile? –(b) About what is the IQR? (a)In normal table, we need z that gives percent in center, not 80%, but (80 - ( ))% = 60%, which is z =.85. So 80th %ile of scores is [undoing std units] (40) ≈ 154 (b) We need z so that 50% of the data is between z and -z, and that’s z =.70. So the 3rd quartile is (.70), the 1st is (-.70), and their difference is the IQR, 2(40(.70)) = 56

Normal approx: Ex 3 Data following the normal curve has avg 80 and std dev 10. –(a) What is the 15th %ile? –(b) What is the 83rd %ile? –(c) What % of data is between 85 and 95? –(d) What % of data is between 60 and 90?

Measurement errors (Scientific) measurements are sum of –the true value (an ideal concept) –bias (not prejudgment, but a consistent error built into the method of measurement) e.g., a measuring tape that has stretched too far –chance error (the result of many tiny changes that affect the measuring process) e.g., how you stand, humidity, phase of moon, etc. In the NB10 ex in text, true value is 10g (by law), bias is about 400μg (consistent error), chance error is small differences from that

The terms might be grouped as... by an idealist: true value + error –Profs. Valente & Schult have a short note relating this to Plato’s metaphor of the cave by an experimenter: (consistent measurement) + (chance error) –because you can’t distinguish between true value and bias without changing the method of measurement

Foreshadowing... The fact that chance errors in measurement are (regarded as) normally distributed is the basis for the significance tests called z-tests that we’ll study at the end of the course.

Recall some elementary algebra “Plotting points”: Points in plane ↔ ordered pairs of real numbers Graph of y = mx + b is set of points whose pairs that satisfy it –It’s a line –(y-)intercept b is value where line hits y-axis (i.e., x=0) –m = slope = rise / run = (diff in y-values) / (diff in x-values) for any two points on line –Ex: Line through (1,4) and (3,2) has slope -1, y-intercept 5 Useful form: Point-slope form: –y - (y-value) = (slope) (x - (x-value)) –Ex: Equation of line through (2,3) and (5,7)