Statistics: Intro Statistics or What’s normal about the normal curve, what’s standard about the standard deviation, and what’s co-relating in a correlation?

Slides:



Advertisements
Similar presentations
The Simple Regression Model
Advertisements

Chapter 12: Testing hypotheses about single means (z and t) Example: Suppose you have the hypothesis that UW undergrads have higher than the average IQ.
Sampling Distributions
Segment 3 Introduction to Random Variables - or - You really do not know exactly what is going to happen George Howard.
Business Statistics for Managerial Decision
CHAPTER 13: Binomial Distributions
U eatworms.swmed.edu/~leon u
Theoretical Probability Distributions We have talked about the idea of frequency distributions as a way to see what is happening with our data. We have.
Linear Transformation and Statistical Estimation and the Law of Large Numbers Target Goal: I can describe the effects of transforming a random variable.
Overview Revising z-scores Revising z-scores Interpreting z-scores Interpreting z-scores Dangers of z-scores Dangers of z-scores Sample and populations.
Probability & Certainty: Intro Probability & Certainty.
Statistics for the Social Sciences
The Basics of Regression continued
1.2: Describing Distributions
Introduction to Probability and Statistics Linear Regression and Correlation.
Need to know in order to do the normal dist problems How to calculate Z How to read a probability from the table, knowing Z **** how to convert table values.
1 The Sample Mean rule Recall we learned a variable could have a normal distribution? This was useful because then we could say approximately.
Statistics Alan D. Smith.
Basic Statistical Concepts Part II Psych 231: Research Methods in Psychology.
Probability & Certainty: Intro Probability & Certainty.
Lecture 6: Descriptive Statistics: Probability, Distribution, Univariate Data.
1 Binomial Probability Distribution Here we study a special discrete PD (PD will stand for Probability Distribution) known as the Binomial PD.
Inferential Statistics
Review of Probability and Binomial Distributions
The standard error of the sample mean and confidence intervals How far is the average sample mean from the population mean? In what interval around mu.
Chapter 6 Probability.
1 CY1B2 Statistics Aims: To introduce basic statistics. Outcomes: To understand some fundamental concepts in statistics, and be able to apply some probability.
Chapter 5 Sampling Distributions
Concepts and Notions for Econometrics Probability and Statistics.
© Copyright McGraw-Hill CHAPTER 6 The Normal Distribution.
1 9/8/2015 MATH 224 – Discrete Mathematics Basic finite probability is given by the formula, where |E| is the number of events and |S| is the total number.
1 9/23/2015 MATH 224 – Discrete Mathematics Basic finite probability is given by the formula, where |E| is the number of events and |S| is the total number.
CHAPTER 16: Inference in Practice. Chapter 16 Concepts 2  Conditions for Inference in Practice  Cautions About Confidence Intervals  Cautions About.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc.. Chap 5-1 Chapter 5 Some Important Discrete Probability Distributions Basic Business Statistics.
Probability, contd. Learning Objectives By the end of this lecture, you should be able to: – Describe the difference between discrete random variables.
Statistics Recording the results from our studies.
Theory of Probability Statistics for Business and Economics.
Describing Location in a Distribution. Measuring Position: Percentiles Here are the scores of 25 students in Mr. Pryor’s statistics class on their first.
Applied Business Forecasting and Regression Analysis Review lecture 2 Randomness and Probability.
Chapter 6 Lecture 3 Sections: 6.4 – 6.5.
Measures of Dispersion & The Standard Normal Distribution 2/5/07.
+ Recitation 3. + The Normal Distribution + Probability Distributions A probability distribution is a table or an equation that links each outcome of.
1 G Lect 4a G Lecture 4a f(X) of special interest: Normal Distribution Are These Random Variables Normally Distributed? Probability Statements.
1 Psych 5500/6500 Standard Deviations, Standard Scores, and Areas Under the Normal Curve Fall, 2008.
Measures of Dispersion & The Standard Normal Distribution 9/12/06.
Introduction to Behavioral Statistics Probability, The Binomial Distribution and the Normal Curve.
TYPES OF STATISTICAL METHODS USED IN PSYCHOLOGY Statistics.
Statistical analysis Outline that error bars are a graphical representation of the variability of data. The knowledge that any individual measurement.
Stats Exam Prep. Dr. Lin Lin. WARNING The goal of this workshop is to go over some basic concepts in probability and statistic theories required for IS.
Objective: Understanding and using linear regression Answer the following questions: (c) If one house is larger in size than another, do you think it affects.
7.2 Means and variances of Random Variables (weighted average) Mean of a sample is X bar, Mean of a probability distribution is μ.
Overview Of Probability Distribution. Standard Distributions  Learning Objectives  Be familiar with the standard distributions (normal, binomial, and.
Review of Probability. Important Topics 1 Random Variables and Probability Distributions 2 Expected Values, Mean, and Variance 3 Two Random Variables.
Quadratic Regression ©2005 Dr. B. C. Paul. Fitting Second Order Effects Can also use least square error formulation to fit an equation of the form Math.
Chapter 6 Lecture 3 Sections: 6.4 – 6.5. Sampling Distributions and Estimators What we want to do is find out the sampling distribution of a statistic.
Binomial Distributions Chapter 5.3 – Probability Distributions and Predictions Mathematics of Data Management (Nelson) MDM 4U.
From the population to the sample The sampling distribution FETP India.
Chapter 7 Random Variables and Discrete Distributions.
Warsaw Summer School 2015, OSU Study Abroad Program Normal Distribution.
STROUD Worked examples and exercises are in the text Programme 29: Probability PROGRAMME 29 PROBABILITY.
Hypothesis test flow chart
Binomial Distributions Chapter 5.3 – Probability Distributions and Predictions Mathematics of Data Management (Nelson) MDM 4U Authors: Gary Greer (with.
The expected value The value of a variable one would “expect” to get. It is also called the (mathematical) expectation, or the mean.
Chapter 7: Random Variables 7.2 – Means and Variance of Random Variables.
7.2 Day 1: Mean & Variance of Random Variables Law of Large Numbers.
Chapter 5 Sampling Distributions
The normal distribution
Probability Key Questions
Chapter 5 Sampling Distributions
Chapter 6: Random Variables
Presentation transcript:

Statistics: Intro Statistics or What’s normal about the normal curve, what’s standard about the standard deviation, and what’s co-relating in a correlation?

Statistics: Intro Overview What’s normal about the normal curve? –The nature of the confusion –One formal answer –An intuitive answer (real-time demo) What’s standard about a standard deviation? –Z-scores What’s co-relating in a correlation?

Statistics: Intro What’s normal about the normal curve(s)? There are a number of ways of mathematically defining and estimating the normal distribution (which defines a class of curves, not one single curve) The main question I want to address today is: what does that math mean? Why are so many things normally distributed? What makes sure that those things stay distributed normally? What stops other things from being normally distributed at all?

Statistics: Intro U: Why do you think height is distributed normally? L: Come again? (sarcastic) U: Why is it that women's height can be graphed using a normal curve? L: That's a strange question. U: Strange? L: No one's ever asked me that before..... (thinking to herself for a while) I guess there are 2 possible theories: Either it's just a fact about the world, some guy collected a lot of height data and noticed that it fell into a normal shape..... U: Or? L: Or maybe it's just a mathematical trick. U: A trick? How could it be a trick? From: Wilensky, U., (1997). What is Normal Anyway? Therapy for Epistemological Anxiety. Educational Studies in Mathematics. Special Issue on Computational Environments in Mathematics Education. Noss R. (Ed.) Volume 33, No. 2. pp

Statistics: Intro L: Well... Maybe some mathematician somewhere just concocted this crazy function, you know, and decided to say that height fit it. U: You mean... L: You know the height data could probably be graphed with lots of different functions and the normal curve was just applied to it by this one guy and now everybody has to use his function. U: So you’re saying that in the one case, it's a fact about the world that height is distributed in a certain way, and in the other case, it's a fact about our descriptions but not about height? L: Yeah. U: Well, if you had to commit to one of these theories, which would it be? L: If I had to choose just one? U: Yeah. L: I don't know. That's really interesting. Which theory do I really believe? I guess I've always been uncertain which to believe and it's been there in the background you know, but I don't know. I guess if I had to choose, if I have to choose one, I believe it's a mathematical trick, a mathematician's game.....What possible reason could there be for height,....for nature, to follow some weird bizarro function?

Statistics: Intro Formal answer 1: The binomial distribution I The chance of an event of probability p happening r times out of n tries: P(r) = n!/(r! (n - r)!) * p r * (1 - p) n-r (Recall: We wondered about this generalization last class.)

Statistics: Intro Formal answer 1: The binomial distribution II Why is it called the binomial distribution? Bi = 2 Nom = thing = the two-thing distribution It can be used wherever: 1.Each trial has two possible outcomes (say, success and failure; or heads and tails) 2.The trials are independent = the outcome of one trial has no influence over the outcome of another trial. 3. The trials are mutually exclusive 4. The events are randomly selected

Statistics: Intro Let’s try it out (Example 6.3 from last class) What are the odds of there being exactly one seven out of two rolls? one way is to roll 7 first, but not second - the odds of this are 1/6 * 5/6 (independent events) = the odds of rolling 7 second are 5/6 * 1/6 (independent events) = since these two outcomes are mutually exclusive, we can add them to get = 0.277

Statistics: Intro The generalization (Example 6.3 from last class) An event of probability p happens r times out of n tries: P(r) = n!/(r! (n - r)!) * p r * (1 - p) n-r p = 1/6; N = 2; r = 1 2!/(1!1!)*1/6 1 *5/6 1 = What are the odds of there being exactly one seven out of two rolls?

Statistics: Intro What does this have to do with the normal distribution?

Statistics: Intro What does this have to do with the normal distribution?

Statistics: Intro Why does this normal distribution happen? [See for the StarLogoT demo used in class. Can you understand: What effect changing the probabilities of each event has? What has to change to skew a normal curve?]

Statistics: Intro The standard deviation Given the non-linear shape of the normal distribution, one has two choices: –A.) Keep the amount of variation in each division constant, but vary the size of the divisions –B.) Keep the size of each division constant, but vary the the amount of variation in each division From:

Statistics: Intro The standard deviation (SD) The SD takes the second approach: it keeps the size of each division constant, but varies the the amount of variation in each division The SD is a measure of average deviation (difference) from the mean It is the square root of the variance, which is the average squared difference from the mean.

Statistics: Intro Z-scores If we express differences by dividing them by SDs, we have z-scores: standard units of difference from the mean THESE Z-SCORES WILL COME IN EXTREMELY USEFUL! –For example, we might want to know: If a 12-foot elephant is taller (compared to the height of average elephants) than a 230 pound man is heavy (compared to weight of average men) If a person with a WAIS IQ of 140 is rarer than a person with a GPA of 3.9 —Etc.

Statistics: Intro What co-relates in a correlation? In a correlation, we want to find the equation for the (one and only) line (the line of regression) which describes the relation between variables with the least error. –This is done mathematically, but the idea is simply that we draw a line such that the squared distances on two (or more) dimensions of points from the line would not be less for any other line

Statistics: Intro What co-relates in a correlation? R = The covariance of x and y / the product of the SDs of X and Y Covariance is related to variance = the mean value of all the pairs of differences from the mean for X multiplied by the differences from the mean for Y (the mean product of differences from the means) When X and Y are related, large numbers will be systematically multiplied by large numbers with the same sign (for differences on both sides of the mean) = covariance will be large & close to the product of the SDs of X and Y, so R will be close to 1.