Stor 155, Section 2, Last Time Prediction in Regression –Given new point X 0, predict Y 0 –Confidence interval for mean –Prediction Interval for value.

Slides:



Advertisements
Similar presentations
Sta220 - Statistics Mr. Smith Room 310 Class #14.
Advertisements

Lecture 3 HSPM J716. Efficiency in an estimator Efficiency = low bias and low variance Unbiased with high variance – not very useful Biased with low variance.
6-1 Stats Unit 6 Sampling Distributions and Statistical Inference - 1 FPP Chapters 16-18, 20-21, 23 The Law of Averages (Ch 16) Box Models (Ch 16) Sampling.
© 2004 Prentice-Hall, Inc.Chap 5-1 Basic Business Statistics (9 th Edition) Chapter 5 Some Important Discrete Probability Distributions.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 5-1 Chapter 5 Some Important Discrete Probability Distributions Basic Business Statistics.
Stat 155, Section 2, Last Time Big Rules of Probability: –Not Rule ( 1 – P{opposite}) –Or Rule (glasses – football) –And rule (multiply conditional prob’s)
© 2003 Prentice-Hall, Inc.Chap 5-1 Basic Business Statistics (9 th Edition) Chapter 5 Some Important Discrete Probability Distributions.
ฟังก์ชั่นการแจกแจงความน่าจะเป็น แบบไม่ต่อเนื่อง Discrete Probability Distributions.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 5-1 Chapter 5 Some Important Discrete Probability Distributions Statistics.
Stat 31, Section 1, Last Time Paired Diff’s vs. Unmatched Samples
Using Statistics to Analyze your Results
AP Statistics Chapter 16. Discrete Random Variables A discrete random variable X has a countable number of possible values. The probability distribution.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 4-1 Introduction to Statistics Chapter 5 Random Variables.
Lecture 19: Tues., Nov. 11th R-squared (8.6.1) Review
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 5-1 Chapter 5 Some Important Discrete Probability Distributions Statistics.
Stat 217 – Day 10 Review. Last Time Judging “spread” of a distribution “Empirical rule”: In a mound-shaped symmetric distribution, roughly 68% of observations.
Chapter 6 Random Variables
Variance Fall 2003, Math 115B. Basic Idea Tables of values and graphs of the p.m.f.’s of the finite random variables, X and Y, are given in the sheet.
Stor 155, Section 2, Last Time 2-way Tables –Sliced populations in 2 different ways –Look for independence of factors –Chi Square Hypothesis test Simpson’s.
Scot Exec Course Nov/Dec 04 Ambitious title? Confidence intervals, design effects and significance tests for surveys. How to calculate sample numbers when.
Stat 155, Section 2, Last Time Pepsi Challenge: When are results “significant” vs. “random”? Independence –Conditional Prob’s = Unconditional Prob’s –Special.
Stat 1510: Introducing Probability. Agenda 2  The Idea of Probability  Probability Models  Probability Rules  Finite and Discrete Probability Models.
Week71 Discrete Random Variables A random variable (r.v.) assigns a numerical value to the outcomes in the sample space of a random phenomenon. A discrete.
Variance Formula. Probability A. The importance of probability Hypothesis testing and statistical significance Probabilistic causation - because error.
Review of Econ424 Fall –open book –understand the concepts –use them in real examples –Dec. 14, 8am-12pm, Plant Sciences 1129 –Vote Option 1(2)
Stat 31, Section 1, Last Time T distribution –For unknown, replace with –Compute with TDIST & TINV (different!) Paired Samples –Similar to above, work.
Stor 155, Section 2, Last Time Inference for Regression –Least Square Fits –Sampling distrib’ns for slope and intercept Regression Tool –Gave many useful.
Statistics 303 Chapter 4 and 1.3 Probability. The probability of an outcome is the proportion of times the outcome would occur if we repeated the procedure.
Last Time Hypothesis Testing –Yes – No Questions –Assess with p-value P[what saw or m.c. | Boundary] –Interpretation –Small is conclusive –1-sided vs.
Probability, contd. Learning Objectives By the end of this lecture, you should be able to: – Describe the difference between discrete random variables.
1 Chapter 5 Sampling Distributions. 2 The Distribution of a Sample Statistic Examples  Take random sample of students and compute average GPA in sample.
Applied Business Forecasting and Regression Analysis Review lecture 2 Randomness and Probability.
GrowingKnowing.com © Expected value Expected value is a weighted mean Example You put your data in categories by product You build a frequency.
PROBABILITY CONCEPTS Key concepts are described Probability rules are introduced Expected values, standard deviation, covariance and correlation for individual.
5.3 Random Variables  Random Variable  Discrete Random Variables  Continuous Random Variables  Normal Distributions as Probability Distributions 1.
Outline Random processes Random variables Probability histograms
Stor 155, Section 2, Last Time Review…. Stat 31 Final Exam: Date & Time: Tuesday, May 8, 8:00-11:00 Last Office Hours: Thursday, May 3, 12:00 - 5:00 Monday,
Chapter 7 Random Variables
Stat 155, Section 2, Last Time Binomial Distribution –Normal Approximation –Continuity Correction –Proportions (different scale from “counts”) Distribution.
Section 10.1 Confidence Intervals
Stat 155, Section 2, Last Time Interpreted Midterm Results Variance of Random Variables –From Probability Table –Properties: Ignores shift Multiples comes.
A random variable is a variable whose values are numerical outcomes of a random experiment. That is, we consider all the outcomes in a sample space S and.
Stat 31, Section 1, Last Time Independence –Special Case of “And” Rule –Relation to Mutually Exclusive Random Variables –Discrete vs. Continuous –Tables.
Welcome to MM570 Psychological Statistics
Inference: Probabilities and Distributions Feb , 2012.
Random Variables Ch. 6. Flip a fair coin 4 times. List all the possible outcomes. Let X be the number of heads. A probability model describes the possible.
Sampling and estimation Petter Mostad
AP Statistics Chapter 16. Discrete Random Variables A discrete random variable X has a countable number of possible values. The probability distribution.
Administrative Matters Midterm II Results Take max of two midterm scores:
Probability Theory Modelling random phenomena. Permutations the number of ways that you can order n objects is: n! = n(n-1)(n-2)(n-3)…(3)(2)(1) Definition:
Stat 31, Section 1, Last Time Big Rules of Probability –The not rule –The or rule –The and rule P{A & B} = P{A|B}P{B} = P{B|A}P{A} Bayes Rule (turn around.
1 Stat 350 Lab Session GSI: Yizao Wang Section 016 Mon 2pm30-4pm MH 444-D Section 043 Wed 2pm30-4pm MH 444-B.
A random variable is a variable whose values are numerical outcomes of a random experiment. That is, we consider all the outcomes in a sample space S and.
Stat 31, Section 1, Last Time Distribution of Sample Means –Expected Value  same –Variance  less, Law of Averages, I –Dist’n  Normal, Law of Averages,
A statistic from a random sample or randomized experiment is a random variable. The probability distribution of this random variable is called its sampling.
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 5-1 Chapter 5 Some Important Discrete Probability Distributions Business Statistics,
Simulations with Binomials Mean and S.D. of Binomials Section
The expected value The value of a variable one would “expect” to get. It is also called the (mathematical) expectation, or the mean.
Chap 4-1 Chapter 4 Using Probability and Probability Distributions.
©The McGraw-Hill Companies, Inc. 2008McGraw-Hill/Irwin Probability Distributions Chapter 6.
Discrete Probability Distributions
Last Time Proportions Continuous Random Variables Probabilities
Final Examination Thursday, April 30, 4:00 – 7:00
Stat 31, Section 1, Last Time Sampling Distributions
The binomial applied: absolute and relative risks, chi-square
Bayes Net Learning: Bayesian Approaches
Stat 217 – Day 28 Review Stat 217.
Advanced Placement Statistics Section 7.2: Means & Variance of a
Putting It All Together: Which Method Do I Use?
Producing good data through sampling and experimentation
Presentation transcript:

Stor 155, Section 2, Last Time Prediction in Regression –Given new point X 0, predict Y 0 –Confidence interval for mean –Prediction Interval for value Review…

Stat 31 Final Exam: Date & Time: Tuesday, May 8, 8:00-11:00 Last Office Hours: Thursday, May 3, 12:00 - 5:00 Monday, May 7, 10:00 - 5:00 & by appointment (earlier) Bring with you, to exam: Single (8.5" x 11") sheet of formulas Front & Back OK

Review Slippery Issues Major Confusion: Population Quantities Vs. Sample Quantities

Response to a Request You said at the end of today's class that you would be willing to take class time to "reteach" concepts that might still be unknown to us. Well, in my case, it seems that probability and probability distribution is a hard concept for me to grasp. On the first midterm, I missed … and on the second midterm, I missed … I seem to be able to grasp the other concepts involving binomial distribution, normal distribution, t- distribution, etc fairly well, but probability is really killing me on the exams. If you could reteach these or brush up on them I would greatly appreciate it.

Levels of Probability Simple Events –Big Rules of Prob (Not, And, Or) –Bayes Rule Distributions (in general) –Defined by Tables Summary of discrete probs Get probs by summing –Uniform Get probs by finding areas

Levels of Probability Distributions (in general) Named (& Useful) Distributions –Binomial Discrete distribution of Counts Compute with BINOMDIST & Normal Approx. –Normal Continuous distribution of Averages Compute with NORMDIST & NORMINV –T Similar to Normal, for estimated s.d. Compute with TDIST & TINV

Detailed Look Simple Events: Big Rules of Probability: –Not Rule ( 1 – P{opposite}) –Or Rule (glasses – football) –And rule (multiply conditional prob’s) –Use in combination for real power Bayes Rule –Turn around conditional probabilities –Write hard ones in terms of easy ones –Recall surprising disease testing result

Detailed Look Distributions (in general) –Defined by Tables Summary of discrete probs Get probs by summing Easy to forget after so much other stuff… Studied in Notes: 2/20, 2/22, 3/1 Some highlights…

Highlights of Dist’ns in Tables Distributions (in general) –Defined by Tables Summary of discrete probs Get probs by summing Easy to forget after so much other stuff… Studied in Notes: 2/20, 2/22, 3/1 Some highlights…

Random Variables Die rolling example, for X = “net winnings”: Win $9 if 5 or 6, Pay $4, if 1, 2 or 4 Probability Structure of X is summarized by: P{X = 9} = 1/3 P{X = -4} = 1/2 P{X = 0} = 1/6 Convenient form: a table Winning9-40 Prob.1/31/21/6

Summary of Prob. Structure In general: for discrete X, summarize “distribution” (i.e. full prob. Structure) by a table: Where: i.All are between 0 and 1 ii. (so get a prob. funct’n as above) Valuesx1x1 x2x2 …xkxk Prob.p1p1 p2p2 …pkpk

Summary of Prob. Structure Summarize distribution, for discrete X, by a table: Power of this idea: Get probs by summing table values Special case of disjoint OR rule Valuesx1x1 x2x2 …xkxk Prob.p1p1 p2p2 …pkpk

Summary of Prob. Structure E.g. Die Rolling game above: P{X = 9} = 1/3 P{X < 2} = P{X = 0} + P{X = -4} =1/6+1/2 = 2/3 P{X = 5} = 0 (not in table!) Winning9-40 Prob.1/31/21/6

Summary of Prob. Structure E.g. Die Rolling game above: Winning9-40 Prob.1/31/21/6

Mean of Discrete Distributions Frequentist approach to mean: a weighted average of values where weights are probabilities

Mean of Discrete Distributions E.g. Above Die Rolling Game: Mean of distribution = = (1/3)(9) + (1/6)(0) +(1/2)(-4) = = 1 Interpretation: on average (over large number of plays) winnings per play = $1 Conclusion: should be very happy to play Winning9-40 Prob.1/31/21/6

Variance of Random Variables So define: Variance of a distribution As: random variable

Variance of Random Variables E. g. above game: =(1/2)*5^2+(1/6)*1^2+(1/3)*8^2 Note: one acceptable Excel form, e.g. for exam (but there are many) Winning9-40 Prob.1/31/21/6

Standard Deviation Recall standard deviation is square root of variance (same units as data) E. g. above game: Standard Deviation =sqrt((1/2)*5^2+(1/6)*1^2+(1/3)*8^2) Winning9-40 Prob.1/31/21/6

And Now for Something Completely Different Thought Provoking Movie…

Review Slippery Issues Major Confusion: Population Quantities Vs. Sample Quantities

Recall Pepsi Challenge In class taste test: Removed bias with randomization Double blind approach Asked which was: –Better –Sweeter –which

Recall Pepsi Challenge Results summarized in Recall Eyeball impressions: a. Perhaps no consensus preference between Pepsi and Coke? –Is 54% "significantly different from 50%? Result of "marketing research"???

Recall Pepsi Challenge b. Perhaps no consensus as to which is sweeter? Very different from the past, when Pepsi was noticeably sweeter This may have driven old Pepsi challenge phenomenon Coke figured this out, and matched Pepsi in sweetness

Recall Pepsi Challenge c. Most people believe they know –Serious cola drinkers, because now flavor driven –In past, was sweetness driven, and there were many advertising caused misperceptions! d. People tend to get it right or not??? (less clear) –Overall 71% right. Seems like it, but again is that significantly different from 50%?

Recall Pepsi Challenge e. Those who think they know tend to be right??? –People who thought they knew: right 71% of the time f. Those who don't think they know seem to right as well. Wonder why? –People who didn't: also right 70% of time? Why? "Natural sampling variation"??? –Any difference between people who thought they knew, and those who did not think so?

Recall Pepsi Challenge g. Coin toss was fair (or is 57% heads significantly different from %50?) How accurate are those ideas? Will build tools to assess this Called “hypo tests” and “P-values” Revisit this now

Pepsi – Coke Taste Test Data and Analysis: Hypothesis Tests: Proportions based (i.e. think about p) Interesting Hypos: Recall Sampling Distribution:

Pepsi – Coke Taste Test Data and Analysis: P-value: P{what saw or m.c. | p = 0.5} Under assumption p = 0.5, So compute P-value as: Area obs’d

Pepsi – Coke Taste Test Data and Analysis: Compute P-value as: Area obs’d =NORMDIST(ABS(phat – 0.5),0, 1/(2*SQRT(n),TRUE)

Pepsi – Coke Taste Test Conclusions (P-values): No consensus, Pepsi vs. Coke (0.46) No consensus, Sweeter (0.81) Most think know (e-5, very strong) Get It Right (0.0006, very strong) Fair Coin Toss (0.21, seems OK) Thought Right, Were Right (0.003,yes) Thought Not, Were Right (0.09, perhaps too modest?)

Pepsi – Coke Taste Test Some interesting history of this test: First Attempts –Pepsi was preferred –Pepsi was sweeter –Many got it wrong (even if thought new) –Reason for “Pepsi challenge”? New Coke Came Out –Response to Pepsi Challenge?

Pepsi – Coke Taste Test Some interesting history of this test: New Coke Came Out –People thought they hated it… –Anger over changing the flavor… –So Coke Classic came out Fun for me: New Coke vs. Coke Classic

Pepsi – Coke Taste Test Some interesting history of this test: Taste test: New Coke vs. Coke Classic –New Coke preferred to Coke Classic! –New Coke was sweeter –Most got it wrong (even if thought new) Changes Over Time –Appears Coke Classic slowly morphed into New Coke…