Stat 31, Section 1, Last Time Independence –Special Case of “And” Rule –Relation to Mutually Exclusive Random Variables –Discrete vs. Continuous –Tables.

Slides:



Advertisements
Similar presentations
Mean, Proportion, CLT Bootstrap
Advertisements

Sta220 - Statistics Mr. Smith Room 310 Class #14.
Rules for Means and Variances
Sampling Distributions
6-1 Stats Unit 6 Sampling Distributions and Statistical Inference - 1 FPP Chapters 16-18, 20-21, 23 The Law of Averages (Ch 16) Box Models (Ch 16) Sampling.
Stat 155, Section 2, Last Time Big Rules of Probability: –Not Rule ( 1 – P{opposite}) –Or Rule (glasses – football) –And rule (multiply conditional prob’s)
MM207 Statistics Welcome to the Unit 7 Seminar Prof. Charles Whiffen.
Expected Value, the Law of Averages, and the Central Limit Theorem
Chapter 5 Understanding Randomness
Stat 155, Section 2, Last Time Producing Data: How to Sample? –Placebos –Double Blind Experiment –Random Sampling Statistical Inference –Population “parameters”,,
Linear Transformation and Statistical Estimation and the Law of Large Numbers Target Goal: I can describe the effects of transforming a random variable.
AP Statistics Chapter 16. Discrete Random Variables A discrete random variable X has a countable number of possible values. The probability distribution.
Confidence Intervals for Proportions
Section 5.1 Random Variables
1 Many people debate basic questions of chance in games such as lotteries. The Monty Hall problem is a fun brain teaser that Marilyn vos Savant addressed.
1 Business 260: Managerial Decision Analysis Professor David Mease Lecture 2 Agenda: 1) Assign Homework #1 (due Thursday 3/19) 2) Basic Probability (Stats.
Variance Fall 2003, Math 115B. Basic Idea Tables of values and graphs of the p.m.f.’s of the finite random variables, X and Y, are given in the sheet.
Determining the Size of
Chapter 19: Confidence Intervals for Proportions
Albert Gatt Corpora and Statistical Methods. Probability distributions Part 2.
The Marriage Problem Finding an Optimal Stopping Procedure.
Random Variables A random variable A variable (usually x ) that has a single numerical value (determined by chance) for each outcome of an experiment A.
Stat 155, Section 2, Last Time Pepsi Challenge: When are results “significant” vs. “random”? Independence –Conditional Prob’s = Unconditional Prob’s –Special.
Application of Random Variables
Unit 4 Starters. Starter Suppose a fair coin is tossed 4 times. Find the probability that heads comes up exactly two times.
Chapter 3 Section 3.5 Expected Value. When the result of an experiment is one of several numbers, (sometimes called a random variable) we can calculate.
1 9/8/2015 MATH 224 – Discrete Mathematics Basic finite probability is given by the formula, where |E| is the number of events and |S| is the total number.
Chapter 6 Random Variables. Make a Sample Space for Tossing a Fair Coin 3 times.
The mean of a set of observations is their ordinary average, whereas the mean of a random variable X is an average of the possible values of X The mean.
Please turn off cell phones, pagers, etc. The lecture will begin shortly.
Theory of Probability Statistics for Business and Economics.
AP Statistics Section 7.2A Mean & Standard Deviation of a Probability Distribution.
Lecture 9. If X is a discrete random variable, the mean (or expected value) of X is denoted μ X and defined as μ X = x 1 p 1 + x 2 p 2 + x 3 p 3 + ∙∙∙
A study of education followed a large group of fourth-grade children to see how many years of school they eventually completed. Let x be the highest year.
5.3 Random Variables  Random Variable  Discrete Random Variables  Continuous Random Variables  Normal Distributions as Probability Distributions 1.
1 M14 Expected Value, Discrete  Department of ISM, University of Alabama, ’95,2002 Lesson Objectives  Understand the meaning of “expected value.” (Know.
Outline Random processes Random variables Probability histograms
Stor 155, Section 2, Last Time Prediction in Regression –Given new point X 0, predict Y 0 –Confidence interval for mean –Prediction Interval for value.
The mean of a set of observations is their ordinary average, whereas the mean of a random variable X is an average of the possible values of X The mean.
Stat 155, Section 2, Last Time Binomial Distribution –Normal Approximation –Continuity Correction –Proportions (different scale from “counts”) Distribution.
7.2 Means and Variances of Random Variables.  Calculate the mean and standard deviation of random variables  Understand the law of large numbers.
7.2 Means and variances of Random Variables (weighted average) Mean of a sample is X bar, Mean of a probability distribution is μ.
Probability Distribution
Stats Lunch: Day 3 The Basis of Hypothesis Testing w/ Parametric Statistics.
Chapter 19 Confidence intervals for proportions
QR 32 Section #6 November 03, 2008 TA: Victoria Liublinska
Welcome to MM570 Psychological Statistics
L56 – Discrete Random Variables, Distributions & Expected Values
AP Statistics Chapter 16. Discrete Random Variables A discrete random variable X has a countable number of possible values. The probability distribution.
Section 7.2 P1 Means and Variances of Random Variables AP Statistics.
Stat 31, Section 1, Last Time Big Rules of Probability –The not rule –The or rule –The and rule P{A & B} = P{A|B}P{B} = P{B|A}P{A} Bayes Rule (turn around.
MATH 256 Probability and Random Processes Yrd. Doç. Dr. Didem Kivanc Tureli 14/10/2011Lecture 3 OKAN UNIVERSITY.
A random variable is a variable whose values are numerical outcomes of a random experiment. That is, we consider all the outcomes in a sample space S and.
Discrete Probability Distributions. Random Variable A random variable X takes on a defined set of values with different probabilities. For example, if.
Stat 31, Section 1, Last Time Distribution of Sample Means –Expected Value  same –Variance  less, Law of Averages, I –Dist’n  Normal, Law of Averages,
Chapter 15 Random Variables. Introduction Insurance companies make bets. They bet that you are going to live a long life. You bet that you are going to.
Copyright © 2010 Pearson Education, Inc. Chapter 16 Random Variables.
Uncertainty and confidence Although the sample mean,, is a unique number for any particular sample, if you pick a different sample you will probably get.
The expected value The value of a variable one would “expect” to get. It is also called the (mathematical) expectation, or the mean.
Statistics 16 Random Variables. Expected Value: Center A random variable assumes a value based on the outcome of a random event. –We use a capital letter,
Chapter 7: Random Variables 7.2 – Means and Variance of Random Variables.
The Law of Averages. What does the law of average say? We know that, from the definition of probability, in the long run the frequency of some event will.
Lesson 96 – Expected Value & Variance of Discrete Random Variables HL2 Math - Santowski.
Stat 31, Section 1, Last Time Linear transformations
Introduction to Probability Distributions
Stat 31, Section 1, Last Time Sampling Distributions
Last Time Histograms Notions of Center
Probability Distributions; Expected Value
Means and Variances of Random Variables
Presentation transcript:

Stat 31, Section 1, Last Time Independence –Special Case of “And” Rule –Relation to Mutually Exclusive Random Variables –Discrete vs. Continuous –Tables of Probabilities for Discrete R.V.s –Areas as Probabilities for Continuous R.V.s

Means and Variances (of random variables) Text, Sec. 4.4 Idea: Above population summaries, extended from populations to probability distributions Connection: frequentist view Make repeated draws, from the distribution

Discrete Prob. Distributions Recall table summary of distribution: Taken on by random variable X, Probabilities: P{X = x i } = p i (note: big difference between X and x!) Valuesx1x1 x2x2 …xkxk Prob.p1p1 p2p2 …pkpk

Discrete Prob. Distributions Table summary of distribution: Recall power of this: Can compute any prob., by summing p i Valuesx1x1 x2x2 …xkxk Prob.p1p1 p2p2 …pkpk

Mean of Discrete Distributions Frequentist approach to mean:

Mean of Discrete Distributions Frequentist approach to mean: a weighted average of values where weights are probabilities

Mean of Discrete Distributions E.g. Above Die Rolling Game: Mean of distribution = = (1/3)(9) + (1/6)(0) +(1/2)(-4) = = 1 Interpretation: on average (over large number of plays) winnings per play = $1 Conclusion: should be very happy to play Winning9-40 Prob.1/31/21/6

Mean of Discrete Distributions Terminology: mean is also called: “Expected Value” E.g. in above game “expect” $1 (per play) (caution: on average over many plays)

Expected Value HW: (2.45) 4.61

Expected Value An application of Expected Value: Assess “fairness” of games (e.g. gambling) Major Caution: Expected Value is not what is expected on one play, but instead is average over many plays. Cannot say what happens in one or a few plays, only in long run average

Expected Value E.g. Suppose have $5000, and need $10,000 (e.g. you owe mafia $5000, clean out safe at work. If you give to mafia, you go to jail, so decide to try to raise additional $5000 by gambling) And can make even bets, where P{win} = 0.48 (can really do this, e.g. bets on Red in roulette at a casino)

Expected Value E.g. Suppose have $5000, and need $10,000 and can make even bets, w/ P{win} = 0.48 Pressing Practical Problem: Should you make one large bet? Or many small bets? Or something in between?

Expected Value E.g. Suppose have $5000, and need $10,000 and can make even bets, w/ P{win} = 0.48 Expected Value analysis: E(Winnings) = P{lose} x $0 + P{win} x $2 = 0.52 x $ x $2 = = $0.96 Thus expect to lose $0.04 for every dollar bet

Expected Value E.g. Suppose have $5000, and need $10,000 and can make even bets, w/ P{win} = 0.48 Expect to lose $0.04 for every dollar bet This is why gambling is very profitable (for the casinos, been to Las Vegas?) They play many times So expected value works for them And after many bets, you will surely lose So should make fewer, not more bets?

Expected Value E.g. Suppose have $5000, and need $10,000 and can make even bets, w/ P{win} = 0.48 Another view: Strategy P{get $10,000} one $5000 bet 0.48 ~ 1/2 two $2500 bets ~ (0.48) 2 ~ 1/4 four $1250 bets ~ (0.48) 2 ~ 1/16 “many” “no chance”

Expected Value E.g. Suppose have $5000, and need $10,000 and can make even bets, w/ P{win} = 0.48 Surprising (?) answer: Best to make one big bet Not much fun… But best chance at winning Casino Folklore: This really happens Folks walk in, place one huge bet….

Expected Value Warning about Expected Value: Excellent for some things, but not all decisions e.g. if will play many times e.g. if only play once (so don’t have long run)

Expected Value Real life decisions against Expected Value: 1.State Lotteries –State sells tickets –Keeps about half of $$$ –Gives rest to ~ one (randomly chosen) player –So Expected Value is clearly negative –Why do people play? Totally irrational? –Players buy faint hope of humongous gain –Could be worth joy of thinking about it

Expected Value Real life decisions against Expected Value: 1.State Lotteries –Want one in North Carolina? –You will be asked to decide Interesting (and deep) philosophical balances: –Only totally voluntary tax –Yet tax burden borne mostly by poor –Is that fair? –But we lose revenue to other states…

Expected Value Real life decisions against Expected Value: 2.Casino Gambling –Always lose in long run (expected value…) –Yet people do it. Are they nuts? –Depends on how many times they play –If really enjoy being ahead sometimes –Then could be worth price paid for the thrill –Serious societal challenge: (some are totally consumed by thrill)

Expected Value Real life decisions against Expected Value: 3.Insurance –Everyone pays about 2 x Expected Loss –Insurance Company keeps the rest! –So very profitable. –But e.g. car insurance is required by law! –Sensible, since if lose, can lose very big –Yet purchase is totally against Expected Value –OK, since you only play once (not many times) –Insurance Co’s play many times (Expected Value works for them) –So they are an evening out mechanism

And now for something completely different Interesting Suggestion / Request By Katie Baer Well supported with Data / Analysis!

SIMPLE MATH: Date of the 2005 NCAA Men’s Basketball Tournament Final: Monday, April 4 th, 2005 Date of the Stat 31 Midterm #2: Tuesday, April 5 th, 2005

WHY SHOULD STEVE RESCHEDULE THE EXAM? STATISTICAL EVIDENCE:

BinFrequency Probability of a #1 Seed Reaching the Final Four Final Four Data: P{FF} = 43/104 =

How many of these #1 seeds actually win the Tourney? P{Champ} = 12/25 = %

However, this assumes that North Carolina has an equal probability of winning the Tourney as the other predicted #1 Seeds (Illinois, Wake Forest, and Boston College) NBC Sports, msnbc.com

So we all know that… Illinois is undefeated Illinois beat Wake Forest and is ranked #1 in the Big 10 Wake Forest beat North Carolina North Carolina is ranked #1 in the ACC and is 4-2 versus ranked teams Boston College has lost only one game and is #1 in the Big Least, I mean East

How do we determine which team is better? RPI is derived from three component factors: Div. I winning percentage (25)%, schedule strength (50)%; and opponent's schedule strength (25)%. How do the #1 Seeds’ RPI’s compare to the rest of the Top 25?

As expected, teams with higher rankings have higher ranking RPI’s. This indicates that the best teams are going to be at the bottom left corner of the graph. BUT… RPI’s are not an entirely accurate way of measuring team’s ability (as seen with mediocre R^2) RPI does not take into account factors such as margin of victory, location of game, etc.

A different approach… A study found that approximately 62.8% of all college students consume alcohol on a regular basis * Considering that this percentage does not take into account specific drinking statistics at UNC nor the fact that a national championship is at stake, this is a conservative figure Number of students in Steve’s Stat. 31 class: 92 (from class exam data) 92*0.628 ≈ 58 people This number estimates the number of people enrolled in Stat 31, section 1 that consume alcohol on a regular basis

A study by the NCAA showed that 87% of university students strongly believe that supporting collegiate sports is an integral part of college life Taking into account that watching sports and drinking alcohol are major aspects of college students’ lives, what is the probability that a college student will support college sports AND consume alcohol at the same time? P{A} = 0.628, P{S} = 0.87 P {A and S} = P{A}*P{S} = 0.628*0.87 = (54.6%) THUS, over half the class (approx. 50 people) will probably drink alcohol the night of the final game of the NCAA Tourney

Conclusions: Carolina has a considerable chance of reaching the Final Four and winning the NCAA tourney as a #1 seed as seen in past tournament data They have fierce competition, as seen with in the graph of RPI vs. Rank, for the title Over half of the class will probably consume alcohol the night of April 4 th, resulting in difficulty in studying for a midterm scheduled the next day Note that these figures are very conservative percentages, given that students will most likely drink more when their team is in the final game and especially if it is a close, exciting match-up

PLEASE MOVE THE TEST, STEVE! GO HEELS!!!

And now for something completely different Now about that exam change request… It is possible But we all need to agree Some choices: Thursday, April 7 or Tuesday, April 12 Please objections to either

Functions of Expected Value Important Properties of the Mean: i.Linearity: Why? i. e. mean “preserves linear transformations”

Functions of Expected Value Important Properties of the Mean: ii.summability: Why is harder, so won’t do here i. e. can add means to get mean of sums i. e. mean “preserves sums”

Functions of Expected Value E. g. above game: If we “double the stakes”, then want: “mean of 2X” Recall $1 before i.e. have twice the expected value Winning9-40 Prob.1/31/21/6

Functions of Expected Value E. g. above game: If we “play twice”, then have Same as above? But isn’t playing twice different from doubling stake? Yes, but not in means Winning9-40 Prob.1/31/21/6

Functions of Expected Value HW: (70)

Indep. Of Random Variables Independence: Random Variables X & Y are independent when knowledge of value of X does not change chances of values of Y

Indep. Of Random Variables HW: 4.64 (Indep., Dep., Dep.) 4.65

Independence Application: Law of Large Numbers IF are independent draws from the same distribution, with mean, THEN: (needs more mathematics to make precise, but this is the main idea)

Independence Application: Law of Large Numbers Note: this is the foundation of the “frequentist view of probability” Underlying thought experiment is based on many replications, so limit works….

Variance of Random Variables Again consider discrete random variables: Where distribution is summarized by a table, Valuesx1x1 x2x2 …xkxk Prob.p1p1 p2p2 …pkpk

Variance of Random Variables Again connect via frequentist approach:

Variance of Random Variables Again connect via frequentist approach:

Variance of Random Variables So define: Variance of a distribution As: random variable

Variance of Random Variables E. g. above game: =(1/2)*5^2+(1/6)*1^2+(1/3)*8^2 Note: one acceptable Excel form, e.g. for exam (but there are many) Winning9-40 Prob.1/31/21/6

Standard Deviation Recall standard deviation is square root of variance (same units as data) E. g. above game: Standard Deviation =sqrt((1/2)*5^2+(1/6)*1^2+(1/3)*8^2) Winning9-40 Prob.1/31/21/6

Variance of Random Variables HW: C14: Find the variance and standard deviation of the distribution in ( 1.21, 1.10 )

Properties of Variance i.Linear transformation I.e. “ignore shifts” var( ) = var ( ) (makes sense) And scales come through squared (recall s.d. on scale of data, var is square)

Properties of Variance ii.For X and Y independent (important!) I. e. Variance of sum is sum of variances Here is where variance is “more natural” than standard deviation:

Properties of Variance E. g. above game: Recall “double the stakes”, gave same mean, as “play twice”, but seems different Doubling: Play twice, independently: Note: playing more reduces uncertainty (var quantifies this idea, will do more later) Winning9-40 Prob.1/31/21/6

Variance of Random Variables HW: 4.74 ( (a) 550, 5.7, (b) 0, 5.7, (c) 1022, 10.3 ) 4.75