Stat 13, Thu 5/3/12. 1. Correction on normal notation. 2. Normal percentiles. 3. Normal probability plots. 4. Bernoulli and Binomial random variables.

Slides:



Advertisements
Similar presentations
Assumptions and Conditions
Advertisements

Ch. 8 – Practical Examples of Confidence Intervals for z, t, p.
Chapter 8: Binomial and Geometric Distributions
CHAPTER 13: Binomial Distributions
Section 7.4 (partially). Section Summary Expected Value Linearity of Expectations Independent Random Variables.
CONTINUOUS RANDOM VARIABLES These are used to define probability models for continuous scale measurements, e.g. distance, weight, time For a large data.
Normal Distribution ch5.
Chapter 6 The Normal Distribution
1 Sociology 601, Class 4: September 10, 2009 Chapter 4: Distributions Probability distributions (4.1) The normal probability distribution (4.2) Sampling.
Discrete probability distributions Chapter 6 - Sullivan
Discrete Random Variables: The Binomial Distribution
Probability Distributions: Finite Random Variables.
Objectives (BPS chapter 13) Binomial distributions  The binomial setting and binomial distributions  Binomial distributions in statistical sampling 
Fall 2013Biostat 5110 (Biostatistics 511) Week 7 Discussion Section Lisa Brown Medical Biometry I.
Examples of continuous probability distributions: The normal and standard normal.
COURSE: JUST 3900 INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE Instructor: Dr. John J. Kerbs, Associate Professor Joint Ph.D. in Social Work and Sociology.
CHAPTER 6 Random Variables
Probability Models Chapter 17.
Chapter 5 Sampling Distributions
Stat 13, Tue 6/5/ Hand in hw7. 2. Practice problems. Final exam is Thur, 6/7, in class. It has 20 multiple choice questions. Only about 50% are on.
The Mean of a Discrete Probability Distribution
Stat 13, Thu 5/10/ CLT again. 2. CIs. 3. Interpretation of a CI. 4. Examples. 5. Margin of error and sample size. 6. CIs using the t table. 7. When.
June 10, 2008Stat Lecture 9 - Proportions1 Introduction to Inference Sampling Distributions for Counts and Proportions Statistics Lecture 9.
5.5 Distributions for Counts  Binomial Distributions for Sample Counts  Finding Binomial Probabilities  Binomial Mean and Standard Deviation  Binomial.
AP Statistics Section 8.1: The Binomial Distribution.
Stat 35b: Introduction to Probability with Applications to Poker Outline for the day: 1.Midterm. 2.Review of Bernoulli and binomial random variables. 3.Geometric.
Sampling distributions - for counts and proportions IPS chapter 5.1 © 2006 W. H. Freeman and Company.
Stat 13, Tue 5/8/ Collect HW Central limit theorem. 3. CLT for 0-1 events. 4. Examples. 5.  versus  /√n. 6. Assumptions. Read ch. 5 and 6.
Stat 13, Tue 5/15/ Hand in HW5 2. Review list. 3. Assumptions and CLT again. 4. Examples. Hand in Hw5. Midterm 2 is Thur, 5/17. Hw6 is due Thu, 5/24.
Bernoulli Trials Two Possible Outcomes –Success, with probability p –Failure, with probability q = 1  p Trials are independent.
Introduction to Behavioral Statistics Probability, The Binomial Distribution and the Normal Curve.
Binomial Formulas Target Goal: I can calculate the mean and standard deviation of a binomial function. 6.3b h.w: pg 404: 75, 77,
Binomial Probability Distribution
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 7 Section 5 – Slide 1 of 21 Chapter 7 Section 5 The Normal Approximation to the Binomial.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 6: Random Variables Section 6.3 Day 1 Binomial and Geometric Random.
LECTURE 14 TUESDAY, 13 OCTOBER STA 291 Fall
1 Since everything is a reflection of our minds, everything can be changed by our minds.
Stat 13, Tue 5/1/ Midterm Expected value, lotto and roulette. 3. Normal calculations. Read ch4. Hw4 is due Tue, 5/8, and Midterm 2 is Thur,
Exam 2: Rules Section 2.1 Bring a cheat sheet. One page 2 sides. Bring a calculator. Bring your book to use the tables in the back.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 6 Random Variables 6.3 Binomial and Geometric.
1 Chapter 2: The Normal Distribution 2.1Density Curves and the Normal Distributions 2.2Standard Normal Calculations.
© 2010 Pearson Prentice Hall. All rights reserved 7-1.
Binomial Distributions Chapter 5.3 – Probability Distributions and Predictions Mathematics of Data Management (Nelson) MDM 4U.
Stat 13, Thu 4/19/ Hand in HW2! 1. Resistance. 2. n-1 in sample sd formula, and parameters and statistics. 3. Probability basic terminology. 4. Probability.
Lesson The Normal Approximation to the Binomial Probability Distribution.
A statistic from a random sample or randomized experiment is a random variable. The probability distribution of this random variable is called its sampling.
Binomial Distributions Chapter 5.3 – Probability Distributions and Predictions Mathematics of Data Management (Nelson) MDM 4U Authors: Gary Greer (with.
Review Design of experiments, histograms, average and standard deviation, normal approximation, measurement error, and probability.
CHAPTER 6 Random Variables
Supplemental Lecture Notes
INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE
Binomial and Geometric Random Variables
Chapter 5 Joint Probability Distributions and Random Samples
Discrete random variable X Examples: shoe size, dosage (mg), # cells,…
Chapter 5 Sampling Distributions
Chapter 5 Sampling Distributions
Chapter 5 Sampling Distributions
ASV Chapters 1 - Sample Spaces and Probabilities
CHAPTER 6 Random Variables
Chapter 5 Sampling Distributions
Chapter 6: Random Variables
Chapter 5 Sampling Distributions
CHAPTER 6 Random Variables
Copyright © Cengage Learning. All rights reserved.
CHAPTER 6 Random Variables
CHAPTER 6 Random Variables
12/12/ A Binomial Random Variables.
Chapter 8: Binomial and Geometric Distributions
A statistic from a random sample or randomized experiment is a random variable. The probability distribution of this random variable is called its sampling.
Chapter 5: Sampling Distributions
Presentation transcript:

Stat 13, Thu 5/3/ Correction on normal notation. 2. Normal percentiles. 3. Normal probability plots. 4. Bernoulli and Binomial random variables. 5. E(x+y). 6. Independence and alleles. Hw4 is due Tue, 5/8. Midterm 2 is Thur, 5/17. When submitting hw, please make 4 stacks: section 1a, 1b, 1c, & 1d. Ignore the chapter on the “continuity correction”. Gradegrubbing procedure: if you would like a question (or more than one question) reevaluated, submit your exam or homework and a WRITTEN explanation of why you think you deserve more points and how many more points you think you deserve to your TA. The TA will then give it to me, and I will consider it, and then give it back to the TA to give back to you. 1

1. Correction on normal notation. Last time I said N(µ,  2 ) means a normal random variable with mean µ and sd . However, in ch4, your book just calls this N(µ,  ). I will use the book’s notation from now on, and I actually changed day7.ppt since last class, so it is now consistent with this as well. 2. Normal percentiles. The main point from last time was that if X is N(µ,  ), and Z =, then Z is N(0,1), i.e. standard normal. So, P(X < c) = P( < ) = P(Z < ), which you can find in the table in your book. Now, how about if, instead of P(X < c), you want to find c such that P(X < c) = k%? That is, suppose you want to find the kth percentile of your distribution? For instance, IQs are N(100, 15). What is your IQ if you are in the 90 th percentile? We want c, where 90% of people score less than c. That is, if X is a randomly chosen person, then we want c so that P(X < c) = 90%. Look in the body of the table til you see 90%. Find P(Z < 1.28) = ~ 90%. So, 90% = P(Z < 1.28) = P( < 1.28) = P(X - µ < 1.28  ) = P(X < 1.28  + µ) = P(X < 1.28(15) + 100) = P(X < 119.2). Thus, the answer is Previously, to convert to standard units, c -> (c-µ)/ , and here we know what the answer c is in standard units, so to convert c from standard units to IQ points, we do the opposite, i.e. c -> c  + µ. 2

3. Normal probability plots. How can you tell if a distribution is normal, or approximately normal? You could look at the histogram and see if it looks like the normal curve. With enough data, this isn’t a problem. However, with less data, it can be hard to tell. An alternative approach is the normal probability plot. It’s also called a normal Q-Q plot. The Qs stand for quantile. Sort the data, and for data point X i, it’s the i/n quantile of your data. Figure out what you’d expect the i/n quantile of the standard normal to be, call it q i, and plot X i vs. q i. (In most computer packages, the default is actually to plot q i vs. X i. Book calls q i n-scores. Linear means good fit. Curvature typically means one or both tails of the distribution are non-normal. 3

4

4. Bernoulli and Binomial random variables. Often a variable has only 2 possible outcomes, like a coin flip, or people’s genders, or if your response variable is whether someone has a disease or not. A useful convention is to score the responses 0 or 1. (not -1 or 1). Such random variables are then called Bernoulli, or Bernoulli (p), if p = probability of scoring 1. If X is Bernoulli (p), then E(X) = p, and SD(X) =. Now suppose you flip n coins, or sample n subjects and record their genders. Sometimes you are interested in the total number of 1s in your sample. The 0-1 convention is helpful, because this total number is the same as the sum of your sample values. If each obs. is 0 or 1, and Y = the total number of 1s in your sample, and if the observations are independent, then we say Y is a binomial(n,p) random variable. P(Y=k) is C(n,k) p k q n-k, where q = 1-p, for k = 0, 1, 2,..., n. For instance, flip 8 coins. P(Y=3) = P(HHHTTTTT or HTHTHTTT or...). There are C(8,3) different places you could put the H’s, and each such ordering has prob. p 3 q 5. If Y is binomial(n,p), then E(Y) = np, and SD(Y) =. 5

5. E(X+Y). In general, E(X+Y) = E(X) + E(Y), even if X and Y might not be independent! For example, suppose you bet $10 on number 32 in roullette. Your expected profit is 10 x -5.3 cents = -53 cents. Now suppose instead you put $5 on number 32 and $5 on 33. Your total expected profit from the two bets is E(profit on 32 + profit on 33) = E(profit on 32) + E(profit on 33) = 5 x x -5.3 = -53 cents. Now suppose you put $5 on number 32 on one spin, and then $5 on 33 on the next spin. Your total expected profit is still E(profit on 32 + profit on 33) = E(profit on 32) + E(profit on 33) = 5 x x -5.3 = -53 cents. 6

6. Independence and alleles. People often multiply allele frequencies to estimate the probability of a match. This assumes independence of alleles! P(randomly selected person matches allele #1 and allele #2 and... and allele #6) = P(randomly selected person matches allele #1) x P(randomly selected person matches allele #2) x... x P(randomly sel. person matches #6) = 12/240 x 37/750 x... This assumes independence! For instance, in the OJ Simpson trial, Bruce Weir multiplied probabilities to claim that there was a 1 in 140 million chance that a randomly selected person’s blood would have matched the blood found at the crime scene in all the tested alleles. The allele frequencies, however, were based on samples of just a few hundred people. Don’t assume independence unless there is a good reason to do so! (coins, dice, roullette spins, sampling with replacement, sampling from a large population, can verify that the events are independent because P(AB) = P(A)P(B), or you are told that the events or variables are independent). 7