Probability Lecture 2. Probability Why did we spend last class talking about probability? How do we use this?

Slides:



Advertisements
Similar presentations
Lecture Discrete Probability. 5.2 Recap Sample space: space of all possible outcomes. Event: subset of of S. p(s) : probability of element s of.
Advertisements

Hypothesis Testing A hypothesis is a claim or statement about a property of a population (in our case, about the mean or a proportion of the population)
Chi-Square Test A fundamental problem is genetics is determining whether the experimentally determined data fits the results expected from theory (i.e.
Segment 3 Introduction to Random Variables - or - You really do not know exactly what is going to happen George Howard.
Mathematics.
Binomial Distribution & Bayes’ Theorem. Questions What is a probability? What is the probability of obtaining 2 heads in 4 coin tosses? What is the probability.
Testing Hypotheses About Proportions Chapter 20. Hypotheses Hypotheses are working models that we adopt temporarily. Our starting hypothesis is called.
Using Statistics to Analyze your Results
22C:19 Discrete Structures Discrete Probability Fall 2014 Sukumar Ghosh.
Chapter 3 Section 3.3 Basic Rules of Probability.
Probability Probability Principles of EngineeringTM
Statistics for the Social Sciences Psychology 340 Spring 2005 Sampling distribution.
Hypothesis testing 1.Make assumptions. One of them is the “hypothesis.” 2.Calculate the probability of what happened based on the assumptions. 3.If the.
Statistics for the Social Sciences Psychology 340 Fall 2006 Hypothesis testing.
C82MCP Diploma Statistics School of Psychology University of Nottingham 1 Overview Parameters and Statistics Probabilities The Binomial Probability Test.
Statistics for the Social Sciences Psychology 340 Spring 2005 Hypothesis testing.
Statistics for the Social Sciences Psychology 340 Fall 2006 Hypothesis testing.
CHAPTER 6 Random Variables
Section 6.2 ~ Basics of Probability Introduction to Probability and Statistics Ms. Young.
Mathematics in Today's World
Bell Work: Collect like terms: x + y – 1 – x + y + 1.
Hypothesis Testing. A Research Question Everybody knows men are better drivers than women. Hypothesis: A tentative explanation that accounts for a set.
Chapter 8 Hypothesis testing 1. ▪Along with estimation, hypothesis testing is one of the major fields of statistical inference ▪In estimation, we: –don’t.
1 Algorithms CSCI 235, Fall 2012 Lecture 9 Probability.
Apr. 8 Stat 100. To do Read Chapter 21, try problems 1-6 Skim Chapter 22.
Evidence Based Medicine
The Binomial Distribution Permutations: How many different pairs of two items are possible from these four letters: L, M. N, P. L,M L,N L,P M,L M,N M,P.
Individual values of X Frequency How many individuals   Distribution of a population.
Chapter 3 Section 3.2 Basic Terms of Probability.
Unit 8 Section : z Test for a Mean  Many hypotheses are tested using the generalized statistical formula: Test value = (Observed Value)-(expected.
Introduction In probability, events are either dependent or independent. Two events are independent if the occurrence or non-occurrence of one event has.
Associate Professor Arthur Dryver, PhD School of Business Administration, NIDA url:
Chi-Square Test A fundamental problem in genetics is determining whether the experimentally determined data fits the results expected from theory. How.
Chapter 12 A Primer for Inferential Statistics What Does Statistically Significant Mean? It’s the probability that an observed difference or association.
1 Psych 5500/6500 The t Test for a Single Group Mean (Part 1): Two-tail Tests & Confidence Intervals Fall, 2008.
The Big Picture: Counting events in a sample space allows us to calculate probabilities The key to calculating the probabilities of events is to count.
Chapter 20 Testing Hypothesis about proportions
22C:19 Discrete Structures Discrete Probability Spring 2014 Sukumar Ghosh.
Introduction to probability. Take out a sheet of paper and a coin (if you don’t have one I will give you one) Write your name on the sheet of paper. When.
POSC 202A: Lecture 4 Probability. We begin with the basics of probability and then move on to expected value. Understanding probability is important because.
Welcome to MM570 Psychological Statistics
Binomial Distribution
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 6 Random Variables 6.1 Discrete and Continuous.
Statistical Techniques
MATH 256 Probability and Random Processes Yrd. Doç. Dr. Didem Kivanc Tureli 14/10/2011Lecture 3 OKAN UNIVERSITY.
Discrete Math Section 16.3 Use the Binomial Probability theorem to find the probability of a given outcome on repeated independent trials. Flip a coin.
No Warm-Up today. You have a Quiz Clear your desk of everything but a calculator and something to write with.
1 Outline 1.Count data 2.Properties of the multinomial experiment 3.Testing the null hypothesis 4.Examples.
Slide 20-1 Copyright © 2004 Pearson Education, Inc.
DRAWING INFERENCES FROM DATA THE CHI SQUARE TEST.
Hypothesis Tests Hypothesis Tests Large Sample 1- Proportion z-test.
Chi Square Pg 302. Why Chi - Squared ▪Biologists and other scientists use relationships they have discovered in the lab to predict events that might happen.
Binomial Probability Theorem In a rainy season, there is 60% chance that it will rain on a particular day. What is the probability that there will exactly.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
AP Statistics From Randomness to Probability Chapter 14.
The Fine Art of Knowing How Wrong You Might Be. To Err Is Human Humans make an infinitude of mistakes. I figure safety in numbers makes it a little more.
Lesson 10: Using Simulation to Estimate a Probability Simulation is a procedure that will allow you to answer questions about real problems by running.
Lecture #8 Thursday, September 15, 2016 Textbook: Section 4.4
Random Variables.
CHAPTER 12: Introducing Probability
Unit 1: Probability and Statistics
Probability Principles of Engineering
Probability Principles of Engineering
Statistics for the Social Sciences
Introduction In probability, events are either dependent or independent. Two events are independent if the occurrence or non-occurrence of one event has.
Chapter 9 Section 1 Probability Review.
Significance Tests: The Basics
WARM - UP After an extensive review of weather related accidents an insurance company concluded the following results: An accident has a 70% chance of.
Probability Probability Principles of EngineeringTM
Probability Principles of Engineering
Presentation transcript:

Probability Lecture 2

Probability Why did we spend last class talking about probability? How do we use this?

You’re the FDA A company wants you to approve a new drug They run an experimental trial –40 people have the disease –20 get drug, 20 get placebo –Random assignment –Conducted perfectly

You’re the FDA Results: –Placebo group: 10 of 20 live –Drug group: 11 of 20 live Does the drug work? –Would you approve it? –Why or why not?

You’re the FDA Different study, same design Results: –Placebo: 2 of 20 live –Drug: 18 of 20 live Does the drug work? –Would you approve it? –Why or why not?

You’re the FDA Different study, same design Results: –Placebo: 8 of 20 live –Drug: 12 of 20 live Does the drug work? –Would you approve it? –Why or why not? How big of a difference do we need?

Why probability Probability provides the answer –Set of agreed on rules –All based on mathematical formula

Example How many of you would accept the following wager: –If no two people in the class have the same birthday (month and day) you get an automatic A. –If two or more people in class have the same birthday, you get an automatic F. Not ethical for me to accept the wager

Example Would you have won?

Example Would you have won? What is the probability? –Not 60/365 –Think of the complement –How many possible pairs are there in the class? Me and each student = N First student and every other student = N-1 Second student and every remaining student = N-2 … Last two students == 1770

Example P of any pair matches is 1/365 = P any pair doesn’t match is –= We have 1770 pairs. –Remember the rule –Joint probability of all not matching is: –P(first pair not match)*P(second pair not match)*…*P(last pair not match)

What is random? What are the odds that the first flip is a heads? –½ –Each outcome is equally likely The second flip? –½ So what are the odds that both are? –Four outcomes: HH, HT, TH, TT so ¼ (each equally likely)

What is random? Odds the third flip is a heads? –½ Odds that all three are heads? –8 outcomes –HHH, HHT, HTH, HTT, THH, THT, TTH, TTT –So, 1/8 Odds the fourth flip is a heads? –½ All four? –1/16

What is random? Odds that five in a row are heads? –1/32 Odds that six in a row? –1/64 If we did this as a probability they would be: –0.5 –0.25 –0.125 – – – Each is the previous probability multiplied by 0.5

Example P of any pair matches is 1/365 = P any pair doesn’t match is –= We have 1770 pairs. –Remember the rule –Joint probability of all not matching is: –P(first pair not match)*P(second pair not match)*…*P(last pair not match) –= –= Seems likely that at least one would match

Rules of probability and math let us determine how likely an event is. Want to be able to determine “statistical significance” –Can we conclude that the pattern we see didn’t happen by chance?

What is “statistical significance?” First, let’s be clear about what statistical significance is NOT. A finding that a relationship between some X and some Y is “statistically significant” does NOT mean that the relationship is “strong.” (It might be strong, but not because it’s statistically significant.)

This is a common mistake Many people think that a “statistically significant” relationship is by definition a “strong” one. In fact, many people think that “statistical significance” IS ITSELF a test of the strength of the relationship. It’s not.

Then what is statistical significance? It is a probabilistic statement—typically, 95% confidence—that the relationship we observe in the sample, no matter how strong or weak, exists in the population.

But, as always… There is a 5% chance we could be wrong—that is, that despite what we observe in the sample, there really is no relationship in the population.

How do we demonstrate statistical significance? We perform something called “hypothesis testing.” We actually begin with a statement called the “null hypothesis.” It is always a statement that there is not a relationship between two variables.

Why a Null Hypothesis? We want to know if there is a relationship Our theory is not strong enough to tell us how large the effect is –Theory: Gender helps determine vote choice –Hypothesis: Women were more likely to vote for Obama than men were –Problem: How much more? We don’t know. How large of a difference would be big enough?

Null Hypothesis Big enough to not happen by chance Ok, but how much is enough to be “not by chance?” This is where probability comes in Anything is possible—the normal distributions is unbounded.

Null Hypothesis Everything may be possible, but everything is not probable We want to know the probability that a relationship could exist in the data by chance

Example: Gender Gap FemaleMale McCain281 (31%)230 (37%) Obama625 (69%)391 (63%

Probability If we make some assumptions we can calculate how probable any outcome is. What do we assume? –There is no difference between treatments –What the probability distribution is (this is technical and I will tell you what matters). With these, we can calculate P(data occurred by chance).

Probability But that isn’t exactly what we want to know. We want to know probability that there is a difference, this would be probability that there is no difference. Unfortunately that is as good as we can do

Probability So, what is the null hypothesis (since that is where this started)? It is the hypothesis that there is no relationship (thus, “null”). This is what we can test. It is the inverse of what we want to know. So, if our theory is right the null hypothesis is wrong and we will reject the null hypothesis. If our theory is wrong, we will accept the null hypothesis

What does this mean? How likely are we to see this by chance? If there were no difference between genders, the probability of seeing this difference is 0.01 FemaleMale McCain281 (31%) 230 (37%) Obama625 (69%) 391 (63%

0.01 That is pretty unlikely, but what does that mean? One of three things occurred 1.The data are wrong 2.We were really unlucky 3.The assumption of no relationship is wrong –Conclusion is the last one. We have a relationship.

Probability How unlikely does the null have to be for us to reject it? 1 out of 20 (5%) Why? Vestige of pre-computer days –Norm