Elementary statistics Ruth Anderson CSE 160 University of Washington 1.

Slides:



Advertisements
Similar presentations
Elementary statistics Michael Ernst CSE 190p University of Washington.
Advertisements

Review You run a t-test and get a result of t = 0.5. What is your conclusion? Reject the null hypothesis because t is bigger than expected by chance Reject.
A statistical hypothesis is an assumption about a population parameter. This assumption may or may not be true
Chapter 10 Section 2 Hypothesis Tests for a Population Mean
Objectives Use simulations and hypothesis testing to compare treatments from a randomized experiment.
+ Chapter 10 Section 10.4 Part 2 – Inference as Decision.
Probability - 1 Probability statements are about likelihood, NOT determinism Example: You can’t say there is a 100% chance of rain (no possibility of.
Statistical Significance What is Statistical Significance? What is Statistical Significance? How Do We Know Whether a Result is Statistically Significant?
HYPOTHESIS TESTING Four Steps Statistical Significance Outcomes Sampling Distributions.
Statistical Significance What is Statistical Significance? How Do We Know Whether a Result is Statistically Significant? How Do We Know Whether a Result.
Elementary statistics Ruth Anderson CSE 140 University of Washington 1.
INFERENTIAL STATISTICS – Samples are only estimates of the population – Sample statistics will be slightly off from the true values of its population’s.
Statistical Inference Dr. Mona Hassan Ahmed Prof. of Biostatistics HIPH, Alexandria University.
Testing Hypotheses I Lesson 9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics n Inferential Statistics.
1 © Lecture note 3 Hypothesis Testing MAKE HYPOTHESIS ©
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 9. Hypothesis Testing I: The Six Steps of Statistical Inference.
Chapter 8 Hypothesis testing 1. ▪Along with estimation, hypothesis testing is one of the major fields of statistical inference ▪In estimation, we: –don’t.
Tests of significance & hypothesis testing Dr. Omar Al Jadaan Assistant Professor – Computer Science & Mathematics.
Testing Hypotheses Tuesday, October 28. Objectives: Understand the logic of hypothesis testing and following related concepts Sidedness of a test (left-,
Let’s flip a coin. Making Data-Based Decisions We’re going to flip a coin 10 times. What results do you think we will get?
Significance Testing Statistical testing of the mean (z test)
Chapter 8 Hypothesis Testing I. Chapter Outline  An Overview of Hypothesis Testing  The Five-Step Model for Hypothesis Testing  One-Tailed and Two-Tailed.
Hypothesis Testing: One Sample Cases. Outline: – The logic of hypothesis testing – The Five-Step Model – Hypothesis testing for single sample means (z.
Chapter 9 Hypothesis Testing II: two samples Test of significance for sample means (large samples) The difference between “statistical significance” and.
Copyright © 2012 by Nelson Education Limited. Chapter 7 Hypothesis Testing I: The One-Sample Case 7-1.
Chapter 3 Investigating Independence Objectives Students will be able to: 1) Understand what it means for attempts to be independent 2) Determine when.
Maximum Likelihood Estimator of Proportion Let {s 1,s 2,…,s n } be a set of independent outcomes from a Bernoulli experiment with unknown probability.
1 Statistics 300: Elementary Statistics Section 8-2.
AP Stat Review Descriptive Statistics Grab Bag Probability
Economics 173 Business Statistics Lecture 4 Fall, 2001 Professor J. Petry
CHAPTER 9 Testing a Claim
Fall 2002Biostat Statistical Inference - Confidence Intervals General (1 -  ) Confidence Intervals: a random interval that will include a fixed.
Chapter 8 Hypothesis Testing I. Significant Differences  Hypothesis testing is designed to detect significant differences: differences that did not occur.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Unit 5: Hypothesis Testing.
Introduction to Probability – Experimental Probability.
Introduction to Statistics for the Social Sciences SBS200, COMM200, GEOG200, PA200, POL200, or SOC200 Lecture Section 001, Fall 2015 Room 150 Harvill.
T tests comparing two means t tests comparing two means.
Chapter 13 Understanding research results: statistical inference.
Probability Class Causation and Probability We are interested in finding the effect of Dr. Wong’s exploring teaching methodology on the statistics.
The Chi Square Equation Statistics in Biology. Background The chi square (χ 2 ) test is a statistical test to compare observed results with theoretical.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 9 Testing a Claim 9.1 Significance Tests:
Hypothesis Testing and Statistical Significance
Modern Languages Row A Row B Row C Row D Row E Row F Row G Row H Row J Row K Row L Row M
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 9 Testing a Claim 9.1 Significance Tests:
The Fine Art of Knowing How Wrong You Might Be. To Err Is Human Humans make an infinitude of mistakes. I figure safety in numbers makes it a little more.
Chapter 9: Hypothesis Tests for One Population Mean 9.5 P-Values.
Lecture #8 Thursday, September 15, 2016 Textbook: Section 4.4
Introduction to Hypothesis Testing: The Binomial Test
Section Testing a Proportion
One-Sample Inference for Proportions
Review You run a t-test and get a result of t = 0.5. What is your conclusion? Reject the null hypothesis because t is bigger than expected by chance Reject.
Statistical inference: distribution, hypothesis testing
Introduction to Hypothesis Testing: The Binomial Test
CHAPTER 9 Testing a Claim
Inferential Statistics
INTEGRATED LEARNING CENTER
Elementary statistics
Elementary statistics
Making Data-Based Decisions
CHAPTER 9 Testing a Claim
Hypothesis Testing and Comparing Two Proportions
CHAPTER 9 Testing a Claim
Chapter 3 Probability Sampling Theory Hypothesis Testing.
Hypothesis Testing and Confidence Intervals (Part 2): Cohen’s d, Logic of Testing, and Confidence Intervals Lecture 9 Justin Kern April 9, 2018.
CHAPTER 9 Testing a Claim
Calculating Probabilities
CHAPTER 9 Testing a Claim
Elementary statistics
CHAPTER 9 Testing a Claim
Presentation transcript:

Elementary statistics Ruth Anderson CSE 160 University of Washington 1

A dice-rolling game Two players each roll a die The higher roll wins – Goal: roll as high as you can! Repeat the game 6 times 2

Hypotheses regarding the outcome Luck Fraud – loaded die – inaccurate reporting How likely is luck? How do we decide? 3

Questions that statistics can answer I am flipping a coin. Is it fair? How confident am I in my answer? I have two bags of beans, each containing some black and some white beans. I have a handful of beans. Which bag did the handful come from? I have a handful of beans, and a single bag. Did the handful come from that bag? Does this drug improve patient outcomes? Which website design yields greater revenue? Which baseball player should my team draft? What premium should an insurer charge? Which chemical process leads to the best-tasting beer? 4

What can happen when you roll a die? What is the likelihood of each? 5

What can happen when you roll two dice? How likely are you to roll 11 or higher? This probability is known as the “p value”. 6

How to compute p values Via a statistical formula – Requires you to make assumptions and know which formula to use Computationally (simulation) – Run many experiments – Count the fraction with a better result Requires a metric/measurement for “better” – Requires you to be able to run the experiments – We will use this approach exclusively 7

Analogy between hypothesis testing and mathematical proofs “The underlying logic [of hypothesis testing] is similar to a proof by contradiction. To prove a mathematical statement, A, you assume temporarily that A is false. If that assumption leads to a contradiction, you conclude that A must actually be true.” From the book Think Statistics by Allen Downey 8

Interpreting p values p value of 5% or less = statistically significant – This is a convention; there is nothing magical about 5% Two types of errors may occur in statistical tests: false positive (or false alarm or Type I error): no real effect, but report an effect (through good/bad luck or coincidence) – If no real effect, a false positive occurs about 1 time in 20 false negative (or miss or Type II error): real effect, but report no effect (through good/bad luck or coincidence) The larger the sample, the less the likelihood of a false positive or negative 9

A false positive 10

11

Summary of statistical methodology 1.Decide on a metric (bigger value = better) 2.Observe what you see in the real world 3.Hypothesize that what you saw is normal/typical This is the “null hypothesis” 4.Simulate the real world many times 5.How different is what you observed from the simulations? What percent of the simulation values are the real world values bigger than? 6.If the percentage is 95% or more, reject the null hypothesis 12

A common error 1.Observe what you see in the real world 2.Decide on a metric (bigger value = better) This is backwards For any observation, there is something unique about it. Example: Roll dice, then be amazed because what are the odds you would get exactly that combination of rolls? 13

Correlation  causation Ice cream sales and murder rates are correlated 14

Statistical significance  practical importance 15

Don’t trust your intuition People have very bad statistical intuition It’s much better to follow the methodology and do the experiments 16