Elementary statistics

Slides:

Advertisements

Similar presentations

Elementary statistics Michael Ernst CSE 190p University of Washington.

Advertisements

Chapter 10 Section 2 Hypothesis Tests for a Population Mean

Probability - 1 Probability statements are about likelihood, NOT determinism Example: You can’t say there is a 100% chance of rain (no possibility of.

Elementary statistics Ruth Anderson CSE 140 University of Washington 1.

1 © Lecture note 3 Hypothesis Testing MAKE HYPOTHESIS ©

© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 9. Hypothesis Testing I: The Six Steps of Statistical Inference.

Tests of significance & hypothesis testing Dr. Omar Al Jadaan Assistant Professor – Computer Science & Mathematics.

Testing Hypotheses Tuesday, October 28. Objectives: Understand the logic of hypothesis testing and following related concepts Sidedness of a test (left-,

Significance Testing Statistical testing of the mean (z test)

Chapter 8 Hypothesis Testing I. Chapter Outline  An Overview of Hypothesis Testing  The Five-Step Model for Hypothesis Testing  One-Tailed and Two-Tailed.

Hypothesis Testing: One Sample Cases. Outline: – The logic of hypothesis testing – The Five-Step Model – Hypothesis testing for single sample means (z.

Chapter 9 Hypothesis Testing II: two samples Test of significance for sample means (large samples) The difference between “statistical significance” and.

Chapter 3 Investigating Independence Objectives Students will be able to: 1) Understand what it means for attempts to be independent 2) Determine when.

Experimental Design and Analysis Instructor: Mark Hancock February 8, 2008Slides by Mark Hancock.

Chapter 8 Hypothesis Testing I. Significant Differences  Hypothesis testing is designed to detect significant differences: differences that did not occur.

Introduction to Probability – Experimental Probability.

Introduction to Statistics for the Social Sciences SBS200, COMM200, GEOG200, PA200, POL200, or SOC200 Lecture Section 001, Fall 2015 Room 150 Harvill.

Chapter 13 Understanding research results: statistical inference.

Probability Class Causation and Probability We are interested in finding the effect of Dr. Wong’s exploring teaching methodology on the statistics.

Elementary statistics Ruth Anderson CSE 160 University of Washington 1.

The Chi Square Equation Statistics in Biology. Background The chi square (χ 2 ) test is a statistical test to compare observed results with theoretical.

Statistical principles: the normal distribution and methods of testing Or, “Explaining the arrangement of things”

The Fine Art of Knowing How Wrong You Might Be. To Err Is Human Humans make an infinitude of mistakes. I figure safety in numbers makes it a little more.

Lecture #8 Thursday, September 15, 2016 Textbook: Section 4.4

Introduction to Hypothesis Testing: The Binomial Test

Hypothesis Testing.

One-Sample Inference for Proportions

Review You run a t-test and get a result of t = 0.5. What is your conclusion? Reject the null hypothesis because t is bigger than expected by chance Reject.

Statistical inference: distribution, hypothesis testing

Introduction to Hypothesis Testing: The Binomial Test

CHAPTER 9 Testing a Claim

Consider This… I claim that I make 80% of my basketball free throws. To test my claim, you ask me to shoot 20 free throws. I make only 8 out of 20.

Inferential Statistics

Chapter 9: Testing a Claim

INTEGRATED LEARNING CENTER

Chapter 9: Testing a Claim

Elementary statistics

Making Data-Based Decisions

Introduction to Statistics for the Social Sciences SBS200 - Lecture Section 001, Spring 2017 Room 150 Harvill Building 9:00 - 9:50 Mondays, Wednesdays.

CHAPTER 9 Testing a Claim

Hypothesis Testing and Comparing Two Proportions

The Language of Statistical Decision Making

CHAPTER 9 Testing a Claim

Chapter 3 Probability Sampling Theory Hypothesis Testing.

The Language of Statistical Decision Making

Statistics 300: Elementary Statistics

Chapter 9: Testing a Claim

Chapter 9: Testing a Claim

Chapter 9: Testing a Claim

Hypothesis Testing and Confidence Intervals (Part 2): Cohen’s d, Logic of Testing, and Confidence Intervals Lecture 9 Justin Kern April 9, 2018.

Sample Mean Compared to a Given Population Mean

Chapter 9: Testing a Claim

Chapter 9: Testing a Claim

Chapter 9: Testing a Claim

CHAPTER 9 Testing a Claim

Chapter 9: Testing a Claim

Chapter 9: Testing a Claim

Testing Hypotheses I Lesson 9.

Chapter 9: Testing a Claim

Chapter 9: Testing a Claim

Chapter 9: Testing a Claim

CHAPTER 9 Testing a Claim

Chapter 9: Testing a Claim

Type I and Type II Errors

Elementary statistics

CHAPTER 9 Testing a Claim

Chapter 9: Testing a Claim

Chapter 9: Testing a Claim

Chapter 9: Testing a Claim

Presentation transcript:

Elementary statistics UW CSE 160 Winter 2017

A dice-rolling game Two players each roll a die The higher roll wins Goal: roll as high as you can! Repeat the game 6 times Have one student roll the die, and the other report the outcome

Hypotheses regarding the outcome Luck Fraud loaded die inaccurate reporting How likely is luck? How do we decide?

Questions that statistics can answer I am flipping a coin. Is it fair? How confident am I in my answer? I have two bags of beans, each containing some black and some white beans. I have a handful of beans. Which bag did the handful come from? I have a handful of beans, and a single bag. Did the handful come from that bag? Does this drug improve patient outcomes? Which website design yields greater revenue? Which baseball player should my team draft? What premium should an insurer charge? Which chemical process leads to the best-tasting beer? “Student” was William Gosset, a chemist working for the Guinness brewery in Dublin

What can happen when you roll a die? What is the likelihood of each?

What can happen when you roll two dice? How likely are you to roll 11 or higher? This probability is known as the “p value”. 2 3 4 5 6 7 8 9 10 11 12

How to compute p values Via a statistical formula Requires you to make assumptions and know which formula to use Computationally (simulation) Run many experiments Count the fraction with a better result Requires a metric/measurement for “better” Requires you to be able to run the experiments We will use this approach exclusively

Analogy between hypothesis testing and mathematical proofs “The underlying logic [of hypothesis testing] is similar to a proof by contradiction. To prove a mathematical statement, A, you assume temporarily that A is false. If that assumption leads to a contradiction, you conclude that A must actually be true.” From the book Think Statistics by Allen Downey

Interpreting p values p value of 5% or less = statistically significant This is a convention; there is nothing magical about 5% Two types of errors may occur in statistical tests: false positive (or false alarm or Type I error): no real effect, but report an effect (through good/bad luck or coincidence) If no real effect, a false positive occurs about 1 time in 20 false negative (or miss or Type II error): real effect, but report no effect (through good/bad luck or coincidence) The larger the sample, the less the likelihood of a false positive or negative

A false positive http://xkcd.com/882/

http://xkcd.com/882/ http://xkcd.com/882/

Summary of statistical methodology Decide on a metric (bigger value = better) Observe what you see in the real world Hypothesize that what you saw is normal/typical This is the “null hypothesis” Simulate the real world many times How different is what you observed from the simulations? What percent of the simulation values are the real world values bigger than? If the percentage is 95% or more, reject the null hypothesis

A common error Observe what you see in the real world Decide on a metric (bigger value = better) This is backwards For any observation, there is something unique about it. Example: Roll dice, then be amazed because what are the odds you would get exactly that combination of rolls?

Correlation  causation Ice cream sales and rate of drowning deaths are correlated http://xkcd.com/552/

Statistical significance  practical importance

Don’t trust your intuition People have very bad statistical intuition It’s much better to follow the methodology and do the experiments