Stat 217 – Day 10 Review. Last Time Judging “spread” of a distribution “Empirical rule”: In a mound-shaped symmetric distribution, roughly 68% of observations.

Slides:



Advertisements
Similar presentations
Stat 301 – Day 17 Tests of Significance. Last Time – Sampling cont. Different types of sampling and nonsampling errors  Can only judge sampling bias.
Advertisements

Chapter 7 Sampling Distributions
Stat 301 – Day 28 Review. Last Time - Handout (a) Make sure you discuss shape, center, and spread, and cite graphical and numerical evidence, in context.
Suppose we are interested in the digits in people’s phone numbers. There is some population mean (μ) and standard deviation (σ) Now suppose we take a sample.
Stat 512 Day 4: Quantitative Data. Last Time p-values and statistical significance  What p-values tell us (and do not tell us) For now, approximating.
Stat 512 – Lecture 18 Multiple Regression (Ch. 11)
Stat 217 – Day 7 Quantitative data (Topic 7) Preliminary question #1: What do you think is a typical weight for a male Olympic rower? Handout.
Stat 217 – Day 22 Review. Last Time – Subtle Issues Make sure you have a random sample or interval doesn’t tell you much! Make sure you have a sample!
Stat 301 – Day 14 Review. Previously Instead of sampling from a process  Each trick or treater makes a “random” choice of what item to select; Sarah.
Stat 217 – Day 5 Random Assignment (Topic 5). Last Time – Random Sampling Issue #1: Do I believe the sample I have is representative of the population.
Lecture 4 Chapter 11 wrap-up
Stat 512 – Lecture 12 Two sample comparisons (Ch. 7) Experiments revisited.
Normal Distribution (Topic 12)
1 Hypothesis Testing In this section I want to review a few things and then introduce hypothesis testing.
Stat 217 – Day 27 Chi-square tests (Topic 25). The Plan Exam 2 returned at end of class today  Mean.80 (36/45)  Solutions with commentary online  Discuss.
Stat 217 – Week 10. Outline Exam 2 Lab 7 Questions on Chi-square, ANOVA, Regression  HW 7  Lab 8 Notes for Thursday’s lab Notes for final exam Notes.
Stat 217 – Day 10 Measures of spread, position (Topic 9)
Stat 512 – Lecture 13 Chi-Square Analysis (Ch. 8).
Stat 301 – Day 21 Large sample methods. Announcements HW 4  Updated solutions Especially Simpson’s Paradox  Should always show your work and explain.
Stat 301 – Day 18 Normal probability model (4.2).
Stat 217 – Day 25 Regression. Last Time - ANOVA When?  Comparing 2 or means (one categorical and one quantitative variable) Research question  Null.
Stat 512 Day 6: Sampling. Last Time Get lots of sleep! Characteristics of the distribution of a quantitative variable  Shape, center, spread, outliers.
Stat 217 – Day 13 Sampling Distributions (Topic 13) Submit Activity 12-6?
Stat 217 – Day 20 Comparing Two Proportions The judge asked the statistician if she promised to tell the truth, the whole truth, and nothing but the truth?
Stat 512 – Lecture 11 Type I/Type II Errors Open Applets page Review.
Stat 512 – Day 5 Statistical significance with quantitative response variable.
Copyright © 2012 Pearson Education. All rights reserved Copyright © 2012 Pearson Education. All rights reserved. Chapter 10 Sampling Distributions.
Statistics: Unlocking the Power of Data Lock 5 Hypothesis Testing: Hypotheses STAT 101 Dr. Kari Lock Morgan SECTION 4.1 Statistical test Null and alternative.
Confidence Intervals. Estimating the difference due to error that we can expect between sample statistics and the population parameter.
A P STATISTICS LESSON 9 – 1 ( DAY 1 ) SAMPLING DISTRIBUTIONS.
June 2, 2008Stat Lecture 18 - Review1 Final review Statistics Lecture 18.
Lecturer’s desk INTEGRATED LEARNING CENTER ILC 120 Screen Row A Row B Row C Row D Row E Row F Row G Row.
Testing Hypotheses about a Population Proportion Lecture 29 Sections 9.1 – 9.3 Tue, Oct 23, 2007.
June 18, 2008Stat Lecture 11 - Confidence Intervals 1 Introduction to Inference Sampling Distributions, Confidence Intervals and Hypothesis Testing.
Chapter 9 Sampling Distributions and the Normal Model © 2010 Pearson Education 1.
Normal Distribution Z-scores put to use!
CHAPTER 16: Inference in Practice. Chapter 16 Concepts 2  Conditions for Inference in Practice  Cautions About Confidence Intervals  Cautions About.
STA Lecture 161 STA 291 Lecture 16 Normal distributions: ( mean and SD ) use table or web page. The sampling distribution of and are both (approximately)
Estimation: Sampling Distribution
Confidence intervals are one of the two most common types of statistical inference. Use a confidence interval when your goal is to estimate a population.
Day 3: Sampling Distributions. CCSS.Math.Content.HSS-IC.A.1 Understand statistics as a process for making inferences about population parameters based.
February 2012 Sampling Distribution Models. Drawing Normal Models For cars on I-10 between Kerrville and Junction, it is estimated that 80% are speeding.
 What do we mean by “volume” in measurement?  How do we measure volume?
Introduction to Statistics for the Social Sciences SBS200, COMM200, GEOG200, PA200, POL200, or SOC200 Lecture Section 001, Spring 2015 Room 150 Harvill.
BPS - 3rd Ed. Chapter 141 Tests of significance: the basics.
Reasoning in Psychology Using Statistics Psychology
 The point estimators of population parameters ( and in our case) are random variables and they follow a normal distribution. Their expected values are.
Rejecting Chance – Testing Hypotheses in Research Thought Questions 1. Want to test a claim about the proportion of a population who have a certain trait.
Review - Confidence Interval Most variables used in social science research (e.g., age, officer cynicism) are normally distributed, meaning that their.
Testing Hypotheses about a Population Proportion Lecture 29 Sections 9.1 – 9.3 Fri, Nov 12, 2004.
Testing Hypotheses about a Population Proportion Lecture 29 Sections 9.1 – 9.3 Wed, Nov 1, 2006.
Testing Hypotheses about a Population Proportion Lecture 31 Sections 9.1 – 9.3 Wed, Mar 22, 2006.
Synthesis and Review 2/20/12 Hypothesis Tests: the big picture Randomization distributions Connecting intervals and tests Review of major topics Open Q+A.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 19 Confidence Intervals for Proportions.
Introduction to Statistics for the Social Sciences SBS200, COMM200, GEOG200, PA200, POL200, or SOC200 Lecture Section 001, Fall 2015 Room 150 Harvill.
Statistics 19 Confidence Intervals for Proportions.
Chapter 9 Sampling Distributions 9.1 Sampling Distributions.
Copyright © 2009 Pearson Education, Inc. Chapter 6 The Standard Deviation as a Ruler and the Normal Model.
Lecture #8 Thursday, September 15, 2016 Textbook: Section 4.4
Statistics 200 Lecture #4 Thursday, September 1, 2016
CHAPTER 1 Exploring Data
Stat 217 – Day 28 Review Stat 217.
Stat 217 – Day 17 Review.
Reasoning in Psychology Using Statistics
Sampling Distributions
Introduction to Statistics for the Social Sciences SBS200 - Lecture Section 001, Fall 2018 Room 150 Harvill Building 10: :50 Mondays, Wednesdays.
Reasoning in Psychology Using Statistics
Pull 2 samples of 20 pennies and record both averages (2 dots).
Reasoning in Psychology Using Statistics
Significance Test for a Mean
Presentation transcript:

Stat 217 – Day 10 Review

Last Time Judging “spread” of a distribution “Empirical rule”: In a mound-shaped symmetric distribution, roughly 68% of observations fall within one standard deviation of the mean, 95% within two standard deviations of the mean  2SD = width of middle 68% of distribution Z-scores measure the relative position of an observation and provide us a unitless measuring stick for how far an observation falls from mean  Very useful for comparing values from different distributions Boxplots – visual display of five number summary  Helpful for comparing distributions (spread, center)

Comments on HW 2 Problem 2: Identify terms  Sampling frame is the list of the population used to select the sample Does not include the response variable information! (b) average number of words on a page of a textbook (d) tend to gain an average of 15 lbs?

Comments on HW 2 Act 5-14: Studies from Blink  (a) and (b) only had response variables, observational studies  (c) and (d) had 2 variables and the explanatory variable was randomly assigned, experiments  So in (c) and (d) can potentially draw cause and effect conclusions  “Generalizability” means can you take information from sample and apply it to the larger population? “There was not a significance difference in SAT performance in the sample so I don’t think there is in the population as well” Yes if have random sample, so maybe only in (a)

Comments on HW 2 Question 4: Hand hold (b) Can the status of the EV be determined by Ashleigh? Gender of participant vs. gender of researcher  Random sampling vs. random assignment

Comments on HW 2 Cause and effect vs. generalizing to population YesNo Yes No Were groups randomly assigned? Were obs units randomly selected? Can draw cause-and- effect conclusions Can generalize to larger population

Comments on HW 2 Question 5: AIDS testing  Most of you got the table right but then read the wrong proportion from the table Of those who tested positive, what proportion had AIDS = 4885/78515 =.062 Of those who have AIDS, what proportion test positive = 4885/5000 =.977 (sensitivity) Positive testNegative testTotal Carries AIDS virus(2) 4885(2) 115(1) 5000 Does not carry AIDS(3)73630(3) (1) 995,000 Total(4) 78515(4) ,000,000

Lab 2 Notes (model online) Comparing groups  Are people yawning a lot vs. does the yawn seed group yawn more often Overall proportion vs. Difference in conditional proportions  4.4% vs. 4.4 percentage points  Yawned “a lot more” vs. “yawned a lot more often” Interpreting p-value vs. conclusions from p- value  Probably want to explicitly compare p-value to some cut-off

Lab 2 Notes Interpretation of p-value  If those subjects were going to yawn, regardless of which condition they were in, how often would the random assignment process alone lead to such a large difference in the conditional proportions? Each dot represents one (fake) random assignment Observation units = 1000 fake random assignments Variable = difference in conditional proportions Roughly 51% of fake random assignments (null model) saw a difference at least this large Don’t consider this a small p-value since >.05

Lab 2 Notes Effect of sample size

Challenge Question Why was “random assignment” used in the study? Why did we shuffle the cards and deal them into 2 groups?

Lab 3 Randomization distribution  If everyone was going to remember the same number of letters regardless of which sequence they got, how often would the random assignment process alone lead to such a big difference in the group means? Each dot represents one random assignment Observation units = 1000 fake random assignments Variable = difference in group means Where is the observed difference in means in this distribution?

About Exam 1 50 minutes, 50 points  Will include one of the self-check activities Bring calculator, pencil, eraser  Could be asked to use Minitab and/or to interpret Minitab output  No cell phone calculators (square root) One 8.5x11 sheet of own notes  Both sides ok I will supply paper

Some advice for studying Review handout, problems online  See also p. 627? Review lecture notes, text, hws, labs  See me for old homework, inclass activities Work problems Start with ideas that we have emphasized more often

Some advice during exam If you get stuck on a problem, move on  later parts, later problems Try to hit the highlights in your answer (e.g., not all sources of bias, just the most serious)  Be succinct (think before you write) Read the question carefully Show all of your work, explain well  communication points Read entire question before writing anything

Some big, big ideas Observational units, variable Random assignment vs. random sampling  Implementation  Purpose  Consequence (Scope of conclusions) What see in sample vs. saying something beyond the sample  Statistic vs. Parameter  Statistical significance Interpretations, reasoning  Properties, “what if” questions… How are you deciding this?

Activity 4-19: Voter Turnout (p. 70) Statistic:.682 proportion claiming to vote Parameter:.490 proportion claiming to vote What are some possible explanations for why these values differ?  Those in sample do not represent population  Those in sample were not honest  Statistics vary from sample to sample and may differ from parameter by chance Which of these explanations can we eliminate?

No longer believe it was just “by chance”…

Questions?