Random Thoughts 2012 (COMP 066) Jan-Michael Frahm Jared Heinly source: fivethirtyeight.com.

Slides:



Advertisements
Similar presentations
© 2004 by David T. Olson Sample - Not for Public Use1 The State of the Church in Nebraska Dave Olson
Advertisements

© 2004 by David T. Olson Sample - Not for Public Use1 The State of the Church in Rhode Island Dave Olson
Sampling.
© 2004 by David T. Olson Sample - Not for Public Use1 The State of the Church in Louisiana Dave Olson
1. Exams 2. Sampling Distributions 3. Estimation + Confidence Intervals.
© 2004 by David T. Olson Sample - Not for Public Use1 The State of the Church in North Carolina Dave Olson
The Diversity of Samples from the Same Population Thought Questions 1.40% of large population disagree with new law. In parts a and b, think about role.
1. Estimation ESTIMATION.
McGraw-Hill/Irwin Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 7: Demand Estimation and Forecasting.
1 BABS 502 Lecture 10 March 23, 2011 (C) Martin L. Puterman.
An Introduction to Logistic Regression
The Research Process. Purposes of Research  Exploration gaining some familiarity with a topic, discovering some of its main dimensions, and possibly.
CHAPTER 18 Models for Time Series and Forecasting
© 2004 by David T. Olson Sample - Not for Public Use1 A Sample Presentation of The State of the Church in Florida and the Largest 5 Metro Areas
© 2004 by David T. Olson Sample - Not for Public Use1 A Sample Presentation of The State of the Church in Wisconsin and the Milwaukee and Madison Metro.
© 2004 by David T. Olson Sample - Not for Public Use1 The State of the Church in Wisconsin Dave Olson
© 2004 by David T. Olson Sample - Not for Public Use1 A Sample Presentation of The State of the Church in North Carolina and the Charlotte, Raleigh, and.
Business Forecasting Chapter 4 Data Collection and Analysis in Forecasting.
Random Thoughts 2012 (COMP 066) Jan-Michael Frahm Jared Heinly source: fivethirtyeight.com.
How We Form Political Opinions Political Opinions Personal Beliefs Political Knowledge Cues From Leaders.
Voting Behavior of Naturalized Citizens: Sarah R. Crissey Thom File U.S. Census Bureau Housing and Household Economic Statistics Division Presented.
PUBLIC OPINION AND POLITICAL SOCIALIZATION
Sampling Defined / The idea – Making inference about a larger population What is the population – Some particular value in the population estimating.
Random Thoughts 2012 (COMP 066) Jan-Michael Frahm Jared Heinly.
8.2 Estimating Population Means LEARNING GOAL Learn to estimate population means and compute the associated margins of error and confidence intervals.
© 2004 by David T. Olson Sample - Not for Public Use1 The State of the Church in Virginia Dave Olson
© 2004 by David T. Olson Sample - Not for Public Use1 The State of the Church in Kentucky Dave Olson
© 2004 by David T. Olson Sample - Not for Public Use1 The State of the Church in Mississippi Dave Olson
The Demand Side: Consumption & Saving. Created By: Reem M. Al-Hajji.
Statistical Inference: Making conclusions about the population from sample data.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 7 Estimates and Sample Sizes 7-1 Review and Preview 7-2 Estimating a Population Proportion.
Random Thoughts 2012 (COMP 066) Jan-Michael Frahm Jared Heinly.
© 2004 by David T. Olson Sample - Not for Public Use1 A Sample Presentation of The State of the Church in Florida and the Largest 5 Metro Areas
Time Series Analysis and Forecasting
Managerial Economics Demand Estimation & Forecasting.
© 2004 by David T. Olson Sample - Not for Public Use1 The State of the Church in Ohio Dave Olson
Chapter 15 Sampling and Sample Size Winston Jackson and Norine Verberg Methods: Doing Social Research, 4e.
Sampling Methods and Sampling Distributions
© 2004 by David T. Olson Sample - Not for Public Use1 The State of the Church in New York Dave Olson
Chapter 16 Data Analysis: Testing for Associations.
© 2004 by David T. Olson Sample - Not for Public Use1 The State of the Church in Hawaii Dave Olson
CONFIDENCE STATEMENT MARGIN OF ERROR CONFIDENCE INTERVAL 1.
Random Thoughts 2012 (COMP 066) Jan-Michael Frahm Jared Heinly.
© 2004 by David T. Olson Sample - Not for Public Use1 The State of the Church in Indiana Dave Olson
© 2004 by David T. Olson Sample - Not for Public Use1 The State of the Church in Florida Dave Olson
IMPORTANCE OF STATISTICS MR.CHITHRAVEL.V ASST.PROFESSOR ACN.
MARKET APPRAISAL. Steps in Market Appraisal Situational Analysis and Specification of Objectives Collection of Secondary Information Conduct of Market.
© 2004 by David T. Olson Sample - Not for Public Use1 The State of the Church in New Jersey Dave Olson
© 2004 by David T. Olson Sample - Not for Public Use1 A Sample Presentation of The State of the Church in West Virginia Dave Olson
Multiple Regression Analysis Regression analysis with two or more independent variables. Leads to an improvement.
1 1 Chapter 6 Forecasting n Quantitative Approaches to Forecasting n The Components of a Time Series n Measures of Forecast Accuracy n Using Smoothing.
© 2004 by David T. Olson Sample - Not for Public Use1 The State of the Church in New Hampshire Dave Olson
© 2004 by David T. Olson Sample - Not for Public Use1 The State of the Church in West Virginia Dave Olson
© 2004 by David T. Olson Sample - Not for Public Use1 The State of the Church in Missouri Dave Olson
Margin of Error S-IC.4 Use data from a sample survey to estimate a population mean or proportion; develop a margin of error through the use of simulation.
The inference and accuracy We learned how to estimate the probability that the percentage of some subjects in the sample would be in a given interval by.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 7 Sampling Distributions Section 7.1 How Sample Proportions Vary Around the Population.
Introduction Sample surveys involve chance error. Here we will study how to find the likely size of the chance error in a percentage, for simple random.
© 2004 by David T. Olson Sample - Not for Public Use1 The State of the Church in Tennessee Dave Olson
Sampling Distribution Models
Chapter 6 Inferences Based on a Single Sample: Estimation with Confidence Intervals Slides for Optional Sections Section 7.5 Finite Population Correction.
Sampling.
What is Correlation Analysis?
The Diversity of Samples from the Same Population
Predicting Presidential Elections
Natalie Jackson Senior Polling Editor, The Huffington Post
The State of the Church in Massachusetts
The State of the Church in Delaware
Why 25% of Voter Polls are Wrong
Presentation transcript:

Random Thoughts 2012 (COMP 066) Jan-Michael Frahm Jared Heinly source: fivethirtyeight.com

Halloween Statistics 41 million potential trick or treaters (kids in the US) 106 million stops giving out candy $2.9 billion spend on costumes  $370 on pet costumes  $1.1 billion on kids costumes  $1.4 billion on adult costumes average candy consumption 1.2 lb per person (~280 M&Ms or 33 fun size Snickers) about 2700 calories heaviest pumpkin 1810 lb (2010) 2

Where to spend Halloween Transylvania County, N.C., with 29,499 residents. Tombstone, Ariz., with a population of 1,537. Pumpkin Center, N.C. (population 2,228) and Pumpkin Bend township, Ark. (population 307). Cape Fear township (New Hanover Co.), N.C.; and Cape Fear township (Chatham Co.), N.C. (populations of 15,711 and 1,170, respectively). Skull Creek township, Neb., with a population of 295. (Source: The U.S. Census Zombies) 3

Predicting Election Results How to predict the result of the election Challenges:  many different polls out there  different dates for polls  some states are polled less than others  how to project out in time till election day  how to convert poll results into election outcome One popular site is fivethirtyeight.com by Nathan Silver  uses available polls to predict the election outcome  sophisticated forecast model with careful data selection/weighting 4

Today’s Election Forecast 5

State by State Probability 6

How is the Election Forecast Done 1.Polling Average: Aggregate polling data, and weight it according to reliability scores. 7

Reliability Score of a Poll The reliability is dependent on three factors  pollster’s historical accuracy  sample size of the poll  recentness of the poll Pollsters reliability score tries to measure errors introduced by the pollster’s technique of polling (PIE) measured by: Total Error = MOE + Temporal Error + PIE 8

Errors MOE is what the poll reports based on its sample size (see previous class on how to compute) Temporal error is larger the farther out from the election for measuring the PIE it can be assumed zero when using only polls that are within a small time frame PIE is error from poor methodology (see class on polling)  increases the MOE beyond the one reported  varies significantly from pollster to pollster  computed from historical data on a per pollster basis PIE = Actual Result of Election – Predicted Result - MOE Iteratively compare PIE of a pollster with results of others (iterated Average Error IAE)  measures how well a pollster does 9

PIE & IAE 10 measures how well pollster does compared to others

Long run PIE Long run PIE accumulates the PIE of pollster over multiple elections 11

Converting Pie into Weighting How to use PIE to weight samples?  effectively PIE increases MOE  equivalent to reduce sample size Use PIE to compute effective sample size (see previous class) the ratio of sample size to effective sample size is used as weight of poll 12

Effective Sample Size vs Sample Size 13

How is the Election Forecast Done 1. Polling Average: Aggregate polling data, and weight it according to our reliability scores. 2. Trend Adjustment: Adjust the polling data for current trends. 14

Trend Adjustment Tries to account for difference in polling frequency Adjust data from states that have not been polled using data of recently polled states  use recent polls of similar states to correct last poll data  use national polls to correct Uses regression process to determine the change, which is then smoothed over time 15

How is the Election Forecast Done 1. Polling Average: Aggregate polling data, and weight it according to our reliability scores. 2. Trend Adjustment: Adjust the polling data for current trends. 3. Demographic Regression: Analyze demographic data in each 16

Demographic Regression remove the influence of demographics on polls estimates the dependency of the vote on 16 factors:  home state of candidate  fundraising share of candidate in the state  percentage of candidates vote in primaries in the state  score for conservativeness of state (Massachusetts most liberal, Utah most conservative)  Religious distribution in state (Evangelic., Catholic, Mormon)  Ethnic & Racial score (African-American, Hispanic, American)  Economic (Per capita income, Manifacturing)  Demographic (Senior, Twenty, Education, Suburban) 17

How is the Election Forecast Done 1. Polling Average: Aggregate polling data, and weight it according to our reliability scores. 2. Trend Adjustment: Adjust the polling data for current trends. 3. Demographic Regression: Analyze demographic data in each 4. Snapshot: Combine the polling data with the regression  uses trend adjusted data  regression adjusted data  reflects what would happen if election would be today 18

How is the Election Forecast Done 1. Polling Average: Aggregate polling data, and weight it according to our reliability scores. 2. Trend Adjustment: Adjust the polling data for current trends. 3. Demographic Regression: Analyze demographic data in each 4. Snapshot: Combine the polling data with the regression 5. Projection: Translate the snapshot into a projection of what will happen in November 19

Projection of Polls allocates undecided voters  50/50 split between candidates adjusts values to tighten up polls closer to election day (regression to the mean) 20

How is the Election Forecast Done 1. Polling Average: Aggregate polling data, and weight it according to our reliability scores. 2. Trend Adjustment: Adjust the polling data for current trends. 3. Demographic Regression: Analyze demographic data in each 4. Snapshot: Combine the polling data with the regression 5. Projection: Translate the snapshot into a projection of what will happen in November 6. Simulation: Simulate our results 10,000 times to get a robust probabilistic assessment of what will happen in each state as well as in the nation as a whole. 21

Simulation of Election accounts for uncertainty in interpreting polls state-specific movements national movements 22

How is the Election Forecast Done 1. Polling Average: Aggregate polling data, and weight it according to our reliability scores. 2. Trend Adjustment: Adjust the polling data for current trends. 3. Demographic Regression: Analyze demographic data in each 4. Snapshot: Combine the polling data with the regression 5. Projection: Translate the snapshot into a projection of what will happen in November 6. Simulation: Simulate our results 10,000 times to get a robust probabilistic assessment of what will happen in each state as well as in the nation as a whole. 23