Do the math: Outsmarting Stats Holly Hacker EWA Webinar Jan. 2012.

Slides:



Advertisements
Similar presentations
Describing Quantitative Variables
Advertisements

SPSS Review CENTRAL TENDENCY & DISPERSION
Newsroom math Prof. Steve Doig Cronkite School, ASU.
The Normal Distribution
Normal distribution. An example from class HEIGHTS OF MOTHERS CLASS LIMITS(in.)FREQUENCY
Jan Shapes of distributions… “Statistics” for one quantitative variable… Mean and median Percentiles Standard deviations Transforming data… Rescale:
SOC 3155 SPSS CODING/GRAPHS & CHARTS CENTRAL TENDENCY & DISPERSION.
Standard Scores & Correlation. Review A frequency curve either normal or otherwise is simply a line graph of all frequency of scores earned in a data.
Measures of Central Tendency
Examples of continuous probability distributions: The normal and standard normal.
Understanding Research Results
STAT 13 -Lecture 2 Lecture 2 Standardization, Normal distribution, Stem-leaf, histogram Standardization is a re-scaling technique, useful for conveying.
Warm-up Day of Ch. 7 Review Free Response A h.s. math teacher believes the greater number of hours of sleep before taking an AP exam, the better the score.
M28- Categorical Analysis 1  Department of ISM, University of Alabama, Categorical Data.
Department of Quantitative Methods & Information Systems
Think of a topic to study Review the previous literature and research Develop research questions and hypotheses Specify how to measure the variables in.
MGQ 201 WEEK 4 VICTORIA LOJACONO. Help Me Solve This Tool.
Normal Distributions.
Student Engagement Survey Results and Analysis June 2011.
Statistics. Intro to statistics Presentations More on who to do qualitative analysis Tututorial time.
Copyright © 2011 Pearson Education, Inc. Putting Statistics to Work Discussion Paragraph 6B 1 web 26. Web Data Sets 1 world 27. Ranges in the News 28.
Random Thoughts 2012 (COMP 066) Jan-Michael Frahm Jared Heinly.
Using Standard Deviation to Analyze Reports Understanding the Distribution of a Set of Data.
Chapter 4 Variability. Variability In statistics, our goal is to measure the amount of variability for a particular set of scores, a distribution. In.
Research and Statistics AP PSYCHOLOGY RESEARCH METHODS.
Chapter 3 Section 3.1 Examining Relationships. Continue to ask the preliminary questions familiar from Chapter 1 and 2 What individuals do the data describe?
Statistics Numerical Representation of Data Part 2 – Measure of Variation.
The Standard Deviation as a Ruler and the Normal Model
BUS304 – Data Charaterization1 Other Numerical Measures  Median  Mode  Range  Percentiles  Quartiles, Interquartile range.
V pátek nebude přednáška. Cvičení v tomto týdnu bude.
SW388R6 Data Analysis and Computers I Slide 1 Multiple Regression Key Points about Multiple Regression Sample Homework Problem Solving the Problem with.
© 2010 Pearson Prentice Hall. All rights reserved. CHAPTER 12 Statistics.
SOC 3155 SPSS CODING/GRAPHS & CHARTS CENTRAL TENDENCY & DISPERSION h458 student
Introduction A Review of Descriptive Statistics. Charts When dealing with a larger set of data values, it may be clearer to summarize the data by presenting.
Practice Page 65 –2.1 Positive Skew Note Slides online.
SOC 3155 SPSS Review CENTRAL TENDENCY & DISPERSION.
Descriptive & Inferential Statistics Adopted from ;Merryellen Towey Schulz, Ph.D. College of Saint Mary EDU 496.
Chapter 2 Describing Distributions with Numbers. Numerical Summaries u Center of the data –mean –median u Variation –range –quartiles (interquartile range)
M3U9D2 Warm-up: 1. At the doctor’s office, there are three children in the waiting room. The children are ages 3, 4, and 5. Another 4 year old child enters.
Chapter 4 – Measurements of Location and Position Math 22 Introductory Statistics.
Math for communicators. In general Many press releases/stories/ads are based on numbers. Familiarity with basic math is necessary to convey the info the.
The Normal Approximation for Data. History The normal curve was discovered by Abraham de Moivre around Around 1870, the Belgian mathematician Adolph.
CHAPTER 3 – Numerical Techniques for Describing Data 3.1 Measures of Central Tendency 3.2 Measures of Variability.
Statistics Unit Test Review Chapters 11 & /11-2 Mean(average): the sum of the data divided by the number of pieces of data Median: the value appearing.
(Unit 6) Formulas and Definitions:. Association. A connection between data values.
Review Design of experiments, histograms, average and standard deviation, normal approximation, measurement error, and probability.
Chapter 3.3 – 3.4 Applications of the Standard Deviation and Measures of Relative Standing.
SOC 3155 SPSS CODING/GRAPHS & CHARTS CENTRAL TENDENCY & DISPERSION h458 student
Describing Data Week 1 The W’s (Where do the Numbers come from?) Who: Who was measured? By Whom: Who did the measuring What: What was measured? Where:
Describing Data: Summary Measures. Identifying the Scale of Measurement Before you analyze the data, identify the measurement scale for each variable.
Don’t know much about algebra: Basic math in stories David Donald Training Director IRE and NICAR Western Knight Center, April 23, 2005.
Last lecture summary Five numbers summary, percentiles, mean Box plot, modified box plot Robust statistic – mean, median, trimmed mean outlier Measures.
Week 2 Normal Distributions, Scatter Plots, Regression and Random.
Measures of Variation. Variation Variation describes how widely data values are spread out about the center of a distribution.
SPSS CODING/GRAPHS & CHARTS CENTRAL TENDENCY & DISPERSION
Practice Page Practice Page Positive Skew.
Describing Location in a Distribution
Statistics Unit Test Review
Objective: Given a data set, compute measures of center and spread.
IB Psychology Today’s Agenda: Turn in:
IB Psychology Today’s Agenda: Turn in:
Newsroom Math and Statistics
Statistics: The Interpretation of Data
Summary (Week 1) Categorical vs. Quantitative Variables
The Standard Deviation as a Ruler and the Normal Model
Summary (Week 1) Categorical vs. Quantitative Variables
`.
CHAPTER 12 Statistics.
Which Test Should I Use?.
Normal Distributions and the Empirical Rule
Presentation transcript:

Do the math: Outsmarting Stats Holly Hacker EWA Webinar Jan. 2012

Innumeracy: Excuses, excuses No math required by my j-school Math ignorance as badge of honor We’ve got our nerd My source will give me the numbers we need

Why learn stats? The dreaded SAT story

Percentage change ( N ew – O ld) O ver O ld NOOO!!! ,000 students 13,500 students Difference: (18,000-13,500)/13,500 =

What if it’s reversed? ,500 students 18,000 students Difference: (13,500-18,000)/18,000 =

Texans paid an average of $855 a year for coverage on their homes, about 47 percent more than the U.S. average of $455, according to the study. CORRECTIONS, CLARIFICATIONS: The story … incorrectly compared U.S. and Texas rates. It should have said that U.S. residents pay about 47 percent less than Texans do for homeowner coverage.

Beware of small bases District A grew from 100,000 to 110,000 students – 10% increase District B grew from 1,000 to 11,000 students – 1,000% increase

Students per teacher (turn your PER into a division sign) Total students Total teachers /32 = 15.6 =15.6 students per teacher Rates

The “Duh” factor California has the most fertility clinics in the nation, with most concentrated in the Bay Area and Los Angeles, according to a new report... "Fertility procedures parallel population size, and California has the most women of childbearing age of any state," said report author Dr. Victoria Wright, public health analyst at the Centers for Disease Control and Prevention.

Newsroom stats Mean (average): Add the values, then divide by the number of values Median: Sort the values, then find the middle one

Weighted average Don’t average averages. Teacher pay = $37,000 Janitor pay = $20,000 Principal pay = $75,000 Avg district pay = $44,000 ?

Weighted average 10,000 x $37,000 =$370 m 2,000 x $20,000 =$ 40 m 2,000 x $20,000 =$ 40 m 500 x $75,000 =$ 37.5 m 500 x $75,000 =$ 37.5 m 12,500$447.5 m $447,500,000/12,500 Avg district pay = $35,800

Public opinion surveys Survey vs. census A random sample is necessary Size of the population being sampled doesn’t matter – only sample size matters

Sampling error Rule: The bigger the sample, the smaller the error the smaller the error Sampling error = 1/√N N = / √100 = 1/10 = +/- 10 pts. N = / √400 = 1/20 = +/- 5 pts. N = / √900 = 1/30 = +/- 3.3 pts.

Reporting poll results Don’t report unscientific polls.

Reporting poll results Don’t report unscientific polls. Beware of big error margins on subgroups.

1,007 x.45 = 453 1/√453 =.047 +/- 4.7 points Do the math

Reporting poll results Don’t report unscientific polls. Beware of big error margins on subgroups. Don’t forget that a poll at best is a snapshot of now, not a predictor of the future.

Distribution

Standard deviation The most common measure of spread – that is, how much things are spread out or clustered around the average.

Scale scores

Percentile A measure that indicates the percent of a distribution that is equal to or below it. If you received a score of 95 on a math test and this score was greater than or equal to the scores of 88% of the students taking the test, then you would be in the 88th percentile.

Correlation With correlation, you can look at how often two or more things happen together – that there is some direct or inverse relationship.

Regression A tool that examines the relationship between two or more variables (such as poverty and test scores). It then allows you to make predictions based on that relationship.

Numbers v. words No more than two or three numbers in a sentence or (gasp!) paragraph percent vs. “about a third” 24 percent vs. “one in four” 2011 vs. “last year” 2011 vs. “last year”

Numbers v. words Avoid false precision.

Numbers v. words Avoid false precision. Always double-check your math answers. No, better triple-check.

Numbers v. words When all the above fails, numbers belong in graphics – bar charts, tables and at last resort a bulleted break out box.