Ratchet up your investigation with statistics David Donald, The Center for Public Integrity Jennifer LaFleur, CIR.

Slides:



Advertisements
Similar presentations
Selecting a Data Analysis Technique: The First Steps
Advertisements

Logistic Regression Psy 524 Ainsworth.
ADVANCED STATISTICS FOR MEDICAL STUDIES Mwarumba Mwavita, Ph.D. School of Educational Studies Research Evaluation Measurement and Statistics (REMS) Oklahoma.
Statistical Analysis Regression & Correlation Psyc 250 Winter, 2013.
Chapter 11 Contingency Table Analysis. Nonparametric Systems Another method of examining the relationship between independent (X) and dependant (Y) variables.
Inference1 Data Analysis Inferential Statistics Research Methods Gail Johnson.
Developing the Research Question
Data Analysis Statistics. Inferential statistics.
LINEAR REGRESSION: Evaluating Regression Models Overview Assumptions for Linear Regression Evaluating a Regression Model.
LINEAR REGRESSION: Evaluating Regression Models. Overview Assumptions for Linear Regression Evaluating a Regression Model.
By Wendiann Sethi Spring  The second stages of using SPSS is data analysis. We will review descriptive statistics and then move onto other methods.
LINEAR REGRESSION: Evaluating Regression Models. Overview Standard Error of the Estimate Goodness of Fit Coefficient of Determination Regression Coefficients.
Topic 3: Regression.
Statistical Evaluation of Data
Data Analysis Statistics. Inferential statistics.
Today Concepts underlying inferential statistics
1 Relationships We have examined how to measure relationships between two categorical variables (chi-square) one categorical variable and one measurement.
Business 205. Review Exam Least Square Regression Simple Linear Regression.
Summary of Quantitative Analysis Neuman and Robson Ch. 11
Social Science Research Design and Statistics, 2/e Alfred P. Rovai, Jason D. Baker, and Michael K. Ponton Internal Consistency Reliability Analysis PowerPoint.
Statistical hypothesis testing – Inferential statistics II. Testing for associations.
Statistical Analysis KSE966/986 Seminar Uichin Lee Oct. 19, 2012.
Chapter 12 Inferential Statistics Gay, Mills, and Airasian
Leedy and Ormrod Ch. 11 Gray Ch. 14
Regression Part 3 of 3 Parts Multiple Regression Overview Examples Hypothesis Tests MR ANOVA Table Interpretation Indicator Variables Assumptions Homework.
LEARNING PROGRAMME Hypothesis testing Intermediate Training in Quantitative Analysis Bangkok November 2007.
Methodological Digression. Overview Clinical Trials Regression Analysis Structure Terminology.
Class Meeting #11 Data Analysis. Types of Statistics Descriptive Statistics used to describe things, frequently groups of people.  Central Tendency 
T-Tests and Chi2 Does your sample data reflect the population from which it is drawn from?
Statistics Definition Methods of organizing and analyzing quantitative data Types Descriptive statistics –Central tendency, variability, etc. Inferential.
Statistical Evaluation of Data
Copyright © 2010 Pearson Education, Inc Chapter Seventeen Correlation and Regression.
Instrumentation (cont.) February 28 Note: Measurement Plan Due Next Week.
Statistical Analysis. Statistics u Description –Describes the data –Mean –Median –Mode u Inferential –Allows prediction from the sample to the population.
Data Analysis (continued). Analyzing the Results of Research Investigations Two basic ways of describing the results Two basic ways of describing the.
Bivariate Descriptive Analysis First step in analyzing your data Three components Cross-tabulations and frequency distributions Significance testing Correlations.
Statistical analysis Prepared and gathered by Alireza Yousefy(Ph.D)
Copyright © 2010 Pearson Education, Inc Chapter Seventeen Correlation and Regression.
Statistical Analysis Regression & Correlation Psyc 250 Winter, 2008.
Hypothesis testing Intermediate Food Security Analysis Training Rome, July 2010.
Linear correlation and linear regression + summary of tests
Education 795 Class Notes Data Analyst Pitfalls Difference Scores Effects Sizes Note set 12.
MGT-491 QUANTITATIVE ANALYSIS AND RESEARCH FOR MANAGEMENT OSMAN BIN SAIF Session 26.
Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.
ITEC6310 Research Methods in Information Technology Instructor: Prof. Z. Yang Course Website: c6310.htm Office:
Scatterplots & Regression Week 3 Lecture MG461 Dr. Meredith Rolfe.
Regression & Correlation. Review: Types of Variables & Steps in Analysis.
Experimental Research Methods in Language Learning Chapter 10 Inferential Statistics.
Review. Statistics Types Descriptive – describe the data, create a picture of the data Mean – average of all scores Mode – score that appears the most.
Inferential Statistics. The Logic of Inferential Statistics Makes inferences about a population from a sample Makes inferences about a population from.
Copyright © 2010 Pearson Education, Inc Chapter Seventeen Correlation and Regression.
Statistics as a Tool A set of tools for collecting, organizing, presenting and analyzing numerical facts or observations.
Introducing Communication Research 2e © 2014 SAGE Publications Chapter Seven Generalizing From Research Results: Inferential Statistics.
There be dragons Avoid stats pitfalls Jennifer LaFleur, ProPublica.
Nonparametric Statistics
Biostatistics Regression and Correlation Methods Class #10 April 4, 2000.
Beginners statistics Assoc Prof Terry Haines. 5 simple steps 1.Understand the type of measurement you are dealing with 2.Understand the type of question.
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
STATISTICAL TESTS USING SPSS Dimitrios Tselios/ Example tests “Discovering statistics using SPSS”, Andy Field.
Choosing and using your statistic. Steps of hypothesis testing 1. Establish the null hypothesis, H 0. 2.Establish the alternate hypothesis: H 1. 3.Decide.
NURS 306, Nursing Research Lisa Broughton, MSN, RN, CCRN RESEARCH STATISTICS.
GIS and Spatial Analysis1 Summary  Parametric Test  Interval/ratio data  Based on normal distribution  Difference in Variances  Differences are just.
Other tests of significance. Independent variables: continuous Dependent variable: continuous Correlation: Relationship between variables Regression:
Predicting Energy Consumption in Buildings using Multiple Linear Regression Introduction Linear regression is used to model energy consumption in buildings.
ScWk 298 Quantitative Review Session
Nonparametric Statistics
Ass. Prof. Dr. Mogeeb Mosleh
15.1 The Role of Statistics in the Research Process
Presentation transcript:

Ratchet up your investigation with statistics David Donald, The Center for Public Integrity Jennifer LaFleur, CIR

These may all be nuts

But they do different things and need different tools

These may all be data

But they too require the right tool

First step: Understand your data Categorical data Dichotomous: yes/no, 1/0

On Off  UP DOWN 

Next step: Know what you can do with it Frequencies

Cross-tabs Categorical  Dichotomous

Stepping up the data ladder Categorical data Dichotomous: yes/no, 1/0 Multiple categories

Frequencies Categorical  Multi-categorical

Frequencies Cross-tabs Categorical  Multi-categorical

Stepping up the data ladder Continuous data Categorical data Dichotomous: yes/no, 1/0 Multiple categories

Continuous data Mean Median Range Rank Working with continuous data

More with continuous data N-tiles Rates Correlation Regression But wait! There’s more

Correlation

Distribution

Regression

When you run a regression, you get a result called an R- square. That will help you see how much the independent variable predicts the dependent variable. Model Summary ModelRR Square Adjusted R Square Std. Error of the Estimate 1.911(a) percent of the variation in test scores is explained by change in poverty

Regression Then use that information to “adjust” the data or to see if entities are doing better or worse than they should.

Mixing it up Categorical and continuous T-tests or nonparametric tests can be used to compare two groups

Mixing it up Categorical and continuous ANOVA – analysis of variance Good for comparing more than 2 groups Is variation across groups greater than within groups

Mixing it up Categorical and continuous AND your outcome variables is dichotomous Logistic regression

Don’t reinvent the wheel Know the tools (or find them)

Index A measurement of movement used to simplify or compare a set of numbers

Indexes FBI Crime index: total of seven crimes Diversity Index: probability that two people pulled out of a given area would be of the same race Herfindahl-Hirshman Index: measures competition in a market Gini Coefficient: measures income disparity

How to build your own index Make sure what your index is valid. Here’s an example: No index measures people’s attitudes toward pro-development versus pro-environment. And it’s hard to ask people up front, which are you? One way is in a survey to ask a bunch of questions on a scale, such as whether more material consumption makes happier people. You ask on a scale of 1-5 if you agree or disagree with the statement.

Your own index Say you ask questions along this line. Then you can see if they correlate. You want some correlation but not too much. a Pearson “r” between.2 and.5 Run a test called Chronbach’s Alpha to examine affect each question has on the others. You want the score to be above.7 at least,.8 is better. If you find that to be met, then you can average the scores on the seven questions (or however many correlate) and come up with one score that measures your difficult concept.

Your own index Using a panel of questions and a statistical test known as Cronbach’s Alpha, David Donald, then with the Savannah Morning News, showed that those who are most pro-development were lower income people, those who had been left out of the American Dream. Material success apparently allows you to become more pro-environment.

Knowing all of these tools can get confusing

So when you’re not sure which tool to use, ask someone.

Dos and Don’ts Put your findings in terms readers can understand Explain your methodology Put complicated numbers in graphics Duplicate your results

Dos and Don’ts Watch your wording: A farm city of 122,000 known as the "Salad Bowl of the World," Salinas had become a cornucopia of gang violence. The number of gangs more than tripled since In 1994, the per-capita murder rate in Salinas, largely attributed to warring gangs, surpassed that of San Jose, at 38 and 24 respectively.

Dos and Don’ts Bounce findings off experts/targets Don’t say significant – when you don’t really now (but you can test for that) Beware the spurious correlation While your analysis may generate lots of Rs and Ps --- you may want to go back to descriptives in print.

Questions?