Lecture note on statistics, data analysis planning – week 14 Elspeth Slayter, M.S.W., Ph.D.

Slides:



Advertisements
Similar presentations
Regression Analysis: A statistical procedure used to find relationships among a set of variables.
Advertisements

David Pieper, Ph.D. STATISTICS David Pieper, Ph.D.
INFERENTIAL STATISTICS. Descriptive statistics is used simply to describe what's going on in the data. Inferential statistics helps us reach conclusions.
Regression With Categorical Variables. Overview Regression with Categorical Predictors Logistic Regression.
Statistical Tests Karen H. Hagglund, M.S.
Statistical Analysis SC504/HS927 Spring Term 2008 Week 17 (25th January 2008): Analysing data.
Topic 3: Regression.
Analyzing quantitative data – section III Week 10 Lecture 1.
An Introduction to Logistic Regression
Lecture 5: Simple Linear Regression
Summary of Quantitative Analysis Neuman and Robson Ch. 11
DESIGNING, CONDUCTING, ANALYZING & INTERPRETING DESCRIPTIVE RESEARCH CHAPTERS 7 & 11 Kristina Feldner.
Statistical hypothesis testing – Inferential statistics II. Testing for associations.
Leedy and Ormrod Ch. 11 Gray Ch. 14
Understanding Research Results
February  Study & Abstract StudyAbstract  Graphic presentation of data. Graphic presentation of data.  Statistical Analyses Statistical Analyses.
Chapter 13: Inference in Regression
LEARNING PROGRAMME Hypothesis testing Intermediate Training in Quantitative Analysis Bangkok November 2007.
Bivariate Relationships Analyzing two variables at a time, usually the Independent & Dependent Variables Like one variable at a time, this can be done.
(a.k.a: The statistical bare minimum I should take along from STAT 101)
Introduction To Biological Research. Step-by-step analysis of biological data The statistical analysis of a biological experiment may be broken down into.
Week 4: Multiple regression analysis Overview Questions from last week What is regression analysis? The mathematical model Interpreting the β coefficient.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 15 Multiple Regression n Multiple Regression Model n Least Squares Method n Multiple.
UNDERSTANDING RESEARCH RESULTS: DESCRIPTION AND CORRELATION © 2012 The McGraw-Hill Companies, Inc.
Chapter 9 Analyzing Data Multiple Variables. Basic Directions Review page 180 for basic directions on which way to proceed with your analysis Provides.
Choosing the Appropriate Statistics Dr. Erin Devers October 17, 2012.
Go to Table of Content Single Variable Regression Farrokh Alemi, Ph.D. Kashif Haqqi M.D.
Hypothesis testing Intermediate Food Security Analysis Training Rome, July 2010.
Multiple Regression and Model Building Chapter 15 Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
Logistic (regression) single and multiple. Overview  Defined: A model for predicting one variable from other variable(s).  Variables:IV(s) is continuous/categorical,
Statistical analysis Outline that error bars are a graphical representation of the variability of data. The knowledge that any individual measurement.
Preparing for the final - sample questions with answers.
Regression Analysis A statistical procedure used to find relations among a set of variables.
Chapter Sixteen Copyright © 2006 McGraw-Hill/Irwin Data Analysis: Testing for Association.
Regression Lesson 11. The General Linear Model n Relationship b/n predictor & outcome variables form straight line l Correlation, regression, t-tests,
Chapter 16 Data Analysis: Testing for Associations.
Chapter 13 Multiple Regression
Discussion of time series and panel models
Regression & Correlation. Review: Types of Variables & Steps in Analysis.
Academic Research Academic Research Dr Kishor Bhanushali M
Inferential Statistics. The Logic of Inferential Statistics Makes inferences about a population from a sample Makes inferences about a population from.
Examining Relationships in Quantitative Research
Chapter 9 Correlational Research Designs. Correlation Acceptable terminology for the pattern of data in a correlation: *Correlation between variables.
Chapter 9: Correlation and Regression Analysis. Correlation Correlation is a numerical way to measure the strength and direction of a linear association.
Introduction to Basic Statistical Tools for Research OCED 5443 Interpreting Research in OCED Dr. Ausburn OCED 5443 Interpreting Research in OCED Dr. Ausburn.
NIH and IRB Purpose and Method M.Ed Session 2.
CORRELATION ANALYSIS.
Multiple Regression Learning Objectives n Explain the Linear Multiple Regression Model n Interpret Linear Multiple Regression Computer Output n Test.
Applied Quantitative Analysis and Practices LECTURE#28 By Dr. Osman Sadiq Paracha.
Nonparametric Statistics
Biostatistics Regression and Correlation Methods Class #10 April 4, 2000.
Regression Analysis: A statistical procedure used to find relations among a set of variables B. Klinkenberg G
Topics, Summer 2008 Day 1. Introduction Day 2. Samples and populations Day 3. Evaluating relationships Scatterplots and correlation Day 4. Regression and.
Approaches to quantitative data analysis Lara Traeger, PhD Methods in Supportive Oncology Research.
Lecture notes on statistics Elspeth Slayter, M.S.W., Ph.D.
PSY 325 AID Education Expert/psy325aid.com FOR MORE CLASSES VISIT
Chapter 22 Inferential Data Analysis: Part 2 PowerPoint presentation developed by: Jennifer L. Bellamy & Sarah E. Bledsoe.
Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 9 l Simple Linear Regression 9.1 Simple Linear Regression 9.2 Scatter Diagram 9.3 Graphical.
Methods of Presenting and Interpreting Information Class 9.
Statistics & Evidence-Based Practice
Research in Social Work Practice Salem State University
BINARY LOGISTIC REGRESSION
Bivariate & Multivariate Regression Analysis
Regression Analysis.
BUS 308 HELPS Perfect Education/ bus308helps.com.
CORRELATION ANALYSIS.
ADVANCED DATA ANALYSIS IN SPSS AND AMOS
15.1 The Role of Statistics in the Research Process
Regression Part II.
Presentation transcript:

Lecture note on statistics, data analysis planning – week 14 Elspeth Slayter, M.S.W., Ph.D.

Today’s class  Administrative matters & check-in  Review, worksheet  Take-home message: Match stats to question 2 more types of tests Quantitative data analysis planning  Group consultations

Assignment 3  Introduction RQ for the literature  Literature review Starts with annotated bibliography  Recommendation What you pulled from literature  Proposed evaluation plan RQ for the evaluation Sub-sections

4 basic types of statistical tests: Mean, standard deviation Median, Mode Percentage, frequency Description Pearson’s correlation Correlation Student’s t tests Chi-square tests ANOVA Odds ratios Comparison OLS regression Logit regression Prediction

What you need to know: I. Purpose of test Match question to test II. Logistics for test Structure of variable needed for test ○ Continuous ○ Nominal Number of variables needed for test III. Interpret test findings Basic knowledge of the “squigglies” in reporting statistical results

Review: Variable structure  Continuous (a.k.a. numeric) Examples?  Nominal (a.k.a. dummy variable, categorical variables) Examples?

Breathe, people.

What, exactly, is regression?  Researchy-language-lite: A statistical procedure used to find relationships among a set of variables  Vernacular language: How well do all of these independent variables explain the variation in the dependent variable How well does this “model” explain the outcome of interest

Ordinary least squares (OLS) regression  Vernacular language:  There is a dependent variable, which is the one you are trying to explain, and 1+ independent variables that are related to it  Researchy-language:  Tests a ‘model’ of how a group of independent variables explain variation in the dependent variable Requirements: Dependent variable is continuous Certain conditions are met among independent variables

Two major factors to assess for/consider before running an OLS regression  Multicollinearity: Occurs when one or more of your independent variables are related to one another  Omitted Variables: If independent variables that have significant relationships with the dependent variable are left out of the model, results will not be as good as if they are included

Some relationships are perfectly linear  Your cell phone bill, for instance, may be: Total Charges = Base Fee + 30¢ (overage minutes) + ε If you know the base fee and the number of overage minutes, you can predict the total charges exactly.

Other relationships are not so linear  Weight - function of height  Variations that height does not explain  If you take a sample of actual heights and weights, you might see something like the graph to the right.

Examples with one CONTINUOUS dependent variable, 2+ independent variables OF ANY TYPE  Weight = Height + X1 + X2… + ε  What factors are known to impact weight? Separate relationships with weight  What is a “model” for explaining weight?  This is multiple regression analysis

Interpreting “coefficients” in OLS regression analysis  The coefficient for each independent variable shows how much an increase of one in its value will change the dependent variable, holding all other independent variables constant.  The p-value is a percentage. It tells you how likely it is that the coefficient for that independent variable emerged by chance and does not describe a real relationship.

Interpret: OLS regression results  Standardized coefficient is what to focus on  Interpret like a Pearson’s correlation  Association with dependent variable  Look at R 2

Logistic regression (a.k.a. logit)  Vernacular language:  There is a dependent variable, which is the one you are trying to explain, and 1+ independent variables that are related to it  Researchy-language:  ‘Models’ how a set of independent variables function as collective predictors of the dependent variable Requirements: Dependent variable is nominal (2 values only)

How ‘Good’ is the ‘Model?’ OLS Regression  How well the model explains the data measured via R 2  Tells you percent of differences explained by model  R 2=.68 means: 68% of the variance in the dependent variable is explained by the model 32% of those differences remains unexplained Logit/Logistic Regression  Nagelkerke R 2  Cox & Snell R 2  Same idea, different formulae used to get it

Interpret: Logistic Regression results  Interpret the power of the model Nagelkerke R 2 = 0.33  Separate from reporting of odds ratios with lots of control variables

…put your quantitative hat on…

Moving in on the questions – but what to do w/the answers?  Developing a data analysis plan ahead of time is crucial to conducting good research  Relates to research design Cross-sectional, longitudinal?  Relates to how you ask questions: Topic, question structure, answer design

Introduction to table shells:  What is a table shell?  Tables with no data in them – and a process  Take each question and think about how you will use it  Use table shells to: Clarify your thinking pre-data collection Prepare for your data’s arrival Get a jump on preparing for data interpretation, writing your final report

Thinking about groupings, or not  Table for overall data Frequencies, means  Table for comparative data Frequencies, means and test results Two groups (Chi-square, t-test, odds ratio, etc.) ○ Choose test based on variable structure Three groups or more (ANOVA, etc.)

Table shells – your examples  What results do you envision?  Are you comparing groups?  What groups are you comparing?  On what variables?

Time to slice and dice:

Data analysis basics:  First – run frequencies on all Conduct data cleaning for data entry errors See what the emerging “story” is  Second – run means on all data measured continuously See what the emerging story is  Third – run correlations per plan  Fourth – run comparisons per plan  Fifth – look at additional unexpected analysis possibilities