Introduction to Research Design Statlab Workshop, Fall 2010 Jeremy Green Nancy Hite.

Slides:



Advertisements
Similar presentations
Everything I wish I had known about research design and data analysis… Statlab Workshop Spring 2005 Heather Lord and Melanie Dirks.
Advertisements

Design of Experiments Lecture I
Pengolahan dan Analisa Data Indra Budi Fasilkom UI.
Anita M. Baker, Ed.D. Jamie Bassell Evaluation Services Program Evaluation Essentials Evaluation Support 2.0 Session 2 Bruner Foundation Rochester, New.
Analysis of variance (ANOVA)-the General Linear Model (GLM)
Research methods – Deductive / quantitative
Writing with Data: Incorporating Statistics Into Causal Research Statlab Workshop Spring 2011 Brian Fried and Kevin Callender.
Research Academy The Research Process A training to “ what is research”
Reading the Dental Literature
The research process Psych 231: Research Methods in Psychology.
CORRELATIONAL RESEARCH II Relations in Observational and Survey Research Lawrence R. Gordon Psychology Research Methods I.
Statistics for the Social Sciences
A Simple Guide to Using SPSS© for Windows
SOWK 6003 Social Work Research Week 10 Quantitative Data Analysis
How Do We Identify Causes? The criteria of causation © Pine Forge Press, an imprint of Sage Publications, 2006.
An Introduction to Educational Research Statistics Graham McMahon MD MMSc 1.
So, You’re Going to Write an Empirical Paper Statlab Workshop October 31 st, 2003 David Nickerson.
PSYC512: Research Methods PSYC512: Research Methods Lecture 9 Brian P. Dyre University of Idaho.
Ecology reporting and statistical analysis
Educational Research by John W. Creswell. Copyright © 2002 by Pearson Education. All rights reserved. Slide 1 Chapter 8 Analyzing and Interpreting Quantitative.
Data Analysis Statistics. Levels of Measurement Nominal – Categorical; no implied rankings among the categories. Also includes written observations and.
Everything I wish I had known about research design and data analysis… Statlab Workshop Fall 2006 Kyle Hood and Frank Farach.
Summary of Quantitative Analysis Neuman and Robson Ch. 11
Chapter 14 Inferential Data Analysis
Business Research SMS Varanasi.
Writing the Research Paper
IBC464 International College
QUANTITATIVE METHODS I203 Social and Organizational Issues of Information.
Biostatistics Analysis Center Center for Clinical Epidemiology and Biostatistics University of Pennsylvania School of Medicine Minimum Documentation Requirements.
Introduction to Linear Regression and Correlation Analysis
Chapter 4 Hypothesis Testing, Power, and Control: A Review of the Basics.
Copyright © 2008 by Pearson Education, Inc. Upper Saddle River, New Jersey All rights reserved. John W. Creswell Educational Research: Planning,
Variation, Validity, & Variables Lesson 3. Research Methods & Statistics n Integral relationship l Must consider both during planning n Research Methods.
Literature Review and Parts of Proposal
Proposal Type One: Corpus-Based. The following is a list of items typically included in a Type One research proposal for MA in translation studies. The.
Choosing and using statistics to test ecological hypotheses
Evaluating a Research Report
Analyzing and Interpreting Quantitative Data
Statistical analysis Prepared and gathered by Alireza Yousefy(Ph.D)
The Literature Search and Background of the Problem.
Introduction to research methods 10/26/2004 Xiangming Mu.
What is Computer Science?  Three paradigms (CACM 1/89) Theory (math): definitions, theorems, proofs, interpretations Abstraction (science): hypothesize,
Research Seminars in IT in Education (MIT6003) Quantitative Educational Research Design 2 Dr Jacky Pow.
PY 603 – Advanced Statistics II TR 12:30-1:45pm 232 Gordon Palmer Hall Jamie DeCoster.
MK346 – Undergraduate Dissertation Preparation Part II - Data Analysis and Significance Testing.
Review of Research Methods. Overview of the Research Process I. Develop a research question II. Develop a hypothesis III. Choose a research design IV.
Analyzing Statistical Inferences How to Not Know Null.
EDCI 696 Dr. D. Brown Presented by: Kim Bassa. Targeted Topics Analysis of dependent variables and different types of data Selecting the appropriate statistic.
11/25/2015Marketing Research2 Observation (Variables) Theory (Concepts)
The research process Psych 231: Research Methods in Psychology.
Chapter 6: Analyzing and Interpreting Quantitative Data
The research process Psych 231: Research Methods in Psychology.
Chapter 15 The Chi-Square Statistic: Tests for Goodness of Fit and Independence PowerPoint Lecture Slides Essentials of Statistics for the Behavioral.
PSY6010: Statistics, Psychometrics and Research Design Professor Leora Lawton Spring 2007 Wednesdays 7-10 PM Room 204.
Chapter Eight: Quantitative Methods
Introduction to Validity True Experiment – searching for causality What effect does the I.V. have on the D.V. Correlation Design – searching for an association.
4 basic analytical tasks in statistics: 1)Comparing scores across groups  look for differences in means 2)Cross-tabulating categoric variables  look.
1 PEER Session 02/04/15. 2  Multiple good data management software options exist – quantitative (e.g., SPSS), qualitative (e.g, atlas.ti), mixed (e.g.,
The research process Psych 231: Research Methods in Psychology.
Chapter 13 Understanding research results: statistical inference.
Power Point Slides by Ronald J. Shope in collaboration with John W. Creswell Chapter 7 Analyzing and Interpreting Quantitative Data.
The research process Psych 231: Research Methods in Psychology.
Approaches to quantitative data analysis Lara Traeger, PhD Methods in Supportive Oncology Research.
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
Research And Evaluation Differences Between Research and Evaluation  Research and evaluation are closely related but differ in four ways: –The purpose.
STA248 week 121 Bootstrap Test for Pairs of Means of a Non-Normal Population – small samples Suppose X 1, …, X n are iid from some distribution independent.
Lecture 11: Honours Thesis Structure
The Literature Search and Background of the Problem
Chapter Eight: Quantitative Methods
Research Methods & Statistics
Presentation transcript:

Introduction to Research Design Statlab Workshop, Fall 2010 Jeremy Green Nancy Hite

Outline of a paper Introduction Theory Data Description Analysis Conclusion

Identifying a Question Tradeoff between work in and results Tradeoff between work in and results Easy to do, trivial results Easy to do, trivial results Result is interesting, but difficulty is high Result is interesting, but difficulty is high New tools open up new questions New tools open up new questions New statistical or computational tools make formerly difficult questions approachable New statistical or computational tools make formerly difficult questions approachable New theory opens up new questions New theory opens up new questions

Introduction Topic Most general level Most general levelQuestion What is the question you want to answer? What is the question you want to answer? Be specific Be specific Ask only what you can answer Ask only what you can answer Review the Literature “Stay the course” “Stay the course”

Theory Categorize your theory Descriptive vs. causal Descriptive vs. causal Write down your theory In paragraph form In paragraph form Using a statistical model Using a statistical model

Hypothesis Identify testable hypotheses. - how well does the hypothesis test the theory - what is the counterfactual argument - What is the scope of the hypothesis test - Spurious factors, contamination, endogenous factors

Do you need statistics after all? Quantitative v. Qualitative research Quantitative v. Qualitative research Mixed methods Mixed methods Trade-off between various data generating styles Trade-off between various data generating styles Archival Archival Pre-existing data Pre-existing data Primary data collection Primary data collection Surveys Surveys Experiments Experiments Downstream analysis Downstream analysis Function (TIME, BUDGET, ACCESS, FEASABILITY) Function (TIME, BUDGET, ACCESS, FEASABILITY)

Methodological Concerns: Consort Checklist Intro- scientific background and explanation of rationale Intro- scientific background and explanation of rationale Eligibility criteria for participants and settings and locations where the data were collected Eligibility criteria for participants and settings and locations where the data were collected Interventions- Interventions- Objectives Objectives Outcomes Outcomes Samples size Samples size Randomization Randomization Blinding (masking) Blinding (masking) Statistical methods Statistical methods Results- Results- Recruitment Recruitment Baseline data Baseline data Numbers Numbers Estimation Estimation Ancillary analysis Ancillary analysis Adverse events - indicate whether there was opportunity for treatment to spill e control Adverse events - indicate whether there was opportunity for treatment to spill e control Discussion Discussion Interpretation Interpretation Generalizability Generalizability Overall evidence Overall evidence

Variables Dependent Variable ( response, outcome, criterion) Independent Variables ( explanatory or predictor variables) Treatment Variable Treatment Variable Covariates / Confounding Variables Covariates / Confounding Variables Categorical and Continuous Variables Remember: Types of variables we choose, determine the statistics we use

You need Data Think about analyses early! Collecting your own data Retrospective, prospective, experimental & observational methods Retrospective, prospective, experimental & observational methods Can find most data you’ll need on-line! Statlab Webpage ( Advisors Advisors Yale StatCat ( Yale StatCat ( ICPSR ( ICPSR ( Reference Librarian (Julie Linden) Reference Librarian (Julie Linden)

So, you want to make a survey Extensive on-line resources and software Extensive on-line resources and software Question types determine analyses Question types determine analyses Open vs. close ended questions, Likert scales, rank order data Open vs. close ended questions, Likert scales, rank order data Assumptions of normality Assumptions of normality Validity Validity Internal & External validity Internal & External validity Pilot testing Pilot testing You need variance to analyze! You need variance to analyze! Sample size Sample size It depends; power, effect size, cost (UCLA power calculator) It depends; power, effect size, cost (UCLA power calculator)

Once You’ve Found or Collected your data Download the data and documentation StatTransfer (Statlab) StatTransfer (Statlab) Determine data file type Probably a text file (.txt,.dat,.raw) Probably a text file (.txt,.dat,.raw) Converting text & delimited files Choose a statistical software program SPSS, Stata, SAS, Matlab, Excel, R, C++ SPSS, Stata, SAS, Matlab, Excel, R, C++

Managing your data Back up all Master Data Files CDR/CDRW, USB Key Codebook All codes All codes Adding variables, cases, computing new variables Adding variables, cases, computing new variables Keep a roadmap Keep a log of all analyses with what you have done Keep a log of all analyses with what you have done Save syntax files Save syntax files

Data Entry - Codebook Always create a codebook that contains: Always create a codebook that contains: Instructions for entering data Instructions for entering data Instructions for making decisions when data are ambiguous Instructions for making decisions when data are ambiguous Instructions for handling missing observation Instructions for handling missing observation Numerical codes you will use for categorical data Numerical codes you will use for categorical data General troubleshooting information General troubleshooting information Treat it as a working document Treat it as a working document

Cleaning your data In order to minimize errors while manually entering data, you can set ranges in Excel so that if a value outside the range is entered, the cell will change color. In order to minimize errors while manually entering data, you can set ranges in Excel so that if a value outside the range is entered, the cell will change color. To to this go to Format - Conditional Formatting and specify the ranges for which a different format should show up. To to this go to Format - Conditional Formatting and specify the ranges for which a different format should show up. Also, you can use the data validation options. Go to Data - Validation Also, you can use the data validation options. Go to Data - Validation

Keeping Track of Data Sets Ever time you make changes to your data, save it with the current date Ever time you make changes to your data, save it with the current date Keep a document with a list of the major changes with each version Keep a document with a list of the major changes with each version A good idea is to keep a folder with the original data sets and create different subfolders as you make changes to the data set. Sometimes it is also a good idea to keep a working directory for currently active files A good idea is to keep a folder with the original data sets and create different subfolders as you make changes to the data set. Sometimes it is also a good idea to keep a working directory for currently active files Always make backup copies Always make backup copies

Keeping Track of Syntaxes and Outputs Save all the syntax you write Save all the syntax you write Save all the output you produce and try to annotate it as much as possible Save all the output you produce and try to annotate it as much as possible Save your syntax and output with the data file name a brief description of the analyses and the current date Save your syntax and output with the data file name a brief description of the analyses and the current date Save syntax and output in a separate folder from your data Save syntax and output in a separate folder from your data

So, how do I analyze my data? Correlation Correlation allows you to quantify relationships between variables (r, r-squared) Correlation allows you to quantify relationships between variables (r, r-squared) Regression allows prediction of dependent variable based on one or more independent variables Regression allows prediction of dependent variable based on one or more independent variables Group differences t-test & ANOVA t-test & ANOVA Chi-square for categorical and frequency data Chi-square for categorical and frequency data Significance v. effect size More Complex Models

Take Away Messages 1) Determine your question, methods and statistics before you start 2) Keep a codebook of everything 3) Keep a log of all commands issued 4) Save data at every step 5) Ask for help 6) Don’t get in over your head