DataToText: A Consumer-Oriented Approach to Data Analysis David A. Kenny University of Connecticut University of Connecticut

Slides:



Advertisements
Similar presentations
Actor-Partner Interdependence Model or APIM David A
Advertisements

Session # 2 SWE 211 – Introduction to Software Engineering Lect. Amanullah Quadri 2. Fact Finding & Techniques.
11-Jun-14 The assert statement. 2 About the assert statement The purpose of the assert statement is to give you a way to catch program errors early The.
The t Test for Independent Means
Seven Deadly Sins of Dyadic Data Analysis David A. Kenny February 14, 2013.
Sta220 - Statistics Mr. Smith Room 310 Class #14.
G Lecture 101 Examples of Binary Data Binary Data and Correlation Measurement Models and Binary Data Measurement Models and Ordinal Data Analyzing.
Copyright © 2009 Pearson Education, Inc. Chapter 29 Multiple Regression.
Chapter 8Java: an Introduction to Computer Science & Programming - Walter Savitch 1 Chapter 8 l Basic Exception Handling »the mechanics of exceptions l.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 9 Hypothesis Testing Developing Null and Alternative Hypotheses Developing Null and.
Aim: How do we find confidence interval using SPSS? SPSS Assignment 3 due Thursday.
5/15/2015Slide 1 SOLVING THE PROBLEM The one sample t-test compares two values for the population mean of a single variable. The two-sample test of a population.
The Cycle of Proof: Designing Experiments. Designing Experiments: Daily Learning Goals The student will be able to formulate scientific questions and.
Python Programming Chapter 1: The way of the program Saad Bani Mohammad Department of Computer Science Al al-Bayt University 1 st 2011/2012.
QM Spring 2002 Business Statistics Introduction to Inference: Hypothesis Testing.
Linear Regression with One Regression
QM Spring 2002 Business Statistics SPSS: A Summary & Review.
1 SPSS Recently it has gone through a name change so your icon on your computer may be under a different name (i.e. PASW- Predictive Analytics SoftWare).
Procedures Software for People. Agenda Procedure: Definition Software for People Designing Procedures Procedures as Problem Solving Software: Procedures.
The Basics of Regression continued
Economics 20 - Prof. Anderson1 Multiple Regression Analysis y =  0 +  1 x 1 +  2 x  k x k + u 2. Inference.
Meta Analysis An Introduction. What… is… it? A “study of studies,” i.e., averaging results across studies in a given domain to get a better estimate of.
Social Science Research Design and Statistics, 2/e Alfred P. Rovai, Jason D. Baker, and Michael K. Ponton Internal Consistency Reliability Analysis PowerPoint.
Scot Exec Course Nov/Dec 04 Ambitious title? Confidence intervals, design effects and significance tests for surveys. How to calculate sample numbers when.
1 Doing Statistics for Business Doing Statistics for Business Data, Inference, and Decision Making Marilyn K. Pelosi Theresa M. Sandifer Chapter 11 Regression.
Trying to Give Business Students What They Need for Their Future Bob Andrews Virginia Commonwealth University.
Introductory Statistical Concepts. Disclaimer – I am not an expert SAS programmer. – Nothing that I say is confirmed or denied by Texas A&M University.
Inference in practice BPS chapter 16 © 2006 W.H. Freeman and Company.
2 nd Order CFA Byrne Chapter 5. 2 nd Order Models The idea of a 2 nd order model (sometimes called a bi-factor model) is: – You have some latent variables.
Two Way ANOVA ©2005 Dr. B. C. Paul. ANOVA Application ANOVA allows us to review data and determine whether a particular effect is changing our results.
PARAMETRIC STATISTICAL INFERENCE
LECTURE 19 THURSDAY, 14 April STA 291 Spring
● Final exam Wednesday, 6/10, 11:30-2:30. ● Bring your own blue books ● Closed book. Calculators and 2-page cheat sheet allowed. No cell phone/computer.
Statistics for Business and Economics Chapter 1 Statistics, Data, & Statistical Thinking.
Skewness and Curves 10/1/2013. Readings Chapter 2 Measuring and Describing Variables (Pollock) (pp.37-44) Chapter 6. Foundations of Statistical Inference.
Path Analysis. Remember What Multiple Regression Tells Us How each individual IV is related to the DV The total amount of variance explained in a DV Multiple.
Central Tendency and Variability Chapter 4. Variability In reality – all of statistics can be summed into one statement: – Variability matters. – (and.
Metadata driven application for data processing – from local toward global solution Rudi Seljak Statistical Office of the Republic of Slovenia.
Regression Chapter 16. Regression >Builds on Correlation >The difference is a question of prediction versus relation Regression predicts, correlation.
Introduction to Quantitative Research Analysis and SPSS SW242 – Session 6 Slides.
PSYC 6130 One-Way Independent ANOVA. PSYC 6130, PROF. J. ELDER 2 Generalizing t-Tests t-Tests allow us to test hypotheses about differences between two.
Analysis of Variance (One Factor). ANOVA Analysis of Variance Tests whether differences exist among population means categorized by only one factor or.
EDCI 696 Dr. D. Brown Presented by: Kim Bassa. Targeted Topics Analysis of dependent variables and different types of data Selecting the appropriate statistic.
Selecting Input Probability Distribution. Simulation Machine Simulation can be considered as an Engine with input and output as follows: Simulation Engine.
Solving a problem in an orderly and systematic way.
Sampling distributions rule of thumb…. Some important points about sample distributions… If we obtain a sample that meets the rules of thumb, then…
Computer Programming with JAVA Chapter 8. Exception Handling Basic Exception Handling the mechanics of exceptions Defining and Using Exceptions some "simple"
Geron1 Quality Indicators for Home and Community-Based Services Scott Miyake Geron, Ph.D. Boston University School of Social Work State Long-Term Care.
Stuff I Have Done and Am Doing Now David A. Kenny.
Cognitive Walkthrough More evaluating with experts.
1 - 1 © 2001 Prentice-Hall, Inc. Statistics for Business and Economics Statistics, Data, & Statistical Thinking Chapter 1.
Non-parametric Approaches The Bootstrap. Non-parametric? Non-parametric or distribution-free tests have more lax and/or different assumptions Properties:
Use Case Analysis Chapter 6.
How do we use the Scientific Method?
Team Project   1. Use at least 4 independent variables and at least 40 data points In the beginning!) If you start with 5 independent variables you should.
One-way ANALYSIS OF VARIANCE (ANOVA)
Scientific Method.
Advanced Quantitative Analysis
Chapter 8 Part 2 Linear Regression
AEIS: 607 Lecture 3: Job Analysis and Talent Management
A New Approach to the Study of Teams: The GAPIM
The structure of a scientific paper:
Sampling Distribution
Sampling Distribution
Social Relations Model: Estimation of Relationship Effects
Psych 231: Research Methods in Psychology
MEMORY PERSPECTIVES: DATA ANALYSIS Week 9 Practical.
Arrays.
Cognitive Walkthrough
In this chapter Be able to outline the purpose and distinct focus of management research; • Be able to place your research project on a basic-applied.
Presentation transcript:

DataToText: A Consumer-Oriented Approach to Data Analysis David A. Kenny University of Connecticut University of Connecticut

Data Analysis for Methodologists Begin with a hypothesis that is embedded within a statistical model. Develop a research and measurement design to estimate the parameters of that model. Gather data; test model assumptions; and estimate the models parameters. Choose the best model.

Data Analysis for Practitioners What do I have to click in SPSS? What from my SPSS output do I put where in my results section?

Consumer versus Industry Perspective For data analysis, quantitative psychologists are the industry or at least part of the industry. The spirit of DataToText is an attempt to get us to become more consumer- oriented.

DataToText Project Have the researcher tell DataToText what is the research question. DataToText performs the requisite analyses. DataToText gives the results from those analyses: computer output a written description

Example 1 Moderation Analysis Livi et al.: The effect of Noise Sensitivity (X) on Stress (Y) is moderated by the Need for Cognitive Closure (M)

Moderation Example Syntax Gives results in words, tables, and pictureswords tablespictures Tests assumptions Gives warnings Specialized output

How Positively the Wife Sees the Husband Wife Satisfaction Example 2 Husband Satisfaction Actor Wife Actor Husband Partner Husband Partner Wife How Positively the Wife Sees the Husband SyntaxSyntax PicturePicture Specialized Output

Advantages and Disadvantages

DataToText is Mindless! Thought and intelligence is needed for: What analysis to do The execution of the analysis The interpretation of the analysis

High school student Jenna Smith randomly throwing variables into the macro.

Researcher Brad Anderson not knowing anything about testing moderation.

Definitely bad, probably terrible. However, it might be better than what would have otherwise been done.

Additionally Sometimes warnings may alert the user that the wrong analysis was done. For example, both macros provide a warning if a dichotomous variable is analyzed.

DataToText is Mindless! about: What analysis to do The execution of the analysis The interpretation of the analysis

Data Analysis Requires Thought Not all problems have a flow- chart structure CFA ARIMA modeling

However Some problems have a flow-chart structure (though we might disagree some about that flow- chart). For many analyses we do have explicit or implicit standards for reporting of results.

Also Keeps everything straight. Avoids errors. DataToText while it may typically fail to perform the best analysis, it might often create a better analysis than that done by even a skilled data analyst. Provides warnings. Makes assumptions explicit and can provide statistical tests of some of them.

Warnings DataToText issues warnings: A dichotomous outcome variable Outliers Colinearity Low Power

Assumption Testing Is Like Flossing Something we know that we should do but do not do as often we should. DataToText can test certain assumptions level of measurement distributional outliers

DataToText is Mindless! about: What analysis to do The execution of the analysis The interpretation of the analysis

The Researcher Needs to Understand the Results Good data analysis is more than doing the right analysis; the researcher must still understand the meaning of the results.

However Because DataToText provides a verbal summary of the results, some researchers might better understand their results by using the approach.

Moreover Output produced by DataToText may be more intelligible to the reader and so although the data analyst may not understand the results, the reader may be able to!

Packages vs. Open Source Packages PASW ( née SPSS), SAS Accessible but costly Updates sometimes without backwards compatibility Open Source: R

Feedback Please let me know what you think.

Special Thanks to My Lab Thomas Ledermann Amanda Snook Randi Garcia