Data Analysis of Coded Chats Study of correlation and regression between different dimension variables Progress Report, VMT Meeting, Jan. 19 th 2005 Fatos.

Slides:



Advertisements
Similar presentations
A Brief Introduction to Spatial Regression
Advertisements

Lesson 10: Linear Regression and Correlation
C 3.7 Use the data in MEAP93.RAW to answer this question
Correlation and Regression By Walden University Statsupport Team March 2011.
Copyright © 2009 Pearson Education, Inc. Chapter 29 Multiple Regression.
1 SSS II Lecture 1: Correlation and Regression Graduate School 2008/2009 Social Science Statistics II Gwilym Pryce
Probabilistic & Statistical Techniques Eng. Tamer Eshtawi First Semester Eng. Tamer Eshtawi First Semester
Ch11 Curve Fitting Dr. Deshi Ye
Quantitative Techniques
1-1 Regression Models  Population Deterministic Regression Model Y i =  0 +  1 X i u Y i only depends on the value of X i and no other factor can affect.
Chapter 10 Simple Regression.
Regression Analysis. Unscheduled Maintenance Issue: l 36 flight squadrons l Each experiences unscheduled maintenance actions (UMAs) l UMAs costs $1000.
Chapter 13 Introduction to Linear Regression and Correlation Analysis
Lecture 19: Tues., Nov. 11th R-squared (8.6.1) Review
The Simple Regression Model
Examining Relationship of Variables  Response (dependent) variable - measures the outcome of a study.  Explanatory (Independent) variable - explains.
SIMPLE LINEAR REGRESSION
REGRESSION AND CORRELATION
Introduction to Probability and Statistics Linear Regression and Correlation.
Ch 11: Correlations (pt. 2) and Ch 12: Regression (pt.1) Nov. 13, 2014.
Correlation and Regression Analysis
Linear Regression 2 Sociology 5811 Lecture 21 Copyright © 2005 by Evan Schofer Do not copy or distribute without permission.
By Jayelle Hegewald, Michele Houtappels and Melinda Gray 2013.
Quantitative Business Analysis for Decision Making Multiple Linear RegressionAnalysis.
Correlation and Linear Regression
Example of Simple and Multiple Regression
Chapter 12 Correlation and Regression Part III: Additional Hypothesis Tests Renee R. Ha, Ph.D. James C. Ha, Ph.D Integrative Statistics for the Social.
Regression and Correlation Methods Judy Zhong Ph.D.
Statistics for the Social Sciences Psychology 340 Fall 2013 Tuesday, November 19 Chi-Squared Test of Independence.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved Section 10-3 Regression.
ASSOCIATION BETWEEN INTERVAL-RATIO VARIABLES
Relationships between Variables. Two variables are related if they move together in some way Relationship between two variables can be strong, weak or.
Simple Linear Regression Models
CORRELATION & REGRESSION
Correlation.
Chapter 14 – Correlation and Simple Regression Math 22 Introductory Statistics.
Regression. Correlation and regression are closely related in use and in math. Correlation summarizes the relations b/t 2 variables. Regression is used.
Ch4 Describing Relationships Between Variables. Pressure.
Copyright © 2010 Pearson Education, Inc Chapter Seventeen Correlation and Regression.
CHAPTER 14 MULTIPLE REGRESSION
Introductory Statistics for Laboratorians dealing with High Throughput Data sets Centers for Disease Control.
Introduction to Linear Regression
Chap 12-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 12 Introduction to Linear.
Copyright © 2010 Pearson Education, Inc Chapter Seventeen Correlation and Regression.
Ch4 Describing Relationships Between Variables. Section 4.1: Fitting a Line by Least Squares Often we want to fit a straight line to data. For example.
MGS3100_04.ppt/Sep 29, 2015/Page 1 Georgia State University - Confidential MGS 3100 Business Analysis Regression Sep 29 and 30, 2015.
Chapter 4 Linear Regression 1. Introduction Managerial decisions are often based on the relationship between two or more variables. For example, after.
Regression Chapter 16. Regression >Builds on Correlation >The difference is a question of prediction versus relation Regression predicts, correlation.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Copyright © 2010 Pearson Education, Inc Chapter Seventeen Correlation and Regression.
 Relationship between education level, income, and length of time out of school  Our new regression equation: is the predicted value of the dependent.
Chapter 22: Building Multiple Regression Models Generalization of univariate linear regression models. One unit of data with a value of dependent variable.
Multiple regression.
Data Analysis of Coded Chats Tracking the collaboration activity of a student in different powwows Progress Report, VMT Meeting, Feb. 2 nd, 2005 Fatos.
Explorative Thread-based Analysis of Patterns of Collaborative Interaction in Chat Nan Zhou Murat Cakir.
رگرسیون چندگانه Multiple Regression
Chapter 13 Linear Regression and Correlation. Our Objectives  Draw a scatter diagram.  Understand and interpret the terms dependent and independent.
Modeling in R Sanna Härkönen.
Correlation and Simple Linear Regression
Correlation and Simple Linear Regression
CHAPTER 29: Multiple Regression*
Linear Regression.
Example 1 5. Use SPSS output ANOVAb Model Sum of Squares df
Correlation and Simple Linear Regression
Simple Linear Regression and Correlation
Algebra Review The equation of a straight line y = mx + b
Correlation and Simple Linear Regression
Correlation and Simple Linear Regression
Presentation transcript:

Data Analysis of Coded Chats Study of correlation and regression between different dimension variables Progress Report, VMT Meeting, Jan. 19 th 2005 Fatos Xhafa VMT Project

January 19 th, VMT Meeting Outline The variables under study Test for Normal distribution of variables Correlation between different variables Regression between different variables Discussion  From statistical perspective  From interaction based / CA perspective

January 19 th, VMT Meeting The variables under study Social Reference Pbm Solving Math Move - Still at the first level of analysis - The same sample of six powwows

January 19 th, VMT Meeting Test for Normal distributions (I) In correlation and regression variables under study are assumed to approximate a Normal distribution We tested the normality distribution of the dimension variables:  Social reference  Problem Solving  Math Move

January 19 th, VMT Meeting Test for Normal distributions (II) Social reference dimension variable:  Not a good approximation to Normal distribution  Could be indicating outlier/s

January 19 th, VMT Meeting Test for Normal distributions (III) Social reference dimension variable:  Pow18 shows to be an outlier  After removing it from the sample a “perfect” approximation to Normal distribution is obtained

January 19 th, VMT Meeting Test for Normal distributions (IV) The Pbm Solving and Math Move show good approximations to Normal distribution Correlation and regression between:  Social reference and Pbm Solving  Social Reference and Math Move can be studied (pow18 excluded) Correlation and regression between:  Pbm Solving and Math Move can be studied for the whole sample

January 19 th, VMT Meeting Correlations Percentage Social reference postings Percentage Pbm Solving postings Percentage Math Move postings Percentage Social reference postings Pearson Correlation (**) -.942(*) Sig. (2-tailed) N 555 Percentage Pbm Solving postings Pearson Correlation -.970(**)1.967(**) Sig. (2-tailed) N 555 Percentage Math Move postings Pearson Correlation -.942(*).967(**) 1 Sig. (2-tailed) N 555 ** Correlation is significant at the 0.01 level (2-tailed). * Correlation is significant at the 0.05 level (2-tailed).

January 19 th, VMT Meeting Regression: Social reference vs. Pbm Solving The two variables are strongly and negatively correlated (-.970) What type of correlation? How are they correlated?

January 19 th, VMT Meeting Regression: Social reference vs. Pbm Solving

January 19 th, VMT Meeting Analytically… Model Summary ModelR R Square Adjusted R SquareStd. Error of the Estimate 1.970(a) a Predictors: (Constant), Percentage Social reference postings ANOVA(b) Model Sum of Squaresdf Mean SquareFSig. 1Regression (a) Residual Total a Predictors: (Constant), Percentage Social reference postings b Dependent Variable: Percentage Pbm Solving postings

January 19 th, VMT Meeting Analytically… Coefficients(a) Model Unstandardized Coefficients Standard ized Coefficie ntstSig. B Std. ErrorBeta 1(Constant) Percentage Social reference postings a Dependent Variable: Percentage Pbm Solving postings

January 19 th, VMT Meeting Regression: Social reference vs. Math Move The two variables are strongly and negatively correlated (-.942) What type of correlation? How are they correlated?

January 19 th, VMT Meeting Regression: Social reference vs. Math Move

January 19 th, VMT Meeting Analytically… Model Summary ModelR R Square Adjusted R Square Std. Error of the Estimate 1.942(a) a Predictors: (Constant), Percentage Social reference postings ANOVA(b) Model Sum of Squaresdf Mean SquareFSig. 1Regression (a) Residual Total a Predictors: (Constant), Percentage Social reference postings b Dependent Variable: Percentage Math Move postings

January 19 th, VMT Meeting Analytically… Coefficients(a) Model Unstandardized Coefficients Standardize d CoefficientstSig. B Std. ErrorBeta 1(Constant) Percentage Social reference postings a Dependent Variable: Percentage Math Move postings

January 19 th, VMT Meeting Regression: Pbm Solving vs. Math Move The two variables are strongly and positively correlated (.967) What type of correlation? How are they correlated?

January 19 th, VMT Meeting Regression: Pbm Solving vs. Math Move

January 19 th, VMT Meeting Regression: Math Move vs. Pbm Solving

January 19 th, VMT Meeting Discussion: correlations (I) The Social reference is strongly and negatively correlated to Pbm Solving (-.970) and Math Move (-.942) The degree of the correlation may vary by enlarging the sample size The strong correlation indicates that such a tendency is expected:  by enlarging the sample size (the sample was ‘randomly’ chosen)  even if coders might have influenced the strong correlation Pow18 shows to be an outlier and requires a careful examination

January 19 th, VMT Meeting Discussion: correlations (II) Question1: Why the “production” of Social reference influences negatively the “production” of Pbm Solving and Math Move? A first interpretation  The math pbm solving activity takes place during a fixed amount of time (roughly an hour).  The more effort in “production” of Social Reference, less “production” of Math  Question2: Does this have anything to do with “exploratory” vs. “expository” mode?  e.g. pow2-1 vs. pow2-2  we see that there is a considerable “distance” between the two (cf. regression)

January 19 th, VMT Meeting Discussion: correlations (III) Study at the second level (subcategories)  Two codes from Social Ref. dimension seem particularly interesting:  References to individual actions vs. group actions seem to be a key point! Code: Individual reference = Any utterance with a reference to the self or another member. This refers to the collaboration in a broader sense (an activity that has been done or will be done by the self or another group member) Code: Group reference = Any utterance with a reference to the group. This refers to the collaboration in a broader sense (an activity that has been done or is assumed to be done or will be done by the group) Let’s look at pow2-1 vs. pow2-2

January 19 th, VMT Meeting Individual vs. group references in Pbm Solving POWWOW2-1POWWOW2-2 I thought of factoring (n + 2)^2 and n(n + 5)  Pbm Solving (Tactic) & Individual Ref. we could find a range  Pbm Solving (Tactic) & Group Ref.

January 19 th, VMT Meeting This leads to… Hypothesis:  in “expository” powwows there is more Individual ref. than Group Ref. and,  in “exploratory” powwows there is more Group Ref. than Individual ref. that we will study from  Statistical approach (second level of analysis)  distribution of freqs of individual vs. group refs  distribution of freqs of other subcategories  Thread analysis  computing and visualizing individual-like threads and group- like threads and combinations of them  CA approach

January 19 th, VMT Meeting Discussion: from CA perspective How does the “social activity” unfolds sequentially during the pbm solving? And, specifically, how does the individual vs. group reference unfolds?

January 19 th, VMT Meeting Discussion: from CA perspective (I) Handle Posting Soc. Ref Pbm Solving Math Move AVR it's okay PIN hahaa Ss SUP my internet explorer wouldnt open PIN ena you gotta hurtet! Ci PIN haha jk Ss PIN hurry* AVR so now for the new triangle we have: = 1/2bh CgPGeo AVR do you follow me? Cg PIN hey its ChNc PIN cuz look AVR Rs AVR and do the calculation PIN we agree it is CgChNc SUP then einstein over here was confusing me Io PIN or no? AVR yes we do CgCh Powwow2-1

January 19 th, VMT Meeting Discussion: from CA perspective (II) Handle Posting Soc. Ref.Pbm Solv.Math Move REA I got 15 R MCP I'm getting 15 also ChNc REA I'll explain Ci AH3 Yep, that's right– I got 15 also CiChNc REA now AH3 For the extra, let REA first i got the area to both triangles CiTGeo REA With the first one with edgelengths of 9 REA I used the fourmla CiPGeo Powwow2-2

January 19 th, VMT Meeting Discussion: from CA perspective (III) HandlePosting Soc. Ref Pbm Solving Math Move AVR so now we add the two areas CgT SUP just a little PIN its RNc AVR exactly Ch AVR or as I got it :-) CiRe AVR multiply it by two PNc AVR and you get = bh CiPGeo PIN we should get the exact measure ment CgChNc Powwow2-1

January 19 th, VMT Meeting Discussion: from CA perspective (IV) Handle Posting Soc. Ref Pbm Sol Math Move OFF hey... Gr SUP what do we fdomwith the area Ss AVR off spring do SO not rule! OFF lol Ss AVR especially if you are a woman! AVR no jk jk Ss OFF lol Ss OFF im no woman PIN lol Ss AVR well I am SUP hey hey SUP women are great Ss GER why don't the three old timers explain what you have figured out OFF oh AVR women are great... Ss SUP ok AVR but pain-enduring SUP they wont explain it to me Cg AVR okay, let's explain Cg Powwow2-1

January 19 th, VMT Meeting Discussion: regression Significant linear regressions between:  Social reference and Pbm Solving  Social reference and Math Move  Pbm Solving and Math Move  Coefficients in each equation show the estimation for each case.

January 19 th, VMT Meeting Annex

January 19 th, VMT Meeting LLR Smoother (for the whole sample) A smoother is a trend line that shows how the two variables (X and Y) are related to one another. It is not a statistical test !!! of the relationship of X and Y, although in most cases it is possible to infer the practical significance of the relationship.

January 19 th, VMT Meeting Correlation Pbm Solving vs. Math Move (without removing pow18) Correlations Percenta ge Pbm Solving postings Percenta ge Math Move postings Percentage Pbm Solving postings Pearson Correlation 1.956(**) Sig. (2-tailed)..003 N 66 Percentage Math Move postings Pearson Correlation.956(**) 1 Sig. (2-tailed).003. N 66 ** Correlation is significant at the 0.01 level (2-tailed).

January 19 th, VMT Meeting Individual vs. group action references in Social Activity (count; for percents look at slide 23) POWWOW2-1POWWOW2-2 I thought of factoring (n + 2)^2 and n(n + 5)  Pbm Solving (Tactic) & Individual Ref. we could find a range  Pbm Solving (Tactic) & Group Ref. Check Orientation Perform Result Restate Reflect Strategy Tactic Count Social Ref. Collaboration group Collaboration individual Identify self Resource Composition of Pbm Solving in terms of Social Reference