Data Analysis of Coded Chats Study of correlation and regression between different dimension variables Progress Report, VMT Meeting, Jan. 19 th 2005 Fatos Xhafa VMT Project
January 19 th, VMT Meeting Outline The variables under study Test for Normal distribution of variables Correlation between different variables Regression between different variables Discussion From statistical perspective From interaction based / CA perspective
January 19 th, VMT Meeting The variables under study Social Reference Pbm Solving Math Move - Still at the first level of analysis - The same sample of six powwows
January 19 th, VMT Meeting Test for Normal distributions (I) In correlation and regression variables under study are assumed to approximate a Normal distribution We tested the normality distribution of the dimension variables: Social reference Problem Solving Math Move
January 19 th, VMT Meeting Test for Normal distributions (II) Social reference dimension variable: Not a good approximation to Normal distribution Could be indicating outlier/s
January 19 th, VMT Meeting Test for Normal distributions (III) Social reference dimension variable: Pow18 shows to be an outlier After removing it from the sample a “perfect” approximation to Normal distribution is obtained
January 19 th, VMT Meeting Test for Normal distributions (IV) The Pbm Solving and Math Move show good approximations to Normal distribution Correlation and regression between: Social reference and Pbm Solving Social Reference and Math Move can be studied (pow18 excluded) Correlation and regression between: Pbm Solving and Math Move can be studied for the whole sample
January 19 th, VMT Meeting Correlations Percentage Social reference postings Percentage Pbm Solving postings Percentage Math Move postings Percentage Social reference postings Pearson Correlation (**) -.942(*) Sig. (2-tailed) N 555 Percentage Pbm Solving postings Pearson Correlation -.970(**)1.967(**) Sig. (2-tailed) N 555 Percentage Math Move postings Pearson Correlation -.942(*).967(**) 1 Sig. (2-tailed) N 555 ** Correlation is significant at the 0.01 level (2-tailed). * Correlation is significant at the 0.05 level (2-tailed).
January 19 th, VMT Meeting Regression: Social reference vs. Pbm Solving The two variables are strongly and negatively correlated (-.970) What type of correlation? How are they correlated?
January 19 th, VMT Meeting Regression: Social reference vs. Pbm Solving
January 19 th, VMT Meeting Analytically… Model Summary ModelR R Square Adjusted R SquareStd. Error of the Estimate 1.970(a) a Predictors: (Constant), Percentage Social reference postings ANOVA(b) Model Sum of Squaresdf Mean SquareFSig. 1Regression (a) Residual Total a Predictors: (Constant), Percentage Social reference postings b Dependent Variable: Percentage Pbm Solving postings
January 19 th, VMT Meeting Analytically… Coefficients(a) Model Unstandardized Coefficients Standard ized Coefficie ntstSig. B Std. ErrorBeta 1(Constant) Percentage Social reference postings a Dependent Variable: Percentage Pbm Solving postings
January 19 th, VMT Meeting Regression: Social reference vs. Math Move The two variables are strongly and negatively correlated (-.942) What type of correlation? How are they correlated?
January 19 th, VMT Meeting Regression: Social reference vs. Math Move
January 19 th, VMT Meeting Analytically… Model Summary ModelR R Square Adjusted R Square Std. Error of the Estimate 1.942(a) a Predictors: (Constant), Percentage Social reference postings ANOVA(b) Model Sum of Squaresdf Mean SquareFSig. 1Regression (a) Residual Total a Predictors: (Constant), Percentage Social reference postings b Dependent Variable: Percentage Math Move postings
January 19 th, VMT Meeting Analytically… Coefficients(a) Model Unstandardized Coefficients Standardize d CoefficientstSig. B Std. ErrorBeta 1(Constant) Percentage Social reference postings a Dependent Variable: Percentage Math Move postings
January 19 th, VMT Meeting Regression: Pbm Solving vs. Math Move The two variables are strongly and positively correlated (.967) What type of correlation? How are they correlated?
January 19 th, VMT Meeting Regression: Pbm Solving vs. Math Move
January 19 th, VMT Meeting Regression: Math Move vs. Pbm Solving
January 19 th, VMT Meeting Discussion: correlations (I) The Social reference is strongly and negatively correlated to Pbm Solving (-.970) and Math Move (-.942) The degree of the correlation may vary by enlarging the sample size The strong correlation indicates that such a tendency is expected: by enlarging the sample size (the sample was ‘randomly’ chosen) even if coders might have influenced the strong correlation Pow18 shows to be an outlier and requires a careful examination
January 19 th, VMT Meeting Discussion: correlations (II) Question1: Why the “production” of Social reference influences negatively the “production” of Pbm Solving and Math Move? A first interpretation The math pbm solving activity takes place during a fixed amount of time (roughly an hour). The more effort in “production” of Social Reference, less “production” of Math Question2: Does this have anything to do with “exploratory” vs. “expository” mode? e.g. pow2-1 vs. pow2-2 we see that there is a considerable “distance” between the two (cf. regression)
January 19 th, VMT Meeting Discussion: correlations (III) Study at the second level (subcategories) Two codes from Social Ref. dimension seem particularly interesting: References to individual actions vs. group actions seem to be a key point! Code: Individual reference = Any utterance with a reference to the self or another member. This refers to the collaboration in a broader sense (an activity that has been done or will be done by the self or another group member) Code: Group reference = Any utterance with a reference to the group. This refers to the collaboration in a broader sense (an activity that has been done or is assumed to be done or will be done by the group) Let’s look at pow2-1 vs. pow2-2
January 19 th, VMT Meeting Individual vs. group references in Pbm Solving POWWOW2-1POWWOW2-2 I thought of factoring (n + 2)^2 and n(n + 5) Pbm Solving (Tactic) & Individual Ref. we could find a range Pbm Solving (Tactic) & Group Ref.
January 19 th, VMT Meeting This leads to… Hypothesis: in “expository” powwows there is more Individual ref. than Group Ref. and, in “exploratory” powwows there is more Group Ref. than Individual ref. that we will study from Statistical approach (second level of analysis) distribution of freqs of individual vs. group refs distribution of freqs of other subcategories Thread analysis computing and visualizing individual-like threads and group- like threads and combinations of them CA approach
January 19 th, VMT Meeting Discussion: from CA perspective How does the “social activity” unfolds sequentially during the pbm solving? And, specifically, how does the individual vs. group reference unfolds?
January 19 th, VMT Meeting Discussion: from CA perspective (I) Handle Posting Soc. Ref Pbm Solving Math Move AVR it's okay PIN hahaa Ss SUP my internet explorer wouldnt open PIN ena you gotta hurtet! Ci PIN haha jk Ss PIN hurry* AVR so now for the new triangle we have: = 1/2bh CgPGeo AVR do you follow me? Cg PIN hey its ChNc PIN cuz look AVR Rs AVR and do the calculation PIN we agree it is CgChNc SUP then einstein over here was confusing me Io PIN or no? AVR yes we do CgCh Powwow2-1
January 19 th, VMT Meeting Discussion: from CA perspective (II) Handle Posting Soc. Ref.Pbm Solv.Math Move REA I got 15 R MCP I'm getting 15 also ChNc REA I'll explain Ci AH3 Yep, that's right– I got 15 also CiChNc REA now AH3 For the extra, let REA first i got the area to both triangles CiTGeo REA With the first one with edgelengths of 9 REA I used the fourmla CiPGeo Powwow2-2
January 19 th, VMT Meeting Discussion: from CA perspective (III) HandlePosting Soc. Ref Pbm Solving Math Move AVR so now we add the two areas CgT SUP just a little PIN its RNc AVR exactly Ch AVR or as I got it :-) CiRe AVR multiply it by two PNc AVR and you get = bh CiPGeo PIN we should get the exact measure ment CgChNc Powwow2-1
January 19 th, VMT Meeting Discussion: from CA perspective (IV) Handle Posting Soc. Ref Pbm Sol Math Move OFF hey... Gr SUP what do we fdomwith the area Ss AVR off spring do SO not rule! OFF lol Ss AVR especially if you are a woman! AVR no jk jk Ss OFF lol Ss OFF im no woman PIN lol Ss AVR well I am SUP hey hey SUP women are great Ss GER why don't the three old timers explain what you have figured out OFF oh AVR women are great... Ss SUP ok AVR but pain-enduring SUP they wont explain it to me Cg AVR okay, let's explain Cg Powwow2-1
January 19 th, VMT Meeting Discussion: regression Significant linear regressions between: Social reference and Pbm Solving Social reference and Math Move Pbm Solving and Math Move Coefficients in each equation show the estimation for each case.
January 19 th, VMT Meeting Annex
January 19 th, VMT Meeting LLR Smoother (for the whole sample) A smoother is a trend line that shows how the two variables (X and Y) are related to one another. It is not a statistical test !!! of the relationship of X and Y, although in most cases it is possible to infer the practical significance of the relationship.
January 19 th, VMT Meeting Correlation Pbm Solving vs. Math Move (without removing pow18) Correlations Percenta ge Pbm Solving postings Percenta ge Math Move postings Percentage Pbm Solving postings Pearson Correlation 1.956(**) Sig. (2-tailed)..003 N 66 Percentage Math Move postings Pearson Correlation.956(**) 1 Sig. (2-tailed).003. N 66 ** Correlation is significant at the 0.01 level (2-tailed).
January 19 th, VMT Meeting Individual vs. group action references in Social Activity (count; for percents look at slide 23) POWWOW2-1POWWOW2-2 I thought of factoring (n + 2)^2 and n(n + 5) Pbm Solving (Tactic) & Individual Ref. we could find a range Pbm Solving (Tactic) & Group Ref. Check Orientation Perform Result Restate Reflect Strategy Tactic Count Social Ref. Collaboration group Collaboration individual Identify self Resource Composition of Pbm Solving in terms of Social Reference