Analyzing Chat Dialogue with Taghelper Tools Catherine Chase Stanford University PSLC Summer Institute June 22, 2007.

Slides:



Advertisements
Similar presentations
Literacy Subject Leaders
Advertisements

Summer Institute 2014 Technology Integration Matrix Laurie Kirkland.
What is Differentiation?
An adaptive hierarchical questionnaire based on the Index of Learning Styles Alvaro Ortigosa, Pedro Paredes, Pilar Rodriguez Universidad Autónoma de Madrid.
Inclusion Literature & Bibliotherapy Marina Louari Marita Paparousi
Academic Warning B Status Standing Workshop What is Academic Warning? How did I get here? How do I get off Academic Warning? The Center For Educational.
What I Learned In CIS 101! By, Billy Johns. Excel is a GREAT program to create spreadsheets!!! I learned how to create cells, edit cells, and entering.
ML ALGORITHMS. Algorithm Types Classification (supervised) Given -> A set of classified examples “instances” Produce -> A way of classifying new examples.
September 23, mood vs. tone Ho mework: Story elements test on Monday, September 29th! Bring a copy of Stargirl by this Friday! Ob jective I can.
ENGINEERING YOUR FUTURE
Click to edit the title text format An Introduction to TuTalk: Developing Dialogue Agents for Learning Studies Pamela Jordan University of Pittsburgh Learning.
Student views about homework Toby Leonard Eleanor Scott Jessica Hollis Nima Habibzadeh Lauren Danbury Alice Ward.
Peer Leader Selection and Training. Peer Leader Selection  Important attributes Excellent interpersonal skills: Interactive, communicative, supportive,
How to Make a Survey.
Content Analysis Renee Meyers UWS Leadership Site April 08.
Methods in Medical Image Analysis Statistics of Pattern Recognition: Classification and Clustering Some content provided by Milos Hauskrecht, University.
What is Academic Warning? How did I get here? How do I get off Academic Warning? The Center For Educational and Retention Counseling Presents Academic.
Computational Methods to Vocalize Arabic Texts H. Safadi*, O. Al Dakkak** & N. Ghneim**
1.4 MULTIPLICATION WITH NEGATIVE NUMBERS 1.5 DIVISION WITH NEGATIVE NUMBERS Catherine Conway Math 081.
Short Introduction to Machine Learning Instructor: Rada Mihalcea.
Kristina Lerman Aram Galstyan USC Information Sciences Institute Analysis of Social Voting Patterns on Digg.
Evaluating the Effect of Social Conversation on Learning, Interaction, and Perceived Interdependence in a Collaborative Math Problem Solving Environment.
TagHelper: Basics Part 1 Carolyn Penstein Rosé Carnegie Mellon University Funded through the Pittsburgh Science of Learning Center and The Office of Naval.
1 / 12 PSLC Summer School, June 21, 2007 Identifying Students’ Gradual Understanding of Physics Concepts Using TagHelper Tools Nava L.
TagHelper and InfoMagnets Technologies for Exploring the effect of Language Interactions in Learning Carolyn Penstein Rosé, Jaime Arguello, Yue Cui, Rohit.
 Text Representation & Text Classification for Intelligent Information Retrieval Ning Yu School of Library and Information Science Indiana University.
Click to edit the title text format An Introduction to TuTalk: Developing Dialogue Agents for Learning Studies Pamela Jordan University of Pittsburgh Learning.
1 Statistical Techniques Chapter Linear Regression Analysis Simple Linear Regression.
Noboru Matsuda Human-Computer Interaction Institute
湖南长郡卫星远程学校 2013 年下学期制作 13. 湖南长郡卫星远程学校 2013 年下学期制作 13 Discussion: What words or phrases can we use to describe our negative emotions?
PIER Research Methods Protocol Analysis Module Hua Ai Language Technologies Institute/ PSLC.
Personal Identity Mind/Soul Brain Body Memory Story.
My 1 st Grade Year. I will be going to a new school next year. My new school will be Tucker Elementary in Ponchatoula, Louisiana.
1/21 Automatic Discovery of Intentions in Text and its Application to Question Answering (ACL 2005 Student Research Workshop )
Warm up Pick 1 of the sentences below to complete I am proud of my ability to… If I could change anything about myself, it would be….
Carolyn Penstein Rosé Language Technologies Institute Human-Computer Interaction Institute School of Computer Science With funding from the National Science.
Design Reuse Earlier we have covered the re-usable Architectural Styles as design patterns for High-Level Design. At mid-level and low-level, design patterns.
Evaluation of DBMiner By: Shu LIN Calin ANTON. Outline  Importing and managing data source  Data mining modules Summarizer Associator Classifier Predictor.
Machine Learning: Decision Trees Homework 4 assigned courtesy: Geoffrey Hinton, Yann LeCun, Tan, Steinbach, Kumar.
Dealing with Problems in Daily Life – Unit 337
Machine Learning in Practice Lecture 6 Carolyn Penstein Rosé Language Technologies Institute/ Human-Computer Interaction Institute.
High Throughput and Programmable Online Traffic Classifier on FPGA Author: Da Tong, Lu Sun, Kiran Kumar Matam, Viktor Prasanna Publisher: FPGA 2013 Presenter:
Creative Problem Solving Yadira Rosales and Daisy Padilla CCE 576.
Pupils’ Perceptions as they end KS3 Mathematics Sheila Smyth.
I believe that mathematical reality lies outside us, … our function is to discover or observe it, and that … which we describe … as our 'creations' are.
Using PLCs to support MTSS Florida Association of Staff Developers September 24, 2012 Florida Association of Staff Developers September 24, 2012.
AZURE MACHINE LEARNING Bringing New Value To Old Data SQL Saturday #
The pre- post assessment 3 points1 point Introduction Purpose Learning targets Instruments Eliminate bias & distortion Writing conventions.
Develop and assess new curriculum material for middle school science classes Link genomics and evolution Relate learning to community health Incorporate.
TagHelper Track Overview Carolyn Penstein Rosé Carnegie Mellon University Language Technologies Institute & Human-Computer Interaction Institute School.
Accountability Pillar: Continuous Improvement – School Improvement Detail.
Research in natural settings 2 Qualitative research: Surveys and fieldwork Macau University of Science and Technology.
Introduction to Machine Learning, its potential usage in network area,
The Three-Minute Classroom Walk-Through
Video Enhanced Observation in teacher development
What is a CAT? What is a CAT?.
ME 142 Engineering Computation I
Presented by Khawar Shakeel
Advanced data mining with TagHelper and Weka
The Uh-Oh Syndrome with Steve Robbins
INFORMATION AND PROGRESS
Title: Validating a theoretical framework for describing computer programming processes 29 November 2017.
Introduction to Data Mining, 2nd Edition by
An Inteligent System to Diabetes Prediction
Change your words Change your mindset.
Discourse Analytics Carolyn Penstein Rosé
GCSE English Language 2017/18 Session 5
Machine Learning in Practice Lecture 6
All About Me By Raeem.
Meet SIP I am the School Improvement Plan.
Presentation transcript:

Analyzing Chat Dialogue with Taghelper Tools Catherine Chase Stanford University PSLC Summer Institute June 22, 2007

Study Design (Kumar et al., 2007) 16 dyads of 6 th grade students Math Talk tutor built with CTAT and Tutalk Two conditions: Mathtalk with social dialogue enhancers Mathtalk alone Dependent measure analyzed: Chat dialogue from collaborative Math Talk sessions Kumar, R., Gweon, G., Joshi, M., Cui, Y., & Rose, C. (2007). Supporting Students Working Together on Math with Social Dialogue.

Math Talk

Social Dialogue Agent Tutor asks Student1: “Do you like tacos or hamburgers?” Tutor asks Student2: “Do you like to hang out on the weekends or after school?” Tutor uses students responses to generate a story problem: Alice thought she bought enough tacos for 11 of her friends one day on the weekend. But 8 of her friends finished 2/3 of the food. How many times more food should she have bought?

Research Questions Do social dialogue agents affect students’ sense of belonging to a group? Do social dialogue agents affect the attributions students make towards solutions to a math problem? Do social dialogue agents impact student affect?

The Process Data clean-up in Excel Developed and applied coding schemes Created coding models using TagHelper Tools Evaluated model performance Debugged model Altered features in model creation Coded more data or different types of data

Coding Scheme: Sense of belonging Trained a model to code for group, self, and partner-referenced language using only simple user-defined rules Group = we, us, our, ours, ourselves Self = I, I am, I’m, Im, my, mine, me Partner = you, u, your, yours, you’re Using a decision tree algorithm, the model achieved 94% reliability

Coding Scheme: Attributions Trained a model to code for attributions to any problem-solving step using a set of pre- coded data Group: “we did it!” Self: “I figured it out” Partner: “u multiplied by the wrong number” Only achieved reliability of 34% Fewer instances of attributions in the data More ambiguous language made it more difficult for the program to find meaningful patterns Is “nice work!” an attribution to the partner?

Coding Scheme: Affect Part 1 Coded for specific types of affective states or behaviors Insult: “you are stoopid”, “FOOL” Annoyance: “This is annoying” Boredom: “I am bored” Achieved low reliability level of 39% The model used meaningless keywords like “it” and “I” to classify codes into certain categories Sample of coded data for model training was not representative of the full distribution of examples within each category

Coding Scheme: Affect Part 2 Coded broader categories of positive and negative emotion Positive: “sweet”, “awesome!”, “yeah!” Negative: “this is annoying”, “you are stoopid”, “stop it” Achieved high reliability level of 69%, however, data coded by the trained model contained many incorrectly classified cases Coded as Negative by the model: “I know that” Coded as Positive by the model: “it should be 12!” Coded data for training was not representative of all instances of positive/negative affect

Next Steps Conduct statistical analyses on sense of belonging data Continue to “debug” training models for attribution and affect coding schemes Code more data Code more varied examples within data Test out different algorithm types and user- defined features Examine erroneous coding to determine problems with the model

Lessons Learned  Both typical and atypical coded examples are necessary to train an accurate model  Experimentation with various features and algorithms often leads to successful models  TagHelper is particularly useful for:  large data sets  training a model to analyze multiple data sets

Special thanks to… Carolyn Rosé Rohit Kumar Pittsburgh Science of Learning Center