PSLC DataShop Introduction Slides current to DataShop version 4.1.8 John Stamper DataShop Technical Director.

Slides:



Advertisements
Similar presentations
Copyright © 2006 Knovel Corporation Streamline Your Science and Engineering Research
Advertisements

PSLC DataShop Closing Remarks Slides current to DataShop version John Stamper DataShop Technical Director.
Educational data mining overview & Introduction to Exploratory Data Analysis Ken Koedinger CMU Director of PSLC Professor of Human-Computer Interaction.
Improving learning by improving the cognitive model: A data- driven approach Cen, H., Koedinger, K., Junker, B. Learning Factors Analysis - A General Method.
Slide Set I: PARS Overview
©2012 Carnegie Learning, Inc. In-vivo Experimentation Steve Ritter Founder and Chief Scientist Carnegie Learning.
Importing and Exporting DataShop Data Slides current to DataShop version Brett Leber Interaction Designer.
Data mining with DataShop Ken Koedinger CMU Director of PSLC Professor of Human-Computer Interaction & Psychology Carnegie Mellon University Ryan S.J.d.
© 2004, The Trustees of Indiana University 1 OneStart Workflow Basics Brian McGough, Manager, Systems Integration, UITS Ryan Kirkendall, Lead Developer.
Educational Data Mining Overview John Stamper PSLC Summer School /25/2011 1PSLC Summer School 2011.
Computers Are Your Future © 2008Prentice-Hall, Inc.
Educational data mining overview & Introduction to Exploratory Data Analysis with DataShop Ken Koedinger CMU Director of PSLC Professor of Human-Computer.
Introduction to Systems Analysis and Design
 MODERN DATABASE MANAGEMENT SYSTEMS OVERVIEW BY ENGINEER BILAL AHMAD
DataShop: An Educational Data Mining Platform for the Learning Science Community John Stamper Pittsburgh Science of Learning Center Human-Computer Interaction.
John Stamper Human-Computer Interaction Institute Carnegie Mellon University Technical Director Pittsburgh Science of Learning Center DataShop.
Customer Portal – Customer User. You will receive an indicating that your Customer Portal registration is complete. A link to the Customer Portal,
FPDS- NG Reports Overview December 16, Today’s Goals Provide an overview of the FPDS-NG reporting capability Demonstrate each of the reporting tools.
Concept demo System dashboard. Overview Dashboard use case General implementation ideas Use of MULE integration platform Collection Aggregation/Factorization.
Introduction to database systems
OCAN College Access Program Data Submissions Vonetta Woods HEI Analyst, Ohio Board of Regents
PSLC DataShop Introduction Slides current to DataShop version John Stamper DataShop Technical Director.
Introduction With TimeCard users can tag SharePoint events with information that converts them into time sheets. This way they can report.
School Census Summer 2008 for Secondary Schools Jim Haywood – Version 1.1.
DataShop v7.1 Release Event Friday, November 1, 2013 LearnLabdatashop.org LearnLab
PSLC DataShop Introduction Slides current to DataShop version John Stamper DataShop Technical Director.
Administrator – Employee Overview September, 2011.
Instructors begin using McGraw-Hill’s Homework Manager by creating a unique class Web site in the system. The Class Homepage becomes the entry point for.
TPM Software within Good Spirit School Division. TPM Software is an integrated Student Services Software Solution Forms / Printouts / Reports Integrated.
Database Design and Management CPTG /23/2015Chapter 12 of 38 Functions of a Database Store data Store data School: student records, class schedules,
1 NetInsight Demonstration 10 October 2007
Weka: Experimenter and Knowledge Flow interfaces Neil Mac Parthaláin
Database Applications – Microsoft Access Lesson 1 Introduction 26 Slides in Presentation Updated 8/12.
DataShop Import Workshop Tuesday, June 14, 2011 pslcdatashop.org PSLC
Copyright © 2006 Pilothouse Consulting Inc. All rights reserved. Search Overview Search Features: WSS and Office Search Architecture Content Sources and.
MEMBERSHIP AND IDENTITY Active server pages (ASP.NET) 1 Chapter-4.
Create Content Capture Content Review Content Edit Content Version Content Version Content Translate Content Translate Content Format Content Transform.
Applying the Redundancy Principle ( Chapter 7) And using e-learning data for CTA Ken Koedinger 1.
Data mining with DataShop Ken Koedinger CMU Director of PSLC Professor of Human-Computer Interaction & Psychology Carnegie Mellon University.
Excel part 5 Working with Excel Tables, PivotTables, and PivotCharts.
1. October 25, 2011 Louis Everett & John Yu Division of Undergraduate Education National Science Foundation October 26, 2011 Don Millard & John Yu Division.
CIAF Summary Report 2012/13 TPM Software within Good Spirit School Division.
1 CA202 Spreadsheet Application Focusing on Specific Data using Filters Lecture # 5.
Institute for the Protection and Security of the Citizen HAZAS – Hazard Assessment ECCAIRS Technical Course Provided by the Joint Research Centre - Ispra.
Using DataShop Tools to Model Students Learning Statistics Marsha C. Lovett Eberly Center & Psychology Acknowledgements to: Judy Brooks, Ken Koedinger,
HEI/OCAN College Access Program Data Submissions.
© Copyright 2015 salesforce.com, inc. All rights reserved. Various trademarks held by their respective
July 8, 2008In vivo experimentation: 1 Step by Step In Vivo Experimentation Lecture 3 for the IV track of the 2011 PSLC Summer School Philip Pavlik Jr.
Data Mining Lab Student performance evaluation. Rate of learning varies from student to student May depend on similarity of the problem Is it possible.
Introduction to Core Database Concepts Getting started with Databases and Structure Query Language (SQL)
Introduction to the Power BI Platform Presented by Ted Pattison.
1 Access Lesson 1 Understanding Access Fundamentals Microsoft Office 2010 Fundamentals Story / Walls.
Maintaining and Updating Windows Server 2008 Lesson 8.
1 Copyright © 2008, Oracle. All rights reserved. Repository Basics.
Web Content And Customer Relationship Management Solution. Transforming web sites into a customer-focused, revenue generating channel with less stress.
Emdeon Office Batch Management Services This document provides detailed information on Batch Import Services and other Batch features.
SCC P2P – Collaboration Made Easy Contract Management training
What Business Analytics Can Do For You!
Data Virtualization Demoette… CIS Rights
1Z0-477 VCE Questions
Welcome to our first session!
SIRS and STARS: Now What?
Introduction to PSLC DataShop
VALIDATION BEST PRACTICES
Lessons Vocabulary Access 2016.
Agenda About Excel/Calc Spreadsheets Key Features
Support for parents and guardians
Populating the Knowledge Base,Entering Questions, and Analytics
Slide Set I: PARS Overview
Presentation transcript:

PSLC DataShop Introduction Slides current to DataShop version John Stamper DataShop Technical Director

John Stamper –DataShop Technical Director Sandy Demi –QA (Quality Assurance – Testing) Brett Leber –Interaction Designer Alida Skogsholm –DataShop Manager, Developer Duncan Spencer –DataShop Developer Shanwen Yu –DataShop Developer The DataShop Team 2

Central Repository –Secure place to store & access research data Every LearnLab and every study –Supports various kinds of research Primary analysis of study data Exploratory analysis of course data Secondary analysis of any data set Analysis & Reporting Tools –Focus on student-tutor interaction data –Learning curves & error reports provide summary and low-level views of student performance –Performance Profiler aggregates across various levels of granularity (problem, dataset levels, knowledge components, etc.) –Data Export Tab delimited tables you can open with your favorite spreadsheet program or statistical package –New tools created to meet highest demands What is DataShop? 3

1 Collect Tutors log to a standard log format Data is captured in files or sent to the logging database Older data can be transformed to the tutor logging format Store 2 DataShop …converts log data to a uniform XML Format …anonymizes participant identifiers …imports data into a relational database Report 3 DataShop …provides visualization and reports for researchers to analyze their data …can export the data to a tab-delimited format How do I get data in? 4

Directly –Some tutors are logging directly to the PSLC logging database –CTAT-based tutors (when configured correctly) Indirectly –Other tutors are logging to their own file formats or their own databases –These data require a conversion process –Many studies are in this category How do I get data in? 5

Explore data through the DataShop tools Where is DataShop? – –Linked from DataShop homepage and learnlab.org Getting to DataShop 6

Creating an account On DataShop's home page, click "Sign up now". Complete the form to create your DataShop account. "Sign up now" If you’re a CMU student/staff/faculty, click “Log in with WebISO” to create your account. 7

Getting access to datasets By default, you will have access to the public datasets. Of these, we recommend three for getting started: –Geometry Area ( ) –Joint Explanation - Electric Fields - Pitt - Spring 2007 –Chinese Vocabulary Fall 2006 For access to other datasets, contact us: 8

Public datasets that you can view only. Private datasets you can’t view. us and the PI to get access. Datasets you can view or edit. You have to be a project member or PI for the dataset to appear here. DataShop – Dataset selection 9

Important Terms Transaction Step Sample KC (Knowledge Component) Opportunity Observation LFA AIC BIC see 10

Important Terms Transaction –A transaction is an interaction between the student and the tutoring system. –Students may make incorrect entries or ask for hints before getting a step correct. Each hint request, incorrect attempt, or correct attempt is a transaction; and a step can involve one or more transactions. Step –A step is an observable part of the solution to a problem. Because steps are observable, they are partly determined by the user interface available to the student for solving the problem. 11

Important Terms Sample –a subset of the data –the Sample Selector provides the ability to filter on various aspects of the data –You can use samples to: Compare across conditions Narrow the scope of data analysis to a specific time range, set of students, problem category, or unit of a curriculum (for example) –The columns available to filter on are organized into categories. The categories are: Condition, Dataset Level, Problem, School, Student, Tutor Transaction 12

Important Terms KC, Knowledge Component –a piece of information that can be used to accomplish tasks, perhaps along with other knowledge components. Knowledge component is a generalization of everyday terms like concept, principle, fact, or skill, and cognitive science terms like schema, production rule, misconception, or facet. Opportunity –An opportunity is a chance for a student to demonstrate whether he or she has learned a given knowledge component. An opportunity exists each time a step is present with the associated knowledge component. 13

Important Terms Observation –An observation is a group of transactions for a particular student working on a particular step within a problem view. If within these constraints there is only one transaction recorded, an observation will still exist for that single transaction. –Put another way, an observation is available each time a student takes an opportunity to demonstrate a knowledge component 14

Important Terms LFA, Learning Factors Analysis –a logistic regression method which uses a set of customized Item-Response models to predict how a student will perform for each knowledge component on each learning opportunity. LFA was developed at Carnegie Mellon by Hao Cen, Kenneth Koedinger, and Brian Junker. In DataShop, the LFA algorithm is run over each KC model of each dataset, producing data that populates predicted learning curves and other reports. 15

Important Terms AIC, Akaike Information Criterion –a measure of the goodness of fit of a statistical model, in this case, the LFA model. It is an operational way of trading off the complexity of the estimated model against how well the model fits the data 2. In this way, it penalizes the model based on its complexity (the number of parameters). A lower AIC value is better. 2 BIC, Bayesian Information Criterion –a measure of goodness of fit of the LFA model. The BIC penalizes free parameters more strongly than does the Akaike information criterion (AIC) 3. A lower BIC value is better. 3 16