Special Topics in Educational Data Mining HUDK5199 Spring term, 2013 February 25, 2013.

Slides:



Advertisements
Similar presentations
Advanced Excel Class Glenn Harris Microsoft Certified Trainer
Advertisements

Advanced Excel Class Glenn Harris Microsoft Certified Trainer Office Master Instructor Excel Class NYC.
Special Topics in Educational Data Mining HUDK5199 Spring term, 2013 February 18, 2013.
Mgt 240 Lecture Decision Support Systems March 3, 2005.
Educational Data Mining Overview John Stamper PSLC Summer School /25/2011 1PSLC Summer School 2011.
Feature Engineering Studio February 23, Let’s start by discussing the HW.
Brainstorming. Brainstorming Sequence One team member should review the topic of the brainstorm using "why", "how", or "what" questions. Example: The.
Feature Engineering Week 3 Video 3. Feature Engineering.
CS150 Final Exam Review 1 CS150. What you can bring with you  The paper (cheat sheet) that you used on the excel exam 2 CS150.
Feature Engineering Studio March 30, Iterative Feature Refinement.
Spreadsheet Productivity Session Roger Grinde, Ph.D.
Advanced Features of Microsoft Excel session 3 10/10/2003 Bart Baesens.
Feature Engineering Studio September 23, Welcome to Mucking Around Day.
Feature Engineering Studio September 9, Welcome to Problem Proposal Day Rules for Presenters Rules for the Rest of the Class.
Feature Engineering Studio September 23, Let’s start by discussing the HW.
Feature Engineering Studio October 14, Iterative Feature Refinement.
Learning Analytics: Process & Theory March 24, 2014.
Feature Engineering Studio March 1, Let’s start by discussing the HW.
Feature Engineering Studio September 30, Quick Note Please me for appointments rather than just showing up at my office – I’m always glad.
Core Methods in Educational Data Mining HUDK4050 Fall 2014.
Special Topics in Educational Data Mining HUDK5199 Spring term, 2013 February 27, 2013.
Core Methods in Educational Data Mining HUDK4050 Fall 2014.
Core Methods in Educational Data Mining HUDK4050 Fall 2015.
Feature Engineering Studio October 7, Welcome to Bring Me a Rock Day 2.
Special Topics in Educational Data Mining HUDK5199 Spring term, 2013 March 6, 2013.
Core Methods in Educational Data Mining HUDK4050 Fall 2014.
Feature Engineering Studio February 2, Welcome to Problem Proposal Day Rules for Presenters Rules for the Rest of the Class.
Core Methods in Educational Data Mining HUDK4050 Fall 2014.
MS-EXCEL PART 3. Use data validation in Excel to make sure that users enter certain values into a cell. Data Validation Example In this example, we restrict.
MEDIATION. What is your conflict style? How do you resolve conflicts? Are you aggressive (my way of the highway) Compromising (let’s work it out) Appeasing.
Argument and Persuasion Close Read Questions with Persuasive Devices.
Advanced spreadsheet tips & tricks
What are good next studies to run?*
Feature Engineering Studio
Linking Images Finding a common story
8th Grade Bell Work- Monday 18, 2013
Information Systems and Databases
Linking Images Finding a common story
“The Things They Carried”
Making Your Last Paragraph Count
Presents RAP Week 6 MARCH 11TH 2013.
QM222 A1 More on Excel QM222 Fall 2017 Section A1.
Special Topics in Educational Data Mining
QM222 A1 Visualizing data using Excel graphs
You will then answer the questions based on the problem set
Fundamentals of Excel WORKSHOP DESCRIPTION Register Here
Today: Review for quiz on Monday.
Question What is a Question?
Ms. Curtis Ms. Skellington 614
Fostering Creativity in Yourself and Your Team
Core Methods in Educational Data Mining
Introduction to bivariate data
Bellwork—August 31, 2015 In your journal, please write a response to the following question: Describe your favorite show and explain why it is your favorite.
“The Things They Carried”
Feature Engineering Studio
Slide 1 -Introduce yourself, as volunteers
MS Excel Scaffolding START.
Evaluation Context Concepts and Examples.
Developing a spreadsheet
Level 4 questions Level 4 questions What is s/he going to say?
Climate Change Unit Lesson 2
Math Break Work Review of the concepts.
Core Methods in Educational Data Mining
Session 1, Program Introduction and Overview
What did you do this morning which used Electricity?
Core Methods in Educational Data Mining
DO NOW Copy down your homework: 2-2 Lesson check on page 91
Accuracy of Averages.
Digital Literacies for learning
Workshop on Excel Dr P Sravan Kumar MBA PhD
Presentation transcript:

Special Topics in Educational Data Mining HUDK5199 Spring term, 2013 February 25, 2013

Today’s Class Feature Engineering and Distillation - What

Special Rules for Today Everyone Votes Everyone Participates

Feature Engineering Not just throwing spaghetti at the wall and seeing what sticks

Construct Validity Matters! Crap features will give you crap models Crap features = reduced generalizability/more over-fitting Nice discussion of this in Sao Pedro paper I assigned

What’s a good feature? A feature that is potentially meaningfully linked to the construct you want to identify

Let’s look at some features used in real models Split into groups of 3-4 Take a sheet of features Which features (or combinations) can you come up with “just so” stories for why they might predict the construct? Are there any features that seem utterly irrelevant?

Each group Tell us what your construct is Tell us your favorite “just so story” (or two) from your features Tell us which features look like junk Everyone else: you have to give the feature a thumbs-up or thumbs-down

Now… Let’s take a break

I need 3 volunteers

Volunteers #1, #2: “Wee dee dee dee” #3: “Weema wompa way”

Everyone else Has to sing a verse of “In the jungle…” With an animal that no one else has mentioned yet

In the jungle….

Now that we’re all feeling creative

Break into *different* 3-4 person groups than last time

Now that we’re all feeling creative Make up features for Assignment 4 You need to – Come up with a new feature – Justify how you can would it from the data set – Justify why it would work

I need a volunteer

Your task is to write down the features suggested And the counts for thumbs up/thumbs down

Now… Each group needs to read their favorite feature to the class and justify it Who thinks this feature will improve prediction of off-task behavior? Who doesn’t? Thumbs up, thumbs down!

Comments or Questions About Assignment 4?

Special Request Bring a print-out of your Assignment 4 solution to class

Next Class Monday, February 27 Feature Engineering and Distillation – HOW Assignment Due: 4. Feature Engineering

Excel Plan is to go as far as we can by 5pm We will continue after next class session Vote on which topics you most want to hear about

Topics Using average, count, sum, stdev (asgn. 4 data set) Relative and absolute referencing (made up data) Copy and paste values only (made up data) Using sort, filter (asgn. 4 data set) Making pivot table (asgn. 4 data set) Using vlookup (Jan. 28 class data set) Using countif (asgn. 4 data set) Making scatterplot (Jan. 28 class data set) Making histogram (asgn. 4 data set) Equation Solver (Jan. 28 class data set) Z-test (made up data) 2-sample t-test (made up data) Other topics?

The End