Core Methods in Educational Data Mining

Slides:



Advertisements
Similar presentations
Core Methods in Educational Data Mining HUDK4050 Fall 2014.
Advertisements

Chapter 4. Validity: Does the test cover what we are told (or believe)
Core Methods in Educational Data Mining HUDK4050 Fall 2014.
Beacon Media Supporting Christian schooling worldwide Inquiry-based learning.
Feature Engineering Studio February 23, Let’s start by discussing the HW.
Core Methods in Educational Data Mining HUDK4050 Fall 2014.
Core Methods in Educational Data Mining HUDK4050 Fall 2014.
Feature Engineering Week 3 Video 3. Feature Engineering.
Core Methods in Educational Data Mining HUDK4050 Fall 2014.
Core Methods in Educational Data Mining HUDK4050 Fall 2014.
Core Methods in Educational Data Mining HUDK4050 Fall 2014.
Learning Analytics: Process & Theory March 24, 2014.
Feature Engineering Studio March 1, Let’s start by discussing the HW.
Feature Engineering Studio September 30, Quick Note Please me for appointments rather than just showing up at my office – I’m always glad.
Core Methods in Educational Data Mining HUDK4050 Fall 2014.
Special Topics in Educational Data Mining HUDK5199 Spring term, 2013 February 27, 2013.
Engineers create what has never existed!
Core Methods in Educational Data Mining HUDK4050 Fall 2014.
Core Methods in Educational Data Mining HUDK4050 Fall 2015.
Core Methods in Educational Data Mining HUDK4050 Fall 2015.
Special Topics in Educational Data Mining HUDK5199 Spring term, 2013 March 6, 2013.
Core Methods in Educational Data Mining HUDK4050 Fall 2015.
Core Methods in Educational Data Mining HUDK4050 Fall 2014.
What is Design – Tom Kelley Not just problem solving – creative leap Messy – No right answer Takes a point of view – or many Calls for vision and multiple.
Core Methods in Educational Data Mining HUDK4050 Fall 2014.
Core Methods in Educational Data Mining HUDK4050 Fall 2015.
Core Methods in Educational Data Mining HUDK4050 Fall 2014.
So what happened in the election ?. I’m a bit confused by those figures. Harvey got 47.4% of what? Let me explain.
Feature Engineering Studio October 7, Welcome to Bring Me Another Rock.
Special Topics in Educational Data Mining HUDK5199 Spring term, 2013 February 25, 2013.
What is Brainstorming? Brainstorming is a process when you focus on a problem and come up with as many solutions as possible. One of the reasons it is.
Today Project Introduction Design Thinking EA: New Paradigm access:Tufts Demo Tools & Takeaways Conversation.
Core Methods in Educational Data Mining
Core Methods in Educational Data Mining
Lab 05: Coordinated Multiple Views
Core Methods in Educational Data Mining
Core Methods in Educational Data Mining
Core Methods in Educational Data Mining
Design.
This is a focusing stage empathize define test ideate prototype.
Core Methods in Educational Data Mining
Foundations of Technology Creativity and Brainstorming
Good Morning Everyone!! Our Warm Up today is finishing the exam we began on Monday. You will have exactly 30 mins in class today before we need to move.
Core Methods in Educational Data Mining
The 6 Traits of Writing.
Presents RAP Week 6 MARCH 11TH 2013.
Using Design Thinking to Help Your Campus Consider a Breakthrough
Core Methods in Educational Data Mining
Ideation CPSC 481: HCI I Fall 2014
Big Data, Education, and Society
Core Methods in Educational Data Mining
Core Methods in Educational Data Mining
Core Methods in Educational Data Mining
Big Data, Education, and Society
Feature Engineering Studio
Core Methods in Educational Data Mining
DimensionX: Dream and Discover
The Take-Away What are they learning?.
Exploring Daily Check-In Meetings
Software Product Management Metrics
Foundations of Technology Creativity and Brainstorming
Core Methods in Educational Data Mining
Core Methods in Educational Data Mining
Language Arts: Monday 2-25 I.N. 15
Core Methods in Educational Data Mining
Core Methods in Educational Data Mining
The Writing Process Please take out some paper, you will need to take notes. Please label these notes “The Writing Process”
Core Methods in Educational Data Mining
One Page Target Planning
Presentation transcript:

Core Methods in Educational Data Mining EDUC545 Spring 2017

Last Lecture Slides 34-36

Textbook part 1

Pelanek example W002V004v5 Who would like me to review this?

Feature Engineering Not just throwing spaghetti at the wall and seeing what sticks

Construct Validity Matters! Crap features will give you crap models Crap features = reduced generalizability/more over-fitting Nice discussion of this in the Sao Pedro paper

What’s a good feature? A feature that is potentially meaningfully linked to the construct you want to identify

Baker’s feature engineering process Brainstorming features Deciding what features to create Creating the features Studying the impact of features on model goodness Iterating on features if useful Go to 3 (or 1)

What’s useful? Brainstorming features Deciding what features to create Creating the features Studying the impact of features on model goodness Iterating on features if useful Go to 3 (or 1)

What’s missing? Brainstorming features Deciding what features to create Creating the features Studying the impact of features on model goodness Iterating on features if useful Go to 3 (or 1)

How else could it be improved?

IDEO tips for Brainstorming 1. Defer judgment 2. Encourage wild ideas 3. Build on the ideas of others 4. Stay focused on the topic 5. One conversation at a time 6. Be visual 7. Go for quantity http://www.openideo.com/fieldnotes/openideo-team-notes/seven-tips-on-better-brainstorming

Your thoughts?

Deciding what features to create Trade-off between the effort to create a feature and how likely it is to be useful Worth biasing in favor of features that are different than anything else you’ve tried before Explores a different part of the space

General thoughts about feature engineering?

Activity

Special Rules for Today Everyone Votes Everyone Participates

Let’s look at some features used in real models

Let’s look at some features used in real models Split into 6 groups Take a sheet of features Which features (or combinations) can you come up with “just so” stories for why they might predict the construct? Are there any features that seem utterly irrelevant?

Each group Tell us what your construct is Tell us your favorite “just so story” (or two) from your features Tell us which features look like junk Everyone else: you have to give the feature a thumbs-up or thumbs-down

Textbook part 2

Automated Feature Generation What are the advantages of automated feature generation, as compared to feature engineering? What are the disadvantages?

Automated Feature Selection What are the advantages of automated feature selection, as compared to having a domain expert decide? (as in Sao Pedro paper from Monday) What are the disadvantages?

A connection to make

A connection to make Correlation filtering Eliminating collinearity in statistics In this case, increasing interpretability and reducing over-fitting go together At least to some positive degree

Outer-loop forward selection What are the advantages and disadvantages to doing this?

Knowledge Engineering What is knowledge engineering?

Knowledge Engineering What is the difference between knowledge engineering and EDM?

Knowledge Engineering What is the difference between good knowledge engineering and bad knowledge engineering?

Knowledge Engineering What is the difference between (good) knowledge engineering and EDM? What are the advantages and disadvantages of each?

How can they be integrated?

FCBF: What Variables will be kept? (Cutoff = 0.65) What variables emerge from this table? G H I J K L Predicted .7 .8 .4 .3 .72 .6 .5 .38 .82 .1 .75 .65 .42

Other questions, comments, concerns about textbook?

No Class Next Week Spring Break

Next Class Association Rule Mining. Guest lecture, Miguel Andres. Wednesday, March 15 Baker, R.S. (2015) Big Data and Education. Ch. 5, V3. Merceron, A., Yacef, K. (2008) Interestingness Measures for Association Rules in Educational Data. Proceedings of the 1st International Conference on Educational Data Mining,57-66. Bazaldua, D.A.L., Baker, R.S., San Pedro, M.O.Z. (2014) Combining Expert and Metric-Based Assessments of Association Rule Interestingness. Proceedings of the 7th International Conference on Educational Data Mining

Special Request Bring a print-out of your Assignment C2 solution to class on the day it’s due In 2 weeks

The End