Feature Engineering Studio September 9, 2013. Welcome to Problem Proposal Day Rules for Presenters Rules for the Rest of the Class.

Slides:



Advertisements
Similar presentations
Case Studies M.Sc. in Applied Statistics Dr. Órlaith Burke Michaelmas Term 2012.
Advertisements

Feature Engineering Studio January 21, Welcome to Feature Engineering Studio Design studio-style course teaching how to distill and engineer features.
We’ll be spending a few minutes talking about Quiz 2 on Sections that you’ll be taking the next class session, before you work on Practice Quiz.
Using Rubrics to Assess Learning Tamara H. Rosier Assistant Director for Assessment, Pew FTLC, Fall 2007.
Software Engineering Lab Session Session 4 – Feedback on Assignment 1 © Jorge Aranda, 2005.
Special Topics in Educational Data Mining HUDK5199 Spring term, 2013 March 7, 2013.
Feature Engineering Studio Special Session January 26, 2015.
Introduce the Peer Review Project
Lecture 5 CS171: Game Design Studio 1I UC Santa Cruz School of Engineering 18 Feb 2010.
CSCE790: Security and Privacy for Emerging Ubiquitous Communication system Wenyuan Xu Department of Computer Science and Engineering University of South.
1 An Excel-based Data Mining Tool Chapter The iData Analyzer.
Please open your laptops, log in to the MyMathLab course web site, and open Daily Quiz 18. You will have 10 minutes for today’s quiz. The second problem.
IntroductionTaskProcess Evaluation Conclusion Don’t you think the world would be more beautiful if it looked more like this? How can we change what our.
+ A user guide for students Welcome to MyPlan Create a Plan Audit Your Progress Search for Courses Work with Your Adviser.
Feature Engineering Studio February 23, Let’s start by discussing the HW.
Feature Engineering Week 3 Video 3. Feature Engineering.
Feature Engineering Studio Special Session September 11, 2013.
L1: INTRODUCTION Getting started with Stata Angela Ambroz May 2015.
Preparing and Giving Presentations
Graphical Analysis. Why Graph Data? Graphical methods Require very little training Easy to use Massive amounts of data can be presented more readily Can.
Core Methods in Educational Data Mining HUDK4050 Fall 2014.
Three Secrets about Learning Objects Rachel S. Smith Director, Development & Programs NMC: The New Media Consortium September 15, 2004.
Masterful Meetings September 26, 2007 LEARNERS = LEADERS.
Writing & Getting Published Uwe Grimm (based on slides by Claudia Eckert) MCT, The Open University.
Moodle (Course Management Systems). Forums, Chats, and Messaging.
Feature Engineering Studio September 23, Welcome to Mucking Around Day.
Welcome to the Apex Learning System An online alternative.
Before you are seated, please look inside the back of your nametag for a slip of colored paper. Please seat yourself at the table bearing a sheet of paper.
Updated Today's talk should help you to understand better  what your responsibilities for this module  how you will be taught  how you.
Thank a Teacher! Every Monday Matters. Teacher Appreciation Week is here! Wchttps://
Research and Writing Seminar Thursday, – 16 35, room C To find an up-to-date version of the schedule and to read the papers check the website
How to give a PowerPoint Presentation Staff Development Group and Central Computing Services.
Feature Engineering Studio September 23, Let’s start by discussing the HW.
We will create a unit on interesting people and present this to each other. You will create a power point and publisher document that shows us why YOU.
SMART Steps to MEET & GREET
Feature Engineering Studio October 14, Iterative Feature Refinement.
Statistics: Analyzing 2 Categorical Variables MIDDLE SCHOOL LEVEL  Session #1  Presented by: Dr. Del Ferster.
AuthorAID Workshop on Research Writing Sri Lanka March 2010.
Feature Engineering Studio March 1, Let’s start by discussing the HW.
Feature Engineering Studio September 30, Quick Note Please me for appointments rather than just showing up at my office – I’m always glad.
Core Methods in Educational Data Mining HUDK4050 Fall 2014.
FUTURA: Week 5 Wonders of the World: Week 3. Agenda/Reminders Renzulli Assignment: Building Big due 10/27 Possibly: Materials for Center Work or Seven.
IADSR International Conference 2012 Aiwan-e-Iqbal Lahore, Pakistan 27–29 April 2012.
Special Topics in Educational Data Mining HUDK5199 Spring term, 2013 February 27, 2013.
Can social information change behaviour? The results of a study with student & national trust volunteers Facilitator: Ben Lee Presenters: Professor Oliver.
Core Methods in Educational Data Mining HUDK4050 Fall 2014.
Feature Engineering Studio October 7, Welcome to Bring Me a Rock Day 2.
Feature Engineering Studio April 29, Assignment Problem Shift “The Fresh Mind”
Feature Engineering Studio September 9, Welcome to Feature Engineering Studio Design studio-style course teaching how to distill and engineer features.
Feature Engineering Studio Special Session September 25, 2013.
Feature Engineering Studio February 2, Welcome to Problem Proposal Day Rules for Presenters Rules for the Rest of the Class.
U.S. History Group Project.  In the remaining weeks of school, you, the students, will be put in the position of teacher. You will be broken up into.
An Excel-based Data Mining Tool Chapter The iData Analyzer.
We’ll be spending a few minutes talking about Quiz 2 on Sections that you’ll be taking the next class session, before you work on Practice Quiz.
1 Required , Google Group 1.Send the professor (This is also listed in the –In the Subject,
Wednesday, September 3 rd,  Is it possible to predict which color will be most popular in a bag of M&M’s?  Are the colors evenly distributed?
Language Learning for Busy People These documents are private and confidential. Please do not distribute.. Intermediate: I Disagree.
Successful Peer Review Strategies. Getting Ready for Peer Review What you get out of peer review depends on what you put into it. Your job as a writer.
Special Topics in Educational Data Mining HUDK5199 Spring term, 2013 February 25, 2013.
Feature Engineering Studio
MATH-138 Elementary Statistics
Feature Engineering Studio Special Session
Feature Engineering Studio
Feature Engineering Studio
Guidelines for Group Projects and Papers
Guidelines for Group Projects and Papers
Feature Engineering Studio
BR: T1D18 Describe the issue that you are addressing in your political cartoon. Why did you choose it? How are you making your point(s)? I’ll be giving.
Home-School Communication
Presentation transcript:

Feature Engineering Studio September 9, 2013

Welcome to Problem Proposal Day Rules for Presenters Rules for the Rest of the Class

Rules for Presenters Talk for 3 minutes on: – Data set – What variable will you predict? – What kind of variables will you use to predict it? – Why is this worth doing? Remember to send me your slides (if any)

Rules for Audience After the presentation – Ask quick questions – Give quick suggestions

Criteria Everyone – Is the problem genuinely important? (usable or publishable) – Is there a good measure of ground truth? Only if you know what you’re talking about – Is there rich enough data to distill meaningful features? – Is there enough data to be able to take advantage of data mining?

Rules for Audience Be polite! No interrupting No rambling No being mean

First Step Get into the right collaborative spirit You are officially encouraged (though not required) to sing along – 0:25

Presentations Alphabetical Order Based on Last Name – Tie-Breaker: First Name

For next week Think about how to improve your problem proposal Rewrite your problem proposal based on the feedback you got today Then it to me for further feedback and a “thumbs-up” before the next class

Assignment 2 Data Familiarization “Mucking Around” Get your data set Open it in Excel Look at your ground truth label (if you have one) Look at other key variables What does each variable mean semantically? If numerical, what are its max, min, average, stdev? Create histograms of key variables. If categorical, what is the distribution of each value?

Assignment 2 Data Familiarization “Mucking Around” Write a brief report for me You don’t need to prepare a presentation But be ready to discuss what you learn about your data

What if you don’t have data yet? 1.Get your data 2.If you can’t get your data before class, me at least 48 hours before class and I’ll send you a practice data set

How to compute in Excel If numerical, what are its max, min, average, stdev? If categorical, what is the distribution of each value? Using Class2Data

How to do a histogram in Excel Using Class2Data

Next Class 9/23 Feature distillation in Excel (Asgn.2 due) – Do the assignment – Read the readings

Upcoming Classes 9/23 Feature distillation in Excel (Asgn.2 due) 9/25 Special session on prediction models – Come to this if you don’t know why student-level cross- validation is important, or if you don’t know what J48 is 9/30 Advanced feature distillation in Excel (Asgn. 3 due) 10/2 Special session on RapidMiner – Come to this if you’ve never built a classifier or regressor in RapidMiner (or a similar tool) – Statistical significance tests using linear regression don’t count…