Feature Engineering Studio Special Session September 25, 2013.

Slides:



Advertisements
Similar presentations
Feature Engineering Studio January 21, Welcome to Feature Engineering Studio Design studio-style course teaching how to distill and engineer features.
Advertisements

Special Topics in Educational Data Mining HUDK5199 Spring term, 2013 March 7, 2013.
How computers answer questions An introduction to machine learning Peter Barnum August 7, 2008.
COMP 328: Midterm Review Spring 2010 Nevin L. Zhang Department of Computer Science & Engineering The Hong Kong University of Science & Technology
Text Classification: An Implementation Project Prerak Sanghvi Computer Science and Engineering Department State University of New York at Buffalo.
1 Homework  What’s important (i.e., this will be used in determining your grade): Finding features that make a difference You should expect to do some.
ML ALGORITHMS. Algorithm Types Classification (supervised) Given -> A set of classified examples “instances” Produce -> A way of classifying new examples.
CS 732: Advance Machine Learning Usman Roshan Department of Computer Science NJIT.
Evaluation of Results (classifiers, and beyond) Biplav Srivastava Sources: [Witten&Frank00] Witten, I.H. and Frank, E. Data Mining - Practical Machine.
Computational Analysis of USA Swimming Data Junfu Xu School of Computer Engineering and Science, Shanghai University.
Feature Engineering Studio February 23, Let’s start by discussing the HW.
CS 5604 Spring 2015 Classification Xuewen Cui Rongrong Tao Ruide Zhang May 5th, 2015.
Machine Learning CS 165B Spring 2012
SVMLight SVMLight is an implementation of Support Vector Machine (SVM) in C. Download source from :
Slide Image Retrieval: A Preliminary Study Guo Min Liew and Min-Yen Kan National University of Singapore Web IR / NLP Group (WING)
Case Study – San Pedro Week 1, Video 6. Case Study of Classification  San Pedro, M.O.Z., Baker, R.S.J.d., Bowers, A.J., Heffernan, N.T. (2013) Predicting.
Core Methods in Educational Data Mining HUDK4050 Fall 2014.
Advanced Methods and Analysis for the Learning and Social Sciences PSY505 Spring term, 2012 February 13, 2012.
Project MLExAI Machine Learning Experiences in AI Ingrid Russell, University.
An Example of Course Project Face Identification.
Feature Engineering Studio September 23, Welcome to Mucking Around Day.
Hospitalization Prediction From Health Care Claims Adithya Renduchintala, Benjamin Martin, & Lance Legel University of Colorado Boulder  Data Mining 
Data Mining Teaching experience at the FIB. What is Data Mining? A broad set of techniques and algorithms brought from machine learning and statistics.
Feature Engineering Studio September 9, Welcome to Problem Proposal Day Rules for Presenters Rules for the Rest of the Class.
The Perceptron. Perceptron Pattern Classification One of the purposes that neural networks are used for is pattern classification. Once the neural network.
Overview of Part I, CMSC5707 Advanced Topics in Artificial Intelligence KH Wong (6 weeks) Audio signal processing – Signals in time & frequency domains.
Machine learning system design Prioritizing what to work on
Classification of microarray samples Tim Beißbarth Mini-Group Meeting
Today Ensemble Methods. Recap of the course. Classifier Fusion
Prediction of Influencers from Word Use Chan Shing Hei.
Feature (Gene) Selection MethodsSample Classification Methods Gene filtering: Variance (SD/Mean) Principal Component Analysis Regression using variable.
Core Methods in Educational Data Mining HUDK4050 Fall 2014.
Core Methods in Educational Data Mining HUDK4050 Fall 2014.
Feature Engineering Studio October 7, Welcome to Bring Me a Rock Day 2.
Data Analytics CMIS Short Course part II Day 1 Part 1: Introduction Sam Buttrey December 2015.
***Classification Model*** Hosam Al-Samarraie, PhD. CITM-USM.
Feature Engineering Studio September 9, Welcome to Feature Engineering Studio Design studio-style course teaching how to distill and engineer features.
Introduction to Weka ML Seminar for Rookies Byoung-Hee Kim Biointelligence Lab, Seoul National University.
CS 732: Advance Machine Learning
Supervised Random Walks: Predicting and Recommending Links in Social Networks Lars Backstrom (Facebook) & Jure Leskovec (Stanford) Proc. of WSDM 2011 Present.
Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:
Machine Learning in CSC 196K
Classification Cheng Lei Department of Electrical and Computer Engineering University of Victoria April 24, 2015.
Machine Learning. Machine Learning: what is it? We have data.
BNFO 615 Fall 2016 Usman Roshan NJIT. Outline Machine learning for bioinformatics – Basic machine learning algorithms – Applications to bioinformatics.
Course Outline (6 Weeks) for Professor K.H Wong
Core Methods in Educational Data Mining
6. Kernel Regression.
Table 1. Advantages and Disadvantages of Traditional DM/ML Methods
Core Methods in Educational Data Mining
Estimating Link Signatures with Machine Learning Algorithms
Prediction of NBA games based on Machine Learning Methods
FUNDAMENTALS OF MACHINE LEARNING AND DEEP LEARNING
Machine Learning Week 1.
Feature Engineering Studio Special Session
TED Talks – A Predictive Analysis Using Classification Algorithms
دانشگاه صنعتی امیرکبیر Instructor : Saeed Shiry & Bishop Ch. 1
Prediction of Wine Grade
Machine Learning Course.
Lecture 6: Introduction to Machine Learning
Chapter 12 Linear Regression and Correlation
Classification Boundaries
Variable Selection - Accelerator
Multivariate Methods Berlin Chen
AI-ECON Research Center Department of Economics National Chengchi University.
Analysis on Accelerated Learning Cohorts
Practice Project Overview
MACHINE LEARNING IN ASIC BACKEND DESIGN
What is Artificial Intelligence?
Presentation transcript:

Feature Engineering Studio Special Session September 25, 2013

RapidMiner 5.3 Data file and rapidminer xml file are on course webpage

Look at data

Look at process step-by-step

Build classifier

Goodness Criteria Kappa AUC (Warning!) Accuracy (Warning!) Precision Recall

Turn cross-validation off

Other types of cross-validation Student-level cross-validation Population-level cross-validation Content-level cross-validation When you use these….

Setting up other types of cross-validation BatchXValidation SetRole

CompleteFeatureGeneration

RemoveCorrelatedFeatures

Other Classification Algorithms W-J48 W-JRip W-KStar

Set up a Regression

Regression Algorithms Linear Regression W-RepTree W-M5P Neural Networks Support Vector Machines

Goodness Criteria Correlation RMSE/MAD

Many other things RapidMiner can do… These are just two types of common prediction models

For a broader overview of prediction modeling… Come to next week’s special session

Questions? Concerns?