Worcester Polytechnic Institute CS548 Spring 2016 Decision Trees Showcase By Yi Jiang and Brandon Boos ---- Showcase work by Zhun Yu, Fariborz Haghighat,

Slides:



Advertisements
Similar presentations
DECISION TREES. Decision trees  One possible representation for hypotheses.
Advertisements

CS548 Spring 2015 Showcase By Yang Liu, Viseth Sean, Azharuddin Priyotomo Showcasing work by Le, Abrahart, and Mount on "M5 Model Tree applied to modelling.
Introduction Training Complexity, Pruning CART vs. ID3 vs. C4.5
1 Data Mining Classification Techniques: Decision Trees (BUSINESS INTELLIGENCE) Slides prepared by Elizabeth Anglo, DISCS ADMU.
Decision Tree.
Comparison of Data Mining Algorithms on Bioinformatics Dataset Melissa K. Carroll Advisor: Sung-Hyuk Cha March 4, 2003.
Application of Stacked Generalization to a Protein Localization Prediction Task Melissa K. Carroll, M.S. and Sung-Hyuk Cha, Ph.D. Pace University, School.
Chapter 7 – Classification and Regression Trees
Chapter 7 – Classification and Regression Trees
ID3 Algorithm Abbas Rizvi CS157 B Spring What is the ID3 algorithm? ID3 stands for Iterative Dichotomiser 3 Algorithm used to generate a decision.
Decision Tree Algorithm
Poster Guidelines put into a Poster Format Tyler, J.A and Biology & Biotechnology Faculty 1 Worcester Polytechnic Institute, Biology and Biotechnology.
Lecture 5 (Classification with Decision Trees)
Covering Algorithms. Trees vs. rules From trees to rules. Easy: converting a tree into a set of rules –One rule for each leaf: –Antecedent contains a.
Decision Trees (2). Numerical attributes Tests in nodes are of the form f i > constant.
Classification II.
© Prentice Hall1 DATA MINING Introductory and Advanced Topics Part II Margaret H. Dunham Department of Computer Science and Engineering Southern Methodist.
Ordinal Decision Trees Qinghua Hu Harbin Institute of Technology
Ensemble Learning (2), Tree and Forest
Data Mining and Decision Tree CS157B Spring 2006 Masumi Shimoda.
Introduction to Directed Data Mining: Decision Trees
PUBLIC: A Decision Tree Classifier that Integrates Building and Pruning RASTOGI, Rajeev and SHIM, Kyuseok Data Mining and Knowledge Discovery, 2000, 4.4.
DATA MINING : CLASSIFICATION. Classification : Definition  Classification is a supervised learning.  Uses training sets which has correct answers (class.
Fall 2004 TDIDT Learning CS478 - Machine Learning.
Lecture 14 Multiple Regression Model
© 2002 Prentice-Hall, Inc.Chap 14-1 Introduction to Multiple Regression Model.
Slides for “Data Mining” by I. H. Witten and E. Frank.
1 Data Mining Lecture 3: Decision Trees. 2 Classification: Definition l Given a collection of records (training set ) –Each record contains a set of attributes,
Chapter 9 – Classification and Regression Trees
Lecture 7. Outline 1. Overview of Classification and Decision Tree 2. Algorithm to build Decision Tree 3. Formula to measure information 4. Weka, data.
Comparing Univariate and Multivariate Decision Trees Olcay Taner Yıldız Ethem Alpaydın Department of Computer Engineering Bogazici University
Decision Trees DefinitionDefinition MechanismMechanism Splitting FunctionSplitting Function Issues in Decision-Tree LearningIssues in Decision-Tree Learning.
Decision Trees. Decision trees Decision trees are powerful and popular tools for classification and prediction. The attractiveness of decision trees is.
Artificial Intelligence in Game Design N-Grams and Decision Tree Learning.
Business Intelligence and Decision Modeling Week 9 Customer Profiling Decision Trees (Part 2) CHAID CRT.
MACHINE LEARNING 10 Decision Trees. Motivation  Parametric Estimation  Assume model for class probability or regression  Estimate parameters from all.
Decision Trees Binary output – easily extendible to multiple output classes. Takes a set of attributes for a given situation or object and outputs a yes/no.
DATA MINING WITH CLUSTERING AND CLASSIFICATION Spring 2007, SJSU Benjamin Lam.
Data Mining Practical Machine Learning Tools and Techniques By I. H. Witten, E. Frank and M. A. Hall Chapter 6.2: Classification Rules Rodney Nielsen Many.
Data Mining Practical Machine Learning Tools and Techniques By I. H. Witten, E. Frank and M. A. Hall DM Finals Study Guide Rodney Nielsen.
Random Forests Ujjwol Subedi. Introduction What is Random Tree? ◦ Is a tree constructed randomly from a set of possible trees having K random features.
Decision Tree Learning Presented by Ping Zhang Nov. 26th, 2007.
Lecture Notes for Chapter 4 Introduction to Data Mining
Using Classification Trees to Decide News Popularity
Machine Learning: Decision Trees Homework 4 assigned courtesy: Geoffrey Hinton, Yann LeCun, Tan, Steinbach, Kumar.
Classification Today: Basic Problem Decision Trees.
Data Mining By Farzana Forhad CS 157B. Agenda Decision Tree and ID3 Rough Set Theory Clustering.
Classification and Regression Trees
Machine Learning Recitation 8 Oct 21, 2009 Oznur Tastan.
A Decision Tree Classification Model For Determining The Location For Solar Power Plant A PRESENTATION BY-  DISHANT MITTAL  DEV GAURAV VIT UNIVERSITY,VELLORE.
Machine Learning in Practice Lecture 2 Carolyn Penstein Rosé Language Technologies Institute/ Human-Computer Interaction Institute.
Eco 6380 Predictive Analytics For Economists Spring 2016 Professor Tom Fomby Department of Economics SMU.
Tree and Forest Classification and Regression Tree Bagging of trees Boosting trees Random Forest.
BY International School of Engineering {We Are Applied Engineering} Disclaimer: Some of the Images and content have been taken from multiple online sources.
Data Mining CH6 Implementation: Real machine learning schemes(2) Reporter: H.C. Tsai.
Chapter 14 Introduction to Multiple Regression
Introduction to Machine Learning and Tree Based Methods
Decision Trees.
Issues in Decision-Tree Learning Avoiding overfitting through pruning
Introduction to Data Mining and Classification
Decision Tree Saed Sayad 9/21/2018.
NBA Draft Prediction BIT 5534 May 2nd 2018
CS548 Fall 2017 Decision Trees / Random Forest Showcase by Yimin Lin, Youqiao Ma, Ran Lin, Shaoju Wu, Bhon Bunnag Showcasing work by Cano,
Introduction to Data Mining, 2nd Edition by
Data Mining – Chapter 3 Classification
Introduction to Predictive Modeling
Decision Trees By Cole Daily CSCI 446.
Bias-variance Trade-off
MIS2502: Data Analytics Classification Using Decision Trees
STT : Intro. to Statistical Learning
Presentation transcript:

Worcester Polytechnic Institute CS548 Spring 2016 Decision Trees Showcase By Yi Jiang and Brandon Boos ---- Showcase work by Zhun Yu, Fariborz Haghighat, Benjamin C.M. Fung, and Hiroshi Yoshino on A decision tree method for building energy demand modeling

Worcester Polytechnic Institute References [1] Zhun Yu, Fariborz Haghighat, Benjamin C.M. Fung, and Hiroshi Yoshino. “A decision tree method for building energy demand modeling,” Energy and Building, Vol. 48, no. 10, pp , Oct [2] James, Witten, Hastie and Tibshirani. An Introduction to Statistical Learning with Applications in R. Springer Texts in Statistics Vol. 103,

Worcester Polytechnic Institute Why Predicting EUI matters? EUI stands for Energy Use Intensity Energy consumption throughout the world increased significantly. For efficient building design 3 Taken from baidu.com/imaghttp://news.zhulong.com/read html

Worcester Polytechnic Institute Overview of Decision tree What is a tree? A tree is a prediction method with simple rules to divide the range of variables into smaller and smaller sections. Taken from

Worcester Polytechnic Institute Cons & Pros Comparison among three method in this paper Decision tree wins!(the accuracies are almost the same) Tree methodsRegression models ANN models Advantage understandable interpretable easy to execute simple and efficient can model complex relationships Disadvantage too easyhard to interpret complicated to operate

Worcester Polytechnic Institute Data Set In this project, field surveys on energy related data and other relevant information were carried out in 80 residential buildings in six different districts in Japan 13 observations have missing value. They use 55 observations in training data set Target variable:EUI: high or low Variables: Taken from [1]

Worcester Polytechnic Institute Decision Tree Generation Attribute Selection Criterion Splitting dataset into training and test data Generating decision tree using training data Estimating the accuracy Improve the model

Worcester Polytechnic Institute Results - The Decision Tree 8 Taken from [1] ●Decision Tree from training data ●Confusion matrix for training data using decision tree

Worcester Polytechnic Institute Results - Prediction on Test Data 9 Taken from [1]

Worcester Polytechnic Institute Results - Nodes pt Taken from [1] Non-Leaf Node: ●Node # ●# of Data Instances ●Entropy Value ●Split Attribute

Worcester Polytechnic Institute Result - Nodes pt Taken from [1] Leaf Node: ●Node # ●# of Data Instances ●Avg. EUI ●EUI Class ●Stopping Criteria Met

Worcester Polytechnic Institute Results - Decision Rules 12 Example Rule for Node 10: If TEMP is high and HLC < 3.89 and ELA < 4.41 and HWS is electric then EUI is LOW Taken from [1] and modified

Worcester Polytechnic Institute Observations - Important Attributes 13

Worcester Polytechnic Institute Observations - Interesting The importance of attributes for high and low temperature areas are different High temperature areas benefit from a certain value of equivalent leakage area as long as the heat loss coefficient is low enough 14

Worcester Polytechnic Institute Conclusions The decision tree provides an easily understood model which can help building designers and owners know which attributes to prioritize in order to lower energy use Non-binary classification could improve the results but would also increase chance of misclassification Larger data set needed 15