SAS Homework 2 Review Decision trees

Slides:



Advertisements
Similar presentations
 Decide if it’s a permutation or a combination, then find how many are possible:  Your class is having an election. There are 7 candidates, and they.
Advertisements

Chapter 7 – Classification and Regression Trees
Chapter 7 – Classification and Regression Trees
Decision Tree Models in Data Mining
Daniel Kroening and Ofer Strichman Decision Procedures An Algorithmic Point of View Deciding ILPs with Branch & Bound ILP References: ‘Integer Programming’
Decision Procedures An Algorithmic Point of View
Lecture Notes 4 Pruning Zhangxi Lin ISQS
Building And Interpreting Decision Trees in Enterprise Miner.
Chapter 9 – Classification and Regression Trees
JMB Chapter 1EGR Spring 2010 Slide 1 Probability and Statistics for Engineers  Descriptive Statistics  Measures of Central Tendency  Measures.
Zhangxi Lin ISQS Texas Tech University Note: Most slides are from Decision Tree Modeling by SAS Lecture Notes 5 Auxiliary Uses of Trees.
Final Exam Review. The following is a list of items that you should review in preparation for the exam. Note that not every item in the following slides.
Exam 3 Sample Decision Trees Cluster Analysis Association Rules Data Visualization SAS.
Stats Rock Using Pet Rocks to Find the Mean, Median and Interquartile Range (IQR)
SAS Homework 4 Review Clustering and Segmentation
Summary Statistics Review
Exam 3 Review Decision Trees Cluster Analysis Association Rules Data Visualization SAS.
Machine Learning II 부산대학교 전자전기컴퓨터공학과 인공지능연구실 김민호
Homework Questions. Measures of Center and Spread Unit 5, Statistics.
EXAM REVIEW MIS2502 Data Analytics. Exam What Tool to Use? Evaluating Decision Trees Association Rules Clustering.
Loan Default Model Saed Sayad 1www.ismartsoft.com.
DECISION TREE Ge Song. Introduction ■ Decision Tree: is a supervised learning algorithm used for classification or regression. ■ Decision Tree Graph:
Statistics: Unlocking the Power of Data Lock 5 STAT 101 Dr. Kari Lock Morgan 11/20/12 Multiple Regression SECTIONS 9.2, 10.1, 10.2 Multiple explanatory.
Measures of Central Tendency Mean, Median, Mode, and Range.
MDH Chapter 1EGR 252 Fall 2015 Slide 1 Probability and Statistics for Engineers  Descriptive Statistics  Measures of Central Tendency  Measures of Variability.
Decimals Day 2 Multiply and Divide Decimals. DO NOT line up decimals Multiply with first number, then continue with next number Place decimal point in.
Stem and Leaf Plots (tens) (ones) Stem and Leaf Plots emphasize place value. The stems represent the tens digit and the leaves represent the ones 22, 24,
Describing & Comparing Data Unit 7 - Statistics. Describing Data  Shape Symmetric or Skewed or Bimodal  Center Mean (average) or Median  Spread Range.
 Insight – extracting conceptually appealing information from data  Exposition – displaying the decision tree results in a form to communicate insight.
Tree Diagrams Objective: To calculate probability using a tree diagram. Lesley Soar Valley College Five balls are put into a bag. Three are red.
Tree Diagrams Objective: To calculate probability using a tree Lesley Hall Five balls are put into a bag. Three are red. Two are blue.
Plot Diagram.
Organ Donation The gift of Life.
Data Management Culminating Task
Decision Trees an introduction.
Probability and Statistics for Engineers
Population Mini Census
Introduction to Data Mining and Classification
Advanced Analytics Using Enterprise Miner
Probability and Statistics for Engineers
Data Analysis AMA Collegiate Marketing Research Certificate Program.
Introduction to Data Mining, 2nd Edition by
Introduction to Data Mining, 2nd Edition by
Statistics: Stem-and-Leaf Plots
Classification by Decision Tree Induction
Stem & Leaf Plots How to make a Stem & Leaf Plot.
מדינת ישראל הוועדה לאנרגיה אטומית
Remember hundreds, tens and units!
MIS2502: Data Analytics Classification using Decision Trees
Expressions – Day 1 Objective: I can evaluate algebraic expressions.
Probability and Statistics for Engineers
Multiple Decision Trees ISQS7342
Prime Factorization Practice
Decision Trees By Cole Daily CSCI 446.
Probability and Statistics for Engineers
Probability and Statistics for Engineers
Measures of Central Tendency
Find the average of the following sets of numbers.
Welcome Back and Welcome to Honors Statistics!
Probability and Statistics for Engineers
1.7 Addition Rule - Tree Diagrams (1/3)
Probability and Statistics for Engineers
Many coffee shop and many people
MIS2502: Data Analytics Classification Using Decision Trees
Probability and Statistics for Engineers
Math 145 January 24, 2007.
Review of 6th grade material to help with new Statistics unit
Stem and Leaf Plots Stem and Leaf Plots emphasize place value.
Math 341 January 24, 2007.
Stem & Leaf Plots How to make a Stem & Leaf Plot.
Presentation transcript:

SAS Homework 2 Review Decision trees MIS2502 Data Analytics

SAS Homework 2 Review Decision Trees Using Organics Data Set from exercise #1. If Organics is wrong then your Decision Tree will be wrong Partition: 50% Training, 50% Validation Add a Decision Tree using defaults (max number branches 2) Evaluate default Decision Tree using Average Square Error Add another Decision Tree but this time customize by changing the max number of branches from 2 to 3 Assess this Decision Tree using Average Square Error Compare default Tree (2 branch max) to customized Tree (3 branch max) and determine which model is ‘better’ Answer some questions regarding the customized Decision Tree

Partition

Decision Tree – Diagram Right Click> Run after adding objects Difference is Maximum Branch

Assessing the 1st Decision Tree 2 branches Age is 1st branch

Assessing the 1st Decision Tree using average square error subtree assessment plot View>Model>Subtree Assessment Plot The line is the Optimal Leaf count for that tree Note the Leaf Count and the Validation: Average Square Error

Assessing the 2nd Decision Tree

Assessing the 2nd Decision Tree Note change in average square error In general, less error is better

What is the probability that a 39 What is the probability that a 39.5 year old male with an affluence grade of 15 buys organics ? Age = 39.5, AfflGrade > 11.5 Gender = M Look at the ‘Validation’ stats 1 = buy, 0 = no buy Navigating the Tree

In Class

In Class 1. Answers: a. 45.2% b. 66.0% c. 57.2% d. 48.2% 2. It means that for those people who made more than three donations, the region’s average median home value doesn’t make a difference in whether they make an additional donation. 3. Answers: a. GiftCnt36 – Gift Count 36 Months; because it has the highest logworth value, which means it does the best job of differentiating between future donors and non‐donors. b. Gift Count 36 Months is the first variable chosen by SAS to split the tree.