Application of Decision Tree: Bankruptcy Prediction 2004/05/07.

Slides:



Advertisements
Similar presentations
Data Mining Lecture 9.
Advertisements

DECISION TREES. Decision trees  One possible representation for hypotheses.
Random Forest Predrag Radenković 3237/10
Paper By - Manish Mehta, Rakesh Agarwal and Jorma Rissanen
Huffman code and ID3 Prof. Sin-Min Lee Department of Computer Science.
Hunt’s Algorithm CIT365: Data Mining & Data Warehousing Bajuna Salehe
Decision Tree Approach in Data Mining
Classification: Definition Given a collection of records (training set ) –Each record contains a set of attributes, one of the attributes is the class.
Decision Tree.
Deriving rules from data Decision Trees a.j.m.m (ton) weijters.
Classification Techniques: Decision Tree Learning
Chapter 7 – Classification and Regression Trees
Chapter 7 – Classification and Regression Trees
Data Mining Glen Shih CS157B Section 1 Dr. Sin-Min Lee April 4, 2006.
Decision Tree Rong Jin. Determine Milage Per Gallon.
Data Quality Class 9. Rule Discovery Decision and Classification Trees Association Rules.
DATA MINING CS157A Swathi Rangan. A Brief History of Data Mining The term “Data Mining” was only introduced in the 1990s. Data Mining roots are traced.
Decision Tree Algorithm
About ISoft … What is Decision Tree? Alice Process … Conclusions Outline.
Classification Continued
Lecture 5 (Classification with Decision Trees)
Three kinds of learning
Tree methods: Dependent variable is categorical
Classification.
BİL744 Derleyici Gerçekleştirimi (Compiler Design)1.
© Prentice Hall1 DATA MINING Introductory and Advanced Topics Part II Margaret H. Dunham Department of Computer Science and Engineering Southern Methodist.
Ordinal Decision Trees Qinghua Hu Harbin Institute of Technology
Part I: Classification and Bayesian Learning
Introduction to Data Mining Engineering Group in ACL.
Detection and Resolution of Anomalies in Firewall Policy Rules
Comparison of Classification Methods for Customer Attrition Analysis Xiaohua Hu, Ph.D. Drexel University Philadelphia, PA, 19104
Data Mining. 2 Models Created by Data Mining Linear Equations Rules Clusters Graphs Tree Structures Recurrent Patterns.
DATA MINING : CLASSIFICATION. Classification : Definition  Classification is a supervised learning.  Uses training sets which has correct answers (class.
Copyright R. Weber Machine Learning, Data Mining ISYS370 Dr. R. Weber.
Data Mining and Application Part 1: Data Mining Fundamentals Part 2: Tools for Knowledge Discovery Part 3: Advanced Data Mining Techniques Part 4: Intelligent.
1 Data Mining Lecture 3: Decision Trees. 2 Classification: Definition l Given a collection of records (training set ) –Each record contains a set of attributes,
Knowledge Discovery and Data Mining Evgueni Smirnov.
Knowledge Discovery and Data Mining Evgueni Smirnov.
Categorical data. Decision Tree Classification Which feature to split on? Try to classify as many as possible with each split (This is a good split)
SOCIAL NETWORKS ANALYSIS SEMINAR INTRODUCTORY LECTURE #2 Danny Hendler and Yehonatan Cohen Advanced Topics in on-line Social Networks Analysis.
Classification and Prediction Compiled By: Umair Yaqub Lecturer Govt. Murray College Sialkot Readings: Chapter 6 – Han and Kamber.
Dr. Chen, Data Mining  A/W & Dr. Chen, Data Mining Part I Data Mining Fundamentals Chapter 1 Data Mining: A First View Jason C. H. Chen, Ph.D. Professor.
ASSESSING LEARNING ALGORITHMS Yılmaz KILIÇASLAN. Assessing the performance of the learning algorithm A learning algorithm is good if it produces hypotheses.
Chapter 6 Classification and Prediction Dr. Bernard Chen Ph.D. University of Central Arkansas.
Software Quality in Use Characteristic Mining from Customer Reviews Warit Leopairote, Athasit Surarerks, Nakornthip Prompoon Department of Computer Engineering,
DATA MINING WITH CLUSTERING AND CLASSIFICATION Spring 2007, SJSU Benjamin Lam.
CS112 Scientific Computation Department of Computer Science Wellesley College AI designs a forest Decision trees.
ISQS 7342 Dr. zhangxi Lin By: Tej Pulapa. DT in Forecasting Targeted Marketing - Know before hand what an online customer loves to see or hear about.
Decision Tree Learning Presented by Ping Zhang Nov. 26th, 2007.
Lecture Notes for Chapter 4 Introduction to Data Mining
1 Classification: predicts categorical class labels (discrete or nominal) classifies data (constructs a model) based on the training set and the values.
An Interval Classifier for Database Mining Applications Rakes Agrawal, Sakti Ghosh, Tomasz Imielinski, Bala Iyer, Arun Swami Proceedings of the 18 th VLDB.
Miloš Kotlar 2012/115 Single Layer Perceptron Linear Classifier.
Classification Cheng Lei Department of Electrical and Computer Engineering University of Victoria April 24, 2015.
Monday, February 22,  The term analytics is often used interchangeably with:  Data science  Data mining  Knowledge discovery  Extracting useful.
CS416 Compiler Design1. 2 Course Information Instructor : Dr. Ilyas Cicekli –Office: EA504, –Phone: , – Course Web.
DATA MINING TECHNIQUES (DECISION TREES ) Presented by: Shweta Ghate MIT College OF Engineering.
Genetic Programming. What is Genetic Programming? GP for Symbolic Regression Other Representations for GP Example of GP for Knowledge Discovery Outline.
DECISION TREE INDUCTION CLASSIFICATION AND PREDICTION What is classification? what is prediction? Issues for classification and prediction. What is decision.
A Smart Tool to Predict Salary Trends of H1-B Holders
Chapter 6 Classification and Prediction
Classification and Prediction
Prepared by: Mahmoud Rafeek Al-Farra
Prepared by: Mahmoud Rafeek Al-Farra
כריית נתונים.
Classification and Prediction
CSCI N317 Computation for Scientific Applications Unit Weka
Authors: Wai Lam and Kon Fan Low Announcer: Kyu-Baek Hwang
©Jiawei Han and Micheline Kamber
Presentation transcript:

Application of Decision Tree: Bankruptcy Prediction 2004/05/07

Outline The prediction of bankruptcy Model development –Problem analysis –Feature selection –Data collection –Model building –Model evaluation Other applications

The prediction of bankruptcy Corporate bankruptcy brings with it losses to management, stockholders, employees, customers, and others. =>develop prediction models An unprecedented number of corporations are filing for bankruptcy in Korea during 1997 to 1998.

Model development (Problem analysis) Developing model, which –Predicts bankruptcy –Generates understandable rules, –Without much computation, –Handles both continuous and categorical variables, –Provides an indication of important variables. =>Decision Tree

Model development (Feature selection)

Model development (Data collection) Bankrupt firms were referred to as an act of filing a petition for bankruptcy reported by the KSE. 75 firms were identified. After removing small firms and noise data, 29 bankrupt firms remained. 49 nonbankrupt were selected under certain criteria. All financial data collected were gathered from the KSE.

Model development (Model building) Decision tree induction approaches construct a decision tree using a training data set, where the tree is a simple recursive structure for representing a decision procedure in which a new case is assigned to one of the predefined classes. A nonterminal node in the tree represents a decision attribute value test, and a terminal node denotes a decision class.

Model development (Model building)

Model development (Model evaluation) Cross-validation approach –Divide whole data sets as training set and test set. –Using training set to build model –Using test set to evaluate model Jackknife approach –For n cases, using n-1 cases to build mode and using 1 case to evaluate model

Model development (Model evaluation)

Other application Widely applying on decision support problems, such as: –Information retrieval Ex. Text classification –Business/Management Ex. Credit prediction –Healthcare/Biology Ex. Health care fraud and abuse detection –Computer science Ex. Intrusion detection