1 Improving quality of graduate students by data mining Asst. Prof. Kitsana Waiyamai, Ph.D. Dept. of Computer Engineering Faculty of Engineering, Kasetsart.

Slides:



Advertisements
Similar presentations
Supporting End-User Access
Advertisements

By: Mr Hashem Alaidaros MIS 211 Lecture 4 Title: Data Base Management System.
Overview of Data Mining & The Knowledge Discovery Process Bamshad Mobasher DePaul University Bamshad Mobasher DePaul University.
DATA MINING CS157A Swathi Rangan. A Brief History of Data Mining The term “Data Mining” was only introduced in the 1990s. Data Mining roots are traced.
Week 9 Data Mining System (Knowledge Data Discovery)
© Prentice Hall1 DATA MINING TECHNIQUES Introductory and Advanced Topics Eamonn Keogh (some slides adapted from) Margaret Dunham Dr. M.H.Dunham, Data Mining,
Data Mining By Archana Ketkar.
Data Mining – Intro.
CS157A Spring 05 Data Mining Professor Sin-Min Lee.
Oracle Data Mining Ying Zhang. Agenda Data Mining Data Mining Algorithms Oracle DM Demo.
Data Warehousing 資料倉儲 Min-Yuh Day 戴敏育 Assistant Professor 專任助理教授 Dept. of Information Management, Tamkang University Dept. of Information ManagementTamkang.
Data Mining: A Closer Look
Data Mining.
CIT 858: Data Mining and Data Warehousing Course Instructor: Bajuna Salehe Web:
Computer Science Universiteit Maastricht Institute for Knowledge and Agent Technology Data mining and the knowledge discovery process Summer Course 2005.
TURKISH STATISTICAL INSTITUTE INFORMATION TECHNOLOGIES DEPARTMENT (Muscat, Oman) DATA MINING.
GUHA method in Data Mining Esko Turunen Tampere University of Technology Tampere, Finland.
Enterprise systems infrastructure and architecture DT211 4
Data Mining: Concepts & Techniques. Motivation: Necessity is the Mother of Invention Data explosion problem –Automated data collection tools and mature.
1 © Goharian & Grossman 2003 Introduction to Data Mining (CS 422) Fall 2010.
OLAM and Data Mining: Concepts and Techniques. Introduction Data explosion problem: –Automated data collection tools and mature database technology lead.
Data Warehouse Fundamentals Rabie A. Ramadan, PhD 2.
Data Mining Using IBM Intelligent Miner Presented by: Qiyan (Jennifer ) Huang.
Chapter 5: Data Mining for Business Intelligence
Data Mining Techniques
Shilpa Seth.  What is Data Mining What is Data Mining  Applications of Data Mining Applications of Data Mining  KDD Process KDD Process  Architecture.
Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence From Data Mining To Knowledge.
Data Mining Techniques As Tools for Analysis of Customer Behavior
Page 1 WEB MINING by NINI P SURESH PROJECT CO-ORDINATOR Kavitha Murugeshan.
3 Objects (Views Synonyms Sequences) 4 PL/SQL blocks 5 Procedures Triggers 6 Enhanced SQL programming 7 SQL &.NET applications 8 OEM DB structure 9 DB.
Data Mining and Application Part 1: Data Mining Fundamentals Part 2: Tools for Knowledge Discovery Part 3: Advanced Data Mining Techniques Part 4: Intelligent.
Chapter 1 Introduction to Data Mining
Data Mining Chapter 1 Introduction -- Basic Data Mining Tasks -- Related Concepts -- Data Mining Techniques.
DATA MINING 1. 2 Data Mining Extracting or “mining” knowledge from large amounts of data Data mining is the process of autonomously retrieving useful.
2015年10月18日星期日 2015年10月18日星期日 2015年10月18日星期日 Introduction to Data Mining 1 Chapter 1 Introduction to Data Mining Chen. Chun-Hsien Department of Information.
2015年10月22日星期四 2015年10月22日星期四 2015年10月22日星期四 Introduction to Data Mining 1 Chapter 1 Introduction to Data Mining Chen. Chun-Hsien Department of Information.
Data Mining By Dave Maung.
Data Mining – Intro. Course Overview Spatial Databases Temporal and Spatio-Temporal Databases Multimedia Databases Data Mining.
CS157B Fall 04 Introduction to Data Mining Chapter 22.3 Professor Lee Yu, Jianji (Joseph)
Advanced Database Course (ESED5204) Eng. Hanan Alyazji University of Palestine Software Engineering Department.
Chapter 5: Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization DECISION SUPPORT SYSTEMS AND BUSINESS.
Introduction to Data-Mining Marko Grobelnik Institut Jozef Stefan.
1 Introduction to Data Mining C hapter 1. 2 Chapter 1 Outline Chapter 1 Outline – Background –Information is Power –Knowledge is Power –Data Mining.
Introduction to Data Mining by Yen-Hsien Lee Department of Information Management College of Management National Sun Yat-Sen University March 4, 2003.
MIS2502: Data Analytics Advanced Analytics - Introduction.
January 8, 2016Data Mining: Concepts and Techniques1 Data Mining: Trends and Applications.
Data Mining and Decision Support
Academic Year 2014 Spring Academic Year 2014 Spring.
February 13, 2016 Data Mining: Concepts and Techniques 1 1 Data Mining: Concepts and Techniques These slides have been adapted from Han, J., Kamber, M.,
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 28 Data Mining Concepts.
2016年6月12日星期日 2016年6月12日星期日 2016年6月12日星期日 Introduction to Data Mining 1 Chapter 1 Introduction to Data Mining Chen. Chun-Hsien Department of Information.
CS570: Data Mining Spring 2010, TT 1 – 2:15pm Li Xiong.
July 7, 2016 Data Mining: Concepts and Techniques 1 1.
Data Mining Concept Submitted TO: Mrs. MONIKA SUBMITTED BY: SHALU 4717.
Data Mining: Concepts and Techniques (3rd ed.) — Chapter 1 —
Data Mining – Intro.
MIS2502: Data Analytics Advanced Analytics - Introduction
Introduction to Data Mining
MIS 451 Building Business Intelligence Systems
Datamining : Refers to extracting or mining knowledge from large amounts of data Applications : Market Analysis Fraud Detection Customer Retention Production.
Data Mining: Concepts and Techniques Course Outline
כריית מידע -- מבוא ד"ר אבי רוזנפלד.
Data Warehousing and Data Mining
Prepared by: Mahmoud Rafeek Al-Farra
Data Mining: Concepts and Techniques
Supporting End-User Access
Data Mining: Concepts and Techniques
Data Mining: Concepts and Techniques
CSE591: Data Mining by H. Liu
Presentation transcript:

1 Improving quality of graduate students by data mining Asst. Prof. Kitsana Waiyamai, Ph.D. Dept. of Computer Engineering Faculty of Engineering, Kasetsart University Bangkok, Thailand

2 Content PART I  Introduction to data mining  Data mining technique: association rule discovery  Data mining technique: data classification PART II  Improving quality of graduate students by data mining Conclusion

3 What Is Data Mining ? Knowledge Discovery from Data: KDD (Data Mining): The process of nontrivial extraction of patterns from data. Patterns that are: implicit, previously unknown, and potentially useful Patterns must be comprehensible for human users.

4 Knowledge Discovery Process: Iterative & Interactive Process Data sources Databases, flat files, Complex data Data Warehouses Preprocessing data Gathering, cleaning and selecting data Search for patterns: Data Mining Neural nets, machine learning, statistics and others Analyst reviews output Report findings Take actions based on findings Interpret results Mining Objective

5 What kind of data can be mined? Relational databases Data warehouses Transactional databases and Flat files Advanced DB systems and information repositories Object-oriented and object-relational databases Spatial databases Time-series data and temporal data Text databases, multimedia databases Heterogeneous and legacy databases World Wide Web Bioinformatic data Databases Data Warehouse

6 Two modes of data mining Predictive data mining Predict behavior based on historic data Use data with known results to build a model that can be later used to explicitly predict values for different data Methods: classification, prediction, … etc. Descriptive data mining Describe patterns in existing data that may be used to guide decisions Methods: Associations rule discovery, Sequence pattern discovery, Clustering, … etc.

7 Data Mining Techniques Data Clustering Association rule discovery Data Classification Outlier detection Data regression Etc.

8

9 Classification is the process of assigning new objects to predefined categories or classes  Given a set of labeled records  Build a model  Predict labels for future unlabeled records Example:  Age, Educational background, Annual income, Current debts, Housing location => Making Decision  Degree= “ Master ” and Income=7500 => Credit= “ Excellent ” Data Classification

10 Three-Step Process of Classification Model construction Model Evaluation Classification Classifier Model Training Data Testing Data Classifier Model Unseen Data

11 Data Mining Tools ANGOSS KnowledgeStudio IBM Intelligent Miner Metaputer PolyAnalyst SAS Enterprise Miner SGI Mineset SPSS Clementine Many others More at

12 Data Mining Projects Checklist: Start with well-defined questions Define measures of success and failure Main difficulty: No automation Understanding the problem Data preparation Selection of the right mining methods Interpretation

13 Using Data Mining for Improving Quality of Engineering Graduates Objective: Discover knowledge from large databases of engineering student records. Discovered knowledge are useful in: - Assisting in development of new curricula, - Improvement of existing curricula, - Helping students to select the appropriate major

14 Using a data mining technique to help students in selecting their majors Motivation: - Student major selection is very important factor for his/her success. - Lack of experience and information on each major. Solution: - Find out the profiles of good students for each major using student profile database and course enrollment student databases (10 years) - Determine the most appropriate major for each student

15 A Data Mining based Approach for Improving Quality of Engineering Graduates DB2 SQL Server course enrollment student databases student profile database Data Mining Tool Java Servlet User

16 Data for Data Mining Stu_codeSexAddressSch_GPA..... GPA mal e Bangko k mal e Songkla Stu_codeSub_codeTermYearGrade C D B+ Student profile database course enrollment student databases

17 Data preparation a classification model Stu_co de Se x Addr ess Sch_G PA..... GP A ma le Bang kok ma le Song kla Stu_c ode Sub_c ode Ter m Ye ar Gra de C D B+ Stu_co de Sex … GPA m ale Medi um Low m ale High

18 Global Classification Model Global Decision Tree which determines which majors should be appropriate to which students. Each internal node represents a test on student’s profile. Each leaf node represents an appropriate major to be selected

19 Drawbacks of Global Classification Model - Low Precision ~ 50% due to the large number of majors - Number of students is different in each department => the model cannot predict correctly the best major to be selected. - The model proposes a unique major to be selected, a set of possible majors ordered by appropriateness score would be preferred.

20 Classification Model for Each Major -Decision tree predicts whether a student is likely to be a good student in a given major. - Good students are those that graduate within 4 years and are at the first 40% ranking in a given major. - Leaf nodes represent two class: Good and Bad

21 Advantage of Major ’ s Classification Model  Good precision 80%  The model predicts the best major to be selected even if number of students in each major is different  Its proposes a set of possible majors to be selected ordered by appropriateness score. Encountered problems Database size Other factors that could affect student’s decision:  Teacher Preference, etc.

22 Presentation of Discovered Knowledge

23 Applying Association rule discovery for Grade prediction Basket Analysis Mediu m High Mediu m Mediu m Education

24 Grade Prediction for the Coming Term

25 Presentation of Discovered Knowledge

26 Conclusion & Future works Application of data mining in Education Use data mining techniques for improving quality of engineering students Apply data mining techniques to several other educational domains.