Data Mining: Penelitian Data Mining

Slides:



Advertisements
Similar presentations
The Management Process
Advertisements

Software Engineering: Research Romi Satria Wahono
Data Mining: Metode dan Algoritma
Research skills. OUTLINE Mission and Vision What is Research? Ten Steps for Good Research Resources of Research Types of research Skills (Top_5 Skills)
BPMN Fundamentals Romi Satria Wahono WA/SMS:
Data Mining: 5. Penelitian Data Mining Romi Satria Wahono WA/SMS:
TOGAF 9 Fundamental: 3. Core Concepts
2015/6/1Course Introduction1 Welcome! MSCIT 521: Knowledge Discovery and Data Mining Qiang Yang Hong Kong University of Science and Technology
Data Mining Lecture 1: Introduction to Data Mining Manuel Penaloza, PhD.
SAK 5609 DATA MINING Prof. Madya Dr. Md. Nasir bin Sulaiman
Introduction to Data Mining with Case Studies
SLIDE 1IS 257 – Fall 2008 Data Mining and the Weka Toolkit University of California, Berkeley School of Information IS 257: Database Management.
Software Engineering: 3. Methodology
BPMN Fundamentals: 4. BPMN Refactoring Romi Satria Wahono WA:
Data Warehousing 資料倉儲 Min-Yuh Day 戴敏育 Assistant Professor 專任助理教授 Dept. of Information Management, Tamkang University Dept. of Information ManagementTamkang.
Computer Science Universiteit Maastricht Institute for Knowledge and Agent Technology Data mining and the knowledge discovery process Summer Course 2005.
Introduction to Data Mining Engineering Group in ACL.
Microsoft Enterprise Consortium Data Mining Concepts Introduction: The essential background Prepared by David Douglas, University of ArkansasHosted by.
TOGAF 9 Fundamental: 2. Basic Concepts
Data Mining CMPT 455/826 - Week 10, Day 2 Jan-Apr 2009 – w10d21.
More on Data Mining KDnuggets Datanami ACM SIGKDD
Intelligent Systems Lecture 23 Introduction to Intelligent Data Analysis (IDA). Example of system for Data Analyzing based on neural networks.
Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence From Data Mining To Knowledge.
Data Mining Chun-Hung Chou
Introduction: The essential background
CSCI 347 – Data Mining Lecture 01 – Course Overview.
Unit 2: Engineering Design Process
DR. AHMAD SHAHRUL NIZAM ISHA
1 Programming Thinking and Method (0) Zhao Hai 赵海 Department of Computer Science and Engineering Shanghai Jiao Tong University
CLassification TESTING Testing classifier accuracy
1 Data Mining Books: 1.Data Mining, 1996 Pieter Adriaans and Dolf Zantinge Addison-Wesley 2.Discovering Data Mining, 1997 From Concept to Implementation.
Course Title Database Technologies Instructor: Dr ALI DAUD Course Credits: 3 with Lab Total Hours: 45 approximately.
Data Mining Applied to Document Imaging Jeff Rekoske.
CS525 DATA MINING COURSE INTRODUCTION YÜCEL SAYGIN SABANCI UNIVERSITY.
Knowledge Management: 2. Foundations Romi Satria Wahono WA/SMS:
BPMN Fundamentals: 2. BPMN Basic Concepts Romi Satria Wahono WA:
Data Mining with Oracle using Classification and Clustering Algorithms Proposed and Presented by Nhamo Mdzingwa Supervisor: John Ebden.
The CRISP Data Mining Process. August 28, 2004Data Mining2 The Data Mining Process Business understanding Data evaluation Data preparation Modeling Evaluation.
Advanced Database Course (ESED5204) Eng. Hanan Alyazji University of Palestine Software Engineering Department.
Knowledge Management: 3. Solutions Romi Satria Wahono WA/SMS:
Data Mining BY JEMINI ISLAM. Data Mining Outline: What is data mining? Why use data mining? How does data mining work The process of data mining Tools.
An Evaluation of Commercial Data Mining Proposed and Presented by Emily Davis Supervisor: John Ebden.
COMP53311 Knowledge Discovery in Databases Overview Prepared by Raymond Wong Presented by Raymond Wong
9/03 Data Mining – Introduction G Dong (WSU)1 CS499/ Data Mining Fall 2003 Professor Guozhu Dong Computer Science & Engineering WSU.
TOGAF 9 Fundamental: 3. TOGAF ADM
October 2-3, 2015, İSTANBUL Boğaziçi University Prof.Dr. M.Erdal Balaban Istanbul University Faculty of Business Administration Avcılar, Istanbul - TURKEY.
Introduction to Operations Research. MATH Mathematical Modeling 2 Introduction to Operations Research Operations research/management science –Winston:
BPMN Fundamentals Romi Satria Wahono WA/SMS:
D10A Metode Penelitian MP-04b Metodologi Penelitian di dalam Ilmu Komputer/Informatika Program Studi S-1 Teknik Informatika FMIPA Universitas.
DATA MINING TECHNIQUES (DECISION TREES ) Presented by: Shweta Ghate MIT College OF Engineering.
Sotarat Thammaboosadee, Ph.D. EGIT563- Data Mining Course Outline.
Data Mining: 8. Text Mining Romi Satria Wahono WA/SMS:
FNA/Spring CENG 562 – Machine Learning. FNA/Spring Contact information Instructor: Dr. Ferda N. Alpaslan
DATA MINING: LECTURE 1 By Dr. Hammad A. Qureshi Introduction to the Course and the Field There is an inherent meaning in everything. “Signs for people.
1 SBM411 資料探勘 陳春賢. 2 Lecture I Class Introduction.
BPMN Fundamentals: 5. BPMN Guide and Examples
D10A Metode Penelitian MP-04 Metodologi Penelitian
Department of Computer Science & Engineering
BPMN Fundamentals: 4. BPMN Refactoring
The Management Process
Introduction to IR Research
Data Mining: Concepts and Techniques Course Outline
A Unifying View on Instance Selection
Data Mining: Evaluasi dan Validasi
Dept. of Computer Science University of Liverpool
Welcome! Knowledge Discovery and Data Mining
CSCE 4143 Section 001: Data Mining Spring 2019.
3. The Project Management Process Groups: A Case Study
Presentation transcript:

Data Mining: Penelitian Data Mining Romi Satria Wahono romi@romisatriawahono.net http://romisatriawahono.net +6281586220090

Romi Satria Wahono SD Sompok Semarang (1987) SMPN 8 Semarang (1990) SMA Taruna Nusantara, Magelang (1993) S1, S2 dan S3 (on-leave) Department of Computer Sciences Saitama University, Japan (1994-2004) Research Interests: Software Engineering and Intelligent Systems Founder IlmuKomputer.Com Peneliti LIPI (2004-2007) Founder dan CEO PT Brainmatics Cipta Informatika

Course Outline Pengenalan Data Mining Proses Data Mining romi@romisatriawahono.net Object-Oriented Programming Course Outline Pengenalan Data Mining Proses Data Mining Evaluasi dan Validasi pada Data Mining Metode dan Algoritma Data Mining Penelitian Data Mining http://romisatriawahono.net

Penelitian Data Mining

Penelitian Data Mining Standard Proses Penelitian pada Data Mining Journal Publications on Data Mining Research on Classification Research on Clustering Research on Prediction Research on Association Rule

Standard Proses Penelitian pada Data Mining romi@romisatriawahono.net Object-Oriented Programming Standard Proses Penelitian pada Data Mining http://romisatriawahono.net

Data Mining Standard Process (CRISP–DM) romi@romisatriawahono.net Object-Oriented Programming Data Mining Standard Process (CRISP–DM) A cross-industry standard was clearly required that is industry neutral, tool-neutral, and application-neutral The Cross-Industry Standard Process for Data Mining (CRISP–DM) was developed in 1996 (Chapman, 2000) CRISP-DM provides a nonproprietary and freely available standard process for fitting data mining into the general problem-solving strategy of a business or research unit http://romisatriawahono.net

CRISP-DM romi@romisatriawahono.net Object-Oriented Programming http://romisatriawahono.net

1. Business Understanding Phase romi@romisatriawahono.net Object-Oriented Programming 1. Business Understanding Phase Enunciate the project objectives and requirements clearly in terms of the business or research unit as a whole Translate these goals and restrictions into the formulation of a data mining problem definition Prepare a preliminary strategy for achieving these objectives http://romisatriawahono.net

2. Data Understanding Phase romi@romisatriawahono.net Object-Oriented Programming 2. Data Understanding Phase Collect the data Use exploratory data analysis to familiarize yourself with the data and discover initial insights Evaluate the quality of the data If desired, select interesting subsets that may contain actionable patterns http://romisatriawahono.net

3. Data Preparation Phase romi@romisatriawahono.net Object-Oriented Programming 3. Data Preparation Phase Prepare from the initial raw data the final data set that is to be used for all subsequent phases. This phase is very labor intensive Select the cases and variables you want to analyze and that are appropriate for your analysis Perform transformations on certain variables, if needed Clean the raw data so that it is ready for the modeling tools http://romisatriawahono.net

4. Modeling phase Select and apply appropriate modeling techniques romi@romisatriawahono.net Object-Oriented Programming 4. Modeling phase Select and apply appropriate modeling techniques Calibrate model settings to optimize results Remember that often, several different techniques may be used for the same data mining problem If necessary, loop back to the data preparation phase to bring the form of the data into line with the specific requirements of a particular data mining technique http://romisatriawahono.net

romi@romisatriawahono.net Object-Oriented Programming 5. Evaluation phase Evaluate the one or more models delivered in the modeling phase for quality and effectiveness before deploying them for use in the field Determine whether the model in fact achieves the objectives set for it in the first phase Establish whether some important facet of the business or research problem has not been accounted for sufficiently Come to a decision regarding use of the data mining results http://romisatriawahono.net

romi@romisatriawahono.net Object-Oriented Programming 6. Deployment phase Make use of the models created: Model creation does not signify the completion of a project Example of a simple deployment: Generate a report Example of a more complex deployment: Implement a parallel data mining process in another department For businesses, the customer often carries out the deployment based on your model http://romisatriawahono.net

romi@romisatriawahono.net Object-Oriented Programming Latihan Pelajari dan pahami Case Study 1-5 dari buku Larose (2005) Chapter 1 Pelajari dan pahami bagaimana menerapkan CRISP-DM pada tesis Firmansyah (2011) tentang penerapan algoritma C4.5 untuk penentuan kelayakan kredit http://romisatriawahono.net

Journal Publications on Data Mining

Transactions and Journals Review Paper (survey and state-of-the-art): ACM Computing Surveys (CSUR) Research Paper (technical): ACM Transactions on Knowledge Discovery from Data (TKDD) ACM Transactions on Information Systems (TOIS) IEEE Transactions on Knowledge and Data Engineering Springer Data Mining and Knowledge Discovery International Journal of Business Intelligence and Data Mining (IJBIDM)

Cognitive Assignment III romi@romisatriawahono.net Object-Oriented Programming Cognitive Assignment III Baca 1 paper ilmiah yang diterbitkan di journal 2010-2012 yang berhubungan dengan metode data mining yang sudah kita pelajari Rangkumkan masing-masing dalam bentuk slide dengan struktur: Latar Belakang Masalah (Research Background) Pernyataan Masalah (Problem Statements) Pertanyaan Penelitian (Research Questions) Tujuan Penelitian (Research Objective) Metode-Metode yang Sudah Ada (Existing Methods) Metode yang Diusulkan (Proposed Method) Hasil (Results) Kesimpulan (Conclusion) Presentasikan di depan kelas pada mata kuliah berikutnya http://romisatriawahono.net

romi@romisatriawahono.net Object-Oriented Programming Referensi Ian H. Witten, Frank Eibe, Mark A. Hall, Data mining: Practical Machine Learning Tools and Techniques 3rd Edition, Elsevier, 2011 Daniel T. Larose, Discovering Knowledge in Data: an Introduction to Data Mining, John Wiley & Sons, 2005 Florin Gorunescu, Data Mining: Concepts, Models and Techniques, Springer, 2011 Jiawei Han and Micheline Kamber, Data Mining: Concepts and Techniques Second Edition, Elsevier, 2006 Oded Maimon and Lior Rokach, Data Mining and Knowledge Discovery Handbook Second Edition, Springer, 2010 Warren Liao and Evangelos Triantaphyllou (eds.), Recent Advances in Data Mining of Enterprise Data: Algorithms and Applications, World Scientific, 2007 http://romisatriawahono.net