Weka: a useful tool in data mining and machine learning Team 5 Noha Elsherbiny, Huijun Xiong, and Bhanu Peddi.

Slides:



Advertisements
Similar presentations
Florida International University COP 4770 Introduction of Weka.
Advertisements

Weka & Rapid Miner Tutorial By Chibuike Muoh. WEKA:: Introduction A collection of open source ML algorithms – pre-processing – classifiers – clustering.
Kansas State University Department of Computing and Information Sciences Laboratory for Knowledge Discovery in Databases (KDD) KDD Group Research Seminar.
Department of Computer Science, University of Waikato, New Zealand Eibe Frank WEKA: A Machine Learning Toolkit The Explorer Classification and Regression.
WEKA (sumber: Machine Learning with WEKA). What is WEKA? Weka is a collection of machine learning algorithms for data mining tasks. Weka contains.
Chapter 9 DATA WAREHOUSING Transparencies © Pearson Education Limited 1995, 2005.
Relational Data Mining in Finance Haonan Zhang CFWin /04/2003.
WEKA Evaluation of WEKA Waikato Environment for Knowledge Analysis Presented By: Manoj Wartikar & Sameer Sagade.
Department of Computer Science, University of Waikato, New Zealand Eibe Frank WEKA: A Machine Learning Toolkit The Explorer Classification and Regression.
March 25, 2004Columbia University1 Machine Learning with Weka Lokesh S. Shrestha.
DATA WAREHOUSING.
An Extended Introduction to WEKA. Data Mining Process.
Introduction to WEKA Aaron 2/13/2009. Contents Introduction to weka Download and install weka Basic use of weka Weka API Survey.
Jump to first page The objective of our final project is to evaluate several supervised learning algorithms for identifying pre-defined classes among web.
Presented To: Madam Nadia Gul Presented By: Bi Bi Mariam.
Presenter: Teng-Chih Yang Professor: Ming-Puu Chen Date: 10/ 28/ 2009 Data mining in course management systems: Moodle case study and tutorial Romero,
Computer Science Universiteit Maastricht Institute for Knowledge and Agent Technology Data mining and the knowledge discovery process Summer Course 2005.
Machine Learning Usman Roshan Dept. of Computer Science NJIT.
1 How to use Weka How to use Weka. 2 WEKA: the software Waikato Environment for Knowledge Analysis Collection of state-of-the-art machine learning algorithms.
Data Mining By Andrie Suherman. Agenda Introduction Major Elements Steps/ Processes Tools used for data mining Advantages and Disadvantages.
1 © Goharian & Grossman 2003 Introduction to Data Mining (CS 422) Fall 2010.
Data Mining Techniques
An Exercise in Machine Learning
 The Weka The Weka is an well known bird of New Zealand..  W(aikato) E(nvironment) for K(nowlegde) A(nalysis)  Developed by the University of Waikato.
Contributed by Yizhou Sun 2008 An Introduction to WEKA.
Department of Computer Science, University of Waikato, New Zealand Geoff Holmes WEKA project and team Data Mining process Data format Preprocessing Classification.
Appendix: The WEKA Data Mining Software
In part from: Yizhou Sun 2008 An Introduction to WEKA Explorer.
1 Research Groups : KEEL: A Software Tool to Assess Evolutionary Algorithms for Data Mining Problems SCI 2 SMetrology and Models Intelligent.
Data Mining Applied to Document Imaging Jeff Rekoske.
1 1 Slide Evaluation. 2 2 n Interactive decision tree construction Load segmentchallenge.arff; look at dataset Load segmentchallenge.arff; look at dataset.
Data Mining – A First View Roiger & Geatz. Definition Data mining is the process of employing one or more computer learning techniques to automatically.
Introduction to machine learning and data mining 1 iCSC2014, Juan López González, University of Oviedo Introduction to machine learning Juan López González.
Department of Computer Science, University of Waikato, New Zealand Eibe Frank WEKA: A Machine Learning Toolkit The Explorer Classification and Regression.
Constructing Data Mining Applications based on Web Services Composition Ali Shaikh Ali and Omer Rana
Machine Learning with Weka Cornelia Caragea Thanks to Eibe Frank for some of the slides.
W E K A Waikato Environment for Knowledge Analysis Branko Kavšek MPŠ Jožef StefanNovember 2005.
Artificial Neural Network Building Using WEKA Software
Department of Computer Science, University of Waikato, New Zealand Eibe Frank WEKA: A Machine Learning Toolkit The Explorer Classification and Regression.
27-18 września Data Mining dr Iwona Schab. 2 Semester timetable ORGANIZATIONAL ISSUES, INDTRODUCTION TO DATA MINING 1 Sources of data in business,
Weka – A Machine Learning Toolkit October 2, 2008 Keum-Sung Hwang.
October 2-3, 2015, İSTANBUL Boğaziçi University Prof.Dr. M.Erdal Balaban Istanbul University Faculty of Business Administration Avcılar, Istanbul - TURKEY.
Introduction to Weka Xingquan (Hill) Zhu Slides copied from Jeffrey Junfeng Pan (UST)
W E K A Waikato Environment for Knowledge Aquisition.
An Exercise in Machine Learning
Berendt: Advanced databases, winter term 2007/08, 1 Advanced databases – Inferring implicit/new.
***Classification Model*** Hosam Al-Samarraie, PhD. CITM-USM.
WEKA's Knowledge Flow Interface Data Mining Knowledge Discovery in Databases ELIE TCHEIMEGNI Department of Computer Science Bowie State University, MD.
In part from: Yizhou Sun 2008 An Introduction to WEKA Explorer.
FNA/Spring CENG 562 – Machine Learning. FNA/Spring Contact information Instructor: Dr. Ferda N. Alpaslan
@relation age sex { female, chest_pain_type { typ_angina, asympt, non_anginal,
WEKA: A Practical Machine Learning Tool WEKA : A Practical Machine Learning Tool.
Machine Learning Usman Roshan Dept. of Computer Science NJIT.
An Introduction to WEKA
An Enhanced Support Vector Machine Model for Intrusion Detection
Waikato Environment for Knowledge Analysis
WEKA.
An Introduction to WEKA
Machine Learning with WEKA
Machine Learning with WEKA
Machine Learning with Weka
An Introduction to WEKA
Data Science introduction.
Opening Weka Select Weka from Start Menu Select Explorer Fall 2003
Machine Learning with Weka
Dept. of Computer Science University of Liverpool
Lecture 10 – Introduction to Weka
Data Mining CSCI 307, Spring 2019 Lecture 7
Presentation transcript:

Weka: a useful tool in data mining and machine learning Team 5 Noha Elsherbiny, Huijun Xiong, and Bhanu Peddi

What does it mean really to mine data? Data mining is an experimental science. Data mining finds valuable information hidden in large volumes of data Data mining is the analysis of data and the use of software techniques for finding patterns and regularities in sets of data Data mining encompasses varied fields, including: – Databases – Statistics – Machine Learning – High Performance Computing – Visualization – Mathematics

How does WEKA come into play? "Drowning in Data yet Starving for Knowledge“ There is no single machine learning scheme is suitable to all data mining problems. WEKA(Waikato Environment for knowledge Analysis)

What is in WEKA, that makes it special? Provides many different algorithms for data mining and machine learning This is an open source and freely available It is platform-independent It is easily useable by people who are not data mining specialists It provides flexible facilities for scripting experiments Its has kept up-to-date, with new algorithms being added as they appear in research literature.

How do one implement WEKA, then? Apply a learning method to a dataset and analyze its output to learn more about the data. Use learned models to generate prediction on new instances. Apply several different learners and compare their performance in order to chose best one for prediction.

How do you actually use it? All algorithms take their input form of a single relational table in the ARFF format. The learning methods are called classifiers. – weka.classifiers.IBk: k-nearest neighbour learner – weka.classifiers.trees.J48: decision trees – weka.classifiers.NaiveBayes: Naive Bayes with/without kernels – weka.classifiers.SMO: support vector machines There are also pre-processing tools, called filters

Show us how to use it!

Decision Trees on weka

Naïvebayes on Weka

Support Vector machines

References Witten, Ian: Data Mining: Practical Tools and Techniques KDNuggets, McNicholas, P. D. and Zhao, Y. C. (2009), Association rules: An overview, in Y. Zhao, C. Zhang & L. Cao, eds, 'Post-Mining of Association Rules: Techniques for Effective Knowledge Extraction', IGI Global, pp Available at international.org/downloads/excerpts/33406.pdf process.html University of Waikato, New Zealand