WEKA – Knowledge Flow & Simple CLI

Slides:



Advertisements
Similar presentations
Florida International University COP 4770 Introduction of Weka.
Advertisements

CPSC 502, Lecture 15Slide 1 Introduction to Artificial Intelligence (AI) Computer Science cpsc502, Lecture 15 Nov, 1, 2011 Slide credit: C. Conati, S.
Lecture 22: Evaluation April 24, 2010.
Lesson 10-Controlling User Processes. Overview Managing and processing processes. Managing jobs. Exiting/quitting when jobs have been stopped.
Machine Learning in Practice Lecture 3 Carolyn Penstein Rosé Language Technologies Institute/ Human-Computer Interaction Institute.
WEKA (sumber: Machine Learning with WEKA). What is WEKA? Weka is a collection of machine learning algorithms for data mining tasks. Weka contains.
Mapping Between Taxonomies Elena Eneva 30 Oct 2001 Advanced IR Seminar.
Classifier Decision Tree A decision tree classifies data by predicting the label for each record. The first element of the tree is the root node, representing.
Three kinds of learning
Evaluation of MineSet 3.0 By Rajesh Rathinasabapathi S Peer Mohamed Raja Guided By Dr. Li Yang.
An Extended Introduction to WEKA. Data Mining Process.
1 Statistical Learning Introduction to Weka Michel Galley Artificial Intelligence class November 2, 2006.
© Prentice Hall1 DATA MINING Introductory and Advanced Topics Part II Margaret H. Dunham Department of Computer Science and Engineering Southern Methodist.
Ranga Rodrigo April 5, 2014 Most of the sides are from the Matlab tutorial. 1.
LOGO Chapter V Formattings 1. LOGO Overview  Conditional formatting  Working with tables  Filtering  Sorting  Freeze panes  Pivot tables  How to.
Midterm Review. 1-Intro Data Mining vs. Statistics –Predictive v. experimental; hypotheses vs data-driven Different types of data Data Mining pitfalls.
Today Evaluation Measures Accuracy Significance Testing
An Exercise in Machine Learning
Carolina Environmental Program UNC Chapel Hill The Analysis Engine – A New Tool for Model Evaluation, Sensitivity and Uncertainty Analysis, and more… Alison.
WEKA - Explorer (sumber: WEKA Explorer user Guide for Version 3-5-5)
WEKA and Machine Learning Algorithms. Algorithm Types Classification (supervised) Given -> A set of classified examples “instances” Produce -> A way of.
Appendix: The WEKA Data Mining Software
The Knowledge Flow Interface 김개원 데이터베이스 연구실. 1. Overview.
1 1 Slide Evaluation. 2 2 n Interactive decision tree construction Load segmentchallenge.arff; look at dataset Load segmentchallenge.arff; look at dataset.
The WinMine Toolkit Max Chickering. Build Statistical Models From Data Dependency Networks Bayesian Networks Local Distributions –Trees Multinomial /
Hands-on predictive models and machine learning for software Foutse Khomh, Queen’s University Segla Kpodjedo, École Polytechnique de Montreal PASED - Canadian.
Weka: Experimenter and Knowledge Flow interfaces Neil Mac Parthaláin
Today Ensemble Methods. Recap of the course. Classifier Fusion
W E K A Waikato Environment for Knowledge Analysis Branko Kavšek MPŠ Jožef StefanNovember 2005.
1 1 Slide Using Weka. 2 2 Slide Data Mining Using Weka n What’s Data Mining? We are overwhelmed with data We are overwhelmed with data Data mining is.
BOĞAZİÇİ UNIVERSITY DEPARTMENT OF MANAGEMENT INFORMATION SYSTEMS MATLAB AS A DATA MINING ENVIRONMENT.
WEKA Machine Learning Toolbox. You can install Weka on your computer from
Weka Just do it Free and Open Source ML Suite Ian Witten & Eibe Frank University of Waikato New Zealand.
Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 5-1 Data Mining Methods: Classification Most frequently used DM method Employ supervised.
W E K A Waikato Environment for Knowledge Aquisition.
***Classification Model*** Hosam Al-Samarraie, PhD. CITM-USM.
A Brief Introduction and Issues on the Classification Problem Jin Mao Postdoc, School of Information, University of Arizona Sept 18, 2015.
Evaluating Classifiers Reading: T. Fawcett, An introduction to ROC analysis, Sections 1-4, 7 (linked from class website)An introduction to ROC analysis.
WEKA's Knowledge Flow Interface Data Mining Knowledge Discovery in Databases ELIE TCHEIMEGNI Department of Computer Science Bowie State University, MD.
Copyright  2004 limsoon wong Using WEKA for Classification (without feature selection)
Machine Learning in Practice Lecture 2 Carolyn Penstein Rosé Language Technologies Institute/ Human-Computer Interaction Institute.
AdaptJ Sookmyung Women’s Univ. PSLAB. 1. 목차 1. Overview 2. Collecting Trace Data using the AdaptJ Agent 2.1 Recording a Trace 3. Analyzing Trace Data.
Downloading the MAXENT Software
In part from: Yizhou Sun 2008 An Introduction to WEKA Explorer.
WEKA: A Practical Machine Learning Tool WEKA : A Practical Machine Learning Tool.
Evaluating Classifiers
Introduction to WEKA Mark Hall Data Mining WEKA - what is it? WEKA UIs
MMS Software Deliverables: Year 1
Waikato Environment for Knowledge Analysis
WEKA.
Sampath Jayarathna Cal Poly Pomona
Machine Learning Week 1.
Machine Learning Platform Life-Cycle Management
Weka Package Weka package is open source data mining software written in Java. Weka can be applied to your dataset from the GUI, the command line or called.
Weka Free and Open Source ML Suite Ian Witten & Eibe Frank
Machine Learning with Weka
Tutorial for LightSIDE
DataMining, Morgan Kaufmann, p Mining Lab. 김완섭 2004년 10월 27일
Opening Weka Select Weka from Start Menu Select Explorer Fall 2003
CSCI N317 Computation for Scientific Applications Unit Weka
CS4705 – Natural Language Processing Thursday, September 28
Intro to Machine Learning
Machine Learning with WEKA
Lecture 10 – Introduction to Weka
Statistical Learning Introduction to Weka
Assignment 1: Classification by K Nearest Neighbors (KNN) technique
Assignment 8 : logistic regression
Assignment 7 Due Application of Support Vector Machines using Weka software Must install libsvm Data set: Breast cancer diagnostics Deliverables:
Data Mining CSCI 307, Spring 2019 Lecture 7
Data Mining CSCI 307, Spring 2019 Lecture 8
Presentation transcript:

WEKA – Knowledge Flow & Simple CLI

Knowledge Flow

Knowledge Flow .. A graphical front end to Weka's core algorithms It is a work in progress It can handle data either incrementally or in batches (the Explorer handles batch data only). Currently in Weka there are five classifiers that can handle data incrementally: NaiveBayesUpdateable, IB1, IBk, LWR (locally weighted regression).

Features intuitive data flow style layout process data in batches or incrementally process multiple batches or streams in parallel! (each separate flow executes in its own thread) chain filters together view models produced by classifiers for each fold in a cross validation visualize performance of incremental classifiers during processing (scrolling plots of classification accuracy, RMS error, predictions etc)

Components DataSources DataSinks Filters Classifiers Clusterers All of Weka's loaders are available DataSinks All of Weka's savers are available Filters All of Weka's filters are available Classifiers All of Weka's classifiers are available Clusterers All of Weka's clusterers are available

Components .. Evaluation TrainingSetMaker - make a data set into a training set TestSetMaker - make a data set into a test set CrossValidationFoldMaker - split any data set, training set or test set into folds TrainTestSplitMaker - split any data set, training set or test set ClassAssigner - assign a column to be the class for any data set ClassValuePicker - choose a class value as the "positive" class. ClassifierPerformanceEvaluator - evaluate the performance of batch trained/tested classifiers IncrementalClassifierEvaluator - evaluate the performance of incrementally trained classifiers ClustererPerformanceEvaluator - evaluate the performance of batch trained/tested clusterers PredictionAppender - append classifier predictions to a test set.

Components .. Visualization DataVisualizer - visualizing data in a single large 2D scatter plot ScatterPlotMatrix - containing a matrix of small scatter plots (clicking on a small plot pops up a large scatter plot) AttributeSummarizer - component that can pop up a panel containing a matrix of histogram plots - one for each of the attributes in the input data ModelPerformanceChart - component that can pop up a panel for visualizing threshold (i.e. ROC style) curves. TextViewer - component for showing textual data. Can show data sets, classification performance statistics etc. GraphViewer - pop up a panel for visualizing tree based models StripChart - component that can pop up a panel that displays a scrolling plot of data

Example

Result

Simple CLI

Simple CLI .. The Simple CLI provides full access to all Weka classes, i.e., classifiers, filters, clusterers, etc. It offers a simple Weka shell with separated commandline and output.

Commands java <classname> [<args>]: invokes a java class with the given arguments (if any) break: stops the current thread, e.g., a running classifier, in a friendly manner kill: stops the current thread in an unfriendly fashion cls: clears the output area exit: exits the Simple CLI help [<command>]: provides an overview of the available commands if without a command name as argument, otherwise more help on the specified command

Example java weka.classifiers.trees.J48 -t e:\data\iris.arff