Neural Networks Weka Lab

Slides:



Advertisements
Similar presentations
FUNCTION FITTING Student’s name: Ruba Eyal Salman Supervisor:
Advertisements

Machine Learning Homework
Florida International University COP 4770 Introduction of Weka.
KompoZer. This is what KompoZer will look like with a blank document open. As you can see, there are a lot of icons for beginning users. But don't be.
AI Practice 05 / 07 Sang-Woo Lee. 1.Usage of SVM and Decision Tree in Weka 2.Amplification about Final Project Spec 3.SVM – State of the Art in Classification.
Application of Stacked Generalization to a Protein Localization Prediction Task Melissa K. Carroll, M.S. and Sung-Hyuk Cha, Ph.D. Pace University, School.
An Extended Introduction to WEKA. Data Mining Process.
Evaluation of Results (classifiers, and beyond) Biplav Srivastava Sources: [Witten&Frank00] Witten, I.H. and Frank, E. Data Mining - Practical Machine.
Ranga Rodrigo April 5, 2014 Most of the sides are from the Matlab tutorial. 1.
CSc288 Term Project Data mining on predict Voice-over-IP Phones market Huaqin Xu.
CSCI 347 / CS 4206: Data Mining Module 06: Evaluation Topic 01: Training, Testing, and Tuning Datasets.
Data Mining – Algorithms: OneR Chapter 4, Section 4.1.
Data Mining Joyeeta Dutta-Moscato July 10, Wherever we have large amounts of data, we have the need for building systems capable of learning information.
WEKA - Explorer (sumber: WEKA Explorer user Guide for Version 3-5-5)
Appendix: The WEKA Data Mining Software
To make a graph in Excel, first you have to highlight the data you want to graph. To do this, click and drag over the “blocks” you need. Be sure to include.
Mail merge I: Use mail merge for mass mailings Perform a complete mail merge Now you’ll walk through the process of performing a mail merge by using the.
StAR web server tutorial for ROC Analysis. ROC Analysis ROC Analysis: This module allows the user to input data for several classifiers to be tested.
1 1 Slide Evaluation. 2 2 n Interactive decision tree construction Load segmentchallenge.arff; look at dataset Load segmentchallenge.arff; look at dataset.
1 ADVANCED MICROSOFT EXCEL Lesson 9 Applying Advanced Worksheets and Charts Options.
Targeted Projection Pursuit Click here for an introduction.
CSC350: Learning Management Systems COMSATS Institute of Information Technology (Virtual Campus)
ISU Basic SAS commands Laboratory No. 1 Computer Techniques for Biological Research Animal Science 500 Ken Stalder, Professor Department of Animal Science.
W E K A Waikato Environment for Knowledge Analysis Branko Kavšek MPŠ Jožef StefanNovember 2005.
Artificial Neural Network Building Using WEKA Software
1 1 Slide Using Weka. 2 2 Slide Data Mining Using Weka n What’s Data Mining? We are overwhelmed with data We are overwhelmed with data Data mining is.
Weka – A Machine Learning Toolkit October 2, 2008 Keum-Sung Hwang.
WEKA Machine Learning Toolbox. You can install Weka on your computer from
An Artificial Neural Network Approach to Surface Waviness Prediction in Surface Finishing Process by Chi Ngo ECE/ME 539 Class Project.
W E K A Waikato Environment for Knowledge Aquisition.
***Classification Model*** Hosam Al-Samarraie, PhD. CITM-USM.
Introduction to Weka ML Seminar for Rookies Byoung-Hee Kim Biointelligence Lab, Seoul National University.
Machine Learning (ML) with Weka Weka can classify data or approximate functions: choice of many algorithms.
In part from: Yizhou Sun 2008 An Introduction to WEKA Explorer.
Machine Learning Homework Gaining familiarity with Weka, ML tools and algorithms.
WEKA: A Practical Machine Learning Tool WEKA : A Practical Machine Learning Tool.
Enlisted Association of the National Guard of the United States Data Extract Instructional Guide.
Perform a complete mail merge Lesson 14 By the end of this lesson you will be able to complete the following: Use the Mail Merge Wizard to perform a basic.
Data Science Credibility: Evaluating What’s Been Learned
An Introduction to WEKA
Presented by Yuting Liu
Development with Eclipse
Computer Science and Engineering, Seoul National University
RFPMonkey.com RFP Triage
shapeAnalysisMANCOVA_Wizard
Prepared by Kimberly Sayre and Jinbo Bi
Advanced Analytics Using Enterprise Miner
An Introduction to WEKA
Machine Learning with WEKA
Machine Learning with WEKA
Weka Package Weka package is open source data mining software written in Java. Weka can be applied to your dataset from the GUI, the command line or called.
Machine Learning with Weka
An Introduction to WEKA
CS539: Project 3 Zach Pardos.
Tutorial for WEKA Heejun Kim June 19, 2018.
Opening Weka Select Weka from Start Menu Select Explorer Fall 2003
CSCI N317 Computation for Scientific Applications Unit Weka
CS4705 – Natural Language Processing Thursday, September 28
Machine Learning with Weka
Forecasting Electricity Demand and Prices with Machine Learning
Machine Learning with WEKA
Lecture 10 – Introduction to Weka
Computer Based Testing Practice
Dimensionality Reduction
Assignment 1: Classification by K Nearest Neighbors (KNN) technique
Assignment 8 : logistic regression
A task of induction to find patterns
A task of induction to find patterns
Data Mining CSCI 307, Spring 2019 Lecture 8
Presentation transcript:

Neural Networks Weka Lab Villanova University Machine Learning Project

Weka Practice We are going to explore using Weka for neural nets The most common NN classifier in Weka is the Multilayer Perceptron, which is located under functions. And we will us the weather.numeric arff file.

First let’s look at the data Open Weka and open weather.numeric.arff in the Preprocess tab. Click on the “Edit…” button to see the actual data. In the Viewer window you can reorder the data by clicking on a column head. And in the window on the bottom right you can click on Visualize All to see each attribute mapped against “Play” as a class

Edit window, reordered This has been modified to order by Play, the highlighted column. Reordering by various columns makes it easier to look at the effect of any one attribute.

Visualize All window Looking at these it’s pretty clear that none of the predictive attributes works well by itself. Humidity or outlook probably does better than the others. In fact, OneR uses outlook. But it’s not very good.

Question 1: How does it look? Just looking at it, can you predict Play easily? Which do you think will do better, ZeroR or OneR? Try the default for both, using cross- validation and the default settings. Which did better? Not easy. ZeroR does better, at 65%, because we have more “play” than “don’t play”. OneR only gives 42% accuracy.

Question 2: More classifiers Try J48 and IBk. What results did you get? For J48 the default is 2 instances/leaf. We have a very small dataset. Does letting it be a single leaf do better (set minNumObj to 1)? For IBk we the default number of neighbors is 1 (KNN). Will more neighbors do better? Try it. Why do you think you got what you did? J48 default is 65%. IBk is about 78%.

Question 3: Okay, on to Neural Nets First let’s look at the standard Weka Interface. In the Classify tab, under Functions choose Multilayer Preceptron as the classifier. Try it with the default options. What accuracy do you get? How many nodes are described? How many hidden layers does this NN have? Accuracy is 78.57. 6 nodes, zero through five. One hidden layer; Zero and One are the output nodes for yes and no, respectively.

Question 4:Applying the model Right-click on the result in the result list to visualize the classifier errors. Rectangles are errors, Xs are correct. You probably want to increase jitter so you can see instances more clearly Look at the accurate predictions (Xs). Chick on one to see the actual instance data. Where did most of the Xs fall? In this case, the currect predictions are largely in the “yes/yes” corner.

Question 5: Classifier Errors This shows instance 10. 1 and 9 are also errors. Classifier Errors are rectangles. Clicking on one opens that instance. Which three instances are the errors?

Question 6: Different Parameters In the GenericObjectEditor you can change the default parameters for the classifier. Try different number of hidden layers and nodes in each layer. learning rate and momentum training time Did you get your accuracy to change? . hiddenLayers -- This defines the hidden layers of the neural network. This is a list of positive whole numbers. 1 for each hidden layer. Comma separated. To have no hidden layers put a single 0 here. For hidden layers, 4,4,4 says three hidden layers with four nodes each. Note that for this small sample size there are not a lot of changes based on these options.

Question 7: A bigger data set Load the diabetes dataset, which has many more instances, and experiment again with the options. Did you see an effect? Can you improve on the default accuracy? Note: these can take a while. Initial results start showing up, but if the STOP button is not grayed out it’s still working. The default parameters give 75.4% accuracy. Students should be able to improve somewhat on this. For instance, 2,2,2 will give 77.3% accuracy.

Question 8.The NN GUI There is a neural net GUI, selected by the first option in the GenericObjectEditor. This editor can be used to change the NN on the fly, while it is being built. Section 11.4, pages 469-472, in the Weka text describes how to use this gui. Work through the examples on those pages. How good a network can you build?(measured by accuracy). Save and submit your final model. The Weka text is Data Mining: Practical Machine Learning Tools and Techniques, Third Edition. Ian H. Witten , Eibe Frank, Mark A. Hall, Morgan Kaufmann, 2011) ISBN-10: 0123748569