Data mining by WEKA Karunamuni P Silva CIS 675. Data Description Cost Data for U.S. Airlines, 90 Oservations On 6 Firms For 15 Years, 1970-1984 Source:

Slides:



Advertisements
Similar presentations
Florida International University COP 4770 Introduction of Weka.
Advertisements

Statistical Methods Lecture 29
 Will help you gain knowledge in: ◦ Improving performance characteristics ◦ Reducing costs ◦ Understand regression analysis ◦ Understand relationships.
Unsupervised Learning
Principle Components & Neural Networks How I finished second in Mapping Dark Matter Challenge Sergey Yurgenson, Harvard University Pasadena, 2011.
Copyright © 2014, 2011 Pearson Education, Inc. 1 Chapter 20 Curved Patterns.
Part 1: Simple Linear Model 1-1/301-1 Regression Models Professor William Greene Stern School of Business IOMS Department Department of Economics.
Linear vs. exponential growth Linear vs. exponential growth: t = 0 A = 1x(1+1) 0 = 1 A = 1x0 + 1 = 1.
1 On Constructing Efficient Shared Decision Trees for Multiple Packet Filters Author: Bo Zhang T. S. Eugene Ng Publisher: IEEE INFOCOM 2010 Presenter:
How much rent do you pay per month during the academic year? (Enter DK if you don’t know.)
Environmental Regulation in Oligopoly Markets: A Study of Electricity Restructuring Erin T. Mansur UC Berkeley and UC Energy Institute March 22, 2002 POWER.
Bagging LING 572 Fei Xia 1/24/06. Ensemble methods So far, we have covered several learning methods: FSA, HMM, DT, DL, TBL. Question: how to improve results?
Introduction to WEKA Aaron 2/13/2009. Contents Introduction to weka Download and install weka Basic use of weka Weka API Survey.
 relatively small economies of scale  many firms  product differentiation  close but not perfect substitutes  product characteristics, location, services.
Excel Data Analysis Tools Descriptive Statistics – Data ribbon – Analysis section – Data Analysis icon – Descriptive Statistics option – Does NOT auto.
Study Case Results PC06 High NG Price PC07 Low NG Price W ESTERN E LECTRICITY C OORDINATING C OUNCIL.
Decision for the location of Intermodal terminals in a rail-road network Anupam Kulshreshtha IIM - Lucknow.
Ranga Rodrigo April 5, 2014 Most of the sides are from the Matlab tutorial. 1.
Comparing the Parallel Automatic Composition of Inductive Applications with Stacking Methods Hidenao Abe & Takahira Yamaguchi Shizuoka University, JAPAN.
1 Chapter Seven Introduction to Sampling Distributions Section 1 Sampling Distribution.
Carolina Environmental Program UNC Chapel Hill The Analysis Engine – A New Tool for Model Evaluation, Sensitivity and Uncertainty Analysis, and more… Alison.
Air Transport Association May 21,2002 NET INCOME U.S. Scheduled Airlines
Chapter Fourteen Statistical Analysis Procedures Statistical procedures that simultaneously analyze multiple measurements on each individual or.
Learning user preferences for 2CP-regression for a recommender system Alan Eckhardt, Peter Vojtáš Department of Software Engineering, Charles University.
Statistical Methods Statistical Methods Descriptive Inferential
Then click the box for Normal probability plot. In the box labeled Standardized Residual Plots, first click the checkbox for Histogram, Multiple Linear.
Recent Growth Rate of US GDP Since the Great Recession nited-states/gdp-growth.
Copyright © Cengage Learning. All rights reserved.
Stat 112: Notes 2 Today’s class: Section 3.3. –Full description of simple linear regression model. –Checking the assumptions of the simple linear regression.
FUEL SAVING TECHNIQUES David Popwell, Yancey Bros. Co.
Quantifying change order impact on productivity by using ANN approach ECE 539 Project Presentation (Order: 316) Min-Jae Lee Construction Management Program.
Production Schedule. Production Table Production Table for a Linear Production Function Equation: Patients served = 5*Doctors + 5* Nurses.
Intro to Econometrics Using Excel for Estimation.
Prognostic Prediction of Breast Cancer Using C5 Sakina Begum May 1, 2001.
Estimated Weighted Least Squares Profits and Market Structure for Highly Advertising Companies J.M. Vernon and R.E.M. Nourse (1973). “Profit Rates and.
CpSc 881: Machine Learning
Environmental Benefits of Renewable Portfolio Standards in an Age of Coal Plant Retirements September 10 th, 2015 Energy Policy Research Conference Denver,
American Wind Energy Association Wind in The Portfolio Mike Jacobs American Wind Energy Association Managing Your Energy Portfolio in a Greener World January.
LP Examples Solid Waste Management. A SOLID WASTE PROBLEM Landfill Maximum capacity (tons/day) Cost of transfer to landfill ($/ton) Cost of disposal at.
Machine Learning in CSC 196K
Elnaz Siami-Irdemoosaa,Saeid R. Dindarloo
Linear Regulator Fundamentals
2013 Load Impact Evaluation Capacity Bidding Program (CBP) Steve Braithwait, Dan Hansen, and Dave Armstrong Christensen Associates Energy Consulting DRMEC.
Chapter 8: Short-Run Costs and Output Decisions. Firm’s Decisions.
1 A Conversation about MISO Paul Spicer November 8, 2007.
PROFIT MAXIMIZATION. Profit Maximization  Profit =  Total Cost = Fixed Cost + Variable Cost  Fixed vs. Variable… examples?  Fixed – rent, loan payments,
Electricity and Renewables
Given Slope & y-Intercept
Introduction to Linear Programs
Market Update With Mark Chrencik September 13, :30 AM.
Lecture 3: Linear Regression (with One Variable)
NET INCOME U.S. Scheduled Airlines $ Billions
3.2 Productivity Understand what is meant by productivity.
A Simple Artificial Neuron
Predict House Sales Price
NEURAL NETWORK APPROACHES FOR AUTOMOBILE MPG PREDICTION
Year 2 Updates.
ريكاوري (بازگشت به حالت اوليه)
التدريب الرياضى إعداد الدكتور طارق صلاح.
Lower fuel prices and operating costs
Dr Sh Salleh bin Sh Ahmad
INNOvation in TRAINING BUSINESS ANALYSTS HAO HElEN Zhang UniVERSITY of ARIZONA

Factor Analysis (Principal Components) Output
Assignment 5 Example of multivariate regression
Cases. Simple Regression Linear Multiple Regression.
Motivation It can effectively mine multi-modal knowledge with structured textural and visual relationships from web automatically. We propose BC-DNN method.
Data Mining CSCI 307, Spring 2019 Lecture 7
Why Have Economic Reforms in Mexico Not Generated Growth
Data Mining CSCI 307, Spring 2019 Lecture 8
Presentation transcript:

Data mining by WEKA Karunamuni P Silva CIS 675

Data Description Cost Data for U.S. Airlines, 90 Oservations On 6 Firms For 15 Years, Source: These data are a subset of a larger data set provided to the author by Professor Moshe Kim. They were originally constructed by Christensen Associates of Madison, Wisconsin. I consider following data set Cost vs Fuel price and Load factor, the average capacity utilization of the fleet. Output

Histogram of Output

Histogram of Fuel Price

Histogram of Load Factor

Histogram of Cost

Linear Regression 66% of training set & 34% test set

Visualization

76% of training set & 24% test set

Visualization

86% of training set & 14% test set

Visualization

Alpha=0.05 Iterations=200

References nel/ E2010/files/2010/11/wekatutorial.pdf