Data mining techniques for analysing the weather patterns in Kumeu multi-sensor data Subana Shanmuganathan Geoinformatics Research Centre (GRC) Auckland.

Slides:



Advertisements
Similar presentations
Good Morning. Im the meteorologist, ______ ________. Yesterday I predicted ________. My prediction was_________, because it _________. I ___ed. Here is.
Advertisements

Daily Weather Data - This information will be entered into your science notebook at the beginning of each class day as the bell rings. - It will be checked.
Ensemble Learning – Bagging, Boosting, and Stacking, and other topics
CSE594 Fall 2009 Jennifer Wong Oct. 14, 2009
Mrs. Horn 5 th Grade Science 3 rd Nine Weeks. Temperature Meteorologist- scientist who studies weather Thermometer- measures air temperature and tells.
Describing Weather What factors do we use to describe weather?
Reconstruction of highly resolved atmospheric forcing fields for Northern Europe since 1850 AD Frederik Schenk & Eduardo Zorita EMS Annual Meeting & European.
VISIT: Virtual Intelligent System for Informing Tourists Kevin Meehan Intelligent Systems Research Centre Supervisors: Dr. Kevin Curran, Dr. Tom Lunney,
Weather Tools What are they? What do they do?. The Most Common Weather Tools Are: Thermometer Wind Vane Anemometer Barometer Rain Gauge.
Weather Journal Warm-up. First Page Current Almanac Predictions for the area – Include predictions from both almanacs Predicted Reflection – Is there.
Influences on Weather. EQ: What has an effect on the weather?
Strategic Directions in Real- Time & Embedded Systems Aatash Patel 18 th September, 2001.
Center for Advanced Transportation Education and Research University of Nevada, Reno Md. Arafat Hossain Khan Center for Advanced Transportation Education.
Computer Science Universiteit Maastricht Institute for Knowledge and Agent Technology Data mining and the knowledge discovery process Summer Course 2005.
A Look into America’s Weather NSF North Mississippi GK-8 Project Created By: Leah Craft Grade Level: 7th.
REBECCA KOLLMEYER EAS 4480 APRIL 24 TH, 2012 Impacts on Atlanta weather during winter by Central Pacific and Eastern Pacific El Nino.
ACCURATE TELEMONITORING OF PARKINSON’S DISEASE SYMPTOM SEVERITY USING SPEECH SIGNALS Schematic representation of the UPDRS estimation process Athanasios.
An Evaluation of A Commercial Data Mining Suite Oracle Data Mining Presented by Emily Davis Supervisor: John Ebden.
The University of Mississippi Geoinformatics Center NASA RPC – March, Evaluation for the Integration of a Virtual Evapotranspiration Sensor Based.
Dynamic Earth : Weather Systems © 2008 Michael Doig Day 1 - Weather Instruments.
Weather Instruments Can you name any instruments or tools used to predict or describe weather?
Introduction to machine learning and data mining 1 iCSC2014, Juan López González, University of Oviedo Introduction to machine learning Juan López González.
Xing Wan Predicting Blood Donor’s Attrition with Data Mining Methods Predicting Blood Donor’s Attrition with Data Mining Methods.
Particle Competition and Cooperation in Networks for Semi-Supervised Learning with Concept Drift Fabricio Breve 1,2 Liang Zhao 2
Introducing model-data fusion to graduate students in ecology Topics of discussion: The impact of NEON on ecology What are the desired outcomes from a.
Techniques for Evaluating Wildfire Smoke Impact on Ozone for Possible Exceptional Events Daniel Alrick 1, Clinton MacDonald 1, Brigette Tollstrup 2, Charles.
What are they? What do they do?
Experiment Management with Microsoft Project Gregor von Laszewski Leor E. Dilmanian Link to presentation on wiki 12:13:33Service Oriented Cyberinfrastructure.
What are they? What do they do?
Department of Automation Xiamen University
Lesson 1: What is Weather?
IEEE International Conference on Multimedia and Expo.
Weather Above and Below the Equator. For the next several days we are going to collect weather data from two cities: *One north of the equator *One an.
A Decision Tree Classification Model For Determining The Location For Solar Power Plant A PRESENTATION BY-  DISHANT MITTAL  DEV GAURAV VIT UNIVERSITY,VELLORE.
DATA MINING and VISUALIZATION Instructor: Dr. Matthew Iklé, Adams State University Remote Instructor: Dr. Hong Liu, Embry-Riddle Aeronautical University.
and Statistics, 2016, Vol. 4, No. 1, 1-8. doi: /ajams-4-1-1
Weather Station Model.
before and after rehabilitation
Weather Instruments.
What are they? What do they do?
What are they? What do they do?
What are they? What do they do?
Different tools are used to describe the weather.
What are they? What do they do?
Weather.
Progress Report Meeting
Thursday, May 21, 2015 Castle Learning Week # 38
SLMS 7th Grade Science Weather and Climate
Influences on Weather.
What are they? What do they do?
Study these weather words!
The Origin and Structure of Southern California Climate Variations
Non-parametric Structural Change Detection in Predictive Relationships
What are they? What do they do?
What are they? What do they do?
What are they? What do they do?
What are they? What do they do?
Environmental Statistics
What are they? What do they do?
Research Paper Overview.
What are they? What do they do?
Year range Rainfall (mm) ∆s in rainfall Temp (°C) ∆s in temp RH (mm)
What are they? What do they do?
Physical Geography Geography of Canada.
What are they? What do they do?
What are they? What do they do?
Weather.
Std. Error of the Estimate
What are they? What do they do?
Presentation transcript:

Data mining techniques for analysing the weather patterns in Kumeu multi-sensor data Subana Shanmuganathan Geoinformatics Research Centre (GRC) Auckland University of Technology(AUT) 24 May 2013

overview Background Literature Methodology Data Results Conclusions

background

live web display

literature 1.Sallis, P. and Hernandez. (2011). A precision agronomic state-space estimation method for event anticipation using dynamic multivariate continuous data. to be published in the Journal of Computer Science and Computational Mathematics JCSCMA precision agronomic state-space estimation method for event anticipation using dynamic multivariate continuous data 2.Sallis, P., Claster, W., and Hernandez, S. (2011). A machine-learning algorithm for wind gust prediction. Journal of Computers and the Geosciences, Elsevier Press. 37 (2011) A machine-learning algorithm for wind gust prediction 3.Sallis, P. and Hernandez. (2011). An event-state depiction algorithm using CPA methods with continuous feed data. pp Fifth Asia Modelling Symposium. ISBN /11 © 2011 IEEE DOI /AMS An event-state depiction algorithm using CPA methods with continuous feed data 4.Sallis, P., Hernandez, S. and Shanmuganathan, S. (2011). Dynamic multivariate continuous data state-space estimation for agrometeorological event anticipation. In proceedings of [eds] Thatcher, S., rd International Conference on Machine Learning and Computing (ICMLC 2011) Singapore, February 2011, ISBN /11 ©2011 IEEE Vol 1 pp Dynamic multivariate continuous data state-space estimation for agrometeorological event anticipation 5.Sallis, P., and Hernandez, S. (2011). Geospatial state space estimation using an Ensemble Kalman Filter, International Journal of Simulation, Systems, Science and Technology. Vol 11 (6) May 2011 pp: Geospatial state space estimation using an Ensemble Kalman Filter

The methodology Chapman, Pete, et al., et al. (2000) CRISP-DM 1.0, Step-by-step data mining guide. SPSS Inc. CRISPMWP-1104, pp73. Page 10 & page 12

The multi-sensor data 1.id 2.DateTime 3.NodeId 4.Pressure_Rel 5.Ind_Temp 6.Ind_Hum 7.Out_Temp 8.Out_Hum 9.Dewp 10.Windc 11.Winds 12.Wind_Dir 13.Gust 14.Rain_Rate 15.Act_rain 16.Rain_Today 17.Pressure_Abs 18.VinyardId 19.Rain_Total 20.Heat_Indx 21.High_Gust 1.id 2.Date Time 3.Pressure_Rel 4.Out_Temp 5.Out_Hum 6.Dewp 7.Windc 8.Winds 9.Wind_Dir 10.Gust 11. class 12.Heat_Indx 13.E_code Record high temperature 32.8°C Record low temperature -8.9°C Record high gust km/h Record high average km/h Record daily rain 82.6 mm Record low wind chill -11.5°C Record high barometer hPa Record low barometer hPa From

Data distribution gust classes 20,"very high"

Data mining C5.0 C&RT (classification and regression trees-B) CAHID (Chi-squared Automatic Interaction) Detector ANN Regression PCA Sw: SPSS clementine

C5.0 for high gust

C5.0 for very very high gust

C5.0 rule for high gust Rule 118 for high gust (8; 0.75) if Pressure_Rel > 909 and Out_Temp > -9.8 and Winds > 4.9 and Dewp <= 18 and Winds <= 9.9 and Heat_Indx <= 0 and Out_Temp > 2.5 and Winds > 7.3 and Dewp > 8.3 and Out_Temp > 14.9 and Out_Hum <= 90 and Pressure_Rel <= and Winds <= 8.8 and Wind_Dir <= 242 and Out_Hum > 45 and Winds > 8 and Pressure_Rel > and Out_Temp <= 24.9 and Out_Hum > 69 and Dewp <= 16.4 and Wind_Dir <= 135 and Out_Hum > 70 and Winds <= 8.3 and Wind_Dir <= 67 and Wind_Dir <= 22 and Out_Hum <= 72 and Pressure_Rel > and Pressure_Rel <= then high gust

C&RT gust prediction

C&RT rules for Gust prediction

C&RT class: error & correct readings Wind speed Pressure relative & wind sp Wind direction & wind sp Outdoor temp wind dir & wind sp Wind sp

CHAID

CHAID wind speed <=0

Wind speed >13.7

ANN predict gust class

ANN

PCA

Conclusions Different primary predictors – C5.0=>pressure relative – C&RT => wind speed – CHAID => wind speed – Regression test model => wine speed, pressure relative, outdoor humidity, wind direction, wind chill, outdoor temperature, dew point – PCA=> pressure relative outdoor temperature, outdoor humidity, dew point, wind chill, w speed, w direction Future work – Deploy online – Test other location data

acknowledgements Sara, Akbar & Philip – for access to data