Pang-Ning Tan Associate Professor Dept of Computer Science & Engineering Michigan State University

Slides:



Advertisements
Similar presentations
The new German project KLIWEX-MED: Changes in weather and climate extremes in the Mediterranean basin Andreas Paxian, University of Würzburg MedCLIVAR.
Advertisements

Statistical downscaling of extreme precipitation and temperature – a systematic and rigorous inter-comparison of methods T. Schmith (1), C.M. Goodess (2)
The Canadian Climate Impacts Scenarios (CCIS) Project is funded by the Climate Change Action Fund and provides climate change scenarios and related information.
Comparing statistical downscaling methods: From simple to complex
Machine learning methods with applications to precipitation and streamflow William W. Hsieh Dept. of Earth, Ocean & Atmospheric Sciences, The University.
Statistical Downscaling of Climate Extremes Xuebin Zhang, Jiafeng Wang, Elaine Barrow Supported by CCAF.
An Overview of Machine Learning
Task: (ECSK06) Regional downscaling Regional modelling with HadGEM3-RA driven by HadGEM2-AO projections National Institute of Meteorological Research (NIMR)/KMA.
Clim.pact – a tool for empirical-statistical downscaling Monika Cahynová cas.cz Institute of Atmospheric Physics & Faculty of Science, Charles.
Hidden Markov Model based 2D Shape Classification Ninad Thakoor 1 and Jean Gao 2 1 Electrical Engineering, University of Texas at Arlington, TX-76013,
Relational Learning with Gaussian Processes By Wei Chu, Vikas Sindhwani, Zoubin Ghahramani, S.Sathiya Keerthi (Columbia, Chicago, Cambridge, Yahoo!) Presented.
Estimating future changes in daily precipitation distribution from GCM simulations 11 th International Meeting on Statistical Climatology Edinburgh,
Dennis P. Lettenmaier Alan F. Hamlet JISAO Center for Science in the Earth System Climate Impacts Group and Department of Civil and Environmental Engineering.
Progress in Downscaling Climate Change Scenarios in Idaho Brandon C. Moore.
Large-scale atmospheric circulation characteristics and their relations to local daily precipitation extremes in Hesse, central Germany Anahita Amiri Department.
Clementine Server Clementine Server A data mining software for business solution.
Eric Salathé Climate Impacts Group (JISAO/SMA) University of Washington Constructing regional climate change scenarios.
1 11th INTERNATIONAL MEETING on STATISTICAL CLIMATOLOGY, EDINBURGH, JULY 12-16, 2010 Downscaling future climate change using statistical ensembles E. Hertig,
1 Ensembles of Nearest Neighbor Forecasts Dragomir Yankov, Eamonn Keogh Dept. of Computer Science & Eng. University of California Riverside Dennis DeCoste.
Statistical Learning: Pattern Classification, Prediction, and Control Peter Bartlett August 2002, UC Berkeley CIS.
General Mining Issues a.j.m.m. (ton) weijters Overfitting Noise and Overfitting Quality of mined models (some figures are based on the ML-introduction.
The Canadian Climate Impacts Scenarios (CCIS) Project is funded by the Climate Change Action Fund and provides climate change scenarios and related information.
Statistical Methods for long-range forecast By Syunji Takahashi Climate Prediction Division JMA.
The Canadian Climate Impacts Scenarios (CCIS) Project is funded by the Climate Change Action Fund and provides climate change scenarios and related information.
Introduction to machine learning
Combining Supervised and Unsupervised Learning for Zero-Day Malware Detection © 2013 Narus, Inc. Prakash Comar 1 Lei Liu 1 Sabyasachi (Saby) Saha 2 Pang-Ning.
Transfer Learning From Multiple Source Domains via Consensus Regularization Ping Luo, Fuzhen Zhuang, Hui Xiong, Yuhong Xiong, Qing He.
MACHINE LEARNING 張銘軒 譚恆力 1. OUTLINE OVERVIEW HOW DOSE THE MACHINE “ LEARN ” ? ADVANTAGE OF MACHINE LEARNING ALGORITHM TYPES  SUPERVISED.
Anomaly detection with Bayesian networks Website: John Sandiford.
Downscaling in time. Aim is to make a probabilistic description of weather for next season –How often is it likely to rain, when is the rainy season likely.
Experimental seasonal hydrologic forecasting for the Western U.S. Dennis P. Lettenmaier Andrew W. Wood, Alan F. Hamlet Climate Impacts Group University.
STARDEX STAtistical and Regional dynamical Downscaling of EXtremes for European regions A project within the EC 5th Framework Programme EVK2-CT
Bayesian networks Classification, segmentation, time series prediction and more. Website: Twitter:
Uncertainty in climate scenarios when downscaling with an RCM M. Tadross, B. Hewitson, W Gutowski & AF07 collaborators Water Research Commission of South.
Introduction to machine learning and data mining 1 iCSC2014, Juan López González, University of Oviedo Introduction to machine learning Juan López González.
LOGO Ensemble Learning Lecturer: Dr. Bo Yuan
Where the Research Meets the Road: Climate Science, Uncertainties, and Knowledge Gaps First National Expert and Stakeholder Workshop on Water Infrastructure.
Assessing the impacts of climate change on Atbara flows using bias-corrected GCM scenarios SIGMED and MEDFRIEND International Scientific Workshop Relations.
June 16th, 2009 Christian Pagé, CERFACS Laurent Terray, CERFACS - URA 1875 Julien Boé, U California Christophe Cassou, CERFACS - URA 1875 Weather typing.
M Machine Learning F# and Accord.net. Alena Dzenisenka Software architect at Luxoft Poland Member of F# Software Foundation Board of Trustees Researcher.
Young-Kwon Lim, D.W. Shin, S. Cocke, T. E. LaRow, J. J. O’Brien, and E. P. Chassignet Center for Ocean-Atmospheric Prediction Studies, Florida State University,
Investigation of Various Factorization Methods for Large Recommender Systems G. Takács, I. Pilászy, B. Németh and D. Tikk 10th International.
1 STAT 5814 Statistical Data Mining. 2 Use of SAS Data Mining.
1 Data Mining: Concepts and Techniques (3 rd ed.) — Chapter 12 — Jiawei Han, Micheline Kamber, and Jian Pei University of Illinois at Urbana-Champaign.
Climate Scenario and Uncertainties in the Caribbean Chen,Cassandra Rhoden,Albert Owino Anthony Chen,Cassandra Rhoden,Albert Owino Climate Studies Group.
Machine Learning Extract from various presentations: University of Nebraska, Scott, Freund, Domingo, Hong,
Regional Climate Model Evaluation System based on satellite and other observations for application to CMIP/AR downscaling Peter Lean 1, Jinwon Kim 1,3,
CSE 5331/7331 F'07© Prentice Hall1 CSE 5331/7331 Fall 2007 Machine Learning Margaret H. Dunham Department of Computer Science and Engineering Southern.
Mike Dettinger USGS, La Jolla, CA DOWNSCALING to local climate.
Evaluating the ability of climate models to simulate extremes Eric Robinson Natalie McLean Christine Radermacher Ross Towe Yushiang Tung Project 6.
 Present by 陳群元.  Introduction  Previous work  Predicting motion patterns  Spatio-temporal transition distribution  Discerning pedestrians  Experimental.
The Canadian Climate Impacts Scenarios (CCIS) Project is funded by the Climate Change Action Fund and provides climate change scenarios and related information.
1 Multi-Task Semi-Supervised Underwater Mine Detection Lawrence Carin, Qiuhua Liu and Xuejun Liao Duke University Jason Stack Office of Naval Research.
CS Statistical Machine learning Lecture 12 Yuan (Alan) Qi Purdue CS Oct
© Vipin Kumar IIT Mumbai Case Study 2: Dipoles Teleconnections are recurring long distance patterns of climate anomalies. Typically, teleconnections.
WHAT IS DATA MINING?  The process of automatically extracting useful information from large amounts of data.  Uses traditional data analysis techniques.
Introduction to Gaussian Process CS 478 – INTRODUCTION 1 CS 778 Chris Tensmeyer.
WHAT IS DATA MINING?  The process of automatically extracting useful information from large amounts of data.  Uses traditional data analysis techniques.
NOAA Northeast Regional Climate Center Dr. Lee Tryhorn NOAA Climate Literacy Workshop April 2010 NOAA Northeast Regional Climate.
Bias-Variance Analysis in Regression  True function is y = f(x) +  where  is normally distributed with zero mean and standard deviation .  Given a.
Data Mining: Concepts and Techniques1 Prediction Prediction vs. classification Classification predicts categorical class label Prediction predicts continuous-valued.
Carlo DeMarchi Dept. of Earth, Environmental, and Planetary Sciences Case Western Reserve University Case Western Reserve University Brent Lofgren NOAA.
DATA MINING and VISUALIZATION Instructor: Dr. Matthew Iklé, Adams State University Remote Instructor: Dr. Hong Liu, Embry-Riddle Aeronautical University.
DATA MINING © Prentice Hall.
Considerations in Using Climate Change Information in Hydrologic Models and Water Resources Assessments JISAO Center for Science in the Earth System Climate.
Overview of Supervised Learning
Data Warehousing and Data Mining
Predictability of Indian monsoon rainfall variability
Statistical interpretation of ECMWF EPS
Predictors Observed rainfall up to May.
Presentation transcript:

Pang-Ning Tan Associate Professor Dept of Computer Science & Engineering Michigan State University

Research Overview Spatio-Temporal Data Mining Link Mining Cluster Analysis Anomaly Detection Predictive Modeling Pattern Discovery Techniques and Algorithm Development New problems and challenges New ways for problem solving

Research Overview Spatio-Temporal Data Mining Link Mining Cluster Analysis Anomaly Detection Predictive Modeling Pattern Discovery Techniques and Algorithm Development spatio-temporal data, fusion of simulation & observation data, handling of noise, multi-resolution, extremes Kernel learning, multi-task learning, transfer learning, ensemble learning, semi-supervised learning Analysis and modeling of climate and earth science data

Time Series Prediction Predictor variables (X) Predictand variable (y) Training data Test data (Future) Transfer Function f: X  y

Supervised vs Semi-supervised Learning ðSupervised: Construct function using training data only ðSemi-supervised: Construct function using training + test data (predictor variables only) Predictor variables Predictand variable Training data Test data (Future)

Hidden Markov Regression ØState transition A=[a ij ] ØInitial probability:  = {  1,  2,…,  N } ØEach state is associated with a regression function: ØObservations are generated by:

Experimental Results ðUC Riverside Time Series Repository ðRoot mean square error (RMSE) ØMore than 10% improvement on Greatlake, Steam, Leaf, and C12full data sets

Statistical Downscaling Predictor variables (e.g., SLP, Z-500, u, v) Predictand variable (e.g., precip) Training (NCEP Reanalysis) Test (GCM) Training (OBS)

Application to Statistical Downscaling ðData: Canadian Climate Change Scenario Network ØPredictors: 25 variables (SLP, wind direction, etc) ØPredictand: mean temperature at a location Ø40 randomly selected sites in Canada

Application to Statistical Downscaling Semi-HMMR not much better than supervised HMMR Training and future data come from different sources (GCM vs NCEP reanalysis)

Bias Correction with Covariance Alignment ðCovariance matrices for training and future data: ðGoal is to find a transformation: X U  X U  such that A and B’ are “aligned” with each other

Covariance Alignment ðObjective function: ØSolved using gradient descent algorithm ðAfter alignment, apply semiHMMR on X L and X U 

Semi-Supervised HMMR after Alignment Without covariance alignment With covariance alignment

Collaboration ðNSF-CNH: Towards an Integrated Framework for Climate Change Impact Assessments for International Market Systems with Long-Term Investments ØDrs J.Winkler (PI), J.Andresen, S.Zhong, R.Black, S.Thornsbury, S.Loveridge, A.Iezzoni, J. Zhao ðOther challenges in downscaling: ØSimultaneous downscaling of multiple predictands/locales Multi-task and transfer learning ØSkewed or zero inflated data (e.g., precipitation) Simultaneous classification & regression (Abraham & Tan, 2010) ØModeling of extreme values

Questions? Thank You Website:

Value of (Unlabeled) Predictor Data