Presentation is loading. Please wait.

Presentation is loading. Please wait.

Pang-Ning Tan Associate Professor Dept of Computer Science & Engineering Michigan State University

Similar presentations


Presentation on theme: "Pang-Ning Tan Associate Professor Dept of Computer Science & Engineering Michigan State University"— Presentation transcript:

1 Pang-Ning Tan Associate Professor Dept of Computer Science & Engineering Michigan State University http://www.cse.msu.edu/~ptan

2 Research Overview Spatio-Temporal Data Mining Link Mining Cluster Analysis Anomaly Detection Predictive Modeling Pattern Discovery Techniques and Algorithm Development New problems and challenges New ways for problem solving

3 Research Overview Spatio-Temporal Data Mining Link Mining Cluster Analysis Anomaly Detection Predictive Modeling Pattern Discovery Techniques and Algorithm Development spatio-temporal data, fusion of simulation & observation data, handling of noise, multi-resolution, extremes Kernel learning, multi-task learning, transfer learning, ensemble learning, semi-supervised learning Analysis and modeling of climate and earth science data

4 Time Series Prediction Predictor variables (X) Predictand variable (y) Training data Test data (Future) Transfer Function f: X  y

5 Supervised vs Semi-supervised Learning ðSupervised: Construct function using training data only ðSemi-supervised: Construct function using training + test data (predictor variables only) Predictor variables Predictand variable Training data Test data (Future)

6 Hidden Markov Regression ØState transition A=[a ij ] ØInitial probability:  = {  1,  2,…,  N } ØEach state is associated with a regression function: ØObservations are generated by:

7 Experimental Results ðUC Riverside Time Series Repository ðRoot mean square error (RMSE) ØMore than 10% improvement on Greatlake, Steam, Leaf, and C12full data sets

8 Statistical Downscaling Predictor variables (e.g., SLP, Z-500, u, v) Predictand variable (e.g., precip) Training (NCEP Reanalysis) Test (GCM) Training (OBS)

9 Application to Statistical Downscaling ðData: Canadian Climate Change Scenario Network ØPredictors: 25 variables (SLP, wind direction, etc) ØPredictand: mean temperature at a location Ø40 randomly selected sites in Canada

10 Application to Statistical Downscaling Semi-HMMR not much better than supervised HMMR Training and future data come from different sources (GCM vs NCEP reanalysis)

11 Bias Correction with Covariance Alignment ðCovariance matrices for training and future data: ðGoal is to find a transformation: X U  X U  such that A and B’ are “aligned” with each other

12 Covariance Alignment ðObjective function: ØSolved using gradient descent algorithm ðAfter alignment, apply semiHMMR on X L and X U 

13 Semi-Supervised HMMR after Alignment Without covariance alignment With covariance alignment

14 Collaboration ðNSF-CNH: Towards an Integrated Framework for Climate Change Impact Assessments for International Market Systems with Long-Term Investments ØDrs J.Winkler (PI), J.Andresen, S.Zhong, R.Black, S.Thornsbury, S.Loveridge, A.Iezzoni, J. Zhao ðOther challenges in downscaling: ØSimultaneous downscaling of multiple predictands/locales Multi-task and transfer learning ØSkewed or zero inflated data (e.g., precipitation) Simultaneous classification & regression (Abraham & Tan, 2010) ØModeling of extreme values

15 Questions? Thank You Email: ptan@cse.msu.eduptan@cse.msu.edu Website: http://www.cse.msu.edu/~ptanhttp://www.cse.msu.edu/~ptan

16 Value of (Unlabeled) Predictor Data 12 13 11


Download ppt "Pang-Ning Tan Associate Professor Dept of Computer Science & Engineering Michigan State University"

Similar presentations


Ads by Google