Arizona State University1 Fast Mining of a Network of Coevolving Time Series Wei FanHanghang TongPing JiYongjie Cai.

Slides:



Advertisements
Similar presentations
Statistics for Improving the Efficiency of Public Administration Daniel Peña Universidad Carlos III Madrid, Spain NTTS 2009 Brussels.
Advertisements

FUNNEL: Automatic Mining of Spatially Coevolving Epidemics Yasuko Matsubara, Yasushi Sakurai (Kumamoto University) Willem G. van Panhuis (University of.
COLLABORATIVE FILTERING Mustafa Cavdar Neslihan Bulut.
Learning to Recommend Hao Ma Supervisors: Prof. Irwin King and Prof. Michael R. Lyu Dept. of Computer Science & Engineering The Chinese University of Hong.
Robust Network Compressive Sensing Lili Qiu UT Austin NSF Workshop Nov. 12, 2014.
A Constraint Generation Approach to Learning Stable Linear Dynamical Systems Sajid M. Siddiqi Byron Boots Geoffrey J. Gordon Carnegie Mellon University.
Yuan Yao Joint work with Hanghang Tong, Feng Xu, and Jian Lu Predicting Long-Term Impact of CQA Posts: A Comprehensive Viewpoint 1 Aug 24-27, KDD 2014.
Locally Constraint Support Vector Clustering
1 Abstract This paper presents a novel modification to the classical Competitive Learning (CL) by adding a dynamic branching mechanism to neural networks.
Margin Based Sample Weighting for Stable Feature Selection Yue Han, Lei Yu State University of New York at Binghamton.
Sparsity, Scalability and Distribution in Recommender Systems
A Constraint Generation Approach to Learning Stable Linear Dynamical Systems Sajid M. Siddiqi Byron Boots Geoffrey J. Gordon Carnegie Mellon University.
Spatial and Temporal Databases Efficiently Time Series Matching by Wavelets (ICDE 98) Kin-pong Chan and Ada Wai-chee Fu.
Host Load Prediction in a Google Compute Cloud with a Bayesian Model Sheng Di 1, Derrick Kondo 1, Walfredo Cirne 2 1 INRIA 2 Google.
Artificial Neural Networks
Cut-And-Stitch: Efficient Parallel Learning of Linear Dynamical Systems on SMPs Lei Li Computer Science Department School of Computer Science Carnegie.
Da Yan, Zhou Zhao and Wilfred Ng The Hong Kong University of Science and Technology.
Sparse Coding for Specification Mining and Error Localization Runtime Verification September 26, 2012 Wenchao Li, Sanjit A. Seshia University of California.
WEMAREC: Accurate and Scalable Recommendation through Weighted and Ensemble Matrix Approximation Chao Chen ⨳ , Dongsheng Li
Mining Discriminative Components With Low-Rank and Sparsity Constraints for Face Recognition Qiang Zhang, Baoxin Li Computer Science and Engineering Arizona.
Yan Yan, Mingkui Tan, Ivor W. Tsang, Yi Yang,
Non Negative Matrix Factorization
Tracking with Unreliable Node Sequences Ziguo Zhong, Ting Zhu, Dan Wang and Tian He Computer Science and Engineering, University of Minnesota Infocom 2009.
Time Series Data Analysis - I Yaji Sripada. Dept. of Computing Science, University of Aberdeen2 In this lecture you learn What are Time Series? How to.
Fast Mining and Forecasting of Complex Time-Stamped Events Yasuko Matsubara (Kyoto University), Yasushi Sakurai (NTT), Christos Faloutsos (CMU), Tomoharu.
Evaluation of Non-Uniqueness in Contaminant Source Characterization based on Sensors with Event Detection Methods Jitendra Kumar 1, E. M. Zechman 1, E.
Prognosis of gear health using stochastic dynamical models with online parameter estimation 10th International PhD Workshop on Systems and Control a Young.
Learning Geographical Preferences for Point-of-Interest Recommendation Author(s): Bin Liu Yanjie Fu, Zijun Yao, Hui Xiong [KDD-2013]
1 LD-Sketch: A Distributed Sketching Design for Accurate and Scalable Anomaly Detection in Network Data Streams Qun Huang and Patrick P. C. Lee The Chinese.
Zibin Zheng DR 2 : Dynamic Request Routing for Tolerating Latency Variability in Cloud Applications CLOUD 2013 Jieming Zhu, Zibin.
Efficient computation of Robust Low-Rank Matrix Approximations in the Presence of Missing Data using the L 1 Norm Anders Eriksson and Anton van den Hengel.
Mingyang Zhu, Huaijiang Sun, Zhigang Deng Quaternion Space Sparse Decomposition for Motion Compression and Retrieval SCA 2012.
Gap-filling and Fault-detection for the life under your feet dataset.
Lei Li Computer Science Department Carnegie Mellon University Pre Proposal Time Series Learning completed work 11/27/2015.
1 Panther: Fast Top-K Similarity Search on Large Networks Jing Zhang 1, Jie Tang 1, Cong Ma 1, Hanghang Tong 2, Yu Jing 1, and Juanzi Li 1 1 Department.
A Trust Based Distributed Kalman Filtering Approach for Mode Estimation in Power Systems Tao Jiang, Ion Matei and John S. Baras Institute for Systems Research.
NCAF Manchester July 2000 Graham Hesketh Information Engineering Group Rolls-Royce Strategic Research Centre.
The Restricted Matched Filter for Distributed Detection Charles Sestok and Alan Oppenheim MIT DARPA SensIT PI Meeting Jan. 16, 2002.
ICDCS 2014 Madrid, Spain 30 June-3 July 2014
Local Linear Matrix Factorization for Document Modeling Institute of Computing Technology, Chinese Academy of Sciences Lu Bai,
Sparse Granger Causality Graphs for Human Action Classification Saehoon Yi and Vladimir Pavlovic Rutgers, The State University of New Jersey.
Streaming Pattern Discovery in Multiple Time-Series Jimeng Sun Spiros Papadimitrou Christos Faloutsos PARALLEL DATA LABORATORY Carnegie Mellon University.
U of Minnesota DIWANS'061 Energy-Aware Scheduling with Quality of Surveillance Guarantee in Wireless Sensor Networks Jaehoon Jeong, Sarah Sharafkandi and.
ICONIP 2010, Sydney, Australia 1 An Enhanced Semi-supervised Recommendation Model Based on Green’s Function Dingyan Wang and Irwin King Dept. of Computer.
Unsupervised Streaming Feature Selection in Social Media
An Energy-Efficient Approach for Real-Time Tracking of Moving Objects in Multi-Level Sensor Networks Vincent S. Tseng, Eric H. C. Lu, & Kawuu W. Lin Institute.
D YNA MM O : M INING AND S UMMARIZATION OF C OEVOLVING S EQUENCES WITH M ISSING V ALUES Lei Li joint work with Christos Faloutsos, James McCann, Nancy.
Facets: Fast Comprehensive Mining of Coevolving High-order Time Series Hanghang TongPing JiYongjie CaiWei FanQing He Joint Work by Presenter:Wei Fan.
Hao Ma, Dengyong Zhou, Chao Liu Microsoft Research Michael R. Lyu
Feature learning for multivariate time series classification Mustafa Gokce Baydogan * George Runger * Eugene Tuv † * Arizona State University † Intel Corporation.
1 / 24 Distributed Methods for High-dimensional and Large-scale Tensor Factorization Kijung Shin (Seoul National University) and U Kang (KAIST)
Carnegie Mellon School of Computer Science Forecasting with Cyber-physical Interactions in Data Centers Lei Li PDL Seminar 9/28/2011.
SketchVisor: Robust Network Measurement for Software Packet Processing
Forecasting with Cyber-physical Interactions in Data Centers (part 3)
State Space Representation
CARP: Context-Aware Reliability Prediction of Black-Box Web Services
Supervised Time Series Pattern Discovery through Local Importance
MATRIX FACTORIZATION TECHNIQUES FOR RECOMMENDER SYSTEMS
Predicting Long-Term Impact of CQA Posts: A Comprehensive Viewpoint
Pre Proposal Time Series Learning completed work
Kijung Shin1 Mohammad Hammoud1
Chao Zhang1, Yu Zheng2, Xiuli Ma3, Jiawei Han1
Latent Space Model for Road Networks to Predict Time-Varying Traffic
Probabilistic Models with Latent Variables
Recommending Mobile Apps - A Collaborative Filtering Viewpoint
State Space Analysis UNIT-V.
Binghui Wang, Le Zhang, Neil Zhenqiang Gong
Asymmetric Transitivity Preserving Graph Embedding
GANG: Detecting Fraudulent Users in OSNs
Fusing Rating-based and Hitting-based Algorithms in Recommender Systems Xin Xin
Presentation transcript:

Arizona State University1 Fast Mining of a Network of Coevolving Time Series Wei FanHanghang TongPing JiYongjie Cai

Arizona State University Ubiquitous Coevolving Time Series 2 b) Chlorine concentration levels in water distribution network a) Temperature monitoring in a smart building Coevolving Time Series c) Marker tracking in motion capture d) Physiological signal in health care

Arizona State University Contextual Network  Embedded with contextual information (Networks) 3 (a) A Simplified Sensor Network (b) Measured Temperature Time Series. The time series are inter-connected with each other by its embedded network.

Arizona State University Contextual Network (cont.)  Appear in many applications, e.g., 4 (a) Water Quality Monitoring (b) Motion Capture(c) Epilepsy Signaling Contextual Network Coevolving Time Series

Arizona State University Problem Definition  NoT Missing Value Recovery Problem Given: NoT - a network of time series R = Recover: its missing parts indicated by the indicator W

Arizona State University Singular Value Decomposition (SVD) 6 Coevolving time seriesMatrix representation X t1t1 t2t2 t7t7 t17t … … … 11… … … … … … 21… … … … … … … … … … … … t8t8 t9t9 t 18 t 19 … … … TS 1 TS 2 TS 3 TS 4 TS 5 TS 6 TS 7 Morning rush hours Time Traffic Volume Afternoon rush hours

Arizona State University SVD (cont.)  Singular vectors for correlation detection 7 ≈ × × … … … … … … … … … … … … … … … … … … … … … … … … … … … MR: Morning rush hours AR: Afternoon rush hours AR MR + AR P1P1 P2P2 strength of P 1 strength of P 2 TS 1 TS 2 TS 3 TS 4 TS 5 TS 6 TS 7 Limitations: Contextual Information Temporal Smoothness P1P1 P2P2 U Σ Z X MR AR

Arizona State University Outline  Motivation  Dynamic Contextual Matrix Factorization  Experiments  Conclusion 8

Arizona State University Step 1. Encode Correlation Among Time Series 9 coevolving time seriestime series matrix X X ≈ U×Z coefficient matrix … … … … … … … … … … … … … … … … … … … … … ≈ TS 1 TS 2 TS 3 TS 4 TS 5 TS 6 TS 7 × … … … … … … P1P1 P2P2 TS 1 TS 2 TS 3 TS 4 TS 5 TS 6 TS 7 P1P1 P2P2 time series latent factor t7t7 t17t17 t8t8 t9t9 t 18 t 19 … … … indicator matrix Morning rush hours Time Traffic Volume Afternoon rush hours AR MR

Arizona State University Step 2. Encode Contextual Information 10 contextual informationcontextual matrix S S ≈ U×V ≈ × contextual latent factor TS 1 TS 2 TS 3 TS 4 TS 5 TS 6 TS 7 P1P1 P2P2 TS 1 TS 2 TS 3 TS 4 TS 5 TS 6 TS TS 1TS 2TS 3TS 4TS 5TS 6TS 7 P1P1 P2P TS 1TS 2TS 3TS 4TS 5TS 6TS 7 coefficient matrix

Arizona State University Step 3. Encode Temporal Smoothness … … Z:Z: AR MR

Arizona State University Put It All Together - DCMF #3 #2 #1 #2 ok #3 #1 12

Arizona State University Proposed Algorithm  Algorithm – Key idea: EM algorithm – Sketch: Group Z and V, group U to θ Alternatively update {Z,V} and θ(U) Forward and backward algorithms for Z 13  Properties – Converge to a local optimum – Time complexity: Linear in T #3#3 #3#3 S ≈U x V #2#2 #2#2 #1#1 #1#1 #2#2 #2#2 ok #3#3 #3#3 #1#1 #1#1

Arizona State University Relations with Existing Tools  Kalman filter and smoother  DynaMMo [Li+ KDD 2009]  SoRec [Ma+ CIKM 2008]  PMF [Salakhutdinov+ NIPS 2007]  SmoothSoRec, SmoothPMF 14 All special cases of our DCMF model

Arizona State University Outline  Motivation  Dynamic Contextual Matrix Factorization  Experiments  Conclusion 15

Arizona State University Experimental Evaluations  Parameter Sensitivity – how robust is our DCMF algorithm?  Effectiveness – how accurate is our DCMF algorithm in terms of recovering the missing values of the input time series?  Efficiency – how does our DCMF algorithm scale w.r.t. T ? 16

Arizona State University Parameter Sensitivity (a) Impact of l(b) Impact of λ 17  l: dimensionality of latent factors. – RMSE stabilizes after l reaches 15.  λ: weight to control the contribution of contextual network

Arizona State University Effectiveness Results Evaluation of missing value recovery. Lower is better 18 (a) Motes (n=54, T= 14400) (b) Chlorine (n=166, T= 4310) Ours

Arizona State University A Case Study – Running Instance (b) y-coordinate (c) z-coordinate 19 * (a) Marker positions* Partially missing: LWRB Closest in U: RWRB, RWRA, LWRA

Arizona State University Scalability  T: the length of time series 20

Arizona State University Conclusion  NoT: a network of coevolving time series  Main contributions – Model formulation multiple time series + contextual information +temporal smoothness – Algorithm EM algorithm Linear scalability in the length of time series – Empirical evaluation outperform all the existing competitors 21

Arizona State University Thank you! Yongjie Cai Source code: Q & A

Arizona State University Effectiveness Results (cont.)  Motion Capture Dataset 23

Arizona State University Social Recommendation  Social network - Correlations from social connections to personal behaviors  User-item rating matrix factorization  Social network matrix factorization 24 … … … … … … … … … … … … … … … … … … … … … ≈ … … … … … … × U Z ≈ × U V X S

Arizona State University  Find user latent factor shared by users’ social network S and the rating matrix X – Improve the recommendation accuracy Social Recommendation (cont.) 25 [Ma+ CIKM 2008] Lack of temporal smoothness