Detection, Recognition, and Localization of Multiple Cyber/Physical Attacks through Event Unmixing Wei Wang, Yang Song, Li He, Penn Markham, Hairong Qi,

Slides:



Advertisements
Similar presentations
Applications of one-class classification
Advertisements

Nonnegative Matrix Factorization with Sparseness Constraints S. Race MA591R.
Marzieh Parandehgheibi
Object Specific Compressed Sensing by minimizing a weighted L2-norm A. Mahalanobis.
Local Discriminative Distance Metrics and Their Real World Applications Local Discriminative Distance Metrics and Their Real World Applications Yang Mu,
Slide 1 Bayesian Model Fusion: Large-Scale Performance Modeling of Analog and Mixed- Signal Circuits by Reusing Early-Stage Data Fa Wang*, Wangyang Zhang*,
Structured Sparse Principal Component Analysis Reading Group Presenter: Peng Zhang Cognitive Radio Institute Friday, October 01, 2010 Authors: Rodolphe.
Presenter: Raghu Ranganathan ECE / CMR Tennessee Technological University March 22th, 2011 Smart grid seminar series Yao Liu, Peng Ning, and Michael K.
Multi-layer Orthogonal Codebook for Image Classification Presented by Xia Li.
Fast Bayesian Matching Pursuit Presenter: Changchun Zhang ECE / CMR Tennessee Technological University November 12, 2010 Reading Group (Authors: Philip.
Patch-based Image Deconvolution via Joint Modeling of Sparse Priors Chao Jia and Brian L. Evans The University of Texas at Austin 12 Sep
An Introduction to Sparse Coding, Sparse Sensing, and Optimization Speaker: Wei-Lun Chao Date: Nov. 23, 2011 DISP Lab, Graduate Institute of Communication.
Learning With Dynamic Group Sparsity Junzhou Huang Xiaolei Huang Dimitris Metaxas Rutgers University Lehigh University Rutgers University.
Image Indexing and Retrieval using Moment Invariants Imran Ahmad School of Computer Science University of Windsor – Canada.
1 Applications on Signal Recovering Miguel Argáez Carlos A. Quintero Computational Science Program El Paso, Texas, USA April 16, 2009.
Real-time Estimation of Accident Likelihood for Safety Enhancement Jun Oh, Ph.D., PE, PTOE Western Michigan University March 14, 2007.
A Versatile Depalletizer of Boxes Based on Range Imagery Dimitrios Katsoulas*, Lothar Bergen*, Lambis Tassakos** *University of Freiburg **Inos Automation-software.
Modeling Pixel Process with Scale Invariant Local Patterns for Background Subtraction in Complex Scenes (CVPR’10) Shengcai Liao, Guoying Zhao, Vili Kellokumpu,
Volkan Cevher, Marco F. Duarte, and Richard G. Baraniuk European Signal Processing Conference 2008.
Principal Component Analysis
Image Denoising via Learned Dictionaries and Sparse Representations
An Introduction to Kernel-Based Learning Algorithms K.-R. Muller, S. Mika, G. Ratsch, K. Tsuda and B. Scholkopf Presented by: Joanna Giforos CS8980: Topics.
7th IEEE Technical Exchange Meeting 2000 Hybrid Wavelet-SVD based Filtering of Noise in Harmonics By Prof. Maamar Bettayeb and Syed Faisal Ali Shah King.
Independent Component Analysis (ICA) and Factor Analysis (FA)
Basic Concepts and Definitions Vector and Function Space. A finite or an infinite dimensional linear vector/function space described with set of non-unique.
Improving the Accuracy of Continuous Aggregates & Mining Queries Under Load Shedding Yan-Nei Law* and Carlo Zaniolo Computer Science Dept. UCLA * Bioinformatics.
Radial Basis Function Networks
Fast Subsequence Matching in Time-Series Databases Christos Faloutsos M. Ranganathan Yannis Manolopoulos Department of Computer Science and ISR University.
Jinhui Tang †, Shuicheng Yan †, Richang Hong †, Guo-Jun Qi ‡, Tat-Seng Chua † † National University of Singapore ‡ University of Illinois at Urbana-Champaign.
Tomo-gravity Yin ZhangMatthew Roughan Nick DuffieldAlbert Greenberg “A Northern NJ Research Lab” ACM.
Machine Learning CUNY Graduate Center Lecture 3: Linear Regression.
Applying the Distribution System in Grid Restoration/NERC CIP-014 Risk Assessment Srijib Mukherjee, Ph.D., P.E. UC Synergetic.
Synchrophasor: Implementation,Testing & Operational Experience
Mining Discriminative Components With Low-Rank and Sparsity Constraints for Face Recognition Qiang Zhang, Baoxin Li Computer Science and Engineering Arizona.
Wavelets and Denoising Jun Ge and Gagan Mirchandani Electrical and Computer Engineering Department The University of Vermont October 10, 2003 Research.
Image Classification 영상분류
Statistical Sampling-Based Parametric Analysis of Power Grids Dr. Peng Li Presented by Xueqian Zhao EE5970 Seminar.
Algorithms for Wireless Sensor Networks Marcela Boboila, George Iordache Computer Science Department Stony Brook University.
Progress in identification of damping: Energy-based method with incomplete and noisy data Marco Prandina University of Liverpool.
Mingyang Zhu, Huaijiang Sun, Zhigang Deng Quaternion Space Sparse Decomposition for Motion Compression and Retrieval SCA 2012.
Solution of the Inverse Problem for Gravitational Wave Bursts Massimo Tinto JPL/CIT LIGO Seminar, October 12, 2004 Y. Gursel & M. Tinto, Phys. Rev. D,
©2009 Mladen Kezunovic. Improving Relay Performance By Off-line and On-line Evaluation Mladen Kezunovic Jinfeng Ren, Chengzong Pang Texas A&M University,
Sparse Signals Reconstruction Via Adaptive Iterative Greedy Algorithm Ahmed Aziz, Ahmed Salim, Walid Osamy Presenter : 張庭豪 International Journal of Computer.
Endmember Extraction from Highly Mixed Data Using MVC-NMF Lidan Miao AICIP Group Meeting Apr. 6, 2006 Lidan Miao AICIP Group Meeting Apr. 6, 2006.
Lecture 22 Unbalanced Faults and Power System Protection Professor Tom Overbye Department of Electrical and Computer Engineering ECE 476 POWER SYSTEM ANALYSIS.
Chapter 20 Classification and Estimation Classification – Feature selection Good feature have four characteristics: –Discrimination. Features.
Review of Spectral Unmixing for Hyperspectral Imagery Lidan Miao Sept. 29, 2005.
Locally Optimized Precipitation Detection over Land Grant Petty Atmospheric and Oceanic Sciences University of Wisconsin - Madison.
Large-Scale Matrix Factorization with Missing Data under Additional Constraints Kaushik Mitra University of Maryland, College Park, MD Sameer Sheoreyy.
1 © A. Kwasinski, 2015 Cyber Physical Power Systems Fall 2015 Security.
Mathematical Analysis of MaxEnt for Mixed Pixel Decomposition
Irfan Ullah Department of Information and Communication Engineering Myongji university, Yongin, South Korea Copyright © solarlits.com.
A Kernel Approach for Learning From Almost Orthogonal Pattern * CIS 525 Class Presentation Professor: Slobodan Vucetic Presenter: Yilian Qin * B. Scholkopf.
Network Anomography Yin Zhang Joint work with Zihui Ge, Albert Greenberg, Matthew Roughan Internet Measurement.
Comparative Analysis of Spectral Unmixing Algorithms Lidan Miao Nov. 10, 2005.
Dynamic Background Learning through Deep Auto-encoder Networks Pei Xu 1, Mao Ye 1, Xue Li 2, Qihe Liu 1, Yi Yang 2 and Jian Ding 3 1.University of Electronic.
Non-parametric Methods for Clustering Continuous and Categorical Data Steven X. Wang Dept. of Math. and Stat. York University May 13, 2010.
Biao Wang 1, Ge Chen 1, Luoyi Fu 1, Li Song 1, Xinbing Wang 1, Xue Liu 2 1 Shanghai Jiao Tong University 2 McGill University
Jianchao Yang, John Wright, Thomas Huang, Yi Ma CVPR 2008 Image Super-Resolution as Sparse Representation of Raw Image Patches.
1 Chapter 5 Branch-and-bound Framework and Its Applications.
Minimum spanning tree diameter estimation in random sensor networks in fractal dimension Students: Arthur Romm Daniel Kozlov Supervisor: Dr.Zvi Lotker.
PIs: Hairong Qi, Qing Cao, Yilu Liu, and Leon Tolbert
Injong Rhee ICMCS’98 Presented by Wenyu Ren
T. Chernyakova, A. Aberdam, E. Bar-Ilan, Y. C. Eldar
Hairong Qi, Gonzalez Family Professor
Sparselet Models for Efficient Multiclass Object Detection
Optimal sparse representations in general overcomplete bases
Wei Wang, Yang Song, Li He, Penn Markham, Hairong Qi, Yilu Liu
M. Kezunovic (P.I.) S. S. Luo D. Ristanovic Texas A&M University
Overview: Chapter 2 Localization and Tracking
Presentation transcript:

Detection, Recognition, and Localization of Multiple Cyber/Physical Attacks through Event Unmixing Wei Wang, Yang Song, Li He, Penn Markham, Hairong Qi, Yilu Liu Electrical Engineering and Computer Science Department University of Tennessee Contact: Hairong Qi, 1-1

Motivation and Challenge Objective  To detect, recognize, and localize (both temporally and spatially) attacks from multiple sources using data collected from the ultra-wide-area monitoring network (e.g., FNET) Motivation  Conventional power systems are designed to be robust to accidental failures (e.g., N-1, N-2, or even N-3 contingencies). Nevertheless, under the post 9/11 environment, simultaneous coordinated strikes become a realistic threat, which will lead to N-X operations under emergency.  Researchers at the Trustworthy Cyber Infrastructure for the Power Grid (TCIP) Cyber Trust Center also reported [1] that through the usage of a commercially available Power simulator and publicly available power flow data, a small set of breakers was found whose tripping would lead to a blackout almost the scale of the August 2003 blackout. This will put the interconnected power network in a greater danger that the original power system planner had never envisioned. 1-2

Background: Mixture and Unmixing Target detection at the subpixel level in remote sensing Speaker identification - The cocktail party problem Image restoration Heat source analysis from surface temperature evolution pattern in bulk metallic glass (BMG) Hidden weapon detection using terahertz images 1-3 Event Unmixing

Rationale – Event Unmixing Events seldom occur in an isolated fashion. Cascading events are more common and realistic which create multiple disturbances. The electromechanical waves generated from multiple disturbances will interfere with each other and the measurements taken at an FDR would more than likely be a mixture. Linear mixture analysis has been widely used due to its effectiveness and simplicity, where the sensor readout at a single location is given by x=As+n  x: an l-element column vector, the measured mixture or observation  A: an lxc source matrix with each column indicating a root event signature  s: a cx1 column vector or abundance vector, indicating the mixing coefficients satisfying certain constraints  n: the noise vector If given A, i.e., the signature matrix, s is traditionally estimated using methods such as Unsupervised Fully Constrained Least Squares (UFCLS) or Nonnegatively Constrained Least Squares (NCLS). Event detection can be conducted by identifying the event signature with a non-zero (or comparatively larger) corresponding abundance. The problems in traditional abundance estimation methods include  The estimated abundance may have values on each signature – not suitable for rare event detection  Very computationally intensive 1-4

Initial Trial x=As+n Unsupervised unmixing using minimum volume constraints, J(A) Failed! What are the challenges?  The construction of signature matrix, A  The dynamics of cascading events What is a good constraint?  The sparsity constraint  Signature training and learning 1-5

Root Event Signatures Generator trip (gt) Line trip (lt) Load drop (ld) Oscillation 1-6

Algorithm - Sparsity-constrained Unmixing x=As+n Abundance estimation via sparse coding  The sparse coding formulation (an NP-hard problem): minimize the number of non-zero elements in s while s is subject to the least-square constraint  If s is sufficiently sparse, we can solve for s by instead minimizing the l 1 -norm  “Feature sign search” is used to solve the optimization problem 1-7

Sparsity-constrained Unmixing – Dictionary Construction Signature dictionary learning – Design an overcomplete dictionary that incorporates temporal information  Training root event signature (done offline)  Generator trip (547 from EI, 415 from WECC, 189 from ERCOT) and load shedding (160 from EI, 346 from WECC) data are retrieved from the FNET database  Since FNET doesn’t detect line trips yet, we use PSS/E to generate signatures for line trips. A 16,000-bus model of the EI was used for simulation. Approximately 75 buses corresponding to actual FDRs were selected as the measurement points, and lines adjacent to these buses were tripped one at a time (257 training cases)  K-mean clustering is used to extract 6 (i.e., k=6) representing root event signatures for each event from all the above training cases 1-8

Signatures Learned 1-9

Dictionary Construction – Cont’  Construction of overcomplete dictionary - Temporal span of root event signature (done online)  For each root event signature learned above (6 for each of the three events), time-shift the signature by 0.1t seconds, t =1,...,200, to the right to generate all possible occurrence time of that event. Note that the interval 0.1 second can be changed with higher resolution, e.g., 0.01 second, which means the algorithm can resolve multiple events occurred at a finer scale. 1-10

Results – Simulated Events The model: A 23-bus model supplied from PSS/E is used. The model represents a small power system with 3,200 MW of load. The system contains several different voltage levels ranging from 13.8 kV at the generator buses to 500 kV at the transmission buses. It represents a variety of generation sources including nuclear, thermal, and hydro. Root event signatures: since this is a small power grid, we simply extracted 5 generator trip signatures (gt), 3 line trip generators (lt), and 5 load drop signatures (ld) to form the root event signature matrix Both single event and multiple event detections are accurate in terms of both detection and temporal localization Single event detection Multiple event detection 1-11 Two cascading events of gt101 (1 st sec) and lt (15 th sec) Two cascading events of gt101 (1 st sec) and gt3011 (15 th sec)Single event of gt101 (1 st sec) Single event of lt (1 st sec) Left: original signal vs. reconstructed signal. Mid: sparse coefficient or abundance. Right: event type detection

Simulation – Single Event 1-12 Single event of gt101 (1 st sec) Single event of lt (1 st sec)

Simulation – Multiple Events 1-13 Two cascading events of gt101 (1 st sec) and lt (15 th sec) Two cascading events of gt101 (1 st sec) and gt3011 (15 th sec)

Results – Real Case (Single Event) 1-14 Single Event Detection and Temporal Localization - One single generator trips were successfully detected from 10 out of 10 FDRs. - Each FDR detected the events with different time delay which can be further utilized for event localization purpose. Event detection on FDR 2: one event is detected and temporally localized as the occurring time of the largest coefficient. Plots of 10 raw FDR signals without denoising. Temporally localized on different FDRs! Why different occurring time? The FDRs will receive event wave at different time. The delays are very important for spatial localization!

Results – Real Case (Multiple Events) 1-15 Multiple Event Detection and Temporal Localization - Two generator trips (event3 and event4) were successfully detected from 16 out of 18 FDRs and two line trips were successfully detected from 17 out of 18 FDRs. - Each FDR detected the events with different time delay which can be further utilized for event spatial localization purpose. Event detection on FDR 14 Plots of 18 raw FDR signals without denoising. (Denoising is necessary before performing event detection algorithm!) Each individual event is temporally localized on different FDRs!

Event Spatial Localization 1-16 Traditional Localization Method - Wave-front Arrival Time (detected based on an empirical threshold!) - Geographic and Geometrical Triangulation [3]. Assumption: the time delay is linearly related to the distance between the FDR location and the event location. Wave-front Arrival Time! [3] [3] T. Xia, H. Zhang, R. Gardner, J. Bank, J. Dong, J. Zuo, Y. Liu, L. Beard, P. Hirsch, G. Zhang, and R. Dong, “Wide-area frequency based event location estimation,” in Power Engineering Society General Meeting, IEEE, 2007, pp. 1–7.

Event Spatial Localization 1-17 Limitations of Traditional Localization Method * Only can handle single event, cannot discriminate multiple events involved cascading event! * The wave-front arrival time is not accurate enough for spatial localization! Advantages of the Proposed Event Unmixing Algorithm * Can unmix each individual single event from a mixture signal that multiple events cascadingly involved! * The detected occurring time of each individual event should be more stable! (More robust to noise and more accurate) Triangulation * Use the same triangulation algorithm but with good spatial localization performance.

Event Spatial Localization – Single Event Localization error: 60 miles

Event Spatial Localization 1-19 Example of a Multiple-Event Individual Event Separation Apply Wave-front arrival detection on each individual event

Event Spatial Localization – Multiple Event 1-20 Localization error: GT (105 mi), LT (122 mi)

Challenge of LTB 1-21 Similarity: Different disturbances may cause the similar reaction on certain buses Ground truth: a line trip between bus Ground truth: a load drop on bus 3 Ground truth: a generator trip on bus 92

New idea Signal on each bus, NPCC grid with 140 buses Trip a generator on bus 21 Signal on each bus, NPCC grid with 140 buses Cluster Selection/ /Mean

New idea 1-23 D D Basic idea: Unmixing/sparse coding is based on a group signals instead of a single signal.

Experiment results 1-24 Comparison: previous strategy and new strategy ( different signal extraction methods and different spares coefficient analysis method)

Next Step … Further improve signal quality or frequency estimation accuracy Signature dictionary learning  Traditional parametric models using a fixed and finite number of parameters, e.g., k-means, can suffer from over- or under-fitting of data when there is a misfit between the complexity of the model.  The Bayesian nonparametric approach is an alternative to parametric modeling and selection. By using a model with an unbounded complexity, underfitting is mitigated, while the Bayesian approach of computing or approximating the full posterior over parameters mitigates overfitting.  The Dirichlet Process (DP), one of the most popular Bayesian nonparametric models, will be used for the learning of representative root event signatures  Not much improvement 1-25