Predictive Analytics derived from HVAC and PMU data at UCSD Chuck Wells Industry Principal OSIsoft, LLC 1.

Slides:



Advertisements
Similar presentations
DCSP-12 Jianfeng Feng
Advertisements

DCSP-13 Jianfeng Feng Department of Computer Science Warwick Univ., UK
3-D Computational Vision CSc Image Processing II - Fourier Transform.
UNLOCKING THE SECRETS HIDDEN IN YOUR DATA Part 3 Data Analysis.
CS Spring 2009 CS 414 – Multimedia Systems Design Lecture 4 – Digital Image Representation Klara Nahrstedt Spring 2009.
November 12, 2013Computer Vision Lecture 12: Texture 1Signature Another popular method of representing shape is called the signature. In order to compute.
Aimé Lay-Ekuakille University of Salento. Index: 1.Problem statement 2.Main motivation 3.FDM-Filter Diagonalization Method (mono) 4.DSD-Decimate Signal.
ACHIZITIA IN TIMP REAL A SEMNALELOR. Three frames of a sampled time domain signal. The Fast Fourier Transform (FFT) is the heart of the real-time spectrum.
1 Chapter 16 Fourier Analysis with MATLAB Fourier analysis is the process of representing a function in terms of sinusoidal components. It is widely employed.
Intro to Spectral Analysis and Matlab. Time domain Seismogram - particle position over time Time Amplitude.
Intro to Spectral Analysis and Matlab Q: How Could you quantify how much lower the tone of a race car is after it passes you compared to as it is coming.
Uncertainty Representation. Gaussian Distribution variance Standard deviation.
Spectrum Analyzer. Another Oscilloscope??? Like an oscilloscope Oscilloscope in time domain Spectrum analyzer in frequency domain (selectable)
School of Computing Science Simon Fraser University
1 Speech Parametrisation Compact encoding of information in speech Accentuates important info –Attempts to eliminate irrelevant information Accentuates.
Surface Variation and Mating Surface Rotational Error in Assemblies Taylor Anderson UGS June 15, 2001.
DEVON BRYANT CS 525 SEMESTER PROJECT Audio Signal MIDI Transcription.
Image Enhancement in the Frequency Domain Part I Image Enhancement in the Frequency Domain Part I Dr. Samir H. Abdul-Jauwad Electrical Engineering Department.
Chapter 12 Fourier Transforms of Discrete Signals.
Continuous Time Signals All signals in nature are in continuous time.
Discrete Time Periodic Signals A discrete time signal x[n] is periodic with period N if and only if for all n. Definition: Meaning: a periodic signal keeps.
Review of Probability.
Vibrationdata 1 Unit 5 The Fourier Transform. Vibrationdata 2 Courtesy of Professor Alan M. Nathan, University of Illinois at Urbana-Champaign.
GCT731 Fall 2014 Topics in Music Technology - Music Information Retrieval Overview of MIR Systems Audio and Music Representations (Part 1) 1.
R ESEARCH BY E LAINE C HEW AND C HING -H UA C HUAN U NIVERSITY OF S OUTHERN C ALIFORNIA P RESENTATION BY S EAN S WEENEY D IGI P EN I NSTITUTE OF T ECHNOLOGY.
EVENT MANAGEMENT IN MULTIVARIATE STREAMING SENSOR DATA National and Kapodistrian University of Athens.
The Story of Wavelets.
Fourier series. The frequency domain It is sometimes preferable to work in the frequency domain rather than time –Some mathematical operations are easier.
Transforms. 5*sin (2  4t) Amplitude = 5 Frequency = 4 Hz seconds A sine wave.
Spectral Analysis AOE March 2011 Lowe 1. Announcements Lectures on both Monday, March 28 th, and Wednesday, March 30 th. – Fracture Testing –
Implementing a Speech Recognition System on a GPU using CUDA
Jacob Zurasky ECE5526 – Spring 2011
Copyright © 2003 OPNET Technologies, Inc. Confidential, not for distribution to third parties. Session 1341: Case Studies of Security Studies of Intrusion.
1 Prof. Nizamettin AYDIN Digital Signal Processing.
Vibrationdata 1 Unit 5 The Fourier Transform. Vibrationdata 2 Courtesy of Professor Alan M. Nathan, University of Illinois at Urbana-Champaign.
Why We Use Three Different, Equivalent Forms of the Fourier Series.
Digital Image Processing Chapter 4 Image Enhancement in the Frequency Domain Part I.
Lecture 3 The Digital Image – Part I - Single Channel Data 12 September
Communication Delays in Wide Area Measurement Systems (WAMS) Biju Naduvathuparambil, Matthew C. Valenti, and Ali Feliachi Lane Department of Comp. Sci.
Using Computer Assisted Audit Tools and Techniques (CAATTs)
1 CSC 594 Topics in AI – Text Mining and Analytics Fall 2015/16 6. Dimensionality Reduction.
Inverse DFT. Frequency to time domain Sometimes calculations are easier in the frequency domain then later convert the results back to the time domain.
GG313 Lecture 24 11/17/05 Power Spectrum, Phase Spectrum, and Aliasing.
Antenna and Microwave Laboratory Babol Noshirvani University of Technology Antenna and Microwave Laboratory Babol Noshirvani University.
SIGNALS AND SIGNAL SPACE
Using Big Data to Enhance Transmission System Planning and Operations Jun Wen Southern California Edison 1.
CS 376b Introduction to Computer Vision 03 / 17 / 2008 Instructor: Michael Eckmann.
By Dr. Rajeev Srivastava CSE, IIT(BHU)
Creation of a Vibration Monitoring System
Digital Image Processing Image Enhancement in Spatial Domain
CS654: Digital Image Analysis Lecture 11: Image Transforms.
Fourier Analyses Time series Sampling interval Total period Question: How perturbations with different frequencies contribute to the turbulent kinetic.
The Frequency Domain Digital Image Processing – Chapter 8.
Conjoint Analysis. 1. Managers frequently want to know what utility a particular product feature or service feature will have for a consumer. 2. Conjoint.
Data statistics and transformation revision Michael J. Watts
DSP First, 2/e Lecture 18 DFS: Discrete Fourier Series, and Windowing.
Topic: Waveforms in Noesis
CS 591 S1 – Computational Audio
Section II Digital Signal Processing ES & BM.
ARTIFICIAL NEURAL NETWORKS
Data Mining, Distributed Computing and Event Detection at BPA
Supervised Time Series Pattern Discovery through Local Importance
Baselining PMU Data to Find Patterns and Anomalies
Computer Vision Lecture 16: Texture II
Lecture 18 DFS: Discrete Fourier Series, and Windowing
Fourier Analyses Time series Sampling interval Total period
Lecture 2: Frequency & Time Domains presented by David Shires
Data Mining, Distributed Computing and Event Detection at BPA
Discrete Fourier Transform
Lec.6:Discrete Fourier Transform and Signal Spectrum
Presentation transcript:

Predictive Analytics derived from HVAC and PMU data at UCSD Chuck Wells Industry Principal OSIsoft, LLC 1

Outline What is big data Structured versus unstructured data sets Tools used for developing analytics Feature extraction methods Focus on PMU data analytics 2

Big Data Typically large volumes of structured or unstructured data. Example of the HVAC data set at UCSD – 2 years of one minute interval data from 106,000 sensors – Over 530 Gbytes of structured text – Each day is one file of 1440 rows, 106,000 columns 3

PMU data sets An Example: Planned Indonesia Wide Area Measurement System (WAMS) PMU based measurement system Twenty regions – Each with 40 PMUs Each PMU has 40 measurement – Sampling rate is 50 Hz – Each PMU generates 2000 samples per second 4

PMU data archives Raw data (1.416 Gbytes per day) Calculated angle differences (91.88 GB/d) Computed FFTs ( GB/day) ~7.2 PB/d 5

Moving Window FFTs Use FFT spectra to extract features from the data sets Typically use 1024 wide windows running at 50 or 60 Hz rate to compute spectrum 6

Examples 7

PMU Frequency Data

Sampling PMU Frequency Data and Fast Fourier Transformation (FFT) Transforming Frequency Data to FFT Data – 23 samples of Frequency Data was taken from the PMU at different times – The FFT was computed for each sample – Each FFT was standardized by setting the max value to 1 – The following slides are the standardized FFT for the various time samples

X-Axis = FrequencyY-Axis: Magnitude FFT at Various Time (1 of 4)

FFT at Various Time (2 of 4) X-Axis = FrequencyY-Axis: Magnitude

FFT at Various Time (3 of 4) X-Axis = FrequencyY-Axis: Magnitude

FFT at Various Time (4 of 4) X-Axis = FrequencyY-Axis: Magnitude

Time Series Representation and Similarity Measure Transforming FFT Data into FFT Bins – For each preceding sample, FFT Frequencies are discretized into 25 bins – For each bin the mean and the sum are calculated – Correlation matrix comparing the corresponding event and control frequency bins

FFT Correlation Matrix Control GroupEvent Group

Simple Anomaly Detection Benford’s Law – Also called the First-Digit Law, refers to the frequency distribution of digits in many (but not all) real-life sources of data. In this distribution, the number 1 occurs as the leading digit about 30% of the time, while larger numbers occur in that position less frequently: 9 as the first digit less than 5% of the time – Benford's Law also concerns the expected distribution for digits beyond the first, which approach a uniform distribution

Benford Distribution Between Compressed and Uncompressed Data

Benford Distribution Between Control and Event

Big Data from Microgrids Complexities introduced by the large amount of multivariate and heterogeneous data streaming from complex sensor networks Extremely large, complex sensor networks, enabling a novel feature reduction method that scales well

20

21