Anomaly Detection LCLS data: Introduction Research Conclusion

Slides:



Advertisements
Similar presentations
Inferences About Process Quality
Advertisements

Inference for One-Sample Means
INTRODUCTORY STATISTICS Chapter 2 DESCRIPTIVE STATISTICS PowerPoint Image Slideshow.
Author’s Name/s Goes Here, Author’s Name/s Goes Here, Author’s Name/s Goes Here Address/es Goes Here, Address/es Goes Here, Address/es Goes Here Acknowledgements.
Chapter 23 Inference for One- Sample Means. Steps for doing a confidence interval: 1)State the parameter 2)Conditions 1) The sample should be chosen randomly.
Author’s Name/s Goes Here, Author’s Name/s Goes Here, Author’s Name/s Goes Here Address/es Goes Here, Address/es Goes Here, Address/es Goes Here Acknowledgements.
The Single-Sample t Test Chapter 9. The t Distributions >Distributions of Means When the Parameters Are Not Known >Using t distributions Estimating a.
Chapter 12 Introduction to Analysis of Variance PowerPoint Lecture Slides Essentials of Statistics for the Behavioral Sciences Eighth Edition by Frederick.
Point Estimates point estimate A point estimate is a single number determined from a sample that is used to estimate the corresponding population parameter.
Chapter 12 Introduction to Analysis of Variance
Chapter 9 Introduction to the t Statistic
The Electromagnetic Spectrum High Harmonic Generation
Mathematical Derivation of Probability
Jonathan Donkers1, Lin Zhang1+
Measuring the Thickness of Cryosheet Jets
Madhu Karra1, Jason Koglin1,2
Chapter 9 Roadmap Where are we going?.
Sentiment Analysis and Data visualization Techniques of Experiments
Complications and Troubleshooting
Image Processing For Soft X-Ray Self-Seeding
Structure and Function: Ubiquitination of NEMO and Optineurin
Characterizing Optics with White-Light Interferometry
Matter in Extreme Conditions at LCLS:
Kevin Shimbo1, Matt Hayes+
X-Ray Shielding Introduction Application Research Conclusions
Crystallography images
Hutch 4 (XCS) Laser System
MAKING WAY FOR LCLS II HERRIOTT CELL IGNIS THE NEED FOR LCLS-II
A remote EPICS monitoring tool
EASE: Alert Management
Designing and Prototyping Optics Table and Shelves
Managing Diffraction Beam
Online Conditional Outlier Detection in Nonstationary Time Series
Integration of Blu-Ice into
Effects of Wrapping Vacuum
Scintillation Material Encapsulation Fixture
LCLS CXI Sample Chambers and Engineering Solutions
Values to Normalize Data
Designing a molecular probe for detection
LAMP Vacuum System Documentation
G6PD Complex Structural Study Crystallization Setup
Your Name, Elias Catalano, Robert Sublett, Robin Curtis
Determining a Detector’s Saturation Threshold
HXRSD Helium Enclosure
Crystallography Labeling Using Machine Learning
IT Inventory and Optimization
Structural study of soluble ammonia monooxygenase
Creation and development of a PRP database
IT Inventory and Optimization
Electro Optic Sampling of Ultrashort Mid-IR/THz Pulses
Margaret Koulikova1, William Colocho+
Analytical Solutions for Heat Distribution in KB Mirror Systems
Lecture Slides Elementary Statistics Twelfth Edition
Repeatability Test of Attocube Piezo Stages
Analytical Solutions for Temperature
Barron Montgomery1,2, Christopher O’Grady2, Chuck Yoon2, Mark Hunter2+
Elizabeth Van Campen, Barrack Obama2, Stephen Curry1,2, Steve Kerr2+
Advanced Methods of Klystron Phasing
Ryan Dudschus, Alex Wallace, Zachary Lentz
Jennifer Hoover1, William Colocho2
X-Ray Spectrometry Using Cauchois Geometry For Temperature Diagnostics
Redesigning of The SED Calendars
ACKNOWLEDGEMENTS (optional section)
Author’s Name/s Goes Here1, Author’s Name/s Goes Here2
Automatic compression control for the LCLS injector laser
Russell Edmonds1, Matt Hayes2+ 1LCLS Summer Research Intern
Prototype and Design of High-Speed X-Ray Chopper
Title of Your Poster Introduction Research Conclusions Acknowledgments
Reconstruction algorithms Challenges & Future work
Accelerator Safety Workshop SLAC
Presentation transcript:

Anomaly Detection LCLS data: Introduction Research Conclusion Kassandra Munguia, Daniel Ratner, Danielle Sanzone, Nora Norwell Please upload your posters (PowerPoint format) on: https://portal.slac.stanford.edu/sites/conf_public/lcls_ssps/2016/ The posters will be printed for you on a 36 inch (width) paper which you can pick up before the poster session. You will be responsible for posting the poster on the wall. For Group A, the due date is Tuesday, August 9th 2016. For Group B, the due date is Tuesday, September 6th 2016. There will be prizes for best posters. 1Intern 2Linac Coherent Light Source, SLAC National Accelerator Laboratory, 2575 Sand Hill Road, Menlo Park, CA 94025, USA. +Contact: dratner@slac.stanford.edu Introduction Our group’s main goals were to detect anomalies for data analysis and potentially create a real-time alert system to notify or any potential problems.. We are trying to get a better look at anomaly detection at SLAC. Definitions: Anomaly- a point(s) that deviate from the standard Breakout- a change from one steady standard to another standard Twitter Twitter Fig 2. Example of a breakout Fig 1. Example of anomalies Research Seasonal Hybrid ESD Twitter modified the generalized ESD to account for seasonality. It uses time series decomposition and robust statistical metrics. Two methods of finding anomalies: AnomalyTS- takes in a 2 column data frame: time series + count AnomalyVec- takes in a single data frame of counts Function: AnomalyDetectionVec = function(x, max_anoms=0.10, direction='pos', alpha=0.05, period=NULL, only_last=F, threshold='None', e_value=F, longterm_period=NULL, plot=F, y_log=F, xlabel='', ylabel='count', title=NULL, verbose=FALSE) The algorithm produces a list of the timestamp/index where the anomaly is present and the numeric value of the anomaly. Along with it, there is a visual that graphs the data in context. Dump Vacuum Gage 365 pressure In order to understand the Twitter algorithm, we examined the Student's T-Distribution, Extreme Studentized Deviate (ESD), and the Generalized ESD. Student’s T-distribution Extreme Studentized Deviate(ESD) Generalized ESD Fig 4. Anomalies on plot is for the modulator high voltage for the of one of the RF stations It found 30 anomalies. Things that can cause these anomalies are pressure increasing if air gets in the pipe, valves are opening or closing, an RF changes power supply, or the beam hits the wall. Fig 1. Description of the figure above. Publication quality images must be at least 300 dpi. Vector image format, such as PNG or EPS, resizes better compared to JPEG. LCLS data: A t distribution uses the sample standard deviation instead of the population’s standard deviation. This equation is the equation of the colored lines you see in the graph. The black line is the normal distribution. As you increase the degrees of freedom, your t distribution will look more and more like normal distribution. The parameter that I changed to produce the following graphs is max_ anoms . By increasing the max_anoms, which indicates the max number of anomalies that S-H-ESD will detect as a percentage of the data, the algorithm was able to catch more anomalies and highlight them in the plot. 25-8 Beam volt Conclusion Anomaly detection is hard to generalize for all applications. We had trouble with implementing timestamps with our count and using the timestamp decomposition method. However, these results are promising because it is the basis for a real time notification system and for a key data analysis tool at SLAC. ESD is used to find outliers in univariate data assuming that the data is normally distributed. It detects one outlier at a time. Once an outlier is detected it is removed from the data. You compare the G value to the critical value and if the above conditional is true, then the point is an outlier. Fig 3. Anomalies on plot is for the modulator high voltage for the of one of the RF stations Acknowledgments It found 1644 anomalies. This detection is useful because if implemented for all RF stations, see which one trips more often in order to see which ones need more work. It is a modified ESD calculation. You find G and find the critical value and compare it to each other; however, the critical value and the mean updates as the data set changes each iteration. Use of the Linac Coherent Light Source (LCLS), SLAC National Accelerator Laboratory, is supported by the U.S. Department of Energy, Office of Science, Office of Basic Energy Sciences under Contract No. DE-AC02-76SF00515. Date: 08/11/2016