CERN Data Analytics Use Cases

Slides:



Advertisements
Similar presentations
Tivoli Software from IBM Storage Resource Management Webcast
Advertisements

© 2013 IBM Corporation October 4, 2013 IT Analytics and Big Data IBM Solutions Paul Smith (Smitty) Service Management Architect.
Evaluation of NoSQL databases for DIRAC monitoring and beyond
Extensible Scalable Monitoring for Clusters of Computers Eric Anderson U.C. Berkeley Summer 1997 NOW Retreat.
MS DB Proposal Scott Canaan B. Thomas Golisano College of Computing & Information Sciences.
Tools and Services for the Long Term Preservation and Access of Digital Archives Joseph JaJa, Mike Smorul, and Sangchul Song Institute for Advanced Computer.
Overview of Data Management solutions for the Control and Operation of the CERN Accelerators Database Futures Workshop, CERN June 2011 Zory Zaharieva,
CERN IT Department CH-1211 Genève 23 Switzerland t Integrating Lemon Monitoring and Alarming System with the new CERN Agile Infrastructure.
LÊ QU Ố C HUY ID: QLU OUTLINE  What is data mining ?  Major issues in data mining 2.
LHC Experiment Dashboard Main areas covered by the Experiment Dashboard: Data processing monitoring (job monitoring) Data transfer monitoring Site/service.
CERN - IT Department CH-1211 Genève 23 Switzerland t Monitoring the ATLAS Distributed Data Management System Ricardo Rocha (CERN) on behalf.
Openlab Workshop on Data Analytics 16 th of November 2012 Axel Voitier – CERN EN-ICE.
Data & Storage Services CERN IT Department CH-1211 Genève 23 Switzerland t DSS From data management to storage services to the next challenges.
Control System Data Analysis Future Vision Author: Axel Voitier CERN EN-ICE.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Collaborating with iPlant.
Data Mining By Dave Maung.
CERN IT Department CH-1211 Genève 23 Switzerland t Internet Services Overlook of Messaging.
And Tier 3 monitoring Tier 3 Ivan Kadochnikov LIT JINR
Transaction-based Grid Data Replication Using OGSA-DAI Presented by Yin Chen February 2007.
6 May 2014 CERN openlab IT Challenges workshop, Kacper Szkudlarek, CERN Manuel.
8-Dec-15T.Wildish / Princeton1 CMS analytics A proposal for a pilot project CMS Analytics.
CERN IT Department CH-1211 Geneva 23 Switzerland t CF Computing Facilities Agile Infrastructure Monitoring CERN IT/CF.
CERN openlab technical workshop
INFSO-RI Enabling Grids for E-sciencE ARDA Experiment Dashboard Ricardo Rocha (ARDA – CERN) on behalf of the Dashboard Team.
CERN - IT Department CH-1211 Genève 23 Switzerland t High Availability Databases based on Oracle 10g RAC on Linux WLCG Tier2 Tutorials, CERN,
XROOTD AND FEDERATED STORAGE MONITORING CURRENT STATUS AND ISSUES A.Petrosyan, D.Oleynik, J.Andreeva Creating federated data stores for the LHC CC-IN2P3,
Computing Facilities CERN IT Department CH-1211 Geneva 23 Switzerland t CF Agile Infrastructure Monitoring HEPiX Spring th April.
CERN IT Department CH-1211 Genève 23 Switzerland t CERN IT Monitoring and Data Analytics Pedro Andrade (IT-GT) Openlab Workshop on Data Analytics.
A Validation System for the Complex Event Processing Directives of the ATLAS Shifter Assistant Tool G. Anders (CERN), G. Avolio (CERN), A. Kazarov (PNPI),
CERN IT Department CH-1211 Genève 23 Switzerland t CERN Agile Infrastructure Monitoring Pedro Andrade CERN – IT/GT HEPiX Spring 2012.
Streaming Analytics with Spark 1 Magnoni Luca IT-CM-MM 09/02/16EBI - CERN meeting.
Smart Grid Big Data: Automating Analysis of Distribution Systems Steve Pascoe Manager Business Development E&O - NISC.
Time Series Data Repository #ODSummit - The Generic, Extensible, and Elastic Data Repository in OpenDaylight for Advanced Analytics.
Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES The Common Solutions Strategy of the Experiment Support group.
European Organization For Nuclear Research CERN Accelerator Logging Service Overview Focus on Data Extraction for Offline Analysis Ronny Billen & Chris.
CERN openlab Machine Learning and Data Analytics workshop
Grid Technology CERN IT Department CH-1211 Geneva 23 Switzerland t DBCF GT Our experience with NoSQL and MapReduce technologies Fabio Souto.
7/8/2016 OAF Jean-Jacques Gras Stephen Jackson Blazej Kolad 1.
IT Monitoring Service Status and Progress 1 Alberto AIMAR, IT-CM-MM.
Database 12.2 and Oracle Enterprise Manager 13c Liana LUPSA.
Data Analytics Challenges Some faults cannot be avoided Decrease the availability for running physics Preventive maintenance is not enough Does not take.
Pilot Kafka Service Manuel Martín Márquez. Pilot Kafka Service Manuel Martín Márquez.
Daniele Bonacorsi Andrea Sciabà
EView/390z Management for IBM Mainframe for HPE Operations Manager i (OMi) Extending the cross-platform capabilities of Hewlett Packard Enterprise Software.
Monitoring Evolution and IPv6
Backup and Recovery for Hadoop: Plans, Survey and User Inputs
Database Services Katarzyna Dziedziniewicz-Wojcik On behalf of IT-DB.
Discovering Computers 2010: Living in a Digital World Chapter 14
Report from WLCG Workshop 2017: WLCG Network Requirements GDB - CERN 12th of July 2017
OptiView™ XG Network Analysis Tablet
POW MND section.
HP BSM implementation summary
CERN Openlab: my transition from science to business
Time Series Data Repository
Joseph JaJa, Mike Smorul, and Sangchul Song
Data Analytics CERN openlab Open Day Manuel Martin Marquez.
Experiment Dashboard overviw of the applications
Couchbase Server is a NoSQL Database with a SQL-Based Query Language
Dagmar Adamova (NPI AS CR Prague/Rez) and Maarten Litmaath (CERN)
A Messaging Infrastructure for WLCG
Remote Monitoring solution
Monitoring of the infrastructure from the VO perspective
به نام خدا Big Data and a New Look at Communication Networks Babak Khalaj Sharif University of Technology Department of Electrical Engineering.
Designed for Big Data Visual Analytics, Zoomdata Allows Business Users to Quickly Connect, Stream, and Visualize Data in the Microsoft Azure Platform MICROSOFT.
Logsign All-In-One Security Information and Event Management (SIEM) Solution Built on Azure Improves Security & Business Continuity MICROSOFT AZURE APP.
Technical Capabilities
Big DATA.
CRM DMP – a marriage of two acronyms
Presentation transcript:

CERN Data Analytics Use Cases Manuel Martin Marquez Stefano Alberto Russo

Manuel Martin Marquez – CERN openlab Today’s Objectives Overview - Data Analytics at CERN Use Cases Context Status Technologies applied Limitations Future Plans and Challenges Real Time Analytics Batch Analytics Repository AaaS - Analytics as a Service Manuel Martin Marquez – CERN openlab

Overview: Data Analytics at CERN Manuel Martin Marquez – CERN openlab

Overview: Data Analytics at CERN Huge interest and potential benefits for different domains at CERN IT – Information Technology BE – Beams EN – Engineering PH – Physics No one size fits all Real time, batch, data management. Different solutions, technologies, approaches Manuel Martin Marquez – CERN openlab

Use Cases: Accelerator Data Analysis Context Logging Service persist several million signals Core infrastructure: Electricity, Industrial Data: Cryogenics, Vacuum, Beam data: position, currents, losses, Critical Service About 800 extraction clients About 120 custom applications More than 5 million request per day Massive increase (2-3x) in 2014 Extensive use of Oracle Technologies Manuel Martin Marquez – CERN openlab

Use Cases: Accelerator Data Analysis Current Status: Data analysis focused on the service itself Improving reaction time and performance Basic streaming analysis: Complex Event Processing Based on system knowledge Basic in-database analysis built-in (PL/SQL) Embeded in the extraction tool Complex cases pushed to the users Matlab, Python, Labview, etc. Manuel Martin Marquez – CERN openlab

Use Cases: Accelerator Data Analysis Future Plans and Challenges: Extract more value from the data Focus on accelerator operations Make common data analysis use cases easier Save and share analysis results Simple and fast access to data Domain specific language Not replacing the existing infrastructure and tools Manuel Martin Marquez – CERN openlab

Use Cases: Industrial Control Systems Analytics Context: Support for Large industrial control systems Five major installations with million of parameters ALICE ATLAS CMS LHb Accelerator Complex Many Equiment groups Cryo Gas Vacuum Machine Protection Manuel Martin Marquez – CERN openlab

Use Cases: Industrial Control Systems Analytics Current Status: Monitoring and control problem solved Alerting and reporting system Manually configured Based on threshold Huge data volumen acquiere and stored OS logs, performances metrics, device status, Measurements, Alarms Not much efficiently exploited Manuel Martin Marquez – CERN openlab

Use Cases: Industrial Control Systems Analytics Current Status: P.O.C already started: Control System Health Alarm statistical Analysis Gas System Breakdown Evaluation of different technologies Drools WatchCAT Facing Problems: Data Access (Sensible and Protected) Integration of different data sources Common Data Analysis problems: classification, completeness, Manuel Martin Marquez – CERN openlab

Use Cases: Industrial Control Systems Analytics Future Plans and Challenges: AaaS: Common framework for all the subsystems Configurable analysis flow by user High scalability of analysis processes Near real time and batch analysis Stream based data processing engine: CEP, Storm NoSQL data storage engine Manuel Martin Marquez – CERN openlab

Use Cases: Atlas Distributed DM System Analytics Context: The Distributed Data Management Project (DDM) manage ATLAS data on the GRID 150 PB 1000 Active Users 500 million files Manuel Martin Marquez – CERN openlab

Use Cases: Atlas Distributed DM System Analytics Current Status: Use both SQL and NoSLQ NoSQL complemetary to RDBMS Different uses cases Popularity analysis Data Aggregation and statistical analysis Manuel Martin Marquez – CERN openlab

Use Cases: Atlas Distributed DM System Analytics Future Plans and Challenges: More complex use cases Trace Mining Analysis of the client interactions Replicas automatically managed Deletion Creation Forecasting Future dataset popularity Manuel Martin Marquez – CERN openlab

Use Cases: Intelligent data placement for CMS Context CMS Grid resources for storage and offline analysis Hundreds users Daily up to 500.000 jobs Data sample replicated 23 PB of data 18 PB transfer last year Manuel Martin Marquez – CERN openlab

Use Cases: Intelligent data placement for CMS Current Status: Current Data Management model Manpower intensive Inneficient disk usage Data Popularity Services Cleaning Agent Automatic Deletion of obsolete replicas Implented using Oracle DB Metrics (Number of accesses, users, sites, processing time, etc) Multiple aggregations (Jobs success/failure, set of files, etc) Manuel Martin Marquez – CERN openlab

Use Cases: Intelligent data placement for CMS Future Plans and Challenges: LHC Run 2 implies a 6x factor in computing resources Critical to optimize resources Jobs time in accessing analysing data Minimizing number of replicas Extract further knowledge from Monitoring data Classify analysis activities and predict resources Recommendation systems Learn from past trends and patters Manuel Martin Marquez – CERN openlab

Use Cases: Intelligent data placement for CMS Future Plans and Challenges: Near real time Based on knowledge extracted from the data (Bacth) CEP Batch analysis Different technologies R In-database analytics - Oracle R Entreprise Hadoop Elastic Search Manuel Martin Marquez – CERN openlab

Use Cases: Network Monitoring WLCG Context: WLCG relies heavily on the underlying networks Interconnect sites and resources PerfSONAR - Network Perfomance Measurement and Monitoring Manuel Martin Marquez – CERN openlab

Use Cases: Network Monitoring WLCG Current Status: PerfSONAR deployed on 70% of infrastructure A Lot of data but making sense out of it not trivial at all Measurements span different time periods They measure different things (while all related to network) They might be affected by other measurements and/or events Manuel Martin Marquez – CERN openlab

Use Cases: Network Monitoring WLCG Future Plan and Challenges: From Monitoring to Intelligent & Predicting Monitoring Time correlation During a PS throughput test, was there any known activity in the same link? There is packet loss, does this appears as degraded performance somewhere at the same time Loss of performance Is it a network problem and where? Is it a storage problem? Analyze the existing data, mine the information looking for known issues in the past Manuel Martin Marquez – CERN openlab

Use Cases: IT Monitoring and Analytics Context: Monitoring in IT covers a wide range of resources Hardware, OS, applications, files, jobs, etc. Many high level resources are interdependent Several application-specific monitoring solutions Similar needs and architecture Publish metric results, aggregate results, alarms, etc. Different technologies and tool-chains Some based on commercial solutions Similar limitations and problems Limited sharing of monitoring data Manuel Martin Marquez – CERN openlab

Use Cases: IT Monitoring and Analytics Current Status: Data Storage and Analysis Store all monitoring data in a common location Feed the system with processed data Use one single common data format (JSON) Permanent storage for historical data Data Visualization and Alarms Easy-to-use dashboards Efficient delivery of notifications Manuel Martin Marquez – CERN openlab

Use Cases: IT Monitoring and Analytics Future Plan and Challenges: Intelligent and Predictive Monitoring Real time analytics Dashboard and interactive analytics Batch analysis: Data mining – exploratory AaaS based on several technologies Hadoop Elastic Search Kibana Storm Manuel Martin Marquez – CERN openlab

Use Cases: Analytics in Castor Context: The CERN Advanced Storage Manager (CASTOR) is the mass storage solution of CERN, including LHC data hierarchical storage, both disks and tape 12K disks, 30K tapes, More than 100 PB of data lot of monitoring/log data up to 20 GB per day (~100M lines of log) totaling ~10 TB per year. stored in Hadoop/HBase and processed live for display in a cockpit Manuel Martin Marquez – CERN openlab

Use Cases: Analytics in Castor Current Status: Monitoring system in production for CASTOR and being extended to EOS. online (simple) analysis in the Cockpit (time series and histograms) offline analyses on the long-term storage based on Hadoop/HBase. auditing, error recovery historical studies (e.g. usage of protocols) Service availability being registered on the Service Level Status (SLS) board Manuel Martin Marquez – CERN openlab

Use Cases: Analytics in Castor Future Plan and Challenges: The current system is not covering two important topics: Expert system: spotting "interesting" time series out of a large monitoring data Early warning system: find predictive power to forecast potential dangerous situations Investigation of these topics started together with the DB group as part of the openlab data analytics activity finding the best models is challenging lot of data from different sources (avoiding time-consuming eye inspection) (e.g. overload conditions!) Data Analytics DB SLS Cockpit (standardization required!) Manuel Martin Marquez – CERN openlab

Analytics as a Service (AaaS) Real-time data analysis Complex Event Processing Pattern Recognition Streaming data analysis Batch data analytics Forecasting Modelling Knowledge discovery for later apply to real time Manuel Martin Marquez – CERN openlab

Analytics as a Service (AaaS) Data analytics repository Flexible data repository infrastructure Problem driven – no technology driven A combination of RDMS and noSQL Integrating existing data sources and systems Analytics framework (AaaS) Real-time analytics Batch Data analytics repo Data Analytics Visualization Manuel Martin Marquez – CERN openlab

Manuel Martin Marquez – CERN openlab Conclusions Huge interest and potential benefits for CERN IT, BE, PH, EN departments Improve our Monitoring and control systems by mean of Data Analytics Intelligent Proactive Predictive Manuel Martin Marquez – CERN openlab

Manuel Martin Marquez – CERN openlab Conclusions Challenges Real time analytics based on CERN use case Based on domain knowledge and hidden knowledge extracted by batch analytics CEP, Storm Batch analytics Correlation analysis Forecasting modeling Knowledge discovering Data analytics repository AaaS Manuel Martin Marquez – CERN openlab