Jacek Otwinowski (Data Preparation Group)

Slides:



Advertisements
Similar presentations
Instant JChem - current status and what's coming soon. Tim Dudgeon Solutions for Cheminformatics.
Advertisements

Information Retrieval in Practice
GLAST LAT ProjectOnline Peer Review – July 21, Integration and Test J. Panetta 1 Gamma-ray Large Area Space Telescope GLAST Large Area Telescope:
Overview of Search Engines
Passage Three Introduction to Microsoft SQL Server 2000.
Implementing search with free software An introduction to Solr By Mick England.
MDC Open Information Model West Virginia University CS486 Presentation Feb 18, 2000 Lijian Liu (OIM:
LIGO-G E ITR 2003 DMT Sub-Project John G. Zweizig LIGO/Caltech Argonne, May 10, 2004.
Classroom User Training June 29, 2005 Presented by:
GRITS Working with AVM Data Astronomy Visualization Metadata June 11th, 2010 Casey Rosenthal
Computer and Automation Research Institute Hungarian Academy of Sciences Presentation and Analysis of Grid Performance Data Norbert Podhorszki and Peter.
LAB CVP 2009 ‘Leveraging the LIMS Investment’. Invested in a Laboratory Information Management System (LIMS) Solution is limited to Storing and Reporting.
LIGO-G9900XX-00-M ITR 2003 DMT Sub-Project John G. Zweizig LIGO/Caltech.
Database Design and Management CPTG /23/2015Chapter 12 of 38 Functions of a Database Store data Store data School: student records, class schedules,
5-1 McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved.
Infrastructure for QA and automatic trending F. Bellini, M. Germain ALICE Offline Week, 19 th November 2014.
CS5604: Final Presentation ProjOpenDSA: Log Support Victoria Suwardiman Anand Swaminathan Shiyi Wei Department of Computer Science, Virginia Tech December.
Summary of User Requirements for Calibration and Alignment Database Magali Gruwé CERN PH/AIP ALICE Offline Week Alignment and Calibration Workshop February.
The DCS Databases Peter Chochula. 31/05/2005Peter Chochula 2 Outline PVSS basics (boring topic but useful if one wants to understand the DCS data flow)
Analysis experience at GSIAF Marian Ivanov. HEP data analysis ● Typical HEP data analysis (physic analysis, calibration, alignment) and any statistical.
Validation db status and plans (what happened since the Collaboration meeting) Hans Wenzel 10th Physics Lists and Validation Tools working group meeting.
The ALICE data quality monitoring Barthélémy von Haller CERN PH/AID For the ALICE Collaboration.
uses of DB systems DB environment DB structure Codd’s rules current common RDBMs implementations.
1 Copyright © 2008, Oracle. All rights reserved. Repository Basics.
Joe Foster 1 Two questions about datasets: –How do you find datasets with the processes, cuts, conditions you need for your analysis? –How do.
ALICE Full Dress Rehearsal ALICE TF Meeting 02/08/07.
Monthly video-conference, 18/12/2003 P.Hristov1 Preparation for physics data challenge'04 P.Hristov Alice monthly off-line video-conference December 18,
LHCb Computing 2015 Q3 Report Stefan Roiser LHCC Referees Meeting 1 December 2015.
Information Retrieval in Practice
QC-specific database(s) vs aggregated data database(s) Outline
Experiments Monitoring Plans and Progress
Monitoring Evolution and IPv6
F. Bellini for the DQM core DQM meeting, 04th October 2012
Database Replication and Monitoring
Hadoop and Analytics at CERN IT
CMS High Level Trigger Configuration Management
DPG Activities DPG Session, ALICE Monthly Mini Week
Complex Geometry Visualization TOol
ALICE analysis preservation
Collecting heterogeneous data into a central repository
Experience in CMS with Analytic Tools for Data Flow and HLT Monitoring
Calibrating ALICE.
CUAHSI HIS Sharing hydrologic data
Commissioning of the ALICE HLT, TPC and PHOS systems
Savannah to Jira Migration
Jacek Otwinowski (for the DPG QA tools and WP7 groups)
Post-relational databases What's wrong with web development?
Experience between AMORE/Offline and sub-systems
Data Preparation Group Summary of the Activities
Analysis framework - status
Data Quality Monitoring of the CMS Silicon Strip Tracker Detector
QA tools – introduction and summary of activities
Data Warehouse.
Monitoring of the infrastructure from the VO perspective
Populating a Data Warehouse
The Re3gistry software and the INSPIRE Registry
Populating a Data Warehouse
Renouncing Hotel’s Data Through Queries Using Hadoop
Get your ETL flow under statistical process control
Data Management Innovations 2017 High level overview of DB
End to End Monitoring Solution using Open Source Technology where webMethods 9.10 is used as ESB IBM Confidential.
DQM for the RPC subdetector
MIS2502: Data Analytics MySQL and MySQL Workbench
Reportnet 3.0 Database Feasibility Study – Approach
Planning next release of GAUDI
Indexing with ElasticSearch
BIOPAMA Data Management
Offline framework for conditions data
SDMX IT Tools SDMX Registry
Presentation transcript:

Jacek Otwinowski (Data Preparation Group) QA tools overview Jacek Otwinowski (Data Preparation Group) 2-Nov-16 Alice Offline Week

Lesson from CMS, ATLAS and LHCb Databases for QA information CMS (trending information) ATLAS (validation decisions) LHCb (trigger decisions) ROOT histograms and trees for storing QA information Custom tools for data analytics (regression tests, statistical tests, classifiers…) Custom tools for QA visualization 2-Nov-16 Alice Offline Week

ALICE QA information sources Software release validation Performance benchmarks on data and MC simulations Validation with reference data and MC Online DQM and HLT Offline QA: detector, trigger, calibration, tracking, PID and analysis OCDB and AODB Logbook, MonAlisa, RCT, … 2-Nov-16 Alice Offline Week

ALICE QA tools - status Different solutions for DQM, HLT and offline QA Several solutions for the offline QA Detector QA Tacking QA QA tools developments ROOT tree based DB (TPC QA generalization) Elasticsearch (nosql database) + Kibana (visualisation) Overwatch (EMCAL QA) RCT Goal: Develop common set of tools for online and offline QA 2-Nov-16 Alice Offline Week

ROOT tree based DB development (TPC QA generalization) Marian Ivanov, Hans Beck, Jens Wiechula, Julius Gronefeld, Peter Malzacher Input information stored in the ROOT files Relations (e.g. TPC QA vs. TRD QA) made with friend trees Inner joins (SQL) to combine columns from one or more tables in a rational database implemented Access to the external information provided (RCT, Logbook,…) https://alice.its.cern.ch/jira/browse/ATO-46 Standardized cross queries possible (e.g. TPC QA vs. RCT) Functionality to transform ROOT tree to other formats (html table, cvs, json, xml) + exporting metadata https://alice.its.cern.ch/jira/browse/ATO-373 Can be used to export to e.g. Elasticsearch Time series query support for tree using AliTreePlayer https://alice.its.cern.ch/jira/browse/ATO-382 Data selection (acceptance cuts) based on the custom parameterizations (AliNDLocalRegression) https://alice.its.cern.ch/jira/browse/PWGPP-163 TPC QA visualization https://aliqatpc.web.cern.ch/aliqatpc/data/2015/LHC15o/pass2_lowIR 2-Nov-16 Alice Offline Week

Elasticsearch and Kibana https://www.elastic.co/products/elasticsearch Full-text search engine with HTTP web interface and schema-free json documents It can serve as distributed NoSQL database Several techniques to manage relational data (e.g. parent/child) Used for Grid monitoring at CERN https://www.elastic.co/elasticon/conf/2016/sf/grid-monitoring-at-cern-with-the-elastic-stack Kibana Data visualization plugin for Elasticsearch https://www.elastic.co/products/kibana Limitation in visualization of relationships (e.g. parent/child) Kibi Plugins for Elasticsearch and Kibana (relationships visualization implemented but only in enterprise edition) https://siren.solutions/ 2-Nov-16 Alice Offline Week

Overwatch https://aliceoverwatch.physics.yale.edu/monitoring Online processing and interface for online detector monitoring and basic QA using data from the HLT (originally implemented for EMCAL) Overwatch is python and ROOT based Utilizes ZMQ for data transport Histogram visualization based on JSROOT Architecture similar to data processing at Run3 (parallel processing -> merging -> QA -> visualization) 2-Nov-16 Alice Offline Week

Quality Control in Run3 ROOT objects 2-Nov-16 Alice Offline Week

Run Control Table (QA feedback status) http://alimonitor.cern.ch/QA/ 2-Nov-16 Alice Offline Week

ALICE QA/QC tools requirements Data/MC QA monitoring (trending) Possibility to trigger alarms Database for QA information Relationships (joins) possible Extended data analytics possible regression tests, statistical tests, classifiers… User interface and data visualization Interactive QA data analysis 2-Nov-16 Alice Offline Week

ALCE QA/QC tools selection They should fulfill experiment and data processing requirements in Run2/Run3 ROOT objects for storing QA information primary source of information Study existing solutions (before implementing the customize solutions) e.g. JSROOT library to visualize ROOT objects (histograms, trees) using browsers e.g. Jupyter, SWAN notebooks for data analysis and visualization (https://swan.web.cern.ch/) e.g. Elasticsearch as QA database Selection should based on extensive tests Online QA Offline (user/expert) QA 2-Nov-16 Alice Offline Week

DPG QA tools group Weekly meetings: Wednesdays 2:30 PM Please join and participate in developments and tests Mailing list: alice-dpg-qa-tools Please subscribe and stay updated 2-Nov-16 Alice Offline Week