AMI 30/11/2015S.A.1 AMI – Status April 2011. Solveig Albrand Jerome Fulachier Fabian Lambert.

Slides:



Advertisements
Similar presentations
Conditions and configuration metadata for the ATLAS experiment E J Gallas 1, S Albrand 2, J Fulachier 2, F Lambert 2, K E Pachal 1, J C L Tseng 1, Q Zhang.
Advertisements

Copyright 2004 Monash University IMS5401 Web-based Systems Development Topic 2: Elements of the Web (g) Interactivity.
Feb. 2, 2004CS WPI1 CS 509 Design of Software Systems Lecture #3 Monday, Feb. 2, 2004.
Systems Analysis I Data Flow Diagrams
AMI S.A. Datasets… Solveig Albrand. AMI S.A. A set is… A number of things grouped together according to a system of classification, or conceived as forming.
The Project AH Computing. Functional Requirements  What the product must do!  Examples attractive welcome screen all options available as clickable.
MC, REPROCESSING, TRAINS EXPERIENCE FROM DATA PROCESSING.
The ATLAS Production System. The Architecture ATLAS Production Database Eowyn Lexor Lexor-CondorG Oracle SQL queries Dulcinea NorduGrid Panda OSGLCG The.
JavaScript & jQuery the missing manual Chapter 11
Systems Analysis – Analyzing Requirements.  Analyzing requirement stage identifies user information needs and new systems requirements  IS dev team.
Database Security and Auditing: Protecting Data Integrity and Accessibility Chapter 3 Administration of Users.
What is Sure BDCs? BDC stands for Batch Data Communication and is also known as Batch Input. It is a technique for mass input of data into SAP by simulating.
ATLAS : File and Dataset Metadata Collection and Use S Albrand 1, J Fulachier 1, E J Gallas 2, F Lambert 1 1. Introduction The ATLAS dataset search catalogs.
Moodle (Course Management Systems). Assignments 1 Assignments are a refreshingly simple method for collecting student work. They are a simple and flexible.
ATLAS Data Periods in COMA Elizabeth Gallas - Oxford ATLAS Software and Computing Week CERN – April 4-8, 2011.
Nightly Releases and Testing Alexander Undrus Atlas SW week, May
SWC Spring 2010 Application Version SWC Spring 2010 Select Folder 2.
COMCOM omponent bject odel After this presentation, you will: Be Able To: Conceptual understanding of COM How it works, What it’s used for Decode Acronyms.
11/10/2015S.A.1 Searches for data using AMI October 2010 Solveig Albrand.
1 OPOL Training (OrderPro Online) Prepared by Christina Van Metre Independent Educational Consultant CTO, Business Development Team © Training Version.
National Center for Supercomputing Applications NCSA OPIE Presentation November 2000.
Marianne BargiottiBK Workshop – CERN - 6/12/ Bookkeeping Meta Data catalogue: present status Marianne Bargiotti CERN.
An Introduction to Designing and Executing Workflows with Taverna Aleksandra Pawlik materials by: Katy Wolstencroft University of Manchester.
Continuous Integration and Code Review: how IT can help Alex Lossent – IT/PES – Version Control Systems 29-Sep st Forum1.
Microsoft WorkSpace Step by Step Guide January 2015.
Session Session 15 FAFSA on the Web - Onward and Upward!
DDM Monitoring David Cameron Pedro Salgado Ricardo Rocha.
David Adams ATLAS DIAL/ADA JDL and catalogs David Adams BNL December 4, 2003 ATLAS software workshop Production session CERN.
EGEE User Forum Data Management session Development of gLite Web Service Based Security Components for the ATLAS Metadata Interface Thomas Doherty GridPP.
Esri UC 2014 | Technical Workshop | Creating Geoprocessing Services Kevin Hibma.
Metadata and COMA Elizabeth Gallas Oxford. ATLAS Weekly / Open EB Meeting October 18, 2011.
MIS 7003 MBA Core Course in MIS Professor Akhilesh Bajaj The University of Tulsa Introduction to S/W Engineering © All slides in this presentation Akhilesh.
How to Create an Address How to Create a Free Account, Read and Answer your s. Yahoo! provides FREE . To create a free .
Monte-Carlo Event Database: current status Sergey Belov, JINR, Dubna.
9 Copyright © 2009, Oracle. All rights reserved. Deploying and Reporting on ETL Jobs.
INFSO-RI Enabling Grids for E-sciencE ARDA Experiment Dashboard Ricardo Rocha (ARDA – CERN) on behalf of the Dashboard Team.
Your university or experiment logo here Performance Monitoring Gidon Moont e-Science, HEP, Imperial College London Talk to JRA1.
Jan 27, 2003D0 Trigger Db Status1 Trigger Database Status Elizabeth Gallas – FNAL CD Trigger Working Page d0.fnal.gov/~gallas/d0_private/trig/trigger.html.
SPI NIGHTLIES Alex Hodgkins. SPI nightlies  Build and test various software projects each night  Provide a nightlies summary page that displays all.
Distributed Data Management Miguel Branco 1 DQ2 discussion on future features BNL workshop October 4, 2007.
FTS monitoring work WLCG service reliability workshop November 2007 Alexander Uzhinskiy Andrey Nechaevskiy.
ATLAS Metadata Interface Campaign Definition in AMI S.Albrand 23/02/2016ATLAS Metadata Interface1.
Pavel Nevski DDM Workshop BNL, September 27, 2006 JOB DEFINITION as a part of Production.
1 A Scalable Distributed Data Management System for ATLAS David Cameron CERN CHEP 2006 Mumbai, India.
Alexei Klimentov. December 2, 2010 SW&C workshop. Database Session. December, 2010 ATLAS Metadata Handling and AMI Wokshop Highlights Alexei Klimentov.
ELSSISuite Services QIZHI ZHANG Argonne National Laboratory on behalf of the TAG developers group ATLAS Software and Computing Week, 4~8 April, 2011.
David Adams ATLAS ATLAS Distributed Analysis (ADA) David Adams BNL December 5, 2003 ATLAS software workshop CERN.
Finding Data in ATLAS. May 22, 2009Jack Cranshaw (ANL)2 Starting Point Questions What is the latest reprocessing of cosmics? Are there are any AOD produced.
Future of Distributed Production in US Facilities Kaushik De Univ. of Texas at Arlington US ATLAS Distributed Facility Workshop, Santa Cruz November 13,
David Adams ATLAS ADA: ATLAS Distributed Analysis David Adams BNL December 15, 2003 PPDG Collaboration Meeting LBL.
Joe Foster 1 Two questions about datasets: –How do you find datasets with the processes, cuts, conditions you need for your analysis? –How do.
Downloading and Installing GRASP-AF Workshop Ian Robson Information Analyst, North of England Cardiovascular Network.
How to use Drupal Awdhesh Kumar (Team Leader) Presentation Topic.
Jean-Philippe Baud, IT-GD, CERN November 2007
Integrating ArcSight with Enterprise Ticketing Systems
All-Hands Meeting Outcome and Discussion
Integrating ArcSight with Enterprise Ticketing Systems
CSE 103 Day 20 Jo is out today; I’m Carl
Adastra v3 Reporting & National Quality Requirements
David Adams Brookhaven National Laboratory September 28, 2006
Database High-Level Overview
AMI – Status November Solveig Albrand Jerome Fulachier
Basic User Site Access Training & Producing Reports
Materials Engineering Product Data Management (ePDM)
Intro to PHP & Variables
Mary Torjussen A2 ICT Week 1.
The AMI Database Project
Revision Update and Reports
Presentation transcript:

AMI 30/11/2015S.A.1 AMI – Status April Solveig Albrand Jerome Fulachier Fabian Lambert

AMI Summary Server problems. ORACLE problems. Security & Information Protection. Developments. –General –Real Data –MC –Other applications Plans. 30/11/2015S.A.2

AMI In brief Server problems. Some instability since the beginning of See SIT Tag Collector talk for details. (extra slides) Security & Information Protection. We are moving to VOMS for authentication (unless ATLAS management says "No"). Time scale to be fixed. No time to discuss here. See SIT Tag Collector talk for details. ORACLE. –"Back-up test" : I dropped one of the config tag tables table by accident ; our got it back again. –The underscore/case insensitive sorting incompatibility bug manifested itself again in a new form, following the latest ORACLE update (  ) but once we spotted it we were able to get the behaviour we need. We used to get unpredictable results, now get the opposite of what we expected. (see extra slides for more)

AMI Dev – Dataset General A general view of metadata has been started. A document is in preparation (with metadata coordination). Will lead to some actions e.g. rework the AMI dataset state engine and remove panic-inducing states when data is deleted. Lost files - synchronized on DDM service. (see later) Scalability of reading prodDB (Reminder: We read metadata XML for all finished jobs for all finished tasks.) –Sequential since Knew it was not optimal, but that was not a problem up to now. –Had problem in February, so (at last) working on multi threaded reading of finished tasks. Not a panacea, because number of jobs in a task is not predictable, but ~ 50% improvement anticipated. –WARNING – The graph on the next page has an "advertiser's" X axis (number of AMI reads). It doesn't mean anything much. The AMI task runs 300 seconds after it last finished – so not points are not evenly spaced in time in reality.

AMI 20 days in February  150 hours to catch up (AMI was down for maintenance ~12 hours) Scalability of reading FINISHED tasks from ProdDB :18: :33: :01:10 AMI backlog (nTasks) Num AMI reads

AMI Real Data Lost Luminosity Blocks. –Lost files are marked once a week. (dq2 file consistency service) –Lost files are marked in orange in the file list, and removed from the event and file count. The dataset status is changed. –A comment is written to say when the file was lost. –All files in data10 and mc10 and up have been marked with their input file(s). Information is obtained from prodDB.ejobdefbig –The file to file provenance is traced recursively to obtain the lumi blocks which were in the lost file, and the information is stored. –The tracing is not 100% reliable: ejobdefbig problems –with missing information, –Some surprises in the XML grammar ("inputESDFile=" but "inputTAGFile:", –badly formed XML, deleted files mechanism in AMI. (this can be fixed !) –What do I do now? (need guidance from data prep and/or luminosity group) For example we could trace all file lumi blocks for data11 reprocessing.

AMI

MC developements DONE –Transporting cross section values along the MC production chain (less clicks to get the values!). N.B. ~100 "physicsShorts" produce no value for cross section value. –Reworking the "dataset numbers" broker, and extending it to hold production requests in the future. –No longer reading the list of input parameters from Task Request (too many values are "NONE"). The reason is the hard coded argument list for job transforms. Get values only from metadata output of finished jobs, and the AMI tags. NOT DONE –Import of production requests from spreadsheet files; (we know how to do it but the input is too messy) –Pointers to job options files broken. (we lack a reliable way to do it) MD workshop "Meta-data interface looks a bit technical for the end user"

AMI  Data is in the COMA database  AMI "thinks" COMA is part of AMI  Data Prep writes, several apps read Other Developments Data Periods : –Collaboration with COMA (Elizabeth G.) and Data Preparation (Beate). –Replaces text files COMA Web interface and pyAMI web service

AMI AMI interface See extra slides for more about COMA Runs loaded in COMA with selected project Links to COMA

AMI Next steps for Data periods pyAMI commands for Data Period information (in beta testing) –GetDataPeriodsForRun –GetRunsForDataPeriod –GetDataPeriodTree –ListDataPeriods Document it all for users! (we advocate a written Period nomenclature) Extend to Physics Container creation. Other extensions in discussion.

AMI Tracking of object sizes in reconstructed events. A new application in AMI In collaboration with SW dev. (IlijaVukotic) Currently in test on Tier 0. If it works well we will find a way to extend it to Grid tasks. Has its own AMI/ORACLE ressources Will lead to a new AMI graphics effort.

AMI Other stuff. Fruits of the ADC retreat in Napoli –Can "inputfile peeker" mechanism be replaced by consulting AMI? –Can the configuration mechanism currently used by Tier 0 be extended to ProdDB tasks? See Rod Walker's talk yesterday.Rod Walker's talk yesterday. DA user survey – the comments on AMI are interesting but not diectly helpful to us (we already knew not everyone likes our web interface). It would be better to complain directly – or better help us design a new interface! –"AMI web interface is awkward" –"AMI is also a bad tool, the web page is slow, too complicated for what it should offer - help on the mailing list is often difficult to get"  We need a friendly user group to help complete redesign ! (During shutdown?)

AMI

Dev – Partial "To Do Soon" list Synchronizing with DQ2 : –AMI client for DQ2 stomp Active MQ service has been working very well for several months. –We would like to extend this service to Add/Remove primary datasets from dataset containers. This is URGENT. File consistency. (not urgent since all ready have something working) Borut : 'No "automatic" way of marking datasets e.g."September reprocessing"'. Have some ideas but don't see how it can be "automatic". Armin has a procedure to inform TAGS, and he has proposed to inform AMI at the same time.

AMI EXTRA SLIDES SLS + Load on AMI Information protection + security ORACLE & underscores COMA and Data periods

AMI SLS for AMI Degradation since January. –We are not sure why exactly – it is not due to load. (see next two slides) –We suspect that the connection between the APACHE cluster and the Tomcat servers breaks. –The APACHE version changed in January. –We have treated the problem empirically (stronger watch dog) and we are planning an upgrade of Tomcat.

AMI CCAMI02 – Number of commands per hour 10 Feb -> 28 Feb

AMI Nightlies restarted 01:00 28/2

AMI From Alex Undrus No nightlies are launched between 11:00 and 13:00 and between 13:30 and 20:00. >>>> The period between 21:00 and 23:00 is very "hot" in sense that the majority of nightly jobs are started during this period.

AMI Security and Information Protection Following a security audit of the AMI web site at CERN we were asked to put the access to the AMI replica behind SSO and to clean up some rather ugly responses to error conditions or attempts to inject java script. This was done – but we had to take it away as SSO :- –Does not allow pyAMI through. –Does not protect any information from non-ATLAS members. The main site at Lyon remains world readable, and we cannot use SSO at Lyon. What we plan to do in the near future is to restrict world readable rights to the top page, and to permit only members of ATLAS VOMS to read AMI catalogues. (Waiting for management to agree) Everything is in place on the server side, some clients will need to adapt.

AMI ORACLE behaviour ALTER SESSION SET NLS_COMP=LINGUISTIC NLS_SORT=BINARY_CI; SELECT count(LOGICALDATASETNAME) FROM DATASET WHERE LOGICALDATASETNAME LIKE '%data11_cos%'; ALTER SESSION SET NLS_COMP=LINGUISTIC NLS_SORT=BINARY_CI; SELECT count(LOGICALDATASETNAME) FROM DATASET WHERE LOGICALDATASETNAME LIKE '%data11\_cos%' ESCAPE '\'; Which query treats "_" as a wild card? ALTER SESSION SET succeeded. COUNT(LOGICALDATASETNAME) ALTER SESSION SET succeeded. COUNT(LOGICALDATASETNAME)

AMI COMA – complete presentation by Elizabeth Gallas ntribId=13&sessionId=2&materialId=slides &confId= ntribId=13&sessionId=2&materialId=slides &confId= < ontribId=13&sessionId=2&materialId=slide s&confId=130606>

Introduction: ATLAS Data Periods  A Data Period is a set of ATLAS Runs grouped for a purpose  Defined by Data Preparation Coordinators  Used in ATLAS data processing, assessment, and selection …  Each Period uniquely defined with a combination of  Project name (i.e. ‘data10_7TeV’)  Period name (i.e. ‘C1’, ‘C2’, ‘C’, ‘AllYear’ …)  Before 2011, Data Periods were  Described on TWiki page   Stored in a file based system  Edited by hand by Data Prep Coordination (experts)  Structure evolved over last year with experience  This experience  valuable to decide/define long term solution  New for 2011: Data Periods stored in the COMA DB  Thanks: Beate (DataPrep Coordinator), AMI team, DB experts. Topic 1

Data Periods: Links to Reports and Services The links/info below can be found on the revised TWiki page:  Interactive USERS  COMA Data Period Documentation Interface   Comments:  Programmatic USERS For systems needing period info: runQuery, beamspot, Data Quality, …, “Data Period Services” provided via pyAMI:   Comments: AMI / Tag_Collector Team.AMI / Tag_Collector Team  Data Preparation EXPERTS: Entry Interface:   Comments: AMI / Tag_Collector Team.AMI / Tag_Collector Team Next slide

Period Documentation Menu Purpose: Generate Period documentation for chosen input criteria The report will include a description of all Periods  By Year  E.G. all ‘2010’  By Project  e.g. ‘data10_7TeV’  By specific Period or Group  Click on the project and then your Period of interest Wildcards can be entered in this optional section, then click on Submit button

Example Report: All 2010 Data Period Descriptions Input criteria: Shown in header -/+ highlighted links: These sections expand to show period members Members of data10_7TeV.VdM are VdM1, VdM2, VdM3 Links to COMA and runQuery multi-Run Reports for that Period