Sphinx Server Sphinx Client Data Warehouse Submitter Generic Grid Site Monitoring Service Resource Message Interface Current Sphinx Client/Server Multi-threaded.

Slides:



Advertisements
Similar presentations
Network II.5 simulator ..
Advertisements

Building Portals to access Grid Middleware National Technical University of Athens Konstantinos Dolkas, On behalf of Andreas Menychtas.
LEAD Portal: a TeraGrid Gateway and Application Service Architecture Marcus Christie and Suresh Marru Indiana University LEAD Project (
Service Oriented Architecture for Mobile Applications Swarupsingh Baran University of North Carolina Charlotte.
FP7-INFRA Enabling Grids for E-sciencE EGEE Induction Grid training for users, Institute of Physics Belgrade, Serbia Sep. 19, 2008.
CERN LCG Overview & Scaling challenges David Smith For LCG Deployment Group CERN HEPiX 2003, Vancouver.
A Computation Management Agent for Multi-Institutional Grids
David Adams ATLAS DIAL Distributed Interactive Analysis of Large datasets David Adams BNL March 25, 2003 CHEP 2003 Data Analysis Environment and Visualization.
A Grid Resource Broker Supporting Advance Reservations and Benchmark- Based Resource Selection Erik Elmroth and Johan Tordsson Reporter : S.Y.Chen.
Managing the Information Technology Resource Jerry N. Luftman
Sergey Belov, LIT JINR 15 September, NEC’2011, Varna, Bulgaria.
The new The new MONARC Simulation Framework Iosif Legrand  California Institute of Technology.
1 Software Testing and Quality Assurance Lecture 40 – Software Quality Assurance.
Makrand Siddhabhatti Tata Institute of Fundamental Research Mumbai 17 Aug
Grid Monitoring By Zoran Obradovic CSE-510 October 2007.
Enabling Grids for E-sciencE Medical image processing web portal : Requirements analysis. An almost end user point of view … H. Benoit-Cattin,
Module 18 Monitoring SQL Server 2008 R2. Module Overview Monitoring Activity Capturing and Managing Performance Data Analyzing Collected Performance Data.
Operating Systems.  Operating System Support Operating System Support  OS As User/Computer Interface OS As User/Computer Interface  OS As Resource.
ATLAS DQ2 Deletion Service D.A. Oleynik, A.S. Petrosyan, V. Garonne, S. Campana (on behalf of the ATLAS Collaboration)
Grid Data Management A network of computers forming prototype grids currently operate across Britain and the rest of the world, working on the data challenges.
FESR Consorzio COMETA Grid Introduction and gLite Overview Corso di formazione sul Calcolo Parallelo ad Alte Prestazioni (edizione.
Module 7: Fundamentals of Administering Windows Server 2008.
Open Science Grid The OSG Accounting System: GRATIA by Philippe Canal (FNAL) & Matteo Melani (SLAC) Mumbai, India CHEP2006.
Main Sphinx Design Concepts There are two primary design components which comprise Sphinx The Database Warehouse The Control Process The Database Warehouse.
Grid Workload Management & Condor Massimo Sgaravatto INFN Padova.
CERN IT Department CH-1211 Genève 23 Switzerland t Internet Services Job Monitoring for the LHC experiments Irina Sidorova (CERN, JINR) on.
Grid Technologies  Slide text. What is Grid?  The World Wide Web provides seamless access to information that is stored in many millions of different.
SSS Test Results Scalability, Durability, Anomalies Todd Kordenbrock Technology Consultant Scalable Computing Division Sandia is a multiprogram.
Pegasus-a framework for planning for execution in grids Ewa Deelman USC Information Sciences Institute.
Grid Workload Management Massimo Sgaravatto INFN Padova.
The huge amount of resources available in the Grids, and the necessity to have the most up-to-date experimental software deployed in all the sites within.
Introduction to dCache Zhenping (Jane) Liu ATLAS Computing Facility, Physics Department Brookhaven National Lab 09/12 – 09/13, 2005 USATLAS Tier-1 & Tier-2.
Author - Title- Date - n° 1 Partner Logo WP5 Summary Paris John Gordon WP5 6th March 2002.
CPT Demo May Build on SC03 Demo and extend it. Phase 1: Doing Root Analysis and add BOSS, Rendezvous, and Pool RLS catalog to analysis workflow.
Module 9: Implementing Caching. Overview Caching Overview Configuring General Cache Properties Configuring Cache Rules Configuring Content Download Jobs.
Grid Scheduler: Plan & Schedule Adam Arbree Jang Uk In.
July 11-15, 2005Lecture3: Grid Job Management1 Grid Compute Resources and Job Management.
What is SAM-Grid? Job Handling Data Handling Monitoring and Information.
A Utility-based Approach to Scheduling Multimedia Streams in P2P Systems Fang Chen Computer Science Dept. University of California, Riverside
Evolution of a High Performance Computing and Monitoring system onto the GRID for High Energy Experiments T.L. Hsieh, S. Hou, P.K. Teng Academia Sinica,
Pegasus-a framework for planning for execution in grids Karan Vahi USC Information Sciences Institute May 5 th, 2004.
AliEn AliEn at OSC The ALICE distributed computing environment by Bjørn S. Nilsen The Ohio State University.
Tier 3 Status at Panjab V. Bhatnagar, S. Gautam India-CMS Meeting, July 20-21, 2007 BARC, Mumbai Centre of Advanced Study in Physics, Panjab University,
Korea Workshop May GAE CMS Analysis (Example) Michael Thomas (on behalf of the GAE group)
DIRAC Pilot Jobs A. Casajus, R. Graciani, A. Tsaregorodtsev for the LHCb DIRAC team Pilot Framework and the DIRAC WMS DIRAC Workload Management System.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI How to integrate portals with the EGI monitoring system Dusan Vudragovic.
Grid Compute Resources and Job Management. 2 Grid middleware - “glues” all pieces together Offers services that couple users with remote resources through.
Daniele Spiga PerugiaCMS Italia 14 Feb ’07 Napoli1 CRAB status and next evolution Daniele Spiga University & INFN Perugia On behalf of CRAB Team.
Module 6: Administering Reporting Services. Overview Server Administration Performance and Reliability Monitoring Database Administration Security Administration.
Microsoft ® Official Course Module 6 Managing Software Distribution and Deployment by Using Packages and Programs.
Gennaro Tortone, Sergio Fantinel – Bologna, LCG-EDT Monitoring Service DataTAG WP4 Monitoring Group DataTAG WP4 meeting Bologna –
+ Support multiple virtual environment for Grid computing Dr. Lizhe Wang.
Grid Activities in CMS Asad Samar (Caltech) PPDG meeting, Argonne July 13-14, 2000.
DataTAG is a project funded by the European Union International School on Grid Computing, 23 Jul 2003 – n o 1 GridICE The eyes of the grid PART I. Introduction.
OGSA-DAI.
Enabling Grids for E-sciencE Claudio Cherubino INFN DGAS (Distributed Grid Accounting System)
Databases and DBMSs Todd S. Bacastow January 2005.
OpenPBS – Distributed Workload Management System
U.S. ATLAS Grid Production Experience
Database System Concepts and Architecture
ALICE Monitoring
Workload Management System ( WMS )
Introduction to Grid Technology
A Messaging Infrastructure for WLCG
a VO-oriented perspective
DUCKS – Distributed User-mode Chirp-Knowledgeable Server
Initial job submission and monitoring efforts with JClarens
Wide Area Workload Management Work Package DATAGRID project
Production Manager Tools (New Architecture)
Presentation transcript:

Sphinx Server Sphinx Client Data Warehouse Submitter Generic Grid Site Monitoring Service Resource Message Interface Current Sphinx Client/Server Multi-threaded Control Process Planner DAG Reducer Information Gatherer Message Interface Prediction Engine

Sphinx Server Sphinx Client Data Warehouse Submitter Generic Grid Site Monitoring Service Resource Message Interface Sphinx Client/Server with V2 Components Multi-threaded Control Process Planner DAG Reducer Information Gatherer Prediction Engine Data Management Admission Control Message Interface

Sphinx Server Data Warehouse Stand alone Sphinx Server with light Client Sphinx Client Generic Grid Site Monitoring Service Resource Message Interface Multi-threaded Control Process Planner DAG Reducer Information Gatherer Prediction Engine Data Management Admission Control Submitter Message Interface

Sphinx Server Data Warehouse Message Tables Policy Information Tables Entity Accounting Tables Work (DAG, Job, etc) Tables State Unreduced Unpredicted Unaccepted (not admitted) Unplanned (not yet feasible) Unsent (assigned) Unfinished (executing) Remove (accounted or rejected) Entity Resource Property Requirements Dependency Requirements Do not start before “event” I/O QoS Requests Resource Assignment for next k steps Application Tables (in general, distributed…) Transformation Catalogue Profiles (CPU, Disk, Bandwidth requirements) History (CPU, Disk, Bandwidth, actual use) Data Tables (in general, distributed…) Replica Catalogue (from RLS) Profiles (Size, Bandwidth requirements) History (Access rate, etc) Resource Property Tables CPU, Disk, Network Connection, middleware configuration, etc Grid Weather Tables CPU View Storage View Bandwidth View Data Warehouse Multi-threaded Control Process Planner DAG Reducer Information Gatherer Prediction Engine Data Management Admission Control Submitter Message Interface What about including Virtual Data Tables?

Message Interface Component Input (incoming) DAG Requirements QoS requests Status requests Output (incoming) DAG Table (“unreduced”) Job Table (“unreduced”) application Requirements QoS Input (outgoing) Message Output (outgoing) Status updates Sphinx Server Data Warehouse Multi-threaded Control Process Planner DAG Reducer Information Gatherer Prediction Engine Data Management Admission Control Submitter Message Interface

DAG Reducer (Decomposer?) Input DAG (“unreduced”) Available (existing) data Action Remove DAG nodes for which input data exists Add Jobs to Job table Output Pruned DAG (“unpredicted”) Jobs (“unpredicted”) Sphinx Server Data Warehouse Multi-threaded Control Process Planner DAG Reducer Information Gatherer Prediction Engine Data Management Admission Control Submitter Message Interface

Sphinx Server Prediction Engine Component Input: “unpredicted” Job Requirements (if available) CPU hours Storage hours Bandwidth Profile/History Output: Estimated Resource Usage “unaccepted” job Data Warehouse Multi-threaded Control Process Planner DAG Reducer Information Gatherer Prediction Engine Data Management Admission Control Submitter Message Interface

Sphinx Server Admission Control Component Input Entity Account Information “unaccepted” DAG Request Requirements QoS Predictions of Jobs (decomposed DAG) Resource Usages Policy Constraints Current schedule for next k steps Output Reject DAG Request “Remove” DAG “Remove” Jobs Accept DAG with Estimated QoS “Unplanned” DAG “Unplanned” Jobs Data Warehouse Multi-threaded Control Process Planner DAG Reducer Information Gatherer Prediction Engine Data Management Admission Control Submitter Message Interface

Sphinx Server Planning Component Job Planner (independent “work” that can be fully resolved onto resources at time t) Input “unplanned” Job Information Requirements Dependencies (I/O) Job QoS Requests Entity Account Information Policy Constraints Available Data Available Applications Available Resource Properties Prediction of Job Resource Usage Output QoS Deliverable “unsent” Job(s) schedule for the next k steps DAG Planner (intra-dependent “work” that can not all be fully resolved onto resources at time t) Input “unplanned” DAG Information Requirements DAG QoS Requests Current Job Schedule for next k steps Output QoS Deliverable Modified Job Schedule for next k steps Data Warehouse Multi-threaded Control Process Planner DAG Reducer Information Gatherer Prediction Engine Data Management Admission Control Submitter Message Interface

Sphinx Server Job Submission Component (also part of Sphinx Client) Input Fully planned Job: Job state “unsent” Assigned gatekeeper and jobmanager I/O data movement locations Action Construct Job DAG Set up environment Import input data (if necessary) Publish to DMC (if necessary) Run application Export output data (if necessary) Publish to DMC (if necessary) Cleanup environment Local I/O data (if necessary) Submit to Condor-G/DAGMan Job state “unfinished” Monitor Job statistics Output Exit status (if available) Final Job statistics (resource usage) CPU Storage Bandwidth Entity account update Job state “remove” Data Warehouse Multi-threaded Control Process Planner DAG Reducer Information Gatherer Prediction Engine Data Management Admission Control Submitter Message Interface

Sphinx Server Data Management Components “User Demanded Movement” Module Input Source(s) Destination(s) Output DMC update “Pre-Planned Movement” Module (most important to Sphinx) Input Available Data Available Resources Prediction of Data Access Patterns DAG (“reduced”) Jobs (“unsent”) Job(s) schedule for next k steps Output Data Movement DMC update Prediction Module Input: Historical Data Access Patterns Output : Future Data Access Patterns Data Warehouse Multi-threaded Control Process Planner DAG Reducer Information Gatherer Prediction Engine Data Management Admission Control Submitter Message Interface

Sphinx Server Information Gathering Module Resource Property Module Input Grid Information Systems (such as MDS using GLUE Schema, etc) Output Resource Property views Job Tracking Module Input Job Monitoring information (such as Condor-G, etc) Output Work Views Grid Weather Module Input Grid Monitoring systems (such as GEMS, MonALISA, etc) Output Grid Weather Views CPU Storage Bandwidth Grid Weather Prediction Module Input: Grid Weather History Output: Grid Weather Forcast for next k steps Data Warehouse Multi-threaded Control Process Planner DAG Reducer Information Gatherer Prediction Engine Data Management Admission Control Submitter Message Interface