Millions of Jobs or a few good solutions …. David Abramson Monash University MeSsAGE Lab X.

Slides:



Advertisements
Similar presentations
Nimrod/G GRID Resource Broker and Computational Economy
Advertisements

Computational Grids and Computational Economy: Nimrod/G Approach David Abramson Rajkumar Buyya Jonathan Giddy.
Nimrod/K: Towards Massively Parallel Dynamic Grid Workflows David Abramson, Colin Enticott, Monash Ilkay Altinas, UCSD.
National Institute of Advanced Industrial Science and Technology Ninf-G - Core GridRPC Infrastructure Software OGF19 Yoshio Tanaka (AIST) On behalf.
Höchstleistungsrechenzentrum Stuttgart SEGL Parameter Study Slide 1 Science Experimental Grid Laboratory (SEGL) Dynamical Parameter Study in Distributed.
Defining Decision Support System
School of Computing FACULTY OF ENGINEERING Grids and QoS Grid Computing has emerged in the last two decades, initially as a model for large-scale, resource-intensive.
Kensington Oracle Edition: Open Discovery Workflow Meets Oracle 10g Professor Yike Guo.
P. Kacsuk, G. Sipos, A. Toth, Z. Farkas, G. Kecskemeti and G. Hermann P. Kacsuk, G. Sipos, A. Toth, Z. Farkas, G. Kecskemeti and G. Hermann MTA SZTAKI.
A Dynamic World, what can Grids do for Multi-Core computing? Daniel Goodman, Anne Trefethen and Douglas Creager
1 Project Overview EconomyGrid Economic Paradigm For “Resource Management and Scheduling” for Service-Oriented Grid Computing Presenter Name: Sama GovindaRamanujam.
SALSA HPC Group School of Informatics and Computing Indiana University.
Master/Slave Architecture Pattern Source: Pattern-Oriented Software Architecture, Vol. 1, Buschmann, et al.
GridRPC Sources / Credits: IRISA/IFSIC IRISA/INRIA Thierry Priol et. al papers.
Clouds from FutureGrid’s Perspective April Geoffrey Fox Director, Digital Science Center, Pervasive.
Resource Management of Grid Computing
Institute for Software Science – University of ViennaP.Brezany 1 Databases and the Grid Peter Brezany Institute für Scientific Computing University of.
Universität Dortmund Robotics Research Institute Information Technology Section Grid Metaschedulers An Overview and Up-to-date Solutions Christian.
Office of Science U.S. Department of Energy Grids and Portals at NERSC Presented by Steve Chan.
Parallelization and Grid Computing Thilo Kielmann Bioinformatics Data Analysis and Tools June 8th, 2006.
Grids and Grid Technologies for Wide-Area Distributed Computing Mark Baker, Rajkumar Buyya and Domenico Laforenza.
Panel Summary Andrew Hanushevsky Stanford Linear Accelerator Center Stanford University XLDB 23-October-07.
Copyright © 2006 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill Technology Education Copyright © 2006 by The McGraw-Hill Companies,
EUROPEAN UNION Polish Infrastructure for Supporting Computational Science in the European Research Space Cracow Grid Workshop’10 Kraków, October 11-13,
U.S. Department of the Interior U.S. Geological Survey David V. Hill, Information Dynamics, Contractor to USGS/EROS 12/08/2011 Satellite Image Processing.
SIDDHARTH MEHTA PURSUING MASTERS IN COMPUTER SCIENCE (FALL 2008) INTERESTS: SYSTEMS, WEB.
A.V. Bogdanov Private cloud vs personal supercomputer.
STRATEGIES INVOLVED IN REMOTE COMPUTATION
MobSched: An Optimizable Scheduler for Mobile Cloud Computing S. SindiaS. GaoB. Black A.LimV. D. AgrawalP. Agrawal Auburn University, Auburn, AL 45 th.
GRACE Project IST EGAAP meeting – Den Haag, 25/11/2004 Giuseppe Sisto – Telecom Italia Lab.
16-1 The World Wide Web The Web An infrastructure of distributed information combined with software that uses networks as a vehicle to exchange that information.
Building a Real Workflow Thursday morning, 9:00 am Lauren Michael Research Computing Facilitator University of Wisconsin - Madison.
Using the WS-PGRADE Portal in the ProSim Project Protein Molecule Simulation on the Grid Tamas Kiss, Gabor Testyanszky, Noam.
Nimrod/G GRID Resource Broker and Computational Economy David Abramson, Rajkumar Buyya, Jon Giddy School of Computer Science and Software Engineering Monash.
Python: An Introduction
1.8History of Java Java –Based on C and C++ –Originally developed in early 1991 for intelligent consumer electronic devices Market did not develop, project.
Nimrod & NetSolve Sathish Vadhiyar. Nimrod Sources/Credits: Nimrod web site & papers.
Protein Molecule Simulation on the Grid G-USE in ProSim Project Tamas Kiss Joint EGGE and EDGeS Summer School.
Building Scalable Scientific Applications using Makeflow Dinesh Rajan and Douglas Thain University of Notre Dame.
SALSA HPC Group School of Informatics and Computing Indiana University.
1 Sergio Maffioletti Grid Computing Competence Center GC3 University of Zurich Swiss Grid School 2012 Develop High Throughput.
Resource Brokering in the PROGRESS Project Juliusz Pukacki Grid Resource Management Workshop, October 2003.
Issues Autonomic operation (fault tolerance) Minimize interference to applications Hardware support for new operating systems Resource management (global.
DAME: A Distributed Diagnostics Environment for Maintenance Duncan Russell University of Leeds.
 Apache Airavata Architecture Overview Shameera Rathnayaka Graduate Assistant Science Gateways Group Indiana University 07/27/2015.
Using SWARM service to run a Grid based EST Sequence Assembly Karthik Narayan Primary Advisor : Dr. Geoffrey Fox 1.
Building a Real Workflow Thursday morning, 9:00 am Lauren Michael Research Computing Facilitator University of Wisconsin - Madison.
George Goulas, Christos Gogos, Panayiotis Alefragis, Efthymios Housos Computer Systems Laboratory, Electrical & Computer Engineering Dept., University.
WP1 : Applications Océan / atmosphère Cosmologie Site d'expertise algèbre linéaire creuse TLSE.
GCE Shell? GGF6 Chicago October Geoffrey Fox Marlon Pierce Indiana University
Recording Actor Provenance in Scientific Workflows Ian Wootten, Shrija Rajbhandari, Omer Rana Cardiff University, UK.
Economic and On Demand Brain Activity Analysis on Global Grids A case study.
Toward a common data and command representation for quantum chemistry Malcolm Atkinson Director 5 th April 2004.
Università di Perugia Enabling Grids for E-sciencE Status of and requirements for Computational Chemistry NA4 – SA1 Meeting – 6 th April.
OPTIMIZATION OF DIESEL INJECTION USING GRID COMPUTING Miguel Caballer Universidad Politécnica de Valencia.
OGCE Workflow and LEAD Overview Suresh Marru, Marlon Pierce September 2009.
Grid-enabled Probabilistic Model Checking with PRISM Yi Zhang, David Parker, Marta Kwiatkowska University of Birmingham.
INFSO-RI JRA2 Test Management Tools Eva Takacs (4D SOFT) ETICS 2 Final Review Brussels - 11 May 2010.
PARALLEL AND DISTRIBUTED PROGRAMMING MODELS U. Jhashuva 1 Asst. Prof Dept. of CSE om.
Holding slide prior to starting show. Scheduling Parametric Jobs on the Grid Jonathan Giddy
The Anatomy of a Large-Scale Hypertextual Web Search Engine S. Brin and L. Page, Computer Networks and ISDN Systems, Vol. 30, No. 1-7, pages , April.
InSilicoLab – Grid Environment for Supporting Numerical Experiments in Chemistry Joanna Kocot, Daniel Harężlak, Klemens Noga, Mariusz Sterzel, Tomasz Szepieniec.
Implementation of Classifier Tool in Twister Magesh khanna Vadivelu Shivaraman Janakiraman.
Lecture 3 – MapReduce: Implementation CSE 490h – Introduction to Distributed Computing, Spring 2009 Except as otherwise noted, the content of this presentation.
Design rationale and status of the org.glite.overlay component
Auburn University COMP7330/7336 Advanced Parallel and Distributed Computing MapReduce - Introduction Dr. Xiao Qin Auburn.
Class project by Piyush Ranjan Satapathy & Van Lepham
Chris Gill CSE 522S – Advanced Operating Systems
Cardiac Modeling of Steady-State Transients in the Ventricular Myocyte
MapReduce: Simplified Data Processing on Large Clusters
Presentation transcript:

Millions of Jobs or a few good solutions …. David Abramson Monash University MeSsAGE Lab X

No shortage of applications How many jobs do they want/need? Physics Chemistry Environmental Science Biological Systems Engineering Astronomy

The Nimrod Tool Family Nimrod workflows for robust design and search –Vary parameters –Execute programs –Copy data in and out Sequential and parallel dependencies Computational economy drives scheduling Computation scheduled near data when appropriate Use distributed high performance platforms Upper middleware broker for resources discovery Wide Community adoption 3 Generate scenarios Execution Results gathered Analysis

4 Nimrod/G Grid Middleware Nimrod/ONimrod/E Nimrod Portal Actuators Plan File parameter pressure float range from 5000 to 6000 points 4 parameter concent float range from to points 2 parameter material text select anyof “Fe” “Al” task main copy compModel node:compModel copy inputFile.skel node:inputFile.skel node:substitute inputFile.skel inputFile node:execute./compModel results copy node:results results.$jobname endtask

5 Prepare Jobs using Portal Jobs Scheduled Executed Dynamically Sent to available machines Results displayed & interpreted

Parameter Sweeps and searches A full parameter sweep is the cross product of all the parameters –Too easy to generate millions! An optimization run minimizes some output metric and returns parameter combinations that do this –Limited concurrency (except GAs) Design of Experiments limits number of combinations further. –And old idea …. Results Nimrod/OResults

Issues for millions of jobs Generation issues –Don’t necessarily need 1,000,000 jobs! –Smarter ways of specifying problems Don’t want to see 1,000,000 jobs! Don’t necessarily generate all at once Performance issues –Nimrod/G: Server load Hierarchical resource management –Nimrod/K: Handling token load in matching store Need k-bounded loops ideas from 1980’s Fault tolerance –Engaging the user Don’t want to see 1,000,000 jobs –Distributed experiment management (p2p)?

Issues for millions of jobs Analysis issues –Need smarter ways of interacting with results Scientific visualisation, data mining, mega-pixel displays Commercial realities –License management Need parametric licenses like parallel ones. Appropriate infrastructure –Tera Grid class of machine not most appropriate –Parametric Clouds