F. Cappello, O. Richard, P. Sens ---oo Draft oo--- Contact us for experiment proposal Grid eXplorer (GdX) An Instrument for eXploring the GRID F. Cappello,

Slides:



Advertisements
Similar presentations
KANSEI TESTBED OHIO STATE UNIVERSITY. HETEREGENOUS TESTBED Multiple communication networks, computation platforms, multi-modal sensors/actuators, and.
Advertisements

Building a CFD Grid Over ThaiGrid Infrastructure Putchong Uthayopas, Ph.D Department of Computer Engineering, Faculty of Engineering, Kasetsart University,
XtreemOS IP project is funded by the European Commission under contract IST-FP XtreemOS: Building and Promoting a Linux-based Operating System.
Kensington Oracle Edition: Open Discovery Workflow Meets Oracle 10g Professor Yike Guo.
Rob Allan Daresbury Laboratory NW-GRID Training Event 26 th January 2007 NW-GRID Future Developments R.J. Allan CCLRC Daresbury Laboratory.
University of Chicago Department of Energy The Parallel and Grid I/O Perspective MPI, MPI-IO, NetCDF, and HDF5 are in common use Multi TB datasets also.
C3.ca in Atlantic Canada Virendra Bhavsar Director, Advanced Computational Research Laboratory (ACRL) Faculty of Computer Science University of New Brunswick.
Towards a Virtual European Supercomputing Infrastructure Vision & issues Sanzio Bassini
A Nation Wide Experimental Grid The Grid’5000 project: architecture and objectives Building a nation wide experimental platform for Grid researchers –
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
1 In VINI Veritas: Realistic and Controlled Network Experimentation Jennifer Rexford with Andy Bavier, Nick Feamster, Mark Huang, and Larry Peterson
Workload Management Workpackage Massimo Sgaravatto INFN Padova.
The CrossGrid project Juha Alatalo Timo Koivusalo.
What Great Research ?s Can RAMP Help Answer? What Are RAMP’s Grand Challenges ?
Andrei Goldchleger, Fabio Kon, Alfredo Goldman and Marcelo Finger University of São.
Milos Kobliha Alejandro Cimadevilla Luis de Alba Parallel Computing Seminar GROUP 12.
Workload Management Massimo Sgaravatto INFN Padova.
Computer Science Perspective Ludek Matyska Faculty of Informatics, Masaryk University, Brno and also CESNET, Prague.
Scheduling in Heterogeneous Grid Environments: The Effects of Data Migration Leonid Oliker, Hongzhang Shan Future Technology Group Lawrence Berkeley Research.
Dual Stack Virtualization: Consolidating HPC and commodity workloads in the cloud Brian Kocoloski, Jiannan Ouyang, Jack Lange University of Pittsburgh.
1 Enabling Large Scale Network Simulation with 100 Million Nodes using Grid Infrastructure Hiroyuki Ohsaki Graduate School of Information Sci. & Tech.
Cloud Computing 1. Outline  Introduction  Evolution  Cloud architecture  Map reduce operation  Platform 2.
The MicroGrid: A Scientific Tool for Modeling Grids Andrew A. Chien SAIC Chair Professor Department of Computer Science and Engineering University of California,
DISTRIBUTED COMPUTING
Cluster Reliability Project ISIS Vanderbilt University.
INFSO-RI Enabling Grids for E-sciencE EGEODE VO « Expanding GEosciences On DEmand » Geocluster©: Generic Seismic Processing Platform.
Grid Workload Management & Condor Massimo Sgaravatto INFN Padova.
IMDGs An essential part of your architecture. About me
Large-scale Deployment in P2P Experiments Using the JXTA Distributed Framework Gabriel Antoniu, Luc Bougé, Mathieu Jan & Sébastien Monnet PARIS Research.
1 Martin Schulz, Lawrence Livermore National Laboratory Brian White, Sally A. McKee, Cornell University Hsien-Hsin Lee, Georgia Institute of Technology.
Grid Basics Adarsh Patil
07:44:46Service Oriented Cyberinfrastructure Lab, Introduction to BOINC By: Andrew J Younge
Development Timelines Ken Kennedy Andrew Chien Keith Cooper Ian Foster John Mellor-Curmmey Dan Reed.
The Globus Project: A Status Report Ian Foster Carl Kesselman
Heavy and lightweight dynamic network services: challenges and experiments for designing intelligent solutions in evolvable next generation networks Laurent.
Suzhen Lin, A. Sai Sudhir, G. Manimaran Real-time Computing & Networking Laboratory Department of Electrical and Computer Engineering Iowa State University,
© 2012 xtUML.org Bill Chown – Mentor Graphics Model Driven Engineering.
Parallel and Distributed Simulation Introduction and Motivation.
Cracow Grid Workshop October 2009 Dipl.-Ing. (M.Sc.) Marcus Hilbrich Center for Information Services and High Performance.
Issues Autonomic operation (fault tolerance) Minimize interference to applications Hardware support for new operating systems Resource management (global.
The Scaling and Validation Programme PoC David Groep & vle-pfour-team VL-e Workshop NIKHEF SARA LogicaCMG IBM.
Authors: Ronnie Julio Cole David
GVis: Grid-enabled Interactive Visualization State Key Laboratory. of CAD&CG Zhejiang University, Hangzhou
Performance evaluation on grid Zsolt Németh MTA SZTAKI Computer and Automation Research Institute.
Ruth Pordes November 2004TeraGrid GIG Site Review1 TeraGrid and Open Science Grid Ruth Pordes, Fermilab representing the Open Science.
Emulation in Data Grid eXplorer. Emulation problematic Distributed applicationTarget environment Simulation Emulation App. Model Env. model Formal analysis.
August 3, March, The AC3 GRID An investment in the future of Atlantic Canadian R&D Infrastructure Dr. Virendra C. Bhavsar UNB, Fredericton.
1 Wide Area Network Emulation on the Millennium Bhaskaran Raman Yan Chen Weidong Cui Randy Katz {bhaskar, yanchen, wdc, Millennium.
7. Grid Computing Systems and Resource Management
Flip Flop Gates 1 Results Energy was higher than expected Our overall evaluation method sought to prove three hypotheses: (1) that voice-over-IP has actually.
Grid Computing Unit I Introduction. Information anytime anywhere!!! support computation across administrative domains Generally  virtualizing computing.
Auger & XtremWeb: Monte Carlo computation on A Global Computing platform O. Lodygensky, G. Fedak, V. Neri, A.Cordier, F. Cappello Laboratoire de l’Accelerateur.
Grid Activities in CMS Asad Samar (Caltech) PPDG meeting, Argonne July 13-14, 2000.
Flip Flop Gates 1 Results Energy was higher than expected Our overall evaluation method sought to prove three hypotheses: (1) that voice-over- IP has actually.
IHP Im Technologiepark Frankfurt (Oder) Germany IHP Im Technologiepark Frankfurt (Oder) Germany ©
Group # 14 Dhairya Gala Priyank Shah. Introduction to Grid Appliance The Grid appliance is a plug-and-play virtual machine appliance intended for Grid.
LIOProf: Exposing Lustre File System Behavior for I/O Middleware
Northwest Indiana Computational Grid Preston Smith Rosen Center for Advanced Computing Purdue University - West Lafayette West Lafayette Calumet.
Page : 1 SC2004 Pittsburgh, November 12, 2004 DEISA : integrating HPC infrastructures in Europe DEISA : integrating HPC infrastructures in Europe Victor.
Grid Institute Scientific Council, September 10, 2008
Clouds , Grids and Clusters
Grid Computing.
GRID COMPUTING PRESENTED BY : Richa Chaudhary.
Class project by Piyush Ranjan Satapathy & Van Lepham
Shadow: Scalable and Deterministic Network Experimentation
CLUSTER COMPUTING.
Development & Evaluation of Network Test-beds
ModelNet: A Large-Scale Network Emulator for Wireless Networks Priya Mahadevan, Ken Yocum, and Amin Vahdat Duke University, Goal:
Defining the Grid Fabrizio Gagliardi EMEA Director Technical Computing
Presentation transcript:

F. Cappello, O. Richard, P. Sens ---oo Draft oo--- Contact us for experiment proposal Grid eXplorer (GdX) An Instrument for eXploring the GRID F. Cappello, O. Richard, P. Sens LRI, ID-Imag, Lip6, etc.

F. Cappello, O. Richard, P. Sens ---oo Draft oo--- Contact us for experiment proposal What we know about Grid/P2P? Infrastructure features: Large scale distributed systems Large scale distributed systems Dynamic infrastructures (network, nodes) Dynamic infrastructures (network, nodes) Dynamic workloads Dynamic workloads Heterogeneous (parallel & uniproc. nodes) Heterogeneous (parallel & uniproc. nodes) High Computation/Communication perf. Ratio High Computation/Communication perf. Ratio

F. Cappello, O. Richard, P. Sens ---oo Draft oo--- Contact us for experiment proposal What we know about Grid/P2P? Application features: Large data sets (data bases, physics Large data sets (data bases, physics instruments, generation of very large results) instruments, generation of very large results) Huge computations (in terms of #operations) Huge computations (in terms of #operations) Long run times Long run times Large number of data operations (movements, transactions, etc.) Large number of data operations (movements, transactions, etc.)

F. Cappello, O. Richard, P. Sens ---oo Draft oo--- Contact us for experiment proposal What we know about Grid/P2P? Utilization: A large number of users (researcher, industry, individual, etc.) A large number of users (researcher, industry, individual, etc.) A large set of applications (research, business) A large set of applications (research, business) High throughput (file download, multi-parameters, computation, etc.) High throughput (file download, multi-parameters, computation, etc.) Real time (Sensors, Entertainment, HPC, etc.) Real time (Sensors, Entertainment, HPC, etc.)

F. Cappello, O. Richard, P. Sens ---oo Draft oo--- Contact us for experiment proposal What are open issues? Security (GSI, CAS) Security (GSI, CAS) Data Storage/consultation/movement Data Storage/consultation/movement Multi users/ Multi applications scheduling Multi users/ Multi applications scheduling Coordination (virtual, ephemeral infrastructure) Coordination (virtual, ephemeral infrastructure) Programming Programming Fault Tolerance! Fault Tolerance! Scalability Scalability Performance Performance Easy/efficient deployment techniques Easy/efficient deployment techniques Application characterization techniques Application characterization techniques Etc. Etc.

F. Cappello, O. Richard, P. Sens ---oo Draft oo--- Contact us for experiment proposal What is Grid today? Middleware: Globus, Legion, Netsolve, Unicore, DIET, Condor, XtremWeb, Boinc, NWS Middleware: Globus, Legion, Netsolve, Unicore, DIET, Condor, XtremWeb, Boinc, NWS  they are actually working! Testbed: DataGRID, TeraGRID, Etoile, Grads, XW, Boinc Testbed: DataGRID, TeraGRID, Etoile, Grads, XW, Boinc  Difficult to build (debug, human factor, etc.) Dedicated applications (SETI, Kazaa, Jabber, etc.) Dedicated applications (SETI, Kazaa, Jabber, etc.)  they are working already (at large scale!),  BUT they address much less issues!

F. Cappello, O. Richard, P. Sens ---oo Draft oo--- Contact us for experiment proposal Why Grid is so difficult?  Grids are very complex systems (we still have problems with large scale parallel computers!, we have less control on Grid resources! )  Many issues are addressed simultaneously  We need a methodology enabling the study of Grid issues, independently but realistically.

F. Cappello, O. Richard, P. Sens ---oo Draft oo--- Contact us for experiment proposal What are the current approaches? Simulators: SimGRID, MicroGRID, etc. Simulators: SimGRID, MicroGRID, etc.  they have strong limitations (scalability, != than execution of real codes, validation) Experimental testbed (is there any?) Experimental testbed (is there any?)  Most testbed are for production, each testeb is specific, representativeness?  Most testbed are for production, each testeb is specific, representativeness?  We have no way to test: ideas independently, at a significant scale, ideas independently, at a significant scale, with realistic parameters and behaviors! with realistic parameters and behaviors!

F. Cappello, O. Richard, P. Sens ---oo Draft oo--- Contact us for experiment proposal What is missing? A full fledge scientific environment A full fledge scientific environment (reproducible realistic experimental conditions) Probes measuring the performance of real resources and networks (Ganglia, NWS, la grenouille) Probes measuring the performance of real resources and networks (Ganglia, NWS, la grenouille) Fully experimental testbed, (GRID 5000 would remove this lack) Fully experimental testbed, (GRID 5000 would remove this lack)  Not enough, we need instruments with parametrisable reproducible experimental conditions

F. Cappello, O. Richard, P. Sens ---oo Draft oo--- Contact us for experiment proposal Methodology When physicists can’t measure a phenomenon When physicists can’t measure a phenomenon because a) the item to measure is not reachable or b) the phenomenon is hidden by others, because a) the item to measure is not reachable or b) the phenomenon is hidden by others,  They build instruments!  And they observe…

F. Cappello, O. Richard, P. Sens ---oo Draft oo--- Contact us for experiment proposal Grid eXplorer An experimental conditions database or generation An experimental hardware platform A tool set for conducting experiments & measurements

F. Cappello, O. Richard, P. Sens ---oo Draft oo--- Contact us for experiment proposal Grid eXplorer: Analogy with physic instruments A set of sensors Inside real platform A database of Real platform A Hardware Platform for Emulation Simulation A tool set for Observation Results Analysis Validation on real platforms

F. Cappello, O. Richard, P. Sens ---oo Draft oo--- Contact us for experiment proposal Grid eXplorer (GdX): Instrument for Grid exploration: An instrument for exploring Grid middleware, algorithms, performance and applications under reproducible experimental conditions An instrument for exploring Grid middleware, algorithms, performance and applications under reproducible experimental conditions A tool set for emulation/simulation of large scale distributed systems A tool set for emulation/simulation of large scale distributed systems 1K CPU clusters + configurable network & OS 1K CPU clusters + configurable network & OS For Grid, P2P, etc. For Grid, P2P, etc. Potentially connected to Grid testbeds Potentially connected to Grid testbeds F. Cappello, O. Richard, P. Sens ---oo Draft oo--- Contact us for experiment proposal

Grid eXplorer (GdX): eXperimental conditions database A set of sensors (Nodes, Networks): A set of sensors (Nodes, Networks):  Academic Networks (x K nodes, GRID 5000)  ADSL (la grenouille  60 K nodes) A common format for traces A common format for traces A tool set for accessing, managing traces A tool set for accessing, managing traces Tools for trace analysis Tools for trace analysis F. Cappello, O. Richard, P. Sens ---oo Draft oo--- Contact us for experiment proposal

Grid eXplorer (GdX): Hardware platform for eXperiments: 1K CPU clusters 1K CPU clusters configurable network configurable network configurable OS configurable OS Multi-users Multi-users Potentially located/managed by IDRIS or CEA Potentially located/managed by IDRIS or CEA F. Cappello, O. Richard, P. Sens ---oo Draft oo--- Contact us for experiment proposal

Grid eXplorer (GdX): Tool set: Emulators (folding 10K virtual nodes on 1K nodes) Emulators (folding 10K virtual nodes on 1K nodes) Parallel simulators (difficult!) Parallel simulators (difficult!) Virtual GRID environment (1k virtual nodes on 1k nodes) Virtual GRID environment (1k virtual nodes on 1k nodes) Measurement tools, Measurement tools, Visualization tools. Visualization tools. F. Cappello, O. Richard, P. Sens ---oo Draft oo--- Contact us for experiment proposal

Grid eXplorer (GdX): eXperiments: Real scale pseudo-emulation (1:1 scale) Real scale pseudo-emulation (1:1 scale) Large scale emulation (10:1 scale) Large scale emulation (10:1 scale) Large scale simulation (100:1 scale) Large scale simulation (100:1 scale) Application test (bottleneck discovery, performance evaluation, optimization) Application test (bottleneck discovery, performance evaluation, optimization) Infrastructure test (bottleneck discovery, performance evaluation, optimization) Infrastructure test (bottleneck discovery, performance evaluation, optimization) Connection to Grid 5000, International Grids, etc. Connection to Grid 5000, International Grids, etc. F. Cappello, O. Richard, P. Sens ---oo Draft oo--- Contact us for experiment proposal

Grid eXplorer (GdX): Instrument related projects: Instrument projects Instrument projects – Cluster management (ID Imag), – Sensor event data base (LRI-fci, la grenouille), – Event analysis (ID Imag), – Emulators (LRI-fci, etc.), – Experimental conditions injection, – Result visualization (???), – Result analysis, – Validation methodology, – User validation (physicists, biologists, chemists, etc.) F. Cappello, O. Richard, P. Sens ---oo Draft oo--- Contact us for experiment proposal

Grid eXplorer (GdX): Grid/P2P research projects: Grid Grid – DataGRID technology (???) – GRIDRPC (LIP, Desprez?) – Code coupling (IRISA?) P2P P2P – Large scale desktop Grids (LRI-fci) – Large scale storage (Laria, Irisa?) – Large scale data consistency (Lip6-P. sens) – Large scale fault tolerance (LRI-Beauquier) Network Network – Emulation of latency, packet loss, bandwidth – Emulation of topology F. Cappello, O. Richard, P. Sens ---oo Draft oo--- Contact us for experiment proposal

Grid eXplorer (GdX): User applications: Bio Grid (Genomining, IBBMC, etc.) Bio Grid (Genomining, IBBMC, etc.) Medi GRID (???) Medi GRID (???) Geo GRID (IFP, CGG) Geo GRID (IFP, CGG) Power GRID (EDF) Power GRID (EDF) Industry GRID (EADS, Alcatel Space) Industry GRID (EADS, Alcatel Space) Academic GRID (Orsay, Berkeley, SDSC, Suitzerland, etc.) Academic GRID (Orsay, Berkeley, SDSC, Suitzerland, etc.) Industry Partners (GridXpert???, IBM???) Industry Partners (GridXpert???, IBM???) F. Cappello, O. Richard, P. Sens ---oo Draft oo--- Contact us for experiment proposal

Grid eXplorer (GdX) Project organisation A Virtual laboratory A Virtual laboratory A director (or a group), A director (or a group), A scientific director, an administrative director A scientific director, an administrative director A scientific council (with physicists), A scientific council (with physicists), A Researcher group A Researcher group A group of engineers, A group of engineers, A user group A user group A set of scientific events A set of scientific events F. Cappello, O. Richard, P. Sens ---oo Draft oo--- Contact us for experiment proposal

Grid eXplorer GdX A long term effort A long term effort – A medium term milestone: 4 years  A fully functional prototype Many scientific issues (large scale emulation, experimental conditions injection, distance to reality, etc.) Many scientific issues (large scale emulation, experimental conditions injection, distance to reality, etc.) A tool for Grid users or potential users A tool for Grid users or potential users A tool for Grid/P2P developers A tool for Grid/P2P developers