Grid Computing & Tuple Space Presented by Nelson Chu.

Slides:



Advertisements
Similar presentations
Giggle: A Framework for Constructing Scalable Replica Location Services Ann Chervenak, Ewa Deelman, Ian Foster, Leanne Guy, Wolfgang Hoschekk, Adriana.
Advertisements

The Anatomy of the Grid: An Integrated View of Grid Architecture Carl Kesselman USC/Information Sciences Institute Ian Foster, Steve Tuecke Argonne National.
Database Architectures and the Web
Distributed Systems basics
Internet Technologies (Grid Computing (OGSA, WSRF) )
High Performance Computing Course Notes Grid Computing.
The Anatomy of the Grid Ian Foster, Carl Kesselman, Steven Tuecke Professor: Dr. M. Sadjadi Presenter: Sri Medam Antagonist: Tariq King.
Condor-G: A Computation Management Agent for Multi-Institutional Grids James Frey, Todd Tannenbaum, Miron Livny, Ian Foster, Steven Tuecke Reporter: Fu-Jiun.
A Computation Management Agent for Multi-Institutional Grids
An Introduction to Grid Computing Richard Fujimoto Reference: The Grid 2, ch. 1-4, 7 Ian Foster & Carl Kesselman (eds.)
Computer Science Department 1 Load Balancing and Grid Computing David Finkel Computer Science Department Worcester Polytechnic Institute.
GGF Toronto Spitfire A Relational DB Service for the Grid Peter Z. Kunszt European DataGrid Data Management CERN Database Group.
Milos Kobliha Alejandro Cimadevilla Luis de Alba Parallel Computing Seminar GROUP 12.
Grids and Grid Technologies for Wide-Area Distributed Computing Mark Baker, Rajkumar Buyya and Domenico Laforenza.
4b.1 Grid Computing Software Components of Globus 4.0 ITCS 4010 Grid Computing, 2005, UNC-Charlotte, B. Wilkinson, slides 4b.
Globus Ian Foster and Carl Kesselman Argonne National Laboratory and University of Southern California
GRID COMPUTING & GRID SCHEDULERS - Neeraj Shah. Definition A ‘Grid’ is a collection of different machines where in all of them contribute any combination.
Grid Computing Net 535.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
Grid Toolkits Globus, Condor, BOINC, Xgrid Young Suk Moon.
Ali YILDIRIM Emre UZUNCAKARA
Data Management Kelly Clynes Caitlin Minteer. Agenda Globus Toolkit Basic Data Management Systems Overview of Data Management Data Movement Grid FTP Reliable.
OPEN GRID SERVICES ARCHITECTURE AND GLOBUS TOOLKIT 4
Presenter: Dipesh Gautam.  Introduction  Why Data Grid?  High Level View  Design Considerations  Data Grid Services  Topology  Grids and Cloud.
DataGrid Middleware: Enabling Big Science on Big Data One of the most demanding and important challenges that we face as we attempt to construct the distributed.
DISTRIBUTED COMPUTING
ARGONNE  CHICAGO Ian Foster Discussion Points l Maintaining the right balance between research and development l Maintaining focus vs. accepting broader.
SeLeNe - Architecture George Samaras Kyriakos Karenos Larnaca – April 2003 THE UNIVERSITY OF CYPRUS.
GT Components. Globus Toolkit A “toolkit” of services and packages for creating the basic grid computing infrastructure Higher level tools added to this.
1 School of Computer, National University of Defense Technology A Profile on the Grid Data Engine (GridDaEn) Xiao Nong
The Anatomy of the Grid: An Integrated View of Grid Architecture Ian Foster, Steve Tuecke Argonne National Laboratory The University of Chicago Carl Kesselman.
CSF4 Meta-Scheduler Name: Zhaohui Ding, Xiaohui Wei
The Grid System Design Liu Xiangrui Beijing Institute of Technology.
CYBERINFRASTRUCTURE FOR THE GEOSCIENCES Data Replication Service Sandeep Chandra GEON Systems Group San Diego Supercomputer Center.
The Anatomy of the Grid Mahdi Hamzeh Fall 2005 Class Presentation for the Parallel Processing Course. All figures and data are copyrights of their respective.
Virtual Data Grid Architecture Ewa Deelman, Ian Foster, Carl Kesselman, Miron Livny.
1 4/23/2007 Introduction to Grid computing Sunil Avutu Graduate Student Dept.of Computer Science.
Middleware for Grid Computing and the relationship to Middleware at large ECE 1770 : Middleware Systems By: Sepehr (Sep) Seyedi Date: Thurs. January 23,
Grid Architecture William E. Johnston Lawrence Berkeley National Lab and NASA Ames Research Center (These slides are available at grid.lbl.gov/~wej/Grids)
The Replica Location Service The Globus Project™ And The DataGrid Project Copyright (c) 2002 University of Chicago and The University of Southern California.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
GRID ARCHITECTURE Chintan O.Patel. CS 551 Fall 2002 Workshop 1 Software Architectures 2 What is Grid ? "...a flexible, secure, coordinated resource- sharing.
Grid Services I - Concepts
Transaction-based Grid Data Replication Using OGSA-DAI Presented by Yin Chen February 2007.
CLRC and the European DataGrid Middleware Information and Monitoring Services The current information service is built on the hierarchical database OpenLDAP.
What is SAM-Grid? Job Handling Data Handling Monitoring and Information.
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
CEOS Working Group on Information Systems and Services - 1 Data Services Task Team Discussions on GRID and GRIDftp Stuart Doescher, USGS WGISS-15 May 2003.
Introduction to Grids By: Fetahi Z. Wuhib [CSD2004-Team19]
7. Grid Computing Systems and Resource Management
International Symposium on Grid Computing (ISGC-07), Taipei - March 26-29, 2007 Of 16 1 A Novel Grid Resource Broker Cum Meta Scheduler - Asvija B System.
Securing the Grid & other Middleware Challenges Ian Foster Mathematics and Computer Science Division Argonne National Laboratory and Department of Computer.
GRID ANATOMY Advanced Computing Concepts – Dr. Emmanuel Pilli.
Rights Management for Shared Collections Storage Resource Broker Reagan W. Moore
PARALLEL AND DISTRIBUTED PROGRAMMING MODELS U. Jhashuva 1 Asst. Prof Dept. of CSE om.
ACGT Architecture and Grid Infrastructure Juliusz Pukacki ‏ EGEE Conference Budapest, 4 October 2007.
Grid Services for Digital Archive Tao-Sheng Chen Academia Sinica Computing Centre
SuperComputing 2003 “The Great Academia / Industry Grid Debate” ?
GGF OGSA-WG, Data Use Cases Peter Kunszt Middleware Activity, Data Management Cluster EGEE is a project funded by the European.
Globus —— Toolkits for Grid Computing
Grid Computing.
University of Technology
Grid Computing B.Ramamurthy 9/22/2018 B.Ramamurthy.
Grid Services B.Ramamurthy 12/28/2018 B.Ramamurthy.
Introduction to Grid Technology
Proposed Grid Protocol Architecture Working Group
The Anatomy and The Physiology of the Grid
The Anatomy and The Physiology of the Grid
Presentation transcript:

Grid Computing & Tuple Space Presented by Nelson Chu

What is Grid? Computing Grid is a very large distributed computing network, Virtual Organizations (VO) Computing Grid is a very large distributed computing network, Virtual Organizations (VO) Grid is an infrastructure to collaborate many diverse computing resources in a network to accomplish a task(s). Grid is an infrastructure to collaborate many diverse computing resources in a network to accomplish a task(s).

What is Grid? Classification: Classification: Computational Grid (eg. WestGrid) Computational Grid (eg. WestGrid) Desktop Grid (eg. Desktop Grid (eg. Data Grid (eg. Earth Systems Grid) Data Grid (eg. Earth Systems Grid) Global Grid Forum (GGF), Globus, etc. Global Grid Forum (GGF), Globus, etc. Open Grid Services Architecture - OGSA Open Grid Services Architecture - OGSA Web Service Resource Framework - WSRF (OGSI) Web Service Resource Framework - WSRF (OGSI) ESG

Fabric: Physical resources Resource & Connectivity Protocols: secured access Collective Services: brokering, monitoring User Applications/Tools Grid Architecture

Fabric layer: resource management e.g. computational resources, storage resources, and network resources. Connectivity layer: Communication and authentication Collective layer: global-naming, brokering, scheduling, workload and workflow management, and data replication. Application layer: user defined tools and application

OGSA Architecture

OGSA provides a service oriented architecture to simplify grid services development and to build a transparent grid structure Physical resources (for example: storage, network, and computer) and logical resources services Web services (including WSRF) OGSA architect services (such as security management, data management, workflow management, file system management, etc.) Grid applications

Research Opportunities: Grid Security Multi-institutional users and heterogeneous resources Multi-institutional users and heterogeneous resources Authentication & authorization mechanisms Authentication & authorization mechanisms Different local name spaces, credentials, and/or accounts Different local name spaces, credentials, and/or accounts Trust mechanisms: domain-based, user-based, service- based, network-based, and scheduler-based Trust mechanisms: domain-based, user-based, service- based, network-based, and scheduler-based Fault Tolerance Detect and notify a fault occurrence Detect and notify a fault occurrence Handle any failure Handle any failure

Research Opportunities: Grid Scheduling Workflows in a service-based environment Workflows in a service-based environment Split and distribute jobs to resources Split and distribute jobs to resources Time, cost, performance, completeness, fairness, effectiveness, and security policies. Time, cost, performance, completeness, fairness, effectiveness, and security policies.  Resource discovery  System selection  Job submission: resources reservation and supporting operation  Monitoring and cleaning up

Current Research Trends Scheduling, Security, Fault Tolerance Scheduling, Security, Fault Tolerance Replica location services Replica location services Metadata catalog services Metadata catalog services Reliable file transfer services Reliable file transfer services Globus Toolkit 3 / 4: Replica Catalog, GridFTP, and (GASS) Global Access to Secondary Storage. Globus Toolkit 3 / 4: Replica Catalog, GridFTP, and (GASS) Global Access to Secondary Storage. Globus, IBM, Oracle 10g, Sun, etc. Globus, IBM, Oracle 10g, Sun, etc.

Research Opportunities: Data Data Versioning Data Versioning Global name space: data characterization, heterogeneous systems Global name space: data characterization, heterogeneous systems Data Security Data Security Fault tolerance: recovery Fault tolerance: recovery New challenges: inter-domains replication, data synchronization, replica registration, querying, and meta-catalog. New challenges: inter-domains replication, data synchronization, replica registration, querying, and meta-catalog.

Grid Environment Resources collaboration Resources collaboration Loosely coupling Loosely coupling Time, space, destination Time, space, destination Communication allows significant degree of interoperability, synchronization, and extensibility Communication allows significant degree of interoperability, synchronization, and extensibility Simple, flexible, and powerful mechanism Simple, flexible, and powerful mechanism

Grid System Open issues: data synchronization, fault tolerance, persistence, concurrency, transparency, etc. Open issues: data synchronization, fault tolerance, persistence, concurrency, transparency, etc. Tuple Space paradigm Tuple Space paradigm Simple operations provide powerful inter- process mechanism for communication and synchronization. Simple operations provide powerful inter- process mechanism for communication and synchronization. Flexible for function expansion: ordered tuple, leasing, … Flexible for function expansion: ordered tuple, leasing, …

Tuple Space Developed in a parallel and distributed programming system, Linda, at Yale U Developed in a parallel and distributed programming system, Linda, at Yale U Basic element ~ Tuple Basic element ~ Tuple Tuple: a vector of typed values / fields / variables Tuple: a vector of typed values / fields / variables Operations: Writing, Reading, Consuming & Operations: Writing, Reading, Consuming & Matching Matching

Tuple Space TS Vs DBS TS Vs DBS SpacesTwo SpacesOne A C Read Take “B” … … … wait/abort Write Processes Producers Consumers

Tuple Space Globally shared memory space (Communication buffer) Globally shared memory space (Communication buffer) Tuples are associatively addressed by matching Tuples are associatively addressed by matching Loosely coupling Loosely coupling Mechanism for inter-process communication and synchronization Mechanism for inter-process communication and synchronization Simple, flexible, powerful, … Simple, flexible, powerful, … ~> Grid Resource Collaboration

Tuple Space on the Grid Service Registration Service Registration Service Discovery Service Discovery Service Lookup Service Lookup Job Scheduler Job Scheduler Data Mgmt Data Mgmt Tuple Spaces (Services / Tasks / Data) Consumers Producers / Processes

Tuple Space on the Grid with Java Technology Grid Enabled Resource Mgmt (GRAM) Security Feature Fault Tolerant Data Mgmt Look up Service Discovery Mgmt Leasing Plug and Work Resources / Services Collaboration

Grid Job Scheduling Eg) WestGrid, … Eg) WestGrid, … Grid I Scheduler/ Resources Broker Resource Meta-Scheduler Grid II Centralized System Load balance Synchronization Fault Tolerance Policy Meta-Scheduler

Tuple Spaces Based Gird Job Scheduler Resource 1 Agent Resource N Agent Registrar Job Desc. Resources Required Job Desc. Resources Required Result SecurityCheck() PolicyCheck() TakeJob() WriteResult() Resources List Register () JobSpace JobSubmit() Registrar ResourcesList() ResourcesListExchange() JobExchange() Domain II Users Figure 1: Tuple Spaces Based Grid Scheduler GetResult() LeaseRenew () Fault Tolerance Management Fault Tolerance Management Leasing Structure for Dynamic Participation Leasing Structure for Dynamic Participation On-Demand Service On-Demand Service Automatic Load Balance Automatic Load Balance Global Name Space Global Name Space Security model ? Security model ?

View 3 View 1 Query … SearchView() GetJobTicket() QueueTicket() GridFTP() / TransferSchedule() SecCheck() PolicyCheck() TakeJob() UpdateView() EventNotify() UpdateData() … PriorityCheck() New Data Derived Data RenewL() TicketBox ViewSite OpenSpace Figure 2: Data Access Management with Tuple Spaces Registrar ResourcesListExchange() Domain II Resource Agent EventNotify() Update() EventNotify() Register() JobExchange () Replica Ticket: View(s)/Query(ies) Certificate User information Job description … Gird Data Management System with Tuple Spaces

References Ian Foster, Carl Kesselman, and Steven Tuecke, “The Anatomy of the Grid: Enabling Scalable Virtual Organizations”, Intl J. Supercomputer Applications, 2001 Ian Foster, Carl Kesselman, and Steven Tuecke, “The Anatomy of the Grid: Enabling Scalable Virtual Organizations”, Intl J. Supercomputer Applications, 2001 Ian Foster, Carl Kesselman, Jeffrey M. Nick, and Steven Tuecke, “The Physiology of the Grid: An Open Grid Services Architecture for Distributed Systems Integration”, Ian Foster, Carl Kesselman, Jeffrey M. Nick, and Steven Tuecke, “The Physiology of the Grid: An Open Grid Services Architecture for Distributed Systems Integration”, Gregory T. Byrd, and Praveenkumar Cheruvu, “Tuple Space Computing on the Grid”, North Carolina State University Gregory T. Byrd, and Praveenkumar Cheruvu, “Tuple Space Computing on the Grid”, North Carolina State University K.A. Hawick, H.A. James, and L.H. Pritchard, “Tuple-Space Based Middleware for Distributed Computing”, Technical Report DHPC- 128, University of Wales K.A. Hawick, H.A. James, and L.H. Pritchard, “Tuple-Space Based Middleware for Distributed Computing”, Technical Report DHPC- 128, University of Wales Globus, Globus, IBM, IBM,

Discussions Thank you !

Grid DBS Vs Distributed DBS Grid DB System is a Distributed DB System Grid DB System is a Distributed DB System Inter-domains, Policy, Security Inter-domains, Policy, Security Large amount of data intensive users/applications access to large amounts of data (terabytes or petabytes) Large amount of data intensive users/applications access to large amounts of data (terabytes or petabytes) Distributed data population (production, replication) Distributed data population (production, replication) Query processing techniques Query processing techniques Secured and efficient data transport protocol Secured and efficient data transport protocol Dynamic environment and data: Dynamic environment and data: Grid (loosely coupled) ~ DDBS (highly coupled) Grid (loosely coupled) ~ DDBS (highly coupled)