1a.1 Introduction to Grid Computing ITCS 4146/5146, UNC-Charlotte, B. Wilkinson, 2007 Jan 17, 2007.

Slides:



Advertisements
Similar presentations
Distributed Data Processing
Advertisements

High Performance Computing Course Notes Grid Computing.
State-Wide Collaborative Grid Computing Course Barry Wilkinson Department of Computer Science University of North Carolina at Charlotte and Mark Holliday.
MTA SZTAKI Hungarian Academy of Sciences Grid Computing Course Porto, January Introduction to Grid portals Gergely Sipos
Seminar Grid Computing ‘05 Hui Li Sep 19, Overview Brief Introduction Presentations Projects Remarks.
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
Globus Toolkit 4 hands-on Gergely Sipos, Gábor Kecskeméti MTA SZTAKI
Seminar Grid Computing ‘06 Hui Li Sep 18, Overview Brief Introduction Presentations –Architecture –Functionality/Middleware –Applications Projects.
1.1 Introduction to Grid Computing ITCS 4010 Grid Computing, 2005, UNC-Charlotte, B. Wilkinson.
Outline.1 Grid Computing Fall 2005 Tuesday/Thursday 11:00 am - 12:20 pm Instructors Dr. Barry Wilkinson University of North Carolina at Charlotte and Dr.
Slides for Grid Computing: Techniques and Applications by Barry Wilkinson, Chapman & Hall/CRC press, © Chapter 1, pp For educational use only.
1 Short Course on Grid Computing Jornadas Chilenas de Computación 2010 INFONOR-CHILE 2010 November 15th - 19th, 2010 Antofagasta, Chile Dr. Barry Wilkinson.
1 July 30, 2005 Grid Computing Principles Consortium for Computational Science and High Performance Computing 2005 Summer Workshop, July 29-July 31, 2005.
1-2.1 Grid computing infrastructure software Brief introduction to Globus © 2010 B. Wilkinson/Clayton Ferner. Spring 2010 Grid computing course. Modification.
1 Teaching Grid Computing across North Carolina and Beyond Dr. Clayton Ferner University of North Carolina Wilmington Dr. Barry Wilkinson University of.
Milos Kobliha Alejandro Cimadevilla Luis de Alba Parallel Computing Seminar GROUP 12.
4b.1 Grid Computing Software Components of Globus 4.0 ITCS 4010 Grid Computing, 2005, UNC-Charlotte, B. Wilkinson, slides 4b.
1 Workshop 20: Teaching a Hands-on Undergraduate Grid Computing Course SIGCSE The 41st ACM Technical Symposium on Computer Science Education Friday.
Outline.1 Grid Computing Spring 2007 Tuesday/Thursday 11:00 am - 12:15 pm Instructors Dr. Barry Wilkinson University of North Carolina, Charlotte and Dr.
Outline.1 Grid Computing Spring 2010 Tuesday/Thursday 11:00 am - 12:15 pm Instructors © 2010 B. Wilkinson/Clayton Ferner. Modification date: Dec 20, 2009.
Globus Computing Infrustructure Software Globus Toolkit 11-2.
Tutorial on Distributed High Performance Computing 14:30 – 19:00 (2:30 pm – 7:00 pm) Wednesday November 17, 2010 Jornadas Chilenas de Computación 2010.
1 Globus Developments Malcolm Atkinson for OMII SC 18 th January 2005.
Globus 4 Guy Warner NeSC Training.
Kate Keahey Argonne National Laboratory University of Chicago Globus Toolkit® 4: from common Grid protocols to virtualization.
Executive summary Grid computing has become an important concept for high performance computing. By taking advantage of the Internet, geographically distributed.
1 Grid Computing Barry Wilkinson Department of Computer Science University of North Carolina at Charlotte.
OPEN GRID SERVICES ARCHITECTURE AND GLOBUS TOOLKIT 4
DISTRIBUTED COMPUTING
CoG Kit Overview Gregor von Laszewski Keith Jackson.
Friday, October 20, 2006 Barry Wilkinson Department of Computer Science University of North Carolina Charlotte Grid Computing Activities within the Department.
1 “The grid virtualizes heterogeneous geographically disperse resources” "Introduction to Grid Computing with Globus," IBM Redbooks  Using geographically.
GT Components. Globus Toolkit A “toolkit” of services and packages for creating the basic grid computing infrastructure Higher level tools added to this.
Grid Resource Allocation and Management (GRAM) Execution management Execution management –Deployment, scheduling and monitoring Community Scheduler Framework.
Grid Computing Activities within the Department of Computer Science at UNC-Charlotte ITSC 8110 Introduction to Information Technology Research 7:30 pm,
A Geographically-Distributed, Assignment-Structured Undergraduate Grid Computing Course Mark A. Holliday, Barry Wilkinson, Jeffrey House, Samir Daoud,
CSF4 Meta-Scheduler Name: Zhaohui Ding, Xiaohui Wei
1-1.1 Sample Grid Computing Projects. NSF Network for Earthquake Engineering Simulation (NEES) 2004 – to date‏ Transform our ability to carry out research.
Evaluation of Agent Teamwork High Performance Distributed Computing Middleware. Solomon Lane Agent Teamwork Research Assistant October 2006 – March 2007.
1 4/23/2007 Introduction to Grid computing Sunil Avutu Graduate Student Dept.of Computer Science.
Service - Oriented Middleware for Distributed Data Mining on the Grid ,劉妘鑏 Antonio C., Domenico T., and Paolo T. Journal of Parallel and Distributed.
Grid Architecture William E. Johnston Lawrence Berkeley National Lab and NASA Ames Research Center (These slides are available at grid.lbl.gov/~wej/Grids)
Grid Infrastructure group (Charlotte): Barry Wilkinson Jeremy Villalobos Nikul Suthar Keyur Sheth Department of Computer Science UNC-Charlotte March 16,
Panel #2 Discussion SIMVac Symposium 1:30 pm – 2:00 pm April 25, 2006 VisualGrid: An Infrastructure for Visualization and Environmental Research Fostering.
Grids - the near future Mark Hayes NIEeS Summer School 2003.
Institute For Digital Research and Education Implementation of the UCLA Grid Using the Globus Toolkit Grid Center’s 2005 Community Workshop University.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
Grid Services I - Concepts
Grid Security: Authentication Most Grids rely on a Public Key Infrastructure system for issuing credentials. Users are issued long term public and private.
Outline.1 Grid Computing Fall 2008 Tuesday/Thursday 11:00 am - 12:15 pm Instructors Grid Computing, B. Wilkinson, 2008 Aug Dr. Clayton Ferner.
GO-ESSP Workshop, LLNL, Livermore, CA, Jun 19-21, 2006, Center for ATmosphere sciences and Earthquake Researches Construction of e-science Environment.
1 Barry Wilkinson University of North Carolina, Charlotte Clayton Ferner University of North Carolina, Wilmington NSF CCLI Showcase SIGCSE 2007 Friday,
GRIDS Center Middleware Overview Sandra Redman Information Technology and Systems Center and Information Technology Research Center National Space Science.
SURAGrid Project Meeting Washington, DC Wednesday, February 22, 2006 Barry Wilkinson Department of Computer Science UNC-Charlotte SURAGrid and Grid Computing.
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
1-1.1 Introduction to Grid Computing © 2011 B. Wilkinson/Clayton Ferner. Modification date: June 20, 2011.
Introduction to Grids By: Fetahi Z. Wuhib [CSD2004-Team19]
7. Grid Computing Systems and Resource Management
Globus and PlanetLab Resource Management Solutions Compared M. Ripeanu, M. Bowman, J. Chase, I. Foster, M. Milenkovic Presented by Dionysis Logothetis.
Securing the Grid & other Middleware Challenges Ian Foster Mathematics and Computer Science Division Argonne National Laboratory and Department of Computer.
GRID ANATOMY Advanced Computing Concepts – Dr. Emmanuel Pilli.
Data Manipulation with Globus Toolkit Ivan Ivanovski TU München,
ACGT Architecture and Grid Infrastructure Juliusz Pukacki ‏ EGEE Conference Budapest, 4 October 2007.
Parallel Computing Globus Toolkit – Grid Ayaka Ohira.
Bob Jones EGEE Technical Director
Clouds , Grids and Clusters
Grid Computing Course Development team: Barry Wilkinson and Clayton Ferner (Instructors), and Mark Holliday Student assistants: Jeff House and Sam Daoud.
Large Scale Distributed Computing
Grid Computing Software Interface
Presentation transcript:

1a.1 Introduction to Grid Computing ITCS 4146/5146, UNC-Charlotte, B. Wilkinson, 2007 Jan 17, 2007

1a.2 “The grid virtualizes heterogeneous geographically disperse resources” from "Introduction to Grid Computing with Globus," IBM Redbooks Using geographically distributed and interconnected computers together for computing and for resource sharing. Grid Computing

1a.3 Need to harness computers Original driving force behind grid computing same as behind the early development of networks that became the Internet: – Connecting computers at distributed sites for high performance computing.

1a.4 History Began in mid 1990’s with experiments using computers at geographically dispersed sites. Seminal experiment – “I-way” experiment at 1995 Supercomputing conference (SC’95), using 17 sites across the US running: –60+ applications. –Existing networks (10 networks).

1a.5 Grid computing is about collaborating and resource sharing as much as it is about high performance computing.

1a.6 Virtual Organizations Grid computing offers potential of virtual organizations: –groups of people, both geographically and organizationally distributed, working together on a problem, sharing computers AND other resources such as databases and experimental equipment. Crosses multiple administrative domains.

1a.7 Shared Resources Can share much more than just computers: Storage Sensors for experiments at particular sites Application Software Databases Network capacity, …

1a.8 Interconnections and Protocols Focus now on: using standard Internet protocols and technology, i.e. HTTP, SOAP, web services, etc.,

1a.9 Applications Originally e-Science applications –Computational intensive Traditional high performance computing addressing large problems Not necessarily one big problem but a problem that has to be solved repeatedly with different parameters. –Data intensive Computational but emphasis on large amounts of data to stored and processed –Experimental collaborative projects

1a.10 Now also e-Business applications –To improve business models and practices. –Sharing corporate computing resources and databases –On-demand grid computing

1a.11 Computational Grid Applications Biomedical research Industrial research Engineering research Studies in Physics and Chemistry …

1a.12 Sample Grid Computing Projects

NSF Network for Earthquake Engineering Simulation (NEES) Transform our ability to carry out research vital to reducing vulnerability to catastrophic earthquakes from I. Foster Environment/Earth

1a.14 DOE Earth System Grid Goal: address technical obstacles to sharing and analysis of high-volume data from advanced earth system models From I. Foster

1a.15 Earth System Grid I. Foster

1a.16 ox.ac.uk/ Medicine /Biology

1a.17

1a.18

1a.19 Large Hadron Collider experimental facility for complex particle experiments at CERN (European Center for Nuclear Research, near Geneva Switzerland). Physics CERN grid

1a.20

1a.21

1a.22 datagrid.web.cern.ch /eu-datagrid/ Data grids

1a.23 Grid computing infrastructure projects Not tied to one specific application

1a.24 Grid networks for collaborative grid computing projects Grids have been set up at local level, national level, and international level throughout the world, to promote grid computing Grid Networks

1a.25 TeraGrid Funded by NSF in 2002 to link 5 supercomputer sites with 40 Gb/s links

1a.26 TeraGrid

1a.27 Grid2003: An Operational National Grid  28 sites: Universities + national labs  2800 CPUs, 400–1300 jobs  Running since October 2003  Applications in HEP, LIGO, SDSS, Genomics Korea From “A Grid of One to a Grid of Many,” Miron Livny, UW-Madison, Keynote presentation, MIDnet conference, 2005.

1a.28 SURAGrid

1a.29 Close to home: From “Grid Computing in the Industry” by Wolfgang Gentzsch, presentation to Fall 2004 grid computing course. Full set of slides on course home page.

1a.30 National Grids Many countries have embraced grid computing and set-up grid computing infrastructure: UK e-Science grid Grid-Ireland NorduGrid DutchGrid POINIER grid (Poland) ACI grid (France) Japanese grid etc, etc., …

1a.31 UK e-Science Grid

1a.32 Campus Grids Several examples of grids within one university and across campuses Example University of Virginia Campus Grid

1a.33 Grid Computing Software

1a Distributed computing Remote Procedure calls (RPC) Concept of service registry Beginnings of service oriented architecture Object oriented approaches CORBA (Common Request Broker Architecture) Java Remote Method Invocation (RMI) Cluster computing Software Techniques: Computing platforms: Parallel computers Geographically distributed computers (Grid computing) Web services SC’95 experiment Adopted for grid infrastructure components Internet mark-up languages, HTML XML IP addresses, URLs, … ports, protocols Networks Globus toolkit 4.03.x2.x1.x

1a.35 Globus Project Open source software toolkit developed for grid computing. Roots in I-way experiment. Work started in Four versions developed to present time. Reference implementations of grid computing standards. Defacto standard for grid computing.

1a.36 Globus Toolkit: Recent History GT2 (2.4 released in 2002) –GRAM, MDS, GridFTP, GSI. GT3 (3.2 released mid-2004): redesign –OGSA (Open Grid Service Architecture) - OGSI (Open Grid Services Infrastructure) based. –Introduced “Grid services” as an extension of web services. –OGSI now abandoned. GT4 (release for April 2005): redesign –WSRF (Web service Resource Framework) based. –Grid standards merged with Web services.

1a.37 Globus A “toolkit” of services and packages for creating the basic grid computing infrastructure Higher level tools added to this infrastructure Version 4 is web-services based Some non-web services code exists from earlier versions (legacy) or where not appropriate (for efficiency, etc.).

1a.38 Globus Toolkit Five parts: Common Runtime –GT Core for building new services Security –To provide secure access. Based upon Grid Security Infrastructure (GSI) Execution management –Initiation, monitoring, management, scheduling and coordination of executable programs (jobs) Data management –Discover, transfer, and access large data Information services –Discover & monitor dynamic services

1a.39 Each part comprises a set of web services and/or non-web service components. Some built upon earlier versions of Globus.

Data Management Security Common Runtime Execution Management Information Services Web Services Components Non-WS Components Pre-WS Authentication Authorization GridFTP Grid Resource Allocation Mgmt (Pre-WS GRAM) Monitoring & Discovery System (MDS2) C Common Libraries GT2GT2 WS Authentication Authorization Reliable File Transfer OGSA-DAI [Tech Preview] Grid Resource Allocation Mgmt (WS GRAM) Monitoring & Discovery System (MDS4) Java WS Core Community Authorization Service GT3GT3 Replica Location Service XIO GT3GT3 Credential Management GT4GT4 Python WS Core [contribution] C WS Core Community Scheduler Framework [contribution] Delegation Service GT4GT4 Globus Open Source Grid Software I Foster

1a.41 From “Globus Toolkit 4 Tutorial,” MCNC Jan-Feb, 2005, Pawel Plaszczak and Bogdan Lobodzinski, Gridwise Technologies. 2. discover resource 3. submit job 4. transfer data 1.secure environment GSI GRAM MDS GridFTP

1a.42 Supercomputing 2003 Demonstration We used Globus version 2.4 in a Supercomputing 2003 demo organized by the University of Melbourne. 21 countries involved, numerous sites.

1a.43

1a.44

1a.45 A re-implementation of version 2 based upon the Open Grid Service Architecture (OGSA) and Open Grid Service Infrastructure (OGSI) “standards”. The first move towards a web services implementation. We used version 3.2 for the Fall 2004 course. Version 3

1a.46 Grid Computing Course (Fall 2004) Originated from WCU on NCREN network. Broadcast to: –UNC-Wilmington –NC State University –UNC-Asheville –UNC-Greensboro –Appalachian State University –NC Central University –Cape Fear Community College –Elon University Instructors: –Barry Wilkinson and Clayton Ferner (UNC-W) Several faculty helped at various sites 43 students WCU teleclassroom

1a.47 Participating Sites

1a.48 Globus version 3 had a very short life (a little over 2 years, ). Underlying implementation of version 3.x used a type of extension to web services (OGSI) that was not embraced by the community. Globus Version 3

1a.49 Version 4 Released early OGSA kept but OGSI abandoned in favor of new implementation standard based around a more compatible version of web services called Web Services Resource Framework (WSRF) standard.

1a.50 Participating Sites, Fall 2005 Participating UNC campuses Private institutions

1a.51 Fall 2005 Course grid structure MCNCUNC-WUNC-ANCSUWCUUNC-CASU CA Backup facility, not actually used

1a.52 National Publicity Science Grid This Week Feature story Gridtoday.com

1a.53 Web Services-Based Grid Computing Grid Computing is now strongly based upon web services. Large number of newly proposed grid computing standards: –WS-Resource Framework –WS-Addressing –etc., etc. …..

1a.54 Other grid computing software National Science Foundation started NSF middleware initiative in 2001 for bringing together all important grid computing software including: Grid portals Web based interfaces to accessing and controlling grid resources and to communicate with other members of Virtual Organization

1a.55 Grid Computing Course (Spring 2007) Uses GT version 4 (most recent version) Redesigned course, now also with OGCE2/Gridsphere grid computing portal software. Three sites –UNC-Charlotte –UNC-Wilmington –UNC-Asheville

1a.56 Other software Meta-schedulers – to allow job to be scheduled across grid resources. Taken some time to develop meta-schedulers. Example Gridway Pre-existing local schedulers schedule jobs once at a local cluster

1a.57 Resource Discovery Still primitive and in research but ideal is to be able to submit a job and the system find the best grid resources for that job across the whole grid

1a.58 Other issues Account management Users need access to resources, which means one way or the other, users need accounts on all resource at their disposal (or access through common VO accounts). For the course, we set up accounts manually, but in real production grids, use automated tools to assist.

1a.59 Security A big issue. Has to cross administrative domains. Agreed mechanisms. Focus is on Internet security mechanisms, modified to handle the special needs of grid computing. Will look at this in detail later in course.

1a.60 VisualGrid project Project to develop a grid-enabled a bio- informatics algorithm hardware accelerator – Principal Investigators, Arun A Ravindran and Arindam Mukherjee (EE dept) Grid computing projects at UNC-C

1a.61 VisualGrid Project Goal: Collaborative environmental visualization research using a grid computing infrastructure Started Jan 2006 Involves two sites, UNC-Charlotte and UNC-Asheville plus Environment Protection Agency, Raleigh, NC (funding agency) EPA

1a.62 Project Structure at UNC-C (Virtual Organization) Visualization Charlotte Visualization Center Bill Ribrasky, Bank of America Endowed Chair of Information Technology (VisualGrid PI) Aidong Lu, Asst. Professor of Computer Science Environmental Studies Global Inst. of Energy & Environmental Syst. Hilary Inyang, Duke Energy Distinguished Professor Sunyoung Bae, Research Associate Grid Infrastructure Barry Wilkinson, Professor of Computer Science

1a.63 VisualGrid Infrastructure Group: Goal: To create a geographically distributed set of resources and facilitate collaboration between VisualGrid researchers. Team: Barry Wilkinson Jeremy Villalobos (MS student) Nikul Suthar (MS Student) Keyur Sheth (MS student) Jasper Land (BS student) Department of Computer Science UNC-Charlotte Infrastructure Support 52-node University Research Cluster Chuck Price, Director of University Research Computing Mike Mosley, Senior Systems Developer

1a.64 Development System (Four 3.4 Ghz dual Xeons) visualgrid.uncc.edu Visualization lab data server (4 Tbytes) Compute resources 52-node (104 processor) University Research Cluster Software: Globus 4.0, Condor. CA Certificate Authority UNC-Charlotte resources UNC-Asheville resources transylvania.tr.cs.unca.edu (8-node system) VisualGrid Configuration VisualGrid portal

1a.65 National Attention Listed as one of the portals to use OGCE2

1a.66

1a.67 UNC-Asheville Bioinformatics hardware accelerator 52-node UNC-Charlotte university research cluster UNC-C Dept of CS grid computing development system 4TB Windows 2003 data server reached through coit-grid02.uncc.edu (samba mount)

1a.68 Sample VisualGrid portlets One CMAQ script editing portlet CMAQ portlet, main page CMAQ settings portlet Tabs for various CMAQ actions

1a.69 VisualGrid Links VisualGrid Infrastructure group page VisualGrid portal VisualGrid Portal User’s Guide wiki

1a.70 There will (may) be multiple-choice quizzes in the course (on-line through WebCT). Quiz Question: What is a virtual organization? (a) An imaginary company. (b) A web-based organization. (c) A group of people geographically distributed that come together from different organizations to work on grid project. (d) A group of people that come together to work on a virtual reality grid project.

1a.71 Questions