Kento Aida, Tokyo Institute of Technology 1 Tutorial: Technology of the Grid 1. Definition 2. Components 3. Infrastructure Kento Aida Tokyo Institute of.

Slides:



Advertisements
Similar presentations
Kento Aida, Tokyo Institute of Technology Grid Working Group Meeting Aug. 27 th, 2003 Tokyo Institute of Technology Kento Aida.
Advertisements

A Proposal of Capacity and Performance Assured Storage in The PRAGMA Grid Testbed Yusuke Tanimura 1) Hidetaka Koie 1,2) Tomohiro Kudoh 1) Isao Kojima 1)
Gfarm v2 and CSF4 Osamu Tatebe University of Tsukuba Xiaohui Wei Jilin University SC08 PRAGMA Presentation at NCHC booth Nov 19,
Cross-site data transfer on TeraGrid using GridFTP TeraGrid06 Institute User Introduction to TeraGrid June 12 th by Krishna Muriki
FP7-INFRA Enabling Grids for E-sciencE EGEE Induction Grid training for users, Institute of Physics Belgrade, Serbia Sep. 19, 2008.
High Performance Computing Course Notes Grid Computing.
PRAGMA19, Sep. 15 Resources breakout Migration from Globus-based Grid to Cloud Mason Katz, Yoshio Tanaka.
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
Application of GRID technologies for satellite data analysis Stepan G. Antushev, Andrey V. Golik and Vitaly K. Fischenko 2007.
USING THE GLOBUS TOOLKIT This summary by: Asad Samar / CALTECH/CMS Ben Segal / CERN-IT FULL INFO AT:
Presented by Scalable Systems Software Project Al Geist Computer Science Research Group Computer Science and Mathematics Division Research supported by.
Workload Management Workpackage Massimo Sgaravatto INFN Padova.
DataGrid Kimmo Soikkeli Ilkka Sormunen. What is DataGrid? DataGrid is a project that aims to enable access to geographically distributed computing power.
Milos Kobliha Alejandro Cimadevilla Luis de Alba Parallel Computing Seminar GROUP 12.
Grids and Grid Technologies for Wide-Area Distributed Computing Mark Baker, Rajkumar Buyya and Domenico Laforenza.
Grid Services at NERSC Shreyas Cholia Open Software and Programming Group, NERSC NERSC User Group Meeting September 17, 2007.
Workload Management Massimo Sgaravatto INFN Padova.
National Institute of Advanced Industrial Science and Technology Introduction to Grid Activities in the Asia Pacific Region jointly presented by Yoshio.
WORKFLOWS IN CLOUD COMPUTING. CLOUD COMPUTING  Delivering applications or services in on-demand environment  Hundreds of thousands of users / applications.
Ajou University, South Korea ICSOC 2003 “Disconnected Operation Service in Mobile Grid Computing” Disconnected Operation Service in Mobile Grid Computing.
Grid Toolkits Globus, Condor, BOINC, Xgrid Young Suk Moon.
Gridbus Resource Broker for Application Service Costs-based Scheduling on Global Grids: A Case Study in Brain Activity Analysis Srikumar Venugopal 1, Rajkumar.
Grid Computing. What is a Grid? Many definitions exist in the literature Early definitions: Foster and Kesselman, 1998 –“A computational grid is a hardware.
Grid ASP Portals and the Grid PSE Builder Satoshi Itoh GTRC, AIST 3rd Oct UK & Japan N+N Meeting Takeshi Nishikawa Naotaka Yamamoto Hiroshi Takemiya.
NAREGI WP4 (Data Grid Environment) Hideo Matsuda Osaka University.
Science Research: Journey to 10,000 Sources Presented by: Abe Lederman, President and Founder Deep Web Technologies, Inc. Special Libraries Association.
Kento Aida, Tokyo Institute of Technology Grid Challenge - programming competition on the Grid - Kento Aida Tokyo Institute of Technology 22nd APAN Meeting.
DISTRIBUTED COMPUTING
Grid Computing - AAU 14/ Grid Computing Josva Kleist Danish Center for Grid Computing
PRAGMA: Cyberinfrastructure, Applications, People Yoshio Tanaka (AIST, Japan) Peter Arzberger (UCSD, USA)
1 School of Computer, National University of Defense Technology A Profile on the Grid Data Engine (GridDaEn) Xiao Nong
Grid Resource Allocation and Management (GRAM) Execution management Execution management –Deployment, scheduling and monitoring Community Scheduler Framework.
Data GRID Activity in Japan Yoshiyuki WATASE KEK (High energy Accelerator Research Organization) Tsukuba, Japan
Grid Technologies  Slide text. What is Grid?  The World Wide Web provides seamless access to information that is stored in many millions of different.
HYDRA: Using Windows Desktop Systems in Distributed Parallel Computing Arvind Gopu, Douglas Grover, David Hart, Richard Repasky, Joseph Rinkovsky, Steve.
CSF4 Meta-Scheduler Name: Zhaohui Ding, Xiaohui Wei
Evaluation of Agent Teamwork High Performance Distributed Computing Middleware. Solomon Lane Agent Teamwork Research Assistant October 2006 – March 2007.
HYDRA: Using Windows Desktop Systems in Distributed Parallel Computing Arvind Gopu, Douglas Grover, David Hart, Richard Repasky, Joseph Rinkovsky, Steve.
10/24/2015OSG at CANS1 Open Science Grid Ruth Pordes Fermilab
Grid Architecture William E. Johnston Lawrence Berkeley National Lab and NASA Ames Research Center (These slides are available at grid.lbl.gov/~wej/Grids)
Ames Research CenterDivision 1 Information Power Grid (IPG) Overview Anthony Lisotta Computer Sciences Corporation NASA Ames May 2,
The Grid the united computing power Jian He Amit Karnik.
Authors: Ronnie Julio Cole David
What is SAM-Grid? Job Handling Data Handling Monitoring and Information.
GRIDS Center Middleware Overview Sandra Redman Information Technology and Systems Center and Information Technology Research Center National Space Science.
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
Ruth Pordes November 2004TeraGrid GIG Site Review1 TeraGrid and Open Science Grid Ruth Pordes, Fermilab representing the Open Science.
Campus grids: e-Infrastructure within a University Mike Mineter National e-Science Centre 14 February 2006.
Kento Aida, Tokyo Institute of Technology Grid Working Group Aug. 29 th, 2003 Tokyo Institute of Technology Kento Aida.
Kento Aida, Tokyo Institute of Technology Grid working group meeting Jan. 26 th, 2005 Bangkok.
Parallel Computing using Condor on Windows PCs Peng Wang and Corey Shields Research and Academic Computing Division University Information Technology Services.
1 Grid Activity Summary » Grid Testbed » CFD Application » Virtualization » Information Grid » Grid CA.
Globus and PlanetLab Resource Management Solutions Compared M. Ripeanu, M. Bowman, J. Chase, I. Foster, M. Milenkovic Presented by Dionysis Logothetis.
Securing the Grid & other Middleware Challenges Ian Foster Mathematics and Computer Science Division Argonne National Laboratory and Department of Computer.
National Institute of Advanced Industrial Science and Technology Developing Scientific Applications Using Standard Grid Middleware Hiroshi Takemiya Grid.
Kento Aida, Tokyo Institute of Technology Joint Meeting Grid activities committee and Grid working group Jan. 28 th, 2004 Honolulu.
PARALLEL AND DISTRIBUTED PROGRAMMING MODELS U. Jhashuva 1 Asst. Prof Dept. of CSE om.
Data Infrastructure in the TeraGrid Chris Jordan Campus Champions Presentation May 6, 2009.
Grid Services for Digital Archive Tao-Sheng Chen Academia Sinica Computing Centre
Towards a High Performance Extensible Grid Architecture Klaus Krauter Muthucumaru Maheswaran {krauter,
Accessing the VI-SEEM infrastructure
DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING CLOUD COMPUTING
Volunteer Computing for Science Gateways
Peter Kacsuk – Sipos Gergely MTA SZTAKI
Realization of a stable network flow with high performance communication in high bandwidth-delay product network Y. Kodama, T. Kudoh, O. Tatebe, S. Sekiguchi.
Grid Computing.
University of Technology
Grid Datafarm and File System Services
Wide Area Workload Management Work Package DATAGRID project
Jan. 24th, 2003 Kento Aida (TITECH) Sissades Tongsima (NECTEC)
Presentation transcript:

Kento Aida, Tokyo Institute of Technology 1 Tutorial: Technology of the Grid 1. Definition 2. Components 3. Infrastructure Kento Aida Tokyo Institute of Technology

Kento Aida, Tokyo Institute of Technology 2 Goal of the Tutorial What is the grid? definition What technology is needed to create the grid? component technology How is the grid environment is constructed? infrastructure

Kento Aida, Tokyo Institute of Technology 3 1. Definition

Kento Aida, Tokyo Institute of Technology 4 Definition of the Grid Definition [ The grid is an infrastructure to dynamically organize a virtual organization (or a virtual computer) on demand by virtualizing and integrating resources such as computers, data, experimental devices, sensors, people. (The original definition is written by Japanese.) What is the grid? A three point checklist [ coordinates resources that are not subject to centralized control using standard, open, general-purpose protocols and interfaces to deliver nontrivial qualities of service

Kento Aida, Tokyo Institute of Technology 5 What can we do using the grid? We can use information resources (services) on network securely (to guarantee security), stably (to use required resources on demand), and easily (without knowledge of network, computers, …). Internet virtual organization

Kento Aida, Tokyo Institute of Technology 6 Examples of Virtual Organizations Members in a collaborative research project Researchers in a collaborative research project share resources distributed over their sites, e.g. universities, institutes, laboratories, ….  large-scale scientific computing  large-scale distributed database Project team in a company Members in a project team share resources distributed over multiple branches in a company.  business  transaction

Kento Aida, Tokyo Institute of Technology 7 Definition of the Grid (again) Definition The grid is an infrastructure to dynamically organize a virtual organization (or a virtual computer) on demand by virtualizing and integrating resources …. What is the grid? A three point checklist coordinates resources that are not subject to centralized control  dynamic organization of VO using standard, open, general-purpose protocols and interfaces  access to resources by standardized protocols to deliver nontrivial qualities of service  Users do not have to have knowledge about network, computers, etc.

Kento Aida, Tokyo Institute of Technology 8 Grid? Grid = supercomputer + network? Grid = idle PCs + network? Grid = large-scale parallel processing on the internet? If we connect our resources to the grid, anonymous users’ jobs will run on our resources without owners’ knowledge? If we submit jobs to the grid, our job will run on resources in unknown sites?

Kento Aida, Tokyo Institute of Technology 9 Classification of the Grid Computing Grid ( high-performance computing ) Data Grid ( high-performance data processing ) Sensor Grid ( advanced sensing ) Access Grid ( support for collaboration ) Business Grid ( advanced web service ) PC Grid ( utilization of idle PCs ) science business

Kento Aida, Tokyo Institute of Technology 10 Computing Grid Grid computing high-performance computing service to utilize computers on the grid Merit of users reducing computation time expanding problem size receiving computation service Component technology security, resource management, job management, programming, problem solve environment (PSE), …

Kento Aida, Tokyo Institute of Technology 11 Data Grid Large-scale data processing/computing large-scale distributed database on the internet data processing service to access distributed data Merit of users high-speed access to distributed data high-performance and reliable processing using large- scale data Component technology security, high-speed data transfer, replica management, scheduling

Kento Aida, Tokyo Institute of Technology 12 Access Grid Communication support on the grid Example remote conference virtual laboratory remote medical service  SARS Grid (NCHC) entertainment  “KARAOKE” Grid (AIST)

Kento Aida, Tokyo Institute of Technology 13 Sensor Grid Advanced Monitoring coordination of autonomous sensors connected by network  wired network, wireless network, satellite, … advanced sensing, analysis, forecasting Example meteorology (weather forecast), ecology, agriculture, …

Kento Aida, Tokyo Institute of Technology 14 Component technology security, information service, resource management job management, scheduling data management programming problem solve environment (PSE) Infrastructure production grid Application applying to big science applying business Technical Issues of the Grid

Kento Aida, Tokyo Institute of Technology Components

Kento Aida, Tokyo Institute of Technology 16 Component Technology of the Grid resource management job management programming problem solve environment security application data management infrastructure (computer, network, experimental device, …) information service

Kento Aida, Tokyo Institute of Technology 17 Security Issues authentication, encryption of communication Single sign on user authentication on one host Authentication on other hosts is automatically performed. internet user authentication authentication is automatically performed. authentication Org. A Org. B Org. C

Kento Aida, Tokyo Institute of Technology 18 Resource Management internet user GW OS A GW OS B GW OS C com. a com. c com. b common command Org. A Org. B Org. C Common interfaces to the grid wrapping differences of commands/operations among different machines

Kento Aida, Tokyo Institute of Technology 19 Information Service internet GW Org. A Org. B Org. C CPU: … memory: … OS: … CPU: … memory: … OS: … info. service network monitoring Information about resources on the Grid

Kento Aida, Tokyo Institute of Technology 20 Big picture of the GT2 Client Site C Site B CA GRIS GIIS gatekeeper User Cert. Proxy Cert. Proxy Cert. Proxy Cert. grid-proxy-init Query Resource Status GridFTP Server Data Transfer process Process invocation Return result [source: Yoshio Tanaka, AIST]

Kento Aida, Tokyo Institute of Technology 21 Job Management user (0) (2) (1,3,4) (4) internet GW Org. A Org. B Org. C info. service resource broker Resource selection, Scheduling, Job control

Kento Aida, Tokyo Institute of Technology 22 Condor High Throughput Computing matching jobs and resources by ClassAds mechanism fault tolerance by check pointing Implementation on the Globus Tool Kit Condor-G Schedd Match maker Startd Client job [ ] owner: aaa CPU: 2GHz 以上 Memory: 512MB 以上 Disk: 10GB 以上 :

Kento Aida, Tokyo Institute of Technology 23 Scheduling Application scheduling Scheduling of a single application (job) on resources  How do we decompose an application program into tasks?  Where do we allocate tasks?  When do we start execution of tasks? Job scheduling Scheduling of multiple jobs on resources  Where do we dispatch jobs on resources?  When do we start execution of jobs? Goal minimizing the execution time, meeting the deadline, minimizing the cost, preserving fairness, …

Kento Aida, Tokyo Institute of Technology 24 Nimrod Job management system for parameter-survey applications computational economy deadline scheduling Implementation on the Globus Tool Kit Nimrod/G [ ] [source: D. Abramson, et.al., “High Performance Parametric Modeling with Nimrod/G: Killer Application for the Global Grid?,” IPDPS2000, 2000 ]

Kento Aida, Tokyo Institute of Technology 25 Data Management user GW Org. A Org. B Org. C replication high-speed file transfer file internet GW data management Distributed file management, High-speed file transfer, Replica management

Kento Aida, Tokyo Institute of Technology 26 Data Grid Applications High Energy Physics Earth Science, Astronomical Observation Bio informatics [source: Osamu Tatebe, AIST]

Kento Aida, Tokyo Institute of Technology 27 Grid Datafarm Peta-to-Exascale Global Filesystem on unified CPU/storage cluster Parallel I/O and parallel processing with local I/O scalability [source: Osamu Tatebe, AIST]

Kento Aida, Tokyo Institute of Technology 28 Trans-Pacific Gfarm Datafarm testbed: Network and cluster configuration 2.4G 10G 1G 2.4G(1G) 1G SuperSINET APAN/TransPAC Los Angeles 622M AIST Titech Maffin 10G APAN Tokyo XP SuperSINET Tsukuba WAN 10G 2.4G New York OC-12 ATM SC2003 Phoenix 32 nodes 23.3 TBytes 2 GB/sec 5G 16 nodes 11.7 TBytes 1 GB/sec 16 nodes 11.7 TBytes 1 GB/sec 7 nodes 3.7 TBytes 200 MB/sec 10 nodes 1 TBytes 300 MB/sec 147 nodes 16 TBytes 4 GB/sec Indiana Univ Kasetsert Univ, Thailand SDSC Trans-Pacific thoretical peak 3.9 Gbps Gfarm disk capacity 70 TBytes disk read/write 13 GB/sec Chicago Abilene KEK Univ Tsukuba NII 1G [2.34 Gbps] [950 Mbps] [500 Mbps] [source: Osamu Tatebe, AIST]

Kento Aida, Tokyo Institute of Technology 29 Programming MPI programming with Message Passing Interface  MPICH-G2 , GridMPI , … GridRPC programming with Remote Procedure Call (RPC) mechanism  Ninf-G , OmniRPC , NetSolve , … Master Worker Template template to develop master-worker programs  MW , AMWAT , …

Kento Aida, Tokyo Institute of Technology 30 GridRPC for (…) { grpc_call_async( ) } for (…) { grpc_call_async( ) } library program input data output data user program internet master worker

Kento Aida, Tokyo Institute of Technology 31 GridRPC ( cont’d ) for (i = start; i <= end; i++) { SDP_search(argv[1], i, &value[i]); } grpc_function_handle_init(&hdl, …, “SDP/search”); for (i = start; i <= end; i++) { grpc_call_async(&hdl, argv[1], i, &value[i]); } for (i = start; i <= end; i++) { SDP_search(argv[1], i, &value[i]); } grpc_function_handle_init(&hdl, …, “SDP/search”); for (i = start; i <= end; i++) { grpc_call_async(&hdl, argv[1], i, &value[i]); } Ninf-G [ ] reference implementation of GridRPC implementation on the Globus Toolkit  using security functions on the Globus (authentication, encrypted communication).

Kento Aida, Tokyo Institute of Technology 32 Problem Solve Environment (PSE) Portal frontend to search, run, monitor, and control applications on the grid  Web page cooperation with a workflow system Workflow mechanism to run multiple applications following their dependencies  representing dependencies among applications by a graph  initiation of applications following the workflow by the workflow engine

Kento Aida, Tokyo Institute of Technology 33 Example of PSE (UNICORE) [source:

Kento Aida, Tokyo Institute of Technology Infrastructure

Kento Aida, Tokyo Institute of Technology 35 Resources in Grid Infrastructure Computer PC, PC cluster, supercomputer, … Storage HDD, RAID, … [source: /R_System/Overview/index.html] [source: Matsuoka Lab, TITECH]

Kento Aida, Tokyo Institute of Technology 36 Resources in Grid Infrastructure (cont’d) Experimental device microscope, accelerator, … Sensor thermometer, camera, … Ultra-High Voltage Electron Microscope, Osaka University [source: Large Hadron Collider, CERN [source: Osamu Tatebe, AIST] EcoGrid, NCHC [source: Fang Pang Lin, NCHC]

Kento Aida, Tokyo Institute of Technology 37 Resources in Grid Infrastructure (cont’d) Network LAN, WAN, internet, … [ source: [source: supertitanet/index.ja.shtml]

Kento Aida, Tokyo Institute of Technology 38 Grid Infrastructure Classification by objectives test bed the grid environment construct to perform experiment.  temporally available production grid the grid environment for production use, or to run practical applications  permanently available.  Resources are fully operated for 24hrs. Classification by geographic sites department grid, campus grid, national grid, international grid

Kento Aida, Tokyo Institute of Technology 39 ACT-JST Testbed Grid testbed for running applications to solve large-scale optimization problem construction of 1000CPU scale testbed application development collaboration among Grid researchers and application scientists TITECH AIST TDU Tokushima U.

Kento Aida, Tokyo Institute of Technology 40 Grid Challenge Federation (GCF) Test bed constructed for the Grid Challenge event, programming contest on the grid Resources Grid Technology Research Center, AIST HPCS Lab., U. Tsukuba Yuba-Honda Lab., UEC Matsuoka Lab., TITECH Aida Lab., TITECH Ono Lab., Tokushima U. Hiraki Lab., U. Tokyo Chikayama-Taura Lab., U. Tokyo

Kento Aida, Tokyo Institute of Technology 41 ApGrid / PRAGMA [ source: Grid Partnership among Asia-Pacific region

Kento Aida, Tokyo Institute of Technology 42 Titech Grid [source:

Kento Aida, Tokyo Institute of Technology 43 NAREGI [source: ]

Kento Aida, Tokyo Institute of Technology 44 TeraGrid [source: The 40Gbps network connects sites. 20TeraFlops , 1PB resources CalTech, ANL, SDSC, NCSA, PSC

Kento Aida, Tokyo Institute of Technology 45 Operation of Infrastructure Objectives An organization/staff is required to stably provide a grid infrastructure to users.  The current internet is operated by experts (organizations) for network operation. Network Operation Center (NOC) Grid Operation Center organization to operate a grid infrastructure providing information of grid resources  resources in VO  load on computing resources, traffic on networks, … user support  accounting, documents archives, help desk, trouble shooting, …

Kento Aida, Tokyo Institute of Technology 46 PRAGMA GOC

Kento Aida, Tokyo Institute of Technology 47 Network Weather Map Thanks: Dr. Hirabaru and APAN Tokyo NOC team

Kento Aida, Tokyo Institute of Technology 48