Introduction to Taiwan UniGrid Yeh-Ching Chung Department of Computer Science National Tsing Hua University.

Slides:



Advertisements
Similar presentations
LEAD Portal: a TeraGrid Gateway and Application Service Architecture Marcus Christie and Suresh Marru Indiana University LEAD Project (
Advertisements

Legacy code support for commercial production Grids G.Terstyanszky, T. Kiss, T. Delaitre, S. Winter School of Informatics, University.
Welcome to Middleware Joseph Amrithraj
High Performance Computing Course Notes Grid Computing.
楊竹星 國立成功大學電機工程系 98學年第一學期
CoreGRID Workpackage 5 Virtual Institute on Grid Information and Monitoring Services Authorizing Grid Resource Access and Consumption Erik Elmroth, Michał.
David Adams ATLAS DIAL Distributed Interactive Analysis of Large datasets David Adams BNL March 25, 2003 CHEP 2003 Data Analysis Environment and Visualization.
A Grid Resource Broker Supporting Advance Reservations and Benchmark- Based Resource Selection Erik Elmroth and Johan Tordsson Reporter : S.Y.Chen.
Slides for Grid Computing: Techniques and Applications by Barry Wilkinson, Chapman & Hall/CRC press, © Chapter 1, pp For educational use only.
OxGrid, A Campus Grid for the University of Oxford Dr. David Wallom.
Introduction to Web Database Processing
1-2.1 Grid computing infrastructure software Brief introduction to Globus © 2010 B. Wilkinson/Clayton Ferner. Spring 2010 Grid computing course. Modification.
UbiStream Motivation  Streaming data are abundant in our surroundings: Length of queue at cafeteria If the stadium is crowded or not Course.
Introduction to Web Interface Technology (CSE2030)
Multiple Tiers in Action
Webometrics 網路大學排名分析 資工系助理教授兼圖資處資訊應用組組長 陳嘉平 資料收集:圖資處資訊應用組潘筱薇、資工系許峰閣.
Decision Support Systems for Supply Chain Management Chap 10 王仁宏 助理教授 國立中正大學企業管理學系 ©Copyright 2001 製商整合科技中心.
The new The new MONARC Simulation Framework Iosif Legrand  California Institute of Technology.
ASP.NET 2.0 Chapter 6 Securing the ASP.NET Application.
Chapter 6: An Introduction to System Software and Virtual Machines
What is adaptive web technology?  There is an increasingly large demand for software systems which are able to operate effectively in dynamic environments.
Taiwan UniGrid Yeh-Ching Chung Department of Computer Science National Tsing Hua University Hsin-Chu, 300, Taiwan.
Globus Computing Infrustructure Software Globus Toolkit 11-2.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
Web Programming Language Dr. Ken Cosh Week 1 (Introduction)
Resource Management Reading: “A Resource Management Architecture for Metacomputing Systems”
INTRODUCTION TO WEB DATABASE PROGRAMMING
QCDgrid Technology James Perry, George Beckett, Lorna Smith EPCC, The University Of Edinburgh.
Research on cloud computing application in the peer-to-peer based video-on-demand systems Speaker : 吳靖緯 MA0G rd International Workshop.
Riccardo Bruno INFN.CT Sevilla, Sep 2007 The GENIUS Grid portal.
Chapter 16 The World Wide Web Chapter Goals Compare and contrast the Internet and the World Wide Web Describe general Web processing Describe several.
1 Guide to Novell NetWare 6.0 Network Administration Chapter 13.
Test Of Distributed Data Quality Monitoring Of CMS Tracker Dataset H->ZZ->2e2mu with PileUp - 10,000 events ( ~ 50,000 hits for events) The monitoring.
Towards a Javascript CoG Kit Gregor von Laszewski Fugang Wang Marlon Pierce Gerald Guo
Grid Workload Management & Condor Massimo Sgaravatto INFN Padova.
Information Grid Services in the Polish Optical Internet PIONIER Cezary Mazurek, Maciej Stroiński, Jan Węglarz.
Grid Technologies  Slide text. What is Grid?  The World Wide Web provides seamless access to information that is stored in many millions of different.
Invitation to Computer Science 5 th Edition Chapter 6 An Introduction to System Software and Virtual Machine s.
INVITATION TO COMPUTER SCIENCE, JAVA VERSION, THIRD EDITION Chapter 6: An Introduction to System Software and Virtual Machines.
Bookkeeping Tutorial. Bookkeeping & Monitoring Tutorial2 Bookkeeping content  Contains records of all “jobs” and all “files” that are created by production.
「串流代理伺服器平台」之設計與實做 Design and Implementation of a Streaming Proxy Server Platform for Internet Video Streaming 國科會自由軟體專案計畫 (NSC E ) 國立屏東科技大學資訊管理系.
Computing Infrastructure for Large Ecommerce Systems -- based on material written by Jacob Lindeman.
Module 10 Administering and Configuring SharePoint Search.
Grid Programming on Taiwan Unigrid Platform. Outline Introduction to Taiwan Unigrid How to use Taiwan Unigrid.
Grid Computing at Yahoo! Sameer Paranjpye Mahadev Konar Yahoo!
Database Systems: Design, Implementation, and Management Eighth Edition Chapter 14 Database Connectivity and Web Technologies.
June 24-25, 2008 Regional Grid Training, University of Belgrade, Serbia Introduction to gLite gLite Basic Services Antun Balaž SCL, Institute of Physics.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
Java -- A very Hot Object- Oriented Language Instructors: Fu-Chiung Cheng ( 鄭福炯 ) Associate Professor Computer Science & Engineering Tatung Institute.
GO-ESSP Workshop, LLNL, Livermore, CA, Jun 19-21, 2006, Center for ATmosphere sciences and Earthquake Researches Construction of e-science Environment.
What is SAM-Grid? Job Handling Data Handling Monitoring and Information.
The Global Land Cover Facility is sponsored by NASA and the University of Maryland.The GLCF is a founding member of the Federation of Earth Science Information.
The LSAM Proxy Cache - a Multicast Distributed Virtual Cache Joe Touch USC / Information Sciences Institute 元智大學 資訊工程研究所 系統實驗室 陳桂慧
CFTP - A Caching FTP Server Mark Russell and Tim Hopkins Computing Laboratory University of Kent Canterbury, CT2 7NF Kent, UK 元智大學 資訊工程研究所 系統實驗室 陳桂慧.
Dispatching Java agents to user for data extraction from third party web sites Alex Roque F.I.U. HPDRC.
Timeshared Parallel Machines Need resource management Need resource management Shrink and expand individual jobs to available sets of processors Shrink.
Aneka Cloud ApplicationPlatform. Introduction Aneka consists of a scalable cloud middleware that can be deployed on top of heterogeneous computing resources.
Rights Management for Shared Collections Storage Resource Broker Reagan W. Moore
Gennaro Tortone, Sergio Fantinel – Bologna, LCG-EDT Monitoring Service DataTAG WP4 Monitoring Group DataTAG WP4 meeting Bologna –
Grid Activities in CMS Asad Samar (Caltech) PPDG meeting, Argonne July 13-14, 2000.
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI Services for Distributed e-Infrastructure Access Tiziana Ferrari on behalf.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
Fault – Tolerant Distributed Multimedia Streaming Web Application By Nirvan Sagar – Srishti Ganjoo – Syed Shahbaaz Safir
Enabling Grids for E-sciencE Claudio Cherubino INFN DGAS (Distributed Grid Accounting System)
Managing, Storing, and Executing DTS Packages
Distributed System 電機四 陳伯翰 b
University of Technology
Basic Grid Projects – Condor (Part I)
EGEE Middleware: gLite Information Systems (IS)
Presentation transcript:

Introduction to Taiwan UniGrid Yeh-Ching Chung Department of Computer Science National Tsing Hua University

Outline Introduction Portal and SSO Global Queue Resource Broker Job Scheduler Information Service Storage Service Applications

Introduction (1) The purpose of grid computing is to integrate various resources within a large network environment. The purpose of the UniGrid project is to build a platform for academic research using grid-related technologies in Taiwan.

Introduction (2) 8 institutes join to develop the system – 國網中心 – 清華大學資工系 – 中研院資科所 – 東華大學資工系 – 東海大學資科系 – 中華大學資工系 – 興國管理學院電子商務學系 – 靜宜大學資訊管理系

Introduction (3) 台灣大學電機系 台灣大學資工系 台灣師大資工系 台北大學資工系 淡江大學資工系 德明技術學院資科系 交通大學資工系 新竹教育大學資工所 中興大學資科系 逢甲大學資工系 台中教育大學資科系 國家高速網路與計算中心中群 修平技術學院資管系 彰化師大資工系 中正大學資工系 成功大學電機系 成功大學資工系 台南大學數位學習科技系 長榮大學資管系 立德管理學院資管系 中山大學電機系 義守大學資工系 高雄大學資工系 台東大學資訊管理學系 Over 20 institutes join Taiwan UniGrid platform

Introduction (4) All institutes that participate in the UniGrid project contribute some resources. These resources can be used in collaboration for large scale applications.

Introduction (5) System Architecture

Outline Introduction Portal and SSO Global Queue Resource Broker Job Scheduler Information Service Storage Service Applications

Portal and SSO (1) The UniGrid portal provides an interface for UniGrid users to use the resources available in the UniGrid system. Functionalities of the portal – Project information – Single sign-on – Resource Monitoring – User workflow management

Portal and SSO (2)

Single Sign-On (1) Single sign-on is a mechanism whereby a single authentication can permit a user to access all resources where he has access permission, without the need to enter multiple passwords. – All user account information are kept in a database at the portal site. – When a user requests a service, his/her verification data is passed to that service. – The request will be granted only if the identity is verified by the verification service

Single Sign-On (2) Using MyProxy server The proxy could provide – User’s limitations or not overdue proxy (for user) – Password (for RB or other components)

Resource Monitor (1) UniGrid users can examine the status of system resources through the portal. The portal gathers the current system information from the information service and present these information to the users.

Resource Monitor (2) Screenshot of the system status monitoring

Resource Monitor (3) Screenshot of open service monitor

User Workflow Management (1) A user can design and execute the workflow through the UniGrid portal. Workflow Management can handle job dependency and pass independent task to resource broker A user can also monitor the status of his workflow through the UniGrid portal.

User Workflow Management (2) Structure of a workflow sequential execution parallel execution Workflow

User Workflow Management (3) Screenshot of the workflow editing web page

User Workflow Management (4) Screenshot of the workflow monitoring web page

Outline Introduction Portal and SSO Global Queue Resource Broker Job Scheduler Information Service Storage Service Applications

Global Queue (1) All independent jobs from workflow manager is stored in global queue and waiting for scheduling Global queue uses database to store all job requirements and provides failure recover capability when program failures

Global Queue (2) Three queues with configurable capacity in UniGrid – Waiting queue (DB) Store all job information from G.Q. into database – Ready queue (Memory) Periodically grab DB for new jobs into ready queue When job in ready queue, perform scheduling – Running queue (Memory) Store running jobs (thread) Control parallel degree

Global Queue (3) Develop queue scheduler to control the queue behavior – JobDBCrawler Crawling DB for new jobs – SPSController Control when to call Scheduler

Global QueueResource Broker

Outline Introduction Portal and SSO Global Queue Resource Broker Job Scheduler Information Service Storage Service Applications

Resource Broker (1) Resource broker is designed to help users to perform job execution process automatically Main steps of resource broker – Query resource information – Resource matchmaking (job scheduler) – Submit jobs for execution – Retrieve and store results

Resource Broker (2) Each participating organization has a local scheduler (Condor) installed to schedule the jobs assigned to that organization. Condor – A scheduler for large collections of distributively owned computing resources – Developed by the researchers at University of Wisconsin – Specialized for compute-intensive jobs

Query resource information Obtain system information from information service – Static and dynamic resource – Dynamic network information Obtain local condor information from each condor master – Total/Available CPUs uniblade01.cs.nthu.edu.tw,16,4,12 zeta1.hpc.csie.thu.edu.tw,10,0,10 hkugrid01.hku.edu.tw,32,0,26 iisgrid01.iis.sinica.edu.tw,14,0,14 srbn01.csie.chu.edu.tw,4,0,3 grid1.ndhu.edu.tw,5,0,5 total, owner, free

Submit jobs to local scheduler Use multi-thread to submit and execute jobs to each sites Job execution flow – Obtain user proxy – Transfer program and data – Generate AP specific file (rsl, machinefile) – Execute

Retrieve and store results Retrieve result from job execution site when job finish or failure – Execution result (screen output) – Execution log (for debug) – Output file

Outline Introduction Portal and SSO Global Queue Resource Broker Job Scheduler Information Service Storage Service Applications

Job Scheduler (1) Job scheduler is used to control the scheduling and allocation policy of each jobs in queue. – Scheduler Control the job order in queue (ready queue) – Allocation Control which resource to submit

Job Scheduler (2) Implemented algorithms – Scheduling First come first serve (FCFS) Smallest job first (SJF) – Allocation Single Pool –Only can submit to one site Multi Pool –Can submit cross multi-site Single Pool Job Preference –Take user defined job preference such as CPU- bound or communication-bound into consider

Outline Introduction Portal and SSO Global Queue Resource Broker Job Scheduler Information Service Storage Service Applications

Information System (1) Information service include monitoring resource and network status Resource – Static CPU frequency, total memory, etc … – Dynamic CPU loading, free memory, etc … Network – Bandwidth – Latency

Information System (2) Network information model

Information System (3) All resource information are collected by Ganglia and presented in XML format

Outline Introduction Portal and SSO Global Queue Resource Broker Job Scheduler Information Service Storage Service Applications

Storage Service (1) The goal of storage service is to provide a collaborative space where UniGrid users can share their data and resources with others. Components of the storage service – Virtual storage system – Data management system

Storage Service (2) Five SRB Zone for different geographic distributed locations – Each Zone contain one MCAT server Each site provides at least one server to join different Zone to form SRB data grid

Storage Service (3) System architecture

Virtual Storage System (1) Virtual storage component diagram

Virtual Storage System (2) The virtual storage system is implemented with Java as a web service UniGrid services access the virtual storage system when they need to access user data A client program is available for users to manage his own storage space The files are stored in a master file server and replicas of the files are distributed to other SRB server

Virtual Storage System (3)

Virtual Storage System (4) Screenshot of the storage service client program

Data management system (1) Efficient file transfer Automatic replication Replica level

Data management system (2) Resc_1 Resc_2Resc_3Resc_4 Client replica_1replica_2replica_3replica_4 getData() Multi-source data transfer

Outline Introduction Portal and SSO Global Queue Resource Broker Job Scheduler Information Service Storage Service Applications

UbiStream Streaming data are abundant in our surroundings: – Length of queue at cafeteria – If the stadium is crowded or not – Live streaming of concerts or games – Course video/audio for e-learning Great demands to access these streaming data at any time, any place

System components Streaming source – Turn information in the surroundings into streaming data – Ex. Camera, sensor, counter Indexing mechanism – Make those data available to be searched Processing units – Further processing on raw data to provide better usage of them User interface – Display different kinds of streaming data

Scenario We want to show streaming data from tens or even hundreds of sources on our monitor screen simultaneously Machines on UniGrid are recruited to help shrinking the original screens to smaller size, and aggregate them in a single screen

Workflows Three main workflows involve: 1.Service discovery User queries, indexing server replies: –HTML layout –Addresses of decoders –Addresses of services 2.Interpreting the service Download decoders 3.Streaming data delivery Decoder fetches media streams

Sensor Indexing server User tier Processing units Camera Video-on- demand Processing tierSource tier Decoder server 1 2 3

Indexing server Query strings HTML Server translates XML metadata to HTML layout User tier Processing tierSource tier Workflow 1

Decoder server URL Browser downloads decoders (ActiveX) ActiveX (Optional:) Browser executes control logic (JavaScript) Logic interacts with decoders User tier Processing tierSource tier Workflow 2

ActiveX UniGrid …… Transcoding tree UniGrid portal Request Tree root Grid portal recruits machines User tier Processing tierSource tier Workflow 3

ActiveX UniGrid …… Transcoding tree Tree root Decoder fetches streaming data Customized query Media stream User tier Processing tierSource tier Workflow 3

Results

Conclusions and Future Work A prototype of Grid platform for researchers in Taiwan is established Invite more researchers join Taiwan UniGrid Participate in the Grid operation of NCHC Establish a Grid Computing Association Establish a Grid research office under NSC to promote Grid research in Taiwan