Grid Platform for Geospatial Applications & Fine Granule Scheduler Presented by Bin Zhou Bin Zhou, Jibo Xie, Chaowei Yang Joint Center for Intelligent.

Slides:



Advertisements
Similar presentations
Processes Management.
Advertisements

COURSE: COMPUTER PLATFORMS
Agreement-based Distributed Resource Management Alain Andrieux Karl Czajkowski.
SLA-Oriented Resource Provisioning for Cloud Computing
Chapter 1: Introduction
Universität Dortmund Robotics Research Institute Information Technology Section Grid Metaschedulers An Overview and Up-to-date Solutions Christian.
Introduction to Distributed Systems
Supporting Efficient Execution in Heterogeneous Distributed Computing Environments with Cactus and Globus Gabrielle Allen, Thomas Dramlitsch, Ian Foster,
The CrossGrid project Juha Alatalo Timo Koivusalo.
The Network Weather Service: A Distributed Resource Performance Forecasting Service for Metacomputing, Rich Wolski, Neil Spring, and Jim Hayes, Journal.
1 Introduction to Load Balancing: l Definition of Distributed systems. Collection of independent loosely coupled computing resources. l Load Balancing.
Present by Chen, Ting-Wei Adaptive Task Checkpointing and Replication: Toward Efficient Fault-Tolerant Grids Maria Chtepen, Filip H.A. Claeys, Bart Dhoedt,
The new The new MONARC Simulation Framework Iosif Legrand  California Institute of Technology.
Workload Management Massimo Sgaravatto INFN Padova.
Software Issues Derived from Dr. Fawcett’s Slides Phil Pratt-Szeliga Fall 2009.
Introduction. Why Study OS? Understand model of operation –Easier to see how to use the system –Enables you to write efficient code Learn to design an.
Community Manager A Dynamic Collaboration Solution on Heterogeneous Environment Hyeonsook Kim  2006 CUS. All rights reserved.
SPRING 2011 CLOUD COMPUTING Cloud Computing San José State University Computer Architecture (CS 147) Professor Sin-Min Lee Presentation by Vladimir Serdyukov.
New Challenges in Cloud Datacenter Monitoring and Management
WORKFLOWS IN CLOUD COMPUTING. CLOUD COMPUTING  Delivering applications or services in on-demand environment  Hundreds of thousands of users / applications.
Sergey Belov, Tatiana Goloskokova, Vladimir Korenkov, Nikolay Kutovskiy, Danila Oleynik, Artem Petrosyan, Roman Semenov, Alexander Uzhinskiy LIT JINR The.
Ajou University, South Korea ICSOC 2003 “Disconnected Operation Service in Mobile Grid Computing” Disconnected Operation Service in Mobile Grid Computing.
A.V. Bogdanov Private cloud vs personal supercomputer.
Cloud MapReduce : a MapReduce Implementation on top of a Cloud Operating System Speaker : 童耀民 MA1G Authors: Huan Liu, Dan Orban Accenture.
EMBEDDED SYSTEMS G.V.P.COLLEGE OF ENGINEERING Affiliated to J.N.T.U. By By D.Ramya Deepthi D.Ramya Deepthi & V.Soujanya V.Soujanya.
Part VII: Special Topics Introduction to Business 3e 18 Copyright © 2004 South-Western. All rights reserved. Using Information Technology.
RUNNING PARALLEL APPLICATIONS BEYOND EP WORKLOADS IN DISTRIBUTED COMPUTING ENVIRONMENTS Zholudev Yury.
Connecting OurGrid & GridSAM A Short Overview. Content Goals OurGrid: architecture overview OurGrid: short overview GridSAM: short overview GridSAM: example.
An approach to Intelligent Information Fusion in Sensor Saturated Urban Environments Charalampos Doulaverakis Centre for Research and Technology Hellas.
Presenter: Dipesh Gautam.  Introduction  Why Data Grid?  High Level View  Design Considerations  Data Grid Services  Topology  Grids and Cloud.
ATLAS Off-Grid sites (Tier-3) monitoring A. Petrosyan on behalf of the ATLAS collaboration GRID’2012, , JINR, Dubna.
M i SMob i S Mob i Store - Mobile i nternet File Storage Platform Chetna Kaur.
A Lightweight Platform for Integration of Resource Limited Devices into Pervasive Grids Stavros Isaiadis and Vladimir Getov University of Westminster
DISTRIBUTED COMPUTING
ARGONNE  CHICAGO Ian Foster Discussion Points l Maintaining the right balance between research and development l Maintaining focus vs. accepting broader.
WP9 Resource Management Current status and plans for future Juliusz Pukacki Krzysztof Kurowski Poznan Supercomputing.
Young Suk Moon Chair: Dr. Hans-Peter Bischof Reader: Dr. Gregor von Laszewski Observer: Dr. Minseok Kwon 1.
Grid Workload Management & Condor Massimo Sgaravatto INFN Padova.
المحاضرة الاولى Operating Systems. The general objectives of this decision explain the concepts and the importance of operating systems and development.
A Grid Computing Use case Datagrid Jean-Marc Pierson.
The Grid System Design Liu Xiangrui Beijing Institute of Technology.
BOF: Megajobs Gracie: Grid Resource Virtualization and Customization Infrastructure How to execute hundreds of thousands tasks concurrently on distributed.
Resource Brokering in the PROGRESS Project Juliusz Pukacki Grid Resource Management Workshop, October 2003.
Service - Oriented Middleware for Distributed Data Mining on the Grid ,劉妘鑏 Antonio C., Domenico T., and Paolo T. Journal of Parallel and Distributed.
The Value of Parallelism 16 th Meeting Course Name: Business Intelligence Year: 2009.
Ames Research CenterDivision 1 Information Power Grid (IPG) Overview Anthony Lisotta Computer Sciences Corporation NASA Ames May 2,
Silberschatz and Galvin  Operating System Concepts Module 1: Introduction What is an operating system? Simple Batch Systems Multiprogramming.
INFORMATION SYSTEM-SOFTWARE Topic: OPERATING SYSTEM CONCEPTS.
GVis: Grid-enabled Interactive Visualization State Key Laboratory. of CAD&CG Zhejiang University, Hangzhou
What is SAM-Grid? Job Handling Data Handling Monitoring and Information.
1 Alexandru V Staicu 1, Jacek R. Radzikowski 1 Kris Gaj 1, Nikitas Alexandridis 2, Tarek El-Ghazawi 2 1 George Mason University 2 George Washington University.
Operating System Principles And Multitasking
Introduction Object oriented design is a method where developers think in terms of objects instead of procedures or functions. SA/SD approach is based.
GRID activities in Wuppertal D0RACE Workshop Fermilab 02/14/2002 Christian Schmitt Wuppertal University Taking advantage of GRID software now.
Copyright © 2006, GemStone Systems Inc. All Rights Reserved. Increasing computation throughput with Grid Data Caching Jags Ramnarayan Chief Architect GemStone.
International Symposium on Grid Computing (ISGC-07), Taipei - March 26-29, 2007 Of 16 1 A Novel Grid Resource Broker Cum Meta Scheduler - Asvija B System.
Introduction to Grid Computing and its components.
SensorWare: Distributed Services for Sensor Networks Rockwell Science Center and UCLA.
Bulk Data Transfer Activities We regard data transfers as “first class citizens,” just like computational jobs. We have transferred ~3 TB of DPOSS data.
Matthew Farrellee Computer Sciences Department University of Wisconsin-Madison Condor and Web Services.
Copyright 2007, Information Builders. Slide 1 iWay Web Services and WebFOCUS Consumption Michael Florkowski Information Builders.
Euro-Par, HASTE: An Adaptive Middleware for Supporting Time-Critical Event Handling in Distributed Environments ICAC 2008 Conference June 2 nd,
PARALLEL AND DISTRIBUTED PROGRAMMING MODELS U. Jhashuva 1 Asst. Prof Dept. of CSE om.
INTRODUCTION TO GRID & CLOUD COMPUTING U. Jhashuva 1 Asst. Professor Dept. of CSE.
AMSA TO 4 Advanced Technology for Sensor Clouds 09 May 2012 Anabas Inc. Indiana University.
Grid Optical Burst Switched Networks
Introduction to Load Balancing:
Cloud Computing.
Basic Grid Projects – Condor (Part I)
Introduction to SOA and Web Services
Presentation transcript:

Grid Platform for Geospatial Applications & Fine Granule Scheduler Presented by Bin Zhou Bin Zhou, Jibo Xie, Chaowei Yang Joint Center for Intelligent Spatial Computing George Mason University

Agenda ➲ Grid Computing Introduction ➲ CISC & SURA Grid ➲ Geospatial Applications Require Grid ➲ CISC Fine Granule Scheduler ➲ Architecture,Strategy ➲ Progress Status

Grid Computing Introduction ➲ Definition Grid computing is an emerging computing infrastructure that treats all resources as a collection of manageable entities with common interfaces to such functionality as lifetime management, discoverable properties and accessibility via open protocols – wikipedia ➲ Popular Grid Middleware Condor Globus Condor-G Unicore

GMU grid environment SURAgrid GMU CISC GMU Grid can access the computing resources contributed by SURAgrid member universities

GMU grid environment LambdaRail GMU CISC Grid can setup 1-10Gbps connection to any of the LamdaRail supported Universities, Agencies, and Centers, such as GSFC & SDSC

CISC Computing Pool

Geospatial Requirements ➲ Large Data Set Map Data, Sensor Data, in Tera-bytes ➲ Reliability,Interoperability collaboration ➲ Intensive Computation More Complex Algorithms Adaptive Algorithms Intelligent Processing

Grid Computing Could Satisfy these requirements ➲ Reliable File Transfer ➲ Resource Management and Allocation ➲ Authorization & Control ➲ Job Control ➲ Web Service Oriented

Detecting Watersheds from multi-scale DEM ➲ Watershed boundaries are not known before processing massive data ➲ extract coarse watershed boundaries from multi-scale DEM ➲ Using the boundaries to decompose the massive data with some redundancy resample Extraction Xie 2006

Use 24 units to test the speed up (each unit is 3.08M) (Xie 2006)

CISC Test Applications s293s s Job Amount CPUs Executing Time Speed Up Efficiency s 1 1 Real Time Routing Test Result: The efficiency decreases with the CPU numbers because the overhead increase, but the major problem is Condor can’t handle small jobs efficient. Demonstrates the need for fine granule scheduler

Specific Applications: Fine- Grained Near Real Time Jobs ➲ Fine-Grained Very Short Executing Time Huge Amount Job Similarity ➲ Near Real Time Sensitive to scheduling latency example: Real-Time Routing, Short-Time stock prediction, Condor cannot be used for tasks that require less than 3.5 min to complete ---Gregg Cooke, IT Technical Council,"Evaluating Condor for Enterprise Use: A UBS Case Study"

CISC Scheduler ➲ Purpose improve near real time job response time improve mass Fine Granularity job throughput ➲ Scheduling Strategy Short Communicating Message Simple Match-Making Function Dynamic Index Multi-Dispatch

System Architecture TCP/UDP Socket File TransferProcessOther Lib Services Abstract Interface /APIs Message passingMemory System Function Dispatcher Collector Container Resource Manager Submitter Algorithm module Central ManagerWorkerUser Interface

Components

Job Work Flow

Prototype Overhead Test ➲ Test Case Insertion Sort 200,000 integers Dataset: 5.56M Execute File : 1.8M ➲ Test Platform OS: ubuntu 6.10 Network: 100Mbps CPU: Celeron M 1.6G Memory: 1G Job Amou nt File Transfer Time Job Executing Time Other Overhead Communicating Overhead Efficiency 11s27s0.4s18ms95.1% 54s154s1.2s20ms98.9%

Thanks Questions?