Virtualization Framework for Data Service on GLEON and CREON Fang-Pang Lin NCHC PRAGMA HK, March 2011.

Slides:



Advertisements
Similar presentations
Spatial Ontology Community of Practice Workshop, USGS, Dec.2, Using Knowledge to Facilitate Better Data Discovery, Access, and Utilization for CloudGIS.
Advertisements

Telescience 19. Presentations: 7 Presentations E-Culture (Osaka Knowledge Capital City Exhibition; Nara 1300 th Anniversary) (NICT) KLEON (KISTI)
Shinji Shimojo Fang-Pang Lin Telescience WG PRAGMA 17.
1 N C H C 2007 /09/28 Flood Grids Fang-Pang Lin PRAGMA Institute, NCSA, 28, Sept, 2007.
TDW Teams Presenter : Yi-Hsuan Chen Contact : National Center for High-performance Computing, Taiwan Date: 14/07/2009 A Distributed Architecture.
10 september 2002 A.Broersen Developing a Virtual Piano Playing Environment By combining distributed functionality among independent Agents.
Paul Hanson, Fang-Pang Lin, Miron Livny, Chin Wu, Chris Solomon, Many colleagues of the GLEON Transforming ecological sensor networks from data collectors.
Atlas Server – A Tool for Atlas Mapping Altai State Technical University Public Fund Altai 21-st Century Barnaul, Russia Irina Mikhailidi.
ASIAES Project Overview Satellite Image Network for Natural Hazard Management in ASEAN+3 region Pakorn Apaphant Geo-Informatics and Space Technology Development.
Walid Saleh Regional coordinator, MENA Region Enhancing the use of Science in International Waters projects to improve project results.
DIGIDOC A web based tool to Manage Documents. System Overview DigiDoc is a web-based customizable, integrated solution for Business Process Management.
Distributed Data Processing
INTERNATIONAL INSTITUTE FOR GEO-INFORMATION SCIENCE AND EARTH OBSERVATION Towards quality-aware Infrastructures for Geographic Information Services Richard.
NG-CHC Northern Gulf Coastal Hazards Collaboratory Simulation Experiment Integration Sandra Harper 1, Manil Maskey 1, Sara Graves 1, Sabin Basyal 1, Jian.
1 Cyberinfrastructure Framework for 21st Century Science & Engineering (CF21) IRNC Kick-Off Workshop July 13,
Venkatesh Merwade, Purdue University
A Java Architecture for the Internet of Things Noel Poore, Architect Pete St. Pierre, Product Manager Java Platform Group, Internet of Things September.
C van Ingen, D Agarwal, M Goode, J Gupchup, J Hunt, R Leonardson, M Rodriguez, N Li Berkeley Water Center John Hopkins University Lawrence Berkeley Laboratory.
What is the Internet? Internet: The Internet, in simplest terms, is the large group of millions of computers around the world that are all connected to.
International Centre for Integrated Mountain Development Kathmandu, Nepal International Centre for Integrated Mountain Development Kathmandu, Nepal Mobile.
BlogMyData A Virtual Research Environment for collaborative visualization of environmental data Andrew Milsted | 14 September 2010.
The Internet Useful Definitions and Concepts About the Internet.
Sheldon Brown, UCSD, Site Director Milton Halem, UMBC Director Yelena Yesha, UMBC Site Director Tom Conte, Georgia Tech Site Director Fundamental Research.
SESSION 9 THE INTERNET AND THE NEW INFORMATION NEW INFORMATIONTECHNOLOGYINFRASTRUCTURE.
SensIT PI Meeting, April 17-20, Distributed Services for Self-Organizing Sensor Networks Alvin S. Lim Computer Science and Software Engineering.
Development of Japanese GIS Tool for use in the Humanities ○ Masatoshi ISHIKAWA †, Yoichi KAWANISHI ††, Hidefumi OKUMURA †††, Shoichiro HARA †††† † University.
Client-Server Processing and Distributed Databases
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
Internet GIS. A vast network connecting computers throughout the world Computers on the Internet are physically connected Computers on the Internet use.
1 Building National Cyberinfrastructure Alan Blatecky Office of Cyberinfrastructure EPSCoR Meeting May 21,
Chapter 12 Designing Distributed and Internet Systems
About CUAHSI The Consortium of Universities for the Advancement of Hydrologic Science, Inc. (CUAHSI) is an organization representing 120+ universities.
 Cloud computing  Workflow  Workflow lifecycle  Workflow design  Workflow tools : xcp, eucalyptus, open nebula.
Interoperability ERRA System.
HOW ACCESS TO WWW Student Name : Hussein Alkhaldi.
Tanenbaum & Van Steen, Distributed Systems: Principles and Paradigms, 2e, (c) 2007 Prentice-Hall, Inc. All rights reserved DISTRIBUTED.
Water Web Services David R. Maidment Center for Research in Water Resources University of Texas at Austin Open Waters Symposium Delft, the Netherlands.
POAD Distributed System Case Study: A Medical Informatics System Instructor: Dr. Hany H. Ammar Dept. of Computer Science and Electrical Engineering, WVU.
Geoscience WG update PRAGMA 25 Whey-Fone Tsai Yoshio Tanaka Sarawut Ninsawat Sornthep Vannarat.
Challenges in Urban Meteorology: A Forum for Users and Providers OFCM Workshop Summaries Lt Col Rob Rizza Assistant Federal Coordinator for USAF/USA Affairs.
Introduction to Apache OODT Yang Li Mar 9, What is OODT Object Oriented Data Technology Science data management Archiving Systems that span scientific.
Unit – I CLIENT / SERVER ARCHITECTURE. Unit Structure  Evolution of Client/Server Architecture  Client/Server Model  Characteristics of Client/Server.
MySQL and PHP Internet and WWW. Computer Basics A Single Computer.
1 MSCS 237 Overview of web technologies (A specific type of distributed systems)
What is SAM-Grid? Job Handling Data Handling Monitoring and Information.
TEMPLATE DESIGN © E-Eye : A Multi Media Based Unauthorized Object Identification and Tracking System Tolgahan Cakaloglu.
Internet of Things. IoT Novel paradigm – Rapidly gaining ground in the wireless scenario Basic idea – Pervasive presence around us a variety of things.
Foundations of Information Systems in Business. System ® System  A system is an interrelated set of business procedures used within one business unit.
WEB SERVER SOFTWARE FEATURE SETS
The Earth Information Exchange. Portal Structure Portal Functions/Capabilities Portal Content ESIP Portal and Geospatial One-Stop ESIP Portal and NOAA.
COM: 111 Introduction to Computer Applications Department of Information & Communication Technology Panayiotis Christodoulou.
1 VRoIP (Virtual Reality over IP) NCHC TDW TaskForce Jacky Chih-Lung Chang
Cyberinfrastructure Overview of Demos Townsville, AU 28 – 31 March 2006 CREON/GLEON.
E-commerce Architecture Ayşe Başar Bener. Client Server Architecture E-commerce is based on client/ server architecture –Client processes requesting service.
VIEWS b.ppt-1 Managing Intelligent Decision Support Networks in Biosurveillance PHIN 2008, Session G1, August 27, 2008 Mohammad Hashemian, MS, Zaruhi.
© 2007 IBM Corporation IBM Software Strategy Group IBM Google Announcement on Internet-Scale Computing (“Cloud Computing Model”) Oct 8, 2007 IBM Confidential.
Source: Paul Hanson. Collaboration in Environmental Science Global Lake Ecological Observatory Network A grassroots network of –People: lake scientists,
Grid Services for Digital Archive Tao-Sheng Chen Academia Sinica Computing Centre
Enhancements to Galaxy for delivering on NIH Commons
REAL-TIME DETECTOR FOR UNUSUAL BEHAVIOR
Distributed Control and Measurement via the Internet
2nd GEO Data Providers workshop (20-21 April 2017, Florence, Italy)
DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING CLOUD COMPUTING
System Design of Internet-of-Things for Residential Smart Grid
CHAPTER 3 Architectures for Distributed Systems
Staying afloat in the sensor data deluge
Support for ”interactive batch”
WIS Strategy – WIS 2.0 Submitted by: Matteo Dell’Acqua(CBS) (Doc 5b)
Big DATA.
Presentation transcript:

Virtualization Framework for Data Service on GLEON and CREON Fang-Pang Lin NCHC PRAGMA HK, March 2011

GLEON: revolutionizing understanding of aquatic ecosystems through an international grassroots network of people, data, and lake observatories 28 Site Members (sites shown) 208 Individual Members (5Sep10)

Requirements revisit Connecting Sciences based on ecosystems of lakes & coral reefs: – Providing sociological and economic impacts in conservation, planning, decision making, risk management, climate change …etc. Reference Models – GLEON: based on mass conservation in dynamics of DOC (Dissolved Organic Carbon) of lake system. -CREON: yet to be listed. -NCHC currently uses Knowledge4Fish as a driver.

Wish list from GLEON Scale up Current GLEON data in a geographical distribution. Add Meteorological data Add coordinates or Geometry data – 2D and/or 3D depending on availability for sites of interest Land use: – land coverage, grass land, forests, soil types (mostly of remote sensing data) to be expected to connect to social economical variables. Hydrological information: – watersheds (boundary definitions), rivers, underground waters … etc.

Services provided in GLEON Central – Compute Service: CONDOR service: (virtualized in PRAGMA by phil et al.) – A front-end GUI allowing users to enter and to upload input data, and a clear separation of the backend CONDOR production system. Also provide a Web-based Viz system for 2D graphics for results. – Data Service: GLEON data set: web-UI based on a set of tools from Luke and CFL colleagues. Lake-base : (Paul Hanson et al.) – It provides internet scale synthesized data, harvested from internet and also outstandingly from national agency open data such as USGS. 2D Satellite Image service from AIST Geogrid (Sekiguchi, Tanaka, Ryosuke, Sarawut et al) - Introduced but not used (training ?!)

IT Challenges for GLEON Availability: – Real-time streaming and automation issues are not crucial momentarily, hence weaken the needs for scaling up the physical data network for GLEON sites. Yet we conjecture this will be the driver for new science. Performance: – Current DB is not big. If the wish list realized, we may expect big data. – Use file-based service in a Cloud fashion. It can handle simulation and observational data all together with performance. Needs both internal data policy and standards. GIS extension: – OGC standards are well supported in governmental agencies and used extensively in data exchange between major proprietary and public GIS systems. But OGC needs expert to work on!

Virtualization Framework: 4 Layers of Abstraction Observational System Data Center System Automation Knowledge Sharing

Layer 1: Generic Observing System Architecture Focus: Move computation into the field with Embedded Cyberinfrastructure Sensors Cluster Head: aggregation point for sensors. Last IP- addressable point in network Gateway Node: entry point to the Internet A generic architecture facilitates scalability, robustness, reproducibility, and efficiency. Source: Sameer Tilak Move intelligence closer to the local

Layer 2: Data Center Architecture based on OGC standards Source: Sameer Tilak Hide the complexity of resources provisioning

Layer 3: Simple but Broad Automation Data Meta-data Ontologies Acquisition protocols Acquisition protocols Argument/analysis Sensors Human reporters Scientists Models Analysis protocols Analysis protocols Source: Dave Robertson Enable understanding between components

Layer 4: Sharing Experiment Protocols ( request protocolrequest plugin OpenKnowledge kernel supplier Share knowledge for connecting sciences Source: Dave Robertson

GLEON Service Model Revisit GLEON Domain GLEON Central Site C Site B GLEON data policy GLEON Control vocabulary vega Site A Direct collaboration Data Center (e.g. PRAGMA- CONDOR)

3 Types of Service Models Typical Web Service Big Data Service Streaming Data Service

Typical Web Service db External client Query Result HTTP server Application server Application server Application server Application server Data center Examples: Web sites serving dynamic content Characteristics: Small queries and results Little client computation Moderate server computation Moderate data accessed per query Source: David OHallaron

Big Data Service Parallel compute server d1d1 d2d2 d3d3 External client Parallel data server Query Source dataset Derived datasets Parallel file system (e.g., GFS, HDFS) Result Data-intensive computing system (e.g. Hadoop) Parallel query server External data sources Examples: Search Photo scene completion Log processing Science analytics Characteristics: Small queries and results Massive data and computation performed on server Source: David OHallaron

Streaming Data Service Parallel compute server d1d1 d2d2 d3d3 Parallel data server Continuous query stream Source dataset Derived datasets Continuous query results Parallel query server External data sources Characteristics: Application lives on client Client uses cloud as an accelerator Data transferred with query Variable, latency sensitive HPC on server Often combines with Big Data service Examples: Perceptual computing on high data-rate sensors: real time brain activity detection, object recognition, gesture recognition External client and sensors Source: David OHallaron

Exmaple for CREON: Fish4Knowledge Architecture 4.2 GB & 5000 image files per minute Source: Bob Fisher

Source: Fish4Knowledge – EU FP-7 project

Live streaming: MonitorGrid Architecture Stream ReceiverImage Processor Image Managing & Browsing NFS Capture Devices Display Devices NFS (LCD, HDTV, Mobile screen, TDW, and etc.) (DV, HDV, CCTV, Web CAM, IP CAM, Capture card, and etc.) Retrieve and divide the stream into each frame sliders in its owned round-robin queue. Perform the motion detection / stream encoding in real- time. InI – Internet Navigation Interface. / Management interface.

Stream Receiver Image Processor Image Managing & Browsing NFS Capture Devices Display Devices NFS (LCD, HDTV, Mobile screen, TDW, and etc.) (DV, HDV, CCTV, Web CAM, IP CAM, Capture card, and etc.) Round-robin Queue

Image Processor Stream ReceiverImage Processor Image Managing & Browsing NFS Capture Devices Display Devices NFS (LCD, HDTV, Mobile screen, TDW, and etc.) (DV, HDV, CCTV, Web CAM, IP CAM, Capture card, and etc.) Codec MJPEG MPEG1/2/4 SWF/FLV WMV Motion Detection Image Segmentation Object Tracking Image Retrieval

Image Management and Browsing Stream ReceiverImage Processor Image Managing & Browsing NFS Capture Devices Display Devices NFS (LCD, HDTV, Mobile screen, TDW, and etc.) (DV, HDV, CCTV, Web CAM, IP CAM, Capture card, and etc.) InI for Web browsing Direct streaming History info. database Query

Display Interface