Опыт использования нечетких распределенных вычислений (cloud computing) в геоинформатике М.Н. Жижин Геофизический центр и Институт космических исследований.

Slides:



Advertisements
Similar presentations
1 Mixing Public and private clouds a Practical Perspective Maarten Koopmans Nordunet Conference 2009 Maarten Koopmans Nordunet Conference 2009.
Advertisements

Summary of Cloud Computing (CC) from the paper Abovce the Clouds: A Berkeley View of Cloud Computing (Feb. 2009)
University of Notre Dame
ASCR Data Science Centers Infrastructure Demonstration S. Canon, N. Desai, M. Ernst, K. Kleese-Van Dam, G. Shipman, B. Tierney.
Cloud Computing Imranul Hoque. Today’s Cloud Computing.
1 NETE4631 Cloud deployment models and migration Lecture Notes #4.
Infrastructure as a Service (IaaS) Amazon EC2
An Approach to Secure Cloud Computing Architectures By Y. Serge Joseph FAU security Group February 24th, 2011.
INTRODUCTION TO CLOUD COMPUTING CS 595 LECTURE 6 2/13/2015.
Nikolay Tomitov Technical Trainer SoftAcad.bg.  What are Amazon Web services (AWS) ?  What’s cool when developing with AWS ?  Architecture of AWS 
Активное распределенное хранилище для многомерных массивов Дмитрий Медведев ИКИ РАН.
What is Cloud Computing?. Why call it “Cloud” Computing?
Cloud computing Tahani aljehani.
INTRODUCTION TO CLOUD COMPUTING Cs 595 Lecture 5 2/11/2015.
Google AppEngine. Google App Engine enables you to build and host web apps on the same systems that power Google applications. App Engine offers fast.
VAP What is a Virtual Application ? A virtual application is an application that has been optimized to run on virtual infrastructure. The application software.
Cloud Computing Source:
A Brief Overview by Aditya Dutt March 18 th ’ Aditya Inc.
Cloud Computing for the Enterprise November 18th, This work is licensed under a Creative Commons.
Cloud Computing Saneel Bidaye uni-slb2181. What is Cloud Computing? Cloud Computing refers to both the applications delivered as services over the Internet.
Cloud computing is the use of computing resources (hardware and software) that are delivered as a service over the Internet. Cloud is the metaphor for.
 Cloud computing  Workflow  Workflow lifecycle  Workflow design  Workflow tools : xcp, eucalyptus, open nebula.
Customized cloud platform for computing on your terms !
MapReduce April 2012 Extract from various presentations: Sudarshan, Chungnam, Teradata Aster, …
Cloud Computing. What is Cloud Computing? Cloud computing is a model for enabling convenient, on-demand network access to a shared pool of configurable.
DISTRIBUTED DATA FLOW WEB-SERVICES FOR ACCESSING AND PROCESSING OF BIG DATA SETS IN EARTH SCIENCES A.A. Poyda 1, M.N. Zhizhin 1, D.P. Medvedev 2, D.Y.
Software Architecture
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Overview of Atmosphere.
Java in the cloud PaaS Platform in Comparison By Srini Kumar VP MSat IT Evangelist & Strategy Advisor.
Amazon Web Services BY, RAJESH KANDEPU. Introduction  Amazon Web Services is a collection of remote computing services that together make up a cloud.
Cloud Computing & Amazon Web Services – EC2 Arpita Patel Software Engineer.
Introduction to Apache OODT Yang Li Mar 9, What is OODT Object Oriented Data Technology Science data management Archiving Systems that span scientific.
IPlant cyberifrastructure to support ecological modeling Presented at the Species Distribution Modeling Group at the American Museum of Natural History.
Integrated Grid workflow for mesoscale weather modeling and visualization Zhizhin, M., A. Polyakov, D. Medvedev, A. Poyda, S. Berezin Space Research Institute.
The New Zealand Institute for Plant & Food Research Limited Use of Cloud computing in impact assessment of climate change Kwang Soo Kim and Doug MacKenzie.
Source: Alex Szalay. Example: Sloan Digital Sky Survey The SDSS telescope array is systematically mapping ¼ of the entire sky Discoveries are made by.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
Enterprise Cloud Computing
Windows Azure. Azure Application platform for the public cloud. Windows Azure is an operating system You can: – build a web application that runs.
Powered by Microsoft Azure, PointMatter Is a Flexible Solution to Move and Share Data between Business Groups and IT MICROSOFT AZURE ISV PROFILE: LOGICMATTER.
Cloud Computing is a Nebulous Subject Or how I learned to love VDF on Amazon.
1 Adventures in Web Services for Large Geophysical Datasets Joe Sirott PMEL/NOAA.
GOOGLE APP ENGINE By Muktadiur Rahman. Contents  Cloud Computing  What is App Engine  Why App Engine  Development with App Engine  Quote & Pricing.
Web Technologies Lecture 13 Introduction to cloud computing.
→ MIPRO Conference,Opatija, 31 May -3 June 2005 Grid-based Virtual Organization for Flood Prediction Miroslav Dobrucký Institute of Informatics, SAS Slovakia,
1 A Scalable Distributed Data Management System for ATLAS David Cameron CERN CHEP 2006 Mumbai, India.
Launch Amazon Instance. Amazon EC2 Amazon Elastic Compute Cloud (Amazon EC2) provides resizable computing capacity in the Amazon Web Services (AWS) cloud.
Cloud Computing from a Developer’s Perspective Shlomo Swidler CTO & Founder mydrifts.com 25 January 2009.
Alfresco on Azure Shah Rahman Founder and CEO, CloudlyIO.
Microsoft Research San Francisco (aka BARC: bay area research center) Jim Gray Researcher Microsoft Research Scalable servers Scalable servers Collaboration.
Amazon Web Services. Amazon Web Services (AWS) - robust, scalable and affordable infrastructure for cloud computing. This session is about:
INTRODUCTION TO AMAZON WEB SERVICES (EC2). AMAZON WEB SERVICES  Services  Storage (Glacier, S3)  Compute (Elastic Compute Cloud, EC2)  Databases (Redshift,
Course: Cluster, grid and cloud computing systems Course author: Prof
AuraPortal Cloud Helps Empower Organizations to Organize and Control Their Business Processes via Applications on the Microsoft Azure Cloud Platform MICROSOFT.
Scalable Web Apps Target this solution to brand leaders responsible for customer engagement and roll-out of global marketing campaigns. Implement scenarios.
Customized cloud platform for computing on your terms !
Platform as a Service.
Tools and Services Workshop Overview of Atmosphere
Scalable Web Apps Target this solution to brand leaders responsible for customer engagement and roll-out of global marketing campaigns. Implement scenarios.
Amazon AWS Solution Architect Associate Exam Dumps For Full Exam Info Visit This Link:
Scalable SoftNAS Cloud Protects Customers’ Mission-Critical Data in the Cloud with a Highly Available, Flexible Solution for Microsoft Azure MICROSOFT.
Towards A Global Research Platform (GRP)
Accelerate Your Self-Service Data Analytics
Outline Virtualization Cloud Computing Microsoft Azure Platform
Cloud computing mechanisms
Large Scale Distributed Computing
Cloud Computing: Concepts
MS AZURE By Sauras Pandey.
Client/Server Computing and Web Technologies
Presentation transcript:

Опыт использования нечетких распределенных вычислений (cloud computing) в геоинформатике М.Н. Жижин Геофизический центр и Институт космических исследований РАН

New technologies and innovations Long-term preservation with metadata and lineage (Virtual Observatories) Parallel/disrtibuted data storage with Interactive data query and network transfer of large datasets (MapReduce) Relational -> Object -> XML -> Array databases (SciDB) HPC data processing and modeling algorithms (Grid) Event detection, interrelation and data mining (AlphaSearch) Web technologies for visualization of different data types with geolocation (Neogeography) Collaborative data visualization (Videowalls) Scalable virtualization of CPU/network/storage resources (Cloud Computing)

Multiplets of regional earthquakes

Downhole multipoint measurement at Soultz geothermal reservoir

Global Lambda Integrated Facility Available Advanced Network Resources GLIF is a consortium of institutions, organizations, consortia and country National Research & Education Networks who voluntarily share optical networking resources and expertise to develop the Global LambdaGrid for the advancement of scientific collaboration and discovery. Visualization courtesy of Bob Patterson, NCSA; data compilation by Maxine Brown, UIC. Source: Joe Mambrotti

GLORIAD: 10Gb Worldwide Ring Source: Natalia Bulashova

USA-Russia Lightpath for Fast Data Transfer of Terabyte-sized Scientific Datasets National Center for Data Mining (NCDM) at the University of Illinois at Chicago, Geophysical Center RAS and Space Research Institute RAS have successfully moved 1.4 TB of data in 4.5 hours over a 1 Gbps lightpath between Chicago and Moscow as part of the Teraflow Network initiative Using NCDM’s open-source UDP-based Data Transfer protocol (UDT), we were able to transfer the MS SQL database with SDSS astronomy catalog. The 2.5 TB database dump was compressed to 1.4 TB, split into 60 files, transferred over a 1 Gbps lightpath and then decompressed in Moscow and loaded back to MS SQL Server The SkyServer portal and the SDSS database were developed by Jim Gray at MSR and Alex Szalay at JHU. Russian language mirror now resides at in Moscow Direct Lightpath link from IKI in Moscow to NGDC NOAA in Boulder has been successfully tested

Russian Skyserver mirror:

Past Observations + Predictive Model = Reanalysis 1.Direct observations in the past – including raw and processed data, e.g. meteorological station or satellite, 10 5 observations of atmosphere each 6 h 2.Predictive numerical model – “knows” physics, uses direct observations as boundary values, e.g. Global Circulation Model, 360 lat X 180 lon X 20 levels X 100 parameters= 1.3 X 10 8 data values each 6 hours 3.Reanalysis – accumulated output of the numerical model forecasts each corrected for the available direct observations for a long time period, 50 years at 6 h time step

Why OGSA-DAI service container? Standard tool in the Grid community Supports distributed workflow (in version 3.*) Built in support for asynchronous transactions Compatible with Web (Axis) and Grid (OMII, UNICORE, GT4) Looked at alternatives like OpenDap, WCS, … –documentation of our analysis is available Problem 1: it is very complex – Solution: REST wrapper Problem 2: supports only File, SQL and XML data types and queries – Solution: implement additional data sources and functions for data in multidimensional arrays

Web technologies for visualization of different data types with geolocation KML & geoRSS Web-services for CDM data sources OGC Web Map Services WMS/WFS/WCS MS Virtual Earth Google Maps

12 Terraserver tile server by Jim Gray in Large database on the Web (3 TB) Operational since June 1998 Public access to USGS topo maps and aerial images Low resolution images No global coverage GPS market not ready

Core box set image pre-processing At the core warehouse images are acquired for the whole box set To visualize them we split them into separate samples Original box sets Processed

New ways to mashup raster data

Above the Clouds: A Berkeley View of Cloud Computing Cloud Computing refers to both the applications delivered as services over the Internet and the hardware and systems software in the datacenters that provide those services. The services themselves have long been referred to as Software as a Service (SaaS). The datacenter hardware and software is what we call a Cloud. When a Cloud is made available in a pay-as-you-go manner to the public, we call it a Public Cloud; the service being sold is Utility Computing: AmazonWeb Services, Google AppEngine, and Microsoft Azure.

Amazon AWSMicrosoft AzureGoogle AppEngine VMx86 32 and 64 bit architecture via Xen VM Computation elasticity allows scalability, but developer must build the machinery, or third party must provide it Microsoft Common Language Runtime (CLR) VM; Automatic load balancing Predefined application in Python Persistent state stored in MegaStore Automatic scaling StorageRange of models from block store (EBS) to augmented key/blob store (SimpleDB) Scaling varies from no scaling (EBS) to fully automatic (SimpleDB, S3) APIs vary from standardized (EBS) to proprietary (S3) SQL Data Services (restricted view of SQL Server) Azure storage service MegaStore/BigTable NetworkDeclarative specification of topology Security Groups Availability zones Elastic IP addresses provide persistent name Automatic based on roles Fixed topology for 3-tire webapps Automatic scaling

How to deploy SPIDR in Cloud? Single instance: S3 EBS EC2 SPIDR webapp & web services MySQL databases Database dump File system snapshot VM snapshot bundle VM image

Can we support multiple SPIDRs? In different Amazon cloud regions? Yes! Launch several instances of the SPIDR VM Configure DNS round-robin for load balancing Run MySQL master on the first instance, and MySQL slaves on others or Use third-party high-availability products for Amazon cloud, such as RightScale

Clouds above Grid: Cumulus Nimbus experiment in SKIF-Grid, fall 2009

Cloud VMs managed as Grid jobs

Condor Grid deployed in Cloud