Design of Cloud Management Layer for High-Performance File Transfer 高效能檔案傳輸之雲端層設計 1.

Slides:



Advertisements
Similar presentations
Creating HIPAA-Compliant Medical Data Applications with Amazon Web Services Presented by, Tulika Srivastava Purdue University.
Advertisements

Operating System.
Digital Library Service – An overview Introduction System Architecture Components and their functionalities Experimental Results.
A Hadoop Overview. Outline Progress Report MapReduce Programming Hadoop Cluster Overview HBase Overview Q & A.
I.1 Distributed Systems Prof. Dr. Alexander Schill Dresden Technical University Computer Networks Dept.
Web-based Distributed Flexible Manufacturing System (FMS) Monitoring and Control Student: Wei Liu Instructor: Dr. Chang Apr. 23, 2003.
Aneka: A Software Platform for .NET-based Cloud Computing
Mobile Data Sharing over Cloud Group No. 8 - Akshay Kantak - Swapnil Chavan - Harish Singh.
Operating Systems.
Google Distributed System and Hadoop Lakshmi Thyagarajan.
Introduction to Cyberspace
Operating Systems Operating System
Research on cloud computing application in the peer-to-peer based video-on-demand systems Speaker : 吳靖緯 MA0G rd International Workshop.
Microsoft Azure Introduction ISYS 512. Microsoft Azure Microsoft Azure is a cloud.
Remote OMNeT++ v2.0 Introduction What is Remote OMNeT++? Remote environment for OMNeT++ Remote simulation execution Remote data storage.
Technology Overview. Agenda What’s New and Better in Windows Server 2003? Why Upgrade to Windows Server 2003 ?  From Windows NT 4.0  From Windows 2000.
An Architecture for Video Surveillance Service based on P2P and Cloud Computing Yu-Sheng Wu, Yue-Shan Chang, Tong-Ying Juang, Jing-Shyang Yen speaker:
MapReduce April 2012 Extract from various presentations: Sudarshan, Chungnam, Teradata Aster, …
Projects. High Performance Computing Projects Design and implement an HPC cluster with one master node and two compute nodes. (Hint: use Rocks HPC Cluster.
+ CS 325: CS Hardware and Software Organization and Architecture Cloud Architectures.
Cloud Computing 1. Outline  Introduction  Evolution  Cloud architecture  Map reduce operation  Platform 2.
HDFS Hadoop Distributed File System
Socket Swapping for efficient distributed communication between migrating processes MS Final Defense Praveen Ramanan 12 th Dec 2002.
GT Components. Globus Toolkit A “toolkit” of services and packages for creating the basic grid computing infrastructure Higher level tools added to this.
Hadoop Basics -Venkat Cherukupalli. What is Hadoop? Open Source Distributed processing Large data sets across clusters Commodity, shared-nothing servers.
Introduction to Apache Hadoop Zibo Wang. Introduction  What is Apache Hadoop?  Apache Hadoop is a software framework which provides open source libraries.
Introduction to Hadoop and HDFS
SUMA: A Scientific Metacomputer Cardinale, Yudith Figueira, Carlos Hernández, Emilio Baquero, Eduardo Berbín, Luis Bouza, Roberto Gamess, Eric García,
Installation and Development Tools National Center for Supercomputing Applications University of Illinois at Urbana-Champaign The SEASR project and its.
1 Introduction to Middleware. 2 Outline What is middleware? Purpose and origin Why use it? What Middleware does? Technical details Middleware services.
ServiceSs, a new programming model for the Cloud Daniele Lezzi, Rosa M. Badia, Jorge Ejarque, Raul Sirvent, Enric Tejedor Grid Computing and Clusters Group.
Chapter 2 Introduction to Systems Architecture. Chapter goals Discuss the development of automated computing Describe the general capabilities of a computer.
DeDu: Building a Deduplication Storage system over Cloud computing This paper appears in : Computer Supported Cooperative work in Design(CSCWD),
VMware vSphere Configuration and Management v6
Presented by: Katie Woods and Jordan Howell. * Hadoop is a distributed computing platform written in Java. It incorporates features similar to those of.
組員: 張世謙 林原瑭 黃郁翔.  Introduction  Design approach  Optimization Approaches  Conclusion and future directions
DIS PROPOSAL - Distributed Data Warehouse - R 蔣孟儒 R 龍秋明.
Regional Conference on the use of mobile technologies for data collection, Addis Ababa, Ethiopia. Presenter : Mr. Mbemba Hydara 14 th October 2015.
Unit 9: Distributing Computing & Networking Kaplan University 1.
Copyright © 2015, SAS Institute Inc. All rights reserved. THE ELEPHANT IN THE ROOM SAS & HADOOP.
FirewallPK Security tool for centralized Access Control List Management th RoEduNet International Conference - Networking in Education and Research.
Cloud Computing project NSYSU Sec. 1 Demo. NSYSU EE IT_LAB2 Outline  Our system’s architecture  Flow chart of the hadoop’s job(web crawler) working.
HNC COMPUTING - Network Concepts 1 Network Concepts Network Concepts Network Operating Systems Network Operating Systems.
Distributed File System. Outline Basic Concepts Current project Hadoop Distributed File System Future work Reference.
CLIENT SERVER COMPUTING. We have 2 types of n/w architectures – client server and peer to peer. In P2P, each system has equal capabilities and responsibilities.
Data-Centric Systems Lab. A Virtual Cloud Computing Provider for Mobile Devices Gonzalo Huerta-Canepa presenter 김영진.
INTRODUCTION TO HADOOP. OUTLINE  What is Hadoop  The core of Hadoop  Structure of Hadoop Distributed File System  Structure of MapReduce Framework.
IMPROVEMENT OF COMPUTATIONAL ABILITIES IN COMPUTING ENVIRONMENTS WITH VIRTUALIZATION TECHNOLOGIES Abstract We illustrates the ways to improve abilities.
Redmond Protocols Plugfest 2016 Casey Karst PolyBase in SQL Server 2016.
Amazon Web Services. Amazon Web Services (AWS) - robust, scalable and affordable infrastructure for cloud computing. This session is about:
Presenter: Yue Zhu, Linghan Zhang A Novel Approach to Improving the Efficiency of Storing and Accessing Small Files on Hadoop: a Case Study by PowerPoint.
Introduction  Model contains different kinds of elements (such as hosts, databases, web servers, applications, etc)  Relations between these elements.
DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING CLOUD COMPUTING
CSC227: Operating Systems
Introduction to Distributed Platforms
Study course: “Computing clusters, grids and clouds” Andrey Y. Shevel
CSC 480 Software Engineering
Introduction to HDFS: Hadoop Distributed File System
Algorithms for Big Data Delivery over the Internet of Things
The Improvement of PaaS Platform ZENG Shu-Qing, Xu Jie-Bin 2010 First International Conference on Networking and Distributed Computing SQUARE.
Hadoop Clusters Tess Fulkerson.
Virtual Machines.
Chapter 3: Windows7 Part 4.
Replication Middleware for Cloud Based Storage Service
Distributed P2P File System
Hadoop Technopoints.
Introduction to Apache
Internet Protocols IP: Internet Protocol
Introduction to Cyberspace
Outline Operating System Organization Operating System Examples
Presentation transcript:

Design of Cloud Management Layer for High-Performance File Transfer 高效能檔案傳輸之雲端層設計 1

Outline Introduction Background System Design Implementation Experiments 2

Introduction Motivation – File Transfer with load sharing and fault tolerance. 3

Introduction User (Client) Management Layer Cluster Storage Server Upload Files Clients can upload the files to the cloud, and promise your file cannot be lose. Files Download 4

Introduction (Cont.) Surveillance Application Size: 0.2~0.5 GB/1hr Storage:2TB Device x 30 Host 2000G(2T) / 30 = 67G 67G / 0.3G = 223h 223h / 24h = 9.3 day 5

Introduction (Cont.) Used Platforms and API  Hadoop Hadoop Distributed File System (HDFS) – File Storage Hbase – User Authentication  Socket File Data – Transmissions 6

Outline Introduction Background System Design Implementation Experiments 7

Background (Cont.) What is Cloud Computing? Scalable Computing and Storage Resource. Cloud Source Models 8

Background (Cont.) Hadoop DataBase File System PigDataflow language and parallel execution Framework HiveData warehouse infrastructure ZooKeeperDistributed coordination service ChukwaSystem for collecting management data AvroData serialization system The other components of Hadoop Parallel Processing 9

Background (Cont.) What is “Hadoop Distributed File System (HDFS)”? Cluster Storage Server Datanode PC Clients Namenode Transmission 10

Background (Cont.) 11 Hbase have high availability, high performance, and high expansion flexibility. Table: ‘t1’ row-key 'f1''f2''fn'column-family ‘c1’‘c2’‘c3’‘c4’*column-quantifier r1 v1 v2 r2 v3 v4

Outline Introduction Background System Design Implementation Experiments Conclusions Reference 12

System Design(Cont.) Components – Manager Selection – Authentication – Synchronization – Cloud-Based File Transfer 13

System Design(Cont.) Manager Selection 1.Connect Management layer and perform load balance 2.Collect memory load information from each nodes 3.Select the lowest memory load as connection port(Node i ) 4.Connect with Node i 5.Authentication 14

System Design(Cont.) Authentication 1.Key in account / password 2.Authentication(Hbase verification) 3.Synchronization 15

System Design(Cont.) Synchronization 1.HDFS files synchronization in client side 2.Check files existence 3.Cloud-Based files transfer 4.Background Monitoring 16

System Design(Cont.) Synchronization 1.HDFS files synchronization in client side 2.Check files existence 3.Cloud-Based files transfer 4.Background Monitoring 17

System Design(Cont.) Cloud-Based File Transfer 1.Send request using socket 2.Analysis The packet (to extract the command and path) 3.Execute command (Write/Send/Delete) 18

Outline Introduction Background System Design Implementation Experiments 19

Implementation Virtualization software – VMware WorkStation 8.0 OS Environment – Ubuntu Linux 、 Windows Development tools – Ubuntu Linux : C/C++ 、 Java – Windows : Borland C++ builder 20

Implementation(Cont.) Client & Management Layer – C++ Socket API Management Layer – Java Hbase API – C/C++ HDFS API JNI (Java Native Interface) 21

Outline Introduction Background System Design Experiments Conclusions Reference 22

Experiments Manager Selection – 隨機配置 – 負載平衡配置 選擇負載最小之節點: 5 個 更新時間: 10 秒 可同時連線數量: 10 個 Environmental Environment 1 – Servers: 50, 100, 150, 200, 250 – Clients : 2000 Environmental Environment 2 – Servers: 50 – Clients : 1000, 2000, 3000, 4000,

Experiments(Cont.) Experiment 1 – Servers: 50, 100, 150, 200, 250 – Clients :

Experiments(Cont.) Experiment 2 – Servers: 50 – Clients : 1000, 2000, 3000, 4000,