An Architecture for Distributed High Performance Video Processing in the Cloud 2012.06.28 Speaker : 吳靖緯 MA0G0101 2010 IEEE 3rd International Conference.

Slides:



Advertisements
Similar presentations
Digital Library Service – An overview Introduction System Architecture Components and their functionalities Experimental Results.
Advertisements

SDN + Storage.
LIBRA: Lightweight Data Skew Mitigation in MapReduce
Building Cloud-ready Video Transcoding System for Content Delivery Networks(CDNs) Zhenyun Zhuang and Chun Guo Speaker: 饒展榕.
EHarmony in Cloud Subtitle Brian Ko. eHarmony Online subscription-based matchmaking service Available in United States, Canada, Australia and United Kingdom.
Load Rebalancing for Distributed File Systems in Clouds Hung-Chang Hsiao, Member, IEEE Computer Society, Hsueh-Yi Chung, Haiying Shen, Member, IEEE, and.
CLive Cloud-Assisted P2P Live Streaming
Cloud Computing Resource provisioning Keke Chen. Outline  For Web applications statistical Learning and automatic control for datacenters  For data.
Authors: Thilina Gunarathne, Tak-Lon Wu, Judy Qiu, Geoffrey Fox Publish: HPDC'10, June 20–25, 2010, Chicago, Illinois, USA ACM Speaker: Jia Bao Lin.
Efficient Support for Interactive Browsing Operations in Clustered CBR Video Servers IEEE Transactions on Multimedia, Vol. 4, No.1, March 2002 Min-You.
MapReduce in the Clouds for Science CloudCom 2010 Nov 30 – Dec 3, 2010 Thilina Gunarathne, Tak-Lon Wu, Judy Qiu, Geoffrey Fox {tgunarat, taklwu,
DAvinCi: A Cloud Computing Framework for Service Robots
PARALLEL DBMS VS MAP REDUCE “MapReduce and parallel DBMSs: friends or foes?” Stonebraker, Daniel Abadi, David J Dewitt et al.
Final Project: Video Transcoding on Cloud Environments Queenie Wong CMPT 880.
Exploiting Virtualization for Delivering Cloud based IPTV Services Speaker : 吳靖緯 MA0G IEEE Conference on Computer Communications Workshops.
U.S. Department of the Interior U.S. Geological Survey David V. Hill, Information Dynamics, Contractor to USGS/EROS 12/08/2011 Satellite Image Processing.
Ch 4. The Evolution of Analytic Scalability
B 葉彥廷 B 林廷韋 B 王頃恩. Why we choose this topic Introduction Programming Model Example Implementation Conclusion.
A Brief Overview by Aditya Dutt March 18 th ’ Aditya Inc.
Introduction to Parallel Programming MapReduce Except where otherwise noted all portions of this work are Copyright (c) 2007 Google and are licensed under.
Research on cloud computing application in the peer-to-peer based video-on-demand systems Speaker : 吳靖緯 MA0G rd International Workshop.
Cloud MapReduce : a MapReduce Implementation on top of a Cloud Operating System Speaker : 童耀民 MA1G Authors: Huan Liu, Dan Orban Accenture.
資訊工程系智慧型系統實驗室 iLab 南台科技大學 1 Optimizing Cloud MapReduce for Processing Stream Data using Pipelining 出處 : 2011 UKSim 5th European Symposium on Computer Modeling.
THE HOG LANGUAGE A scripting MapReduce language. Jason Halpern Testing/Validation Samuel Messing Project Manager Benjamin Rapaport System Architect Kurry.
Pepper: An Elastic Web Server Farm for Cloud based on Hadoop Author : S. Krishnan, J.-S. Counio Date : Speaker : Sian-Lin Hong IEEE International.
An Architecture for Video Surveillance Service based on P2P and Cloud Computing Yu-Sheng Wu, Yue-Shan Chang, Tong-Ying Juang, Jing-Shyang Yen speaker:
Authors: Jiann-Liang Chenz, Szu-Lin Wuy,Yang-Fang Li, Pei-Jia Yang,Yanuarius Teofilus Larosa th International Wireless Communications and Mobile.
Document Indexing: SPIMI
MapReduce: Simplified Data Processing on Large Clusters Jeffrey Dean and Sanjay Ghemawat.
SecureMR: A Service Integrity Assurance Framework for MapReduce Author: Wei Wei, Juan Du, Ting Yu, Xiaohui Gu Source: Annual Computer Security Applications.
An Approach for Processing Large and Non-uniform Media Objects on MapReduce-Based Clusters Rainer Schmidt and Matthias Rella Speaker: Lin-You Wu.
Introduction to Apache Hadoop Zibo Wang. Introduction  What is Apache Hadoop?  Apache Hadoop is a software framework which provides open source libraries.
Introduction to Hadoop and HDFS
Optimizing Cloud MapReduce for Processing Stream Data using Pipelining 作者 :Rutvik Karve , Devendra Dahiphale , Amit Chhajer 報告 : 饒展榕.
A Framework for Elastic Execution of Existing MPI Programs Aarthi Raveendran Graduate Student Department Of CSE 1.
Pregel: A System for Large-Scale Graph Processing Grzegorz Malewicz, Matthew H. Austern, Aart J. C. Bik, James C. Dehnert, Ilan Horn, Naty Leiser, and.
Map-Reduce-Merge: Simplified Relational Data Processing on Large Clusters Hung-chih Yang(Yahoo!), Ali Dasdan(Yahoo!), Ruey-Lung Hsiao(UCLA), D. Stott Parker(UCLA)
An Architecture for Distributed High Performance Video Processing in the Cloud 作者 :Pereira, R.; Azambuja, M.; Breitman, K.; Endler, M. 出處 :2010 IEEE 3rd.
Performance Evaluation of Image Conversion Module Based on MapReduce for Transcoding and Transmoding in SMCCSE Speaker : 吳靖緯 MA0G IEEE.
Optimizing Cloud MapReduce for Processing Stream Data using Pipelining 2011 UKSim 5th European Symposium on Computer Modeling and Simulation Speker : Hong-Ji.
A Collaborative Framework for Scientific Data Analysis and Visualization Jaliya Ekanayake, Shrideep Pallickara, and Geoffrey Fox Department of Computer.
Parallelizing Video Transcoding Using Map-Reduce-Based Cloud Computing Speaker : 童耀民 MA1G0222 Feng Lao, Xinggong Zhang and Zongming Guo Institute of Computer.
Computing Scientometrics in Large-Scale Academic Search Engines with MapReduce Leonidas Akritidis Panayiotis Bozanis Department of Computer & Communication.
U N I V E R S I T Y O F S O U T H F L O R I D A Hadoop Alternative The Hadoop Alternative Larry Moore 1, Zach Fadika 2, Dr. Madhusudhan Govindaraju 2 1.
Research of P2P Architecture based on Cloud Computing Speaker : 吳靖緯 MA0G0101.
Massive Semantic Web data compression with MapReduce Jacopo Urbani, Jason Maassen, Henri Bal Vrije Universiteit, Amsterdam HPDC ( High Performance Distributed.
Toward Efficient and Simplified Distributed Data Intensive Computing IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, VOL. 22, NO. 6, JUNE 2011PPT.
PDAC-10 Middleware Solutions for Data- Intensive (Scientific) Computing on Clouds Gagan Agrawal Ohio State University (Joint Work with Tekin Bicer, David.
A N I N - MEMORY F RAMEWORK FOR E XTENDED M AP R EDUCE 2011 Third IEEE International Conference on Coud Computing Technology and Science.
CloudPP: A Novel Cloud-based P2P Live Video Streaming Platform with SVC technology Speaker : 吳靖緯 MA0G th International Conference.
Video Streaming Transmission Over Multi-channel Multi-path Wireless Mesh Networks Speaker : 吳靖緯 MA0G WiCOM '08. 4th International.
Authors: Jiann-Liang Chenz, Szu-Lin Wuy, Yang-Fang Li, Pei-Jia Yang,
Load Rebalancing for Distributed File Systems in Clouds.
MapReduce: Simplied Data Processing on Large Clusters Written By: Jeffrey Dean and Sanjay Ghemawat Presented By: Manoher Shatha & Naveen Kumar Ratkal.
Sushant Ahuja, Cassio Cristovao, Sameep Mohta
Data Management on Opportunistic Grids
Advanced Topics in Concurrency and Reactive Programming: Case Study – Google Cluster Majeed Kassis.
Distributed Operating Systems
Sindhusha Doddapaneni
Database Applications (15-415) Hadoop Lecture 26, April 19, 2016
湖南大学-信息科学与工程学院-计算机与科学系
Cse 344 May 4th – Map/Reduce.
Ch 4. The Evolution of Analytic Scalability
Distributed computing deals with hardware
Lecture 16 (Intro to MapReduce and Hadoop)
DISTRIBUTED SYSTEMS Principles and Paradigms Second Edition ANDREW S
The Gamma Database Machine Project
COS 518: Distributed Systems Lecture 11 Mike Freedman
MapReduce: Simplified Data Processing on Large Clusters
Distributed Systems and Concurrency: Map Reduce
Presentation transcript:

An Architecture for Distributed High Performance Video Processing in the Cloud Speaker : 吳靖緯 MA0G IEEE 3rd International Conference On Cloud Computing (CLOUD), Page(s): 482 – 489, July 2010 Authors: Rafael Pereira, Marcello Azambuja, Karin Breitman, Markus Endler

Outline Introduction Background and Problem Statement Distributed Video Compression Discussion Conclusion 2

Introduction Such variable demand can have different causes, such as an unanticipated burst of client requests, a time-critical simulation, or a high volume of simultaneous video uploads. In this paper, we propose the Split&Merge architecture for high performance video processing, a generalization of the MapReduce paradigm that rationalizes the use of resources by exploring on demand computing. 3

Background and Problem Statement The process is illustrated in Figure 1. 4

Background and Problem Statement One of the largest HaaS providers in the public Cloud is Amazon AWS, with its Elastic Cloud Computing (EC2) and Simple Storage Service (S3). However, the Map Reduce architecture isn’t generic enough to be used in all classes of problems, such as the use of different Reduce algorithms for some specific pieces of information, or the chunk ordering before the Reduce step. 5

Background and Problem Statement The order in which pieces of audio and video are recombined after having been processed must also be taken into account so as to avoid that significant distortions are incorporated in the output. Moreover, issues such as fault tolerance and scalability need to be thoroughly considered, so that the proposed architecture becomes robust enough to meet the requirements of different video compression applications. 6

Distributed Video Compression Figure 2 shows the speed of encoding of a scene. 7

Distributed Video Compression A.The Split Step Implemented using the Split&Merge architecture and illustrated in Figure 3. 8

Distributed Video Compression The idea is to break the media files into smaller files so that its multiple parts can be processed simultaneously on different machines, thereby reducing the total encoding time of the video file. The ideal here is to make chunks with a constant amount of frames, and not based on runtime. A good approach is to perform the split so that the number of chunks generated is equal to the number of nodes available for processing. 9

Distributed Video Compression When we split a video file into several chunks, we must repair the container of them, rewriting the header and trailer, most of the time. This process can be avoided with a very interesting method. If in the split step, instead of breaking the video file, we just identify the points of beginning and end of each chunk, then it would not be necessary to rewrite the container, which would consequently reduce the encoding time. 10

Distributed Video Compression B.The Process Step Once video is fragmented, the chunks generated should be distributed among the nodes to be processed. In this step, a compression algorithm is applied to each chunk. 11

Distributed Video Compression C.The Merge Step The first phase of the merge step is to join the chunks of processed video. Following, we remix the audio stream with the video, synchronizing the contents, and generating the expected output. We created a fully parallel and distributed video compression process, where the different pieces of content can be processed simultaneously in a cluster or, alternatively, using resources in the Cloud. 12

Distributed Video Compression D.Performance Tests In Figure 4, as follows we depict the comparison between the total times. 13

Distributed Video Compression Table 1, bellow, compares the traditional encoding process with the proposed Split&Merge approach. 14

Discussion A.Fault Tolerance The typical Map-Reduce implementation provides a single master node, responsible for the scheduling tasks to worker nodes. A single failure can result in the collapse of the entire system. The Split&Merge architecture tackles this problem by coupling a service to the master node, that periodically checks the conditions of its workers. 15

Discussion The main challenge in having two active masters is sharing state control between them. More specifically, state control sharing means that, whenever it delegates a task, the master node responsible for such operation must inform its mirror which task has been delegated to which worker node, so that both are able understand the processing status post from the worker(s). 16

Discussion B.Private Cluster Deployment Figure 5 illustrates the proposed architecture. 17

Discussion C.Public Cloud Deployment In Figure 6 we illustrate the Split&Merge architecture when used with public Cloud Environments. 18

Conclusion In this paper we are specifically interested in architectures that deal with large-scale video processing tasks, as most of the existing video processing techniques do not consider parallel computing in the cloud. A specific case that was presented was video compression, and, for this type of processing, we propose a more efficient architecture and that can be deployed both in private clusters, as in the Cloud. 19