Distributed Computing Systems Current Issues in DCS Dr. Sunny Jeong. Mr. Colin Zhang With Thanks to Prof. G. Coulouris,

Slides:



Advertisements
Similar presentations
Computer Science Internet and Web Technology High Performance Distributed Computing Parallel and Distributed Computer Systems Dr.-Ing. Thilo Kielmann.
Advertisements

Large Scale Computing Systems
2. Computer Clusters for Scalable Parallel Computing
Dinker Batra CLUSTERING Categories of Clusters. Dinker Batra Introduction A computer cluster is a group of linked computers, working together closely.
1 Distributed Systems Distributed Objects & Remote Invocation CORBA Dr. Sunny Jeong. Mr. Colin Zhang With Thanks.
I.1 Distributed Systems Prof. Dr. Alexander Schill Dresden Technical University Computer Networks Dept.
 Need for a new processing platform (BigData)  Origin of Hadoop  What is Hadoop & what it is not ?  Hadoop architecture  Hadoop components (Common/HDFS/MapReduce)
CS10 The Beauty and Joy of Computing Lecture #19 Distributed Computing Researchers at Indiana U used data mining techniques to uncover evidence.
Undergraduate Poster Presentation Match 31, 2015 Department of CSE, BUET, Dhaka, Bangladesh Wireless Sensor Network Integretion With Cloud Computing H.M.A.
Dr. Kalpakis CMSC621 Advanced Operating Systems Introduction.
Big Data A big step towards innovation, competition and productivity.
Take An Internal Look at Hadoop Hairong Kuang Grid Team, Yahoo! Inc
Introduction. Readings r Van Steen and Tanenbaum: 5.1 r Coulouris: 10.3.
Data Mining on the Web via Cloud Computing COMS E6125 Web Enhanced Information Management Presented By Hemanth Murthy.
Frankie Pike. 2010: 1.2 zettabytes 1.2 trillion gigabytes DVDs past the moon 2-way = 6 newspapers everyday ~58% growth per year Why care?
Facebook (stylized facebook) is a Social Networking System and website launched in February 2004, operated and privately owned by Facebook, Inc. As.
SOFTWARE SYSTEMS DEVELOPMENT MAP-REDUCE, Hadoop, HBase.
Big Data. What is Big Data? Big Data Analytics: 11 Case Histories and Success Stories
1 Distributed Systems : Server Load Balancing Dr. Sunny Jeong. Mr. Colin Zhang With Thanks to Prof. G. Coulouris,
Software Architecture
CS525: Special Topics in DBs Large-Scale Data Management Hadoop/MapReduce Computing Paradigm Spring 2013 WPI, Mohamed Eltabakh 1.
What is a Distributed System? n From various textbooks: l “A distributed system is a collection of independent computers that appear to the users of the.
W HAT IS H ADOOP ? Hadoop is an open-source software framework for storing and processing big data in a distributed fashion on large clusters of commodity.
Introduction to Apache Hadoop Zibo Wang. Introduction  What is Apache Hadoop?  Apache Hadoop is a software framework which provides open source libraries.
Hadoop/MapReduce Computing Paradigm 1 Shirish Agale.
Introduction to Hadoop and HDFS
f ACT s  Data intensive applications with Petabytes of data  Web pages billion web pages x 20KB = 400+ terabytes  One computer can read
SEMINAR ON Guided by: Prof. D.V.Chaudhari Seminar by: Namrata Sakhare Roll No: 65 B.E.Comp.
Hadoop & Condor Dhruba Borthakur Project Lead, Hadoop Distributed File System Presented at the The Israeli Association of Grid Technologies.
Hadoop Ali Sharza Khan High Performance Computing 1.
Introduction. Readings r Coulouris, Dollimore and Kindberg Distributed Systems: Concepts and Design Edn. 3 m Note: All figures from this book.
Types of Operating Systems
Performance Evaluation of Image Conversion Module Based on MapReduce for Transcoding and Transmoding in SMCCSE Speaker : 吳靖緯 MA0G IEEE.
Grid and Cloud Computing Globus Provision Dr. Guy Tel-Zur.
Introduction to Hbase. Agenda  What is Hbase  About RDBMS  Overview of Hbase  Why Hbase instead of RDBMS  Architecture of Hbase  Hbase interface.
Apache Hadoop Daniel Lust, Anthony Taliercio. What is Apache Hadoop? Allows applications to utilize thousands of nodes while exchanging thousands of terabytes.
Review of technologies for developing geospatial applications with a focus on open source (FOSS4G) and their implementation of cloud computing application.
© Hortonworks Inc Inside hadoop-dev Steve Loughran– Apachecon EU, November 2012.
Programming in Hadoop Guangda HU Huayang GUO
Types of Operating Systems 1 Computer Engineering Department Distributed Systems Course Assoc. Prof. Dr. Ahmet Sayar Kocaeli University - Fall 2015.
CS525: Big Data Analytics MapReduce Computing Paradigm & Apache Hadoop Open Source Fall 2013 Elke A. Rundensteiner 1.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) Giuseppe Andronico INFN Sez. CT / Consorzio COMETA Beijing,
HADOOP Carson Gallimore, Chris Zingraf, Jonathan Light.
Hadoop/MapReduce Computing Paradigm 1 CS525: Special Topics in DBs Large-Scale Data Management Presented By Kelly Technologies
GLite Middleware Administration Sara Bertocco INFN Padova 11 th International GridKa School 2013 – Big Data, Clouds and Grids.
{ Tanya Chaturvedi MBA(ISM) Hadoop is a software framework for distributed processing of large datasets across large clusters of computers.
The History of Clustering. What is computer clustering? Computer clustings is when a group of computers are linked together operating as one, sharing.
Cloud Distributed Computing Environment Hadoop. Hadoop is an open-source software system that provides a distributed computing environment on cloud (data.
Cluster computing. 1.What is cluster computing? 2.Need of cluster computing. 3.Architecture 4.Applications of cluster computing 5.Advantages of cluster.
Distributed File System. Outline Basic Concepts Current project Hadoop Distributed File System Future work Reference.
PARALLEL AND DISTRIBUTED PROGRAMMING MODELS U. Jhashuva 1 Asst. Prof Dept. of CSE om.
Learn Hadoop and Big Data Technologies. Hadoop  An Open source framework that stores and processes Big Data in distributed manner on a large groups of.
Distributed Computing Primer UMBC CMSC 491 Hadoop-Based Distributed Computing Spring 2016 Adam Shook Some content adapted from Dr. Kalpakis’s CMSC 621.
By: Joel Dominic and Carroll Wongchote 4/18/2012.
SERVICE ORIENTED ARCHITECTURE
DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING CLOUD COMPUTING
Hadoop Aakash Kag What Why How 1.
Clouds , Grids and Clusters
Introduction to Distributed Platforms
An Open Source Project Commonly Used for Processing Big Data Sets
Distributed Computation Framework for Machine Learning
Grid Computing.
Software Engineering Introduction to Apache Hadoop Map Reduce
Cloud Computing.
Meng Cao, Xiangqing Sun, Ziyue Chen May 28th, 2014
Ministry of Higher Education
Chapter 17: Database System Architectures
Distributed Databases
Distributed computing deals with hardware
Anjuman College of Engineering & Technology Computer Science & Engineering Department Subject Code: BECSE408T Subject Name: (ELECTIVE-III)Clustering &
Presentation transcript:

Distributed Computing Systems Current Issues in DCS Dr. Sunny Jeong. Mr. Colin Zhang With Thanks to Prof. G. Coulouris, Prof. A.S. Tanenbaum and Prof. S.C Joo

Cluster, Grid, Cloud …Now…Hadoop?  To Support hadling Big data  Hadoop HDFS + MapReduce –designed so that any node failures are automatically handled by the framework. –enables applications to work with thousands of computation- independent computers and petabytes of data.  Facebook Yahoo..of course GOOGLE

Overview  Hadoop  Computer Cluster  Grid Computing  Grid Computing Projects 3

Hadoop Architecture  Wikipedia

MapReduce Architecture  Yuan Luo

HDFS Architecture  Dhruba Borthakur

Clustering Computing  Consists of a set of loosely connected computers that work together as a single system.  The components of a cluster are usually connected to each other through fast LAN, each node with own OS  Availability of low cost microprocessors, high speed networks, and software for high performance distributed computing 7

Grid Computing  Computational Grid: –Focus on CPU scavenging/stealing technology for complicated operation  Data Grid): Managing and sharing distributed intensive data  Access Grid: For Cooperation between remote computer using audio or video applications  Equipment Grid : Controlling remote machine and analyzing data from the machine.

Grid Project BOINC in 2011)

Cloud Computing( comparison with Grid)  Service Oriented  Transaction oriented  No standardization Organization  No Standard Technology  More less homogeneous gathering (compare to Grid)  Loosely connected 10

Barriers of the Concepts  Shim young cheol