Lecture 1 Book: Hadoop in Action by Chuck Lam Online course – “Cloud Computing Concepts” lecture notes by Indranil Gupta.

Slides:



Advertisements
Similar presentations
1 Cloud Computing with Amazon and Oracle Lewis Cunningham TUSC, Sr Datawarehouse Consultant
Advertisements

INTRODUCTION TO CLOUD COMPUTING CS 595 LECTURE 6 2/13/2015.
Cloud Computing PRESENTED BY- Rajat Dixit (rd2392)
 Need for a new processing platform (BigData)  Origin of Hadoop  What is Hadoop & what it is not ?  Hadoop architecture  Hadoop components (Common/HDFS/MapReduce)
What is Cloud Computing? o Cloud computing:- is a style of computing in which dynamically scalable and often virtualized resources are provided as a service.
INTRODUCTION TO CLOUD COMPUTING CS 595 LECTURE 4.
Topics Problem Statement Define the problem Significance in context of the course Key Concepts Cloud Computing Spatial Cloud Computing Major Contributions.
Presented by Sujit Tilak. Evolution of Client/Server Architecture Clients & Server on different computer systems Local Area Network for Server and Client.
Cloud computing Tahani aljehani.
Software Engineering for Cloud Computing Rao, Feng 04/27/2011.
Hemchand Lallad, Meng-Wei Li and Gregory Scalet Indiana University Purdue University Professor Paul I.Lin June 27, 2013 Implementing Remote Desktop Computing.
Cloud Computing Source:
Introduction to Cloud Computing
For more notes and topics visit:
1 Introduction to Cloud Computing Jian Tang 01/19/2012.
A Brief Overview by Aditya Dutt March 18 th ’ Aditya Inc.
Cloud Computing for the Enterprise November 18th, This work is licensed under a Creative Commons.
PhD course - Milan, March /09/ Some additional words about cloud computing Lionel Brunie National Institute of Applied Science (INSA) LIRIS.
Cloud Computing Kwangyun Cho v=8AXk25TUSRQ.
CLOUD COMPUTING. What is cloud computing ? History Virtualization Cloud Computing hardware Cloud Computing services Cloud Architecture Advantages & Disadvantages.
Cloud Computing. What is Cloud Computing? Cloud computing is a model for enabling convenient, on-demand network access to a shared pool of configurable.
Cloud Computing 1. Outline  Introduction  Evolution  Cloud architecture  Map reduce operation  Platform 2.
Introduction to Cloud Computing Cloud Computing : Module 1.
Evolution - not revolution Server resources are shared globally instead of locally Excess capacity for peak usage can be shared Backup, security and other.
Software Architecture
CS525: Special Topics in DBs Large-Scale Data Management Hadoop/MapReduce Computing Paradigm Spring 2013 WPI, Mohamed Eltabakh 1.
Introduction to Cloud Computing
Cloud Computing. Cloud Computing defined Dynamically scalable, device-independent and task-centric computing resources are provided online, with all charges.
Introduction to Apache Hadoop Zibo Wang. Introduction  What is Apache Hadoop?  Apache Hadoop is a software framework which provides open source libraries.
Hadoop/MapReduce Computing Paradigm 1 Shirish Agale.
Introduction to Hadoop and HDFS
Cloud Computing & Amazon Web Services – EC2 Arpita Patel Software Engineer.
Cloud Computing Dave Elliman 11/10/2015G53ELC 1. Source: NY Times (6/14/2006) The datacenter is the computer!
Presented by: Mostafa Magdi. Contents Introduction. Cloud Computing Definition. Cloud Computing Characteristics. Cloud Computing Key features. Cost Virtualization.
What is the cloud ? IT as a service Cloud allows access to services without user technical knowledge or control of supporting infrastructure Best described.
CS525: Big Data Analytics MapReduce Computing Paradigm & Apache Hadoop Open Source Fall 2013 Elke A. Rundensteiner 1.
Cloud computing Cloud Computing1. NIST: Five essential characteristics On-demand self-service Computing capabilities, disks are demanded over the network.
3/12/2013Computer Engg, IIT(BHU)1 CLOUD COMPUTING-1.
CISC 849 : Applications in Fintech Namami Shukla Dept of Computer & Information Sciences University of Delaware A Cloud Computing Methodology Study of.
Web Technologies Lecture 13 Introduction to cloud computing.
Hadoop/MapReduce Computing Paradigm 1 CS525: Special Topics in DBs Large-Scale Data Management Presented By Kelly Technologies
{ Tanya Chaturvedi MBA(ISM) Hadoop is a software framework for distributed processing of large datasets across large clusters of computers.
1 TCS Confidential. 2 Objective : In this session we will be able to learn:  What is Cloud Computing?  Characteristics  Cloud Flavors  Cloud Deployment.
RANDY MODOWSKI COSC Cloud Computing. Road Map What is Cloud Computing? History of “The Cloud” Cloud Milestones How Cloud Computing is being used.
From infrastructure to applications Where cloud computing is at and where it’s headed.
© 2012 Eucalyptus Systems, Inc. Cloud Computing Introduction Eucalyptus Education Services 2.
CS 6027 Advanced Networking FINAL PROJECT ​. Cloud Computing KRANTHI ​ CHENNUPATI PRANEETHA VARIGONDA ​ SANGEETHA LAXMAN ​ VARUN ​ DENDUKURI.
Web 2.0: Concepts and Applications 6 Linking Data.
GIS IN THE CLOUD Cloud computing furnishes scalable GIS technology that is maintained off premises and delivered on demand as services via the Internet.
Prof. Jong-Moon Chung’s Lecture Notes at Yonsei University
Lecture 6: Cloud Computing
Unit 3 Virtualization.
CLOUD ARCHITECTURE Many organizations and researchers have defined the architecture for cloud computing. Basically the whole system can be divided into.
Introduction to Cloud Computing
11. Looking Ahead.
Introduction to Distributed Platforms
Cloud computing-The Future Technologies
Amazon Web Services Submitted By- Section - B Group - 4
Amazon AWS fundamental Visite- Ravindra verma.
Trends: Technology Doubling Periods – storage: 12 mos, bandwidth: 9 mos, and (what law is this?) cpu compute capacity: 18 mos Then and Now Bandwidth 1985:
What are the most popular services offered by Amazon Web Services..?Amazon Web Services
Introduction to Cloud Computing
Cloud Computing Dr. Sharad Saxena.
Cloud Computing.
Cloud Computing Cloud computing refers to “a model of computing that provides access to a shared pool of computing resources (computers, storage, applications,
Internet and Web Simple client-server model
Emerging technologies-
Cloud Computing: Concepts
Class 5 (Part1) Dr. Zakariya Belkhamza
Anjuman College of Engineering & Technology Computer Science & Engineering Department Subject Code: BECSE408T Subject Name: (ELECTIVE-III)Clustering &
Presentation transcript:

Lecture 1 Book: Hadoop in Action by Chuck Lam Online course – “Cloud Computing Concepts” lecture notes by Indranil Gupta

Content Introduction Clouds MapReduce Understanding Hadoop and MapReduce

Gradation policy Attendance – 10% Quizzes – 20% Midterm – 20% Assignments – 20% Final – 30% TOTAL points

Many Cloud Providers AWS: Amazon Web Services EC2: Elastic Compute Cloud S3: Simple Storage Service EBS: Elastic Block Storage Microsoft Azure Google Compute Enginr Rightscale, Salesforce, EMC, Gigaspaces, 10gen, Datastax, Oracle, VMWare, Yahoo, Cloudera And many, many more!

Two Categories of Clouds Can be either a (i) public cloud, or (ii) private cloud Private clouds are accessible only to company employees Public clouds provide service to any paying customer: Amazon S3(Simple Storage Service) Amazon EC2(Elastic Compute Cloud) Google App Engine/Compute Engine

What is a Cloud? It’s a cluster! It’s a supercomputer! It’s a datastore! It’s a Superman! None of the above All of the above Cloud = Lots of storage + computing cycles nearby

What is a Cloud? A single-site cloud (aka “datacenter”) consists of Compute nodes (grouped into racks) Switches, connecting racks A network topology, e.g. hierarchical Storage nodes connected to network Front-end for submitting jobs and receiving client requests Software services A geographically distributed cloud consists of Multiple such sites Each site perhaps with different structure and services

A Cloudy history of Time

On-demand Access:*aaS On-demand: renting a cab vs. renting a car or buying one HaaS: Hardware as a Service Access to barebones hardware machines. Not always a good idea because of security risks IaaS: Infrastructure as a Service Access to flexible computing and storage infrastructure. Ex: Amazon Web Services (AWS: EC2 and S3) PaaS: Platform as a Service Access to flexible computing and storage infrastructure, coupled with a software platform SaaS: Software as a Service Access to software services(Service Oriented Architectures) Ex: Google docs, MS office on demand

A Cloud... A cloud consists of Hundreds to thousands of machines in a datacenter (server side) Thousands to millions of machines accessing these services (client side) Servers communicate amongst one another Clients communicate with servers Clients also communicate with each other

A Cloud... IS a Distributed System Servers communicate amongst one another -> Distributed System Essentially a cluster! Clients communicate with servers Also a distributed system! Clients may also communicate with each other In peer-to-peer systems like BitTorrent Also a distributed system!

Four Features of Clouds = All Distributed Systems Features! I. Massive Scale: many servers II. On-demand nature –access (multiple) servers anywhere III. Data-Intensive Nature – lots of data => need a cluster (multiple machines) to store IV. New Cloud Programming Paradigms – Hadoop/Mapreduce, NoSQL all need clusters

Distributed System = Many Processes Sending and Receiving Messages

Many Challenges Abound... Failures : no longer the exception, but rather a norm Scalability: 1000s of machines, Terabytes of data Asynchrony : clock skew and clock drift Concurrency : 1000s of machines interacting with each other accessing the same data...

Hadoop Doug Cutting saw an opportunity and led the charge to develop an open source version of this MapReduce system called Hadoop. Today, Hadoop is a core part of the computing infrastructure for many web companies, such as Yahoo, Facebook, LinkedIn, and Twitter. An effective programmer, today, must have knowledge of relational databases, networking, and security, all of which were considered optional skills a couple decades ago. Similarly, basic understanding of distributed data processing will soon become an essential part of every programmer’s toolbox.

What is MapReduce

Map

Reduce

Thank You