IT0483-PRINCIPLES OF CLOUD COMPUTING ,N.ARIVAZHAGAN

Slides:



Advertisements
Similar presentations
Numbers Treasure Hunt Following each question, click on the answer. If correct, the next page will load with a graphic first – these can be used to check.
Advertisements

Symantec 2010 Windows 7 Migration EMEA Results. Methodology Applied Research performed survey 1,360 enterprises worldwide SMBs and enterprises Cross-industry.
Repaso: Unidad 2 Lección 2
Symantec 2010 Windows 7 Migration Global Results.
1 A B C
Simplifications of Context-Free Grammars
Variations of the Turing Machine
AP STUDY SESSION 2.
1
Select from the most commonly used minutes below.
Copyright © 2003 Pearson Education, Inc. Slide 1 Computer Systems Organization & Architecture Chapters 8-12 John D. Carpinelli.
1 Copyright © 2013 Elsevier Inc. All rights reserved. Chapter 4 Computing Platforms.
Processes and Operating Systems
Copyright © 2011, Elsevier Inc. All rights reserved. Chapter 6 Author: Julia Richards and R. Scott Hawley.
David Burdett May 11, 2004 Package Binding for WS CDL.
Local Customization Chapter 2. Local Customization 2-2 Objectives Customization Considerations Types of Data Elements Location for Locally Defined Data.
CALENDAR.
1 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt BlendsDigraphsShort.
1 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt RhymesMapsMathInsects.
1 Chapter 12 File Management Patricia Roy Manatee Community College, Venice, FL ©2008, Prentice Hall Operating Systems: Internals and Design Principles,
1 Click here to End Presentation Software: Installation and Updates Internet Download CD release NACIS Updates.
The 5S numbers game..
Media-Monitoring Final Report April - May 2010 News.
Chapter 7: Steady-State Errors 1 ©2000, John Wiley & Sons, Inc. Nise/Control Systems Engineering, 3/e Chapter 7 Steady-State Errors.
Welcome. © 2008 ADP, Inc. 2 Overview A Look at the Web Site Question and Answer Session Agenda.
Break Time Remaining 10:00.
Turing Machines.
Table 12.1: Cash Flows to a Cash and Carry Trading Strategy.
PP Test Review Sections 6-1 to 6-6
CS 6143 COMPUTER ARCHITECTURE II SPRING 2014 ACM Principles and Practice of Parallel Programming, PPoPP, 2006 Panel Presentations Parallel Processing is.
1 The Royal Doulton Company The Royal Doulton Company is an English company producing tableware and collectables, dating to Operating originally.
Operating Systems Operating Systems - Winter 2010 Chapter 3 – Input/Output Vrije Universiteit Amsterdam.
Exarte Bezoek aan de Mediacampus Bachelor in de grafische en digitale media April 2014.
TESOL International Convention Presentation- ESL Instruction: Developing Your Skills to Become a Master Conductor by Beth Clifton Crumpler by.
Sample Service Screenshots Enterprise Cloud Service 11.3.
Copyright © 2012, Elsevier Inc. All rights Reserved. 1 Chapter 7 Modeling Structure with Blocks.
1 RA III - Regional Training Seminar on CLIMAT&CLIMAT TEMP Reporting Buenos Aires, Argentina, 25 – 27 October 2006 Status of observing programmes in RA.
Basel-ICU-Journal Challenge18/20/ Basel-ICU-Journal Challenge8/20/2014.
1..
1 TV Viewing Trends Rivière-du-Loup EM - Diary Updated Spring 2014.
Adding Up In Chunks.
SLP – Endless Possibilities What can SLP do for your school? Everything you need to know about SLP – past, present and future.
MaK_Full ahead loaded 1 Alarm Page Directory (F11)
1 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt Synthetic.
Artificial Intelligence
Before Between After.
Bell Busters! Unit 1 #1-61. Purposes of Government 1. Purposes of government 2. Preamble to the Constitution 3. Domestic tranquility 4. Common defense.
: 3 00.
5 minutes.
1 hi at no doifpi me be go we of at be do go hi if me no of pi we Inorder Traversal Inorder traversal. n Visit the left subtree. n Visit the node. n Visit.
1 Let’s Recapitulate. 2 Regular Languages DFAs NFAs Regular Expressions Regular Grammars.
Types of selection structures
Speak Up for Safety Dr. Susan Strauss Harassment & Bullying Consultant November 9, 2012.
1 Titre de la diapositive SDMO Industries – Training Département MICS KERYS 09- MICS KERYS – WEBSITE.
Essential Cell Biology
Converting a Fraction to %
Numerical Analysis 1 EE, NCKU Tien-Hao Chang (Darby Chang)
Clock will move after 1 minute
famous photographer Ara Guler famous photographer ARA GULER.
PSSA Preparation.
Physics for Scientists & Engineers, 3rd Edition
Select a time to count down from the clock above
Copyright Tim Morris/St Stephen's School
1.step PMIT start + initial project data input Concept Concept.
1 Decidability continued…. 2 Theorem: For a recursively enumerable language it is undecidable to determine whether is finite Proof: We will reduce the.
1 Non Deterministic Automata. 2 Alphabet = Nondeterministic Finite Accepter (NFA)
HAMS Technologies 1
Presentation transcript:

IT0483-PRINCIPLES OF CLOUD COMPUTING ,N.ARIVAZHAGAN ITO483-PRINCIPLES OF CLOUD COMPUTING Unit-5. CASE STUDY : Amazon Case Study. Introduction to MapReduce: Discussion of Google Paper, GFS, HDFS, Hadoop Framework. IT0483-PRINCIPLES OF CLOUD COMPUTING ,N.ARIVAZHAGAN 3/25/2017

IT0483-PRINCIPLES OF CLOUD COMPUTING ,N.ARIVAZHAGAN AMAZON WEB SERVICE Using Amazon Web Services, an e-commerce web site can weather unforeseen demand with ease; a pharmaceutical company can “rent” computing power to execute large-scale simulations; a media company can serve unlimited videos, music, and more; and an enterprise can deploy bandwidth-consuming services and training to its mobile workforce IT0483-PRINCIPLES OF CLOUD COMPUTING ,N.ARIVAZHAGAN 3/25/2017

IT0483-PRINCIPLES OF CLOUD COMPUTING ,N.ARIVAZHAGAN BENEFITS No contracts or commitments Pay as you go Transparent pricing Better economics Better use of your time Better environmental impact IT0483-PRINCIPLES OF CLOUD COMPUTING ,N.ARIVAZHAGAN 3/25/2017

IT0483-PRINCIPLES OF CLOUD COMPUTING ,N.ARIVAZHAGAN MAP REDUCE The idea of Map, and Reduce is 40+ year old Present in all Functional Programming Languages. See, e.g., APL, Lisp and ML Alternate names for Map: Apply-All Higher Order Functions take function definitions as arguments, or return a function as output Map and Reduce are higher-order functions IT0483-PRINCIPLES OF CLOUD COMPUTING ,N.ARIVAZHAGAN 3/25/2017

IT0483-PRINCIPLES OF CLOUD COMPUTING ,N.ARIVAZHAGAN MAP REDUCE F(x: int) returns r: int Let V be an array of integers. W = map(F, V) W[i] = F(V[i]) for all I i.e., apply F to every element of V IT0483-PRINCIPLES OF CLOUD COMPUTING ,N.ARIVAZHAGAN 3/25/2017

IT0483-PRINCIPLES OF CLOUD COMPUTING ,N.ARIVAZHAGAN reduce: A Higher Order Function reduce also known as fold, accumulate, compress or inject Reduce/fold takes in a function and folds it in between the elements of a list. IT0483-PRINCIPLES OF CLOUD COMPUTING ,N.ARIVAZHAGAN 3/25/2017

IT0483-PRINCIPLES OF CLOUD COMPUTING ,N.ARIVAZHAGAN Map/Reduce Implementation Idea MapReduce and Distributed File System framework for large commodity clusters Master/Slave relationship JobTracker handles all scheduling & data flow between TaskTrackers TaskTracker handles all worker tasks on a node Individual worker task runs map or reduce operation Integrates with HDFS for data locality IT0483-PRINCIPLES OF CLOUD COMPUTING ,N.ARIVAZHAGAN 3/25/2017

IT0483-PRINCIPLES OF CLOUD COMPUTING ,N.ARIVAZHAGAN Hadoop Supported File Systems HDFS: Hadoop's own file system. Amazon S3 file system. Targeted at clusters hosted on the Amazon Elastic Compute Cloud server-on-demand infrastructure Not rack-aware CloudStore previously Kosmos Distributed File System like HDFS, this is rack-aware. FTP Filesystem stored on remote FTP servers. Read-only HTTP and HTTPS file systems. IT0483-PRINCIPLES OF CLOUD COMPUTING ,N.ARIVAZHAGAN 3/25/2017

IT0483-PRINCIPLES OF CLOUD COMPUTING ,N.ARIVAZHAGAN HDFS: Hadoop Distr File System Designed to scale to petabytes of storage, and run on top of the file systems of the underlying OS. Master (“NameNode”) handles replication, deletion, creation Slave (“DataNode”) handles data retrieval Files stored in many blocks Each block has a block Id Block Id associated with several nodes hostname:port (depending on level of replication) IT0483-PRINCIPLES OF CLOUD COMPUTING ,N.ARIVAZHAGAN 3/25/2017

IT0483-PRINCIPLES OF CLOUD COMPUTING ,N.ARIVAZHAGAN Hadoop v. ‘MapReduce MapReduce is also the name of a framework developed by Google Hadoop was initially developed by Yahoo and now part of the Apache group. Hadoop was inspired by Google's MapReduce and Google File System (GFS) papers. IT0483-PRINCIPLES OF CLOUD COMPUTING ,N.ARIVAZHAGAN 3/25/2017

IT0483-PRINCIPLES OF CLOUD COMPUTING ,N.ARIVAZHAGAN MapReduce Hadoop Org Google Yahoo/Apache Impl C++ Java Distributed File Sys GFS HDFS Data Base Bigtable HBase Distributed lock mgr Chubby ZooKeeper IT0483-PRINCIPLES OF CLOUD COMPUTING ,N.ARIVAZHAGAN 3/25/2017

IT0483-PRINCIPLES OF CLOUD COMPUTING ,N.ARIVAZHAGAN 3/25/2017

A Simple Hadoop Example http://wiki.apache.org/hadoop/WordCount wordCount A Simple Hadoop Example http://wiki.apache.org/hadoop/WordCount

IT0483-PRINCIPLES OF CLOUD COMPUTING ,N.ARIVAZHAGAN Word Count Example Read text files and count how often words occur. The input is text files The output is a text file each line: word, tab, count Map: Produce pairs of (word, count) Reduce: For each word, sum up the counts. IT0483-PRINCIPLES OF CLOUD COMPUTING ,N.ARIVAZHAGAN 3/25/2017

IT0483-PRINCIPLES OF CLOUD COMPUTING ,N.ARIVAZHAGAN WordCount Overview 3 import ... 12 public class WordCount { 13 14 public static class Map extends MapReduceBase implements Mapper ... { 17 18 public void map ... 26 } 27 28 public static class Reduce extends MapReduceBase implements Reducer ... { 29 30 public void reduce ... 37 } 38 39 public static void main(String[] args) throws Exception { 40 JobConf conf = new JobConf(WordCount.class); 41 ... 53 FileInputFormat.setInputPaths(conf, new Path(args[0])); 54 FileOutputFormat.setOutputPath(conf, new Path(args[1])); 55 56 JobClient.runJob(conf); 57 } 58 59 } IT0483-PRINCIPLES OF CLOUD COMPUTING ,N.ARIVAZHAGAN 3/25/2017

IT0483-PRINCIPLES OF CLOUD COMPUTING ,N.ARIVAZHAGAN wordCount Mapper 14 public static class Map extends MapReduceBase implements Mapper<LongWritable, Text, Text, IntWritable> { 15 private final static IntWritable one = new IntWritable(1); 16 private Text word = new Text(); 17 18 public void map( LongWritable key, Text value, OutputCollector<Text, IntWritable> output, Reporter reporter) throws IOException { 19 String line = value.toString(); 20 StringTokenizer tokenizer = new StringTokenizer(line); 21 while (tokenizer.hasMoreTokens()) { 22 word.set(tokenizer.nextToken()); 23 output.collect(word, one); 24 } 25 } 26 } IT0483-PRINCIPLES OF CLOUD COMPUTING ,N.ARIVAZHAGAN 3/25/2017

IT0483-PRINCIPLES OF CLOUD COMPUTING ,N.ARIVAZHAGAN wordCount Reducer 28 public static class Reduce extends MapReduceBase implements Reducer<Text, IntWritable, Text, IntWritable> { 29 30 public void reduce(Text key, Iterator<IntWritable> values, OutputCollector<Text, IntWritable> output, Reporter reporter) throws IOException { 31 int sum = 0; 32 while (values.hasNext()) { 33 sum += values.next().get(); 34 } 35 output.collect(key, new IntWritable(sum)); 36 } 37 } IT0483-PRINCIPLES OF CLOUD COMPUTING ,N.ARIVAZHAGAN 3/25/2017

IT0483-PRINCIPLES OF CLOUD COMPUTING ,N.ARIVAZHAGAN wordCount JobConf 40 JobConf conf = new JobConf(WordCount.class); 41 conf.setJobName("wordcount"); 42 43 conf.setOutputKeyClass(Text.class); 44 conf.setOutputValueClass(IntWritable.class); 45 46 conf.setMapperClass(Map.class); 47 conf.setCombinerClass(Reduce.class); 48 conf.setReducerClass(Reduce.class); 49 50 conf.setInputFormat(TextInputFormat.class); 51 conf.setOutputFormat(TextOutputFormat.class); IT0483-PRINCIPLES OF CLOUD COMPUTING ,N.ARIVAZHAGAN 3/25/2017

IT0483-PRINCIPLES OF CLOUD COMPUTING ,N.ARIVAZHAGAN WordCount main 39 public static void main(String[] args) throws Exception { 40 JobConf conf = new JobConf(WordCount.class); 41 conf.setJobName("wordcount"); 42 43 conf.setOutputKeyClass(Text.class); 44 conf.setOutputValueClass(IntWritable.class); 45 46 conf.setMapperClass(Map.class); 47 conf.setCombinerClass(Reduce.class); 48 conf.setReducerClass(Reduce.class); 49 50 conf.setInputFormat(TextInputFormat.class); 51 conf.setOutputFormat(TextOutputFormat.class); 52 53 FileInputFormat.setInputPaths(conf, new Path(args[0])); 54 FileOutputFormat.setOutputPath(conf, new Path(args[1])); 55 56 JobClient.runJob(conf); 57 } IT0483-PRINCIPLES OF CLOUD COMPUTING ,N.ARIVAZHAGAN 3/25/2017

Invocation of wordcount /usr/local/bin/hadoop dfs -mkdir <hdfs-dir> /usr/local/bin/hadoop dfs -copyFromLocal <local-dir> <hdfs-dir> /usr/local/bin/hadoop jar hadoop-*-examples.jar wordcount [-m <#maps>] [-r <#reducers>] <in-dir> <out-dir> IT0483-PRINCIPLES OF CLOUD COMPUTING ,N.ARIVAZHAGAN 3/25/2017

IT0483-PRINCIPLES OF CLOUD COMPUTING ,N.ARIVAZHAGAN GFS Google File System (GFS or GoogleFS) is a proprietary distributed file system developed by Google Inc. for its own use. It is designed to provide efficient, reliable access to data using large clusters of commodity hardware. IT0483-PRINCIPLES OF CLOUD COMPUTING ,N.ARIVAZHAGAN 3/25/2017

IT0483-PRINCIPLES OF CLOUD COMPUTING ,N.ARIVAZHAGAN HDFS Hadoop Distributed File System (HDFS™) is the primary storage system used by Hadoop applications. HDFS creates multiple replicas of data blocks and distributes them on compute nodes throughout a cluster to enable reliable, extremely rapid computations. IT0483-PRINCIPLES OF CLOUD COMPUTING ,N.ARIVAZHAGAN 3/25/2017

IT0483-PRINCIPLES OF CLOUD COMPUTING,N.ARIVAZHAGAN Review questions Part A 1) What is the Apache Hadoop? 2) Mention the uses of Amazon EC2 Cloud Computing services 3) What is meant by MapReduce? 4) Mention the hot spots of MapReduce framework 5) What are the different steps in MapReduce framework 6) What is the use of Map Partition function 7) Mention the uses of MapReduce function 8) Differentiate job tracker and task tracker 9) What is the algorithm used in scheduling of Hadoop? 10) What is meant by fair scheduler. Mention its uses 11) What is meant by capacity scheduler 12) Mention any four applications of Hadoop 13) Who are the all the main users of Hadoop 14) Mention any four commercially supported Hadoop related products IT0483-PRINCIPLES OF CLOUD COMPUTING,N.ARIVAZHAGAN 3/25/2017

IT0483-PRINCIPLES OF CLOUD COMPUTING ,N.ARIVAZHAGAN Part B 1) Draw and explain about Hadoop architecture 2) Explain about Hadoop File System 3) Explain about Amazon EC2 Cloud Computing Case Study for financial organization 4) Explain the concept of MapReduce 5) Explain the concept of Google File System IT0483-PRINCIPLES OF CLOUD COMPUTING ,N.ARIVAZHAGAN 3/25/2017

IT0483-PRINCIPLES OF CLOUD COMPUTING,N.ARIVAZHAGAN REFERENCES 1.WWW.WIKIPEDIA.ORG IT0483-PRINCIPLES OF CLOUD COMPUTING,N.ARIVAZHAGAN 3/25/2017