MapReduceCS-4513 D-term 20081 MapReduce CS-4513 Distributed Computing Systems (Slides include materials from Operating System Concepts, 7 th ed., by Silbershatz,

Slides:



Advertisements
Similar presentations
IntroductionCS-4513, D-Term Distributed Computing Systems Hugh C. Lauer Adjunct Professor (Slides include materials from Operating System Concepts,
Advertisements

Remote Procedure CallCS-4513, D-Term Remote Procedure Call CS-4513 Distributed Computing Systems (Slides include materials from Operating System.
Introduction to Operating Systems CS-2301 B-term Introduction to Operating Systems CS-2301, System Programming for Non-majors (Slides include materials.
Networks & Communications CS-4513, D-Term Networks & Communication (continued) CS-4513 Distributed Computing Systems (Slides include materials from.
1 CS4513 Distributed Computing Systems Bob Kinicki Term D04.
FilesCS-4513, D-Term Introduction to File Systems CS-4513 Distributed Computing Systems (Slides include materials from Operating System Concepts,
More on Replication and Consistency CS-4513, D-Term More on Replication and Consistency CS-4513 D-Term 2007 (Slides include materials from Operating.
CSCI 3 Introduction to Computer Science. CSCI 3 Course Description: –An overview of the fundamentals of computer science. Topics covered include number.
CS533 Concepts of Operating Systems Class 1 Course Overview and Entrance Exam.
Election AlgorithmsCS-4513 D-term Election Algorithms CS-4513 Distributed Computing Systems (Slides include materials from Operating System Concepts,
Atomic TransactionsCS-4513 D-term Atomic Transactions in Distributed Systems CS-4513 Distributed Computing Systems (Slides include materials from.
Atomic TransactionsCS-502 Fall Atomic Transactions in Distributed Systems CS-502, Operating Systems Fall 2007 (Slides include materials from Operating.
Bilkent University Department of Computer Engineering
NamingCS-4513, D-Term Naming CS-4513 Distributed Computing Systems (Slides include materials from Operating System Concepts, 7 th ed., by Silbershatz,
CS3161 Operating System Principles Lecturer: Victor Lee Office: B6415 Telephone: Homepage :
Programming Project #3CS-4513, D-Term Programming Project #3 Simple Web Server CS-4513 D-Term 2007 (Slides include materials from Operating System.
SynchronizationCS-4513, D-Term Synchronization in Distributed Systems CS-4513 D-Term 2007 (Slides include materials from Operating System Concepts,
Understanding BubbleSort CS-502 (EMC) Fall Understanding BubbleSort CS-502, Operating Systems Fall 2009 (EMC) (Slides include materials from Modern.
DisksCS-4513, D-Term Disks CS-4513 Distributed Computing Systems (Slides include materials from Operating System Concepts, 7 th ed., by Silbershatz,
More on AuthenticationCS-4513 D-term More on Authentication CS-4513 Distributed Computing Systems (Slides include materials from Operating System.
Replication and Consistency CS-4513 D-term Replication and Consistency CS-4513 Distributed Computing Systems (Slides include materials from Operating.
Synchronization in Distributed Systems CS-4513 D-term Synchronization in Distributed Systems CS-4513 Distributed Computing Systems (Slides include.
CS4513 Distributed Computer Systems Mark Claypool.
Practical Issues of RPCCS-4513, D-Term Remote Procedure Call Practical Issues CS-4513 Distributed Computing Systems (Slides include materials from.
MicrokernelsCS-502 (EMC) Fall Microkernels CS-502, Operating Systems Fall 2009 (EMC) (Slides include materials from Modern Operating Systems, 3 rd.
CS4513 Distributed Computer Systems Mark Claypool.
NamingCS-4513, D-Term Naming CS-4513 Distributed Computing Systems (Slides include materials from Operating System Concepts, 7 th ed., by Silbershatz,
More on Replication and Consistency CS-4513 D-term More on Replication and Consistency CS-4513 Distributed Computing Systems (Slides include materials.
CSE 515 Statistical Methods in Computer Science Instructor: Pedro Domingos.
Computational Thinking Related Efforts. CS Principles – Big Ideas  Computing is a creative human activity that engenders innovation and promotes exploration.
Introduction. Readings r Van Steen and Tanenbaum: 5.1 r Coulouris: 10.3.
Introduction to Operating Systems J. H. Wang Sep. 18, 2012.
Computer Network Fundamentals CNT4007C
Election AlgorithmsCS-4513, D-Term Synchronization (continued) CS-4513 D-Term 2007 (Slides include materials from Operating System Concepts, 7 th.
CSE 436 Software Engineering Workshop Course Overview Christopher Gill CSE 436 January 2007 Department of Computer Science and Engineering.
CS525: Special Topics in DBs Large-Scale Data Management Hadoop/MapReduce Computing Paradigm Spring 2013 WPI, Mohamed Eltabakh 1.
Introduction to Operating Systems J. H. Wang Sep. 18, 2015.
Syllabus. Instructor Dr. Hanan Lutfiyya Middlesex College 418 Ext Office Hours: Tuesday from 12:05-1:05 and Thursday from 11:05-1:05.
Tanenbaum & Van Steen, Distributed Systems: Principles and Paradigms, 2e, (c) 2007 Prentice-Hall, Inc. All rights reserved DISTRIBUTED SYSTEMS.
CS212: DATA STRUCTURES Lecture 1: Introduction. What is this course is about ?  Data structures : conceptual and concrete ways to organize data for efficient.
Guiding Principles. Goals First we must agree on the goals. Several (non-exclusive) choices – Want every CS major to be educated in performance including.
Advanced Principles of Operating Systems (CE-403).
Tanenbaum & Van Steen, Distributed Systems: Principles and Paradigms, 2e, (c) 2007 Prentice-Hall, Inc. All rights reserved Introduction.
Introduction to Operating Systems J. H. Wang Sep. 15, 2010.
Syllabus. Instructor Dr. Hanan Lutfiyya Middlesex College 418 Ext Office Hours: Wednesday 5-6; Thursdays 4-6 or by appointment.
From Coulouris, Dollimore, Kindberg and Blair Distributed Systems: Concepts and Design Edition 5, © Addison-Wesley 2012 Design of Parallel and Distributed.
Caching Principles & Paging Performance CS-502 (EMC) Fall Caching Principles and Paging Performance CS-502, Operating Systems Fall 2009 (EMC) (Slides.
 Course Overview Distributed Systems IT332. Course Description  The course introduces the main principles underlying distributed systems: processes,
Introduction to CS739: Distribution Systems UNIVERSITY of WISCONSIN-MADISON Computer Sciences Department CS 739 Distributed Systems Andrea C. Arpaci-Dusseau.
Overview of Mobile File Systems Presented by Steve Todd For WSU CS 898T Mobile and Wireless Networks Class 5/3/04.
Introduction to Operating Systems J. H. Wang Sep. 13, 2013.
Hadoop/MapReduce Computing Paradigm 1 CS525: Special Topics in DBs Large-Scale Data Management Presented By Kelly Technologies
1.1 Silberschatz, Galvin and Gagne ©2009 Operating System Concepts – 8 th Edition Lecture 1: Introduction & OS Structures (Part One, Chapters 1&2)
Introduction to Operating Systems
Atomic Transactions in Distributed Systems
CPE741: Distributed Systems Course Introduction
CPS : Information Management and Mining
CPE741: Distributed Systems Course Introduction
CS533 Concepts of Operating Systems Class 1
CPE741: Distributed Systems Course Introduction
Rapid Research - Data Innovations
CS-4513 Distributed Computing Systems Hugh C. Lauer
DISTRIBUTED SYSTEMS Principles and Paradigms Second Edition ANDREW S
Caching Principles and Paging Performance
Proposal for Term Project Operating Systems, Fall 2018
CS-3013 Operating Systems Hugh C. Lauer
Networks & Communication (continued)
Caching Principles and Paging Performance
DISTRIBUTED SYSTEMS Principles and Paradigms Second Edition ANDREW S
CS533 Concepts of Operating Systems Class 1
Presentation transcript:

MapReduceCS-4513 D-term MapReduce CS-4513 Distributed Computing Systems (Slides include materials from Operating System Concepts, 7 th ed., by Silbershatz, Galvin, & Gagne, Distributed Systems: Principles & Paradigms, 2 nd ed. By Tanenbaum and Van Steen, and Modern Operating Systems, 2 nd ed., by Tanenbaum)

MapReduceCS-4513 D-term Why MapReduce An important new model of parallel and distributed computing Particularly for problems dealing with “big data” An abstraction to automate the mechanics of data handling and to let the programmer concentrate on semantics of the problem

MapReduceCS-4513 D-term From Operating System course Three fundamental models of parallel computing –Data Parallelism –Task Parallelism –Pipelined Parallelism Each requires a different set of tools Each requires a different mode of thinking

MapReduceCS-4513 D-term MapReduce A new model Fundamentally different from previous models Shares some elements with each one Promise (hope?) of solving new classes of problems that were previously very tedious to solve Not in textbooks Not in previous Distributed Systems courses at WPI

MapReduceCS-4513 D-term Learning about MapReduce Partition class into four teams Each team responsible for understanding and teaching the rest of the class about one subtopic minutes of class time per team Two teams on April 4 Two teams on April 8

MapReduceCS-4513 D-term MapReduce subtopics The abstraction itself and its algorithms Distributed MapReduce Class of problems that MapReduce can help solve Google File System to support MapReduce

MapReduceCS-4513 D-term MapReduce abstraction Explain the abstraction, what it does, etc. Explain the algorithms Show non-trivial programming examples Focus on how to think about a problem

MapReduceCS-4513 D-term Distributed MapReduce Show how it is naturally distributable and scalable Up to terabytes of data and more Show how mechanics of distribution and parallelization are automated Focus on Performance, Reliability, Fault-tolerance, Failure recovery

MapReduceCS-4513 D-term Classes of problems Identify classes of problems on which to use MapReduce Characterize them Why were they difficult before Why are people so excited about MapReduce Why did Google rewrite 10,000 existing programs in MapReduce form

MapReduceCS-4513 D-term Google File System What is so special about it? How different from traditional file systems How does it help MapReduce Focus on Performance, Reliability, Fault-tolerance, Failure recovery

MapReduceCS-4513 D-term Action items today Form teams (one for each subtopic) Roster to professor Get organized to Do reading Prepare topic

MapReduceCS-4513 D-term References See s See course web page