Web: courses.engr.illinois.edu/cs425/

Slides:



Advertisements
Similar presentations
CS 425 / ECE 428 Distributed Systems Fall 2014 Indranil Gupta (Indy) Lecture 1: Welcome, and Introduction Web: courses.engr.illinois.edu/cs425/ All slides.
Advertisements

CS542: Topics in Distributed Systems
Computer networks SATISH MISHRA,PGT CS,KV TRIMULGHERRY.
CS 425 / ECE 428 Distributed Systems Fall 2014 Indranil Gupta (Indy) August 26 – December 9, 2014 Lecture 1-29 Web: courses.engr.illinois.edu/cs425/ All.
Application Layer-11 CSE401N: Computer Networks Lecture-4 Application Layer Overview HTTP.
Chapter 2: Application Layer
Jacob Boston Josh Pfeifer. Definition of HyperText Transfer Protocol How HTTP works How Websites work GoDaddy.com OSI Model Networking.
Chapter 2 Application Layer Computer Networking: A Top Down Approach Featuring the Internet, 3 rd edition. Jim Kurose, Keith Ross Addison-Wesley, July.
Web, HTTP and Web Caching
Lecture 1-1 CS542: Topics in Distributed Systems Diganta Goswami.
ECS152BXin Liu 1 ECS 152B Computer Networks Fall 2003 Prof. Xin Liu
Application Layer  We will learn about protocols by examining popular application-level protocols  HTTP  FTP  SMTP / POP3 / IMAP  Focus on client-server.
Introduction 1 Lecture 5 Application Layer slides are modified from J. Kurose & K. Ross University of Nevada – Reno Computer Science & Engineering Department.
Distributed Systems Lecture 1: Overview CS425/CSE424/ECE428 Fall 2011 Nikita Borisov.
19-1 Last time □ TCP ♦ Throughput ♦ Fairness ♦ Delay modeling □ TCP socket programming.
1 Application Layer Lecture 4 Imran Ahmed University of Management & Technology.
CS 3830 Day 7 Introduction : Application Layer 2 Processes communicating Process: program running within a host. r within same host, two processes.
70-291: MCSE Guide to Managing a Microsoft Windows Server 2003 Network Chapter 3: TCP/IP Architecture.
FTP (File Transfer Protocol) & Telnet
2: Application Layer1 CS 4244: Internet Software Development Dr. Eli Tilevich.
Rensselaer Polytechnic Institute Shivkumar Kalvanaraman, Biplab Sikdar 1 The Web: the http protocol http: hypertext transfer protocol Web’s application.
2: Application Layer1 Internet apps: their protocols and transport protocols Application remote terminal access Web file transfer streaming multimedia.
Week 11: Application Layer1 Web and HTTP First some jargon r Web page consists of objects r Object can be HTML file, JPEG image, Java applet, audio file,…
1 Computer Communication & Networks Lecture 28 Application Layer: HTTP & WWW p Waleed Ejaz
1.1 What is the Internet What is the Internet? The Internet is a shared media (coaxial cable, copper wire, fiber optics, and radio spectrum) communication.
Copyright © 2002 Pearson Education, Inc. Slide 3-1 CHAPTER 3 Created by, David Zolzer, Northwestern State University—Louisiana The Internet and World Wide.
Networks – Network Architecture Network architecture is specification of design principles (including data formats and procedures) for creating a network.
HOW WEB SERVER WORKS? By- PUSHPENDU MONDAL RAJAT CHAUHAN RAHUL YADAV RANJIT MEENA RAHUL TYAGI.
Jozef Goetz, Application Layer PART VI Jozef Goetz, Position of application layer The application layer enables the user, whether human.
Distributed Systems: Concepts and Design Chapter 1 Pages
TCP/IP fundamentals Unit objectives Discuss the evolution of TCP/IP Discuss TCP/IP fundamentals.
MySQL and PHP Internet and WWW. Computer Basics A Single Computer.
Sockets process sends/receives messages to/from its socket
Application Layer Khondaker Abdullah-Al-Mamun Lecturer, CSE Instructor, CNAP AUST.
Lecture 28-1 Computer Science 425 Distributed Systems Lecture 28 “Wrap-Up” Klara Nahrstedt.
Lecture 1-1 Computer Science 425 Distributed Systems CS 425 / CSE 424 / ECE 428 Fall 2012 Indranil Gupta (Indy) August 28, 2012 Lecture 1  2012, I. Gupta,
Computer Science, FSU1 CNT5505 DATA/COMUTER COMMUNICATIONS FALL 2008 Time: 3:35AM-4:50AM, MW Location: Love 103 Fall 2008, 3 credits.
2: Application Layer1 Chapter 2 Application Layer Computer Networking: A Top Down Approach Featuring the Internet, 3 rd edition. Jim Kurose, Keith Ross.
World Wide Web “WWW”, "Web" or "W3". World Wide Web “WWW”, "Web" or "W3"
TCP/IP (Transmission Control Protocol / Internet Protocol)
70-291: MCSE Guide to Managing a Microsoft Windows Server 2003 Network, Enhanced Chapter 3: TCP/IP Architecture.
Unit 9: Distributing Computing & Networking Kaplan University 1.
2: Application Layer 1 Chapter 2: Application layer r 2.1 Principles of network applications  app architectures  app requirements r 2.2 Web and HTTP.
Lecture 1-1  2002, M. T. Harandi, J. Hou, and I. Gupta (modified Y. Hu) ECE 428 Distributed Systems Yih-Chun Hu August 25, 2005.
27.1 Chapter 27 WWW and HTTP Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display.
Internet Applications (Cont’d) Basic Internet Applications – World Wide Web (WWW) Browser Architecture Static Documents Dynamic Documents Active Documents.
CS 425 / ECE 428 Distributed Systems Fall 2015 Indranil Gupta (Indy) August 25 – December 8, 2015 Lecture 1-29 Web: courses.engr.illinois.edu/cs425/ All.
INTERNET AND PROTOCOLS For more notes and topics visit: eITnotes.com.
Computer Science, FSU1 CNT5505 DATA/COMUTER COMMUNICATIONS Fall 2010.
Lecture 29-1 Computer Science 425 Distributed Systems CS 425 / CSE 424 / ECE 428 Fall 2012 Indranil Gupta (Indy) August 27-December 11, 2012 Lecture 1-29.
Data Communications and Computer Networks Chapter 2 CS 3830 Lecture 7 Omar Meqdadi Department of Computer Science and Software Engineering University of.
Week 11: Application Layer 1 Web and HTTP r Web page consists of objects r Object can be HTML file, JPEG image, Java applet, audio file,… r Web page consists.
The Internet Technological Background. Topic Objectives At the end of this topic, you should be able to do the following: Able to define the Internet.
Distributed Systems Lecture 1 Introduction to distributed systems 1.
COMPUTER NETWORKS Hwajung Lee. Image Source:
2.2 Interfacing Computers MR JOSEPH TAN CHOO KEE TUESDAY 1330 TO 1530
Information Networks. Internet It is a global system of interconnected computer networks that link several billion devices worldwide. It is an international.
Chapter Objectives In this chapter, you will learn:
Block 5: An application layer protocol: HTTP
Lecture 2 Dr. Richard Spillman Fall 2009
Internet transport protocols services
Lecture 6: TCP/IP Networking By: Adal Alashban
Topic 5: Communication and the Internet
Computer Communication & Networks
湖南大学-信息科学与工程学院-计算机与科学系
CSC 724 Advanced Distributed Systems Introduction
Web: courses.engr.illinois.edu/cs425/
Web: courses.engr.illinois.edu/cs425/
Web: courses.engr.illinois.edu/cs425/
Computer Networks Protocols
Presentation transcript:

Web: courses.engr.illinois.edu/cs425/ CS 425 / ECE 428 Distributed Systems Fall 2015 Indranil Gupta (Indy) Lecture 1: Welcome, and Introduction Aug 25, 2015 Web: courses.engr.illinois.edu/cs425/ All slides © IG

Course Staff More about course later in this lecture… Myself (Indy): 3112 Siebel Center Office Hours Every Tuesday and Thursday right after class – see website TAs: (see course website for office hours) Yi Zhang Guangxiang Du Qi Wang Ayush Jain Alex Zahdeh More about course later in this lecture…

What This Course is About Politics Sports Space Travel Job Interviews Company Acquisitions (Not Kidding)

What This Course is Really About Distributed Systems How to Design Algorithms for them How to Design The Systems How they work in real life How to build real distributed systems

Our Main Goal Today To Define the Term Distributed System

Can you name some examples of Operating Systems?

Can you name some examples of Operating Systems? … Linux WinXP Vista 7/8 Unix FreeBSD Mac OSX 2K Aegis Scout Hydra Mach SPIN OS/2 Express Flux Hope Spring AntaresOS EOS LOS SQOS LittleOS TINOS PalmOS WinCE TinyOS iOS

What is an Operating System?

What is an Operating System? User interface to hardware (device driver) Provides abstractions (processes, file system) Resource manager (scheduler) Means of communication (networking) …

FOLDOC definition (FOLDOC = Free On-Line Dictionary of Computing) Operating System - The low-level software which handles the interface to peripheral hardware, schedules tasks, allocates storage, and presents a default interface to the user when no application program is running.

Can you name some examples of Distributed Systems?

Can you name some examples of Distributed Systems? Client-Server (NFS) The Web The Internet A wireless network DNS Gnutella or BitTorrent (peer to peer overlays) A “cloud”, e.g., Amazon EC2/S3, Microsoft Azure A datacenter, e.g., NCSA, a Google datacenter, AWS

What is a Distributed System?

FOLDOC definition A collection of (probably heterogeneous) automata whose distribution is transparent to the user so that the system appears as one local machine. This is in contrast to a network, where the user is aware that there are several machines, and their location, storage replication, load balancing and functionality is not transparent. Distributed systems usually use some kind of client-server organization.

Textbook definitions A distributed system is a collection of independent computers that appear to the users of the system as a single computer. [Andrew Tanenbaum] A distributed system is several computers doing something together. Thus, a distributed system has three primary characteristics: multiple computers, interconnections, and shared state. [Michael Schroeder] Contradictions: uh, doesn’t each computer need to know about a few other computers in the system (somewhere deep down there in the RAM)? If computers don’t exchange any information and make no progress, would this count as a distributed system?

Unsatisfactory Why are these definitions short? Why do these definitions look inadequate to us? Because we are interested in the insides of a distributed system design and implementation Maintenance Algorithmics (“protocols” or “distributed algorithms”)

“I shall not today attempt further to define the kinds of material I understand to be embraced within that shorthand description; and perhaps I could never succeed in intelligibly doing so. But I know it when I see it, and the motion picture involved in this case is not that.” [Potter Stewart, Associate Justice, US Supreme Court (talking about his interpretation of a technical term laid down in the law, case Jacobellis versus Ohio 1964) ]

Which is a Distributed System – (A) or (B)? (A) Facebook Social Network Graph among humans Source: https://www.facebook.com/note.php?note_id=469716398919

(B) The Internet (Internet Mapping Project, color coded by ISPs)

A working definition for us A distributed system is a collection of entities, each of which is autonomous, programmable, asynchronous and failure-prone, and which communicate through an unreliable communication medium. Entity=a process on a device (PC, PDA) Communication Medium=Wired or wireless network Our interest in distributed systems involves design and implementation, maintenance, algorithmics Designers and progammers interested in : algorithmics, maintenance. (informal definition, for us programmers of distributed systems) Ok: peer to peer systems Contradiction: a computer without ROM or disk drives that needs to boot over the network. Is a collection of these computers a distributed system?

Gnutella Peer to Peer System What are the “entities” (nodes)? What is the communication medium (links)? Source: GnuMap Project

Web Domains What are the “entities” (nodes)? What is the communication medium (links)? Source: http://www.vlib.us/web/worldwideweb3d.html

Datacenter What are the “entities” (nodes)? What is the communication medium (links)?

The Internet – Quick Refresher Underlies many distributed systems. A vast interconnected collection of computer networks of many types. Intranets – subnetworks operated by companies and organizations. Intranets contain LANs (local area networks). WAN – wide area networks, consists of subnets (intranets, LANs, etc.) ISPs – companies that provide modem links and other types of connections to users. Intranets (actually the ISPs’ core routers) are linked by backbones – network links of large bandwidth, such as satellite connections, fiber optic cables, and other high-bandwidth circuits. UC2B? Google Fiber? (MAN = Metropolitan Area Networks)

An Intranet & a distributed system Running over this Intranet is a distributed file system prevents unauthorized messages from leaving/entering; implemented by filtering incoming and outgoing messages via firewall “rules” (configurable)

Networking Stacks Application Underlying layer protocol smtp [RFC 821] telnet [RFC 854] http [RFC 2068] ftp [RFC 959] proprietary (e.g. RealNetworks) NFS (e.g., Skype) Underlying transport protocol TCP TCP or UDP typically UDP Application e-mail remote terminal access Web file transfer streaming multimedia remote file server Internet telephony Distributed System Protocols! Networking Protocols TCP=Transmission Control Protocol UDP=User Datagram Protocol (Implemented via sockets)

The History of Internet Standards Source: http://xkcd.com/927/

The Heart of the World Wide Web: the HTTP Standard HTTP: hypertext transfer protocol WWW’s application layer protocol client/server model client: browser that requests, receives, and “displays” WWW objects server: WWW server, which is storing the website, sends objects in response to requests http1.0: RFC 1945 http1.1: RFC 2068 Leverages same connection to download images, scripts, etc. http request PC running Explorer http response http request Server Running Apache Web server http response Mac running Safari

The HTTP Protocol: More http: TCP transport service: client initiates a TCP connection (creates socket) to server, port 80 server accepts the TCP connection from client http messages (application-layer protocol messages) exchanged between browser (http client) and WWW server (http server) TCP connection closed http is “stateless” server maintains no information about past client requests Why? Protocols that maintain session “state” are complex! past history (state) must be maintained and updated. if server/client crashes, their views of “state” may be inconsistent, and hence must be reconciled. RESTful protocols are stateless.

HTTP Example Suppose user enters URL www.cs.uiuc.edu/ time (contains text, references to 10 jpeg images) Suppose user enters URL www.cs.uiuc.edu/ 1a. http client initiates a TCP connection to http server (process) at www.cs.uiuc.edu. Port 80 is default for http server. 1b. http server at host www.cs.uiuc.edu waiting for a TCP connection at port 80. “accepts” connection, notifying client 2. http client sends a http request message (containing URL) into TCP connection socket 3. http server receives request messages, forms a response message containing requested object (index.html), sends message into socket time

HTTP Example (cont.) time 4. http server closes the TCP connection (if necessary). 5. http client receives a response message containing html file, displays html, Parses html file, finds 10 referenced jpeg objects 6. Steps 1-5 are then repeated for each of 10 jpeg objects time For fetching referenced objects, have 2 options: non-persistent connection: only one object fetched per TCP connection some browsers create multiple TCP connections simultaneously - one per object persistent connection: multiple objects transferred within one TCP connection

Your Shell as a Web browser 1. Telnet to your favorite WWW server: telnet www.google.com 80 Opens TCP connection to port 80 (default http server port) at www.google.com Anything typed in sent to port 80 at www.google.com 2. Type in a GET http request: By typing this in (may need to hit return twice), you send this minimal (but complete) GET request to http server GET /index.html Or GET /index.html HTTP/1.0 3. Look at response message sent by http server! What do you think the response is?

Does our Working Definition work for the http Web? A distributed system is a collection of entities, each of which is autonomous, programmable, asynchronous and failure-prone, and that communicate through an unreliable communication medium. Entity=a process on a device (PC, PDA) Communication Medium=Wired or wireless network Our interest in distributed systems involves design and implementation, maintenance, study, algorithmics Designers and progammers interested in : algorithmics, maintenance. (informal definition, for us programmers of distributed systems) Ok: peer to peer systems Contradiction: a computer without ROM or disk drives that needs to boot over the network. Is a collection of these computers a distributed system?

“Important” Distributed Systems Issues No global clock; no single global notion of the correct time (asynchrony) Unpredictable failures of components: lack of response may be due to either failure of a network component, network path being down, or a computer crash (failure-prone, unreliable) Highly variable bandwidth: from 16Kbps (slow modems or Google Balloon) to Gbps (Internet2) to Tbps (in between DCs of same big company) Possibly large and variable latency: few ms to several seconds Large numbers of hosts: 2 to several million

Many Interesting Design Problems Real distributed systems Cloud Computing, Peer to peer systems, Hadoop, key-value stores/NoSQL, distributed file systems, sensor networks, measurements, graph processing, stream processing, … Classical Problems Failure detection, Asynchrony, Snapshots, Multicast, Consensus, Mutual Exclusion, Election, … Concurrency RPCs, Concurrency Control, Replication Control, Paxos, … Security ACLs, Capabilities, … Others…

Typical Distributed Systems Design Goals Common Goals: Heterogeneity – can the system handle a large variety of types of PCs and devices? Robustness – is the system resilient to host crashes and failures, and to the network dropping messages? Availability – are data+services always there for clients? Transparency – can the system hide its internal workings from the users? (warning: term means the opposite of what the name implies!) Concurrency – can the server handle multiple clients simultaneously? Efficiency – is the service fast enough? Does it utilize 100% of all resources? Scalability – can it handle 100 million nodes without degrading service? (nodes=clients and/or servers) How about 6 B? More? Security – can the system withstand hacker attacks? Openness – is the system extensible?

“Important” Issues If you’re already complaining that the list of topics we’ve discussed so far has been perplexing… You’re right! It was meant to be (perplexing) The Goal for the Rest of the Course: see enough examples and learn enough concepts so these topics and issues will make sense We will revisit many of these slides in the very last lecture of the course!

“Concepts”? Which of the following inventions do you think is the most important? Car Wheel Bicycle “What lies beneath?” Concepts!

How will We Learn? All this information is contained in handout on course website: “Course Information and Schedule” Web: courses.engr.illinois.edu/cs425/ Textbook, Recommended but not Required Colouris, Dollimore, Kindberg and Blair (5th edition) CDK Textbook If you use a different or older version, be sure to check problem numbers, etc. Textbook is a great source of practice problems for the exams (midterm and final) Lectures

How will We Learn? (2) All this information contained in handout on course website: “Course Information and Schedule” Web: courses.engr.illinois.edu/cs425/ Homeworks (HWs) Four in total, two before midterm (mid-Oct) and two after About 2 weeks per homework Solutions need to be typed, figures can be hand-drawn Programming assignments (MPs, i.e., Machine Programming Assignments) Only for 4 credit students (3 credit students welcome to do it, but we won’t be able to grade it or take it as extra credit) You will be building a distributed stream processing system in stages Exams/quizzes Midterm (mid-Oct) + Final

What assistance is available to you? Lectures Lecture slides will be placed online at course website “Tentative” version before lecture “Final” version after lecture Asking Questions (most preferable first) Piazza: Your first stop for all questions (see if your question has already been asked) We will try to ensure that every question is answered within 24 hours of posting (during weekdays). Do not post solutions or code on Piazza – that’ll immediately earn you a zero on the HW/MP. Office Hours (posted on course website): Every weekday has at least one office hours scheduled. Email staff mailing list (don’t email individual instructor/TA except for private issues) Course Prerequisite: CS 241 or ECE 391 or equivalent OS/networking course (latter need instructor permission)

Course Staff (Again) Myself (Indy): 3112 Siebel Center Office Hours Every Tuesday and Thursday right after class – see website TAs: (see course website for office hours) Yi Zhang Guangxiang Du Qi Wang Ayush Jain Alex Zahdeh

Individual vs. Group Work Homeworks Individual only All work must be your own MPs In groups of 2 Within group: Can discuss everything For both HWs and MPs You can discuss with others (and course staff) lecture concepts and the HW/MP question/spec itself, but you cannot discuss solutions or even ideas You cannot copy solutions/code or look at someone else’s solutions We will check (we use Moss, we also compare HWs) First violation: zero on HW/MP. Second violation: F in course. (Both have happened in the past!)

Speaking of HWs and MPs HW1 and MP1 have both been released! Don’t worry – you have time But start early! Start now. You must let us know the composition of your group for MP by this Thursday 8/27 5 pm. Instructions on how to inform us are on the course website You can start on MP right away (don’t need lectures for it) Due in about 2 weeks MP1 due 9/13 HW1 due 9/17

Wrap-Up (Reading for today’s lecture: Relevant parts of Chapter 1) All students: Go to course website and fill out Student Survey sheet https://courses.engr.illinois.edu/cs425/ Fill by this Thursday EOD (8/27) Not yet registered? Fill out waitlist (linked from course webpage) Next two lectures will be… Flipped Classroom! Topics: Timestamps, Ordering, and Snapshots

Flipped Classroom? Lecture Videos for next two lectures are posted on the course website You must see the video lectures before you come to class See “Timestamps and Ordering” lectures (multiple short videos) before this Thursday class See “Snapshots” lectures before next Tuesday class During the class we will not have a lecture Instead during class the TAs will help you work on exercises Bring a pen or pencil (we will not provide one)