CSE 704 Data Center Computing Intro

Slides:



Advertisements
Similar presentations
INTRODUCTION TO CLOUD COMPUTING CS 595 LECTURE 6 2/13/2015.
Advertisements

Social Media Intro to Business & Marketing. The most three most trusted forms of advertising are: Recommendations from people I know - 90% Consumer opinions.
CS 345 Distributed Systems Fabián E. Bustamante, Winter 2004 Welcome to Advanced OS Fabián E. Bustamante (Instructor) Yi Qiao (Ad Honorem TA) Communication.
Front and Back End: Webpage and Database Management Prepared by Nailya Galimzyanova and Brian J Kapala Supervisor: Prof. Adriano Cavalcanti, PhD College.
Welcome to CS 450 Internet Security: A Measurement-based Approach.
EECE 571R (Spring 2010) Autonomic Computing (Building Self* Systems) Matei Ripeanu matei at ece.ubc.ca.
Welcome to CS 395/495 Measurement and Analysis of Online Social Networks.
Biology 475 Topics in Applied Microbiology. Biology 475 Official Course Description BIOL 475 LEC,SEM 0.50 Course ID: 1101 Topics in Applied Microbiology.
A step-by-step tutorial by Auckland Libraries 2011 Chinese Digital Community.
A step-by-step tutorial by Henry Liu Auckland City Libraries Make a start Chinese Digital Community.
Level 2 IT Users Qualification – Unit 1 Improving Productivity
CSE 486/586, Spring 2013 CSE 486/586 Distributed Systems Mid-Semester Overview Steve Ko Computer Sciences and Engineering University at Buffalo.
Advanced Topics in Distributed Systems Fall 2011 Instructor: Costin Raiciu.
Introduction. Readings r Coulouris, Dollimore and Kindberg Distributed Systems: Concepts and Design Edn. 3 m Note: All figures from this book.
Social Media 101 An Overview of Social Media Basics.
Welcome to EECS 395/495 Networking Problems in Cloud Computing.
Edmodo Training A Guide to Getting Started. 2 Free social learning network for teachers, students, schools and districts Safe and easy way to connect.
Visual Studio Windows Azure Portal Rest APIs / PS Cmdlets US-North Central Region FC TOR PDU Servers TOR PDU Servers TOR PDU Servers TOR PDU.
Unit 9: Distributing Computing & Networking Kaplan University 1.
CS614: Advanced Course in Computer Systems (Spring’04) Instructor: Ken Birman TA: non assigned (yet)
What is ? ●It is a Halton District School Board term ●It is HDSB use of Google Apps for education. It includes: ■ Google Docs ■ Google Drive ■ Google Applications.
Web 2.0 IS530 Fall 2009 Dr. Dania Bilal. Web 2.0 Is the Web that is being transformed into a computing platform for delivering web applications to end.
Operating Systems CMPSC 473 Introduction and Overview August 24, Lecture 1 Instructor: Bhuvan Urgaonkar.
AN INTRODUCTION TO FACEBOOK. Learning Objectives A brief introduction to the social networking site Facebook. Instructions to create an account. How to.
Successfully Implementing The Information System Systems Analysis and Design Kendall and Kendall Fifth Edition.
Surf smart training.
CLOUD ARCHITECTURE Many organizations and researchers have defined the architecture for cloud computing. Basically the whole system can be divided into.
Introduction Adult website business is very big and it has loads of cash. You cannot imagine how much a single famous porn site makes a day. There are.
Web Programming Language
Virtual Machine Monitors
Welcome to EECS 395/495 Networking Problems in Cloud Computing
Computer Network Fundamentals CNT4007C
How to Develop and Write a Research Paper.
Introduction to Operating Systems
Welcome to the Microsoft Outlook 2010 for Windows Tech Talk
Facebook – Schools???.
Distributed Programming in “Big Data” Systems Pramod Bhatotia wp
Computer Networks CNT5106C
EEL 6686: Embedded Systems Seminar
CSE 486/586 Distributed Systems Mid-Semester Overview
Boomerang Adds Smart Calendar Assistant and Reminders to Office 365 That Increase Productivity and Simplify Meeting Scheduling OFFICE 365 APP BUILDER.
Large Distributed Systems
Discover How Your Business Can Benefit from a Facebook Fanpage
Discover How Your Business Can Benefit from a Facebook Fanpage
CS533 Concepts of Operating Systems Class 1
TYPES OF SERVER. TYPES OF SERVER What is a server.
That Saves Money and Preserves Opportunities
Operating Systems and Systems Programming
Introduction to Operating Systems
It’s About Time – ScheduleMe Outlook Add-In for Office 365 Enables Users to Schedule Meetings Easily with People Outside of Your Organization Partner Logo.
Cloud Computing and Cloud Networking
Steve Ko Computer Sciences and Engineering University at Buffalo
WIRED Week 2 Syllabus Update Readings Overview.
Lecture 1: Multi-tier Architecture Overview
Syllabus and Introduction Keke Chen
Unit 1.4 Wired and Wireless Networks Lesson 3
Learn to use: Salesfloor Reporting.
Internet and Web Simple client-server model
Log in to the computer in front of you
Topics in Applied Microbiology
Course Overview CS 4640 Programming Languages for Web Applications
Journey to the Cloud – Guidance and Lessons Learned
Topics in Applied Microbiology
Objectives Explain the role of computers in client-server and peer-to-peer networks Explain the advantages and disadvantages of client- server and peer-to-peer.
CSE 486/586 Distributed Systems Distributed File Systems
Sarah Diesburg Operating Systems CS 3430
CSE 486/586 Distributed Systems Consistency --- 1
CS533 Concepts of Operating Systems Class 1
CSE 486/586 Distributed Systems Byzantine Fault Tolerance
Course Overview CS 4640 Programming Languages for Web Applications
Presentation transcript:

CSE 704 Data Center Computing Intro Steve Ko

Administrative Information Organizer: Steve Ko Ph.D., 2009, UIUC Interest: distributed systems, networking, and operating systems Office: 210 Bell Office hours: Wed 12pm – 3pm Email: stevko@buffalo.edu

Seminar Overview Data Center Computing Components Computing activities that utilize data centers An attempt to look at the whole spectrum “What technologies do you use when you access a Web service?” Components Front-end, processing, storage, networking, and virtualization

Seminar Credits 1 Credit 3 Credits Reading papers Writing reviews Presenting one paper (or two) Participating in discussions 3 Credits Additional research project Cannot be used for your master’s project Meet me after class

Reading Papers 24 papers, 2 papers per week Recommended reading: "How to Read a Paper" by S. Keshav (only 2 pages!) “A three-pass approach”

Reading Papers First-pass Read the title, abstract, and intro Read the titles of all sections and subsections Read the conclusion Goal: five C’s Category: which category is this paper in? Context: related papers? Correctness (on the assumptions) Contributions Clarity

Reading Papers Second-pass Third-pass Read with greater care, but still ignore details (e.g., proofs) Jot down key points, ideas, background readings, etc. Third-pass Virtually re-create the paper Put down the paper, start from the same assumptions, and re-create the work.

Writing Reviews Format Don’t need to be long What is the research problem? Do you agree that it’s a problem? Why or why not? What are the main approaches/ideas? Strengths and weaknesses? Other comments E.g., what you liked, what you didn’t understand, possible future work, compare & contrast with other papers, hidden assumptions, etc. Don’t need to be long

Presentation Prepare PowerPoint slides Lead a discussion for an hour (presentation + questions/answers/discussions) Incorporate others’ comments (e.g., what people didn’t understand well) Schedule a time with me for a practice run You can use other people’s slides Make sure you acknowledge them

Assignment for Today Email me whether or not you’re going to stay If you decide to stay, do the rest Look through the schedule Pick two papers you’d like to present 1st choice & 2nd choice FCFS Exception: OpenFlow & NOX (from “Networking – 1” on 10/6) should be presented together with some demo Email me your choices

Topics Overview What makes these possible?

Topics Overview Facebook Internet Google Amazon

Data Centers Buildings full of machines

Data Centers Hundreds of locations in the US

Inside Servers in racks Incredible amounts of engineering efforts Usually ~40 blades per rack ToR (Top-of-Rack) switch Incredible amounts of engineering efforts Power, cooling, etc.

Inside Network

Inside 3-tier for Web services

Topics Overview What makes these possible?

Components Front-end Web browsers File and storage systems Data processing frameworks Networking Virtualization Maybe not an exhaustive list, but a good set…

Front-End Web Browsers Why Web browsers? Practically, they are the OSes in the current generation of computing They run applications Maps, email clients, etc. (AJAX programs) have hundreds of thousands of LOC All traditional OS problems exist Protection, reliability, privacy, performance, etc. Exciting new area of research

A Glimpse into the Issues How much do you trust javascripts?

A Glimpse into the Issues SOP (Single-Origin Policy) How do you control the level of trust? http://integrator.com/ <script src=“http://provider.com/p.js”> </script>

File and Storage Systems

Facebook Statistics 13 M users update their statuses at least once each day 2.5 M users become fans of Pages each day 700 M new photos per month 4 M new videos per month 15 M pieces of content shared per month 2 M new events per month 19 M active groups

File and Storage Systems How do you store? How do you not lose? How do you provide good access latency? How do you maintain? …

Data Processing Frameworks Google 20+ billion web pages ~20KB each = 400 TB ~ 4 months just to read the data And growing… 1999 vs. 2009: ~ 100X Yahoo! US Library of Congress every day (20TB/day) 2 billion photos 2 billion mail + messenger sent per day

Data Processing Frameworks How do you process this large amounts of data? Page rank, ad click statistics, search query trend, user profiling, etc. Again, ~ 4 months to read the data

Virtualization VMM: a piece of software that exposes hardware interfaces (ISA, I/O, etc.) App App VMM OS OS VMM Hardware Hardware

Virtualization Many data centers are virtualized How did we get here? Ease of management (start, stop, migrate, etc.) Consolidation (multiplexing one physical machine) How did we get here?

Networking Obviously, we need a network… But, what kind?

Networking Issues Oversubscription Management TCP performance Etc.

Assignment for Today Email me whether or not you’re going to stay If you decide to stay, do the rest Look through the schedule Pick two papers you’d like to present 1st choice & 2nd choice FCFS Exception: OpenFlow & NOX (from “Networking – 1” on 10/6) should be presented together with some demo Email me your choices