CS 5413: High Performance Systems and Networking

Slides:



Advertisements
Similar presentations
CMPT 275 Software Engineering
Advertisements

Welcome to the seminar course
Unshackle the Cloud: Commoditization of the Cloud Hakim Weatherspoon Assistant Professor, Dept of Computer Science CS 5412, Guest Lecture, Cornell University.
CS 345 Distributed Systems Fabián E. Bustamante, Winter 2004 Welcome to Advanced OS Fabián E. Bustamante (Instructor) Yi Qiao (Ad Honorem TA) Communication.
CS 6410: Advanced Systems Fall 2010 Instructor: Hakim Weatherspoon TA: Ki Suh Lee.
CS 6410: Advanced Systems Fall 2009 Instructor: Hakim Weatherspoon TA: Dan Williams.
CS 6410: Advanced Systems Fall 2011 Instructor: Hakim Weatherspoon TA: Ji Yong Shin.
CS 6464: Advanced Distributed Storage Systems Spring 2009 Instructor: Hakim Weatherspoon.
COMS E Cloud Computing and Data Center Networking
Welcome to CS 450 Internet Security: A Measurement-based Approach.
Data Center Middleboxes Hakim Weatherspoon Assistant Professor, Dept of Computer Science CS 5413: High Performance Systems and Networking November 24,
Data Center Virtualization: Open vSwitch Hakim Weatherspoon Assistant Professor, Dept of Computer Science CS 5413: High Performance Systems and Networking.
Welcome to CS 395/495 Measurement and Analysis of Online Social Networks.
Topics Problem Statement Define the problem Significance in context of the course Key Concepts Cloud Computing Spatial Cloud Computing Major Contributions.
Data Center Traffic and Measurements: Available Bandwidth Estimation Hakim Weatherspoon Assistant Professor, Dept of Computer Science CS 5413: High Performance.
ECS15: Introduction to Computers Fall 2013 Patrice Koehl
COP4020/CGS5426 Programming languages Syllabus. Instructor Xin Yuan Office: 168 LOV Office hours: T, H 10:00am – 11:30am Class website:
Component 4: Introduction to Information and Computer Science Unit 10: Future of Computing Lecture 2 This material was developed by Oregon Health & Science.
Introduction to Operating Systems J. H. Wang Sep. 18, 2012.
CS 150 PERSONAL PRODUCTIVITY USING TECHNOLOGY Instructor: Dr. Xenia Mountrouidou.
Computer Network Fundamentals CNT4007C
Syllabus CS 765: Introduction to Database Management Systems Fall 2008 Text Database Management Systems Ramakrishnan/Gehrke, 3rd.
CS 450 MODELING AND SIMULATION Instructor: Dr. Xenia Mountrouidou (Dr. X)
CSE 501N Fall ‘09 00: Introduction 27 August 2009 Nick Leidenfrost.
Data Center Virtualization: Xen and Xen-blanket
CS 458 Internet Engineering Instructor: Prof. Jörg Liebeherr University of Virginia.
COMP Introduction to Programming Yi Hong May 13, 2015.
Computer Networks CEN 5501C Spring, 2008 Ye Xia (Pronounced as “Yeh Siah”)
CSSE 250 (First class) Dr. Yingwu Zhu Office: ENGR 530 Phone: Emai:
Lecture 1 Page 1 CS 239, Fall 2010 Introduction CS 239 Advanced Topics in Computer Security Peter Reiher September 23, 2010.
Component 4: Introduction to Information and Computer Science Unit 10b: Future of Computing.
WEEK-1 PRINCIPLES OF MANAGEMENT BUSN 107, Özge Can.
Introduction to Information Systems and Technology MIS 213, Spring 2015 CIS 2005, CIS 1007.
Introduction to Operating Systems J. H. Wang Sep. 18, 2015.
CS 101 Today’s class will start 5 minutes late. CS 101 Introduction to Computer Science Aaron Bloomfield University of Virginia Spring 2007.
CSE 534: Advanced Computer Networks
Advanced Systems and Network Security Fall 2015 Instructor: Kun Sun, Ph.D.
Advanced Database Course (ESED5204) Eng. Hanan Alyazji University of Palestine Software Engineering Department.
Basic Seminar Rules Stay on Topic If we seem to be getting off track or too much is going on at once, then I will ask everyone to HOLD UP. That is your.
CSSE 250 Dr. Yingwu Zhu Office: ENGR 530 Phone: Emai:
Principles of Computer Science I Honors Section Note Set 1 CSE 1341 – H 1.
1 CAP6133: Advanced Topics in Computer Security and Computer Forensics (spring’08) Class Overview Dr. Cliff Zou.
CM220: Unit 1 Seminar “You must be the change you wish to see in the world.” ~ Mohandas Gandhi.
Introduction to Operating Systems J. H. Wang Sep. 15, 2010.
IST 210: Organization of Data
CS 6410: Advanced Systems Fall 2013 Instructor: Hakim Weatherspoon TA: Erluo Li and Qin Jia.
CSCD 330 Network Programming Winter 2015 Lecture 1 - Course Details.
CS614: Advanced Course in Computer Systems (Spring’04) Instructor: Ken Birman TA: non assigned (yet)
Prof. James A. Landay Computer Science Department Stanford University Winter 2016 dt+UX 2 : USER EXPERIENCE DESIGN PROJECT Introduction & Course Overview.
Introduction to Operating Systems J. H. Wang Sep. 13, 2013.
Dr. Jeff Cummings MIS323 Business Telecommunications.
Advances in Cloud Computing CIS6930/CIS4930
Computer Networks CNT5106C
RANDY MODOWSKI COSC Cloud Computing. Road Map What is Cloud Computing? History of “The Cloud” Cloud Milestones How Cloud Computing is being used.
Discussion Context NIST Cloud definition and extension to address network and infrastructure issues Discussion of the ISPD-RG Infrastructure definition.
CS & CS ST: Probabilistic Data Management Fall 2016 Xiang Lian Kent State University Kent, OH
Computer Network Fundamentals CNT4007C
Introduction to Operating Systems
CSE 704 Data Center Computing Intro
Overview: Cloud Datacenters
Computer Networks CNT5106C
A PhD-oriented course about research in systems
MIS323 Business Telecommunications
CS 5413: High Performance Systems and Networking
CS 6410: Advanced Systems Prof. Hakim Weatherspoon
Computer Networks CNT5106C
MIS323 Business Telecommunications
CS 6410: Advanced Systems Prof. Hakim Weatherspoon
Office: ENGR 530 Phone: Emai:
Computer Networks CNT5106C
Presentation transcript:

CS 5413: High Performance Systems and Networking Hakim Weatherspoon Assistant Professor, Dept of Computer Science CS 5413: High Performance Systems and Networking August 27, 2014

Goals for Today Be brief! Background on Professor Why take this course? How does this class operate? Class details

Who am I? Prof. Hakim Weatherspoon (Hakim means Doctor, wise, or prof. in Arabic) Background in Education Undergraduate University of Washington Played Varsity Football Some teammates collectively make $100’s of millions I teach!!! Graduate University of California, Berkeley Some class mates collectively make $100’s of millions Background in Operating Systems Peer-to-Peer Storage Antiquity project - Secure wide-area distributed system OceanStore project – Store your data for 1000 years Network overlays Bamboo and Tapestry – Find your data around globe Tiny OS Early adopter in 1999, but ultimately chose P2P direction

Context The promise of the Cloud A computer utility; a commodity Catalyst for technology economy Revolutionizing for health care, financial systems, scientific research, and society The cloud completes the commoditization process and makes storage and computation a commodity. SEATTLE

Context The promise of the Cloud ubiquitous, convenient, on-demand network access to a shared pool of configurable computing resources (e.g., networks, servers, storage, applications, and services) that can be rapidly provisioned and released with minimal management effort or service provider interaction. NIST Cloud Definition The cloud completes the commoditization process and makes storage and computation a commodity. Public vs private IaaS, PaaS, SaaS On demand (self service), network access, resource pooling, rapid elasticity, measured service SEATTLE

Context The promise of the Cloud ubiquitous, convenient, on-demand network access to a shared pool of configurable computing resources (e.g., networks, servers, storage, applications, and services) that can be rapidly provisioned and released with minimal management effort or service provider interaction. NIST Cloud Definition The cloud completes the commoditization process and makes storage and computation a commodity. Public vs private IaaS, PaaS, SaaS On demand (self service), network access, resource pooling, rapid elasticity, measured service SEATTLE

Context The promise of the Cloud ubiquitous, convenient, on-demand network access to a shared pool of configurable computing resources (e.g., networks, servers, storage, applications, and services) that can be rapidly provisioned and released with minimal management effort or service provider interaction. Requires fundamentals in distributed systems Networking Computation Storage NIST Cloud Definition The cloud completes the commoditization process and makes storage and computation a commodity. Public vs private IaaS, PaaS, SaaS On demand (self service), network access, resource pooling, rapid elasticity, measured service

High Performance Networks How to optimize a global network of data centers?

High Performance Networks How to optimize a global network of data centers? E.g. Need to optimize movement of data between DCs [NSDI 2013, NSDI 2008, FAST 2009, IMC 2010, DSN 2010]

High Performance Networks How to optimize a global network of data centers? E.g. Investigate novel data center designs [ToN 2013 and ANCS 2012; best paper] Core Switch (CS) Aggregate Switch (AS) … … … ToR … … … … … …

High Performance Storage Large organizations considering using the cloud New York Times Netflix Nintendo Cornell Library of Congress The more data you have, the harder it is to move Switching providers entails paying for bandwidth twice Inhibits opportunistic migration

High Performance Storage How hard is it to move a PetaByte? Titan tech boom, randy katz, 2008

High Performance Storage All my valuable data/computation is in the cloud Am I locked in to one provider forever? The more data you have, the harder it is to move RACS: Redundant Array of Cloud Storage Collaboration with the Internet Archive and IBM [SOCC 2010]; See Also [EuroSys 2007, FAST 2009, FAST 2013] k=3 n=4 RACS: Redundant Array of Cloud Storage Stripe data over n cloud storage repositories Any k-subset contains a complete copy of data Tolerate up to (n-k) failures Written RACS(k,n) RACS(3,4)

High Performance Storage All my valuable data/computation is in the cloud Am I locked in to one provider forever? The more data you have, the harder it is to move RACS: Redundant Array of Cloud Storage Collaboration with the Internet Archive and IBM See Also [EuroSys 2007, FAST 2009, FAST 2013] RACS(3,4) 33KB 33KB Object 100 KB Object 100 KB 33KB 33KB k=3 n=4 RACS: Redundant Array of Cloud Storage Stripe data over n cloud storage repositories Any k-subset contains a complete copy of data Tolerate up to (n-k) failures Written RACS(k,n) Relative Storage n/k Relative Upload Bandwidth Relative Download Bandwidth 1 RACS(3,4)

High Performance Storage Estimated Cost of Switching Cloud Providers 33KB RACS(3,4) 33KB Object 100 KB 33KB Relative Storage n/k Relative Upload Bandwidth Relative Download Bandwidth 1 33KB

High Performance Storage RACS: How do I optimize storage globally Collaboration with Internet Archive / IBM Gecko: How do I optimize storage locally Collaboration with Google and Microsoft 33KB 33KB Object 100 KB RACS(3,4) 33KB RACS and Gecko 33KB Relative Storage n/k Relative Upload Bandwidth Relative Download Bandwidth 1

High Performance Computation Can I compute in the cloud if some of my data is in a vault at home or on another provider Xen-Blanket and VirtualWire Collaboration with IBM [HotCloud 2012, EuroSys 2012] 33KB 33KB 33KB 33KB

High Performance Computation Can create your own Cloud-within-a-Cloud Migrate computation among different cloud providers Guest OS App 33KB Xen-Blanket VMM 1 Guest OS App 33KB Object 100 KB Guest OS App Xen-Blanket VMM 2 33KB Xen-Blanket VMM Xen-Blanket VMM 3 33KB Xen-Blanket VMM 4

High Performance Computation VM Switch Can create your own Cloud-within-a-Cloud Migrate computation among different cloud providers 33KB Xen-Blanket VMM 1 33KB Object 100 KB Xen-Blanket VMM 2 33KB Xen-Blanket VMM Xen-Blanket VMM 3 33KB Xen-Blanket VMM 4

My Contributions Cloud Networking Cloud Computation & Vendor Lock-in SoNIC in NSDI 2013 Wireless DC in ANCS 2012 (best paper) and NetSlice in ANCS 2012 Bifocals in IMC 2010 and DSN 2010 Maelstrom in ToN 2011 and NSDI 2008 Chaired Tudor Marian’s PhD 2010 (now at Google) Cloud Computation & Vendor Lock-in Plug into the Supercloud in IEEE Internet Computing-2013 Supercloud/Xen-Blanket in EuroSys-2012 and HotCloud-2011 Overdriver in VEE-2011 Chaired Dan William’s PhD 2012 (now at IBM) Cloud Storage Gecko in FAST 2013 / HotStorage 2012 RACS in SOCC-2010 SMFS in FAST 2009 Antiquity in EuroSys 2007 / NSDI 2006 Chaired Lakshmi Ganesh’s PhD 2011 (now at Facebook)

Goals for Today Be brief! Background on course and Professor Why take this course? How does this class operate? Class details

Why take this course Learn about systems abstractions, principles, and artifacts that have lead to the high performance systems and networks we see in the cloud, Understand attributes of systems research that is likely to have impact, Become comfortable navigating the statee of the art in systems and networking, Gain experience in thinking critically and analytically about systems research, and Acquire the background needed to work on cloud and data center problems currently under study at Cornell and elsewhere.

Who is the course “for”? MEng students MEng Project Students who have mastered 4410/4411 PhD students as well Serious undergraduates MEng Project Projects in this course can be used to satisfy MEng project requirements

Goals for Today Be brief! Background on course and Professor Why take this course? How does this class operate? Class details

How this class operates Instructor: Hakim Weatherspoon hweather@cs.cornell.edu Office Location: 427 Gates Hall TA: Ki Suh Lee and Han Wang kslee@cs.cornell.edu and hwang@cs.cornell.edu Lectures: CS 5413: M,W,F: 1:25–2:15 PM, 205 Thurston Hall Three slots reserved a week, ***but lecture will be twice a week on average***

Course Help Course staff, office hours, etc: MEng projects http://www.cs.cornell.edu/courses/cs5413/2014fa MEng projects http://www.cs.cornell.edu/courses/cs5413/2014fa/projects.htm

CS 5413: Overview Prerequisite: Class Structure Mastery of CS 4410 and 4411 material Fundamentals of OS design How parts of the OS are structured What algorithms are commonly used What are the mechanisms and policies used Programming in C/C++ Class Structure Lecture/Readings Labs/Homeworks Project In class Quizzes

CS 5413: Topics Overview High Performance Networking Basics Cloud computing, and Internet vs Data Center Networks High Performance Networking Basics Textbook networking vs Data Center Networks Network protocol stack: TCP/IP protocol stack High Performance Data Center Systems & Networks Basic Switching Technologies: 50Gb/s routers &NetFPGA Data Center Topologies, Software Router Designs, Alternative switching technologies, Data Center Transport Software defined networking, virtual networks Middleboxes, advanced topics Data Center traffic and analysis

CS 5413: Paper Readings Required reading is always one paper and/or book reading Book reading provides basic background knowledge Papers pulled from, best journals and conferences TOCS, SOSP, OSDI, … Read papers before each class and bring notes takes ~2 hrs, write notes and questions Write a review and turn in at least two hours before beginning of class Turn on online via Course Management System (CMS) No late reviews will be accepted

CS 5413: Writing Reviews Each student is required to prepare notes on each paper before class and to bring them to class for use in discussion. Your notes should list assumptions, innovative contributions and criticisms. Every paper in the reading list has at least one major weakness. Turn paper reviews in online before class via CMS Be succinct—One paragraph per paper Short summary of paper (two or three sentences) Two to three strengths/contributions and at least one weaknesses

CS 5413: Lecture Format 40 minute presentation All students are required to read ahead of time and participate! Counts in final grading.

CS 5413: Labs/Homeworks 4 labs/homeworks Facilities work in groups 1-3 weeks per lab/homework Topics Building a network proxy (singled, then multi-threaded) Implement an N-port switch Implement a softward-defined network (sdn) switch/controller Facilities Using cloud (Amazon EC2/S3 or local Fractus cloud)

CS 5413: Project One major project per group Groups include three people Group formation – early September Initial selection of project topic – due mid-September Survey of area (related works)–due begin of October Midterm draft paper – due begin of November Peer reviews—due a week later Final demo/presentation–due begin of December Final project report – due a week later

CS 5413: Project Suggestions One major project per group Groups include three people Group formation – early September Initial selection of project topic – due mid-September Survey of area (related works)–due begin of October Midterm draft paper – due begin of November Peer reviews—due a week later Final demo/presentation–due begin of December Final project report – due a week later

CS 5413: Project Infrastructure GENI: Global Environment for Networking Innovations SoNIC: Software Network Interface Cards NetFPGA Fractus: our very own (mini) cloud Amazon’s Cloud Infrastructure EC2/S3 Emulab PlanetLab Cornell’s Center for Advanced Computing (CAC) …

Academic Integrity Submitted work should be your own Acceptable collaboration: Clarify problem, C syntax doubts, debugging strategy You may use any idea from any other person or group in the class or out, provided you clearly state what you have borrowed and from whom. If you do not provide a citation (i.e. you turn other people's work in as your own) that is cheating. Dishonesty has no place in any community May NOT be in possession of someone else’s homework/project May NOT copy code from another group May NOT copy, collaborate or share homework/assignments University Academic Integrity rules are the general guidelines Penalty can be as severe as an ‘F’ in CS 6410 To draw a very clear line, you may use any idea from any other person or group in the class or out, provided you clearly state what you have borrowed and from whom. If you do not provide a citation---that is, you turn other people's work in as your own---that is cheating. Anything else is fair game. Of course, we will be grading you on the ideas you have added, but you should always borrow as much as you can as a starting point as there is no point in reinventing the wheel.

Stress, Health and Wellness Need to pace yourself to manage stress Need regular sleep, eating, and exercising Do not come to class sick (with the flu)! Email me ahead of time that you are not feeling well People not usually sick more than once in a semester

Before Next time Read one paper below and write review The Cost of a Cloud: Research Problems in Data Center Networks, A. Greenberg, J. Hamilton, D. A. Maltz, P. Patel. ACM SIGCOMM computer communication review, Volume 39, Issue 1 (January 2009), pages 68--73. http://dl.acm.org/citation.cfm?id=1496103 (can only access link within Cornell network). http://131.107.65.14/en-us/um/people/dmaltz/papers/DC-Costs-CCR-editorial.pdf (can access outside Cornell network) Check website for updated schedule