Millennium: Computer Systems, Computational Science and Engineering in the Large David Culler, J. Demmel, E. Brewer, J. Canny, A. Joseph, J. Landay, S.

Slides:



Advertisements
Similar presentations
-Grids and the OptIPuter Software Architecture Andrew A. Chien Director, Center for Networked Systems SAIC Chair Professor, Computer Science and Engineering.
Advertisements

All rights reserved © 2006, Alcatel Grid Standardization & ETSI (May 2006) B. Berde, Alcatel R & I.
Hardware & the Machine room Week 5 – Lecture 1. What is behind the wall plug for your workstation? Today we will look at the platform on which our Information.
System Area Network Abhiram Shandilya 12/06/01. Overview Introduction to System Area Networks SAN Design and Examples SAN Applications.
System Center 2012 R2 Overview
© Chinese University, CSE Dept. Software Engineering / Software Engineering Topic 1: Software Engineering: A Preview Your Name: ____________________.
Beowulf Supercomputer System Lee, Jung won CS843.
Unique Opportunities in Experimental Computer Systems Research - the Berkeley Testbeds David Culler U.C. Berkeley Grad.
Millennium Overview and Status David Culler and Jim Demmel Computer Science Division
NPACI Panel on Clusters David E. Culler Computer Science Division University of California, Berkeley
Millennium Overview and Status David Culler and Jim Demmel Computer Science Division
SimMillenniumSimMillennium: Computer Systems, Computational Science and Engineering in the Large Jim Demmel, David Culler E. Brewer, J. Canny, A. Joseph,
Millennium: Cluster Technology for Computational Science and Engineering David Culler E. Brewer, J. Canny, J. Demmel, A. Joseph, J. Landay, S. McCanne.
Towards I-Space Ninja Mini-Retreat June 11, 1997 David Culler, Steve Gribble, Mark Stemm, Matt Welsh Computer Science Division U.C. Berkeley.
Server Platforms Week 11- Lecture 1. Server Market $ 46,100,000,000 ($ 46.1 Billion) Gartner.
Developing a Cluster Strategy for NPACI All Hands Meeting Panel Feb 11, 2000 David E. Culler Computer Science Division University of California, Berkeley.
ProActive Infrastructure Eric Brewer, David Culler, Anthony Joseph, Randy Katz Computer Science Division U.C. Berkeley ninja.cs.berkeley.edu Active Networks.
Connecting the Invisible Extremes of Computing David Culler U.C. Berkeley Summer Inst. on Invisible Computing July,
IPPS 981 Berkeley FY98 Resource Working Group David E. Culler Computer Science Division U.C. Berkeley
Real Parallel Computers. Background Information Recent trends in the marketplace of high performance computing Strohmaier, Dongarra, Meuer, Simon Parallel.
NPACI: National Partnership for Advanced Computational Infrastructure August 17-21, 1998 NPACI Parallel Computing Institute 1 Cluster Archtectures and.
Cluster Computers. Introduction Cluster computing –Standard PCs or workstations connected by a fast network –Good price/performance ratio –Exploit existing.
Chapter 1: Hierarchical Network Design
ArcGIS Workflow Manager An Introduction
COnvergence of fixed and Mobile BrOadband access/aggregation networks Work programme topic: ICT Future Networks Type of project: Large scale integrating.
Local Area Networks: Software © Abdou Illia, Spring 2007 School of Business Eastern Illinois University (Week 8, Thursday 3/1/2007)
Technology Overview. Agenda What’s New and Better in Windows Server 2003? Why Upgrade to Windows Server 2003 ?  From Windows NT 4.0  From Windows 2000.
Cloud Computing 1. Outline  Introduction  Evolution  Cloud architecture  Map reduce operation  Platform 2.
Silberschatz, Galvin and Gagne  Operating System Concepts Chapter 1: Introduction What is an Operating System? Mainframe Systems Desktop Systems.
Nimrod/G GRID Resource Broker and Computational Economy David Abramson, Rajkumar Buyya, Jon Giddy School of Computer Science and Software Engineering Monash.
QoS Support in High-Speed, Wormhole Routing Networks Mario Gerla, B. Kannan, Bruce Kwan, Prasasth Palanti,Simon Walton.
Cluster Computers. Introduction Cluster computing –Standard PCs or workstations connected by a fast network –Good price/performance ratio –Exploit existing.
SUMA: A Scientific Metacomputer Cardinale, Yudith Figueira, Carlos Hernández, Emilio Baquero, Eduardo Berbín, Luis Bouza, Roberto Gamess, Eric García,
Multimedia is a combination of text, art, sound, animation, and video.
1 Introduction to Middleware. 2 Outline What is middleware? Purpose and origin Why use it? What Middleware does? Technical details Middleware services.
1 Introduction to Microsoft Windows 2000 Windows 2000 Overview Windows 2000 Architecture Overview Windows 2000 Directory Services Overview Logging On to.
© 2012 xtUML.org Bill Chown – Mentor Graphics Model Driven Engineering.
PARALLEL COMPUTING overview What is Parallel Computing? Traditionally, software has been written for serial computation: To be run on a single computer.
CLUSTER COMPUTING TECHNOLOGY BY-1.SACHIN YADAV 2.MADHAV SHINDE SECTION-3.
SimMillennium Project Overview David E. Culler Computer Science Division U.C. Berkeley NSF Site Visit March 2, 1998.
11 CLUSTERING AND AVAILABILITY Chapter 11. Chapter 11: CLUSTERING AND AVAILABILITY2 OVERVIEW  Describe the clustering capabilities of Microsoft Windows.
ProActive Infrastructure Eric Brewer, David Culler, Anthony Joseph, Randy Katz Computer Science Division U.C. Berkeley ninja.cs.berkeley.edu Active Networks.
Millennium Executive Committee Meeting David E. Culler Computer Science Division
26/05/2005 Research Infrastructures - 'eInfrastructure: Grid initiatives‘ FP INFRASTRUCTURES-71 DIMMI Project a DI gital M ulti M edia I nfrastructure.
Comprehensive Scientific Support Of Large Scale Parallel Computation David Skinner, NERSC.
GRID ANATOMY Advanced Computing Concepts – Dr. Emmanuel Pilli.
3/12/2013Computer Engg, IIT(BHU)1 PARALLEL COMPUTERS- 2.
SimMillennium Systems Requirements and Challenges David E. Culler Computer Science Division U.C. Berkeley NSF Site Visit March 2, 1998.
Societal-Scale Computing: The eXtremes Scalable, Available Internet Services Information Appliances Client Server Clusters Massive Cluster Gigabit Ethernet.
Tackling I/O Issues 1 David Race 16 March 2010.
Cluster Computers. Introduction Cluster computing –Standard PCs or workstations connected by a fast network –Good price/performance ratio –Exploit existing.
Southern California Infrastructure Philip Papadopoulos Greg Hidley.
Background Computer System Architectures Computer System Software.
History of Windows Operating System. Windows 1.0 Debuted in 1985 First version of Windows that was set up to use bitmap displays and mouse pointing devices.
Towards a High Performance Extensible Grid Architecture Klaus Krauter Muthucumaru Maheswaran {krauter,
DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING CLOUD COMPUTING
Windows 2008 Overview Lecture 1.
Grid Optical Burst Switched Networks
Berkeley Cluster Projects
Programming Models for SimMillennium
Chapter 18 MobileApp Design
U.C. Berkeley Millennium Project
Scaling for the Future Katherine Yelick U.C. Berkeley, EECS
University of Technology
Operating Systems and Systems Programming
IBM Pervasive Computing Visit June 9, 1997
Objective Understand the concepts of modern operating systems by investigating the most popular operating system in the current and future market Provide.
Infrastructure Services
Objective Understand the concepts of modern operating systems by investigating the most popular operating system in the current and future market Provide.
Cluster Computers.
Presentation transcript:

Millennium: Computer Systems, Computational Science and Engineering in the Large David Culler, J. Demmel, E. Brewer, J. Canny, A. Joseph, J. Landay, S. McCanne A. Neureuther, C. Papadimitrou, K. Yelick EECS, U.C. Berkeley Lucent Visit 3/11/99

Millennium2 Project Goals Enable major advances in Computational Science and Engineering –Simulation, Modeling, and Information Processing becoming ubiquitous Explore novel design techniques for large, complex systems –Fundamental Computer Science problems ahead are problems of scale Develop fundamentally better ways of assimilating and interacting with large volumes of information –and with each other Explore emerging technologies –networking, OS, devices

Millennium3 The Vision To work, think, and study in a computationally rich environment with deep information stores and powerful services –test ideas through simulation –explore and investigate data and information –share, manipulate, and interact through natural actions Organized in a manner consistent with the University setting

Millennium4 Topics Today David –Millennium Test bed –Cluster-base High Performance Computing –Towards a Computational Economy Jim –Computational Science and Engineering

Millennium5 Building the Millennium Test Bed

Millennium6 The Community School of Info. Mgmt and Sys. Computer Science Electrical Eng. Mechanical Eng. BMRC Nuclear Eng. IEOR Civil Eng. MSME Inst. Of Transport Business Chemistry Astro Physics Biology Economy Math

Millennium7 NT Workstations for Sci. & Eng. SIMS C.S. E.E. M.E. BMRC N.E. IEOR C. E. MSME Transport Business Chemistry Astro Physics Biology Economy Math

Millennium8 SMP => storage, small-scale parallelism SIMS C.S. E.E. M.E. BMRC N.E. IEOR C. E. MSME Transport Business Chemistry Astro Physics Biology Economy Math

Millennium9 Group Cluster of SMPs => Parallelism SIMS C.S. E.E. M.E. BMRC N.E. IEOR C. E. MSME NERSC Transport Business Chemistry Astro Physics Biology Economy Math

Millennium10 Campus Cluster => large-scale Parallelism SIMS C.S. E.E. M.E. BMRC N.E. IEOR C. E. MSME NERSC Transport Business Chemistry Astro Physics Biology Economy Math

Millennium11 Gigabit Ethernet Connectivity Gigabit Ethernet SIMS C.S. E.E. M.E. BMRC N.E. IEOR C. E. MSME NERSC Transport Business Chemistry Astro Physics Biology Economy Math

Millennium12 Physical Connectivity

Millennium13 Visualization and Novel User Interfaces

Millennium14 Industrial / Academic Collaboration Computers via Intel Technology 2000 grant –200 NT desktops – 16 department 4-way SMPs – 8 5x4 Group Clusters, – 1 ~100x4 Campus Cluster –PPro => Pentium II => Merced Additional storage via IBM SUR grant –0.5 TB this year => 4 TB NT tools via Microsoft grant Solaris x86 tools via SMCC grant Bay Networks discounts the gigabit Ethernet Campus provides Technical staff Research provides the prog. and system support 200 Gflop/s 150 GB memory 8 TB disk

Millennium15 Sample Applications (Jim’s Talk) Astrophysical Simulations –Star formation –Turbulence in geophysical flows –Data-mining Cosmic Microwave Background Radiation CEE Pacific Earthquake Eng. Research Center –Finite element modeling of earthquake impact Technology CAD –Simulation of E-beam and Optical Lithography National Aerospace System Emulation Phylogenetic History of Life

Millennium16 The CS Research Agenda High Performance Cluster Computing Environment –Fast communication on Clusters of SMPs –Compiler Techniques for Performance and Ease of use –Numerical Techniques and Solvers »Particles, FFT, AMR, Multigrid, Sparse and Dense Lin. Alg. Novel System Design Techniques –clusters of clusters –Computational Economy Novel modes of interacting with large amounts of data

Millennium17 Design of a Large Cluster for S&E Classic Architecture Problem “in the large” –Given fixed budget, what is the best partitioning of node, group and campus cluster resources? Basic node has several degrees of freedom –processors per node (4, 2, 1)- Disks –memory capacity- Space, Volume –PCI busses- Power Clustering adds additional degrees of freedom –network, network interfaces Cost is well-defined (Intel) Workload is defined by real applications Design against technology change –Quad PPro, Dual P II, P II, … Merced

Millennium18 Cluster Interconnect Design Proposed design based on MyriNet –16+8 port switch in fat-tree variant –today offers best latency, BW, simplicity, flexibility, and cost »source-based packet routing, open to the metal –link-by-link flow control with cut-through routing –almost reliable System Area Network (SAN) revolution –Tandem/Compaq ServerNet

Millennium19 Communication Interface Revolution Low Overhead Communication “Happens” Academic Research put it on the map –Active Messages (AM), FM, PM, …Unet –Memory Messaging (Get/Put, Reflective, VMMC, Mem. Chan.) Intel / Microsoft / Compaq recognized it –Virtual Interface Architecture 1.0 released 12/16/97 Berkeley VIA over Myrinet released on NT and Linux

Millennium20 Inter-Cluster Networking Gigabit Ethernet - what was the question? –ATM, FiberChannels, HPPI, Serial HPPI, HPPI 6400, SCI, P1394, … fading fast –standard due in April Not Grampa’s Ethernet –switched, full duplex - multiframe bursts –broadcast, multicast trees - level 3 switching –flow control - QoS support Fast Network Interfaces Switches clean and fast Clearly the Storage and Video Transport Is it also the Cluster solution? –VIA/IP

Millennium21 Inter-Cluster Research Agenda Vastly expands the scope of systems challenge –integrate well-connected resources according application needs, rather than physical packaging –resource allocation, management, and administration Network bandwidth matches display BW –Protocols and run-time sys. for visualization, media transport, interaction, and collaboration. Community can share non-trivial resources while preserving sense of ownership –Bandwidth translates into efficiency of exchange –Data can be anywhere Important networking technology in its own right. –Layer 3 switching, QoS, VLan

Millennium22 User Interaction High-quality 3D graphics emerging on cost- effective platforms –desktops and dedicated cluster nodes –NERSC team provides modern scientific visualization support Gigabit network allows this to be remote. New displays create “workbench” environment where large volumes of information can be viewed and manipulated. Trackers and Haptic interfaces greatly enhance degrees of user input –3D capture

Millennium23 A Millennium Cluster 16x2 Processor 400 MHz Pentium II 100 MHz Memory Bus 33 MHz 32-Bit PCI 100BaseTX Ethernet Myrinet M2F Windows NT 4.0 –Terminal Server Edition

Millennium24 Three New Technologies NT Distributed COM (DCOM) –For parallel remote execution of sort. River System –Automated management of distributed data flows Virtual Interface Architecture (VIA) –High performance user-level communication RIVER Sort Core GetPut Partitioner Net Sources Net Sinks

Millennium25 World-Record Datamation Sort Old Record (NOW)

Millennium26 Computational Economy Approach System has a supply of various resources Demand on resources revealed in price –distinct from the cost of acquiring the resources User has unique assessment of value Client agent negotiates for system resources on user’s behalf –submits requests, receives bids or participates in auctions –selects resources of highest value at least cost

Millennium27 Advantages of the Approach Decentralized load balancing –according to user’s perception of what is important, not system’s own metric –adapts to system and workload changes Creates Incentive to adopt efficient modes of use –exploit under-utilized resources –maximize flexibility (e.g., migratable, restartable applications) Establishes user-to-user feedback on resource usage –basis for exchange rate across resources Powerful framework for system design –Natural for client to be watchful, proactive, and wary –Generalizes from resources to services Rich body of theory ready for application

Millennium28 Millennium Resource Allocation Property rights establish “fair share” currency –each brings resources to the system Price determined by competition for the resource User (agent) determines value Provide enabling technology for Evolution of markets –bilateral trade –multilateral trade –standardized contracts –markets for resources and services Monitor how it progresses Elevate useful applications into Services

Millennium29 Approach: Focus on Services Most users use services (only) –such users don’t need accounts on all systems –easier to use, output is graphs/visualization –enables easy student/class usage –services solve specific problems »protein folding, SVD, simulations,... Some users will still log in, write apps Easy conversion of apps to services

Millennium30 Service Economics Services make economic models simpler! –Services simplify resource tracking over time –Build models for each service »can tie resource needs to service inputs »can bid well based on history –Services are well defined => pay per use Services abstract resources –enables high availability –enables varying resources over time Current Demonstration: TACC transformational services –transcend, wingman

Millennium31 System Administration Uniformity is key Clusters evolve and are constantly changing over time Administrative domains tend to diverge => create incentive to simplify administration –more uniform, higher value Build automated system providing weakly consistent database of the state of system health and inference rules –apply expert system diagnosis technology

Millennium32 Systems of Systems Design It is about making things work at large scale –things change, things break, demands extreme Make all components wary, reactive, and self- tuning Use implicit information whenever possible User behavior is critical to closing the loop –when there is personal responsibility Millennium is a good model of large scale systems challenges

Millennium33 What is Millennium About? An experiment in large-scale system design Advance the state of computational science and engineering Exploring novel design techniques Exploring important new technologies