Berkeley Cluster Projects

Slides:



Advertisements
Similar presentations
Hardware & the Machine room Week 5 – Lecture 1. What is behind the wall plug for your workstation? Today we will look at the platform on which our Information.
Advertisements

Evolution of High Performance Cluster Architectures David E. Culler NPACI 2001 All Hands Meeting.
High-Performance Clusters part 1: Performance David E. Culler Computer Science Division U.C. Berkeley PODC/SPAA Tutorial Sunday, June 28, 1998.
Beowulf Supercomputer System Lee, Jung won CS843.
Chapter 7 LAN Operating Systems LAN Software Software Compatibility Network Operating System (NOP) Architecture NOP Functions NOP Trends.
Unique Opportunities in Experimental Computer Systems Research - the Berkeley Testbeds David Culler U.C. Berkeley Grad.
Understanding Application Scaling NAS Parallel Benchmarks 2.2 on NOW and SGI Origin 2000 Frederick Wong, Rich Martin, Remzi Arpaci-Dusseau, David Wu, and.
IBM RS6000/SP Overview Advanced IBM Unix computers series Multiple different configurations Available from entry level to high-end machines. POWER (1,2,3,4)
CS 213 Commercial Multiprocessors. Origin2000 System – Shared Memory Directory state in same or separate DRAMs, accessed in parallel Upto 512 nodes (1024.
NPACI Panel on Clusters David E. Culler Computer Science Division University of California, Berkeley
HELICS Petteri Johansson & Ilkka Uuhiniemi. HELICS COW –AMD Athlon MP 1.4Ghz –512 (2 in same computing node) –35 at top500.org –Linpack Benchmark 825.
Millennium Overview and Status David Culler and Jim Demmel Computer Science Division
Millennium: Computer Systems, Computational Science and Engineering in the Large David Culler, J. Demmel, E. Brewer, J. Canny, A. Joseph, J. Landay, S.
A Comparative Study of Network Protocols & Interconnect for Cluster Computing Performance Evaluation of Fast Ethernet, Gigabit Ethernet and Myrinet.
Towards I-Space Ninja Mini-Retreat June 11, 1997 David Culler, Steve Gribble, Mark Stemm, Matt Welsh Computer Science Division U.C. Berkeley.
NOW Finale Welcome June 1998 NOW Finale David E. Culler 6/15/98.
IPPS 981 What’s So Different about Cluster Architectures? David E. Culler Computer Science Division U.C. Berkeley
NOW 1 Berkeley NOW Project David E. Culler Sun Visit May 1, 1998.
IPPS 981 Berkeley FY98 Resource Working Group David E. Culler Computer Science Division U.C. Berkeley
TITAN: A Next-Generation Infrastructure for Integrating and Communication David E. Culler Computer Science Division U.C. Berkeley NSF Research Infrastructure.
Network Fundamentals Summer 1998 Thane B. Terrill.
High Performance Communication using MPJ Express 1 Presented by Jawad Manzoor National University of Sciences and Technology, Pakistan 29 June 2015.
NPACI: National Partnership for Advanced Computational Infrastructure August 17-21, 1998 NPACI Parallel Computing Institute 1 Cluster Archtectures and.
Hardware/Software Concepts Tran, Van Hoai Department of Systems & Networking Faculty of Computer Science & Engineering HCMC University of Technology.
CLUSTER COMPUTING Prepared by: Kalpesh Sindha (ITSNS)
1 Lecture 7: Part 2: Message Passing Multicomputers (Distributed Memory Machines)
1 In Summary Need more computing power Improve the operating speed of processors & other components constrained by the speed of light, thermodynamic laws,
Terabyte IDE RAID-5 Disk Arrays David A. Sanders, Lucien M. Cremaldi, Vance Eschenburg, Romulus Godang, Christopher N. Lawrence, Chris Riley, and Donald.
RSC Williams MAPLD 2005/BOF-S1 A Linux-based Software Environment for the Reconfigurable Scalable Computing Project John A. Williams 1
1 Recap (from Previous Lecture). 2 Computer Architecture Computer Architecture involves 3 inter- related components – Instruction set architecture (ISA):
Frank Casilio Computer Engineering May 15, 1997 Multithreaded Processors.
Amy Apon, Pawel Wolinski, Dennis Reed Greg Amerson, Prathima Gorjala University of Arkansas Commercial Applications of High Performance Computing Massive.
Large Scale Parallel File System and Cluster Management ICT, CAS.
CLUSTER COMPUTING TECHNOLOGY BY-1.SACHIN YADAV 2.MADHAV SHINDE SECTION-3.
1 Public DAFS Storage for High Performance Computing using MPI-I/O: Design and Experience Arkady Kanevsky & Peter Corbett Network Appliance Vijay Velusamy.
Millennium Executive Committee Meeting David E. Culler Computer Science Division
Comprehensive Scientific Support Of Large Scale Parallel Computation David Skinner, NERSC.
COMP381 by M. Hamdi 1 Clusters: Networks of WS/PC.
3/12/2013Computer Engg, IIT(BHU)1 PARALLEL COMPUTERS- 2.
3/12/2013Computer Engg, IIT(BHU)1 PARALLEL COMPUTERS- 1.
CDA-5155 Computer Architecture Principles Fall 2000 Multiprocessor Architectures.
SYSTEM MODELS FOR ADVANCED COMPUTING Jhashuva. U 1 Asst. Prof CSE
Feeding Parallel Machines – Any Silver Bullets? Novica Nosović ETF Sarajevo 8th Workshop “Software Engineering Education and Reverse Engineering” Durres,
Chapter 2 Operating Systems
Distributed Operating Systems Spring 2004
DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING CLOUD COMPUTING
Hardware Technology Trends and Database Opportunities
Distributed Operating Systems
Parallel Computers Definition: “A parallel computer is a collection of processing elements that cooperate and communicate to solve large problems fast.”
Berkeley Cluster: Zoom Project
Constructing a system with multiple computers or processors
Parallel & Cluster Computing
U.C. Berkeley Millennium Project
University of Technology
Low Latency Analytics HPC Clusters
IBM Pervasive Computing Visit June 9, 1997
by Manuel Saldaña, Daniel Nunes, Emanuel Ramalho, and Paul Chow
Results of Prior NSF RI Grant: TITAN
Guoliang Chen Parallel Computing Guoliang Chen
Web Server Administration
IBM Pervasive Computing Visit Jan 7, 1999
Computer Science Division
Parallel Processing Architectures
Constructing a system with multiple computers or processors
Constructing a system with multiple computers or processors
Constructing a system with multiple computers or processors
Networks Networking has become ubiquitous (cf. WWW)
Chapter 4 Multiprocessors
Chapters 1-3 Concepts NT Server Capabilities
Cluster Computers.
Presentation transcript:

Berkeley Cluster Projects David E. Culler culler@cs.berkeley.edu http://now.cs.berkeley.edu/ 11/23, 1998 1

Goals Make a fundamental change in how we design and construct large-scale systems market reality: 50%/year performance growth => cannot allow 1-2 year engineering lag technological opportunity: single-chip “Killer Switch” => fast, scalable communication Highly integrated building-wide, campus-wide systems Explore novel system design concepts in this new “cluster” paradigm

100 node Ultra/Myrinet NOW

Fast Communication Challenge Network Interface Hardware Comm.. Software Network Interface Hardware Comm. Software Network Interface Hardware Comm. Software Network Interface Hardware Comm. Software Killer Platform ° ° ° ns ms µs Killer Switch Fast processors and fast networks The time is spent in crossing between them

Opening: Intelligent Network Interfaces Dedicated Processing power and storage embedded in the Network Interface An I/O card today Tomorrow on chip? Mryicom Net 160 MB/s Myricom NIC P M M I/O bus (S-Bus) 50 MB/s M M P $ M $ P $ $ Sun Ultra 170 $ P P P P 15

NOW System Architecture Parallel Apps Large Seq. Apps Sockets, Split-C, MPI, HPF, vSM Global Layer UNIX Resource Management Network RAM Distributed Files Process Migration UNIX Workstation UNIX Workstation UNIX Workstation UNIX Workstation Comm. SW Comm. SW Comm. SW Comm. SW Net Inter. HW Net Inter. HW Net Inter. HW Net Inter. HW Fast Commercial Switch (Myrinet) 14

Communication Performance  Direct Network Access Latency 1/BW LogP: Latency, Overhead, and Bandwidth Active Messages: lean layer supporting programming models

World-Record Disk-to-Disk Sort Sustain 500 MB/s disk bandwidth and 1,000 MB/s network bandwidth

Massive Cheap Storage Basic unit: 2 PCs double-ending four SCSI chains Currently serving Fine Art at http://www.thinker.org/imagebase/

Cluster of SMPs (CLUMPS) Four Sun E5000s 8 processors 3 Myricom NICs Multiprocessor, Multi-NIC, Multi-Protocol

Information Servers Basic Storage Unit: Dedicated Info Servers Ultra 2, 300 GB raid, 800 GB tape stacker, ATM scalable backup/restore Dedicated Info Servers web, security, mail, … VLANs project into dept.

Millennium Computational Community SIMS Business BMRC Chemistry C.S. E.E. Biology Gigabit Ethernet Astro NERSC M.E. Physics N.E. IEOR Math Transport Economy C. E. MSME

Millennium PC Clumps Inexpensive, easy to manage Cluster Replicated in many departments Prototype for very large PC cluster

Proactive Infrastructure Information appliances Stationary desktops Scalable Servers