Presentation is loading. Please wait.

Presentation is loading. Please wait.

NSF Visit Gordon Bell www.research.microsoft.com/~gbell Microsoft Research 4 October 2002.

Similar presentations


Presentation on theme: "NSF Visit Gordon Bell www.research.microsoft.com/~gbell Microsoft Research 4 October 2002."— Presentation transcript:

1 NSF Visit Gordon Bell www.research.microsoft.com/~gbell Microsoft Research 4 October 2002

2 Topics How much things have changed since CISE was formed in 1986, but remain the same? 10 year base case @CRA’s Grand Challenges? http://www.google.com/search?sourceid=navclient&q=cra+grand+challenges http://www.google.com/search?sourceid=navclient&q=cra+grand+challenges GB MyLifeBits: storing one’s entire life for recall, home media, etc. Clusters, Grids, and Centers…challenge is apps Supercomputing directions

3 Messages… The Grand Challenge for CISE is to work on applications in science, engineering, and bio/medicine/health care (e.g. NIH). Databases versus greping. Revolution needed. Performance from software >= Moore’s Law Big challenge moving forward will come from trying to manage and exploit all the storage. Supercomputing: Cray. Gresham's Law Build on industry standards and efforts. Grid and “web services” must co-operate. Whatever happened to the first, Grand Challenges? Minimize grant overhead… site visits.

4 IBM Sets Up Biotech Research Center U.S.-based IBM recently set up a biotechnology research and development center in Taiwan -- IBM Life Sciences Center of Excellence -- the company's first in the Asia Pacific region… the center will provide computation solutions and services from an integrated bio-information database linked to resources around the world. Local research institutes working in cooperation with the center include Academia Sinica, the Institute for Information Industry and National Yang Ming University. From HPCWire 30 September 2002

5 Retrospective: CISE formed in 1986 CISE spent about $100 million on research in 1987 Q: What areas of software research do you think will be the most vital in the next decade? A: Methods to design and build large programs and data bases in a distributed environment are central. Q: What software research areas are funded? A: We fund what the community considers to be important … object- oriented languages, data bases, & human interfaces; semantics; formal methods of design and construction; connectionism; and data and knowledge bases, including concurrency. We aren’t funding applications.

6 Software Productivity c1986 I believe the big gains in software will come about by eliminating the old style of programming, by moving to a new paradigm, rather than magic tools or techniques to make the programming process better. Visicalc and Lotus 1-2-3 are good examples of a dramatic improvement in programming productivity. In essence, programming is eliminated and the work put in the hands of the users. These breakthroughs are unlikely to come from the software research community, because they aren’t involved in real applications. Most likely they will come from people trained in another discipline who understand enough about software to be able to carry out the basic work that ultimately is turned over to the software engineers to maintain and evolve.

7 Software productivity c1986 Q: The recent Software Engineering Conference featured a division of opinion on mechanized programming. … developing a programming system to write programs can automate much of the mundane tasks… A: Mechanized programming is recreated and renamed every few years. In the beginning, it meant a compiler. The last time it was called automatic programming. A few years ago it was program generators and the programmer’s work bench. The better it gets, the more programming you do!

8 Parallelism c1986 To show my commitment to parallel processing, for the next 10 years I will offer two $1000 annual awards for the best, operational scientific or engineering program with the most speedup... Q: What …do you expect from parallelism in the next decade? A: Our goal is obtaining a factor of 100 … within the decade and a factor of 10 within five years. 10 will be easy because it is inherently in most applications right now. The hardware will clearly be there if the software can support it or the users can use it. Many researchers think this goal is aiming too low. They think it should be a factor of I million within 15 years. However, I am skeptical that anything more than our goal will be

9 No challenge, next decade of systems. Industry’s evolutionary path… ¿Que sera sera Computing Research Association Grand Challenges Gordon Bell Microsoft Research 26 June 2002 Base Case Grand Challengeland Death and Doldrums Time Goodness 2000 2012

10 The “base case”: A standard for and basis for planning “Grand Challenges” The hardware platforms, networks, interfaces, and programming environments for applications that will be built in 2012, with only modest R&D, is posited as a “base case” prediction, using computing’s 30-50 year evolution. In essence, it is “what is likely to be”. This base case was constructed to act both as a standard to measure and base for CRA Grand Challenges Workshop that met on June 24-26. It also includes risks that may inhibit the base formation and any other GCs. It does not include applications that might drive a GC or that might result from having such platforms as these are often the result of serendipity e.g. PCs and spreadsheets; ARPAnet and email; Internet and the web

11 In a decade… the evolution (outline) The political environment Platforms we are likely to get, have, or get on which apps are built Each decade a new nets/platforms/interface occurs The environments to support possible apps Impediments that could thwart the evolution R & D challenges

12 We can count on: Moore’s Law provides ≈50-100x performance, const. $ 20% $ decrease/year => ½ per 5 years Terabyte personal stores => personal db managers Astronomical sized, by current standards, databases! Paper quality screens on watch, tablets… walls DSL wired, 3-4G/802.11 j nets (>10 Mbps) access Network Services: Finally computers can use|access the web. “It’s the Internet, Stupid.” –Enabler of intra-, extra-, inter-net commerce –Finally EDI/Exchanges/Markets Ubiquity rivaling the telephone. –Challenge: An instrument to supplant the phone? –Challenge: Affordability for everyone on planet <$1500/year Personal authentication to access anything of value Murphy’s Law continues with larger and more complex systems, requiring better fundamental understanding. A opportunity and need for “Autonomic Computing” In a decade, the evolution:

13 We are likely to “have” 120M computers/yr. World population >1B. –increasing with decreasing price. 2x / -50% –X% are discarded. Result is 1 Billion. Smaller personals w/phones… video @PDA $ Almost adequate speech communication for commands, limited dictation, note taking, segmenting/indexing video Vision capable of tracking each individual in a relatively large crowd. With identity, everybody’s location is known, everywhere, anytime. In a decade, the evolution:

14 Inevitable wireless nets… body, home, …x-area nets will create new opportunities Need to construct these environment of platforms, networking protocols, and programming environments for each kind Each net has to research its own sensor/effector structure as f(application) e.g. body, outdoor, building, Taxonomy includes these alternative dimensions: –Network function –master|slave vs. distributed… currently peripheral nets –permanent|dynamic –indoor|outdoor; –size and spatial diameter; –bandwidth and performance; –sensor/effector types; –security and noise immunity;

15 New environments can support a wide range of new apps Continued evolution of personal monitoring and assistance for health and personal care of all ages Personal platforms that provide “total recall” that will assist (25% of population) solving problems Platforms for changing education will be available. Limiters: Authoring tools & standards; content Transforming the scientific infrastructure is needed! –petabyte databases, petaflops performance –shared data notebooks across instruments and labs –new ways of performing experiments and –new ways of programming/visualizing and storing data. Serendipity: Something really new, like we get every decade but didn’t predict, will occur.

16 The worst case impediment! Economy continues to worsen No investment for: 1.IT. Industry cannot sustain Moore’s Law 2.Startups for new computer classes cannot form.

17 Impediments: The exogenous constraints “challenge” Intellectual Property: providing sufficiently secure payment and protection paralleling the “atoms” world is required –Books/e-books, CDs/songs, video, software are jeopardized and these industries collapse and cannot proceed to be electronic Accessibility and protection of one’s personal information, enabling commerce…healthcare Personal & organizational inertia brought about by: –Accelerated backlash/resistance of “automation” in services industries by “depression economy” e.g. healthcare thwarts growth and change –loss of privacy Incumbent ILECs thwart 4G/802 j build out CS Research is dominated by Government Agenda!!! Universities develop weapons instead of ideas. Continued, muddling wartime economy. Legacy apps & data inhibit new platforms and apps NO “converged”, consumer priced, high-bandwidth, net O(10-100 Mbps) Lack of the ubiquity (i.e. telephone) on ww basis with divergent standards

18 R & D Challenges Engineering, evolutionary construction, and non-trivial maintenance of billions of node, fractal nets ranging from the space, continent, campus, local, … to in-body nets Increasing information flows & vast sea of data –Large disks everywhere! personal to large servers across all apps –Akin to the vast tape libraries that are never read (bit rot) A modern, healthcare system that each of us would be happy or unafraid of being admitted into. Cf. islands (incompatible systems) of automation and instruments floating on a sea of paper moved around by people who maintain a bloated and inefficient “services” industry/economy.

19 Bottom Lines aka “Killer apps” for storage everywhere we look 1. “MyLifeBits” recording almost everything 2. The most cost-effective, highest volume stores: consumer & home PCs – for video. 3. Small form factor drives: pocket form factor cameras, phones, tablets, … e-books 4. Largest stores include Operating System, database, and interconnection via LANs/WANs and in the “cloud”

20 MyLifeBits, The Challenge of a 0.001-1 Petabyte lifetime PC Cyberizing everything… I’ve written, said, presented (incl. video), photos of physical objects & a few things I’ve read, heard, seen and might “want to see” on TV

21 "The PC is going to be the place where you store the information … really the center of control“ Billg 1/7/2001 MyLifeBits is an “on-going” project following CyberAll to “cyberize” all of personal bits! ► Memory recall of books, CDs, communication, papers, photos, video ► Photos of physical object collections ► Elimination of all physical stores & objects ► Content source for home media: ambiance, entertainment, communication, interaction Freestyle for CDs, photos, TV content, videos Goal: to understand the 1 TByte PC: need, utility, cost, feasibility, challenge & tools.

22 Storing all we’ve read, heard, & seen Human data-types /hr/day (/4yr)/lifetime read text, few pictures200 K 2 -10 M/G60-300 G speech text @120wpm 43 K 0.5 M/G 15 G speech @1KBps 3.6 M 40 M/G1.2 T stills w/voice @100KB 200 K2 M/G60 G video-like 50Kb/s POTS 22 M.25 G/T 25 T video 200Kb/s VHS-lite 90 M1 G/T100 T video 4.3Mb/s HDTV/DVD 1.8 G20 G/T 1 P

23 © 2002 Scenes from Media Center

24 Music 6.9 GB 1.8K files 180 CDs Working 2.3 GB 432 folders 2.9K files Archive 5.1 GB 477 folders 18.7 K files Video 2.6 GB 10 hours Low res My Books 98 MB 27.1K files & 42K.msg 17.7 GB (by size) Files (by number).xls.jpg.doc/html.pdf.ppt/ppt albums.tif CyberAll Nov.1, 2001 Mail.7 GB 43K msgs.doc/html.jpg.gif.xls.pdf.ppt.tif.gif

25 MyLifeBits use scenarios 1. Acquire from every potentially useful source including the web, voice and instant messages 2. Personal use of MLB for work to recall everything 3. Provide ambiance & entertainment: Personal/home broadcast, CD, Internet radio, TV screen saving 4. Creation of photo and video albums Events, places, trips, people, time intervals -------------- Database land -------------------------------------- 5. Personal/web hosted collections & catalogs 6. A Person (auto- or -biography web hosted time line Historical events by type; Personal time line Compile a life’s story about (event types, range, etc.) 7. Individual…How I spent my year. A personal diary. 8. ISBQ: Interactive Story By Query

26 A “killer app” for Terabyte, Lifetime, PC? ► MyLifeBits demonstrates need for lifetime memory! ► MODI (Microsoft Office Document Imaging)! T he most significant Office™ addition since HTML. ► Technology to support the vision: 1.Guarantee that data will live forever! 2.A single index that includes mail, conversations, web accesses, and books! 3.E-book…e-magazines reach critical mass! 4.Telephony and audio capture are needed 5.Photo & video “index serving” 6.More meta-information … Office, photos 7.Lots of GUIs to improve ease-of-use

27 Copyright Gordon Bell Clusters & Grids The Clusters – GRID Era CC GS C 2002 Lyon, France September 2002

28 Same observations as 2000 GRID was/is an exciting concept … – They can/must work within a community, organization, or project. Apps need to drive. – “Necessity is the mother of invention.” Taxonomy… interesting vs necessity – Cycle scavenging and object evaluation (e.g. seti@home, QCD)seti@home – File distribution/sharing for IP theft e.g. Napster – Databases &/or programs for a community (astronomy, bioinformatics, CERN, NCAR) – Workbenches: web workflow chem, bio… – Exchanges… many sites operating together – Single, large objectified pipeline… e.g. NASA. – Grid as a cluster platform! Transparent & arbitrary access including load balancing Web SVCs X

29 Grid n j. An arbitrary distributed, cluster platform A geographical and multi-organizational collection of diverse computers dynamically configured as cluster platforms responding to arbitrary, ill-defined jobs “thrown” at it. Costs are not necessarily favorable e.g. disks are less expensive than cost to transfer data. Latency and bandwidth are non-deterministic, thereby changing cluster characteristics Once a large body of data exists for a job, it is inherently bound to (set into) fixed resources. Large datasets & I/O bound programs need to be with their data or be database accesses… But are there resources there to share?

30 Bright spots… near term, user focus, a lesson for Grid suppliers Tony Hey, head of UK scientific computing. apps-based funding. versus tools-based funding. Web services based Grid & data orientation. David Abramson - Nimrod. – Parameter scans… other low hanging fruit – Encapsulate apps! “Excel”-- language/control mgmt. – “Legacy apps are programs that users just want, and there’s no time or resources to modify code …independent of age, author, or language e.g. Java.” Andrew Grimshaw - Avaki – Making Legion vision real. A reality check. Lip 4 pairs of “web services” based apps Gray et al Skyservice and Terraservice Goal: providing a web service must be as easy as publishing a web page…and will occur!!!

31 Copyright Gordon Bell Clusters & Grids SkyServer: delivering a web service to the astronomy community. Prototype for other sciences? Gray, Szalay, et al First paper on the SkyServer http://research.microsoft.com/~gray/Papers/MSR_ TR_2001_77_Virtual_Observatory.pdf http://research.microsoft.com/~gray/Papers/MSR_ TR_2001_77_Virtual_Observatory.doc Later, more detailed paper for database community http://research.microsoft.com/~gray/Papers/MSR_ TR_01_104_SkyServer_V1.pdf http://research.microsoft.com/~gray/Papers/MSR_ TR_01_104_SkyServer_V1.doc

32 Copyright Gordon Bell Clusters & Grids What can be learned from Sky Server? It’s about data, not about harvesting flops 1-2 hr. query programs versus 1 wk programs based on grep 10 minute runs versus 3 day compute & searches Database viewpoint. 100x speed-ups – Avoid costly re-computation and searches – Use indices and PARALLEL I/O. Read / Write >>1. – Parallelism is automatic, transparent, and just depends on the number of computers/disks. Limited experience and talent to use dbases.

33 Some science is hitting a wall FTP and GREP are not adequate (Jim Gray) You can GREP 1 GB in a minute You can GREP 1 TB in 2 days You can GREP 1 PB in 3 years. 1PB ~10,000 >> 1,000 disks At some point you need indices to limit search parallel data search and analysis Goal using dbases. Make it easy to – Publish: Record structured data – Find data anywhere in the network Get the subset you need! – Explore datasets interactively Database becomes the file system!!! You can FTP 1 MB in 1 sec. You can FTP 1 GB / min. … 2 days and 1K$ … 3 years and 1M$

34 Network concerns Very high cost – $(1 + 1) / GByte to send on the net; Fedex and 160 GByte shipments are cheaper – Disks cost $1/GByte to purchase!!! – DSL at home is $0.15 - $0.30 Disks cost less than $2/GByte to purchase Low availability of fast links (last mile problem) – Labs & universities have DS3 links at most, and they are very expensive – Traffic: Instant messaging, music stealing Performance at desktop is poor – 1- 10 Mbps; very poor communication links Manage: trade-in fast links for cheap links!!

35 Gray’s $2.4 K, 1 TByte Sneakernet aka Disk Brick Courtesy of Jim Gray, Microsoft Bay Area Research Cost to move a Terabyte Cost, time, and speed to move a Terabyte Cost of a “Sneaker-Net” TB We now ship NTFS/SQL disks. Not good format for Linux. Ship NFS/CIFS/ODBC servers (not disks). Plug “disk” into LAN. DHCP then file or DB serve… Web Service in long term

36 Cost to move a Terabyte

37 Cost, time of Sneaker-net vs Alts Medi aRobot$ Media $ TB read + write time ship time TotalTim/ TBMbps Cost (10 TB) $/TB shipped CD 15002x80024060 hrs 24 hrs6 days28$2 K$208 DVD 2002x8K40060 hrs 24 hrs6 days28$20 K$2,000 Tape 252x15K100092 hrs 24 hrs5 days18$31 K$3,100 DiskBric 71K1,40019 hrs 24 hrs2 days52 $2.6 K$260 Courtesy of Jim Gray, Microsoft Bay Area Research

38 Copyright Gordon Bell Clusters & Grids Grids: Real and “personal” Two carrots, one downside. A bet. Bell will match any Gordon Bell Prize (parallelism, performance, or performance/cost) winner’s prize that is based on “Grid Platform Technology”. I will bet any individual or set of individuals of the Grid Research community up to $5,000 that a Grid application will not win the above by SC2005.

39 Copyright Gordon Bell LANL 5/17/2002 Technical computing: Observations on an ever changing, occasionally repetitious, environment

40 A brief, simplified history of HPC 1. Sequential & data parallelism using shared memory, Cray’s Fortran computers 60-02 (US:90) 2. 1978: VAXen threaten general purpose centers… 3. NSF response: form many centers 1988 - present 4. SCI: Search for parallelism to exploit micros 85-95 5. Scalability: “bet the farm” on clusters. Users “adapt” to clusters aka multi-computers with LCD program model, MPI. >95 6. Beowulf Clusters adopt standardized hardware and Linus’s software to create a standard! >1995 7. “Do-it-yourself” Beowulfs impede new structures and threaten g.p. centers >2000 8. 1997-2002: Let’s tell NEC they aren’t “in step ”. 9. High speed networking enables peer2peer computing and the Grid. Will this really work?

41 What Is the System Architecture? (GB c1990) X X X GRID SIMD X

42 Copyright Gordon Bell LANL 5/17/2002 Processor Architectures? VECTORS OR CS View MISC >> CISC >> Language directed RISC >> Super-scalar >> Extra-Long Instruction Word Caches: mostly alleviate need for memory B/W SC Designers View RISC >> VCISC (vectors)>> Massively parallel (SIMD) (multiple pipelines) Memory B/W = perf.

43 Copyright Gordon Bell LANL 5/17/2002 Results from DARPA’s SCI c1983 Many research and construction efforts … virtually all new hardware efforts failed except Intel and Cray. DARPA directed purchases… screwed up the market, including the many VC funded efforts. No Software funding! Users responded to the massive power potential with LCD software. Clusters, clusters, clusters using MPI. Beowulf! It’s not scalar vs vector, its memory bandwidth! – 6-10 scalar processors = 1 vector unit – 16-64 scalars = a 2 – 6 processor SMP

44 Dead Supercomputer Society ACRI Alliant American Supercomputer Ametek Applied Dynamics Astronautics BBN CDC Convex Cray Computer Cray Research Culler-Harris Culler Scientific Cydrome Dana/Ardent/Stellar/Stardent Denelcor Elexsi ETA Systems Evans and Sutherland Computer Floating Point Systems Galaxy YH-1 Goodyear Aerospace MPP Gould NPL Guiltech Intel Scientific Computers International Parallel Machines Kendall Square Research Key Computer Laboratories MasPar Meiko Multiflow Myrias Numerix Prisma Tera Thinking Machines Saxpy Scientific Computer Systems (SCS) Soviet Supercomputers Supertek Supercomputer Systems Suprenum Vitesse Electronics

45 Copyright Gordon Bell LANL 5/17/2002 What a difference 25 years AND spending >10x makes! LLNL 150 Mflops machine room c1978 ESRDC: 40 Tflops. 640 nodes (8 - 8GFl P.vec/node)

46 Japanese Earth Simulator Spectacular results for $400M. –Year to year gain of 10x. The greatest gain since the first (1987) Gordon Bell Prize. –Performance is 10x the nearest entrant –Performance/cost is 3x the nearest entrant –RAP (real application performance) >60% Peak Other machines are typically 10% of peak. –Programming was done in HPF (Fortran) that the US research community abandoned. NCAR was right in wanting to purchase an NEC super

47 Copyright Gordon Bell LANL 5/17/2002 Computer types Netwrked Supers… Legion Condor Beowulf NT clusters VPPuni T3E SP2 (mP) NOW NEC mP SGI DSM clusters & SGI DSM NEC super Cray X…T (all mPv) Mainframes Multis WSs PCs -------- Connectivity-------- WAN/LAN SAN DSM SM micros vector Clusters GRID & P2P Old World

48 Copyright Gordon Bell LANL 5/17/2002 The Challenge leading to Beowulf NASA HPCC Program begun in 1992 Comprised Computational Aero-Science and Earth and Space Science (ESS) Driven by need for post processing data manipulation and visualization of large data sets Conventional techniques imposed long user response time and shared resource contention Cost low enough for dedicated single-user platform Requirement: – 1 Gflops peak, 10 Gbyte, < $50K Commercial systems: $1000/Mflops or 1M/Gflops

49 Copyright Gordon Bell LANL 5/17/2002 Innovation The Virtuous Economic Cycle drives the PC industry… & Beowulf Volume Competition Standards Utility/value DOJ Greater availability @ lower cost Creates apps, tools, training, Attracts users Attracts suppliers

50 Lessons from Beowulf An experiment in parallel computing systems Established vision- low cost high end computing Demonstrated effectiveness of PC clusters for some (not all) classes of applications Provided networking software Provided cluster management tools Conveyed findings to broad community Tutorials and the book Provided design standard to rally community! Standards beget: books, trained people, software … virtuous cycle that allowed apps to form Industry begins to form beyond a research project Courtesy, Thomas Sterling, Caltech.

51 Copyright Gordon Bell LANL 5/17/2002 Clusters: Next Steps Scalability… They can exist at all levels: personal, group, … centers Clusters challenge centers… given that smaller users get small clusters

52 Computing in small spaces @ LANL (RLX cluster in building with NO A/C) 240 processors @2/3 GFlops Fill the 4 racks -- gives a Teraflops

53

54 Copyright Gordon Bell LANL 5/17/2002 Internet II concerns given $0.5B cost Very high cost – $(1 + 1) / GByte to send on the net; Fedex and 160 GByte shipments are cheaper – DSL at home is $0.15 - $0.30 Disks cost $1/GByte to purchase! Low availability of fast links (last mile problem) – Labs & universities have DS3 links at most, and they are very expensive – Traffic: Instant messaging, music stealing Performance at desktop is poor – 1- 10 Mbps; very poor communication links

55 Scalable computing: the effects They come in all sizes; incremental growth 10 or 100 to 10,000 (100X for most users) debug vs run; problem growth Allows compatibility heretofore impossible 1978: VAX chose Cray Fortran 1987: The NSF centers went to UNIX Users chose sensible environment – Acquisition and operational costs & environments – Cost to use as measured by user’s time The role of gp centers e.g. NSF, state x is unclear. Necessity for support? – Scientific Data for a given community… – Community programs and data – Manage GRID discipline Are clusters ≈ Gresham’s Law? Drive out alts.

56 Copyright Gordon Bell LANL 5/17/2002 The end


Download ppt "NSF Visit Gordon Bell www.research.microsoft.com/~gbell Microsoft Research 4 October 2002."

Similar presentations


Ads by Google