Problem is to compute: f(latitude, longitude, elevation, time)  temperature, pressure, humidity, wind velocity Approach: –Discretize the.

Slides:



Advertisements
Similar presentations
Parallel Programming and Algorithms : A Primer Kishore Kothapalli IIIT-H Workshop on Multi-core Technologies International Institute.
Advertisements

Slides Prepared from the CI-Tutor Courses at NCSA By S. Masoud Sadjadi School of Computing and Information Sciences Florida.
Multiprocessors— Large vs. Small Scale Multiprocessors— Large vs. Small Scale.
Lecture 6: Multicore Systems
Computer Architecture and the Fetch-Execute Cycle Parallel Processor Systems.
COE 502 / CSE 661 Parallel and Vector Architectures Prof. Muhamed Mudawar Computer Engineering Department King Fahd University of Petroleum and Minerals.
Higher Computing: Unit 1: Topic 3 – Computer Performance St Andrew’s High School, Computing Department Higher Computing Topic 3 Computer Performance.
Development of a track trigger based on parallel architectures Felice Pantaleo PH-CMG-CO (University of Hamburg) Felice Pantaleo PH-CMG-CO (University.
Climate modeling Current state of climate knowledge – What does the historical data (temperature, CO 2, etc) tell us – What are trends in the current observational.
GPUs. An enlarging peak performance advantage: –Calculation: 1 TFLOPS vs. 100 GFLOPS –Memory Bandwidth: GB/s vs GB/s –GPU in every PC and.
ECE669 L1: Course Introduction January 29, 2004 ECE 669 Parallel Computer Architecture Lecture 1 Course Introduction Prof. Russell Tessier Department of.
Instruction Level Parallelism (ILP) Colin Stevens.
Computational Astrophysics: Methodology 1.Identify astrophysical problem 2.Write down corresponding equations 3.Identify numerical algorithm 4.Find a computer.
Introduction What is Parallel Algorithms? Why Parallel Algorithms? Evolution and Convergence of Parallel Algorithms Fundamental Design Issues.
09/22/2008CS49601 CS4960: Parallel Programming Guest Lecture: Parallel Programming for Scientific Computing Mary Hall September 22, 2008.
Csci4203/ece43631 Review Quiz. 1)It is less expensive 2)It is usually faster 3)Its average CPI is smaller 4)It allows a faster clock rate 5)It has a simpler.
A History of Modern Weather Forecasting. The Stone Age Prior to approximately 1955, forecasting was basically a subjective art, and not very skillful.
Why Parallel Architecture? Todd C. Mowry CS 495 January 15, 2002.
Parallel Algorithms - Introduction Advanced Algorithms & Data Structures Lecture Theme 11 Prof. Dr. Th. Ottmann Summer Semester 2006.
Lecture 1: Introduction to High Performance Computing.
CMSC 611: Advanced Computer Architecture Parallel Computation Most slides adapted from David Patterson. Some from Mohomed Younis.
CSE 260 Parallel Computation Allan Snavely, Henri Casanova
08/21/2012CS4230 CS4230 Parallel Programming Lecture 1: Introduction Mary Hall August 21,
1 Parallel computing and its recent topics. 2 Outline 1. Introduction of parallel processing (1)What is parallel processing (2)Classification of parallel.
University of Veszprém Department of Image Processing and Neurocomputing Emulated Digital CNN-UM Implementation of a 3-dimensional Ocean Model on FPGAs.
ICT in Weather Forecasting
Higher Computing Computer Systems S. McCrossan 1 Higher Grade Computing Studies 3. Computer Performance Measures of Processor Speed When comparing one.
1 The Performance Potential for Single Application Heterogeneous Systems Henry Wong* and Tor M. Aamodt § *University of Toronto § University of British.
Parallel Computing Laxmikant Kale
Seaborg Cerise Wuthrich CMPS Seaborg  Manufactured by IBM  Distributed Memory Parallel Supercomputer  Based on IBM’s SP RS/6000 Architecture.
ECE 568: Modern Comp. Architectures and Intro to Parallel Processing Fall 2006 Ahmed Louri ECE Department.
20.5 Forecasting Weather Objectives
Sept COMP60611 Fundamentals of Concurrency Lab Exercise 2 Notes Notes on the finite difference performance model example – for the lab… Graham Riley,
M U N -March 10, Phil Bording1 Computer Engineering of Wave Machines for Seismic Modeling and Seismic Migration R. Phillip Bording March 10, 2005.
ECE 569: High-Performance Computing: Architectures, Algorithms and Technologies Spring 2006 Ahmed Louri ECE Department.
Introduction to Research 2011 Introduction to Research 2011 Ashok Srinivasan Florida State University Images from ORNL, IBM, NVIDIA.
IBM RS 6000 Greg Young, Nathan Einsig, Rich Zizik, Ben Noble.
Classic Model of Parallel Processing
Computing Environment The computing environment rapidly evolving ‑ you need to know not only the methods, but also How and when to apply them, Which computers.
CS591x -Cluster Computing and Parallel Programming
Morgan Kaufmann Publishers
© Wen-mei Hwu and S. J. Patel, 2005 ECE 511, University of Illinois Lecture 4: Microarchitecture: Overview and General Trends.
INEL6067 Technology ---> Limitations & Opportunities Wires -Area -Propagation speed Clock Power VLSI -I/O pin limitations -Chip area -Chip crossing delay.
Exascale climate modeling 24th International Conference on Parallel Architectures and Compilation Techniques October 18, 2015 Michael F. Wehner Lawrence.
Anton, a Special-Purpose Machine for Molecular Dynamics Simulation By David E. Shaw et al Presented by Bob Koutsoyannis.
CS433 Spring 2001 Introduction Laxmikant Kale. 2 Course objectives and outline You will learn about: –Parallel programming models Emphasis on 3: message.
Hybrid Multi-Core Architecture for Boosting Single-Threaded Performance Presented by: Peyman Nov 2007.
3/12/2013Computer Engg, IIT(BHU)1 INTRODUCTION-1.
3/12/2013Computer Engg, IIT(BHU)1 PARALLEL COMPUTERS- 3.
CDA-5155 Computer Architecture Principles Fall 2000 Multiprocessor Architectures.
Using Cyberinfrastructure to Study the Earth’s Climate and Air Quality Don Wuebbles Department of Atmospheric Sciences University of Illinois, Urbana-Champaign.
ALPHA 21164PC. Alpha 21164PC High-performance alternative to a Windows NT Personal Computer.
Feeding Parallel Machines – Any Silver Bullets? Novica Nosović ETF Sarajevo 8th Workshop “Software Engineering Education and Reverse Engineering” Durres,
Pentium 4 Deeply pipelined processor supporting multiple issue with speculation and multi-threading 2004 version: 31 clock cycles from fetch to retire,
Weather.
CMSC 611: Advanced Computer Architecture
Technology on the Cutting Edge of Weather Research and Forecasting
5.2 Eleven Advanced Optimizations of Cache Performance
Parallel Computers.
Superscalar Pipelines Part 2
Levels of Parallelism within a Single Processor
What is a Climate Model?.
Patterns in environmental quality and sustainability
Coe818 Advanced Computer Architecture
CS 258 Parallel Computer Architecture
What is a Climate Model?.
Weather and Climate.
Levels of Parallelism within a Single Processor
Weather.
Husky Energy Chair in Oil and Gas Research
Presentation transcript:

Problem is to compute: f(latitude, longitude, elevation, time)  temperature, pressure, humidity, wind velocity Approach: –Discretize the domain, e.g., a measurement point every 10 km –Devise an algorithm to predict weather at time t+1 given t Source: Uses: -Predict major events, e.g., El Nino -Use in setting air emissions standards

Weather Forecasting An accurate long-range forecast requires huge amounts of cells and hence computations. Case Study: Global Climate Modelling  earth’s surface is approximately 5 x 10 8 km 2  Considering one cell per square km with 15 levels ( ground level up to 14 km high )  6 data values ( update once every minute ) : humidity, temperature, wind, latitude, longitude and height Throughput: 3 Gigabytes of data per second

One piece is modeling the fluid flow in the atmosphere –Solve Navier-Stokes problem Roughly 100 Flops per grid point with 1 minute timestep Computational requirements: –To match real-time, need 5x flops in 60 seconds = 8 Gflop/s –Weather prediction (7 days in 24 hours)  56 Gflop/s –Climate prediction (50 years in 30 days)  4.8 Tflop/s –To use in policy negotiations (50 years in 12 hours)  288 Tflop/s To double the grid resolution, computation is at least 8x State of the art models require integration of atmosphere, ocean, sea-ice, land models, plus possibly carbon cycle, geochemistry and more Current models are coarser than this Global Climate Modeling Computation

Weather Forecasting Computer Visualisation of a Hurricane

High Resolution Climate Modeling on NERSC-3 – P. Duffy, et al., LLNL

Protein Folding  One of the major challenges in molecular biology.  Proteins perform over a thousand different jobs. ( As enzymes they accelerate reactions. They also carry oxygen and antibodies to fight disease. )  Before proteins can go to work they must fold into the correct shape. ( The string of amino acids in the protein twist and fold to form the final protein ) Scientists are using supercomputers to discover the rules that describe why a string of amino acids folds into a particular protein.

Protein Folding Researchers at the Pittsburgh Computing Center tracked the folding of a small protein (300 amino acids) in water ( ~ atoms ). Folding time: 1 millisecond #FLOPS required: 3 x With a PetaFLOP computer the simulation would take a year. IBM are funding a $100M project called The Blue Gene Project to build a 1 PetaFLOP/s computer ( PetaScale Computing – Flops/s)

The Production of Toy Story 140,000 frames rendered for full-length feature film. 10,000 seconds required to render each frame. ~ operations Operations were distributed over dozens of Sun workstations, ~ 10 MIPS ( millions of instructions per second ) per Sun.

What is Parallel Architecture? A parallel computer is a collection of processing elements that cooperate to solve large problems fast Some broad issues: –Resource Allocation: how large a collection? how powerful are the elements? how much memory? –Data access, Communication and Synchronization how do the elements cooperate and communicate? how are data transmitted between processors? what are the abstractions and primitives for cooperation? –Performance and Scalability how does it all translate into performance? how does it scale?

Parallelism: Provides alternative to faster clock for performance Applies at all levels of system design Is a fascinating perspective from which to view architecture Is increasingly central in information processing Why Study Parallel Architecture?

Architectural Trends Greatest trend in VLSI generation is increase in parallelism –Up to 1985: bit level parallelism: 4-bit -> 8 bit -> 16-bit slows after 32 bit adoption of 64-bit now under way, 128-bit far (not performance issue) great inflection point when 32-bit micro and cache fit on a chip –Mid 80s to mid 90s: instruction level parallelism pipelining and simple instruction sets, + compiler advances (RISC) on-chip caches and functional units => superscalar execution greater sophistication: out of order execution, speculation, prediction –to deal with control transfer and latency problems –Next step: thread level parallelism

How far will ILP go? Infinite resources and fetch bandwidth, perfect branch prediction and renaming –real caches and non-zero miss latencies