1 Dr. Frederica Darema Senior Science and Technology Advisor NSF Future Parallel Computing Systems – what to remember from the past RAMP Workshop FCRC.

Slides:



Advertisements
Similar presentations
Building a CFD Grid Over ThaiGrid Infrastructure Putchong Uthayopas, Ph.D Department of Computer Engineering, Faculty of Engineering, Kasetsart University,
Advertisements

DELOS Highlights COSTANTINO THANOS ITALIAN NATIONAL RESEARCH COUNCIL.
Technology Drivers Traditional HPC application drivers – OS noise, resource monitoring and management, memory footprint – Complexity of resources to be.
Priority Research Direction (I/O Models, Abstractions and Software) Key challenges What will you do to address the challenges? – Develop newer I/O models.
Clouds C. Vuerli Contributed by Zsolt Nemeth. As it started.
Presented by: Thabet Kacem Spring Outline Contributions Introduction Proposed Approach Related Work Reconception of ADLs XTEAM Tool Chain Discussion.
Chapter 13 Embedded Systems Patricia Roy Manatee Community College, Venice, FL ©2008, Prentice Hall Operating Systems: Internals and Design Principles,
OCIN Workshop Wrapup Bill Dally. Thanks To Funding –NSF - Timothy Pinkston, Federica Darema, Mike Foster –UC Discovery Program Organization –Jane Klickman,
Chapter 13 Embedded Systems
Figure 1.1 Interaction between applications and the operating system.
Strategic Directions in Real- Time & Embedded Systems Aatash Patel 18 th September, 2001.
Workload Management Massimo Sgaravatto INFN Padova.
Research Directions for On-chip Network Microarchitectures Luca Carloni, Steve Keckler, Robert Mullins, Vijay Narayanan, Steve Reinhardt, Michael Taylor.
1 Dr. Frederica Darema Senior Science and Technology Advisor NSF Research and Technology Advances in Systems Software for Emerging Computer Systems EDGE.
Educational Computer Architecture Experimentation Tool Dr. Abdelhafid Bouhraoua.
Instrumentation and Measurement CSci 599 Class Presentation Shreyans Mehta.
COM181 Computer Hardware Ian McCrumRoom 5B18,
Efficient Hardware dependant Software (HdS) Generation using SW Development Platforms Frédéric ROUSSEAU CASTNESS‘07 Computer Architectures and Software.
Advances in Language Design
Darema Dr. Frederica Darema NSF Dynamic Data Driven Application Systems (Symbiotic Measurement&Simulation Systems) “A new paradigm for application simulations.
Priority Research Direction Key challenges Fault oblivious, Error tolerant software Hybrid and hierarchical based algorithms (eg linear algebra split across.
OpenMP in a Heterogeneous World Ayodunni Aribuki Advisor: Dr. Barbara Chapman HPCTools Group University of Houston.
4.x Performance Technology drivers – Exascale systems will consist of complex configurations with a huge number of potentially heterogeneous components.
©Ian Sommerville 2006Software Engineering, 8th edition. Chapter 12 Slide 1 Distributed Systems Architectures.
Domestic Nuclear Detection Office (DNDO) NITRD Workshop What are the Biggest Opportunities in Networking Problem? Sept. 20, 2012 Timothy Ashenfelter, PhD.
CASTNESS‘11 Computer Architectures and Software Tools for Numerical Embedded Scalable Systems Workshop & School: Roma January 17-18th 2011 Frédéric ROUSSEAU.
Introduction and Overview Questions answered in this lecture: What is an operating system? How have operating systems evolved? Why study operating systems?
 What is an operating system? What is an operating system?  Where does the OS fit in? Where does the OS fit in?  Services provided by an OS Services.
Database System Concepts and Architecture
Computer Science Open Research Questions Adversary models –Define/Formalize adversary models Need to incorporate characteristics of new technologies and.
Composing Adaptive Software Authors Philip K. McKinley, Seyed Masoud Sadjadi, Eric P. Kasten, Betty H.C. Cheng Presented by Ana Rodriguez June 21, 2006.
Programming Models & Runtime Systems Breakout Report MICS PI Meeting, June 27, 2002.
A Proposal of Application Failure Detection and Recovery in the Grid Marian Bubak 1,2, Tomasz Szepieniec 2, Marcin Radecki 2 1 Institute of Computer Science,
High Performance Embedded Computing © 2007 Elsevier Lecture 3: Design Methodologies Embedded Computing Systems Mikko Lipasti, adapted from M. Schulte Based.
High Performance Embedded Computing © 2007 Elsevier Chapter 1, part 2: Embedded Computing High Performance Embedded Computing Wayne Wolf.
4.2.1 Programming Models Technology drivers – Node count, scale of parallelism within the node – Heterogeneity – Complex memory hierarchies – Failure rates.
Heterogeneous Multikernel OS Yauhen Klimiankou BSUIR
The Globus Project: A Status Report Ian Foster Carl Kesselman
Types of Operating Systems
COMPUTER ORGANIZATIONS CSNB123. COMPUTER ORGANIZATIONS CSNB123 Why do you need to study computer organization and architecture? Computer science and IT.
Advanced Computer Networks Topic 2: Characterization of Distributed Systems.
Computers Operating System Essentials. Operating Systems PROGRAM HARDWARE OPERATING SYSTEM.
Issues Autonomic operation (fault tolerance) Minimize interference to applications Hardware support for new operating systems Resource management (global.
MILAN: Technical Overview October 2, 2002 Akos Ledeczi MILAN Workshop Institute for Software Integrated.
ICCS WSES BOF Discussion. Possible Topics Scientific workflows and Grid infrastructure Utilization of computing resources in scientific workflows; Virtual.
A few issues on the design of future multicores André Seznec IRISA/INRIA.
Memory Hierarchy Adaptivity An Architectural Perspective Alex Veidenbaum AMRM Project sponsored by DARPA/ITO.
Types of Operating Systems 1 Computer Engineering Department Distributed Systems Course Assoc. Prof. Dr. Ahmet Sayar Kocaeli University - Fall 2015.
CPSC 171 Introduction to Computer Science System Software and Virtual Machines.
CSC Multiprocessor Programming, Spring, 2012 Chapter 11 – Performance and Scalability Dr. Dale E. Parson, week 12.
Programmability Hiroshi Nakashima Thomas Sterling.
Distributed Computing Systems CSCI 6900/4900. Review Distributed system –A collection of independent computers that appears to its users as a single coherent.
Hybrid Multi-Core Architecture for Boosting Single-Threaded Performance Presented by: Peyman Nov 2007.
What’s Ahead for Embedded Software? (Wed) Gilsoo Kim
3/12/2013Computer Engg, IIT(BHU)1 PARALLEL COMPUTERS- 2.
1 Reconfigurable Environment for Analysis and Test of Software Systems Sam Martin REATSS.
High Risk 1. Ensure productive use of GRID computing through participation of biologists to shape the development of the GRID. 2. Develop user-friendly.
Addressing Data Compatibility on Programmable Network Platforms Ada Gavrilovska, Karsten Schwan College of Computing Georgia Tech.
Course Book Course Objective - The student will be able to describe various operating system concepts as they are applied to memory, process, file system.
Resource Optimization for Publisher/Subscriber-based Avionics Systems Institute for Software Integrated Systems Vanderbilt University Nashville, Tennessee.
Decisive Themes, July, JL-1 ARTEMIS Decisive Theme for Integrasys Pedro A. Ruiz Integrasys July, 2011.
Towards a High Performance Extensible Grid Architecture Klaus Krauter Muthucumaru Maheswaran {krauter,
Workload Management Workpackage
Dynamic Data Driven Application Systems
Structural Simulation Toolkit / Gem5 Integration
Chapter 18 MobileApp Design
Many-core Software Development Platforms
The Extensible Tool-chain for Evaluation of Architectural Models
Dynamic Data Driven Application Systems
Q: What Does the Future Hold for “Parallel” Languages?
Presentation transcript:

1 Dr. Frederica Darema Senior Science and Technology Advisor NSF Future Parallel Computing Systems – what to remember from the past RAMP Workshop FCRC 2007

2 Overview Systems Modeling and Analysis is important –Design cycle and support the runtime –measurements an important (static and runtime) –performance, dependability, reliability, power management, security, …. Multicore systems –New opportunities and something from the past –What we learned from shared memory systems of the early-mid80’s

3 Authenication / Authorization Fault Recovery Services Distributed Systems Management Distributed, Heterogeneous, Dynamic, Adaptive Computing Platforms and Networks Device Technology... CPU Technology Visualization Scalable I/O Data Management Archiving/Retrieval Services Collaboration Environments Distributed Applications Memory Technology Two Components of CSR Program: * (AES) Advanced Execution Systems * (SMA) System Modeling and Analysis Prog.Models Libraries Tools Compilers Advanced Execution Systems Parallel and Distributed Operating Systems System Modeling and Analysis

4 System Modeling and Analysis (SMA) Seeks to develop methods and tools for modeling, measuring, analyzing, evaluating, and predicting the performance and correctness of complex computing and communications systems SMA emphasizes the development of methods and tools for modeling, measuring, analyzing, evaluating, and predicting the performance and correctness of complex computing and communications systems Topics of Interest Hardware and Software modeling –methods tools and measurements, providing multimodal, hierarchical or multilevel modeling and analysis capabilities of such systems; –methods that describe components of the system, but also the system as a total, and enable assessment of the effects of individual hardware and software layers and components of these systems; –ability to describe the system in multiple levels of detail (characteristics and time-scales); –combine different methods of describing components and layers

5 System Modeling and Analysis (SMA) Topics of Interest (cont’d) Novel modeling and measurement approaches –Develop capabilities to describe, analyze and predict the behavior of the components as well as the systems; Analysis and prediction due to changes in the application, system software, hardware; multilevel approaches and multi-modal approaches Performance Frameworks –combine tools in “plug-and-play” fashion –multiple views of the system Emphasis on Multidisciplinary Research (across sub-areas of CS)

6 Multiple views of the system The applications’ view Authenication/ Authorization Dependability Services Distributed Systems Management Distributed, Heterogeneous, Dynamic, Adaptive Computing Platforms and Networks Device Technology... CPU Technology Visualization Scalable I/O Data Management Archiving/Retrieval Services Other Services... Collaboration Environments Distributed Applications Memory Technology Application Models OS Scheduler Models Architecture / Network Models Memory Models IO / File Models... Languages Libraries Tools Compilers

7 Authenication/ Authorization Dependability Services Distributed Systems Management Visualization Scalable I/O Data Management Archiving/Retrieval Services Other Services... Collaboration Environments Distributed Applications Distributed, Heterogeneous, Dynamic, Adaptive Computing Platforms and Networks Device Technology... CPU Technology Memory Technology Application Models Architecture / Network Models Memory Models OS Scheduler Models IO / File Models... Languages Libraries Tools Compilers Multiple views of the system The Operating Systems’ view

8 Dynamically Link & Execute The NGS Program developsTechnology for integrated feedback & control Runtime Compiling System (RCS) and Dynamic Application Composition Application Model Application Program Application Intermediate Representation Compiler Front-End Compiler Back-End Performance Measuremetns & Models Distributed Programming Model Application Components & Frameworks Dynamic Analysis Situation Launch Application (s) Distributed Platform Adaptable computing Systems Infrastructure Distributed Computing Resources MPPNOW SAR tac-com data base fire cntl fire cntl alg accelerator data base SP ….

9 Advanced Execution Systems (AES) (a component of the Computer Systems Research Program) (CSR Program) Seeks to create systems software to facilitate the development and runtime support of complex applications executing on large, heterogeneous high-end computing and grid platforms AES emphasizes runtime compiling systems and application composition systems interface with the underlying operating systems services and incorporating systems modeling and analysis methods and tools. Topics of Interest Novel Compiler Technology that go beyond the standard static notion of a compiler –for example by embedding a portion of the compiler in the runtime and endowing the system with resource awareness and adaptive mapping capabilities; –new compiler techniques for determining functional and data dependencies across multiple levels of memory hierarchy and across platforms; –mechanisms for matching an application’s resource needs to underlying resources when both are changing as the application executes

10 Advanced Execution Systems (AES) Topics of Interest Programming models and tools –expressing application partitioning across distributed, heterogeneous computing platforms; application-level checkpointing and recovery Application composition system (ACS) technology –constructing applications to fit the available resources and to adapt to changes in the underlying execution environment; –methods for automatically selecting application components; –creating knowledge bases for application components; interfacing with the underlying computing platform models to determine suitable application components; –and developing appropriate application component libraries and interfaces so the run-time portion of the RCS can link to such libraries.

A number of projects are developing: systems modeling methods and performance frameworks & runtime-compiler and application composition systems Projects can be found in the proceedings of the Next Generation Software (Workshop Series & PI-Meetings) I organize every year in conjunction with IPDPS

The RAMP effort is very important in enabling the modeling capabilities needed for present and future computer systems and environments Can lead to a valuable standard Performance Framework usable by the broader academic community and industry

13 With respect to the Emerging Multicore Systems: Things to remember from the past Coherent caches – issues of scalability to large numbers of processors On-chip networks: –Bus-based communication – contention - & scalability issues –Cross-bars - can have contention for 100s procs (let alone 100Ks of procs ) –Consider multistage switches –Consider combining networks Test&Set/Compare& Swap - Fetch&op (e.g Fetch&Add) for synchronization Shared data –Synchronization (place in a “fast” shared memory - cache) –Other application shared data (place in shared memory – stage in cache) –Past experiments showed that even if 99% of data are shared – accesses to the shared data were ~20% Memory wall - remains challenge; reducing latency remains challenge –Hardware – pipelining –Software – compilers/prefetching Temporary cacheable shared data (compiler enabled) Performance Tools - Parallel tracing – parallel simulations