Multidimensional Performance Modeling for Advanced Embedded Signal Processors Michael Stebnisky Carl Hein Lockheed Martin Advanced Technology Laboratories.

Slides:



Advertisements
Similar presentations
System-level Architectur Modeling for Power Aware Computing Dexin Li.
Advertisements

Optimization of Parallel Task Execution on the Adaptive Reconfigurable Group Organized Computing System Presenter: Lev Kirischian Department of Electrical.
ECOE 560 Design Methodologies and Tools for Software/Hardware Systems Spring 2004 Serdar Taşıran.
1 SECURE-PARTIAL RECONFIGURATION OF FPGAs MSc.Fisnik KRAJA Computer Engineering Department, Faculty Of Information Technology, Polytechnic University of.
*time Optimization Heiko, Diego, Thomas, Kevin, Andreas, Jens.
4.1.5 System Management Background What is in System Management Resource control and scheduling Booting, reconfiguration, defining limits for resource.
MCTS GUIDE TO MICROSOFT WINDOWS 7 Chapter 10 Performance Tuning.
Presented by: Thabet Kacem Spring Outline Contributions Introduction Proposed Approach Related Work Reconception of ADLs XTEAM Tool Chain Discussion.
MotoHawk Training Model-Based Design of Embedded Systems.
SSP Re-hosting System Development: CLBM Overview and Module Recognition SSP Team Department of ECE Stevens Institute of Technology Presented by Hongbing.
1 HW/SW Partitioning Embedded Systems Design. 2 Hardware/Software Codesign “Exploration of the system design space formed by combinations of hardware.
High-level System Modeling and Power Management Techniques Jinfeng Liu Dept. of ECE, UC Irvine Sep
CS244-Introduction to Embedded Systems and Ubiquitous Computing Instructor: Eli Bozorgzadeh Computer Science Department UC Irvine Winter 2010.
Behavioral Design Outline –Design Specification –Behavioral Design –Behavioral Specification –Hardware Description Languages –Behavioral Simulation –Behavioral.
1 Dr. Frederica Darema Senior Science and Technology Advisor NSF Future Parallel Computing Systems – what to remember from the past RAMP Workshop FCRC.
Agent-Based Acceptability-Oriented Computing International Symposium on Software Reliability Engineering Fast Abstract by Shana Hyvat.
November 18, 2004 Embedded System Design Flow Arkadeb Ghosal Alessandro Pinto Daniele Gasperini Alberto Sangiovanni-Vincentelli
1 Platform-Based Design A paper by Alberto Sangiovanni-Vincentelli EE 249, 11/5/2002 Presenter: Mel Tsai.
Research Directions for On-chip Network Microarchitectures Luca Carloni, Steve Keckler, Robert Mullins, Vijay Narayanan, Steve Reinhardt, Michael Taylor.
1 FM Overview of Adaptation. 2 FM RAPIDware: Component-Based Design of Adaptive and Dependable Middleware Project Investigators: Philip McKinley, Kurt.
5/24/011 Advanced Tool Integration for Embedded Systems Assurance Insup Lee Department of Computer and Information Science University of Pennsylvania.
System Integration Management (SIM)
1 A survey on Reconfigurable Computing for Signal Processing Applications Anne Pratoomtong Spring2002.
(1) Introduction © Sudhakar Yalamanchili, Georgia Institute of Technology, 2006.
C OLUMBIA U NIVERSITY Lightwave Research Laboratory Embedding Real-Time Substrate Measurements for Cross-Layer Communications Caroline Lai, Franz Fidler,
Applied Transportation Analysis ITS Application SCATS.
S/W Project Management Software Process Models. Objectives To understand  Software process and process models, including the main characteristics of.
MCTS Guide to Microsoft Windows 7
Tufts Wireless Laboratory School Of Engineering Tufts University “Network QoS Management in Cyber-Physical Systems” Nicole Ng 9/16/20151 by Feng Xia, Longhua.
PCA READAPT V&V DARPATech Exhibit Demo Flight Viewer & CSIM PCA Model Screenshots Lockheed Martin Advanced Technology Laboratories Distributed Processing.
Section 10: Advanced Topics 1 M. Balakrishnan Dept. of Comp. Sci. & Engg. I.I.T. Delhi.
Programming Models & Runtime Systems Breakout Report MICS PI Meeting, June 27, 2002.
1 Embedded Distributed Real-Time Resource Management Carl Hein, Aron Rubin Lockheed Martin Advanced Technology Lab Cherry Hill NJ September 25, 2003 HPEC.
High Performance Embedded Computing © 2007 Elsevier Lecture 3: Design Methodologies Embedded Computing Systems Mikko Lipasti, adapted from M. Schulte Based.
DELMIA DPM Assembly This is the Master “Presentation title” page. Type the title of your presentation in the "Presentation title” field. Cette page est.
High Performance Embedded Computing © 2007 Elsevier Chapter 1, part 2: Embedded Computing High Performance Embedded Computing Wayne Wolf.
Framework for the Development and Testing of Dependable and Safety-Critical Systems IKTA 065/ Supported by the Information and Communication.
Page 1 Reconfigurable Communications Processor Principal Investigator: Chris Papachristou Task Number: NAG Electrical Engineering & Computer Science.
Content The system development life cycle
© 2012 xtUML.org Bill Chown – Mentor Graphics Model Driven Engineering.
Polymorphous Computing Architectures Run-time Environment And Design Application for Polymorphous Technology Verification & Validation (READAPT V&V) Lockheed.
Issues Autonomic operation (fault tolerance) Minimize interference to applications Hardware support for new operating systems Resource management (global.
CS244-Introduction to Embedded Systems and Ubiquitous Computing Instructor: Eli Bozorgzadeh Computer Science Department UC Irvine Winter 2010.
Boost Verification Results by Bridging the Hw/Sw Testbench Gap by Matthew Ballance Verification Technologist Mentor Graphics.
DARPA ITO/MARS Project Update Vanderbilt University A Software Architecture and Tools for Autonomous Robots that Learn on Mission K. Kawamura, M. Wilkes,
MILAN: Technical Overview October 2, 2002 Akos Ledeczi MILAN Workshop Institute for Software Integrated.
MAPLD 2005/254C. Papachristou 1 Reconfigurable and Evolvable Hardware Fabric Chris Papachristou, Frank Wolff Robert Ewing Electrical Engineering & Computer.
Network UAV C3 Stage 1 Final Briefing Timothy X Brown University of Colorado at Boulder Interdisciplinary Telecommunications Program Electrical and Computer.
6. A PPLICATION MAPPING 6.3 HW/SW partitioning 6.4 Mapping to heterogeneous multi-processors 1 6. Application mapping (part 2)
Presentation by Tom Hummel OverSoC: A Framework for the Exploration of RTOS for RSoC Platforms.
Cognitive Radio: Next Generation Communication System
R ECONFIGURABLE SECURITY SUPPORT FOR EMBEDDED SYSTEMS 1 AKSHATA VARDHARAJ.
Requirements of an ITS/Simulation Interoperability Standard (I/SIS) Presented by:Dick Stottler Stottler Henke Associates, Inc.
CASE (Computer-Aided Software Engineering) Tools Software that is used to support software process activities. Provides software process support by:- –
Gedae, Inc. Gedae: Auto Coding to a Virtual Machine Authors: William I. Lundgren, Kerry B. Barnes, James W. Steed HPEC 2004.
Architecture View Models A model is a complete, simplified description of a system from a particular perspective or viewpoint. There is no single view.
Copyright 2002 Prentice-Hall, Inc. Chapter 4 Automated Tools for Systems Development 4.1 Modern Systems Analysis and Design.
August 2003 At A Glance The IRC is a platform independent, extensible, and adaptive framework that provides robust, interactive, and distributed control.
Computer Simulation of Networks ECE/CSC 777: Telecommunications Network Design Fall, 2013, Rudra Dutta.
System Architecture Directions for Networked Sensors.
1 Reconfigurable Environment for Analysis and Test of Software Systems Sam Martin REATSS.
Hardware/Software Co-Design of Complex Embedded System NIKOLAOS S. VOROS, LUIS SANCHES, ALEJANDRO ALONSO, ALEXIOS N. BIRBAS, MICHAEL BIRBAS, AHMED JERRAYA.
MIT Lincoln Laboratory Dynamic Declarative Networking Exploiting Declarative Knowledge To Enable Energy Efficient Collaborative Sensing Daniel J. Van Hook.
POLITECNICO DI MILANO A SystemC-based methodology for the simulation of dynamically reconfigurable embedded systems Dynamic Reconfigurability in Embedded.
Resource Optimization for Publisher/Subscriber-based Avionics Systems Institute for Software Integrated Systems Vanderbilt University Nashville, Tennessee.
Fermilab Scientific Computing Division Fermi National Accelerator Laboratory, Batavia, Illinois, USA. Off-the-Shelf Hardware and Software DAQ Performance.
FPGA: Real needs and limits
User Interference Effect on Routing of Cognitive Radio Ad-Hoc Networks
HIGH LEVEL SYNTHESIS.
Methodology for run-time detection of requirement violations
Presentation transcript:

Multidimensional Performance Modeling for Advanced Embedded Signal Processors Michael Stebnisky Carl Hein Lockheed Martin Advanced Technology Laboratories 1 Federal Street A&E Building 2W Camden, New Jersey Systems Integration

2 Multidimensional Performance Modeling Problem:Problem: —Traditional performance modeling approaches \are unable to address emerging requirements and component technologies. This is a result of an increased awareness and need for dynamically adaptive or reconfigurable systems, particularly in the area of power dissipation/performance. Goal(s)/Objectives(s)Goal(s)/Objectives(s) —Define methods/algorithms to acurately model and optimize reconfigurable architectures and functions (services) required to support multidimensional performance modeling. —Apply ideas developed from InfoPad, ACS, PAC/C, DARES, PCA, and MSP to develop a uniqe new rapid prototyping/optimization capability. ApproachApproach —Define features required to support accurate performance and multidimensional modeling and optimization of DRAs. —Evaluate algorithms/methods for performing intelligent, reactive dynamic scheduling. —Evaluate algorithms/methods for performing offline analysis, data reduction, pattern recognition, and execution planning. DoD missions/systems require new approaches/ tools to exploit emerging reconfigurable technologies to form polymorphous/power aware systems.

3 A Methodology for Verifiable, Validatable Polymorphic Architectures DRA Optimization Process Flow Runtime System Model Parameterized Resource Executables Offline Optimiz ation: Speed Latency Power Energy QoS... DRA Object Library Runtime Optim- ization: Speed Latency Power Energy QoS... Application Flowgraphs Preprocessed Scheduling Information Schedule/ Component Analysis/ Feedback to Optimization Characteristics Estimates Local Designs COTS Testcase/ Scenario Testbench Dynamically Reconfigurable Architectures Based upon a two phase optimization process - offline and onlineBased upon a two phase optimization process - offline and online Offline optimizationOffline optimization —Overall goal: perform component selection, pruning, data extraction and sensitivity analysis that will maximize the effectiveness of the online optimization —Selects an optimum set of verified, validated components from existing libraries —Facilitates analysis to identify potential implementations more optimal to the application —Identifies dependencies/relationships in the data flow graph that will facilitate online scheduling Online optimizationOnline optimization —Components are selected for execution at runtime based upon dynamic figures of merit —Figures of merit are complex functions of component characteristics

4 Key Aspects of Approach Dealing with overwhelming flexibilityDealing with overwhelming flexibility —Management of complexity and methods of abstraction/simplification Two stage optimizationTwo stage optimization —Offline (component and subsystem) and online (scheduling) Methodology for generating/optimizing/ representing components and reconfigurable devicesMethodology for generating/optimizing/ representing components and reconfigurable devices —Develop operating points, analyze, improve, and abstract Dynamic scheduling with multidimensional goals/constraintsDynamic scheduling with multidimensional goals/constraints —“Toolkit” approach using offline extracted information Offline information extraction and optimization of schedulingOffline information extraction and optimization of scheduling —Analysis/characterization/abstraction of tasks/graphs Constraint/goal simplification using modesConstraint/goal simplification using modes —Abstraction method minimizing computation and enforcing interface s Reuse of existing capability to perform the required analysis and data visualizationReuse of existing capability to perform the required analysis and data visualization —Use the internally developed CSIM

5 Graphical Hierarchical Hardware Architecture Definition Graphical Hierarchical Software Architecture Definition Produces Animated and Static Simulation Timeline Displays as well as processor, communication, and memory statistical data Detailed information on CSIM can be found at: CSIM — HW/SW Virtual Prototyping Capabilities and Features CSIM has demonstrated the ability to model complex signal processor network behavior and performanceCSIM has demonstrated the ability to model complex signal processor network behavior and performance CSIM is a C-based, hierarchical simulation environment for modeling system, subsystem, and individual module/component HW/SW performanceCSIM is a C-based, hierarchical simulation environment for modeling system, subsystem, and individual module/component HW/SW performance CSIM’s independent use of HW/SW models provides a path for reducing development costs as well as the life cycle costsCSIM’s independent use of HW/SW models provides a path for reducing development costs as well as the life cycle costs CSIM provides a common environment for developing and verifying system, subsystem, and module HW/SW interfaces and overall system performanceCSIM provides a common environment for developing and verifying system, subsystem, and module HW/SW interfaces and overall system performance

6 Design space resulting from evaluating a representative signal processing application Automated Support to Offline Analysis/ Optimization Each task/component (i.e. fft, fir) is abstracted by several parameters (i.e. throughput, latency, power, size, etc.)Each task/component (i.e. fft, fir) is abstracted by several parameters (i.e. throughput, latency, power, size, etc.) Each task may have several differently optimized implementationsEach task may have several differently optimized implementations An online library of implementations is selectedAn online library of implementations is selected The application(s) of interest are simulated using a subset of the available componentsThe application(s) of interest are simulated using a subset of the available components The results of simulating each subset is characterized with respect to the parameters of interestThe results of simulating each subset is characterized with respect to the parameters of interest The characterizations are used to identify a subset of implementations that are most optimal across the parameters of interestThe characterizations are used to identify a subset of implementations that are most optimal across the parameters of interest

7 Processor Architecture and Environment This graphic illustrates a representative processor architecture and an associated CSIM visualization environment. The core CSIM capability has been augmented with a dynamic scheduler that can apply complex selection criteria to the scheduling process. In this example, these complex selection criteria are encapsulated as several “modes”; the buttons on the lower right are used to interactively switch between the modes. In addition, any of the processors can be “killed” or returned to the simulation, facilitating the evaluation of failover and similar analyses. Illustrative results are shown in the following slides.

8 Power- driven scheduling; “LoPwr” Complex criteria driven scheduling, “DDPSWC” Assigned task scheduling, “Base” Assigned speed scheduling, “Fast” Max speed- scheduling, “Fastest” Dynamically Reconfigurable Architectures Real time on-demand dynamic allocation of computational resourcesReal time on-demand dynamic allocation of computational resources Each task/component (i.e. fft, fir) is abstracted by several parameters (i.e. throughput, latency, power, size, etc.)Each task/component (i.e. fft, fir) is abstracted by several parameters (i.e. throughput, latency, power, size, etc.) Each task may have several differently optimized implementationsEach task may have several differently optimized implementations An online library of implementations is selectedAn online library of implementations is selected At run time the implementation best matched to the current selection criteria is executedAt run time the implementation best matched to the current selection criteria is executed For this example, the maximum throughput to minimum power dissipation ratio is improved >10X versus standard practiceFor this example, the maximum throughput to minimum power dissipation ratio is improved >10X versus standard practice SWEPT improvement >10X vs standard practice

9 Processors faulted/ switched out Processors returned to operation Dynamically Reconfigurable Architectures Validation and verification of next generation processor architecturesValidation and verification of next generation processor architectures Any processor may be temporarily or permanently removed from the architectureAny processor may be temporarily or permanently removed from the architecture Processors may be returned to the architectureProcessors may be returned to the architecture Facilitates development and analysis of failover/fault adaptation methodologiesFacilitates development and analysis of failover/fault adaptation methodologies Extends multidimensional optimization to architectures with intermittently inactive components (especially useful for throughput/ power trades)Extends multidimensional optimization to architectures with intermittently inactive components (especially useful for throughput/ power trades)

10 Other Applications of CSIM ProgramVNSAMRFS Advanced Surface Ship Model Avionics Mission Computing System Wireless Mobile Networks Description Network attack simulator for training IA operators Agent-based resource management Computing infrastructure model Model of JSF- Integrated Core Processor Dynamic ad hoc routing behaviors Customer US Army CECOM ONR LM NE&SS- Moorestown LM NE&SS- Egan DARPA (XGComms) US Army CECOM (WIN-T) Key Technology Token-based performance modeling, HLA Intelligent agents, dynamic scheduler Token-based performance modeling, pre-emptive multitasking CPU models Token-based performance modeling, hardware/ software co-simulation Abstract propagation and arbitration models, dynamic routing algorithms

11 Specification Requirements CSIM & MaC Run-time Environment and Design Application for Polymorphous Technology Verification & Validation (RE-ADAPT V&V) ImpactImpact —Design-Time Modeling and Simulation Environment for Verification and Validation (V&V) of PCA enabled applications —Run-Time Monitoring for PCA Enabled Application V&V —Approach Validation via application to PCA enabled avionics design New IdeasNew Ideas —Methodology for run-time detection of requirement violations —Framework for automatic generation of monitoring and checking components —Dynamic run-time corrector to force reconfiguration into a safe state Real-time Systems Group, University of Pennsylvania

12 Stream Processors indicated by filled boxes GP Processors indicated by outlined boxes Dynamic bar chart indicating total active processors, active stream processor active GP processors and active threaded processors Total active processor count display MaCS messages and status Mission statusRADAR Tasks Imaging Route Planning Self Test and MaCS Flight Control Threat Avoidance Mission Assignment Communi- cations PCA Virtual Processor State and Activity System State and Task Flow Real-time Systems Group, University of Pennsylvania DARPATech Demo