8th Biennial Ptolemy Miniconference Berkeley, CA April 16, 2009 Precision Timed (PRET) Architecture Hiren D. Patel, Ben Lickly, Isaac Liu and Edward A.

Slides:



Advertisements
Similar presentations
Sungjun Kim Columbia University Edward A. Lee UC Berkeley
Advertisements

Multiprocessors— Large vs. Small Scale Multiprocessors— Large vs. Small Scale.
CPE 731 Advanced Computer Architecture Instruction Level Parallelism Part I Dr. Gheith Abandah Adapted from the slides of Prof. David Patterson, University.
2013/06/10 Yun-Chung Yang Kandemir, M., Yemliha, T. ; Kultursay, E. Pennsylvania State Univ., University Park, PA, USA Design Automation Conference (DAC),
Precision Timed Embedded Systems Using TickPAD Memory Matthew M Y Kuo* Partha S Roop* Sidharta Andalam † Nitish Patel* *University of Auckland, New Zealand.
Zhiguo Ge, Weng-Fai Wong, and Hock-Beng Lim Proceedings of the Design, Automation, and Test in Europe Conference, 2007 (DATE’07) April /4/17.
1 Implementing an Untrusted Operating System on Trusted Hardware David Lie Chandramohan A. Thekkath Mark Horowitz University of Toronto, Microsoft Research,
BEARS 2012 February 23, 2012 Berkeley, CA An Ontology Framework for Static Analysis Ben Lickly Elizabeth Latronico Charles Shelton Edward A. Lee.
Predictable Programming on a Precision Timed Architecture Hiren D. Patel UC Berkeley Joint work with: Ben Lickly, Isaac Liu, Edward.
Overview of PTIDES Project
Timing Analysis of Embedded Software for Families of Microarchitectures Jan Reineke, UC Berkeley Edward A. Lee, UC Berkeley Representing Distributed Sense.
PTIDES: Programming Temporally Integrated Distributed Embedded Systems Yang Zhao, EECS, UC Berkeley Edward A. Lee, EECS, UC Berkeley Jie Liu, Microsoft.
7th Biennial Ptolemy Miniconference Berkeley, CA February 13, 2007 Causality Interfaces for Actor Networks Ye Zhou and Edward A. Lee University of California,
IEEE International Symposium on Distributed Simulation and Real-Time Applications October 27, 2008 Vancouver, British Columbia, Canada Presented by An.
6th Biennial Ptolemy Miniconference Berkeley, CA May 12, 2005 Future Directions Edward A. Lee.
WCED: June 7, 2003 Matt Ramsay, Chris Feucht, & Mikko Lipasti University of Wisconsin-MadisonSlide 1 of 26 Exploring Efficient SMT Branch Predictor Design.
Chess Review May 10, 2004 Berkeley, CA A Comparison of Network Processor Programming Environments Niraj Shah William Plishker Kurt Keutzer.
The Case for Precision Timed (PRET) Machines Edward A. Lee Professor, Chair of EECS UC Berkeley With thanks to Stephen Edwards, Columbia University. National.
7th Biennial Ptolemy Miniconference Berkeley, CA February 13, 2007 Cyber-Physical Systems: A Vision of the Future Edward A. Lee Robert S. Pepper Distinguished.
February 21, 2008 Center for Hybrid and Embedded Software Systems Mapping A Timed Functional Specification to a Precision.
7th Biennial Ptolemy Miniconference Berkeley, CA February 13, 2007 PTIDES: A Programming Model for Time- Synchronized Distributed Real-time Systems Yang.
November 18, 2004 Embedded System Design Flow Arkadeb Ghosal Alessandro Pinto Daniele Gasperini Alberto Sangiovanni-Vincentelli
MOBIES Project Progress Report Engine Throttle Controller Design Using Multiple Models of Computation Edward Lee Haiyang Zheng with thanks to Ptolemy Group.
CS 7810 Lecture 24 The Cell Processor H. Peter Hofstee Proceedings of HPCA-11 February 2005.
Department of Electrical Engineering and Computer Sciences University of California at Berkeley The Ptolemy II Framework for Visual Languages Xiaojun Liu.
1 Presenter: Ming-Shiun Yang Sah, A., Balakrishnan, M., Panda, P.R. Design, Automation & Test in Europe Conference & Exhibition, DATE ‘09. A Generic.
Course Outline DayContents Day 1 Introduction Motivation, definitions, properties of embedded systems, outline of the current course How to specify embedded.
Computer System Architectures Computer System Software
CSc 453 Final Code Generation Saumya Debray The University of Arizona Tucson.
CAPS project-team Compilation et Architectures pour Processeurs Superscalaires et Spécialisés.
Real-Time Systems Mark Stanovich. Introduction System with timing constraints (e.g., deadlines) What makes a real-time system different? – Meeting timing.
Spring 2003CSE P5481 VLIW Processors VLIW (“very long instruction word”) processors instructions are scheduled by the compiler a fixed number of operations.
PRET-OS for Biomedical Devices A Part IV Project.
1 Instruction Set Architecture (ISA) Alexander Titov 10/20/2012.
Computer Organization and Architecture Tutorial 1 Kenneth Lee.
By Edward A. Lee, J.Reineke, I.Liu, H.D.Patel, S.Kim
Dept. of Computer Science - CS6461 Computer Architecture CS6461 – Computer Architecture Fall 2015 Lecture 1 – Introduction Adopted from Professor Stephen.
Computer Architecture: Multithreading (I) Prof. Onur Mutlu Carnegie Mellon University.
Hybrid Multi-Core Architecture for Boosting Single-Threaded Performance Presented by: Peyman Nov 2007.
ECE 720T5 Fall 2011 Cyber-Physical Systems Rodolfo Pellizzoni.
DR. SIMING LIU SPRING 2016 COMPUTER SCIENCE AND ENGINEERING UNIVERSITY OF NEVADA, RENO CS 219 Computer Organization.
1  2004 Morgan Kaufmann Publishers Locality A principle that makes having a memory hierarchy a good idea If an item is referenced, temporal locality:
Advanced Computer Architecture pg 1 Embedded Computer Architecture 5SAI0 Chip Multi-Processors (ch 8) Henk Corporaal
February 12, 2009 Center for Hybrid and Embedded Software Systems Timing-aware Exceptions for a Precision Timed (PRET)
Ptolemy Project Vision Edward A. Lee Robert S. Pepper Distinguished Professor Eighth Biennial Ptolemy Miniconference April 16, 2009 Berkeley, CA, USA.
ECE 720T5 Winter 2014 Cyber-Physical Systems Rodolfo Pellizzoni.
February 14, 2013 Center for Hybrid and Embedded Software Systems Organization Faculty Edward A. Lee, EECS Alberto Sangiovanni-Vincentelli,
Computer Architecture Principles Dr. Mike Frank
CS203 – Advanced Computer Architecture
Embedded Computer Architecture 5SAI0 Chip Multi-Processors (ch 8)
A Precision Timed Architecture for Predictable and Repeatable Timing
Computer Architecture: Multithreading (I)
Precision Timed Machine (PRET)
Hiren D. Patel Isaac Liu Ben Lickly Edward A. Lee
Shanna-Shaye Forbes Ben Lickly Man-Kit Leung
Retargetable Model-Based Code Generation in Ptolemy II
Levels of Parallelism within a Single Processor
Intro to Architecture & Organization
The Stanford FLASH Multiprocessor
Timing-aware Exceptions for a Precision Timed (PRET) Target
Computer Architecture: A Science of Tradeoffs
Embedded Computer Architecture 5SAI0 Chip Multi-Processors (ch 8)
Levels of Parallelism within a Single Processor
Dynamic Hardware Prediction
How to improve (decrease) CPI
Chip&Core Architecture
CAPS project-team Compilation et Architectures pour Processeurs Superscalaires et Spécialisés.
CSc 453 Final Code Generation
Research: Past, Present and Future
Presentation transcript:

8th Biennial Ptolemy Miniconference Berkeley, CA April 16, 2009 Precision Timed (PRET) Architecture Hiren D. Patel, Ben Lickly, Isaac Liu and Edward A. Lee University of California, Berkeley

Hiren D. Patel, Berkeley 2 of 14Ptolemy Miniconference, April 16, 2009 Timing Properties in Computing Abstractions Most traditional computing abstractions hide timing properties of software Advantages Focus on functionality Push for higher average-case performance Disadvantages Real-time embedded systems Unpredictable Non-repeatable Brittle Programming models and languages Multithreading Speculative execution, caches, and deep pipelines Compilers, and ISAs

Hiren D. Patel, Berkeley 3 of 14Ptolemy Miniconference, April 16, 2009 Resulting Real-time Embedded Systems Unpredictability Difficulty in determining timing behavior through analysis Non-repeatability Different executions may yield different timing behavior Brittleness Small changes have big effects on timing behavior Time as a first class citizen of embedded computing

Hiren D. Patel, Berkeley 4 of 14Ptolemy Miniconference, April 16, 2009 Precision Timed (PRET) Architectures Stephen. A. Edwards and Edward. A. Lee, “The case for the Precision Timed (PRET) machine.” In Proceedings of the 44th Annual Conference on Design Automation (San Diego, California, June , 2007). DAC '07. ACM, New York, NY, Predictable and repeatable timing

Hiren D. Patel, Berkeley 5 of 14Ptolemy Miniconference, April 16, 2009 Precision Timed Architecture Scratchpad memories Thread- interleaved pipeline Time-triggered arbitration Round-robin thread scheduling ISA with timing instructions

Hiren D. Patel, Berkeley 6 of 14Ptolemy Miniconference, April 16, 2009 Timing Instructions: Deadline ISA extensions dead [Ip & Edwards in 2006] deadload Deadline instructions Denote the required execution time of a block When decoded Stall instruction until timer value is 0 Then set timer value to new value Block 1 Block 2 Block 3

Hiren D. Patel, Berkeley 7 of 14Ptolemy Miniconference, April 16, 2009 Timing Instructions: Exceptions ISA extensions deadbranch deadloadbranch What happens when missing deadlines? Raise exception and perform pre-specified actions To control timing behaviors in software, we need a predictable underlying architecture

Hiren D. Patel, Berkeley 8 of 14Ptolemy Miniconference, April 16, 2009 Pipeline Architecture with Predictable Timing Stall pipeline Dependencies result in complex timing behaviors Predictable timing behavior of instructions Thread-interleaved pipeline: Traditional pipeline:

Hiren D. Patel, Berkeley 9 of 14Ptolemy Miniconference, April 16, 2009 Thread-interleaved Pipeline with Timing Instructions Thread stalls Main memory access Deadline instructions Replay mechanism Execute same PC next iteration Decrement deadline timers Stall if deadline instruction If not stalled, increment PC

Hiren D. Patel, Berkeley 10 of 14Ptolemy Miniconference, April 16, 2009 Memory Hierarchy with Predictable Timing Scratchpad memories Software managed caches Each thread has a uniquely defined address space 13 cycles latency thread0 thread2 thread4 1 cycle latency Predictable timing behavior during cache accesses Shared data goes through to main memory

Hiren D. Patel, Berkeley 11 of 14Ptolemy Miniconference, April 16, 2009 Time-triggered Access to Main Memory Predictable timing behavior when accessing main memory Worst-case bound on access time: 13* = 90 cycles Each thread must make and complete access within its window Memory wheel Time-triggered access

Hiren D. Patel, Berkeley 12 of 14Ptolemy Miniconference, April 16, 2009 Examples Video rendering for a computer game Real-time requirements through deadline instructions Autonomous robot finding moving target Anytime algorithms using timing exceptions Eliminating time-exploiting attacks in cryptosystems Repeatable timing behavior through deadline instructions RSA Encryption (RSAREF 2.0)DSA Encryption from OpenSSL (0.9.8j)

Hiren D. Patel, Berkeley 13 of 14Ptolemy Miniconference, April 16, 2009 Predictable Timing and High Performance Scratchpad memory allocation schemes Thread scheduling and synchronizations Real-time network on- chip Multi-PRET architecture Timing analysis PRET Machine Programming models and languages with time semantics Code generation from Giotto, SDF, and PTIDES.

Hiren D. Patel, Berkeley 14 of 14Ptolemy Miniconference, April 16, 2009 Conclusion Treat time as a first class property of embedded computing Predictable and repeatable timing behaviors PRET cycle-accurate simulator ISA extensions with timing instructions Architecture with predictable timing behaviors Download:

Hiren D. Patel, Berkeley 15 of 14Ptolemy Miniconference, April 16, 2009 End