A High Performance SoC: PkunityTM

Slides:



Advertisements
Similar presentations
ECOE 560 Design Methodologies and Tools for Software/Hardware Systems Spring 2004 Serdar Taşıran.
Advertisements

VADA Lab.SungKyunKwan Univ. 1 L3: Lower Power Design Overview (2) 성균관대학교 조 준 동 교수
FPGA Implementation of Closed-Loop Control System for Small-Scale Robot.
MICROELETTRONICA Design methodologies Lection 8. Design methodologies (general) Three domains –Behavior –Structural –physic Three levels inside –Architectural.
1 HW/SW Partitioning Embedded Systems Design. 2 Hardware/Software Codesign “Exploration of the system design space formed by combinations of hardware.
August 06 PKUnity: A SoC Design and Verification Platform Lu Junlin MicroProcessor R&D Center (MPRC) Peking University.
Spring 07, Jan 16 ELEC 7770: Advanced VLSI Design (Agrawal) 1 ELEC 7770 Advanced VLSI Design Spring 2007 Introduction Vishwani D. Agrawal James J. Danaher.
Energy Evaluation Methodology for Platform Based System-On- Chip Design Hildingsson, K.; Arslan, T.; Erdogan, A.T.; VLSI, Proceedings. IEEE Computer.
Architectural Power Management for High Leakage Technologies Department of Electrical and Computer Engineering Auburn University, Auburn, AL /15/2011.
SSS 4/9/99CMU Reconfigurable Computing1 The CMU Reconfigurable Computing Project April 9, 1999 Mihai Budiu
Implementation of DSP Algorithm on SoC. Mid-Semester Presentation Student : Einat Tevel Supervisor : Isaschar Walter Accompaning engineer : Emilia Burlak.
COM181 Computer Hardware Ian McCrumRoom 5B18,
(1) Introduction © Sudhakar Yalamanchili, Georgia Institute of Technology, 2006.
8/16/2015\course\cpeg323-08F\Topics1b.ppt1 A Review of Processor Design Flow.
ATMEL AVR 8 bit RISC MICROCONTROLLERS
6.893: Advanced VLSI Computer Architecture, September 28, 2000, Lecture 4, Slide 1. © Krste Asanovic Krste Asanovic
17 Sep 2002Embedded Seminar2 Outline The Big Picture Who’s got the Power? What’s in the bag of tricks?
Low Power Techniques in Processor Design
1 3-General Purpose Processors: Altera Nios II 2 Altera Nios II processor A 32-bit soft core processor from Altera Comes in three cores: Fast, Standard,
Micro processor and Micro Controllers
Automated Design of Custom Architecture Tulika Mitra
1 Single-ISA Heterogeneous Multi-Core Architectures: The Potential for Processor Power Reduction Rakesh Kumar, Keith I. Farkas, Norman P. Jouppi, Parthasarathy.
Sogang University Advanced Computing System Chap 1. Computer Architecture Hyuk-Jun Lee, PhD Dept. of Computer Science and Engineering Sogang University.
1 EE 587 SoC Design & Test Partha Pande School of EECS Washington State University
1 Computer Architecture Research Overview Rajeev Balasubramonian School of Computing, University of Utah
The Verification of UniCore-II Microprocessor Sun Hanxin Peking University Microprocessor R&D Center.
1 Towards Optimal Custom Instruction Processors Wayne Luk Kubilay Atasu, Rob Dimond and Oskar Mencer Department of Computing Imperial College London HOT.
Computer Organization and Architecture Tutorial 1 Kenneth Lee.
Power Estimation and Optimization for SoC Design
1 - CPRE 583 (Reconfigurable Computing): Reconfigurable Computing Architectures Iowa State University (Ames) Reconfigurable Architectures Forces that drive.
EKT303/4 PRINCIPLES OF PRINCIPLES OF COMPUTER ARCHITECTURE (PoCA)
1 Energy-Efficient Register Access Jessica H. Tseng and Krste Asanović MIT Laboratory for Computer Science, Cambridge, MA 02139, USA SBCCI2000.
Processor Structure and Function Chapter8:. CPU Structure  CPU must:  Fetch instructions –Read instruction from memory  Interpret instructions –Instruction.
Computer Organization Yasser F. O. Mohammad 1. 2 Lecture 1: Introduction Today’s topics:  Why computer organization is important  Logistics  Modern.
POLITECNICO DI MILANO A SystemC-based methodology for the simulation of dynamically reconfigurable embedded systems Dynamic Reconfigurability in Embedded.
Lecture 1: Introduction CprE 585 Advanced Computer Architecture, Fall 2004 Zhao Zhang.
Microprocessor Design Process
VU-Advanced Computer Architecture Lecture 1-Introduction 1 Advanced Computer Architecture CS 704 Advanced Computer Architecture Lecture 1.
LOW POWER DESIGN METHODS
ALPHA 21164PC. Alpha 21164PC High-performance alternative to a Windows NT Personal Computer.
Dynamic and On-Line Design Space Exploration for Reconfigurable Architecture Fakhreddine Ghaffari, Michael Auguin, Mohamed Abid Nice Sophia Antipolis University.
Seminar On 8085 microprocessor
PROGRAMMABLE LOGIC CONTROLLERS SINGLE CHIP COMPUTER
Government Engg College Bhuj E. C. Engg. Department
ATMEL AVR 8 bit RISC MICROCONTROLLERS
Memory Segmentation to Exploit Sleep Mode Operation
ECE354 Embedded Systems Introduction C Andras Moritz.
ELEC 7770 Advanced VLSI Design Spring 2016 Introduction
LOW POWER DESIGN METHODS V.ANANDI ASST.PROF,E&C MSRIT,BANGALORE.
Andes Technology Innovate SOC ProcessorsTM
Advanced Topic: Alternative Architectures Chapter 9 Objectives
Introduction ( A SoC Design Automation)
Architecture & Organization 1
ELEC 7770 Advanced VLSI Design Spring 2014 Introduction
A Review of Processor Design Flow
Instruction cycle Instruction: A command given to the microprocessor to perform an operation Program : A set of instructions given in a sequential.
Lecture 2: Performance Today’s topics: Technology wrap-up
Architecture & Organization 1
BIC 10503: COMPUTER ARCHITECTURE
Dynamically Reconfigurable Architectures: An Overview
ELEC 7770 Advanced VLSI Design Spring 2012 Introduction
Getting the Most Out of Low Power MCUs
ELEC 7770 Advanced VLSI Design Spring 2010 Introduction
Bus-Based Computer Systems
Introduction to Micro Controllers & Embedded System Design
Chapter 1 Introduction.
Computer Evolution and Performance
Introduction to Microprocessor Programming
Course Outline for Computer Architecture
Programmable logic and FPGA
Presentation transcript:

A High Performance SoC: PkunityTM Chen Jie Peking University Microprocessor R&D Center

Contents PkUnity SoC Introduction PkUnity SoC Low Power Design

Introduction To introduce our lab history and our works, from 97 we began to develop processor, now we have UniCore Processor family and Pkunity SoC family, and many other relative chip products.

PKUnity-3 Architecture

UniCore fix-point processor UniCore Frequency: 600MHz 32-bit harvard-architecture RISC CPU UniCore32 instruction set compatible Add conditional mov & BLX instructions 8-stage instruction pipeline Dynamic prediction policy: G-share Pipelined I&D Cache Two-level TLB

Performance Evaluation Unicore-II CPI increase 10%-15% G-share prediction, pipelined cache, two-level TLB reduce the increasing of CPI caused by deep pipeline UniCore-II MIPS increase 70%- 80 % Performance improvement come from improvement of micro-architecture and technology

SoC Design Platform To build : In PkUnity-3: a chip-based infrastructure a integrated develop environment a design and verification flow In PkUnity-3: CPU configurable BUS configurable Interrupt system configurable DMA configurable Frequency configurable Power management

HW/SW Co-verification Coverage-oriented VERA verification flow SystemC-based HW/SW Co-verification methodology FPGA prototype

Contents PkUnity SoC Introduction PkUnity SoC Low Power Design Power research status PkUnity low power design and power estimation Future work

Power : New Challenge Power is a dramatic issue for SoCs with billions of transistors Power has to be reduced for portable devices that require a dramatic increase of computation power Deep submicron technologies (90 and 65 nm) will present a dramatic increase of leakage power Power still too high for most SoCs SoC Architectures, HW/SW, multiprocessor, multiple memories, are not well supported by CAD tools Reconfigurability and Flexibility compromises low-power Leakage and very low Vdd are dramatic problems

low power design technology and research topics LP Research Condition Technology Feature size shrink, low dielectric constant material, SOI technology Circuit Design low power standard cell library Gate Design low power logic chain: gated clock, gated Vdd RTL reduce switching activity: gated clock, state machine & glitch optimization Micro-arch Parallel, Pipeline, Pre-computing Instruction Good task partition between HW/SW, design low power instruction set Compiler Saving power while improve performance, Memory organization OS Dynamic voltage scaling, I/O devices, Power and energy analysis of OS Application Task partition, Algorithm optimization low power design technology and research topics

Power Estimation Research Power Estimation Hierarchy System Algorithm Register Transfer Logic Circuit Analysis Precision Analysis Speed High level architectural model SimplePower simulation vs. analysis Wattch CACTI simulation with timing info extract circuit parameters adding technology info gate level simulation PrimePower PowerCompiler analysis with extractive parameter HSPICE

Power of Pkunity Embedded Processor: High Performance vs. Low Power Three methods to reduce chip power: Close unused module Frequency scaling Close Pll Pkunity-3 object: CPU <800mW@1.8V/600MHz SoC <2000mW@1.8V/600MHz

Power Estimation

Power optimization Close unused module through gated clock Reduce chip power through scaling among multiple run mode Run Idle Sleep Change chip frequency through dynamic PLL configuration Input vector control in Execution components

Low power design and estimation flow Work Flow Low power design and estimation flow

Future Work Low Power Design Memory architecture (cache, TLB, register file) Clock system ( Syn vs. Asyn ) Bus system Instruction set selection Voltage and frequency scaling Compiler optimization Task movement Power Estimation To pre-analyze arch & micro-arch design through fast and accurate Architectural level power simulator To build a full-chip power simulator Power simulator parameter reconfigurable To build accurate leakage power estimation model Specific component power model

Thank you