PS - Theo Claasen DAC 2000 - 1 First-time-right silicon, but…. to the right specification Theo A.C.M. Claasen Chief Technology Officer Philips Semiconductors.

Slides:



Advertisements
Similar presentations
FPGA (Field Programmable Gate Array)
Advertisements

Philips Research ICS 252 class, February 3, The Trimedia CPU64 VLIW Media Processor Kees Vissers Philips Research Visiting Industrial Fellow
1 of 24 The new way for FPGA & ASIC development © GE-Research.
Advanced Processor Architectures for Embedded Systems Witawas Srisa-an CSCE 496: Embedded Systems Design and Implementation.
System On Chip - SoC Mohanad Shini JTAG course 2005.
1/1/ / faculty of Electrical Engineering eindhoven university of technology Processor support devices Part 1:Interrupts and shared memory dr.ir. A.C. Verschueren.
A reconfigurable system featuring dynamically extensible embedded microprocessor, FPGA, and customizable I/O Borgatti, M. Lertora, F. Foret, B. Cali, L.
MotoHawk Training Model-Based Design of Embedded Systems.
Feng-Xiang Huang A Low-Cost SOC Debug Platform Based on On-Chip Test Architectures.
Some Thoughts on Technology and Strategies for Petaflops.
IP Re-Use: The Key Challenge in SOC (System- on-Chip) Product Development D Y Yang Chairman, Taiwan SoC Consortium Jan. 14, 2003.
Define Embedded Systems Small (?) Application Specific Computer Systems.
Configurable System-on-Chip: Xilinx EDK
Modern trends in computer architecture and semiconductor scaling are leading towards the design of chips with more and more processor cores. Highly concurrent.
6/30/2015HY220: Ιάκωβος Μαυροειδής1 Moore’s Law Gordon Moore (co-founder of Intel) predicted in 1965 that the transistor density of semiconductor chips.
Implementation of DSP Algorithm on SoC. Mid-Semester Presentation Student : Einat Tevel Supervisor : Isaschar Walter Accompaning engineer : Emilia Burlak.
From Concept to Silicon How an idea becomes a part of a new chip at ATI Richard Huddy ATI Research.
Using FPGAs with Embedded Processors for Complete Hardware and Software Systems Jonah Weber May 2, 2006.
A Flexible Architecture for Simulation and Testing (FAST) Multiprocessor Systems John D. Davis, Lance Hammond, Kunle Olukotun Computer Systems Lab Stanford.
1 A survey on Reconfigurable Computing for Signal Processing Applications Anne Pratoomtong Spring2002.
Detailed Technical Feature Presentation Background Information The Importance of Software Software Roadblocks Development Environment DSP Development Cycle.
Programmable Logic- How do they do that? 1/16/2015 Warren Miller Class 5: Software Tools and More 1.
1 Instant replay  The semester was split into roughly four parts. —The 1st quarter covered instruction set architectures—the connection between software.
PHY 201 (Blum) Buses Warning: some of the terminology is used inconsistently within the field.
- 1 - A Powerful Dual-mode IP core for a/b Wireless LANs.
Role of Standards in TLM driven D&V Methodology
Xilinx at Work in Hot New Technologies ® Spartan-II 64- and 32-bit PCI Solutions Below ASSP Prices January
Networking Virtualization Using FPGAs Russell Tessier, Deepak Unnikrishnan, Dong Yin, and Lixin Gao Reconfigurable Computing Group Department of Electrical.
Ross Brennan On the Introduction of Reconfigurable Hardware into Computer Architecture Education Ross Brennan
1 3-General Purpose Processors: Altera Nios II 2 Altera Nios II processor A 32-bit soft core processor from Altera Comes in three cores: Fast, Standard,
11 Using SPIRIT for describing systems to debuggers DSDP meeting February 2006 Hobson Bullman – Engineering Manager Anthony Berent – Debugger Architect.
THE PHILIPS NEXPERIA DIGITAL VIDEO PLATFORM. The Digital Video Revolution  Transition from Analog to Digital Video  Navigate, store, retrieve and share.
REXAPP Bilal Saqib. REXAPP  Radio EXperimentation And Prototyping Platform Based on NOC  REXAPP Compiler.
1 Integration Verification: Re-Create or Re-Use? Nick Gatherer Trident Digital Systems.
May 17, USB Semiconductor IP How to Integrate USB into Your Design Eric Huang inSilicon Corporation.
VLSI & ECAD LAB Introduction.
집적회로 Spring 2007 Prof. Sang Sik AHN Signal Processing LAB.
ASIP Architecture for Future Wireless Systems: Flexibility and Customization Joseph Cavallaro and Predrag Radosavljevic Rice University Center for Multimedia.
System Design with CoWare N2C - Overview. 2 Agenda q Overview –CoWare background and focus –Understanding current design flows –CoWare technology overview.
J. Christiansen, CERN - EP/MIC
® SPARTAN Series High Volume System Solution. ® Spartan/XL Estimated design size (system gates) 30K 5K180K XC4000XL/A XC4000XV Virtex S05/XL.
VLSI DESIGN CONFERENCE 1998 TUTORIAL Embedded System Design and Validation: Building Systems from IC cores to Chips Rajesh Gupta University of California,
Field Programmable Gate Arrays (FPGAs) An Enabling Technology.
Veronica Eyo Sharvari Joshi. System on chip Overview Transition from Ad hoc System On Chip design to Platform based design Partitioning the communication.
EE3A1 Computer Hardware and Digital Design
DIPARTIMENTO DI ELETTRONICA E INFORMAZIONE Novel, Emerging Computing System Technologies Smart Technologies for Effective Reconfiguration: The FASTER approach.
EKT303/4 PRINCIPLES OF PRINCIPLES OF COMPUTER ARCHITECTURE (PoCA)
Lecture 12: Reconfigurable Systems II October 20, 2004 ECE 697F Reconfigurable Computing Lecture 12 Reconfigurable Systems II: Exploring Programmable Systems.
2D/3D Integration Challenges: Dynamic Reconfiguration and Design for Reuse.
Electronic system level design Teacher : 蔡宗漢 Electronic system level Design Lab environment overview Speaker: 范辰碩 2012/10/231.
SOC Virtual Prototyping: An Approach towards fast System- On-Chip Solution Date – 09 th April 2012 Mamta CHALANA Tech Leader ST Microelectronics Pvt. Ltd,
This material exempt per Department of Commerce license exception TSU Xilinx On-Chip Debug.
Survey of multicore architectures Marko Bertogna Scuola Superiore S.Anna, ReTiS Lab, Pisa, Italy.
Lecture 7: Overview Microprocessors / microcontrollers.
Multiprocessor SoC integration Method: A Case Study on Nexperia, Li Bin, Mengtian Rong Presented by Pei-Wei Li.
Chapter 11 System-Level Verification Issues. The Importance of Verification Verifying at the system level is the last opportunity to find errors before.
Design with Vivado IP Integrator
SEPTEMBER 8, 2015 Computer Hardware 1-1. HARDWARE TERMS CPU — Central Processing Unit RAM — Random-Access Memory  “random-access” means the CPU can read.
System on a Programmable Chip (System on a Reprogrammable Chip)
Programmable Hardware: Hardware or Software?
Introduction ( A SoC Design Automation)
System On Chip - SoC E.Anjali.
ENG3050 Embedded Reconfigurable Computing Systems
Chapter 1: Introduction
Overview of Embedded SoC Systems
Spartan-II + Soft IP = Programmable ASSP
ChipScope Pro Software
The performance requirements for DSP applications continue to grow and the traditional solutions do not adequately address this new challenge Paradigm.
HIGH LEVEL SYNTHESIS.
ChipScope Pro Software
Presentation transcript:

PS - Theo Claasen DAC First-time-right silicon, but…. to the right specification Theo A.C.M. Claasen Chief Technology Officer Philips Semiconductors

PS - Theo Claasen DAC Content Consumer systems as driver of design technology The design crisis and its solution: reuse Generations of reuse How to get the specifications right before making the chip –Rapid silicon prototyping Conclusions

PS - Theo Claasen DAC (R)evolution in the consumer market Consumer systems become –Digital –Programmable (downloadable) –User adaptive –Connected (with each other and to a network) –Multifunctional fashion items

PS - Theo Claasen DAC Example

PS - Theo Claasen DAC The environment Same functions wanted –At home –On the move –In the office All the time, anywhere With consistency of –Data –User interface

PS - Theo Claasen DAC Consequences for design IC design needs to handle –Integrated systems (SoC) –Increasing complexity –Fast time to market (too late = no business) –Hardware and software –High processing speed and / or low power This requires a design process that is –Predictable in time and performance –Efficient The key elements of such a design process are –High level of re-use –Silicon prototyping So what about the design crisis?

PS - Theo Claasen DAC Design crisis Process technology provides a 59% per year increase of complexity (Moore’s law) Design efficiency increases by “only” 25% per year Log # transistors Time Technology 59% / year Design 25% / year Design gap

PS - Theo Claasen DAC Design crisis revisited The design gap has been predicted for the last two decades Yet, in every new technology, we design chips of 1 cm 2 that are completely utilized Log # transistors Time Technology 59% / year average Design 25% / year & paradigm shifts New reuse method New process technology

PS - Theo Claasen DAC Reuse Reuse comes in generations

PS - Theo Claasen DAC Conditions for reuse A consistent methodology –Standards Interconnection of elements File and directory structure Test –Guidelines Design (power, clock) Test Debug CoReUse ® standards CoReUse ® constraints Wrapping of a coreContents of a core Directory structure All views present & consistent File name conventions All soft, firm or hard rules Previous + all maintenance rules All CoReUse ® guidelines

PS - Theo Claasen DAC Conditions for reuse A consistent methodology A comprehensive library of reuse elements –Compliant with the methodology –Proven on silicon (individually and in combination)

PS - Theo Claasen DAC Conditions for reuse A consistent methodology A comprehensive library of reuse elements Development tools Rapid silicon prototyping

PS - Theo Claasen DAC Rapid silicon prototyping It is no use to have “first-time-right” silicon if it was not designed with the right specification Specifications can only be right after extensive simulation –With real data –In a real environment Simulation –Needs to be bit-accurate –Should comprise hours of data (at 100Mhz = cycles) –Debugs both hardware and software The only simulation that is fast enough is using silicon –Running at (almost) same speed –Having one-to-one mapping to actual silicon

PS - Theo Claasen DAC Rapid silicon prototyping Design cycle benefits RSP Design Process Production Release Conventional Design Process (1 st time right Si) Production Release SW Development and Validation Silicon Fabrication HW Development and Validation Placement, Routing & Physical Verification Faster chip development –More than 50% total design cycle reduction –True HW / SW co-development Higher probability of first-pass success

PS - Theo Claasen DAC Management attention –Understanding of the benefits –Organization –Discipline –Rewards (for creation and usage) Conditions for reuse A consistent methodology A comprehensive library of reuse elements Development tools Rapid silicon prototyping

PS - Theo Claasen DAC Service –Support organization (help desk) –Documentation –Communication Conditions for reuse A consistent methodology A comprehensive library of reuse elements Development tools Rapid silicon prototyping Management attention

PS - Theo Claasen DAC The first generation reuse Standard cells Has been relatively easy –Extensive libraries for each technology generation Very successful after emergence of logic synthesis tools Various forms of RSP available –FPGA’s –HW simulation

PS - Theo Claasen DAC Standard cells

PS - Theo Claasen DAC The second generation reuse IP blocks (virtual components ) Virtual components have been around for some time Limited success so far, except for specific blocks –CPU cores –DSP cores –Memories –Some periphery devices Major problems –Slow emergence of standards (VSIA) –Business models and liabilities (VCX) –Rapid silicon prototyping –Support and service Extension to SW reusable modules needed –Speeding up HW design brings SW on critical path

PS - Theo Claasen DAC IP reuse Sea-of-IP 

PS - Theo Claasen DAC

PS - Theo Claasen DAC IP based RSP Reference Design Deconfigurable & Extendible Prototype Chip Made from Reusable Components PS own IP 3 rd Party IP Customer IP The busses, not the CPU, are the backbone of this strategy Prototype to finished ASIC added (integrated) modified (extended) removed (deconfigured) Production ASIC Deconfigured & Extended Customer Specific Solution

PS - Theo Claasen DAC VLSI Velocity TM The RSP7 board and development system From system-on-a-benchtop to system-on-a-chip –Wealth of development functionality supporting system, chip and software development Logic Analyzer HW Emulator Standard Plug-in Board On-Board GateField FPGA JTAG Control JumpStart ARM Host Compiler & Debugger Velocity™ Rapid Silicon Prototyping System User Interface FPGA Proto Board

PS - Theo Claasen DAC Logic Analyzer HW Emulator FPGA Proto Board Standard Plug-in Board JumpStart ARM Host Compiler & Debugger User Interface On-Board GateField FPGA JTAG Control Velocity™ Rapid Silicon Prototyping System

PS - Theo Claasen DAC The third generation reuse Architectures (Silicon System Platform) Silicon System Platform –Flexible architecture for hardware and software –Specific (programmable) components –Bus architecture –Software modules –Rules and guidelines for design of HW and SW Has been successful in PC’s –Dominance of a few players who specify and control architecture Application domain specific (difference in constraints) –Speed (compute power) –Dissipation –Costs –Real / non-real time data

PS - Theo Claasen DAC Architecture reuse System DSP ON-CHIP BUSES CPUDRAMAnalog Firmware Different Systems-on-Silicon Platform DSP core CPU core bus Memory Specific blocks OS API Applications Drivers One platform per application domain

PS - Theo Claasen DAC Architecture reuse

PS - Theo Claasen DAC Nexperia TM Application domain specific (e.g. digital video, digital audio, telecom handsets) architectural concept for programmable functions, comprising –Processors (CPU for control and DSP for streaming data) –Connected with low-speed and high-speed busses to a shared memory –A comprehensive set of peripherals –Software components ranging from drivers and API to application modules Scalable from high end to low end –Cost –Performance –Features

PS - Theo Claasen DAC TM-xxxx D$ I$ TriMedia CPU DEVICE I/P BLOCK DVP System Silicon VLIW Media Processor: 100 to 300+ MHz 32-bit or 64-bit Nexperia System Busses PI bus Memory bus bit PI BUS SDRAM MMI DVP MEMORY BUS DEVICE I/P BLOCK PRxxxx D$ I$ MIPS CPU DEVICE I/P BLOCK PI BUS General Purpose RISC Processor 50 to 300+ MHz 32-bit or 64-bit Library of Device Blocks Image coprocessors DSPs UART 1394 USB … and more TriMedia TM MIPS TM Nexperia TM DVP Hardware architecture Flexible architecture for digital video applications

PS - Theo Claasen DAC Nexperia TM Scalability VLIW SDRAM MMI TriMedia CPU + Device blocks when control functions are minimal MIPS CPU + Trimedia CPU replacing some Device blocks RISC SDRAM VLIW MMI Single architecture, multiple product configurations –Processor core options - TM32, TM64, MIPS32, MIPS64... –Device block options Highly programmable to weakly programmable MIPS CPU + Device blocks + Software RISC SDRAM MMI

PS - Theo Claasen DAC NAPA Nexperia TM Advanced Prototyping Architecture

PS - Theo Claasen DAC NAPA Nexperia TM Advanced Prototyping Architecture Is the next generation Rapid Silicon Prototyping –Expands on the capabilities of Velocity™ –Builds nicely on the platform architecture Allows mix and match of processors and peripherals –Upgrading the processor(s) –Upgrading the peripherals –Independently

PS - Theo Claasen DAC The NAPA system Start with a platform architecture with two processors (a CPU and a DSP) and a memory controller. Add a high speed bus to accommodate a high speed memory interface for streaming data. DMA Gate MM/S DMA Gate M/SM MM MM Peripheral M/S MEMC Processor Periph Processor Periph Peripheral M/S Peripheral M/S Peripheral M/S Peripheral M/S Peripheral M/S Peripheral M/S Peripheral M/S Crossover Bridge M/S Bridge Peripheral M/S Peripheral M/S Peripheral M/S Peripheral M/S

PS - Theo Claasen DAC MEMC Processor Periph Tunnel MEMC Processor Periph Tunnel Peripheral M/S Peripheral MM/S Peripheral M/SM Peripheral M/S Peripheral M/S Peripheral MM/S Peripheral M/SM Peripheral M/S DMA Gate MM/S DMA Gate M/SM Crossover Bridge M/S BridgeTunnelBridge Peripheral M/S Peripheral M/S Peripheral M/S Peripheral M/S Bridge (optional) Split the system into logical parts by using bridges and tunnels Bridge: no latency modest speed Tunnel: some latency high speed In many systems one of the memory controllers will be disabled The NAPA system

PS - Theo Claasen DAC The NAPA system MEMC Processor Periph Tunnel For each of these parts a prototype chip is made MEMC Processor Periph Tunnel Separate boards for processor and memory and for periphery Peripheral M/S Peripheral MM/S Peripheral M/SM Peripheral M/S Peripheral M/S Peripheral MM/S Peripheral M/SM Peripheral M/S DMA Gate MM/S DMA Gate M/SM Crossover Bridge M/S Bridge Peripheral M/S Peripheral M/S Peripheral M/S Peripheral M/S Tunnel Bridge Peripherals CPUs/Memory

PS - Theo Claasen DAC NAPA card cage CPUs/Memory Peripherals FPGA card Backplane allows probing and logic analyzer connections Cards can be inserted in any order

PS - Theo Claasen DAC MEMC Processor Periph Tunnel MEMC Processor Periph Tunnel Peripheral M/S Peripheral MM/S Peripheral M/SM Peripheral M/S Peripheral M/S Peripheral MM/S Peripheral M/SM Peripheral M/S DMA Gate MM/S DMA Gate M/SM Crossover Bridge M/S BridgeTunnelBridge Peripheral M/S Peripheral M/S Peripheral M/S Peripheral M/S Bridge Building an IC from the prototype ASIC or CSIC MEMC Processor Periph Tunnel MEMC Processor Periph Tunnel Peripheral M/S Peripheral MM/S Peripheral M/SM Peripheral M/S Peripheral M/S Peripheral MM/S Peripheral M/SM Peripheral M/S DMA Gate MM/S DMA Gate M/SM Crossover Bridge M/S BridgeTunnelBridge Peripheral M/S Peripheral M/S Peripheral M/S Peripheral M/S Bridge Put it all together

PS - Theo Claasen DAC MEMC Processor Periph Tunnel MEMC Processor Periph Tunnel Peripheral M/S Peripheral MM/S Peripheral M/SM Peripheral M/S Peripheral M/S Peripheral MM/S Peripheral M/SM Peripheral M/S DMA Gate MM/S DMA Gate M/SM Crossover Bridge M/S BridgeTunnelBridge Peripheral M/S Peripheral M/S Peripheral M/S Peripheral M/S Bridge Building an IC Deconfigure: remove unwanted components MEMC Processor Periph Processor Periph Peripheral MM/S Peripheral M/SM Peripheral M/S Peripheral M/S Peripheral MM/S Peripheral M/S DMA Gate MM/S DMA Gate M/SM Crossover Bridge M/S Peripheral M/S Peripheral M/S Peripheral M/S Peripheral M/S Bridge

PS - Theo Claasen DAC MEMC Processor Periph Processor Periph Peripheral MM/S Peripheral M/SM Peripheral M/S Peripheral M/S Peripheral MM/S Peripheral M/S DMA Gate MM/S DMA Gate M/SM Crossover Bridge M/S Peripheral M/S Peripheral M/S Peripheral M/S Peripheral M/S Bridge Building an IC Extend: add prototyped (FPGA) components This will provide the database for the chip MEMC Processor Periph Processor Periph Peripheral MM/S Peripheral M/SM Peripheral M/S Peripheral M/S Peripheral MM/S Peripheral M/S DMA Gate MM/S DMA Gate M/SM Crossover Bridge M/S Peripheral M/S Peripheral M/S Peripheral M/S Peripheral M/S Bridge Peripheral M/S Peripheral MM/S Peripheral M/S Peripheral M/S

PS - Theo Claasen DAC The fourth generation reuse IC reuse We need to go to 100M+ transistor chips in deep sub-micron Mask costs get very expensive Too expensive to spin silicon –For correcting design errors –For correcting specification errors –For designing variants Basic technology: retargetable / reconfigurable systems No methodology yet A number of start ups develop technology

PS - Theo Claasen DAC IC reuse Problems to be addressed –Silicon efficiency (area) –Compiler efficiency (code size and speed) –Computational efficiency (speed) –HW / SW co-design –Reconfigurable interconnectivity –SW design for reuse No prototyping issues anymore Guaranteed fast time-to-market

PS - Theo Claasen DAC Conclusions Consumer IC’s generate the need for fast and efficient design methods The design efficiency increases slower than Moore’s law New generations of reuse technology are the solution to the corresponding design gap IP reuse is emerging The next wave will be architecture reuse (Silicon System Platforms) Rapid silicon prototyping is an essential element of these reuse technologies The new wave may be IC reuse with retargetable architectures