THE PHILIPS NEXPERIA DIGITAL VIDEO PLATFORM. The Digital Video Revolution  Transition from Analog to Digital Video  Navigate, store, retrieve and share.

Slides:



Advertisements
Similar presentations
Chapter 7: System Buses Dr Mohamed Menacer Taibah University
Advertisements

Computer Architecture
System Integration and Performance
Philips Research ICS 252 class, February 3, The Trimedia CPU64 VLIW Media Processor Kees Vissers Philips Research Visiting Industrial Fellow
AMD OPTERON ARCHITECTURE Omar Aragon Abdel Salam Sayyad This presentation is missing the references used.
Datorteknik BusInterfacing bild 1 Bus Interfacing Processor-Memory Bus –High speed memory bus Backplane Bus –Processor-Interface bus –This is what we usually.
Lecture 01: Introduction to Microcomputer & Embedded Systems.
S. Barua – CPSC 440 CHAPTER 8 INTERFACING PROCESSORS AND PERIPHERALS Topics to be covered  How to.
1 Hardware and Software Architecture Chapter 2 n The Intel Processor Architecture n History of PC Memory Usage (Real Mode)
A New Network Processor Architecture for High-speed Communication Xiaoning Nie; Gazsi, L.; Engel, F.; Fettweis, G. Signal Processing Systems, SiPS.
Architectural Considerations for CPU and Network Interface Integration C. D. Cranor; R. Gopalakrishnan; P. Z. Onufryk IEEE Micro Volume: 201, Jan.-Feb.
University College Cork IRELAND Hardware Concepts An understanding of computer hardware is a vital prerequisite for the study of operating systems.
1 EE249 Discussion A Method for Architecture Exploration for Heterogeneous Signal Processing Systems Sam Williams EE249 Discussion Section October 15,
TECH CH03 System Buses Computer Components Computer Function
GCSE Computing - The CPU
Parallel Computer Architectures
CPU Chips The logical pinout of a generic CPU. The arrows indicate input signals and output signals. The short diagonal lines indicate that multiple pins.
Module I Overview of Computer Architecture and Organization.
Lecture 12 Today’s topics –CPU basics Registers ALU Control Unit –The bus –Clocks –Input/output subsystem 1.
CS-334: Computer Architecture
Input/Output. Input/Output Problems Wide variety of peripherals —Delivering different amounts of data —At different speeds —In different formats All slower.
PS - Theo Claasen DAC First-time-right silicon, but…. to the right specification Theo A.C.M. Claasen Chief Technology Officer Philips Semiconductors.
Cis303a_chapt06_exam.ppt CIS303A: System Architecture Exam - Chapter 6 Name: __________________ Date: _______________ 1. What connects the CPU with other.
1 Lecture 20: I/O n I/O hardware n I/O structure n communication with controllers n device interrupts n device drivers n streams.
Windows Operating System Internals - by David A. Solomon and Mark E. Russinovich with Andreas Polze Unit OS6: Device Management 6.1. Principles of I/O.
CHAPTER 3 TOP LEVEL VIEW OF COMPUTER FUNCTION AND INTERCONNECTION
Computers organization & Assembly Language Chapter 0 INTRODUCTION TO COMPUTING Basic Concepts.
Top Level View of Computer Function and Interconnection.
System bus.
© 2007 Xilinx, Inc. All Rights Reserved This material exempt per Department of Commerce license exception TSU Hardware Design INF3430 MicroBlaze 7.1.
Chapter 1 Introduction. Objectives To explain the definition of computer architecture To discuss the history of computers To describe the von-neumann.
F. Gharsalli, S. Meftali, F. Rousseau, A.A. Jerraya TIMA laboratory 46 avenue Felix Viallet Grenoble Cedex - France Embedded Memory Wrapper Generation.
I/O Computer Organization II 1 Interconnecting Components Need interconnections between – CPU, memory, I/O controllers Bus: shared communication channel.
EEE440 Computer Architecture
2 Systems Architecture, Fifth Edition Chapter Goals Describe the system bus and bus protocol Describe how the CPU and bus interact with peripheral devices.
ECEG-3202 Computer Architecture and Organization Chapter 3 Top Level View of Computer Function and Interconnection.
Chapter 4 MARIE: An Introduction to a Simple Computer.
L/O/G/O Input Output Chapter 4 CS.216 Computer Architecture and Organization.
VLSI Algorithmic Design Automation Lab. THE TI OMAP PLATFORM APPROACH TO SOC.
Chapter 13 – I/O Systems (Pgs ). Devices  Two conflicting properties A. Growing uniformity in interfaces (both h/w and s/w): e.g., USB, TWAIN.
By Fernan Naderzad.  Today we’ll go over: Von Neumann Architecture, Hardware and Software Approaches, Computer Functions, Interrupts, and Buses.
Dr Mohamed Menacer College of Computer Science and Engineering, Taibah University CE-321: Computer.
Playstation2 Architecture Architecture Hardware Design.
Assoc. Prof. Dr. Ahmet Turan ÖZCERİT.  What Operating Systems Do  Computer-System Organization  Computer-System Architecture  Operating-System Structure.
Chapter 6 Storage and Other I/O Topics. Chapter 6 — Storage and Other I/O Topics — 2 Introduction I/O devices can be characterized by Behaviour: input,
Overview von Neumann Architecture Computer component Computer function
Review of Computer System Organization. Computer Startup For a computer to start running when it is first powered up, it needs to execute an initial program.
Multiprocessor SoC integration Method: A Case Study on Nexperia, Li Bin, Mengtian Rong Presented by Pei-Wei Li.
Chapter 3 System Buses.  Hardwired systems are inflexible  General purpose hardware can do different tasks, given correct control signals  Instead.
Chapter 11 System Performance Enhancement. Basic Operation of a Computer l Program is loaded into memory l Instruction is fetched from memory l Operands.
1 load [2], [9] Transfer contents of memory location 9 to memory location 2. Illegal instruction.
BIC 10503: COMPUTER ARCHITECTURE
An Introduction to Software Architecture
William Stallings Computer Organization and Architecture
Presentation transcript:

THE PHILIPS NEXPERIA DIGITAL VIDEO PLATFORM

The Digital Video Revolution  Transition from Analog to Digital Video  Navigate, store, retrieve and share digital programs as well as access to new interactive services and connectivity possibility  Home entertainment systems will be implemented with a number of Digital Video appliances  Digital Televisions (DTVs)  DVD Players  Digital Video Recorders  Set-top Boxes

The Philips Nexperia Platform Approach  Philips Semiconductors decided to serve the application domains of digital video and mobile with a platform approach : Nexperia  Nexperia : ‘next experience’  Nexperia platform properties  Flexibility  Innovation  Future-proof  Using architecture framework and IP blocks

Nexperia-DVP Platform Conceps  Nexperia-DVP is a Reference Architecture  A set of documents the describes how the products of the Digital Video Product Family will be partitioned into subsystems and how functionality will be split over theses subsystems.  Nexperia-DVP Soc Reference Architecture  Nexperia-DVP Software Reference Architecture  Nexperia-DVP System Reference Architecture

Nexperia-DVP SoC Reference Architecture. Scalable VLIW Media Processor:  100 to 300+ MHz  32-bit or 64-bit Nexperia ™ System Buses  bit General-purpose Scalable RISC Processor  50 to 300+ MHz  32-bit or 64-bit Library of Device IP Blocks  Image coprocessors  DSPs  UART  1394  USB …and more TM-xxxxD$ I$ TriMedia CPU DEVICE IP BLOCK DVP SYSTEM SILICON PI BUS SDRAM MMI DVP MEMORY BUS DEVICE IP BLOCK PRxxxx D$ I$ MIPS CPU DEVICE IP BLOCK PI BUS TriMedia ™ MIPS ™

Three Levels of Abstraction Level 1 : DVP Software-Hardware Rules  Deals with the software view of the hardware  Unified Memory Architecture  Rules for Data Movement  Endianess  Ordering & Coherency  Interrupts  Data formats, including Pixel Formats  Trimedia-MIPS Communication  Protection  Boot

Three Levels of Abstraction Level 2 : Device Transaction Level (DTL)  Deals with point-to-point transfers between Device IPs and the Connection Network, specifying a Device IP partition and architecture that is compatible with Level 1  Two main types of ports : Typical of Device IP  Device Control & Status Ports  DMA ports

Three Levels of Abstraction Level 3 : Connection Network  Deals with the more traditional Bus Hierarchy & Bus Level  In second generation Nexperia-DVP  The Device Control & Status (DCS) Network - Primarily a low latency communication path for the CPUs and other initiators to access the control & status register in the Device IPs  The Memory Connection Network - It will automate the generation of structures - current generation is implemented with two key elements : Memory Transaction Level (MTL) ports and protocol & Pipelined Memory Access Network

Chiplet-based Design  Chiplet : partitioning divides the logic hierarchy into manageable sized blocks  A chiplet is a group of modules which are placed together because either they are synchronous to each other or they are not timing critical.

Chiplet-based Design (Cont.)  The partitioning of the top-level netlist among chiplets followed these guidelines  There should be as few synchronous signal crossings between chiplets as possible  The clock module is placed into a separate chiplet because of its complexity  Cross-chiplet scan chains are not allowed

Nexperia-DVP Software Architecture  Scalable from low-end to high-end  Consistent API (on MIPS or TriMedia)  Single Streaming Architecture for MIPS and TriMedia  Aligned to Nexperia-DVP hardware architecture and IP blocks  Operating system independent software layers  Re-use of software components on any instance of the platform

Nexperia-DVP System Reference Architecture  Performance Characteristics  The system must work under normal conditions  The system must have a given (short) maximum input/output delay  The system must behave gracefully under exceptional condition  The system must be as cost effective as possible  Critical Resources  Memory Bandwidth  CPU Cycles  (RAM) Memory Size

Effect of bandwidth on transaction latency  The memory transaction latency experienced by a CPU is influenced by four different factors  Minimum transaction cost  Transactions of other blocks that take precedence  Pending transactions of blocks that do not take precedence  Other transactions of the CPU itself

Effect of transaction latency on CPU performance  Typical average behavior of two types of code on a Nexperia-DVP system DSP codeControl Code Characteristics Long expressions Highly repetitive loops Lots of switch/if statements Many function calls Working Set Size Typically smaller than CPUs I-cache size Typically much larger than CPUs I-cache size Typical instruction repetition rate before cache line invalidate 40 or moreBetween 1 and 3 Typical CPU stall cycles due to cache misses on moderately loaded system 20% and below on TM80% and above on TM 60% and above on MIPS Predominant Cache Misses D-CacheI-Cache Typical bandwidth consumed for each effective CPU Mcycle 400KB/s on TM6.4MB/s on TM 1.5MB/s on MIPS

Memory Arbitration  To regulate the distribution of available bandwidth over the requesting blocks : Utilize an advanced DDR arbitration scheme  Fair distribution of bandwidth over requesting blocks  Shortest possible transaction latency for CPUs  Nexperia-DVP memory arbiter deploys a sophisticated algorithm for distribution of bandwidth over requesting blocks  Guaranteed bandwidth  Priority based distribution of remainder

Scheduling Techniques  Scheduling of Hardware Accelerators time A1 A2 “Perform A2 after A1 is finished” “Run slice of A2 when A1 finished a slice” “Start A2 after a slice of A1, and guarantee that A2 will not overtake A1”

Scheduling Techniques (Cont.)  Properties of Scheduling Techniques SequencingSlicingStaggering Software effortLowest Higher, depends on slice size Low Buffer memoryLargestSmallestMiddle Input/output delayLongestShorter Requires of processors Nothing Must support slicing Must be sequential and localized