1/1/ / faculty of Electrical Engineering eindhoven university of technology Architectures of Digital Information Systems part 5: Special and weird ‘processor’

Slides:



Advertisements
Similar presentations
Computer Architecture
Advertisements

DSPs Vs General Purpose Microprocessors
1/1/ / faculty of Electrical Engineering eindhoven university of technology Speeding it up Part 3: Out-Of-Order and SuperScalar execution dr.ir. A.C. Verschueren.
1/1/ / faculty of Electrical Engineering eindhoven university of technology Introduction Part 2: Data types and addressing modes dr.ir. A.C. Verschueren.
Analog-to-Digital Converter (ADC) And
1/1/ / faculty of Electrical Engineering eindhoven university of technology Architectures of Digital Information Systems Part 1: Interrupts and DMA dr.ir.
1/1/ / faculty of Electrical Engineering eindhoven university of technology Processor support devices Part 1:Interrupts and shared memory dr.ir. A.C. Verschueren.
1/1/ /e/e eindhoven university of technology Microprocessor Design Course 5Z008 Dr.ir. A.C. (Ad) Verschueren Eindhoven University of Technology Section.
1/1/ / faculty of Electrical Engineering eindhoven university of technology Introduction Part 3: Input/output and co-processors dr.ir. A.C. Verschueren.
Lecture 9: Coarse Grained FPGA Architecture October 6, 2004 ECE 697F Reconfigurable Computing Lecture 9 Coarse Grained FPGA Architecture.
1/1/ / faculty of Electrical Engineering eindhoven university of technology Architectures of Digital Information Systems Part 2: Programmable I/O and Multiprocessors.
1/1/ / faculty of Electrical Engineering eindhoven university of technology Introduction Part 1: Bits, bytes and a simple processor dr.ir. A.C. Verschueren.
Chapter 9 Memory Basics Henry Hexmoor1. 2 Memory Definitions  Memory ─ A collection of storage cells together with the necessary circuits to transfer.
Overview Memory definitions Random Access Memory (RAM)
1 Sec (2.1) Computer Architectures. 2 For temporary storage of information, the CPU contains cells, or registers, that are conceptually similar to main.
Registers  Flip-flops are available in a variety of configurations. A simple one with two independent D flip-flops with clear and preset signals is illustrated.
(Page 554 – 564) Ping Perez CS 147 Summer 2001 Alternative Parallel Architectures  Dataflow  Systolic arrays  Neural networks.
Computer ArchitectureFall 2008 © August 20 th, Introduction to Computer Architecture Lecture 2 – Digital Logic Design.
Railway Foundation Electronic, Electrical and Processor Engineering.
1/1/ / faculty of Electrical Engineering eindhoven university of technology Input/Output devices Part 3: Programmable I/O and DSP's dr.ir. A.C. Verschueren.
1 Sensors and Measurements Penderia & Pengukuran ENT 164 Signal Processing Elements Hema C.R. School of Mechatronics Engineering Northern Malaysia University.
Computer Organization Computer Organization & Assembly Language: Module 2.
Introduction to Interconnection Networks. Introduction to Interconnection network Digital systems(DS) are pervasive in modern society. Digital computers.
Writer:-Rashedul Hasan Editor:- Jasim Uddin
Lecture#14. Last Lecture Summary Memory Address, size What memory stores OS, Application programs, Data, Instructions Types of Memory Non Volatile and.
ECE 259 / CPS 221 Advanced Computer Architecture II (Parallel Computer Architecture) Novel Architectures Copyright 2004 Daniel J. Sorin Duke University.
Artificial Neural Nets and AI Connectionism Sub symbolic reasoning.
IE 585 Introduction to Neural Networks. 2 Modeling Continuum Unarticulated Wisdom Articulated Qualitative Models Theoretic (First Principles) Models Empirical.
 The most intelligent device - “Human Brain”.  The machine that revolutionized the whole world – “computer”.  Inefficiencies of the computer has lead.
1/1/ / faculty of Electrical Engineering eindhoven university of technology 5Z008:Microprocessor design Design steps.
TRIPS – An EDGE Instruction Set Architecture Chirag Shah April 24, 2008.
NEURAL NETWORKS FOR DATA MINING
HCL and ALU תרגול 10. Overview of Logic Design Fundamental Hardware Requirements – Communication: How to get values from one place to another – Computation.
SYEN 3330 Digital SystemsJung H. Kim 1 SYEN 3330 Digital Systems Chapter 9 – Part 1.
1Copyright © Prentice Hall 2000 The Central Processing Unit Chapter 3 What Goes on Inside the Computer.
Advanced Computer Architecture 0 Lecture # 1 Introduction by Husnain Sherazi.
CPEN Digital System Design
Digital Logic Design Instructor: Kasım Sinan YILDIRIM
A Reconfigurable Low-power High-Performance Matrix Multiplier Architecture With Borrow Parallel Counters Counters : Rong Lin SUNY at Geneseo
Computer Organization - 1. INPUT PROCESS OUTPUT List different input devices Compare the use of voice recognition as opposed to the entry of data via.
Design of a High-Throughput Low-Power IS95 Viterbi Decoder Xun Liu Marios C. Papaefthymiou Advanced Computer Architecture Laboratory Electrical Engineering.
Chapter 7 Logic Circuits 1.State the advantages of digital technology compared to analog technology. 2. Understand the terminology of digital circuits.
Neural Networks in Computer Science n CS/PY 231 Lab Presentation # 1 n January 14, 2005 n Mount Union College.
Computer Architecture Lecture 32 Fasih ur Rehman.
Introduction to MMX, XMM, SSE and SSE2 Technology
.. Smart sensors are sensors with integrated electronics that perform the following functions:  Logic functions  Two-way communication  Make decisions.
Computer Studies/ICT SS2
Digital Signal Processors (DSPs). DSP Advanced signal processor circuits MAC (Multiply and Accumulate) unit (s) - provides fast multiplication of two.
ECEN 248: INTRODUCTION TO DIGITAL SYSTEMS DESIGN Dr. Shi Dept. of Electrical and Computer Engineering.
Programmable Logic Controllers LO1: Understand the design and operational characteristics of a PLC system.
A Programmable Single Chip Digital Signal Processing Engine MAPLD 2005 Paul Chiang, MathStar Inc. Pius Ng, Apache Design Solutions.
Charles Kime & Thomas Kaminski © 2008 Pearson Education, Inc. (Hyperlinks are active in View Show mode) Chapter 8 – Memory Basics Logic and Computer Design.
1 KU College of Engineering Elec 204: Digital Systems Design Lecture 22 Memory Definitions Memory ─ A collection of storage cells together with the necessary.
 A computer is an electronic device that receives data (input), processes data, stores data, and produces a result (output).  It performs only three.
Autumn 2006CSE P548 - Dataflow Machines1 Von Neumann Execution Model Fetch: send PC to memory transfer instruction from memory to CPU increment PC Decode.
Memory Systems 3/17/ Memory Classes Main Memory Invariably comprises solid state semiconductor devices Interfaces directly with the three bus architecture.
Computer Operation. Binary Codes CPU operates in binary codes Representation of values in binary codes Instructions to CPU in binary codes Addresses in.
This chapter in the book includes: Objectives Study Guide
Hardware Descriptions of Multi-Layer Perceptions with Different Abstraction Levels Paper by E.M. Ortigosa , A. Canas, E.Ros, P.M. Ortigosa, S. Mota , J.
Architectures of Digital Information Systems Part 1: Interrupts and DMA dr.ir. A.C. Verschueren Eindhoven University of Technology Section of Digital.
Processor support devices Part 2: Caches and the MESI protocol
Topics SRAM-based FPGA fabrics: Xilinx. Altera..
Memory Systems 7/21/2018.
The Central Processing Unit
This chapter in the book includes: Objectives Study Guide
Intelligent Information System Lab
Array Processor.
Triangular Sorter Using Memristive Architecture.
buses, crossing switch, multistage network.
The Xilinx Virtex Series FPGA
Presentation transcript:

1/1/ / faculty of Electrical Engineering eindhoven university of technology Architectures of Digital Information Systems part 5: Special and weird ‘processor’ types dr.ir. A.C. Verschueren Eindhoven University of Technology Section of Digital Information Systems

1/1/ / faculty of Electrical Engineering eindhoven university of technology A single superinstruction, > 100 bits long: Very Large Instruction Word machines VLIW machines use ‘super-instructions’ which explicitly control all functional units in parallel integer 1integer 2load/storefloating pt.jump/call To the functional units... –All scheduling must be done by compiler –Unused sub-instructions must be ‘No operation’ Trimedia: variable length instructions

1/1/ / faculty of Electrical Engineering eindhoven university of technology Single Instruction-Multiple Data machines Lots of ALU’s performing the same operation Systolic array: –Number of ALU’s equals number of data elements –Can have local storage and neighbour connections Vector processor: –Number of ALU’s below number of data elements –Lack of dependencies allows VERY fast pipelines Intel’s MMX is a kind of SIMD !

1/1/ / faculty of Electrical Engineering eindhoven university of technology column address ‘Intelligent memory’ SIMD machines Uses layout and technology of modern RAM chips to get massive parallellism row address 2-D memory bits cell array read one bitwrite one bit ‘row’ select ‘column’ bit line (leaky) capacito r read row write/’refresh’ row read/write amplifiers and data latches parallel one- bit ‘processors’ column addres s control ALU, carry bit, local storage bits, global AND/OR test, left/right neighbour communication takes < 20% extra chip area !

1/1/ / faculty of Electrical Engineering eindhoven university of technology  (A+B) (C–D) (A+B)  (C–D) Dataflow processing It is possible to write functions like (A + B)  (C – D) as an Acyclic Directed Graph +– A A B B C C D D ‘arc’s carry (intermediate) data values ‘node’s perform actual operations ‘dataflow graph’

1/1/ / faculty of Electrical Engineering eindhoven university of technology Dataflow graph memory and decisions These functions require special node types ! read address read data write address data to write A A A (>15) nil (<= 15) T nil (> 15) A (<= 15) F decision: A > 15 memory node EXAMPLES !

1/1/ / faculty of Electrical Engineering eindhoven university of technology Dataflow processor architecture packet grouping and operation selection packet buffers (RAM) routing using packet ID input data_1 data_2 operation/ID out (universal) ALU’s data_1 data_2 operation/ID out output Data stored in ‘packets’ (‘tokens’) with arc ID graph structure memory & queuein g arbitration & schedulin g node operations (abstract) EXAMPLE !

1/1/ / faculty of Electrical Engineering eindhoven university of technology A ‘fuzzy logic’ example rule-set Control the brakes of a car with a radar sensor 1)IF (  V negative) THEN do not brake 2)IF (  V low AND D large) THEN do not brake 3)IF (  V low AND D small) THEN brake gently 4)IF (  V high AND D large) THEN brake gently 5)IF (  V high AND D small) THEN brake strongly Measuring frequency shift gives speed difference  V (due to Doppler effect) Measuring transit time gives distance D

1/1/ / faculty of Electrical Engineering eindhoven university of technology – relative speed  V (km/hr) +50 Fuzzy sets, or, ‘what is high and low?’ A fuzzy set classifies a ‘scalar’ value as a set of ‘truth values’ in the range 0..1 Truth: 0 Truth: 1 ’  V negative''  V low''  V high' distance D (m) Truth: 0 Truth: 1 ’D small'’D large' Needed for smooth braking (switch between rules 1 and 3) ‘still low, but also a bit high’

1/1/ / faculty of Electrical Engineering eindhoven university of technology Rewriting logic into fuzzy rules Replace AND by MIN(imum), OR by MAX(imum) Combine rules 1 & 2 using normal logic: do not brake IFF (  V negative) OR (  V small AND D large) Combination of rules 1 & 2 in fuzzy logic: do not brake := MAX(  V negative, MIN(  V small, D large)) Combination of rules 3 & 4 in fuzzy logic: brake gently := MAX(MIN(  V low, D small), MIN(  V high, D large)) Rule 5 in fuzzy logic: brake strongly := MIN(  V high, D small)

1/1/ / faculty of Electrical Engineering eindhoven university of technology From fuzzy sets back to scalar values 05Braking force (m/s 2 ) Truth: 0 Truth: 1 ’do not brake'’brake gently'’brake strongly' How to calculate scalar value ‘Braking force’ from fuzzy set ’do not brake' = 0.2, ’brake gently' = 0.8 and ’brake strongly' = 0.4 1:scale fuzzy set graphs to their truth values 2:calculate centre of gravity and surface area under the individual scaled graphs 3:calculate weigthed sum of centre of gravities and its position 4:read scalar result value as position of weighted sum along the X-axis 3:calculate weigthed sum of centre of gravities and its position

1/1/ / faculty of Electrical Engineering eindhoven university of technology Fuzzy logic processors Non time-critical applications: normal processors –Compilers exist: fuzzy set + logic rules  C program Fuzzy logic rules (‘MIN’/’MAX’) processors exist –Sometimes include scalar  fuzzy set conversion –Almost never include fuzzy set  scalar conversion –Advantageous to build them as dataflow machine Fuzzy logic calculations need not be very precise –Possible to build ‘processor’ with analog electronics !

1/1/ / faculty of Electrical Engineering eindhoven university of technology Massive parallel fuzzy: neural networks Example: handwriting recognition feature extract feature combin e decide and encode (hex) = ASCII ‘A’ Lots of inputs, fewer outputs: each output is combination of inputs a neuron (one of many) Neural network with three layers of neurons

1/1/ / faculty of Electrical Engineering eindhoven university of technology A single neuron Constants can be negative –‘Inhibit’ or ‘blocking’ input Transfer function can have many forms  c1  c2  c3  cn  inputs output Multiply by constant Sum multiply results Transfer function: decide, scale & clip Implemented like fuzzy –Neural to C compiler –Dataflow architecture No high precision needed –Analog electronics possible Can be made self- learning