Download presentation
Presentation is loading. Please wait.
Published byJaylen Renshaw Modified over 9 years ago
1
Multi-cellular paradigm The molecular level can support self- replication (and self- repair). But we also need cells that can be designed to fit the specific application and at the same time able to support bio- inspired mechanisms for self-replication and fault tolerance.
2
Cellular differentiation Cells adapt their physical structure to fit the “application” Can circuits/processors do the same? Physically? No Logically? Yes, but… Can they do it easily (dare we say, automatically)?
3
Conventional processors
4
Fetch, decode and control unit
5
Instruction encoding Instructions encode both the operation and the operands. For example, in the MIPS architecture
6
Instruction encoding in “real life”
7
The Arithmetic and Logic Unit
8
Common processor components
9
State of the art computing
10
Bio-inspired processors However, none of these “standard” architectures is quite flexible enough to implement many of the behaviours required for bio- inspired computing Needed: adaptable cellular architecture That is, a processor architecture that is Customizable Compact Powerful Easy to design and modify Amenable to evolution and learning Possible solution: MOVE architectures
11
The MOVE paradigm One single instruction : move Data displacements trigger operations Architecture based around data ≠ operation centric Regular structure : functional units + data network Scalable and modular architecture Example: Sum of two values Conventional architecture: add R1, R2, R3; MOVE architecture: move O(Fxxx), I1(Fsum) move O(Fyyy), I2(Fsum) move O(Fsum), I(Fzzz)
12
Example – add operation
13
Cellular differentiation Main features: Only one instruction (OK, maybe two) that MOVEs data to and from the CUs and FUs (dataflow architecture) Conventional fetch/decode mechanism – compatible with bio-inspired mechanisms No pipeline: computation carried out in specialized functional units (FU) Communication carried out in specialized communication units (CU)
14
Cellular differentiation Main advantages: Can be easily customized by introducing application- specific functional and communication units. Perfectly fits the requirements of systolic arrays (arbitrarily complex communication patterns). The introduction of custom components does not affect the assembler language, the code structure, the fetch and decode units, or the transport bus.
15
Genotype Layer Phenotype Layer Example – Automatic Synthesis Application-specific (parallel) functions Developmental algorithm Genetic code Mapping Layer
16
Example – Automatic Synthesis Phenotype Layer Mapping Layer Genotype Layer Totipotent Cell
17
Example – Automatic Synthesis Totipotent Cell Programmable Logic
18
Example – Automatic Synthesis Programmable Logic Cellular Array
19
What kind of applications can take advantage of this kind of system? Complex "real-world" streaming applications computation is carried out sequentially can be represented by a DAG of computation nodes each node processes data locally then forwards them to the next node in the graph Applications ×+÷≠ FFT + × DCT INOUT
20
READDCTQNTZCMPRWRT Example: JPEG Specialized MOVE functional units can be designed for each of these steps INOUT
21
Programmable substrate ×+÷≠ FFT + × DCT Context INOUT Problem: task or resource allocation – i.e. how do we map the graph nodes to the array? Specifically: dynamic allocation
22
Self-Scaling Stream Processing Source Funct A Funct B Funct C JoinFunct A Funct C Funct A Funct C Funct A Funct C
24
SSSP The MJPEG application consists of a four-stage computation pipeline. The data to be compressed are composed of 192 bytes corresponding to an 8x8 array of pixels using 24- bit colour. The maximum rate achievable (determined by the input rate) is of 700 packets per second - roughly 1 MBit/second. With a single pipeline, the performance tops at about 60 packets per second.
26
SSSP When performance peaks, the average output rate is of 675 packets per second (out of a maximum of 700): this technique allows to multiply the throughput by a factor of 11 using 28 processors.
27
Next lecture What kind of tools have to be developed to implement a complete system? How do we determine optimal Fus for a given application? Idea: let’s use evolution!
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.