Download presentation
Presentation is loading. Please wait.
1
Alternative Parallel Processing Approaches
Some people argue that real breakthroughs in computational power will occur only by abandoning the von Neumann model. Numerous efforts are now underway to devise systems that could change the way that we think about computers and computation. In this section, we will look at three of these: dataflow computing, neural networks, and systolic processing. 1 1
2
Alternative Parallel Processing Approaches – Dataflow Computing
The ole Von Neumann machines exhibit sequential control flow: A linear stream of instructions is fetched from memory, and they act upon data. Program flow changes under the direction of branching instructions. In dataflow computing, program control is directly controlled by data dependencies. There is no program counter or shared storage. Data flows continuously and is available to multiple instructions simultaneously. A data flow graph represents the computation flow in a dataflow computer. Its nodes contain the instructions and its arcs indicate the data dependencies. 2 2
3
Alternative Parallel Processing Approaches – Dataflow Computing
When a node has all of the data tokens it needs, it fires, performing the required operation, and consuming the token. The result is placed on an output arc. The architecture of a dataflow computer consists of processing elements that communicate with one another. Each processing element has an enabling unit that sequentially accepts tokens and stores them in memory. If the node to which this token is addressed fires, the input tokens are extracted from memory and are combined with the node itself to form an executable packet. 3 3
4
Alternative Parallel Processing Approaches – Neural Networks
Neural network computers consist of a large number of simple processing elements that individually solve a small piece of a much larger problem. They are particularly useful in dynamic situations that are an accumulation of previous behavior, and where an exact algorithmic solution cannot be formulated. Neural networks can deal with imprecise, probabilistic information, and allow for adaptive interactions. Neural network processing elements (PEs) multiply a set of input values by an adaptable set of weights to yield a single output value. The computation carried out by each PE is simplistic when compared to a traditional microprocessor. Their power lies in their massively parallel architecture and their ability to adapt to the dynamics of the problem space. Neural networks learn from their environments. A built-in learning algorithm directs this process. 4 4
5
Alternative Parallel Processing Approaches – Neural Networks
The simplest neural net PE (processing element) is the perceptron. Perceptrons are trainable neurons. A perceptron produces a Boolean output based upon the values that it receives from several inputs. Perceptrons are trainable because the threshold and input weights are modifiable. In this example, the output Z is true (1) if the net input, w1x1 + w2x wnxn is greater than the threshold T. The biggest problem with neural nets is that when they consist of more than 10 or 20 neurons, it is impossible to understand how the net is arriving at its results. They can derive meaning from data that are too complex to be analyzed by people. Despite early setbacks, neural nets are gaining credibility in sales forecasting, data validation, and facial recognition. 5 5
6
Alternative Parallel Processing Approaches – Systolic Arrays
Systolic Arrays are a network of processing elements the methodically compute data by circulating it through the system. Systolic arrays, a variation of SIMD computers, have simple processors that process data by circulating it through vector pipelines. Systolic arrays can sustain great throughout because they employ a high degree of parallelism. Connections are short, and the design is simple and scalable. They are robust, efficient, and cheap to produce. They are, however, highly specialized and limited as to they types of problems they can solve. 6 6
7
Future Alternatives Computers, as we know them are binary, transistor-based systems. But transistor-based systems are strained to keep up with our computational demands. Transistors are becoming so small that it is hard for them to hold electrons in the way in which we're accustomed to. Thus, an alternative to transistor-based systems is Optical or Photonic Computing systems Rather than electrons performing logic, photons of laser light are used Speed of light could be the upper limit in speed No heat dissipation issues And light beams can travel in parallel – increasing performance even more Many years before this alternative makes it to the mainstream Another alternative to transistor-based systems is Biological Computing systems Uses components from living organisms instead of inorganic silicon ones. Example 1: using neurons from leeches – able to control the behavior Example 2: using DNA as software and enzymes as hardware (called DNA computing) – use DNA strands to test all solutions at once and output a correct answer Example 3: using bacteria that can turn genes on and off in predictable ways 7 7
8
Future Alternatives Another alternative to transistor-based systems is Quantum Computing systems Quantum computing uses quantum bits (qubits) that can be in multiple states at once. The "state" of a qubit is determined by the spin of an electron. A thorough discussion of "spin" is under the domain of quantum physics. A qubit can be in multiple states at the same time. This is called superpositioning. A 3-bit register can simultaneously hold the values 0 through 7 8 operations can be performed at the same time. This phenomenon is called quantum parallelism. A system with 600 qbits can superposition states 8 8
9
Quantum Computing Quantum computers may be applied in the areas of cryptography, true random-number generation, and in the solution of other intractable problems. Making effective use of quantum computers requires rethinking our approach to problems and the development of new algorithms. There is a law (Rose’s Law) that states that the number of qubits that can be assembled to successfully perform computations will double every 12 months; this has been precisely the case for the past nine year One of the largest obstacles to the progress of quantum computation is the tendency for qubits to decay into a state of decoherence. Decoherence causes uncorrectable errors. Advanced error-correction algorithms have been applied to this problem and show promise. Much research remains to be done, however. 9 9
10
- Quantum Computing - The realization of quantum computing has raised questions about technological singularity. Technological singularity is the theoretical point when human technology has fundamentally and irreversibly altered human development This is the point when civilization changes to an extent that its technology is incomprehensible to previous generations. Are we there, now ?????????? 10 10
11
Pipelining T ime Pipelining is a way of effectively organizing concurrent activities in improving performance I I I 1 2 3 F E F E F E 1 1 2 2 3 3 (a) Sequential execution Interstage buffer B1 Instruction Ex ecution fetch unit unit (b) Hardware organization T ime Clock cycle 1 2 3 4 Instruction I F E 1 1 1 I F E 2 2 2 I F E 3 3 3 (c) Pipelined execution Figure 8.1. Basic idea of instruction pipelining.
12
Chapter 8 section 1 - Pipelining
13
Chapter 8 section 1 - Pipelining
Pipelining only works if the various tasks take about the same amount of time. Fetching from main memory can be ten-fold longer than other tasks (caching). This stall is called a “data hazard” because the source or destination operand is not available when expected in the pipeline.
14
Chapter 8 section 1 - Pipelining
You can also experience a stall due to an instruction NOT being available when expected. This stall called an “instruction hazard” or “control hazard” – caused by a missed caching opportunity which causes the processor to fetch from main memory. Instruction I2 fetch is delayed due to cache miss Instruction I1 fetch was fine
15
Chapter 8 section 1 - Pipelining
You can also experience a stall due to TWO instructions requiring the use of the same hardware resources This stall called a “structural hazard”. A A good example would be, one instruction needing access to memory for an EXECUTE or WRITE, while some other instruction need access at the same time for a FETCH. Having separate instruction and data caches can help with this problem. Example: Load X(R1),R2 Operand read is written into register R2 in cycle 6 Memory access takes place in cycle 5 Memory address X+[R1] is computed in step E2 Causes stall for I3 because both require access to the SAME register during cycle 6
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.