Software and Hardware Implementation of Cellular Automata for Structural Analysis and Design Zafer Gürdal * & Mark T. Jones ** Virginia Tech *Depts. of.

Software and Hardware Implementation of Cellular Automata for Structural Analysis and Design Zafer Gürdal * & Mark T. Jones ** Virginia Tech *Depts. of Aerospace and Ocean Eng., & Engineering Science and Mechanics **The Bradley Department of Electrical and Computer Engineering 06/17/03 National Institute of Aerospace, Hampton VA Support  NASA LaRC, NRA 98, Innovative Algorithms for Aerospace Engineering Analysis and Optimization, PM: Jarek Sobieski  NASA LaRC, Mechanics and Durability Branch, PM: Damodar Ambur  Virginia Tech, ASPIRES Program

CA Software Hardware Implementation June 17, 2003 Outline  Introduction  Evolutionary Design  Elements of Cellular Automata  CA applied to Engineering Design  Truss Domain  Composite Laminate Design  Hardware Implementation  Configurable Computing – FPGAs  CA Implementation Results  Multigrid Acceleration

CA Software Hardware Implementation June 17, 2003 Evolutionary Design  Mimic natural evolution of biological systems for structural design  Evolutionary design often relies on local optimality/decision making of independent parts  Examples: Reaction wood Bone growth  Cellular Automata: Decomposition of a seemingly complex macro behavior into basic small local problems

CA Software Hardware Implementation June 17, 2003 Evolutionary Design of Structures Evolutionary Design Genetic Algorithms Species ESO,MMD,CA Individual Designs Cellular Automata Local Evolution of Analysis and Design ESO, MMD Local Rules for Design, Global Analysis

CA Software Hardware Implementation June 17, 2003 Cellular Automata  Weiner (1946), Ulam (1952), von Neumann (1966) –Automata Networks –Cell Dynamic Scheme  Idealizations of complex natural systems –Flock behavior –Diffusion of gaseous systems –Solidification and crystal growth –Hydrodynamic flow and turbulence  General characteristics –Locality –Vast Parallelism –Simplicity

CA Software Hardware Implementation June 17, 2003 Elements of Cellular Automata  Cell Definitions  Lattice Configurations  Neighborhoods  Boundaries  Update rules  Iteration Schemes

CA Software Hardware Implementation June 17, 2003 Elements of Cellular Automata  Definition for state of a cell and update rule time step cell ID  Two-dimensional Lattice Configurations Rectangular Triangular Hexagonal Neighborhood cells Center cell

CA Software Hardware Implementation June 17, 2003 Neighborhood Definition  Rectangular Neighborhoods von Neumann Moore MvonN N S E W N S E W SE NENW SW N S E W SE NENW SW EE SS WW NN  Boundaries  Periodic  Location Specific

CA Software Hardware Implementation June 17, 2003 Update Rule – 2D Truss Domain Analysis Ground Structure uCuC vCvC C N S E NWNW NENE SWSW SE u SE vSE vSE W  Single Cell  Displacement Update:

CA Software Hardware Implementation June 17, 2003 Sample Truss Analysis Results UndeformedCA AnalysisFEM Analysis Linear Analysis Nonlinear Analysis Applied force or displacement

CA Software Hardware Implementation June 17, 2003 Linear vs. Nonlinear Analysis Linear analysis Nonlinear analysis # of iterations total reaction 1641 iterations 2985 iterations

CA Software Hardware Implementation June 17, 2003 Sizing/Design Rules  Local Optimization Formulation  Sequential Move and Size  Fully Stressed Design Geometry & Basic Ground Structure CDF = 1 75 kN 100 kN 40 m 60 m Dense Truss Solution (CDF = 40)

CA Software Hardware Implementation June 17, 2003 Design of Fiber Reinforced Panels  Minimum Compliance Design where  (x,y): fiber angle distribution Principal Strain Direction  Minimum Strain Energy Density (Pedersen 1990)

CA Software Hardware Implementation June 17, 2003 Panel with a Circular Hole in Shear Optimality Criteria (OC) Design Quarter Panel Model

CA Software Hardware Implementation June 17, 2003 Panel with a Circular Hole in Shear Pattern Matching + OC Design Pattern Matching + Discrete Design

CA Software Hardware Implementation June 17, 2003 Panel with a Circular Hole in Shear Topology + Orientation Design Topology + Discrete Fiber Orientation

CA Software Hardware Implementation June 17, 2003  Domain Modeled === Hardware Domain  Current parallel architectures are limited  Specialized CA machines mimicking CA domains Hardware Integration

CA Software Hardware Implementation June 17, 2003 Configurable Computing and Field Programmable Gate Arrays (FPGAs)

CA Software Hardware Implementation June 17, 2003 Definitions and Potential  Configurable computers are a relatively new class of computer architecture in which hardware circuits are (re-)configured for a specific algorithm  Offer “ASIC-like” speeds without the cost of designing and fabricating a chip –ASIC cost can run into many millions –General-purpose CPUs are slow  Configurable computers are often built using FPGAs because of their widespread availability (>>$1B market)

CA Software Hardware Implementation June 17, 2003 Field Programmable Gate Array (FPGA) Layout  An FPGA consists of a large array of Configurable Logic Blocks (CLBs) - typically 1,000 to 8,000 CLBs per chip  Each CLB contains registers and LUTs, where each LUT can implement a 4-input logic operation  By programming the CLBs and interconnections large circuits can be represented in the FPGA  One Xilinx XC2V4000 FPGA can represent a circuit up to 1M gates

CA Software Hardware Implementation June 17, 2003 DINI DN3000k10 Board  DINI DN3000k10 is an FPGA based PCI card  Contains five Xilinx XCV4000 FPGAs connected by a 226 bit wide bus  One of the FPGAs has a separate connection for communicating to a PC via the PCI bus  FPGAs can be configured through the PCI bus or configurations can be stored on board

CA Software Hardware Implementation June 17, 2003 Algorithms for FPGAs  Target FPGA strengths: parallel, pipelined, customized –Goal is to have every part of the chip actively computing at the highest possible clock speed  Do: re-think the algorithm to –Expose the natural parallelism –Pipeline time-consuming operations –Examine the precision that is really necessary  Do not: Implement algorithms as you would in software on a traditional computer

CA Software Hardware Implementation June 17, 2003 Multiplier Options Usage (% CLBs)* *Percentage of CLBs used in a XC2V4000, the XC2C4000 contains 5760 CLBs

CA Software Hardware Implementation June 17, 2003 Application Performance  HokieGene – Genome Matching Project (2003) –Matching engine executes on one FPGA (XC2V1000) –Performs 200 billion cell updates per second –1,200 billion operations per second (1.2 TOPS)  BYU - Network Intrusion Detection Systems (2002) –Hardware implementation uses one FPGA (XC2V1000) –Outperformed software version running on P3 – 750MHz: Up to 400 times more throughput than software version Up to 1000 times less latency than software version  Xilinx – High Performance DES Encryption (2000) –Implemented on one small FPGA (XCV150) –Maximum throughput 10.75 GB/sec –Outperformed best ASIC implementation  University of Texas at Austin – Target Recognition System (2000) –System built using one FPGA (ORCA 40k) and Myrinet interfacing –Capable of processing 900 templates per second –2,800 billion operations per second (2.8 TOPS)

CA Software Hardware Implementation June 17, 2003 Iterative Methods for Linear Systems  Consider Jacobi’s method –D x i+1 = (D-A) x i + b –In software, we would select either single or double precision floating point  On a configurable computer we can select any format in which to store/compute value –Choose the desired precision of the solution –Reconstruct the method for fast computation

CA Software Hardware Implementation June 17, 2003 Iterative Methods Continued  Re-cast as iterative improvement scheme r i = b - A x i  Compute in n bits  x i = A –1 r i  Compute in k bits x i+1 = x i +  x i = A –1 r i  Compute in n bits  Use Jacobi to solve for  x i in compact, fast k-bit hardware (cost ~ bits 2 )  Thm: Convergence rate is independent of k  Thm: Optimal choice of k ~ n/(# iterations) 1/3

CA Software Hardware Implementation June 17, 2003 Convergence  Solution Error vs. Number of Iterations  K= 3,6,9 decimal digits  No difference in convergence rate

CA Software Hardware Implementation June 17, 2003 Performance Advantage  Execution Cost (number of bit operations) vs. the size of the matrix  Compares cost of normal vs. modified algorithm  Convergence for each algorithm is identical

CA Software Hardware Implementation June 17, 2003 Euler Beam Formulation x y F hh w L,θ L w C,θ C w R,θ R FCFC Control Volume MCMC FRFR MRMR FLFL MLML d(x)  Cell Neighborhood  Cell Equilibrium

CA Software Hardware Implementation June 17, 2003 Cellular Automata Model Multiple Cells per Processing Element

CA Software Hardware Implementation June 17, 2003 Beam Design  residual  error  correction  Equilibrium Update  Design Update Converged Design Update Converged End Equilibrium Update NO YES

CA Software Hardware Implementation June 17, 2003 Algorithm Strategy  The limited precision algorithm illustrated for Jacobi’s method earlier is applied to CA –Much smaller, faster circuits for applying CA rule updates in k-bit operations –Built-in 18x18 multipliers compute residual  Built-in high-speed memories provide –Storage for intermediate and permanent quantities –Many customizable word-lengths –Extremely high memory bandwidth

CA Software Hardware Implementation June 17, 2003 Processing Element

CA Software Hardware Implementation June 17, 2003 FPGA Performance Cell Updates Per Second (Millions)

CA Software Hardware Implementation June 17, 2003 CA Performance

CA Software Hardware Implementation June 17, 2003 Multigrid Acceleration x y F lattice 8h lattice 4h lattice 2h lattice h E : Equilibrium update to convergence : Equilibrium updated α times S S S S E S S S h 2h2h 4h4h 8h8h lattice V - cycle S S S E S E S S S E S E S S S h 2h2h 4h4h 8h8h lattice W - cycle : Restriction (on r) : Prolongation (on e)

CA Software Hardware Implementation June 17, 2003 Prolongation lattice 2h lattice h

CA Software Hardware Implementation June 17, 2003 Prolongation/Restriction lattice 2h lattice h  Correction Prolongation  Residual Restriction where Prolongation Operator

CA Software Hardware Implementation June 17, 2003 Design with 5 Cells: Design with 17 Cells: Design with 65 Cells: ~ Design with 257 Cells: ~ ~ Design with 3 Cells: Nested Iteration for MG accelerated CA d(x)

CA Software Hardware Implementation June 17, 2003 CA Design Performance with Full MG 10 0 10 1 10 2 10 3 10 4 10 5 10 6 10 7 10 8 1101001000 Number of Cells Total number of cell updates

CA Software Hardware Implementation June 17, 2003 Concluding Remarks  Summary –CA paradigm has been demonstrated for various structural systems –CA paradigm matches well with Configurable Computing acceleration –Full Multigrid acceleration for CA improves design convergence  Future Work –Expand the design capabilities in terms of structural details and the types of field problems that can be solved –Tools that will enable engineers to effortlessly use configurable computers for CA applications –Continue to investigate algorithms to improve CA performance

Software and Hardware Implementation of Cellular Automata for Structural Analysis and Design Zafer Gürdal * & Mark T. Jones ** Virginia Tech *Depts. of.

Similar presentations

Presentation on theme: "Software and Hardware Implementation of Cellular Automata for Structural Analysis and Design Zafer Gürdal * & Mark T. Jones ** Virginia Tech *Depts. of."— Presentation transcript:

Similar presentations

About project

Feedback

Log in

Auth with social network:

Software and Hardware Implementation of Cellular Automata for Structural Analysis and Design Zafer Gürdal * & Mark T. Jones ** Virginia Tech *Depts. of.

Similar presentations

Presentation on theme: "Software and Hardware Implementation of Cellular Automata for Structural Analysis and Design Zafer Gürdal * & Mark T. Jones ** Virginia Tech *Depts. of."— Presentation transcript:

Similar presentations

About project

Feedback