 Device and architecture co-optimization – Large search space – Need fast yet accurate power and delay estimator for FPGAs  Trace-based power and delay.

Slides:



Advertisements
Similar presentations
F-tests continued.
Advertisements

Device and Architecture Co-Optimization for FPGA Power Reduction Lerong Cheng, Phoebe Wong, Fei Li, Yan Lin, and Prof. Lei He EE Department, UCLA Partially.
Non-Gaussian Statistical Timing Analysis Using Second Order Polynomial Fitting Lerong Cheng 1, Jinjun Xiong 2, and Lei He 1 1 EE Department, UCLA *2 IBM.
Slide 1 Bayesian Model Fusion: Large-Scale Performance Modeling of Analog and Mixed- Signal Circuits by Reusing Early-Stage Data Fa Wang*, Wangyang Zhang*,
Recent Challenges. 2 Soft Errors Scaling:  SEU (Single-event upset): −Ionizing radiation corrupts data stored  Cause: −Radioactive impurities in device.
1 Low-Dose Dual-Energy CT for PET Attenuation Correction with Statistical Sinogram Restoration Joonki Noh, Jeffrey A. Fessler EECS Department, The University.
On the Need for Statistical Timing Analysis Farid N. Najm University of Toronto
Use of Kalman filters in time and frequency analysis John Davis 1st May 2011.
Data Modeling and Parameter Estimation Nov 9, 2005 PSCI 702.
Stochastic Analog Circuit Behavior Modeling by Point Estimation Method
An Efficient Method for Chip-Level Statistical Capacitance Extraction Considering Process Variations with Spatial Correlation W. Zhang, W. Yu, Z. Wang,
0 1 Width-dependent Statistical Leakage Modeling for Random Dopant Induced Threshold Voltage Shift Jie Gu, Sachin Sapatnekar, Chris Kim Department of Electrical.
Parameterized Timing Analysis with General Delay Models and Arbitrary Variation Sources Khaled R. Heloue and Farid N. Najm University of Toronto {khaled,
The continuous scaling trends of smaller devices, higher operating frequencies, lower power supply voltages, and more functionalities for integrated circuits.
Statistical Full-Chip Leakage Analysis Considering Junction Tunneling Leakage Tao Li Zhiping Yu Institute of Microelectronics Tsinghua University.
Longitudinal Experiments Larry V. Hedges Northwestern University Prepared for the IES Summer Research Training Institute July 28, 2010.
Non-Linear Statistical Static Timing Analysis for Non-Gaussian Variation Sources Lerong Cheng 1, Jinjun Xiong 2, and Prof. Lei He 1 1 EE Department, UCLA.
Chapter 3 Simple Regression. What is in this Chapter? This chapter starts with a linear regression model with one explanatory variable, and states the.
UC San Diego Computer Engineering VLSI CAD Laboratory UC San Diego Computer Engineering VLSI CAD Laboratory UC San Diego Computer Engineering VLSI CAD.
Statistical Crosstalk Aggressor Alignment Aware Interconnect Delay Calculation Supported by NSF & MARCO GSRC Andrew B. Kahng, Bao Liu, Xu Xu UC San Diego.
Dimensional reduction, PCA
Design Sensitivities to Variability: Extrapolations and Assessments in Nanometer VLSI Y. Kevin Cao *, Puneet Gupta +, Andrew Kahng +, Dennis Sylvester.
Power Modeling and Architecture Evaluation for FPGA with Novel Circuits for Vdd Programmability Yan Lin, Fei Li and Lei He EE Department, UCLA
Stochastic Physical Synthesis for FPGAs with Pre-routing Interconnect Uncertainty and Process Variation Yan Lin and Lei He EE Department, UCLA
© 2005 Altera Corporation © 2006 Altera Corporation Placement and Timing for FPGAs Considering Variations Yan Lin 1, Mike Hutton 2 and Lei He 1 1 EE Department,
 Y. Hu, V. Shih, R. Majumdar and L. He, “Exploiting Symmetries to Speedup SAT-based Boolean Matching for Logic Synthesis of FPGAs”, TCAD  Y. Hu,
SAMSON: A Generalized Second-order Arnoldi Method for Reducing Multiple Source Linear Network with Susceptance Yiyu Shi, Hao Yu and Lei He EE Department,
Fast Buffer Insertion Considering Process Variation Jinjun Xiong, Lei He EE Department University of California, Los Angeles Sponsors: NSF, UC MICRO, Actel,
Circuit Performance Variability Decomposition Michael Orshansky, Costas Spanos, and Chenming Hu Department of Electrical Engineering and Computer Sciences,
Statistical Gate Delay Calculation with Crosstalk Alignment Consideration Andrew B. Kahng, Bao Liu, Xu Xu UC San Diego
Trace-Based Framework for Concurrent Development of Process and FPGA Architecture Considering Process Variation and Reliability 1 Lerong Cheng, 1 Yan Lin,
UC San Diego Computer Engineering VLSI CAD Laboratory UC San Diego Computer Engineering VLSI CAD Laboratory UC San Diego Computer Engineering VLSI CAD.
ECE 506 Reconfigurable Computing Lecture 8 FPGA Placement.
Component Reliability Analysis
Hypothesis Testing in Linear Regression Analysis
1 Miodrag Bolic ARCHITECTURES FOR EFFICIENT IMPLEMENTATION OF PARTICLE FILTERS Department of Electrical and Computer Engineering Stony Brook University.
Power Reduction for FPGA using Multiple Vdd/Vth
PROBABILITY (6MTCOAE205) Chapter 6 Estimation. Confidence Intervals Contents of this chapter: Confidence Intervals for the Population Mean, μ when Population.
Eng. Mohammed Timraz Electronics & Communication Engineer University of Palestine Faculty of Engineering and Urban planning Software Engineering Department.
Multiuser Detection (MUD) Combined with array signal processing in current wireless communication environments Wed. 박사 3학기 구 정 회.
Geographic Information Science
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
Statistical Sampling-Based Parametric Analysis of Power Grids Dr. Peng Li Presented by Xueqian Zhao EE5970 Seminar.
Predictive Design Space Exploration Using Genetically Programmed Response Surfaces Henry Cook Department of Electrical Engineering and Computer Science.
Design Space Exploration for Application Specific FPGAs in System-on-a-Chip Designs Mark Hammerquist, Roman Lysecky Department of Electrical and Computer.
EMIS 7300 SYSTEMS ANALYSIS METHODS FALL 2005 Dr. John Lipp Copyright © Dr. John Lipp.
Chapter 4: Introduction to Predictive Modeling: Regressions
1 MODELING MATTER AT NANOSCALES 4. Introduction to quantum treatments The variational method.
STA with Variation 1. 2 Corner Analysis PRCA (Process Corner Analysis):  Takes 1.nominal values of process parameters 2.and a delta for each parameter.
On the Assumption of Normality in Statistical Static Timing Analysis Halim Damerdji, Ali Dasdan, Santanu Kolay February 28, 2005 PrimeTime Synopsys, Inc.
1 Carnegie Mellon University Center for Silicon System Implementation An Architectural Exploration of Via Patterned Gate Arrays Chetan Patel, Anthony Cozzie,
1 Chapter 4: Introduction to Predictive Modeling: Regressions 4.1 Introduction 4.2 Selecting Regression Inputs 4.3 Optimizing Regression Complexity 4.4.
In-Place Decomposition for Robustness in FPGA Ju-Yueh Lee, Zhe Feng, and Lei He Electrical Engineering Dept., UCLA Presented by Ju-Yueh Lee Address comments.
SGPP: Spatial Gaussian Predictive Process Models for Neuroimaging Data Yimei Li Department of Biostatistics St. Jude Children’s Research Hospital Joint.
Department of Electrical and Computer Engineering University of Wisconsin - Madison Optimizing Total Power of Many-core Processors Considering Voltage.
Methods of multivariate analysis Ing. Jozef Palkovič, PhD.
Unified Adaptivity Optimization of Clock and Logic Signals Shiyan Hu and Jiang Hu Dept of Electrical and Computer Engineering Texas A&M University.
University of Michigan Advanced Computer Architecture Lab. 2 CAD Tools for Variation Tolerance David Blaauw and Kaviraj Chopra University of Michigan.
F-tests continued.
Chapter 4 Basic Estimation Techniques
Chapter 4b Process Variation Modeling
Chapter 3 Component Reliability Analysis of Structures.
Chapter 4a Stochastic Modeling
Statistical Methods For Engineers
Statistical Optimal Hash-based Longest Prefix Match
Chapter 4a Stochastic Modeling
Filtering and State Estimation: Basic Concepts
Impact of Parameter Variations on Multi-core chips
Post-Silicon Calibration for Large-Volume Products
On the Improvement of Statistical Timing Analysis
Presentation transcript:

 Device and architecture co-optimization – Large search space – Need fast yet accurate power and delay estimator for FPGAs  Trace-based power and delay estimator (Ptrace)  Optimization result – Reduce energy delay product by 18.4% and area by 23% – LUT size 5 provides the maximum power and delay combined yield  Target function driven component analysis (FCA) –Given a target function f(X 1, X 2 ) –Find out the linear decomposition matrix W to minimize the error mean, variance, and skewness of f(·) when ignoring high order dependence –FCA has the same complexity as PCA and ICA but more accurate  Approximate max operation using second order polynomial  Works for all three delay models  More efficient and accurate than that using Fourier series – 20X faster than that using Fourier Series – Computational complexity  O(n 3 ) for quadratic delay model  O(n) for others  Within 2% error compared to MC simulation  Max operation using Fourier series approximation –Approximate PDF of variation sources by Fourier Series –Apply moment matching to reconstruct the canonical form of max operation  All operations are based on either close form formulae or lookup table – Computational complexity O(nK 2 )  Only works for linear and semi- quadratic delay model  Within 5% error compared to MC simulation For the CMOS technology scaling, process variation has become a potential show-stopper if not appropriately handled. These variations introduce significant uncertainty for both circuit performance and leakage power. Statistical modeling, analysis, and optimization for VLSI circuits has thus become the frontier research topic in recent years in combating such variation effects. As the process advances to nanometer technologies and low-energy embedded applications are explored for FPGAs, power consumption becomes a crucial design constraint for FPGAs. It is well known architecture and device setting have great impact on FPGA power and performance. However, how to perform statistical optimization, considering both device and architecture has not been solved by previous works. In addition, some reliability issues, such device aging and soft error rate (SER) may affect the performance of FPGAs. Such impact was not considered in the previous works either. Besides FPGAs, statistical modeling and analysis for ASICs are also hot research topics. There are many works on statistical timing and power modeling and analysis. However, how to efficiently perform statistical static timing analysis (SSTA) for non-linear delay model with non-Gaussian variation sources is still a hard problem. Moreover, most of statistical analysis assumes independent variation sources and apply principle component analysis (PCA) or independent component analysis (ICA) to decompose dependent variation sources. However, some of the variation sources are non-linearly dependent, such as Leff and Vth. In this case, the linear operation (such as PCA or ICA) cannot completely remove dependence. How to handle the non-linear dependent variation sources is another unsolved problem. Spatial correlation is another concern in statistical analysis. Many recent works try to model spatial correlation as a function of distance. However, some recent research work observe that the spatial correlation mainly comes from the deterministic across wafer variation and the pure random spatial variation is not significant. Modeling across wafer variation is also a challenge problem. Ph.D.’09: Statistical Modeling and Optimization for VLSI Circuits Student: Lerong Cheng Advisor: Lei He Co-advisor: Puneet Gupta EDA Lab ( Electrical Engineering Department, UCLA L. Cheng, P. Wong, F. Li, Y. Lin, and L. He, “Device and Architecture Co-Optimization for FPGA Power Reduction,” DAC, P.Wong, L. Cheng, Y. Lin, and L. He, “FPGA Device and Architecture Evaluation Considering Process Variation,” ICCAD, L. Cheng, J. Xiong, and L. He, “FPGA Performance Optimization via Chipwise Placement Considering Process Variations,” FPL, L. Cheng, J. Xiong, and L. He,.Non-Linear Statistical Static Timing Analysis for Non-Gaussian Variation Sources,. DAC, L. Cheng, J. Xiong, and L. He, “Non-Gaussian Statistical Timing Analysis Using Second-Order Polynomial Fitting,” ASPDAC, L. Cheng, Y.Lin, L. He, and Y. Cao, “Trace-Based Framework for Concurrent Development of Process and FPGA Architecture Considering Process Variation and Reliability,” ISFPGA, L. Cheng, P. Gupta, and L. He, “Accounting for Non-linear Dependence Using Function Driven Component Analysis,” ASPDAC Collaborators: Dr. Jinjun Xiong, Dr. Yan Lin, Dr. Fei Li, and Miss Phoebe Wong, Introduction  Block-based SSTA operations –Add (simple) –Max (hard)  Core operation of SSTA  Delay model –Linear, e fficient but not accurate –Quadratic, accurate but not efficient –Quadratic without crossing term (semi-quadratic), efficient and somewhat accurate Analysis of Non-Linear Dependence Statistical Modeling and Optimization for FPGAs References & Collaborators Statistical Static Timing Analysis Modeling of Across-Wafer Variation Switching activity Ratio of short circuit power Critical path structure Circuit element statistics Area Chip level area, delay and power Circuit level delay and power VPR Psim Trace collection Device dependent Device independent  FPGA chipwise placement for timing optimization  Concurrent design of process and FPGA architecture – Develop process and architecture concurrently in order to shorten the time to market – Need to estimate FPGA power and delay from process parameters  Ptrace2 – Based on ITRS Mastar4 transistor model  Analysis result – Device aging leads to 8.5% delay degradation after 10 years – Neither device aging nor process variation has impact on SER – Programmability of FPGAs offer a unique opportunity to leverage process variation and improve circuit performance – Perform placement according to the chipwise variation maps – Improve performance up to 12.1%  Across-wafer variation can be approximated as quadratic function  After subtracting the across-wafer variation, purely random spatial correlation is not significant  In the die point of view, the within wafer is spatially correlated – This variation is not purely random – Cannot be modeled as random correlated variation  New Variation Model – Exactly model the across wafer variation – Only 4 random variables: Xc, Yc, mw, and r – More accurate and efficient than spatial l variation model Loca- tion Our ModelSpatial Correlation model µσ95%T (s)µσ95%T (s) LL-C (10.1X) LR-C (10.5X) UL-C (10.1X) UR-C (10.2X) Target function f(X 1, X 2 ) Samples of X 1, X 2 Joint moments of X 1, X 2 Error of moments of f as function of transfer matrix Nonlinear programming Minimizing error of moments of f Transfer matrix W Target function f(X 1, X 2 ) Samples of X 1, X 2 Transfer matrix W Joint moments of X 1, X 2 Moments of P 1, P 2 ρ ij of P 1, P 2 Function of P 1, P 2 g(P 1, P 2 ) Result with Correct dependence Result assuming ρ ij =0 Error  Linear operation is used to decomposed dependent variation sources  Not accurate with existence of non-linear dependence  Need to estimate the error introduced by ignoring non- linear dependence  Define high order correlation coefficient Circuit delay Comparison Wafer frequency Wafer leakage Across-wafer variation is looked on as spatial correlated in the die point of view PDF comparison Approximate max operation as second order polynomial PDF comparison Fourier Series approximation of PDF Performance improve under different utilization rate Performance improvement histogram PTRACE