1 Design and Impementation of a Sub- threshold BFSK Transmitter By: Suganth Paul # Rajesh Garg $ Sunil P. Khatri $ Sheila Vaidya % # Intel Corporation,

Slides:



Advertisements
Similar presentations
Chapter : Digital Modulation 4.2 : Digital Transmission
Advertisements

A Robust, Fast Pulsed Flip- Flop Design By: Arunprasad Venkatraman Rajesh Garg Sunil Khatri Department of Electrical and Computer Engineering, Texas A.
RF Circuit Design Chris Fuller /7/2012.
FREQUENCY SHIFT KEYING
5/4/2006BAE Analog to Digital (A/D) Conversion An overview of A/D techniques.
Current-Mode Multi-Channel Integrating ADC Electrical Engineering and Computer Science Advisor: Dr. Benjamin J. Blalock Neena Nambiar 16 st April 2009.
Chapter 11 Operational Amplifiers and Applications
1 A Design Approach for Radiation-hard Digital Electronics Rajesh Garg Nikhil Jayakumar Sunil P Khatri Gwan Choi Department of Electrical and Computer.
1 Closed-Loop Modeling of Power and Temperature Profiles of FPGAs Kanupriya Gulati Sunil P. Khatri Peng Li Department of ECE, Texas A&M University, College.
1 A Lithography-friendly Structured ASIC Design Approach By: Salman Goplani* Rajesh Garg # Sunil P Khatri # Mosong Cheng # * National Instruments, Austin,
Kazi Spring 2008CSCI 6601 CSCI-660 Introduction to VLSI Design Khurram Kazi.
1 A Variation-tolerant Sub- threshold Design Approach Nikhil Jayakumar Sunil P. Khatri. Texas A&M University, College Station, TX.
A Self-adjusting Scheme to Determine Optimum RBB by Monitoring Leakage Currents Nikhil Jayakumar* Sandeep Dhar $ Sunil P. Khatri* $ National Semiconductor,
A PLA based Asynchronous Micropipelining Approach for Sub- threshold Circuit Design Authors: Nikhil Jayakumar* Rajesh Garg* Bruce Gamache $ Sunil P. Khatri*
Large Area, High Speed Photo-detectors Readout Jean-Francois Genat + On behalf and with the help of Herve Grabas +, Samuel Meehan +, Eric Oberla +, Fukun.
ANTENNA Subthreshold Transmitter Implementation for Low Power Sensor Platform Gordon D. Burgett Joseph A. Duperre III Rajesh Garg, Ph.D. Student Dr. Sunil.
1 adaptive body bias for reducing process variations nuno alves 19 / october / 2006.
1 Computing with Leakage Currents Nikhil Jayakumar, Kanupriya Gulati, Rajesh Garg and Sunil P. Khatri ECE Department Texas A&M University.
Power, Energy and Delay Static CMOS is an attractive design style because of its good noise margins, ideal voltage transfer characteristics, full logic.
RF Wakeup Sensor – On-Demand Wakeup for Zero Idle Listening and Zero Sleep Delay.
The new E-port interface circuits Filip Tavernier CERN.
BY MD YOUSUF IRFAN.  GLOBAL Positioning System (GPS) receivers for the consumer market require solutions that are compact, cheap, and low power.  This.
General Licensing Class G7A – G7C Practical Circuits Your organization and dates here.
Anthony Gaught Advisors: Dr. In Soo Ahn and Dr. Yufeng Lu Department of Electrical and Computer Engineering Bradley University, Peoria, Illinois May 7,
DARPA Digital Audio Receiver, Processor and Amplifier Group Z James Cotton Bobak Nazer Ryan Verret.
Flow sensor circuitry Eduard Stikvoort 00/1A The work was done in Philips Reaearch Eindhoven.
Design of Robust, Energy-Efficient Full Adders for Deep-Submicrometer Design Using Hybrid-CMOS Logic Style Sumeer Goel, Ashok Kumar, and Magdy A. Bayoumi.
ENGG 6090 Topic Review1 How to reduce the power dissipation? Switching Activity Switched Capacitance Voltage Scaling.
Power Reduction for FPGA using Multiple Vdd/Vth
CAD for Physical Design of VLSI Circuits
ASIC Design Flow – An Overview Ing. Pullini Antonio
Logic Synthesis for Low Power(CHAPTER 6) 6.1 Introduction 6.2 Power Estimation Techniques 6.3 Power Minimization Techniques 6.4 Summary.
1 EE 587 SoC Design & Test Partha Pande School of EECS Washington State University
1 5. Application Examples 5.1. Programmable compensation for analog circuits (Optimal tuning) 5.2. Programmable delays in high-speed digital circuits (Clock.
CSE 494: Electronic Design Automation Lecture 2 VLSI Design, Physical Design Automation, Design Styles.
Development of DC-DC converter ASICs S.Michelis 1,3, B.Allongue 1, G.Blanchot 1, F.Faccio 1, C.Fuentes 1,2, S.Orlandi 1, S.Saggini 4 1 CERN – PH-ESE 2.
L28:Lower Power Algorithm for Multimedia Systems(2) 성균관대학교 조 준 동
A Robust Pulse-triggered Flip-Flop and Enhanced Scan Cell Design
Introduction to FPGA Created & Presented By Ali Masoudi For Advanced Digital Communication Lab (ADC-Lab) At Isfahan University Of technology (IUT) Department.
Recent Topics on Programmable Logic Array
By Sewvanda Hewa Thumbellage Don, Meshegna Shumye, Owen Paxton, Mackenzie Cook, Jonathon Lee, Mohamed Khelifi, Rami Albustami, Samantha Trifoli 1.
CHAPTER 8 Developing Hard Macros The topics are: Overview Hard macro design issues Hard macro design process Physical design for hard macros Block integration.
IMPLEMENTATION OF MIPS 64 WITH VERILOG HARDWARE DESIGN LANGUAGE BY PRAMOD MENON CET520 S’03.
LOGIC OPTIMIZATION USING TECHNOLOGY INDEPENDENT MUX BASED ADDERS IN FPGA Project Guide: Smt. Latha Dept of E & C JSSATE, Bangalore. From: N GURURAJ M-Tech,
Chapter : Digital Modulation 4.2 : Digital Transmission
Patricia Gonzalez Divya Akella VLSI Class Project.
ASIC buck converter prototypes for LHC upgrades
Greg Alkire/Brian Smith 197 MAPLD An Ultra Low Power Reconfigurable Task Processor for Space Brian Smith, Greg Alkire – PicoDyne Inc. Wes Powell.
Low Power, High-Throughput AD Converters
Low Power, High-Throughput AD Converters
Seok-jae, Lee VLSI Signal Processing Lab. Korea University
S. Bota – Calorimeter Electronics overview - July 2002 Status of SPD electronics Very Front End Review of ASIC runs What’s new: RUN 4 and 5 Next Actions.
Rd07 Conference th June 2007 Florence, Italy 1 High frequency stepdown DC-DC converter with switched capacitors This work is part of the INFN DACEL.
Mackenzie Cook Mohamed Khelifi Jonathon Lee Meshegna Shumye Supervisors: John W.M. Rogers, Calvin Plett 1.
NOISE MEASUREMENTS ON CLICPIX AND FUTURE DEVELOPMENTS Pierpaolo Valerio.
Low Power, High-Throughput AD Converters
High Gain Transimpedance Amplifier with Current Mirror Load By: Mohamed Atef Electrical Engineering Department Assiut University Assiut, Egypt.
Communication 40 GHz Anurag Nigam.
EE434 Jason Adams Mike Dierickx
B.Sc. Thesis by Çağrı Gürleyük
COMPUTER NETWORKS and INTERNETS
VLSI Testing Lecture 5: Logic Simulation
Vishwani D. Agrawal Department of ECE, Auburn University
1 Gbit/s Serial Link 1 Gbit/s Data Link Using Multi Level Signalling
General Licensing Class
Lesson 8: Analog Signal Conversion
Circuit Design Techniques for Low Power DSPs
Post-Silicon Calibration for Large-Volume Products
A New Hybrid FPGA with Nanoscale Clusters and CMOS Routing Reza M. P
Presented by Mohsen Shakiba
Presentation transcript:

1 Design and Impementation of a Sub- threshold BFSK Transmitter By: Suganth Paul # Rajesh Garg $ Sunil P. Khatri $ Sheila Vaidya % # Intel Corporation, Austin, TX $ Department of ECE, Texas A&M University, College Station, TX % Lawrence Livermore National Lab., Livermore, CA

2 Outline  Sub-threshold circuits – the opportunity  Challenges  Process/temperature/voltage variations  Solution – dynamic body bias  Validation via test chip  Design methodology  Silicon results  Conclusions

3 The Opportunity  Compared traditional circuit with sub-threshold (obtained by simply setting VDD < V T )  Performed simulations for 2 different processes on a 21 stage ring oscillator.  Impressive power reduction  Impressive power reduction (100X – 500X)  Power-Delay-Product (P-D-P) improves by as much as 20X  P-D-P is an important metric to compare circuit design styles  Power consumption has become a major issue for recent ICs  There is a large and growing class of applications where power reduction is paramount – not speed.  Such applications are ideal candidates for sub-threshold circuit design

4 Sub-threshold Logic  Ids has an exponential dependence on process, voltage and temperature (PVT)  Need to stabilize the circuit performance by compensating for PVT variations  No approach to compensate sub-threshold delay  Existing approaches compensate sub-threshold currents  To compensate delay, need a representative circuit  Not easy to come up with representative circuit for standard cells

5 Our Solution self-adjusting body-bias to phase-lock the circuit delay to a beat clock.  We propose a technique that uses self-adjusting body-bias to phase-lock the circuit delay to a beat clock. network of PLAs  Use a network of PLAs to implement circuits. common nbulk node  Several PLAs in a cluster share a common nbulk node.  A representative PLA in each cluster is chosen to phase lock the delay of the PLAs to the beat clock  If the delay is too high, a forward body bias is applied to speed up the representative PLA.  If the delay is low, body bias is brought back down to zero to slow down the representative PLA.  All other PLAs exhibit the same delay as the representative PLA, since they all share a common nbulk terminal

6 Objective  Validate and verify flow by designing a sub-threshold circuit for the application  Choose a test application  Low power, low speed  Develop a sub-threshold circuit design flow  Implement our delay compensation scheme to negate PVT variations  Implement the same application using a standard cell based flow on the same die  Fabricate and test the chip (TSMC 0.25 um process)  Compare the sub-threshold circuit with the standard cell circuit in terms of power consumption

7 Test Application - Binary Frequency Shift Keying (BFSK) Transmitter DAC Amplifier Antenna Digital BFSK Modulator Produces two tones f 1 if Input is LOW f 2 if Input is HIGH Binary Input Data Digital Block Implemented Using Sub-threshold Circuits  Specifications  Input bit Rate: R B = 32kbps, Broadcast distance: D = 1000m  FSK tones: f 1 =150kHz, f 2 =450kHz, Channel bandwidth: B = 300kHz

8 Sub-threshold Design Approach  Digital part of the circuit implemented as NPLA  Digital part of the circuit implemented as NPLA (Network of Programmable Logic Arrays)  NPLAs have low delay  Critical path delay easy to find  PLAs have common nbulk node  Circuit level PVT compensation phase locked with the critical path delay  An external Beat Clock (BCLK) signal is phase locked with the critical path delay charge pump that modulates the bulk voltage  Delay controlled by a charge pump that modulates the bulk voltage of transistors in the circuit  Compensates for both inter- and intra-die variations

9 Dynamic NOR-NOR PLA  We use precharged NOR-NOR PLAs as the structure of choice  Wordlines run horizontally  Inputs / their complements and outputs run vertically  Each PLA has a “ completion ” signal that switches low after all the outputs switch  Several PLAs in a cluster share a common nbulk node. Inputs Outputs completion clk Precharge Evaluate

10 Network of PLAs (NPLA) L1 PLA L2 PLA L2 PLA L3 PLA L4 PLA Timing Diagram L1 PLA L2 PLA L3 PLA L4 PLA Combinational Logic Implemented as NPLA Inputs Outputs Throughput = T pchg +n.T eval clk

11 The Charge Pump - PLA “completion” signal lags beat clock - nbulk node gets forward biased - PLA “completion” signal leads beat clock - nbulk goes back to zero bias pullup pulldown

12 Effectiveness of the Approach  We simulated a single PLA from 0ºC to 100ºC. Also applied V T variations (10%) and VDD variations (10%).  The light region shows the variations on delay over all the corners without delay compensation.  The red region shows the delays with the self-adjusting body- bias circuit.

13 Design Flow BFSK Design HDLSynthesis Map to NPLA Logic Verification Integrated Spice Netlist Layout LVSRC Extraction Full Chip Spice Verification Spice Verification: Functional, timing, charge pump Design Of Analog Components

14     98 DFF Sine Lookup Table Depth: 2 9 = 512 Phase Increment Clk Mux Binary Input Phase Accumulator BFSK Design  f out < f clk /2, Nyquist criterion, implies   < 256.  Phase increments chosen based on f clk or left programmable in real time to get Software Defined Radio (SDR) operation.  We fix phase increments to avoid extra input pins required for SDR f out = f clk   512

15 Design Flow BFSK Design HDLSynthesis Map to NPLA Logic Verification Integrated Spice Netlist Layout LVSRC Extraction Full Chip Spice Verification Spice Verification: Functional, timing, charge pump Design Of Analog Components

16 Basic BFSK transmitter Block Diagram DAC Amplifier Antenna Digital BFSK Modulator Produces two tones f 1 if Input is LOW f 2 if Input is HIGH Binary Input Data Digital Block Implemented Using NPLA based Sub-threshold Circuits

17 System Architecture Charge Pump Phase Accum NCO Binary to Thermometer Encoder DFF CLK BEAT CLK CLK DACAmplifier Antenna Digital BFSK Modulator Input Phase Detector Ref. PLA completion Common Bulkn Digital BFSK using NPLA 4 LSBs - Binary 15 MSBs - Thermometer Avoids glitches in DAC o/p

18 Delay Compensated Sub- threshold Design block diagram L1 PLA L2 PLA L2 PLA L3 PLA L4 PLA DFFs Beat Clk Phase Detector Charge Pump Completion of Reference PLA Common nbulk node of a cluster of PLAs, modulated by charge pump Clk L1 PLA L2 PLA L2 PLA NPLA

19 HDL to Schematic of Digital BFSK  Digital BFSK transmitter described using VHDL  VHDL synthesized using FPGA synthesis tool, to get a gate level netlist  This is imported into SIS in “ blif ” format  The “ blif ” file is logically optimized and mapped into NPLA  Technology Independent Optimization done on circuit  Circuit converted to a mult-level network of nodes with 5 or less inputs per node  Circuit traversed from inputs to outputs, and nodes are implemented using PLAs of size (8/6/12)  Using NPLA throughput equation, f clk estimated as 1.2MHz  We choose f 1 ≈0.115* f clk and f 2 = 0.345* f clk

20 Design Flow BFSK Design HDLSynthesis Map to NPLA Logic Verification Integrated Spice Netlist Layout LVSRC Extraction Full Chip Spice Verification Spice Verification: Functional, timing, charge pump Design Of Analog Components

21 System Architecture Charge Pump Phase Accum NCO Binary to Thermometer Encoder DFF CLK BEAT CLK CLK DACAmplifier Antenna Digital BFSK Modulator Input Phase Detector Ref. PLA completion Common Bulkn

22 Thermometer Coded 8-BIT DAC 4 4 LSBs Digital BFSK Output Binary to Thermometer Code Conversion DAC ThermBinary Adjacent Values Differ by 1-bit

23 8-BIT DAC Schematic CM legT 4 - T 18 B3B3 B2B2 B1B1 B0B0 Device size16W 1 8W 1 4W 1 2W 1 W1W1  Currents flow through mirror legs based on input value W1W1  Output current / voltage modulated based by sum of weighted currents through R out  Thermometer codes prevent glitches at output  DAC supply is 0.7V to handle 0.6V digital signals  Rout, Rcm are off-chip resistances

24 Amplifier Schematic  Common Source Amplifer  Supply of 0.7V  Rd, Rs are off-chip resistances  M1 biased by DAC Rout resistor  C L on-chip antenna load 80pF

25 Testability Features added before Integration Charge Pump Phase Accum NCO Binary to Thermometer Encoder DFF CLK BEAT CLK CLK DAC Amplifier Antenna Input Phase Detector Ref. PLA completion Common Bulkn CHIP 8-BIT BFSK Output or 8-BIT DAC Input Bulkn Charge Pump Supply DAC Ouput Amp Ouput

26 Layout  Manual PLA layout for every PLA in design  NPLA routed using SEDSM  I/O pad cells, ESD diodes layout done manually  DAC, amplifier layout done manually  Antenna coil layout done manually

27 PLA Layout Word, Lines Input, Bit Line Output, Lines Transistors, modified based on logic to be implemented

28 I/O PAD CELL Layout I/O PAD Primary ESD Diodes Secondary ESD Diodes I/O Drivers  Fully Compliant with TSMC Design rules  ESD Diodes have guard rings to prevent latchup  Fully Compliant with TSMC Design rules  ESD Diodes have guard rings to prevent latchup

29 Die Photo Digital BFSK output domain, 2V Digital BFSK inputs domain, 0.7V Digital BFSK domain, 0.6V Std Cell domain, 2.5V

30 Experimental Results from Silicon  Output of BFSK transistor is shown  As input changes from 0 to 1, the output frequency changes showing the modulation  Output of BFSK transistor is shown  As input changes from 0 to 1, the output frequency changes showing the modulation  Fclk = 1MHz  F1 = 117kHz  F2 = 347kHz  The adjacent peaks are around -10dB below the fundamental peaks  We found from Matlab Simulations that, signals from the extracted Spice netlist, could be demodulated at the receiver side

31 Results from Silicon  Nbulk kept at 0V, 0.45V  Maximum frequency shows an quadratic dependence on supply Voltage Operating Range

32 Design StyleOperating Voltage Frequency of Operation Avg Current Power Dissipated Sub-threshold0.6V1.05MHz  26.8  W Std Cell2.5V1.05MHz 208  A520  W Power Comparison  Sub-threshold power calculated only for Phase Accumulator, and NCO blocks on 0.6V power supply,  Std Cell implements only this portion of BFSK circuit  Sub-threshold gives 19.4X lesser power

33 Bulkn Node Modulation  Bulk node modulates when beat clock demands speedup or slow-down  Bulk node modulates as supply voltage is changed, so that circuit delay is maintained constant.

34 Conclusion  Validated a sub-threshold circuit design methodology based on dynamic body bias (first-of-kind)  Validated design tools and techniques  First-of-kind design automation flow, will help bring sub- threshold design to mainstream.  We implemented an ultra low power, low data rate wireless BFSK transmitter  The fabricated chip, works as expected, validating our design flow.  We compared the sub-threshold design a with Std Cell based design and showed 19.4X reduction in power.

35 Thank you!!

Backup Slides 36

37 Introduction  Power consumption has become a significant hurdle for recent ICs  Higher power consumption leads to  Shorter battery life  Higher on-chip temperatures – reduced operating life of the chip  There is a large and growing class of applications where power reduction is paramount – not speed.  Such applications are ideal candidates for sub- threshold circuit design  For sub-threshold circuits, VDD ≤ V T

38 TX/RX System Testing TX PCB with subthreshold IC TX antennas RX board RX setup

39 Solving the Problem of Delay Sensitivity to Process, Voltage and Temperature Variations Solving the Problem of Delay Sensitivity to Process, Voltage and Temperature Variations "A Variation-tolerant Sub-threshold Design Approach", Jayakumar, Khatri. Design Automation Conference (DAC) 2005 Anaheim, CA, June

40 An Example Showing Phase Locking  This figure shows how the body bias (and hence the delay of the PLA) changes with changes in VDD.  The adjustment is very quick (within a few clock cycles). VDD change 0.2V to 0.22V VDD change 0.22V to 0.18V

41 Energy and Speed  We may be interested in the minimum energy operating point for the design  Minimizing VDD reduces power but minimum VDD does not mean minimum energy  The optimum VDD value increases with increased logical depth, and with temperature "Minimum Energy Near-threshold Network of PLA based Design", Jayakumar, Khatri. International Conference on Computer Design (ICCD) 2005, Oct 2-5, San Jose, CA.  Reclaiming the speed penalty  Can be done for datapath circuits, using asynchronous micropipelining  Showed that speedup of 7X is possible, with a area overhead of 44% "A PLA based Asynchronous Micropipelining Approach for Subthreshold Circuit Design", Jayakumar, Garg, Gamache, Khatri. IEEE/ACM Design Automation Conference (DAC) 2006, July 24-28, San Francisco, CA.

42 On-chip Antenna  Antenna size needs to be at least a 10 th of the transmit wavelength to radiate effectively  Transmit wavelength around 600m  Due to on-chip space constraints, antenna coil length is only 0.2m  We have the option of using an external antenna  And we had a 60dB safety margin in the link budget analysis.  This could compensate for a lossy antenna

43 Spectrum of Amplifier Tones  Fclk = 1MHz  F1 = 117kHz  F2 = 347kHz  The adjacent peaks are around -10dB below the fundamental peaks  We found from Matlab Simulations that, signals from the extracted Spice netlist, could be demodulated at the receiver side