Feng-Xiang Huang A Design-for-Debug (DfD) for NoC-based SoC Debugging via NoC Hyunbean Yi 1, Sungju Park 2, and Sandip Kundu 1 1 Department of Electrical.

Slides:



Advertisements
Similar presentations
Computer Architecture
Advertisements

Presenter : Cheng-Ta Wu Kenichiro Anjo, Member, IEEE, Atsushi Okamura, and Masato Motomura IEEE JOURNAL OF SOLID-STATE CIRCUITS, VOL. 39,NO. 5, MAY 2004.
A Programmable Adaptive Router for a GALS Parallel System Jian Wu APT Group University of Manchester May 2009.
Using emulation for RTL performance verification
Control path Recall that the control path is the physical entity in a processor which: fetches instructions, fetches operands, decodes instructions, schedules.
Presenter : Shih-Tung Huang 2015/4/30 EICE team Automated Data Analysis Solutions to Silicon Debug Yu-Shen Yang Dept. of ECE University of Toronto Toronto,
1 Fundamentals of Computer Science Sequential Circuits.
Presenter : Ching-Hua Huang 2014/4/14 A Configurable Bus-Tracer for Error Reproduction in Post-Silicon Validation Shing-Yu Chen ; Ming-Yi Hsiao ; Wen-Ben.
Feng-Xiang Huang 2015/5/4 International Symposium Quality Electronic Design (ISQED), th M. H Neishaburi, Zeljko Zilic, McGill University, Quebec.
1 Architectural Complexity: Opening the Black Box Methods for Exposing Internal Functionality of Complex Single and Multiple Processor Systems EECC-756.
Reporter:PCLee With a significant increase in the design complexity of cores and associated communication among them, post-silicon validation.
Feng-Xiang Huang MCORE Architecture implements Real-Time Debug Port based on Nexus Consortium Specification David Ruimy Gonzales Senior Member of Technical.
Hyunbean Yi, Sungju Park, and Sandip Kundu, Fellow, IEEE IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS-I : REGULAR PAPERS, VOL. 57, NO. 7, JULY 2010 Reporter:
Feng-Xiang Huang A Low-Cost SOC Debug Platform Based on On-Chip Test Architectures.
Energy Evaluation Methodology for Platform Based System-On- Chip Design Hildingsson, K.; Arslan, T.; Erdogan, A.T.; VLSI, Proceedings. IEEE Computer.
~ EDA lab ~ Interconnect Verification for SOC Jing-Yang Jou Department of Electronics Engineering National Chiao Tung University Hsinchu, Taiwan
1 Multi-Core Debug Platform for NoC-Based Systems Shan Tang and Qiang Xu EDA&Testing Laboratory.
Presenter: Jyun-Yan Li Multiprocessor System-on-Chip Profiling Architecture: Design and Implementation Po-Hui Chen, Chung-Ta King, Yuan-Ying Chang, Shau-Yin.
1 Design For Debug Using DAFCA system Gadi Glikberg 15/6/06.
1 FPGA Lab School of Electrical Engineering and Computer Science Ohio University, Athens, OH 45701, U.S.A. An Entropy-based Learning Hardware Organization.
University College Cork IRELAND Hardware Concepts An understanding of computer hardware is a vital prerequisite for the study of operating systems.
Feng-Xiang Huang NIFD: Non-Intrusive FPGA Debugger Debugging FPGA ‘Threads’ for Rapid HW/SW Systems Prototyping Hari Angepat, Gage Eads, Christopher Craik.
Presenter : Cheng-Ta Wu Antti Rasmus, Ari Kulmala, Erno Salminen, and Timo D. Hämäläinen Tampere University of Technology, Institute of Digital and Computer.
TOPIC - BIST architectures I
ENGIN112 L26: Shift Registers November 3, 2003 ENGIN 112 Intro to Electrical and Computer Engineering Lecture 26 Shift Registers.
1 EE 587 SoC Design & Test Partha Pande School of EECS Washington State University
Cortex-M3 Debugging System
Presenter : Shao-Cheih Hou Sight count : 11 ASPDAC ‘08.
CS-334: Computer Architecture
Yao Wang, Yu Wang, Jiang Xu, Huazhong Yang EE. Dept, TNList, Tsinghua University, Beijing, China Computing System Lab, Dept. of ECE Hong Kong University.
L i a b l eh kC o m p u t i n gL a b o r a t o r y Trace-Based Post-Silicon Validation for VLSI Circuits Xiao Liu Department of Computer Science and Engineering.
Reporter: PCLee. Assertions in silicon help post-silicon debug by providing observability of internal properties within a system which are.
1 Registers and Counters A register consists of a group of flip-flops and gates that affect their transition. An n-bit register consists of n-bit flip-flops.
MICROPROCESSOR INPUT/OUTPUT
Presenter: Hong-Wei Zhuang On-Chip SOC Test Platform Design Based on IEEE 1500 Standard Very Large Scale Integration (VLSI) Systems, IEEE Transactions.
SMART: A Single- Cycle Reconfigurable NoC for SoC Applications -Jyoti Wadhwani Chia-Hsin Owen Chen, Sunghyun Park, Tushar Krishna, Suvinay Subramaniam,
Presenter : Ching-Hua Huang 2013/9/16 Visibility Enhancement for Silicon Debug Cited count : 62 Yu-Chin Hsu; Furshing Tsai; Wells Jong; Ying-Tsai Chang.
CHAPTER 3 TOP LEVEL VIEW OF COMPUTER FUNCTION AND INTERCONNECTION
High-Level Interconnect Architectures for FPGAs An investigation into network-based interconnect systems for existing and future FPGA architectures Nick.
Top Level View of Computer Function and Interconnection.
High-Level Interconnect Architectures for FPGAs Nick Barrow-Williams.
Presenter: PCLee Post-silicon validation is used to identify design errors in silicon. Its main limitation is real-time observability of the.
ECE 553: TESTING AND TESTABLE DESIGN OF DIGITAL SYSTEMS
COMPUTER ORGANIZATIONS CSNB123. COMPUTER ORGANIZATIONS CSNB123 Expected Course Outcome #Course OutcomeCoverage 1Explain the concepts that underlie modern.
EEE440 Computer Architecture
Feng-Xiang Huang Test Symposium(ETS), th IEEE European Ko, Ho Fai; Nicolici, Nicola; Department of Electrical and Computer Engineering,
4/19/20021 TCPSplitter: A Reconfigurable Hardware Based TCP Flow Monitor David V. Schuehler.
Computer Hardware A computer is made of internal components Central Processor Unit Internal External and external components.
Improving NoC-based Testing Through Compression Schemes Érika Cota 1 Julien Dalmasso 2 Marie-Lise Flottes 2 Bruno Rouzeyre 2 WNOC
1 Presenter: Min Yu,Lo 2015/12/21 Kumar, S.; Jantsch, A.; Soininen, J.-P.; Forsell, M.; Millberg, M.; Oberg, J.; Tiensyrja, K.; Hemani, A. VLSI, 2002.
Presenter : Shao-Chieh Hou 2012/8/27 Second ACM/IEEE International Symposium on Networks-on-Chip IEEE computer society.
By Fernan Naderzad.  Today we’ll go over: Von Neumann Architecture, Hardware and Software Approaches, Computer Functions, Interrupts, and Buses.
Dr Mohamed Menacer College of Computer Science and Engineering, Taibah University CE-321: Computer.
Lecture 4 General-Purpose Input/Output NCHUEE 720A Lab Prof. Jichiang Tsai.
SCORES: A Scalable and Parametric Streams-Based Communication Architecture for Modular Reconfigurable Systems Abelardo Jara-Berrocal, Ann Gordon-Ross NSF.
ECE 448 Lecture 6 Finite State Machines State Diagrams vs. Algorithmic State Machine (ASM) Charts.
Chap 5. Registers and Counters
Chapter 3 System Buses.  Hardwired systems are inflexible  General purpose hardware can do different tasks, given correct control signals  Instead.
Field Programmable Port Extender (FPX) 1 Modular Design Techniques for the Field Programmable Port Extender John Lockwood and David Taylor Washington University.
RTL Hardware Design by P. Chu Chapter 9 – ECE420 (CSUN) Mirzaei 1 Sequential Circuit Design: Practice Shahnam Mirzaei, PhD Spring 2016 California State.
ELEC 7950 – VLSI Design and Test Seminar
Performed by: Erez Davidi / Aviad Zrihen Instructor: Yaniv Ben-Yitzhak המעבדה למערכות ספרתיות מהירות High speed digital systems laboratory הטכניון - מכון.
Chap 7. Register Transfers and Datapaths
KU College of Engineering Elec 204: Digital Systems Design
CPE/EE 428/528 VLSI Design II – Intro to Testing (Part 2)
CPE/EE 428/528 VLSI Design II – Intro to Testing (Part 3)
Registers and Counters Register : A Group of Flip-Flops. N-Bit Register has N flip-flops. Each flip-flop stores 1-Bit Information. So N-Bit Register Stores.
Lesson 4 Synchronous Design Architectures: Data Path and High-level Synthesis (part two) Sept EE37E Adv. Digital Electronics.
ECEN 248: INTRODUCTION TO DIGITAL SYSTEMS DESIGN
Presentation transcript:

Feng-Xiang Huang A Design-for-Debug (DfD) for NoC-based SoC Debugging via NoC Hyunbean Yi 1, Sungju Park 2, and Sandip Kundu 1 1 Department of Electrical & Computer Engineering, University of Massachusetts, USA 2 Department of Computer Science and Engineering, Hanyang University, Korea Asian Test Symposium, ATS’ th.

Combining Scan and Trace Buffers for Enhancing Real-time Observability in Post- Silicon Debugging A Scan Cell Design for Scan- Based Debugging of an SoC With Multiple Clock Domains NIFD: Non-Intrusive FPGA Debugger Debugging FPGA ‘Threads’ for Rapid HW/SW Systems Prototyping NIFD: Non-Intrusive FPGA Debugger Debugging FPGA ‘Threads’ for Rapid HW/SW Systems Prototyping A Design-for- Debug(DfD) for NoC- based SoC Debugging via NoC

This paper presents design-for-debug (DfD) methods for the reuse of network-on-chip (NoC) as a debug data path in an NoC-based system-on-chip (SoC). We propose on-chip core debug supporting logics which can support transaction-based debug. A debug interface unit is also presented to enable debug data transfer through an NoC between an external debugger and a core-under-debug (CUD). The proposed approach supports debug of designs with multiple clock domains. It also supports collection of trace signatures to facilitate debug of long pattern sequences. Experimental results show that single and multiple stepping through transactions are feasible with moderately low area overhead. We also present simulation result to verify proper operation of the debug components.

[14] An event- based Network- On-Chip monitoring service [15] Transaction Monitoring in Networks on chip: The On-Chip Run-Time Prespective A Design-for- Debug(DfD) for NoC-based SoC DebugginG via NoC [17] Transaction- Based Communication- centric debug [16] A Multi-Core Debug Platform for NoC-based Systems

For more efficient NoC-based SoC debugging, three problems to be solved in this paper are:  Signal propagation delay 。 Deploy transaction based debug strategy  Instead of stopping cores as soon as an event c, such as transaction counts, timer values and the core from which the event is generated are recoded in an SoC.  Inefficient scan dump 。 Reuse the test infrastructure  Monitors internal nodes difficultly 。 Collect signature of select set of internal states.

Debug Architecture Overview  Event occurs make all routers empty and stop all core and then a debug engineer reads out the debug information and selects cores to be debugger using the TAP controller, and dumps scan contents or applies and observes debug data via NoC. Core debug supporters  MDS for the master cores  SDS for the slave cores

CDS and TDIU  Clock gating cell (CGC) and Clock Multiplexer:  Event & Trans. Detector  Transaction Counter:  Timer:  Transaction & Core Stopper:  Debug info. setter  Debug information register: Current TRcont & Timer Latest issued TRcont & Timer

TDCPI stop core, config TAM Inform TDIU that all transactions and store debug information. all TR_completed signal are asserted Setting debug_rdy to high Enable SDS to stop its slave core all TR_completed signal are asserted Setting debug_rdy to high Enable SDS to stop its slave core

Single Transaction Step Debugging  Performed by running and stopping master cores on a transaction basis  Single_TR_step and TR_stop_req_inout signals of the TDIU 。 Directly connected to a debugger, are used for the STSD 。 When TR_stop_req goes high, TR_block in each MDS goes low  Enable a new transaction to be initiated  After all masters and slaves are stopped  Perfome scan dump and observation, and resuming normal operation 。 Completion of an STSD by de-asserting TR_stop_req

Interrupt for debugging  Long pattern sequences can not be simulated. 。 Golden reference values are not known Easily be done  By periodically repeating the debug cycle using the proposed method  By asserting debug_enable and TR_stop_req signals

Master 1 and 2 run at 500 MHz. Slave 1 and Slave 2 run at 250 MHz and 125 MHz, respectively.

Area Overhead  Used the open AMBA-based IP cores with 32-bit separate read and write data buses  Gate count of the TDCPIs is 889. 。 TDCPIs are arranged from 611 to 1519 according to the number of primary inputs and outputs of the core  An MDS and an SDS are implemented with and 6115 gate count.  The average area overhead by adding a TDCPI and a CDS in a core is about 33%.

Proposed a transaction based debugging strategy and presented the core debug supporting logic. This allows debug support for multiple clock domains It can be efficiently performed without adding new parallel paths or using the slow IEEE