3D Interconnect: Architectural Challenges and Opportunities UC SANTA BARBARA Tim Sherwood.

Slides:



Advertisements
Similar presentations
Computer Architecture
Advertisements

1 Wire-driven Microarchitectural Design Space Exploration School of Electrical and Computer Engineering Georgia Institute of Technology Atlanta, GA 30332,
TO COMPUTERS WITH BASIC CONCEPTS Lecturer: Mohamed-Nur Hussein Abdullahi Hame WEEK 1 M. Sc in CSE (Daffodil International University)
1 A Self-Tuning Cache Architecture for Embedded Systems Chuanjun Zhang*, Frank Vahid**, and Roman Lysecky *Dept. of Electrical Engineering Dept. of Computer.
Zhiguo Ge, Weng-Fai Wong, and Hock-Beng Lim Proceedings of the Design, Automation, and Test in Europe Conference, 2007 (DATE’07) April /4/17.
EXTERNAL COMMUNICATIONS DESIGNING AN EXTERNAL 3 BYTE INTERFACE Mark Neil - Microprocessor Course 1 External Memory & I/O.
Smart Refresh: An Enhanced Memory Controller Design for Reducing Energy in Conventional and 3D Die-Stacked DRAMs Mrinmoy Ghosh Hsien-Hsin S. Lee School.
Helper Threads via Virtual Multithreading on an experimental Itanium 2 processor platform. Perry H Wang et. Al.
June 20 th 2004University of Utah1 Microarchitectural Techniques to Reduce Interconnect Power in Clustered Processors Karthik Ramani Naveen Muralimanohar.
OCIN Workshop Wrapup Bill Dally. Thanks To Funding –NSF - Timothy Pinkston, Federica Darema, Mike Foster –UC Discovery Program Organization –Jane Klickman,
Yefu Wang and Kai Ma. Project Goals and Assumptions Control power consumption of multi-core CPU by CPU frequency scaling Assumptions: Each core can be.
A Self-Tuning Cache Architecture for Embedded Systems Chuanjun Zhang, Vahid F., Lysecky R. Proceedings of Design, Automation and Test in Europe Conference.
Alpha Goal: very fast multiprocessor systems, highly scalable Main trick is high-bandwidth, low-latency data access. How to do it, how to do it?
Temperature-Aware Design Presented by Mehul Shah 4/29/04.
Feb 14 th 2005University of Utah1 Microarchitectural Wire Management for Performance and Power in partitioned architectures Rajeev Balasubramonian Naveen.
Research Directions for On-chip Network Microarchitectures Luca Carloni, Steve Keckler, Robert Mullins, Vijay Narayanan, Steve Reinhardt, Michael Taylor.
Network-on-Chip: Communication Synthesis Department of Computer Science Texas A&M University.
Feb 14 th 2005University of Utah1 Microarchitectural Wire Management for Performance and Power in Partitioned Architectures Rajeev Balasubramonian Naveen.
Architectural and Compiler Techniques for Energy Reduction in High-Performance Microprocessors Nikolaos Bellas, Ibrahim N. Hajj, Fellow, IEEE, Constantine.
SYNTHESIS OF NETWORKS ON CHIPS FOR 3D SYSTEMS ON CHIPS Srinivasan Murali, Ciprian Seiculescu, Luca Benini, Giovanni De Micheli Presented by Puqing Wu.
Juanjo Noguera Xilinx Research Labs Dublin, Ireland Ahmed Al-Wattar Irwin O. Irwin O. Kennedy Alcatel-Lucent Dublin, Ireland.
Robust Low Power VLSI Selecting the Right Conference for the BSN FIR Filter Paper Alicia Klinefelter November 13, 2011.
Yao Wang, Yu Wang, Jiang Xu, Huazhong Yang EE. Dept, TNList, Tsinghua University, Beijing, China Computing System Lab, Dept. of ECE Hong Kong University.
Computer System Architectures Computer System Software
Profile-Guided Microarchitectural Floorplanning for Deep Submicron Processor Design Mongkol Ekpanyapong, Jacob R. Minz, Thaisiri Watewai*, Hsien-Hsin S.
February 12, 1998 Aman Sareen DPGA-Coupled Microprocessors Commodity IC’s for the Early 21st Century by Aman Sareen School of Electrical Engineering and.
Introspective 3D Chips S. Mysore, B. Agrawal, N. Srivastava, S. Lin, K. Banerjee, T. Sherwood (UCSB), ASPLOS 2006 Shimin Chen (LBA Reading Group Presentation)
Emerging Technologies: A CompSci Perspective UC SANTA BARBARA Tim Sherwood.
Simultaneous Multithreading: Maximizing On-Chip Parallelism Presented By: Daron Shrode Shey Liggett.
Computer Architecture Challenges Shriniwas Gadage.
Low-Power Wireless Sensor Networks
1 Overview 1.Motivation (Kevin) 1.5 hrs 2.Thermal issues (Kevin) 3.Power modeling (David) Thermal management (David) hrs 5.Optimal DTM (Lev).5 hrs.
Architectures for mobile and wireless systems Ese 566 Report 1 Hui Zhang Preethi Karthik.
SoC TAM Design to Minimize Test Application Time Advisor Dr. Vishwani D. Agrawal Committee Members Dr. Victor P. Nelson, Dr. Adit D. Singh Apr 9, 2015.
MARS A Scan-Island Based Design Enabling Pre-Bond Testability in Die-Stacked Microprocessors Dean L. Lewis Hsien-Hsin S. Lee Georgia Institute of Technology.
Lessons Learned The Hard Way: FPGA  PCB Integration Challenges Dave Brady & Bruce Riggins.
Warped Gates: Gating Aware Scheduling and Power Gating for GPGPUs
1 Computer Architecture Research Overview Rajeev Balasubramonian School of Computing, University of Utah
CSE 494: Electronic Design Automation Lecture 2 VLSI Design, Physical Design Automation, Design Styles.
Lecture 1 1 Computer Systems Architecture Lecture 1: What is Computer Architecture?
1 Towards Optimal Custom Instruction Processors Wayne Luk Kubilay Atasu, Rob Dimond and Oskar Mencer Department of Computing Imperial College London HOT.
RF network in SoC1 SoC Test Architecture with RF/Wireless Connectivity 1. D. Zhao, S. Upadhyaya, M. Margala, “A new SoC test architecture with RF/wireless.
Computer Organization & Assembly Language © by DR. M. Amer.
1 - CPRE 583 (Reconfigurable Computing): Reconfigurable Computing Architectures Iowa State University (Ames) Reconfigurable Architectures Forces that drive.
Computer Architecture CPSC 350
Power and Control in Networked Sensors E. Jason Riedy and Robert Szewczyk Presenter: Fayun Luo.
System-level power analysis and estimation September 20, 2006 Chong-Min Kyung.
1 Interconnect/Via. 2 Delay of Devices and Interconnect.
DTM and Reliability High temperature greatly degrades reliability
Lev Finkelstein ISCA/Thermal Workshop 6/ Overview 1.Motivation (Kevin) 2.Thermal issues (Kevin) 3.Power modeling (David) 4.Thermal management (David)
Analysis of Cache Tuner Architectural Layouts for Multicore Embedded Systems + Also Affiliated with NSF Center for High- Performance Reconfigurable Computing.
1 Copyright  2001 Pao-Ann Hsiung SW HW Module Outline l Introduction l Unified HW/SW Representations l HW/SW Partitioning Techniques l Integrated HW/SW.
FPGA-Based System Design: Chapter 6 Copyright  2004 Prentice Hall PTR Topics n Low power design. n Pipelining.
1 - CPRE 583 (Reconfigurable Computing): Reconfigurable Computing Architectures Iowa State University (Ames) CPRE 583 Reconfigurable Computing Lecture.
PERFORMANCE EVALUATION OF LARGE RECONFIGURABLE INTERCONNECTS FOR MULTIPROCESSOR SYSTEMS Wim Heirman, Iñigo Artundo, Joni Dambre, Christof Debaes, Pham.
1 Device Controller I/O units typically consist of A mechanical component: the device itself An electronic component: the device controller or adapter.
1 of 14 Lab 2: Formal verification with UPPAAL. 2 of 14 2 The gossiping persons There are n persons. All have one secret to tell, which is not known to.
CDA-5155 Computer Architecture Principles Fall 2000 Multiprocessor Architectures.
1 of 14 Lab 2: Design-Space Exploration with MPARM.
Microprocessor Design Process
Die Stacking (3D) Microarchitecture Bryan Black, Murali Annavaram, Ned Brekelbaum, John DeVale, Lei Jiang, Gabriel H. Loh1, Don McCauley, Pat Morrow, Donald.
Hardware Architecture
Fall 2012 Parallel Computer Architecture Lecture 4: Multi-Core Processors Prof. Onur Mutlu Carnegie Mellon University 9/14/2012.
Computer Organization and Architecture Lecture 1 : Introduction
Warped Gates: Gating Aware Scheduling and Power Gating for GPGPUs
Architecture & Organization 1
Architecture & Organization 1
An Automated Design Flow for 3D Microarchitecture Evaluation
COMS 361 Computer Organization
Presentation transcript:

3D Interconnect: Architectural Challenges and Opportunities UC SANTA BARBARA Tim Sherwood

The Role of Architecture Applications Runtime System Architecture Circuit Device Package SW HW Constraints Demands 3D Integration (Noise, Thermal, Yield) (Battery Life, Performance, Programmability )

Lab Overview Intrusion Detection System Server Farm Processor Core Caches, etc. Prototype Acceleration Primitives High Speed Programmable Routers b2 b1 b0 b4 { 2 } b5 b9 b8 { 2,7 } b7 b6 { 2,5 } b3 Intrusion Detection and Prevention Adaptive Hardware Profiling Engines integrated On-Chip Memory Hierarchy Software Defined Wireless Access Point Reconfigurable Security on FPGAs High Throughput MEMS controllers

Lab Overview Software Defined Wireless Access Point Intrusion Detection System Server Farm Processor Core Caches, etc. Prototype Acceleration Primitives High Speed Programmable Routers b2 b1 b0 b4 { 2 } b5 b9 b8 { 2,7 } b7 b6 { 2,5 } b3 Intrusion Detection and Prevention Adaptive Hardware Profiling Engines integrated On-Chip Memory Hierarchy Reconfigurable Security on FPGAs High Throughput MEMS controllers

Potential for Impact from 3D Software Defined Wireless Access Point Intrusion Detection System Server Farm Processor Core Caches, etc. Prototype Acceleration Primitives High Speed Programmable Routers b2 b1 b0 b4 { 2 } b5 b9 b8 { 2,7 } b7 b6 { 2,5 } b3 Intrusion Detection and Prevention Adaptive Hardware Profiling Engines integrated On-Chip Memory Hierarchy Reconfigurable Security on FPGAs High Throughput MEMS controllers 3D Specialization 3D Bandwidth 3D Integration for Latency 3D Integration for Mixed Signal 3D Integration for Mixed Technology 3D Specialization

Potential for Impact from 3D Software Defined Wireless Access Point Intrusion Detection System Server Farm Processor Core Caches, etc. Prototype Acceleration Primitives High Speed Programmable Routers b2 b1 b0 b4 { 2 } b5 b9 b8 { 2,7 } b7 b6 { 2,5 } b3 Intrusion Detection and Prevention Adaptive Hardware Profiling Engines integrated On-Chip Memory Hierarchy Reconfigurable Security on FPGAs High Throughput MEMS controllers 3D Specialization 3D Bandwidth 3D Integration for Latency 3D Integration for Mixed Signal 3D Integration for Mixed Technology 3D Specialization

Presented Works Shashidhar Mysore, Banit Agrawal, Sheng-Chih Lin, Navin Srivastava, Kaustav Banerjee and Timothy Sherwood. Introspective 3D Chips, Proceedings of the Twelfth International Conference on Architectural Support for Programming Languages and Operating Systems ( ASPLOS ), October San Jose, CA Gian Luca Loi, Banit Agrawal, Navin Srivastava, Sheng-Chih Lin, Timothy Sherwood, Kaustav Banerjee. A Thermally- Aware Performance Analysis of Vertically Integrated (3-D) Processor-Memory Hierarchy, Proceedings of the 43nd Design Automation Conference ( DAC ), June San Francisco, CA

Two Specific Opportunities 1) 3D Integration for Performance  Bring Memory Closer to those that use it  More Bandwidth and Lower Latency  Tricky System Level Tradeoffs 2 ) 3D Integration for Specialization  Integration offers unique specialization opportunity  Decouple commodity from niche The ramifications of any radical change requires a careful evaluation that considers all the parameters

temp package total power dynamic power V utilized area communication A Simple Performance “Ecosystem” parallelismfreq leakage app OS or runtime feedback performance No multicore, no spatial variance, no temporal variance, no metrics of cost or error or yield

Two Specific Opportunities 1) 3D Integration for Performance  Bring Memory Closer to those that use it  More Bandwidth and Lower Latency  Tricky System Level Tradeoffs 2 ) 3D Integration for Specialization  Integration offers unique specialization opportunity  Decouple commodity from niche The ramifications of any radical change requires a careful evaluation that considers all the parameters

Basic Savings in 3D Area: 4 Dist: √8 ≈ 2.8 Area: 2 Dist: √4 ≈ 2 + 1L Area: 1 Dist: √2 ≈ L BW: √8 ≈ 2.8 BW: 2√4 ≈ 4 BW: 4√2 ≈ 5.6 On-chip Latency improved, Bandwidth could improve more What about real wires? What about apps? What about temp?

Example Technology Node Banerjee et al. IEEE 2001

3D Wire Delay x D e l a y ( S e c ) Wire length L(um) Vertical via model Horizontal line model Horizontal wire length L Distributed RC delay Vertical wire length

A “Typical” 2D System Design DRAM CPU core L2Unified Cache L2to Main Memory External Bus Board L1 I-Cache L1 D-Cache Memory Controller Memory Bottleneck

A 3D Memory System CPU core L1I-Cache L1D-Cache L2Unified Cache L2to Main Memory vertical interlayer Bus L1to L2vertical interlayer Bus Layer1 2 3to18 Stacked three dimensional main memory 8 bytes to 128 bytes 200 Mhz to 2 Ghz

System-Level Simulation Simulator : Sim-Alpha simulator Processor : Alpha processor Benchmarks: mcf, parser, twolf with Minnespec reduced inputs % main memory access per instruction mcf parser twolf 1.7% % %

Effect of Bus Width and Frequency Execution time (sec) L2 cache size in KBytes 8 bytes bus width (3-D) 16 bytes bus width (3-D) 32 bytes bus width (3-D) 64 bytes bus width (3-D) 128 bytes bus width (3-D) 8 bytes bus width (2-D) mcf Only a few vias required

Effect of Clock Frequency : mcf

Effect of Clock Frequency : parser

Effect of Clock Frequency : twolf

An Example Memory System

Self-consistent Thermal Modeling Insert the initials values of leakage and dynamic power for each layer Calculate the first thermal profile Based on the previous thermal profile calculate the new power dissipation considering I on decrease with temperature I Leakage increase with temperature Calculate the new temperature profile Finish Yes No Is it convergent?

3D Thermally-aware Performance Analysis mcf Temperature constraint Min execution time in2-D 3-D 3-D max chip temperature 2-D max chip temperature T e m p e r a t u r e ( K ) E x e c u t i o n t i m e p e r i n s t r u c t i o n

3D Thermally-aware Performance Analysis twolf Frequency in MHz Maximum frequency allowed due to temperatureconstraint Min execution time in3-D 2-D Temperature constraint 2-D max chip temperature 3-D max chip temperature T e m p e r a t u r e ( K ) E x e c u t i o n t i m e p e r i n s t r u c t i o n

3D Memory Integration Many Unaccounted For Effects  Effect of Multiple Cores and Memory Banks  Spatial Variation  Temporal Variation (thermal load balancing)  All of these are intimately tied to the integration method and packaging How to Manage  Architecture and Software will be increasingly involved  Exposing Variation to higher levels  Huge demand for “models”, “sensors”, and “knobs”  Thermal, Packaging, Application, Architecture all tangled  Need to build models that capture all of these aspects  Models need to be “self consistent”

Two Specific Opportunities 1) 3D Integration for Performance  Bring Memory Closer to those that use it  More Bandwidth and Lower Latency  Tricky System Level Tradeoffs 2 ) 3D Integration for Specialization  Integration offers unique specialization opportunity  Decouple commodity from niche The ramifications of any radical change requires a careful evaluation that considers all the parameters

3D Integration for Introspection Complex interactions across levels of abstraction make debugging, optimizing, securing, and analysis in general difficult The first requirement – visibility  Not just data capture, we need the ability to put together a cohesive picture of system interactions and correlate between them in a sound and non-intrusive manner The hardware/software boundary is uniquely situated  Piece together from low level events What would the programmer wish list look like?

To Integrated Monitoring Hardware L1_BPU Decode Trace Cache Top L2_BPU Bus Control MOBITLB Trace Cache Bottom DTLB L1 Cache Top L2 Cache L1 Cache Bottom FP Exec UROM FP Reg Alloc Rename Instr Q1 Sched Instr Q2 Int Reg Retire Int Exec Mem Ctl What programmers want 32 bit Memory Address 32 bit Memory Value 10 bit Opcodes 2, 5 bit Register Names 2, 32 bit Register Values 10 bits of “status” Everything. 3x 4x 1892 bits per cycle = 1 terrabyte / 4 Ghz

Why programmers cant have it Interconnect is not free  Huge cross chip busses  OptBuf 285um  20,000 buffers Analysis is not free  Significant processing required Extra cost of added heat  $15 budget for cooling Used by developers To Integrated Monitoring Hardware L1_BPU Decode Trace Cache Top L2_BPU Bus Control MOBITLB Trace Cache Bottom DTLB L1 Cache Top L2 Cache L1 Cache Bottom FP Exec UROM FP Reg Alloc Rename Instr Q1 Sched Instr Q2 Int Reg Retire Int Exec Mem Ctl

Cake + Eating It Too Need a way to provide cheap (or high margin) HW to the masses  No paying for developer functionality Get developers the powerful analysis they crave  See everything at executable rate Provide “snap-on” functionality for developers  Separate chip for analysis engine  Only hook it onto “developer” systems Idea is not limited to development systems  Security, Error Correction, Confidentiality, Accelerators, … 3d Integration offers the potential

Thermal Impact

Conclusion: Opportunities+Challenges 3D Integration for Performance  Bring Memory Closer to those that use it  More Bandwidth and Lower Latency  Requires few vias for big impact  Tricky System Level Tradeoffs 3D Integration for Specialization  Integration offers unique specialization opportunity  Requires rethinking of integration process  Decouple commodity from niche Challenges  Cross layer models: from app to package  Cross layer optimization: both static and dynamic  Thermal Management is everybody's problem

NSF CNS , NSF CCF , NSF CCF

Related Work Bryan Black, Murali M. Annavaram, Edward Brekelbaum, John DeVale, Gabriel H. Loh, Lei Jiang, Don McCauley, Pat Morrow, Don Nelson, Daniel Pantuso, Paul Reed, Jeff Rupley, Sadasivan Shankar, John Paul Shen, Clair Webb, "Die Stacking (3D) Microarchitecture," in IEEE International Symposium on Microarchitecture, , PUBLICATIONS on 3D STACKED IC 1. Karthik Balakrishnan, Vidit Nanda, Siddharth Easwar, and Sung Kyu Lim, "Wire Congestion And Thermal Aware 3D Global Placement," IEEE/ACM Asia South Pacific Design Automation Conference, p , (pdf) 2. Jacob Minz, Sung Kyu Lim, and Cheng-Kok Koh, "3D Module Placement for Congestion and Power Noise Reduction," ACM Great Lake Symposium on VLSI, p , (pdf) 3. Jacob Minz, Eric Wong, and Sung Kyu Lim, "Reliability-aware Floorplanning for 3D Circuits," to appear in IEEE International SOC Conference, (pdf) 4. Kiran Puttaswamy and Gabriel H. Loh, "Implementing Caches in a 3D Technology for High Performance Processors", IEEE International Conference on Computer Design, pp , (pdf) 5. Eric Wong and Sung Kyu Lim, "3D Floorplanning with Thermal Vias," to appear in Design, Automation and Test in Europe, Kiran Puttaswamy and Gabriel H. Loh, "Implementing Register Files for High-Performance Microprocessors in a Die-Stacked (3D) Technology," IEEE International Symposium on VLSI, pp , (pdf) 7. Kiran Puttaswamy and Gabriel H. Loh, "The Impact of 3-Dimenstional Integration on the Design of Arithmetic Units," IEEE International Symposium on Circuits and Systems, pp , (pdf) 8. Kiran Puttaswamy and Gabriel H. Loh, "Thermal Analysis of a 3D Die-Stacked High-Performance Microprocessor," ACM/IEEE Great Lakes Symposium on VLSI, 19-24, (pdf) 9. Kiran Puttaswamy and Gabriel H. Loh, "Dynamic Instruction Schedulers in a 3-Dimensional Integration Technology," ACM/IEEE Great Lakes Symposium on VLSI, , (pdf) 10. Yuan Xie, Gabriel H. Loh, Bryan Black and Kerry Bernstein, "Design Space Exploration for 3D Architectures," ACM Journal on Emerging Technologies in Computing Systems, vol.2(2), pp , (pdf) 11. Eric Wong, Jacob Minz, and Sung Kyu Lim, "Decoupling Capacitor Planning and Sizing for Noise and Leakage Reduction," to appear in IEEE International Conference on Computer Aided Design, Bryan Black, Murali M. Annavaram, Edward Brekelbaum, John DeVale, Gabriel H. Loh, Lei Jiang, Don McCauley, Pat Morrow, Don Nelson, Daniel Pantuso, Paul Reed, Jeff Rupley, Sadasivan Shankar, John Paul Shen, Clair Webb, "Die Stacking (3D) Microarchitecture," in IEEE International Symposium on Microarchitecture, , Kiran Puttaswamy, Gabriel H. Loh, "Thermal Herding: Microarchitecture Techniques for Controlling HotSpots in High-Performance 3D-Integrated Processors," in IEEE International Symposium on High-Performance Computer Architecture, Kiran Puttaswamy, Gabriel H. Loh, "Scalability of 3D-Integrated Arithmetic Units in High-Performance Microprocessors," to appear in ACM Design Automation Conference, PUBLICATIONS on MICRO-ARCHITECTURAL FLOORPLANNING 1. Mongkol Ekpanyapong, Jacob Minz, Thaisiri Watewai, Hsien-Hsin S. Lee, and Sung Kyu Lim, "Profile-Guided Microarchitectural Floorplanning for Deep Submicron Processor Design," IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, Vol. 25, No. 7, pp , (pdf) 2. Mongkol Ekpanyapong, Jacob Minz, Thaisiri Watewai, Hsien-Hsin S. Lee, and Sung Kyu Lim, "Profile-Guided Microarchitectural Floorplanning for Deep Submicron Processor Design," ACM Design Automation Conference, p , (pdf) 3. Mongkol Ekpanyapong, Sung Kyu Lim, Chinnakrishnan Ballapuram, and Hsien-Hsin S. Lee, "Wire-driven Microarchitectural Design Space Exploration," IEEE International Symposium on Circuits and Systems, p , (pdf) 4. Michael Healy, Mario Vittes, Mongkol Ekpanyapong, Chinnakrishnan Ballapuram, Sung Kyu Lim, Hsien-Hsin S. Lee, and Gabriel H. Loh, "Microarchitectural Floorplanning Under Performance and Temperature Tradeoff," to appear in Design, Automation and Test in Europe, Michael Healy, Mario Vittes, Mongkol Ekpanyapong, Chinnakrishnan Ballapuram, Sung Kyu Lim, Hsien-Hsin S. Lee, and Gabriel H. Loh, "Multi-Objective Microarchitectural Floorplanning For 2D And 3D ICs," to appear in IEEE Transactions on Computer-Aided Design of Integrated Ciruits and Systems. 6. Fayez Mohamood, Michael Healy, Sung Kyu Lim, and Hsien-Hsin S. Lee, "A Floorplan-Aware Dynamic Inductive Noise Controller for Reliable Processor Design," to appear in IEEE/ACM International Symposium on Microarchitecture, Fayez Mohamood, Michael Healy, Hsien-Hsin Lee, and Sung Kyu Lim, "Noise-Direct: A Technique for Power Supply Noise Aware Floorplanning Using Microarchitecture Profiling," to appear in IEEE/ACM Asia South Pacific Design Automation Conference, PUBLICATIONS on 3D PACKAGING 1. Jacob Minz and Sung Kyu Lim, "Layer Assignment for System-on-Packages," ACM/IEEE Asia and South Pacific Design Automation Conference, p31-37, (pdf) 2. Jacob Minz, Mohit Pathak, and Sung Kyu Lim, "Net and Pin Distribution for 3D Package Global Routing," Design, Automation and Test in Europe, p , (pdf) 3. Ramprasad Ravichandran, Jacob Minz, Mohit Pathak, Siddharth Easwar, and Sung Kyu Lim, "Physical Layout Automation for System-On-Packages," IEEE Electronic Components and Technology Conference, p41-48, (pdf) 4. Pun Hang Shiu, Ramprasad Ravichandran, Siddharth Easwar, and Sung Kyu Lim, "Multi-layer Floorplanning for Reliable System-on-Package," IEEE International Symposium on Circuits and Systems, p69-72, (pdf) 5. Jacob Minz, Sung Kyu Lim, Jinwoo Choi, and Madhavan Swaminathan, "Module Placement for Power Supply Noise and Wire Congestion Avoidance in 3D Packaging," IEEE Electrical Performance of Electronic Packaging, p , (pdf) 6. Jacob Minz and Sung Kyu Lim, "A Global Router for System-on-Package Targeting Layer and Crosstalk Minimization," IEEE Electrical Performance of Electronic Packaging, p99-102, (pdf) 7. Jacob Minz, Eric Wong, and Sung Kyu Lim, "Thermal and Crosstalk-Aware Physical Design For 3D System-On-Package," IEEE Electronic Components and Technology Conference, P , (pdf) 8. Eric Wong, Jacob Minz, and Sung Kyu Lim, "Power Noise-aware 3D Floorplanning for System-On-Package," to appear in IEEE Electrical Performance of Electronic Packaging, (pdf) 9. Sung Kyu Lim, "Physical Design for 3D System-On-Package: Challenges and Opportunities," IEEE Design & Test of Computers, Vol. 22, No. 6, p , (pdf) 10. Jacob Minz, Eric Wong, Mohit Pathak, and Sung Kyu Lim, "Placement and Routing for 3D System-On-Package Designs," to appear in IEEE Transactions on Components and Packaging Technologies. 11. Jacob Minz and Sung Kyu Lim, "Block-level 3D Global Routing With an Application to 3D Packaging," to appear in IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems. 12. Jacob Minz, Somaskanda Thyagaraja, and Sung Kyu Lim, "Optical Routing for 3D System-On-Package," to appear in Design, Automation and Test in Europe, Eric Wong, Jacob Minz, and Sung Kyu Lim, "White Space Management for Thermal Via and Decoupling Capacitor Insertion Targeting 3D System-On-Package," to appear in IEEE Electronic Components and Technology Conference, Eric Wong, Jacob Minz, and Sung Kyu Lim, "Multi-objective Module Placement For 3D System-On-Package," IEEE Transactions on Very Large Scale Integration Systems, Vol. 14, No. 5, pp , 2006