Computer Engineering Department Research Profile Dr. Sadiq M. Sait Computer Engineering Department King Fahd University of Petroleum & Minerals Dr. Sadiq.

Slides:



Advertisements
Similar presentations
Master/Slave Architecture Pattern Source: Pattern-Oriented Software Architecture, Vol. 1, Buschmann, et al.
Advertisements

CSE 6590 Department of Computer Science & Engineering York University 1 Introduction to Wireless Ad-hoc Networking 5/4/2015 2:17 PM.
Types of Parallel Computers
1 Computer Engineering Department College of Computer Sciences and Engineering Tuesday 18 November 2008 King Fahd University of Petroleum & Minerals.
Zheming CSCE715.  A wireless sensor network (WSN) ◦ Spatially distributed sensors to monitor physical or environmental conditions, and to cooperatively.
Seven Minute Madness: Special-Purpose Parallel Architectures Dr. Jason D. Bakos.
1 HW/SW Partitioning Embedded Systems Design. 2 Hardware/Software Codesign “Exploration of the system design space formed by combinations of hardware.
Spring 08, Jan 15 ELEC 7770: Advanced VLSI Design (Agrawal) 1 ELEC 7770 Advanced VLSI Design Spring 2007 Introduction Vishwani D. Agrawal James J. Danaher.
Spring 07, Jan 16 ELEC 7770: Advanced VLSI Design (Agrawal) 1 ELEC 7770 Advanced VLSI Design Spring 2007 Introduction Vishwani D. Agrawal James J. Danaher.
Emerging Trends in Computer Science Dr. Gurvinder Singh Reader, Deptt of Computer Science & Engineering, GNDU, Amritsar.
1 Multi - Core fast Communication for SoPC Multi - Core fast Communication for SoPC Technion – Israel Institute of Technology Department of Electrical.
October 8, th Asian Test Symposium 2007, Biejing, China XXXXX00XXX XX000101XXXXXXXXXXXXX0XXX1X0 101XXX1011XXXXXX0XXX XX000101XXXXXXXXXXXXX0XXX1XX.
Assignment II Integrated Circuits Design Ping-Hsiu Lee Reagan High School, Houston I. S. D. Deborah Barnett Tidehaven High School, Tidehaven I. S. D. Faculty.
Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display. Parallel Programming in C with MPI and OpenMP Michael J. Quinn.
1 Dr. Frederica Darema Senior Science and Technology Advisor NSF Future Parallel Computing Systems – what to remember from the past RAMP Workshop FCRC.
CAD and Design Tools for On- Chip Networks Luca Benini, Mark Hummel, Olav Lysne, Li-Shiuan Peh, Li Shang, Mithuna Thottethodi,
Iterative Algorithms for Low Power VLSI Placement Sadiq M. Sait, Ph.D Department of Computer Engineering King Fahd University of Petroleum.
The Memory/Logic Interface in FPGA’s with Large Embedded Memory Arrays The Memory/Logic Interface in FPGA’s with Large Embedded Memory Arrays Steven J.
ENEE 644 Dr. Ankur Srivastava Office: 1349 A.V. Williams URL: Computer-Aided Design of.
Undergraduate Poster Presentation Match 31, 2015 Department of CSE, BUET, Dhaka, Bangladesh Wireless Sensor Network Integretion With Cloud Computing H.M.A.
1b.1 Types of Parallel Computers Two principal approaches: Shared memory multiprocessor Distributed memory multicomputer ITCS 4/5145 Parallel Programming,
Information and Computer Science Department Research Profile Information and Computer Science Department Research Profile Dr. Sadiq M. Sait Information.
Design and Implementation of a Single System Image Operating System for High Performance Computing on Clusters Christine MORIN PARIS project-team, IRISA/INRIA.
Juanjo Noguera Xilinx Research Labs Dublin, Ireland Ahmed Al-Wattar Irwin O. Irwin O. Kennedy Alcatel-Lucent Dublin, Ireland.
Embedded Systems Design ICT Embedded System What is an embedded System??? Any IDEA???
1 3-General Purpose Processors: Altera Nios II 2 Altera Nios II processor A 32-bit soft core processor from Altera Comes in three cores: Fast, Standard,
Low-Power Wireless Sensor Networks
CAD for Physical Design of VLSI Circuits
Energy saving in multicore architectures Assoc. Prof. Adrian FLOREA, PhD Prof. Lucian VINTAN, PhD – Research.
Chapter 3 Parallel Algorithm Design. Outline Task/channel model Task/channel model Algorithm design methodology Algorithm design methodology Case studies.
Computer Engineering Department College of Computer Sciences and Engineering.
ASIP Architecture for Future Wireless Systems: Flexibility and Customization Joseph Cavallaro and Predrag Radosavljevic Rice University Center for Multimedia.
Computer Engineering Department College of Computer Science and Engineering King Fahd University of Petroleum & Minerals.
Tinoosh Mohsenin and Bevan M. Baas VLSI Computation Lab, ECE Department University of California, Davis Split-Row: A Reduced Complexity, High Throughput.
J. Christiansen, CERN - EP/MIC
Chonnam national university VLSI Lab 8.4 Block Integration for Hard Macros The process of integrating the subblocks into the macro.
ELEC692/04 course_des 1 ELEC 692 Special Topic VLSI Signal Processing Architecture Fall 2004 Chi-ying Tsui Department of Electrical and Electronic Engineering.
Super computers Parallel Processing By Lecturer: Aisha Dawood.
RF network in SoC1 SoC Test Architecture with RF/Wireless Connectivity 1. D. Zhao, S. Upadhyaya, M. Margala, “A new SoC test architecture with RF/wireless.
CSC 7600 Lecture 28 : Final Exam Review Spring 2010 HIGH PERFORMANCE COMPUTING: MODELS, METHODS, & MEANS FINAL EXAM REVIEW Daniel Kogler, Chirag Dekate.
Mobile Agent Migration Problem Yingyue Xu. Energy efficiency requirement of sensor networks Mobile agent computing paradigm Data fusion, distributed processing.
EE3A1 Computer Hardware and Digital Design
DIPARTIMENTO DI ELETTRONICA E INFORMAZIONE Novel, Emerging Computing System Technologies Smart Technologies for Effective Reconfiguration: The FASTER approach.
An Architecture and Prototype Implementation for TCP/IP Hardware Support Mirko Benz Dresden University of Technology, Germany TERENA 2001.
Lecture 4 TTH 03:30AM-04:45PM Dr. Jianjun Hu CSCE569 Parallel Computing University of South Carolina Department of.
Computer Engineering Department (KFUPM) Computer Engineering Department Sadiq M. Sait College of Computer Sciences and Engineering.
Tackling I/O Issues 1 David Race 16 March 2010.
Computer Science and Engineering Parallel and Distributed Processing CSE 8380 April 28, 2005 Session 29.
Copyright: All rights reserved.Not to be reproduced by any means without prior permission 1 Coordinator Dr. Sadiq M. Sait Department of Computer Engineering.
VU-Advanced Computer Architecture Lecture 1-Introduction 1 Advanced Computer Architecture CS 704 Advanced Computer Architecture Lecture 1.
Introduction to Performance Tuning Chia-heng Tu PAS Lab Summer Workshop 2009 June 30,
QM/BUPT Joint Programme
Microarchitecture.
ELEC 7770 Advanced VLSI Design Spring 2016 Introduction
Architecture & Organization 1
Algorithms for Big Data Delivery over the Internet of Things
University of Technology
Parallel Programming in C with MPI and OpenMP
ELEC 7770 Advanced VLSI Design Spring 2014 Introduction
Architecture & Organization 1
ELEC 7770 Advanced VLSI Design Spring 2012 Introduction
ELEC 7770 Advanced VLSI Design Spring 2010 Introduction
A High Performance SoC: PkunityTM
Chapter 1 Introduction.
Esam Ali Khan M.S. Thesis Defense
Department of Electrical Engineering Joint work with Jiong Luo
Parallel Programming in C with MPI and OpenMP
Seminar Tittles 1-Modeling and Optimization of soft-error reliability of Sequential circuits. 2-Statistical Estimation of Sequential Circuit Activities.
Test Data Compression for Scan-Based Testing
L. Glimcher, R. Jin, G. Agrawal Presented by: Leo Glimcher
Presentation transcript:

Computer Engineering Department Research Profile Dr. Sadiq M. Sait Computer Engineering Department King Fahd University of Petroleum & Minerals Dr. Sadiq M. Sait Computer Engineering Department King Fahd University of Petroleum & Minerals

Computer Engineering Faculty n 20 Professorial Rank faculty members 2 Full Professor 2 Associate Professor 16 Assistant Professor n 6 lecturers n 20 Professorial Rank faculty members 2 Full Professor 2 Associate Professor 16 Assistant Professor n 6 lecturers

COE Research Areas n Data Communications & Computer Networks. n Computer Applications: Robotics, Interfacing, Data acquisition, Machine learning, Data Mining. n Digital Design Automation & VLSI System Design & Test. n Computer Architecture & Parallel Processing. n Computer Arithmetic & Cryptography. n Data Communications & Computer Networks. n Computer Applications: Robotics, Interfacing, Data acquisition, Machine learning, Data Mining. n Digital Design Automation & VLSI System Design & Test. n Computer Architecture & Parallel Processing. n Computer Arithmetic & Cryptography.

COE Recent Research Projects: Data Communications & Computer Networks n Wireless Multi-hop Voice over IP over Wi-Fi using Client-Server UDP (user datagram protocol). n Mobile Patient using sensor network. n Wireless Local Area Networks Integration for Mobile Networks Operators. n eTourism Promoter – An Internet Assisted Location Tracker and Map Reader for Tourists. n A Framework for Integration of Web-based Network Management and Management by Delegation. n Radio Resource Management and QoS Control for Wireless Integrated Services Networks. n Adaptive TCP Mechanisms for Wireless Networks. n Engineering Modern Iterative Heuristics to Solve Hard Computer Network Design Problems. n Wireless Multi-hop Voice over IP over Wi-Fi using Client-Server UDP (user datagram protocol). n Mobile Patient using sensor network. n Wireless Local Area Networks Integration for Mobile Networks Operators. n eTourism Promoter – An Internet Assisted Location Tracker and Map Reader for Tourists. n A Framework for Integration of Web-based Network Management and Management by Delegation. n Radio Resource Management and QoS Control for Wireless Integrated Services Networks. n Adaptive TCP Mechanisms for Wireless Networks. n Engineering Modern Iterative Heuristics to Solve Hard Computer Network Design Problems.

COE Recent Research Projects: Computer Applications n Design of a wireless safety system for smart kitchen. n Predicting log properties from seismic data using abductive networks. n Design of an Intelligent Tele-robotic System. n Designing and building a mobile emergency warning system for patients under health care. n Context aware energy management system. n Design of a wireless safety system for smart kitchen. n Predicting log properties from seismic data using abductive networks. n Design of an Intelligent Tele-robotic System. n Designing and building a mobile emergency warning system for patients under health care. n Context aware energy management system.

COE Recent Research Projects: Design Automation & VLSI System Design & Test. n Iterative Heuristics for Timing & Low Power VLSI Standard Cell Placement. n Parallelization of Iterative Heuristics for Low Power VLSI Standard Cell Placement. n Efficient Test Relaxation Based Static Test Compaction Techniques for Combinational and Sequential Circuits. n Efficient Test Data Compression Techniques for Testing Systems-on-Chip. n Segmented Addressable Scan Architecture for Effective Test Data Compression. n Iterative Heuristics for Timing & Low Power VLSI Standard Cell Placement. n Parallelization of Iterative Heuristics for Low Power VLSI Standard Cell Placement. n Efficient Test Relaxation Based Static Test Compaction Techniques for Combinational and Sequential Circuits. n Efficient Test Data Compression Techniques for Testing Systems-on-Chip. n Segmented Addressable Scan Architecture for Effective Test Data Compression.

COE Recent Research Projects: Design Automation & VLSI System Design & Test. n Development of Digital Circuit Techniques for Clock Recovery and Data Re-Timing for High Speed NRZ Source-Synchronous Serial Data Communications. n Fast context switching configurable architectures supporting dynamic reconfiguration for computation intensive applications. n Development of Integrated Micro-electronic Heavy Metal Sensors for Environmental Applications. n Multi-objective Finite State Machine Encoding using Non-Deterministic Evolutionary Algorithms targeting area, low power and testability. n Design and Implementation of Scalable Interconnect Efficient LDPC Error Correcting Codes. n Development of Digital Circuit Techniques for Clock Recovery and Data Re-Timing for High Speed NRZ Source-Synchronous Serial Data Communications. n Fast context switching configurable architectures supporting dynamic reconfiguration for computation intensive applications. n Development of Integrated Micro-electronic Heavy Metal Sensors for Environmental Applications. n Multi-objective Finite State Machine Encoding using Non-Deterministic Evolutionary Algorithms targeting area, low power and testability. n Design and Implementation of Scalable Interconnect Efficient LDPC Error Correcting Codes.

Parallelizing Non-Deterministic Iterative Heuristics to Solve VLSI CAD Problems n CAD Problems such as Floorplanning, Placement, Routing, Scheduling, etc., require an enormous amount of computation time. n Iterative Heuristics such as Genetic Algorithms, Tabu Search, Simulated Evolution, and others have been found effective in solving several NP-hard optimization problems. n Objective: To use a cluster of PCs to solve multi- objective VLSI CAD problems in order to improve quality and reduce run-time. n CAD Problems such as Floorplanning, Placement, Routing, Scheduling, etc., require an enormous amount of computation time. n Iterative Heuristics such as Genetic Algorithms, Tabu Search, Simulated Evolution, and others have been found effective in solving several NP-hard optimization problems. n Objective: To use a cluster of PCs to solve multi- objective VLSI CAD problems in order to improve quality and reduce run-time.

Approach: To employ a Cluster of PCs to Distribute Computationally Intensive Tasks n Clusters of low end PCs are easy to build. n Tools such as MPI and PVM are available for message passing. n Tools such as gprof, Intel’s VTUNE Performance Analyzer, etc., are used for generating profiles for serial codes and determining the part of the code that has the bottlenecks. n Iterative algorithms are non-deterministic, and dividing work load, i.e. partitioning the search space, is a challenge. n The parallelizing model (i.e., Partitioning, Communication, Agglomeration and Mapping) is very well-defined for numerical problems, which are mostly deterministic. This is not the case for Iterative heuristics, which are non-deterministic. n Clusters of low end PCs are easy to build. n Tools such as MPI and PVM are available for message passing. n Tools such as gprof, Intel’s VTUNE Performance Analyzer, etc., are used for generating profiles for serial codes and determining the part of the code that has the bottlenecks. n Iterative algorithms are non-deterministic, and dividing work load, i.e. partitioning the search space, is a challenge. n The parallelizing model (i.e., Partitioning, Communication, Agglomeration and Mapping) is very well-defined for numerical problems, which are mostly deterministic. This is not the case for Iterative heuristics, which are non-deterministic.

Tools used in our Current Cluster n MPICH Library provides a flexible implementation of MPI for easier message-passing interface development on multiple network architectures. n Intel® Trace Collector 5.0 applies event-based tracing in cluster applications with a low-overhead library. Offers performance data, recording of statistics, multi-threaded traces, and automatic instrumentation of binaries on IA-32. n Intel® Trace Analyzer 4.0 provides visual analysis of application activities gathered by the Intel Trace Collector. n TotalView (MPICH) is also used for observing communication between processors. n Also used in Condor (for scheduling jobs on the cluster). n MPICH Library provides a flexible implementation of MPI for easier message-passing interface development on multiple network architectures. n Intel® Trace Collector 5.0 applies event-based tracing in cluster applications with a low-overhead library. Offers performance data, recording of statistics, multi-threaded traces, and automatic instrumentation of binaries on IA-32. n Intel® Trace Analyzer 4.0 provides visual analysis of application activities gathered by the Intel Trace Collector. n TotalView (MPICH) is also used for observing communication between processors. n Also used in Condor (for scheduling jobs on the cluster).

Relationship to Intel’s R&D n COE Department has faculty experienced in VLSI Design. n Two books in the area of iterative algorithms and VLSI Design have been authored by the department faculty. n The Technology Center being proposed in RI will have the state-of-art tools and equipment. n Faculty and students currently interested in HPC and parallelization of heuristics can work together to address industrial and real-world problems. n COE Department has faculty experienced in VLSI Design. n Two books in the area of iterative algorithms and VLSI Design have been authored by the department faculty. n The Technology Center being proposed in RI will have the state-of-art tools and equipment. n Faculty and students currently interested in HPC and parallelization of heuristics can work together to address industrial and real-world problems.

Efficient Test Compaction & Compression Techniques for Comb. & Seq. Circuits n SOC Testing Challenges Reduce amount of test data. Reduce time a defective chip spends on a tester. n Test Compaction & Compression Reduce the size of a test set as much as possible. n Test vector reordering for combinational circuits. Steepen the curve of fault coverage vs. number of test vectors. n SOC Testing Challenges Reduce amount of test data. Reduce time a defective chip spends on a tester. n Test Compaction & Compression Reduce the size of a test set as much as possible. n Test vector reordering for combinational circuits. Steepen the curve of fault coverage vs. number of test vectors.

Efficient Test Compaction & Compression Techniques for Comb. & Seq. Circuits n Efficient Test Relaxation for Combinational & Sequential circuits Enabling technology for test Compaction & Compression Test power reduction n Developed efficient test compaction techniques based on test relaxation. n Test Vector Decomposition Maximizes test compaction by vector clustering techniques Maximizes test width-based compression techniques. n Efficient Test Relaxation for Combinational & Sequential circuits Enabling technology for test Compaction & Compression Test power reduction n Developed efficient test compaction techniques based on test relaxation. n Test Vector Decomposition Maximizes test compaction by vector clustering techniques Maximizes test width-based compression techniques X0XX100X X11XX0X10 0XXXX0XX1 XXXX111XX X0X01XXXX Test Relax X1XXX1X0X XXX00XXX1 1XXX1X0XX

Segmented Addressable Scan: Scan Test Challenges n Test data volume challenge Limited IOs & unlimited increase in transistors Exponential increase in test data volume n Tester pin count challenge Tester cost is almost linear in number of pins n Test time challenge Critical path Hard to parallelize test loading massively n Test power challenge High activity leading to high power consumption. n Test data volume challenge Limited IOs & unlimited increase in transistors Exponential increase in test data volume n Tester pin count challenge Tester cost is almost linear in number of pins n Test time challenge Critical path Hard to parallelize test loading massively n Test power challenge High activity leading to high power consumption.

Segmented Addressable Scan  Aggressive parallelization of scan chains  Reconfigurable partial compatibilities  Special SAS decoder  Overhead: few gates per scan chain  Data volume: 10x ~ 20x compression with small designs for both SAF and TDF Bigger designs have higher compression  Pin count: 2  log 2 S  +1 pins, can be reduced to ONLY 2  Test time: aggressive parallelization  test time reduction  Power consumption selective clocking

Test Data Volume & Test Time (Delay test) Total Data Volume98 MbCompression Ratio SAS Data Volume 32 Segments 7.7 Mb12x 64 Segments 5.3 Mb17x 128 Segments 4.5 Mb22x 256 Segments 3.6 Mb27x $Ms of annual test cost savings

COE Recent Research Projects: Computer Architecture & Parallel Processing n Load Balancing for Parallel Visualization of Blood Head Vessel Angiography on Cluster of PCs. n Shared Channels in Interconnection Networks. n Study of modified Multistage Interconnection Networks for Networks-on-Chips. n Design of a Simulator for a Class of Dynamic Execution Processors. n Beyond Instruction-Level Parallelism in Processor Architecture. n Design and Performance Evaluation of a Distributed Crossbar Scheduler. n Software Pipelining for Reconfigurable Instruction Set Processors. n Load Balancing for Parallel Visualization of Blood Head Vessel Angiography on Cluster of PCs. n Shared Channels in Interconnection Networks. n Study of modified Multistage Interconnection Networks for Networks-on-Chips. n Design of a Simulator for a Class of Dynamic Execution Processors. n Beyond Instruction-Level Parallelism in Processor Architecture. n Design and Performance Evaluation of a Distributed Crossbar Scheduler. n Software Pipelining for Reconfigurable Instruction Set Processors.

COE Recent Research Projects: Computer Arithmetic & Cryptography n High-Performance Arithmetic for Cryptographic Applications. n Design of efficient integrated circuits for the inverse computation in different finite fields. n Design of Elliptic Curve Cryptography Architectures using parallel multipliers. n Secure reliable storage system. n Design, Analysis, and FPGA prototyping of High- Performance Arithmetic for Cryptographic Applications. n High-Performance Arithmetic for Cryptographic Applications. n Design of efficient integrated circuits for the inverse computation in different finite fields. n Design of Elliptic Curve Cryptography Architectures using parallel multipliers. n Secure reliable storage system. n Design, Analysis, and FPGA prototyping of High- Performance Arithmetic for Cryptographic Applications.

Computer Engineering Faculty Research Profile