The Case of Software Defined Radio with MSRA (work-in-progress) Yongguang Zhang with Kun Tan, Fan Yang, Jiansong Zhang, Haitao Wu, Chunyi Peng, Songwu.

Slides:



Advertisements
Similar presentations
System Area Network Abhiram Shandilya 12/06/01. Overview Introduction to System Area Networks SAN Design and Examples SAN Applications.
Advertisements

Yaron Doweck Yael Einziger Supervisor: Mike Sumszyk Spring 2011 Semester Project.
Overview and Basics of Software Defined Radios INSTRUCTOR: Dr. George Collins By Praveen Kumar Chukka
Software Defined Radio Testbed Team may11-18 Members: Alex Dolan, Mohammad Khan, Ahmet Unsal Adviser: Dr. Aditya Ramamoorthy.
OpenRadio A programmable wireless dataplane Manu Bansal Stanford University Joint work with Jeff Mehlman, Sachin Katti, Phil Levis HotSDN ‘12, August 13,
Enabling MAC Protocol Implementations on Software-defined Radios George Nychis, Thibaud Hottelier, Zhuochen Yang, Srinivasan Seshan, and Peter Steenkiste.
1. Overview  Introduction  Motivations  Multikernel Model  Implementation – The Barrelfish  Performance Testing  Conclusion 2.
© ABB Group Jun-15 Evaluation of Real-Time Operating Systems for Xilinx MicroBlaze CPU Anders Rönnholm.
VIA and Its Extension To TCP/IP Network Yingping Lu Based on Paper “Queue Pair IP, …” by Philip Buonadonna.
Purpose of this Workshop We have a compelling new research platform – Enables research that has been difficult in the past – Already generating excitements.
Integrated  -Wireless Communication Platform Jason Hill.
MSD P07302End of Project Review1 Motor Controller Subsystem MSD P07302 Project Sponsor: KGCOE Project Members: D. ShenoyProject Manager S. TallauSoftware.
Wireless Terminal and PC Interface Using VLSI EE452 - Senior Project Members: Chris Brophy Matt Olinger Advisor: Dr. V. Prasad 5/2/02.
Embedded Transport Acceleration Intel Xeon Processor as a Packet Processing Engine Abhishek Mitra Professor: Dr. Bhuyan.
Trend towards Embedded Multiprocessors Popular Examples –Network processors (Intel, Motorola, etc.) –Graphics (NVIDIA) –Gaming (IBM, Sony, and Toshiba)
Ethernet Bomber Ethernet Packet Generator for network analysis Oren Novitzky & Rony Setter Advisor: Mony Orbach Started: Spring 2008 Part A final Presentation.
Ethernet Bomber Ethernet Packet Generator for network analysis Oren Novitzky & Rony Setter Advisor: Mony Orbach Spring 2008 – Winter 2009 Midterm Presentation.
Intel ® Research mote Ralph Kling Intel Corporation Research Santa Clara, CA.
Heterogeneous Computing Dr. Jason D. Bakos. Heterogeneous Computing 2 “Traditional” Parallel/Multi-Processing Large-scale parallel platforms: –Individual.
Router Architectures An overview of router architectures.
PCI-Express Network Sniffer Characterization Presentation Project Period : 2 semesters Students: Neria Wodage Aviel Tubul Advisor: Mony Orbach 17/12/2007.
- 1 - A Powerful Dual-mode IP core for a/b Wireless LANs.
Sven Ubik, Petr Žejdl CESNET TNC2008, Brugges, 19 May 2008 Passive monitoring of 10 Gb/s lines with PC hardware.
Motivation “Every three minutes a woman is diagnosed with Breast cancer” (American Cancer Society, “Detailed Guide: Breast Cancer,” 2006) Explore the use.
COLLABORATIVE EXECUTION ENVIRONMENT FOR HETEROGENEOUS PARALLEL SYSTEMS Aleksandar Ili´c, Leonel Sousa 2010 IEEE International Symposium on Parallel & Distributed.
ECE 526 – Network Processing Systems Design Network Processor Architecture and Scalability Chapter 13,14: D. E. Comer.
Multimedia & Communications ATMEL Bluetooth Background information on Bluetooth technology ATMEL implementation of Bluetooth spec.
LiNK: An Operating System Architecture for Network Processors Steve Muir, Jonathan Smith Princeton University, University of Pennsylvania
Design and Characterization of TMD-MPI Ethernet Bridge Kevin Lam Professor Paul Chow.
Implementation of Parallel Processing Techniques on Graphical Processing Units Brad Baker, Wayne Haney, Dr. Charles Choi.
COMPUTER SCIENCE &ENGINEERING Compiled code acceleration on FPGAs W. Najjar, B.Buyukkurt, Z.Guo, J. Villareal, J. Cortes, A. Mitra Computer Science & Engineering.
A 50-Gb/s IP Router 참고논문 : Craig Partridge et al. [ IEEE/ACM ToN, June 1998 ]
Helmholtz International Center for CBM – Online Reconstruction and Event Selection Open Charm Event Selection – Driving Force for FEE and DAQ Open charm:
F Networked Embedded Applications and Technologies Lab Department of Computer Science and Information Engineering National Cheng Kung University, TAIWAN.
Cluster Computers. Introduction Cluster computing –Standard PCs or workstations connected by a fast network –Good price/performance ratio –Exploit existing.
The GNU in RADIO Shravan Rayanchu. SDR Getting the code close to the antenna –Software defines the waveform –Replace analog signal processing with Digital.
COT 4600 Operating Systems Fall 2009 Dan C. Marinescu Office: HEC 439 B Office hours: Tu-Th 3:00-4:00 PM.
Computer Architecture Lecture10: Input/output devices Piotr Bilski.
Tessellation: Space-Time Partitioning in a Manycore Client OS Rose Liu 1,2, Kevin Klues 1, Sarah Bird 1, Steven Hofmeyr 3, Krste Asanovic 1, John Kubiatowicz.
Developing a SDR Testbed Alex Dolan Mohammad Khan Ahmet Unsal Project Advisor Dr. Aditya Ramamoorthy.
Heterogeneous Multikernel OS Yauhen Klimiankou BSUIR
1 Abstract & Main Goal המעבדה למערכות ספרתיות מהירות High speed digital systems laboratory The focus of this project was the creation of an analyzing device.
Performance Characterization and Architecture Exploration of PicoRadio Data Link Layer Mei Xu and Rahul Shah EE249 Project Fall 2001 Mentor: Roberto Passerone.
By V. Koutsoumpos, C. Kachris, K. Manolopoulos, A. Belias NESTOR Institute – ICS FORTH Presented by: Kostas Manolopoulos.
I/O Computer Organization II 1 Interconnecting Components Need interconnections between – CPU, memory, I/O controllers Bus: shared communication channel.
An Architecture and Prototype Implementation for TCP/IP Hardware Support Mirko Benz Dresden University of Technology, Germany TERENA 2001.
1 Presented By: Eyal Enav and Tal Rath Eyal Enav and Tal Rath Supervisor: Mike Sumszyk Mike Sumszyk.
Lecture 12: Reconfigurable Systems II October 20, 2004 ECE 697F Reconfigurable Computing Lecture 12 Reconfigurable Systems II: Exploring Programmable Systems.
Chapter 13 – I/O Systems (Pgs ). Devices  Two conflicting properties A. Growing uniformity in interfaces (both h/w and s/w): e.g., USB, TWAIN.
An Efficient Gigabit Ethernet Switch Model for Large-Scale Simulation Dong (Kevin) Jin.
Platform Abstraction Group 3. Question How to deal with different types hardware and software platforms? What detail to expose to the programmer? What.
Ethernet Bomber Ethernet Packet Generator for network analysis
Introduction or Background
Benedikt Wolz, ComNets A Combined Time and Space Division Multiple Access Scheme in a prototypical OFDM-based WiMAX Demonstrator Benedikt Wolz, Benjamin.
Cluster Computers. Introduction Cluster computing –Standard PCs or workstations connected by a fast network –Good price/performance ratio –Exploit existing.
LonWorks Introduction Hwayoung Chae.
System on a Programmable Chip (System on a Reprogrammable Chip)
Heterogeneous Processing KYLE ADAMSKI. Overview What is heterogeneous processing? Why it is necessary Issues with heterogeneity CPU’s vs. GPU’s Heterogeneous.
APE group Many-core platforms and HEP experiments computing XVII SuperB Workshop and Kick-off Meeting Elba, May 29-June 1,
Computer Networking A Top-Down Approach Featuring the Internet Introduction Jaypee Institute of Information Technology.
Freescale™ and the Freescale logo are trademarks of Freescale Semiconductor, Inc. All other product or service names are the property of their respective.
What is CRKIT Framework ? Baseband Processor :  FPGA-based off-the-shelf board  Control up to 4 full-duplex wideband radios  FPGA-based System-on-Chip.
1 Modeling Considerations for the Hardware-Software Co-design of Flexible Modern Wireless Transceivers Benjamin Drozdenko, Matthew Zimmermann, Tuan Dao,
Baseband Platforms - Architecture
NFV Compute Acceleration APIs and Evaluation
High-Level Hardware-Software Co-design of an 802
DETAILED SYSTEM DESIGN
A Wireless Sensor Node SoC with a Profiled Power Management Unit for IR Controllable Digital Consumer Devices Dong-Sun Kim, Member, IEEE, Byung-Soo Kim,
Characteristics of Reconfigurable Hardware
Computer Networking A Top-Down Approach Featuring the Internet
Presentation transcript:

The Case of Software Defined Radio with MSRA (work-in-progress) Yongguang Zhang with Kun Tan, Fan Yang, Jiansong Zhang, Haitao Wu, Chunyi Peng, Songwu Lu Microsoft Research Asia June 2008

Fundamental Challenges of SDR Digital baseband processing Super computation required E.g. traditional a requires > 50Gops Tight real-time response E.g. generate MAC response within 10us High level of programmability Accommodate a large range of heterogeneous PHY algorithms and MAC protocols 2

Impending Technology Sea Change Emerging multi-core/many-core GPP Today’s nVidia GPU: 256 core, 600GFlops Intel 80-core CPU: TFlops, IBM Cell: 200GFlops Predicted to possess 1000 cores in a few years Opportunities Sufficient computational power and parallelism PHY MAC NET … APP processing in one place Question: is general-purpose PC architecture suitable for SDR? 3

Our Project Goals Explore new ways of building DSR based on standard multi-core PC architecture Develop a complete software stack High-speed PHY processing on GPP Support for time-critical MAC control Provide a software system research platform for wireless communication and networking 4

GPP+MEM System Architecture 5 PCIe Bus LPA/Sampling Convertor/DAC/ADC Latency critical radio control (AGC/PD) Antenna SDR mgnt Flexible RF Hardware Network layer and above Link-layer Processing MAC Processing Baseband Processing Multi-core Software Radio Architecture Multi-core Software Radio Architecture

Flexible Radio Control Hardware FPGA board to connect RF and PC bus PCIe-8x, Descriptor-based DMA, 256MB onboard SDRAM cache Basic functions Fast sample transport b/w RF and PC memory AGC Packet Detection Early interrupt trigger Timing control 6 Xilinx Vertex5 FPGA Xilinx Vertex5 FPGA PCI-e Link to PC GPIO: connect to RF

Baseband Processing with GPP Exploit parallelisms in PHY for multi-core Batching for SIMD and inter-symbol parallelism Control batch size to meet latency requirement Exploit large memory and high-speed L2 cache Pre-calculations and lookup tables Convolution encoding (de)scramble, (de)interleaving, (de)mapper, … Two paths Multi-core CPU (intel quad-core) Many-core GPU (nvidia GeForce 8800) 7

Performance Results (intermediate) Hardware throughput Baseband processing speed (over link speed) 8 RxTx PCIe-x4842 Mbps839 Mbps PCIe-x81684 Mbps1675 Mbps ParametersSpeedup 1-core CPU b TX Long preamble, 11Mbps29.59x Short preamble, 11Mbps28.82x b RX Long preamble, 11Mbps2.83x Short preamble, 11Mbps2.67x 128-core GPU a TX54Mbps1.39x a RX (w/o viterbi)54Mbps1.56x a RX (w/ viterbi)54Mbps0.53x

Performance Results (cont'd) Hardware latency 9 DMA ReadDMA WriteInterrupt PCIe-x40.36µs0.72µs7~10µs PCIe-x80.34µs0.68µs7~11µs TX PathRX Path RCB0.75µs1.75µs RxTxTurnaroundTime < 5µs RxTxSwitchTime < 5µs Interrupt Rx Latency. Tx Latency. Packet Detected AGC Settle down PLCP Channel estimation Data demodulation & decoding CRC check and MAC hdr check SIFS. ACK send ACK Prep. PreamblePLCP Header Data GPP H/W

Research Opportunities Operating system support for SDR Resource management for multi-core High-resolution timer, scheduler Programmability Library, reusable components Development tools New thinking in wireless communications under extra computational power 10

MAC Core: Precise Timing Control Kernel service for time-critical MAC processing On-demand dedicate core for MAC state-machine Polling for events when going into time-critical task Microsecond level timing control Two-phase transmission transaction Pre-catch frame symbols in RCB Time synchronization H/W timestamp for transceiving events Library functions to manage time mappings 11

Modular Link Layer Modular software architecture for none time- critical link layer functions Extensibility Code reuse Abstract each component into an object with typed interfaces Decouple functionalities into small objects Objects are interconnected with interfaces 12

Graph Presentation of Link Layer Flexible definitions of interfaces Data passing interfaces vs. control interfaces Versatile connection types One-to-one; one-to-many; many-to-one; many-to-many Components – realized as C++ objects XML description for configuration file 13 Classifier HighQ LowQ Sched ARQ CA IfDataPush IfDataPull IfPreempt IfFlush

GUI-based Graph Generator 14 CCapDe IfDataPush... cap0 CCapDe q0 CQueue... … cap0 q0 IfDataPush 0 Type definition Objects definition Links definition

Summary and Status A new software radio stack based on PC architecture that supports High-speed PHY processing Low latency and time-critical MAC Modular architecture for link layer functions Current status PCIe-based flexible radio control board Prototype and evaluate GPU and CPU based BB modules Prototype for MAC-Core runtime and modular link layer Integrating the whole research platform 15

Thanks and Questions! 16