Www.openfabrics.org Distributed Visualization and Data Resources Enabling Remote Interaction and Analysis Scott A. Friedman UCLA Institute for Digital.

Slides:



Advertisements
Similar presentations
Middleware Support for RDMA-based Data Transfer in Cloud Computing Yufei Ren, Tan Li, Dantong Yu, Shudong Jin, Thomas Robertazzi Department of Electrical.
Advertisements

Unified Wire Felix Marti, Open Fabrics Alliance Workshop Sonoma, April 2008 Chelsio Communications.
Spring 2000CS 4611 Introduction Outline Statistical Multiplexing Inter-Process Communication Network Architecture Performance Metrics.
1 © 2004, Cisco Systems, Inc. All rights reserved. Chapter 3 Ethernet Technologies/ Ethernet Switching/ TCP/IP Protocol Suite and IP Addressing.
Electronic Visualization Laboratory, University of Illinois at Chicago Collaborative Visualization Architecture in Scalable Adaptive Graphics Environment.
Uncovering Performance and Interoperability Issues in the OFED Stack March 2008 Dennis Tolstenko Sonoma Workshop Presentation.
Answers of Exercise 7 1. Explain what are the connection-oriented communication and the connectionless communication. Give some examples for each of the.
Network synchronization of Online Games Li, Zetan.
© 2007 Cisco Systems, Inc. All rights reserved.Cisco Public 1 Version 4.0 OSI Transport Layer Network Fundamentals – Chapter 4.
Introduction Future wireless systems will be characterized by their heterogeneity - availability of multiple access systems in the same physical space.
1/20 Introduction Outline Statistical Multiplexing Inter-Process Communication Network Architecture Performance Metrics Implementation Issues.
RDMA ENABLED WEB SERVER Rajat Sharma. Objective  To implement a Web Server serving HTTP client requests through RDMA replacing the traditional TCP/IP.
Introduction. 2 What Is SmartFlow? SmartFlow is the first application to test QoS and analyze the performance and behavior of the new breed of policy-based.
Ch. 28 Q and A IS 333 Spring Q1 Q: What is network latency? 1.Changes in delay and duration of the changes 2.time required to transfer data across.
Applying Real-time Simulation to Real-time Collaboration Bart Miller.
Multimedia Communications Student: Blidaru Catalina Elena.
Chapter 4: Managing LAN Traffic
THOUGHTS ON DATA MANAGEMENT by Justin Burruss & David Schissel SWIM Workshop November 7-9, 2005 Oak Ridge, TN.
Providing Controlled Quality Assurance in Video Streaming across the Internet Yingfei Dong, Zhi-Li Zhang and Rohit Rakesh Computer Networking and Multimedia.
Chapter Two Application Layer Prepared by: Dr. Bahjat Qazzaz CS Dept. Sept
© 2007 Cisco Systems, Inc. All rights reserved.Cisco Public 1 Version 4.1 ISP Services Working at a Small-to-Medium Business or ISP – Chapter 7.
Globus GridFTP: What’s New in 2007 Raj Kettimuthu Argonne National Laboratory and The University of Chicago.
Towards a Common Communication Infrastructure for Clusters and Grids Darius Buntinas Argonne National Laboratory.
Our Last Class!!  summary  what does the future look like?
TCP Throughput Collapse in Cluster-based Storage Systems
QoS Support in High-Speed, Wormhole Routing Networks Mario Gerla, B. Kannan, Bruce Kwan, Prasasth Palanti,Simon Walton.
The NE010 iWARP Adapter Gary Montry Senior Scientist
1 © 2003, Cisco Systems, Inc. All rights reserved. CCNA 2 Module 9 Basic Router Troubleshooting.
© 2007 Cisco Systems, Inc. All rights reserved.Cisco Public 1 Version 4.0 OSI Transport Layer Network Fundamentals – Chapter 4.
1 Next Few Classes Networking basics Protection & Security.
Interactive Exploration of Large Remote Micro-CT Scans Prohaska, Hutanu, Kähler, Hege (Zuse Institut Berlin)
Electronic visualization laboratory, university of illinois at chicago A Case for UDP Offload Engines in LambdaGrids Venkatram Vishwanath, Jason Leigh.
TCP/IP TCP/IP LAYERED PROTOCOL TCP/IP'S APPLICATION LAYER TRANSPORT LAYER NETWORK LAYER NETWORK ACCESS LAYER (DATA LINK LAYER)
1 Networking Chapter Distributed Capabilities Communications architectures –Software that supports a group of networked computers Network operating.
Managed Object Placement Service John Bresnahan, Mike Link and Raj Kettimuthu (Presenting) Argonne National Lab.
TCP/IP Honolulu Community College Cisco Academy Training Center Semester 2 Version 2.1.
Transport Layer COM211 Communications and Networks CDA College Theodoros Christophides
Paper Review: Latency Evaluation of Networking Mechanisms for Game Traffic Jin, Da-Jhong.
Networking Fundamentals. Basics Network – collection of nodes and links that cooperate for communication Nodes – computer systems –Internal (routers,
Prospects for the use of remote real time computing over long distances in the ATLAS Trigger/DAQ system R. W. Dobinson (CERN), J. Hansen (NBI), K. Korcyl.
1 OSI and TCP/IP Models. 2 TCP/IP Encapsulation (Packet) (Frame)
Mr. P. K. GuptaSandeep Gupta Roopak Agarwal
INTERNET AND PROTOCOLS For more notes and topics visit: eITnotes.com.
IWarp Based Remote Interactive Scientific Visualization CENIC 2008 Oakland, CA Scott A. Friedman UCLA Academic Technology Services Research Computing Technologies.
TCP Traffic Characteristics—Deep buffer Switch
Status of Tokyo LCG tier-2 center for atlas / H. Sakamoto / ISGC07 Status of Tokyo LCG Tier 2 Center for ATLAS Hiroshi Sakamoto International Center for.
Scientific Computing Goals Past progress Future. Goals Numerical algorithms & computational strategies Solve specific set of problems associated with.
UDP: User Datagram Protocol Chapter 12. Introduction Multiple application programs can execute simultaneously on a given computer and can send and receive.
Ethernet Congestion Management Brad Booth, Intel September 2004.
CSE 331: Introduction to Networks and Security Fall 2000 Instructor: Carl A. Gunter Slide Set 2.
Client-server communication Prof. Wenwen Li School of Geographical Sciences and Urban Planning 5644 Coor Hall
Network Processing Systems Design
Advisor: Hung Shi-Hao Presenter: Chen Yu-Jen
Balazs Voneki CERN/EP/LHCb Online group
Accelerating Peer-to-Peer Networks for Video Streaming
Computer Networks.
TCP Vegas: New Techniques for Congestion Detection and Avoidance
ISO/OSI Model and Collision Domain
Transport Protocols over Circuits/VCs
Working at a Small-to-Medium Business or ISP – Chapter 7
EE 122: Network Applications
Using SCTP to hide latency in MPI programs
Working at a Small-to-Medium Business or ISP – Chapter 7
Studying TCP and UDP interactions in WiFi networks
Working at a Small-to-Medium Business or ISP – Chapter 7
University of Houston Datacom II Lecture 1B Review Dr Fred L Zellner
TCP/IP Protocol Suite: Review
Computer Networks Topic :User datagram protocol Transmission Control Protocol -Hemashree S( )
Microsoft Virtual Academy
Towards Hyperscale HPC & RDMA
Presentation transcript:

Distributed Visualization and Data Resources Enabling Remote Interaction and Analysis Scott A. Friedman UCLA Institute for Digital Research and Education

2 Motivating Challenge  Remote access to visualization cluster Limited resources ($) for development Re-use as much exiting IB (verb) code as possible (iWarp) Leverage existing 10G infrastructure (UCLA, CENIC) Use layer 3 Pacing critical for consistent user interaction  Remote access to researcher’s data Same limitations as above High performance retrieval of time step data Synchronized to modulo of rendering rate

3 Visualization Cluster  Hydra - Infiniband based  24 rendering nodes (48 nvidia 9800GTX+) 1 high-definition display node 3 visualization center projection nodes 1 remote visualization bridge node*  Research System - primarily High performance interactive rendering (60Hz) Dynamic (on-line) load balancing  System requires both low latency and high bandwidth Perfect for Infiniband! * We will return to this in a minute

4 Network Diagram visualization cluster visualization cluster local viz nodes local viz nodes IBiWarp IB Remote viz node Remote viz node ucla cenic nlr scinet pixel data input bridge node bridge node Acts like a display to the cluster Remote disk node Remote disk node command data

5 Remote Visualization Bridge Node  Bridges from IB to iWarp/10G Ethernet  Appears like a display node to cluster  No change to existing rendering system  Pixel data arrives over IB, which  is sent using iWarp to a remote display node  uses same RDMA protocol Same buffer used for receive and send (bounce) Very simple in principle - sort of… TCP settings can be a challenge  Time step data also passes through (opposite direction) Distributed to appropriate rendering nodes

6 Data flow interaction frames rate control step data remote display remote disk GPU0 GPU1 GPU0 GPU1 GPU0 GPU1 GPU0 GPU1 IB switch IB iWarp Hydra visualization cluster Time step data inbound Frame data outbound WAN SDR IB DDR IB 10GE Data set size and rate determine throughput. Example data is MB time steps. Rates from 10-60hz give data flows from Gbps Display resolution and update rate determine throughput. Example resolution of 2560x1600 is 11.75MB per frame. Rates from 10-60hz give data flows from 0.9 to 5.5Gbps

7 Example data set  Uncompressed video stream 20-60Hz at up to 2560x1600 ~ 1.8Gbps – 5.5Gbps  Time steps Time step size and rate determine flow steps at 20-60Hz at 16MB/step ~ 2.4Gbps – 7.5Gbps  Actual visualization Interactive 100M particle n-body simulation 11 unique processes  Performance measurements are internal Summaries are displayed at completion of run

8 Running Demo

9 Future Work  Latency is a non-issue at UCLA (which is our main focus) ~60 usec across campus  Longer latencies / distances are an issue UCLA to UC Davis ~20ms rtt (CENIC) UCLA to SC07 (Reno) ~14ms rtt (CENIC/NLR) UCLA to SC08 (Austin) ~30ms rtt (CENIC/NLR)  Frames and time step data completely in-flight (at high frame-rates) Not presently compensating for this  Minimizing latency important for interaction Pipelining transfers helps hide latency: lat’ = ( lat / chunks ) chunks + 1 chunks = 1 : lat = 2chunks = 2 : lat = 1.5

10 Lessons Learned  TCP nature of iWarp (some layer 3 related)  Congestion control wreaks havoc on interaction / latencies  Reliability Not needed for video frames Maybe needed for data (depends on application)  Buffer tuning – headache  Layer 9 issues  Better to use UDP?  Is there an unreliable iWarp?  Dedicated/Dynamic circuits?  Obsidian Longbow IB extenders (haven’t tried this)  Understand there may be an IP version out there? (not iWarp, but…)

11 Thank you  Questions? 