Research Opportunities in IP Wide Area Storage

Slides:



Advertisements
Similar presentations
IP Router Architectures. Outline Basic IP Router Functionalities IP Router Architectures.
Advertisements

RAID Redundant Array of Independent Disks
Logically Centralized Control Class 2. Types of Networks ISP Networks – Entity only owns the switches – Throughput: 100GB-10TB – Heterogeneous devices:
Dr. Kalpakis CMSC 621, Advanced Operating Systems. Fall 2003 URL: Distributed System Architectures.
Operating Systems ECE344 Ashvin Goel ECE University of Toronto Disks and RAID.
CS162 Section Lecture 9. KeyValue Server Project 3 KVClient (Library) Client Side Program KVClient (Library) Client Side Program KVClient (Library) Client.
Cache Coherent Distributed Shared Memory. Motivations Small processor count –SMP machines –Single shared memory with multiple processors interconnected.
Determining BC/DR Methods Recovery Time Objective – (RTO) Time needed to recover from a disaster How long can you afford to be without your systems Recovery.
Video over ICN IRTF Interim Meeting Boston, MA Cedric Westphal.
Latency Tolerance: what to do when it just won’t go away CS 258, Spring 99 David E. Culler Computer Science Division U.C. Berkeley.
SQL Server, Storage And You Part 2: SAN, NAS and IP Storage.
Web Caching Schemes1 A Survey of Web Caching Schemes for the Internet Jia Wang.
ISCSI Performance in Integrated LAN/SAN Environment Li Yin U.C. Berkeley.
P2P: Advanced Topics Filesystems over DHTs and P2P research Vyas Sekar.
1 OASIS: Overlays and Active Services for Internetworked Storage Tal Lavian, Yin Li, George Porter, Mel Tsai, Randy H. Katz Computer Science Division Electrical.
EEC-681/781 Distributed Computing Systems Lecture 3 Wenbing Zhao Department of Electrical and Computer Engineering Cleveland State University
ISCSI Performance Experiments Li Yin EECS Department U.C.Berkeley.
Network Hardware for Expanding Network. Expanding Networks When do we need expansion:  Network cable is full of data movements  Printing tasks needs.
Network Support for Cloud Services Lixin Gao, UMass Amherst.
Redundant Array of Independent Disks
Chapter 4: Managing LAN Traffic
New Protocols for Remote File Synchronization Based on Erasure Codes Utku Irmak Svilen Mihaylov Torsten Suel Polytechnic University.
CS An Overlay Routing Scheme For Moving Large Files Su Zhang Kai Xu.
Infrastructure for Better Quality Internet Access & Web Publishing without Increasing Bandwidth Prof. Chi Chi Hung School of Computing, National University.
Hadoop Hardware Infrastructure considerations ©2013 OpalSoft Big Data.
World Wide Web Caching: Trends and Technologys Gerg Barish & Katia Obraczka USC Information Sciences Institute, USA,2000.
Storage 101: Bringing Up SAN Garry Moreau Senior Staff Alliance Consultant Ciena Communications (763)
Introduction to DFS. Distributed File Systems A file system whose clients, servers and storage devices are dispersed among the machines of a distributed.
Configuring File Services. Using the Distributed File System Larger enterprises typically use more file servers Used to improve network performce Reduce.
Topic 3 Analysing network traffic
Distributed Computing Systems CSCI 4780/6780. Scalability ConceptExample Centralized servicesA single server for all users Centralized dataA single on-line.
CS 6401 Overlay Networks Outline Overlay networks overview Routing overlays Resilient Overlay Networks Content Distribution Networks.
Internet Protocol Storage Area Networks (IP SAN)
3/12/2013Computer Engg, IIT(BHU)1 PARALLEL COMPUTERS- 2.
© 2009 IBM Corporation Statements of IBM future plans and directions are provided for information purposes only. Plans and direction are subject to change.
© 2007 EMC Corporation. All rights reserved. Internet Protocol Storage Area Networks (IP SAN) Module 3.4.
Cisco Networking Academy Program
Instructor Materials Chapter 6: Quality of Service
Business Continuity & Disaster Recovery
ESE532: System-on-a-Chip Architecture
Determining BC/DR Methods
Lecture: Large Caches, Virtual Memory
CS 147 – Parallel Processing
Introduction to Networks
Gregory Kesden, CSE-291 (Storage Systems) Fall 2017
Gregory Kesden, CSE-291 (Cloud Computing) Fall 2016
Business Continuity & Disaster Recovery
2018 Huawei H Real Questions Killtest
Cisco Networking Academy Program
Cisco Networking Academy Program
RAID RAID Mukesh N Tekwani
Distributed Systems CS
Network Core and QoS.
CS Lecture 2 Network Performance
Lecture: Cache Innovations, Virtual Memory
Architectures of distributed systems Fundamental Models
UNIT IV RAID.
Architectures of distributed systems Fundamental Models
Specialized Cloud Architectures
High Performance Computing
Latency Tolerance: what to do when it just won’t go away
Architectures of distributed systems
Outline Review of Quiz #1 Distributed File Systems 4/20/2019 COP5611.
RAID RAID Mukesh N Tekwani April 23, 2019
Architectures of distributed systems Fundamental Models
Computer Networks Protocols
Improving performance
CSE 542: Operating Systems
Network Core and QoS.
Multiprocessors and Multi-computers
Presentation transcript:

Research Opportunities in IP Wide Area Storage George Porter Li Yin Department of EECS U.C. Berkeley 12/26/2018 SAHARA Retreat

Outline Trends Challenges for wide-area storage Programmability inside networks Common techniques to hide latency Functionality that will benefit applications Network Support for that functionality Reconsidering the programmability model and application space Feedback

Outline Trends Challenges for wide-area storage Programmability inside networks Common techniques to hide latency Functionality that will benefit applications Network Support for that functionality Reconsidering the programmability model and application space Feedback

Storage Wide-Area Networking Metro Area SAN technology works well in relatively small area (metro-wide) There is a desire to implement storage applications in the wide-area Comparable performance as small area storage applications

Outline Trends Challenges for wide-area storage Programmability inside networks Common techniques to hide latency Functionality that will benefit applications Network Support for that functionality Reconsidering the programmability model and application space Feedback

Challenges in Wide-Area Storage Speed of light is constant Long distance implies propagation delay Network dynamics Variation of cross traffic load Routes changes Increasing storage capacity Transmit huge amount of data across wide-area

Challenges in Wide-Area Storage Simple operation: Host writes data to the remote target disk Send data to the remote target Write disk Where is the bottleneck? Disk? Network Link? Distance? Performance Degradation Local Disk Write Operation Time Extra Delay Caused by Network Local Operation Time Extra Delay Caused By the Network Time

Challenges in Wide-Area Storage Three Cases: Case 1: Limited link bandwidth Case 2: Small data set with high bandwidth Case 3: Large data set with high bandwidth

Challenges in Wide-Area Storage Case 1: Limited link bandwidth Transmission Time Local Disk Write Operation Time More data to be transmitted Time Time Extra Delay Caused By the Network

Challenges in Wide-Area Storage Case 1: Limited link bandwidth As more data to be transmitted: The performance degradation caused by the transmission delay gets larger Propagation delay does not matter As the disk getting faster, more bandwidth is required to shift the bottleneck away from the network

Challenges in Wide-Area Storage Case 2: Small data set with high link bandwidth In this case, the throughput is very sensitive to the distance, especially when it becomes of the order of the disk latency Local Disk Write Operation Time Larger Distance Time Time

Challenges in Wide-Area Storage Case 3: Large data set with high link bandwidth Local Disk Write Operation Time More Data to be Transmitted Time Time Time Extra Delay Caused By the Network

Challenges in Wide-Area Storage Case 3: Large data set with high link bandwidth In this case, disk is the bottleneck, the network only introduces the propagation delay which can be ignored as more data to be transmitted As the disk getting faster, more bandwidth is required to shift the bottleneck away from the network

Challenges in Wide-Area Storage Where is the bottleneck? Link Bandwidth Size of data to be transmitted Disk Speed Key issue in the wide-area storage is how to reduce the latency Latency introduced by the network Latency introduced by the storage

Outline Trends Challenges for wide-area storage Programmability inside networks Common techniques to hide latency Functionality that will benefit applications Network Support for that functionality Reconsidering the programmability model and application space Feedback

Common Techniques to Hide Latency Caching Parallelism Pipelining Prefetching … Where and how to implement these techniques for wide-area storage applications?

Code at edge –vs- in the fabric Location of data separated from use of data Idea is to put processing near the data it acts on Better visibility into network conditions, dynamics Big performance gains if we can act on streams of data in the datapath Network processors are more powerful today A good match?

Outline Trends Challenges for wide-area storage Programmability inside networks Common techniques to hide latency Functionality that will benefit applications Network Support for that functionality Reconsidering the programmability model and application space Feedback

Gather Digital animation editing Large dataset visualization Synchronous Asynchronous N-to-1 disk copies (KaZaa) Recreate dataset from multiple sources/disks (scientific experiment) Restore backup Digital animation editing Large dataset visualization

Gather Techniques Network Primitives Digital animation editing Caching Parallelism Prefetching Network Primitives FS semantic information Store block location state in router View into network routes/conditions Table lookup in router Modify disk requests to point to correct locations Join data streams to deliver coherent data to app Orthogonal path selection Synchronous Digital animation editing Large dataset visualization

Gather Digital animation editing Large dataset visualization Synchronous Asynchronous N-to-1 disk copies (KaZaa) Recreate dataset from multiple sources/disks (scientific experiment) Restore backup Digital animation editing Large dataset visualization

Gather Techniques Network Primitives Join data streams to deliver coherent data to app Orthogonal path selection Volume state in routers Replicate SCSI requests Reorder SCSI responses Techniques Pipelining Avoid congestion/optimize for bandwidth Network Primitives FS semantic information Store block location state in router View into network routes/conditions Table lookup in router Modify disk requests to point to correct locations Asynchronous N-to-1 disk copies (KaZaa) Recreate dataset from multiple sources/disks (scientific experiment) Restore backup

Scatter State dissemination Updating mapping tables Synchronous Asynchronous Disaster-recovery application Experimental data unloading State dissemination CDN/web server updating? Gaming? Updating mapping tables

Scatter Techniques Network Primitives State dissemination Delay-sensitive path selection Congestion avoidance Synchronization Network Primitives Network monitoring FS semantic information Store block location state in router View into network routes/conditions Table lookup in router … Synchronous State dissemination CDN/web server updating? Gaming? Updating mapping tables

Scatter State dissemination Updating mapping tables Synchronous Asynchronous Disaster-recovery application Experimental data unloading State dissemination CDN/web server updating? Gaming? Updating mapping tables

Scatter Techniques Network Primitives Disk location/selection Load balancing Physical distance knowledge Network Primitives Network monitoring FS semantic information Store block location state in router View into network routes/conditions Table lookup in router … Asynchronous Disaster-recovery application Experimental data unloading

Outline Trends Challenges for wide-area storage Programmability inside networks Common techniques to hide latency Functionality that will benefit applications Network Support for that functionality Reconsidering the programmability model and application space Feedback

Useful Network Primitives What is reasonable and possible? FS semantic information Store block location state in router View into network routes/conditions Table lookup in router Modify disk requests to point to correct locations Join data streams to deliver coherent data to app Orthogonal path selection Volume state in routers Replicate SCSI requests Reorder SCSI responses Others?

Outline Trends Challenges for wide-area storage Programmability inside networks Common techniques to hide latency Functionality that will benefit applications Network Support for that functionality Reconsidering the programmability model and application space Feedback

Your Feedback? 12/26/2018 SAHARA Retreat