Challenges of Storage in an Elastic Infrastructure. May 9, 2014 Farid Yavari, Storage Solutions Architect and Technologist.

Slides:



Advertisements
Similar presentations
Data Storage Solutions Module 1.2. Data Storage Solutions Upon completion of this module, you will be able to: List the common storage media and solutions.
Advertisements

Ivan Pleština Amazon Simple Storage Service (S3) Amazon Elastic Block Storage (EBS) Amazon Elastic Compute Cloud (EC2)
Flash storage memory and Design Trade offs for SSD performance
NAS vs. SAN 10/2010 Palestinian Land Authority IT Department By Nahreen Ameen 1.
Connect communicate collaborate GN3plus What the network should do for clouds? Christos Argyropoulos National Technical University of Athens (NTUA) Institute.
What’s New: Windows Server 2012 R2 Tim Vander Kooi Systems Architect
Brocade VDX 6746 switch module for Hitachi Cb500
The All-Flash Array for the Next Generation Data Center.
Cloud Computing to Satisfy Peak Capacity Needs Case Study.
© 2009 VMware Inc. All rights reserved Big Data’s Virtualization Journey Andrew Yu Sr. Director, Big Data R&D VMware.
Efficiently store fewer bits. File1 File2 After Dedup: Before Dedup:5TB Chunk Store Non-Optimized Files Optimized file stubs Savings = 4TB 1TB.
1© Copyright 2015 EMC Corporation. All rights reserved. SDN INTELLIGENT NETWORKING IMPLICATIONS FOR END-TO-END INTERNETWORKING Simone Mangiante Senior.
CERN openlab Open Day 10 June 2015 KL Yong Sergio Ruocco Data Center Technologies Division Speeding-up Large-Scale Storage with Non-Volatile Memory.
Virtual Network Servers. What is a Server? 1. A software application that provides a specific one or more services to other computers  Example: Apache.
“Better together” PowerVault virtualization solutions
© Hitachi Data Systems Corporation All rights reserved. 1 1 Det går pænt stærkt! Tony Franck Senior Solution Manager.
1© Copyright 2013 EMC Corporation. All rights reserved. EMC and Microsoft SharePoint Server Performance Name Title Date.
Distinguish between primary and secondary storage.
1 © Copyright 2009 EMC Corporation. All rights reserved. Agenda Storing More Efficiently  Storage Consolidation  Tiered Storage  Storing More Intelligently.
CERN openlab Open Day 10 June 2015 KL Yong Sergio Ruocco Data Center Technologies Division Speeding-up Large-Scale Storage with Non-Volatile Memory.
Cloud Computing in Large Scale Projects George Bourmas Sales Consulting Manager Database & Options.
© 2009 VMware Inc. All rights reserved VMworld Update Ian Moore - Country Manager Ireland ie.linkedin.com/in/iantmooreiantmoore.
Extreme Networks Confidential and Proprietary. © 2010 Extreme Networks Inc. All rights reserved.
© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice. Enable Cloud with Virtual.
An Examination of Cloud Storage Architectures for Scalable Internet and Cloud Computing Applications Joel Christner (jec2160) COMS-E6125 Spring 2010 Web.
Hyper-V Storage Senthil Rajaram Senior PM Microsoft Corporation.
ICPP 2014 Keynotes Summary 09/24. Data Centric Systems: The Next Paradigm in Computing Speaker: Dr. Tilak Agerwala ◦ Vice President, Data Centric Systems.
Introduction To Windows Azure Cloud
IMT, Inc. Proprietary & Confidential Storage The Year In Review 2008 And looking ahead for 2009 Rob Kobrin Chief Technology Officer Integrated Media Technologies,
IT Infrastructure Chap 1: Definition
N. GSU Slide 1 Chapter 02 Cloud Computing Systems N. Xiong Georgia State University.
Wayne Hogan National Storage Manager Sun Microsystems of Canada, Inc.
Extreme-scale computing systems – High performance computing systems Current No. 1 supercomputer Tianhe-2 at petaflops Pushing toward exa-scale computing.
FlashSystem family 2014 © 2014 IBM Corporation IBM® FlashSystem™ V840 Product Overview.
Large Scale Test of a storage solution based on an Industry Standard Michael Ernst Brookhaven National Laboratory ADC Retreat Naples, Italy February 2,
Eucalyptus: An Open-source Infrastructure for Cloud Computing Rich Wolski Eucalyptus Systems Inc.
The Data Center of the Future Steve Duplessie Founder & Senior Analyst Enterprise Strategy Group, Inc.
Switched Storage Architecture Benefits Computer Measurements Group November 14 th, 2002 Yves Coderre.
Agenda Motion Imagery Challenges Overview of our Cloud Activities -Big Data -Large Data Implementation Lessons Learned Summary.
RIVERBED INTRODUCES NEW PLATFORM FOR ADC-AS-A-SERVICE New Stingray Services Controller Delivers Hyper-Elastic ADC Platform EXTREME ELASTICITY INSTANTLY.
CS525: Big Data Analytics MapReduce Computing Paradigm & Apache Hadoop Open Source Fall 2013 Elke A. Rundensteiner 1.
Rick Claus Sr. Technical Evangelist,
 The End to the Means › (According to IBM ) › 03.ibm.com/innovation/us/thesmartercity/in dex_flash.html?cmp=blank&cm=v&csr=chap ter_edu&cr=youtube&ct=usbrv111&cn=agus.
EMC Proven Professional. Copyright © 2012 EMC Corporation. All Rights Reserved. NAS versus SAN NAS – Architecture to provide dedicated file level access.
STORAGE ARCHITECTURE/ MASTER): Where IP and FC Storage Fit in Your Enterprise Randy Kerns Senior Partner The Evaluator Group.
Enabling the Cloud OS Today  New high-density Web Sites with elastic cloud scaling and complete dev-ops experiences  New rich IaaS experience for self-service.
Launch Amazon Instance. Amazon EC2 Amazon Elastic Compute Cloud (Amazon EC2) provides resizable computing capacity in the Amazon Web Services (AWS) cloud.
RobuSTore: Performance Isolation for Distributed Storage and Parallel Disk Arrays Justin Burke, Huaxia Xia, and Andrew A. Chien Department of Computer.
Network Virtualization Policy-Based Isolation QoS Performance Metrics Live & Storage Migrations Cross-Premise Connectivity Dynamic & Multi-Tenant.
An Introduction to GPFS
1 Paolo Bianco Storage Architect Sun Microsystems An overview on Hybrid Storage Technologies.
Software Defined Datacenter – from Vision to Solution
Introduction to Data Analysis with R on HPC Texas Advanced Computing Center Feb
Amazon Web Services. Amazon Web Services (AWS) - robust, scalable and affordable infrastructure for cloud computing. This session is about:
St. Petersburg, 2016 Openstack Disk Storage vs Amazon Disk Storage Computing Clusters, Grids and Cloud Erasmus Mundus Master Program in PERCCOM Author:
Extreme Scale Infrastructure
Journey to the HyperConverged Agile Infrastructure
Network Requirements for Resource Disaggregation
MERANTI Caused More Than 1.5 B$ Damage
Organizations Are Embracing New Opportunities
What’s New in VMware vSAN 6.6?
Reducing Risk with Cloud Storage
Storage Virtualization
GGF15 – Grids and Network Virtualization
The Brocade Cloud Manageability Vision
AllDigital Brevity on Microsoft Azure Cloud Platform Supercharges Media Workloads by Encoding During High-Speed File Transmission MICROSOFT AZURE ISV PROFILE:
Cloud computing mechanisms
Internet and Web Simple client-server model
CS 295: Modern Systems Organizing Storage Devices
PayPal Cloud Journey & Architecture
Presentation transcript:

Challenges of Storage in an Elastic Infrastructure. May 9, 2014 Farid Yavari, Storage Solutions Architect and Technologist

The eBay Inc. Portfolio University of Minnesota Storage Workshop

The eBay Inc. Portfolio University of Minnesota Storage Workshop

The eBay Inc. Portfolio University of Minnesota Storage Workshop

The eBay Inc. Portfolio University of Minnesota Storage Workshop

The eBay Inc. Portfolio 2014 University of Minnesota Storage Workshop 6

Size of the Managed Infrastructure 7 SAN ~ FC OLTP environment 16PB NAS ~ 6PB Cloud Object Store ~1EB by 2015 Analytics environment ~270PB ~130 enterprise SAN/NAS/FLASH storage arrays in 3 DCs ~1.4mil peak hour IOPS in SAN environment Thousands of servers with external storage 2014 University of Minnesota Storage Workshop

Foundation of an Elastic Infrastructure 8 Automated Control Plane Resource Pool –Compute –Memory –Storage –Low latency, High bandwidth interconnect Traffic Management –PCI Compliance –Security –QoS Definition: An infrastructure that can spawn, destroy, grow, shrink and move processes dynamically and efficiently within and across data centers University of Minnesota Storage Workshop

Key Technologies to Enable an Elastic Infrastructure 9 Control Plane –Virtualization / Containers/ Hardware –Orchestration of infrastructure resources –Normalization of resources Resource Pool –High Speed Networking (10Gbe, 40Gbe, 100Gbe, beyond) RDMA enabled (routable layer3) Lossless flow control –Memory Class Storage –New Media beyond Virtical Nand Traffic –Virtual Lan –Access Control Image Credit: Open StackOpen Stack 2014 University of Minnesota Storage Workshop

Key Initiatives to Enable an Elastic Infrastructure 10 Separation of Storage and Compute -Hadoop use case Software defined storage, software defined network Cloud, SLA, OLA based services –Standardization –Automation –Show/Chargeback –Self Service 2014 University of Minnesota Storage Workshop

More importantly 11 Simplicity Simplicity Scales Simplicity is the ultimate sophistication 2014 University of Minnesota Storage Workshop

Shifting Paradigm of Storage 12 TechBusBWLat1234 HDD (LFF)SAS/SATA600MB/s 1.2GB/s 3-12ms5-6TB7-8TB10TB? SSDSAS/SATA600MB/s 1.2GB/s ms4-8TB8-16TB24TB36-48TB FlashPCIE2GB/s 3GB/s 2μs - 150μs 1-16TB16TB+ Beyond Nand PCIE2+GB/s1μs -40μs 100’s ns 8-24TB24TB+ Sectors/Block/Cells -> unbound (bytes/ pages) Devices becoming consumables (cell failures vs platter failures, shrinking capacities on failure) Endurance is temporary (next 2-8 years). Interim – controllers will shield users from endurance. long term – media becomes tolerant (no read/ program disturb) 2014 University of Minnesota Storage Workshop

What about Block, File, Object and Databases? 13 Legacy support will keep them around for a long time (Is Cobol/RPG Dead yet?) Block is still the basis of storage, until Software catches up to media (2-4 years delayed after new tech introduced) Object Store is the new Block, File and Database –No Attachments required (Simplicity) –Enforceable Access Control –Only difference between Block, File and Database in an object store is the richness of metadata Maps to new technology better than Block, File and Databases today years: We’re back to the future. Processes have compute and a work area (memory) which may be persistent. Processed data stored in an object store and handed off to further processing or tiered to archival is based on policy and flow control. Scale out across a data center 2014 University of Minnesota Storage Workshop

The Challenge 14 Near Term (3-5 years) –Paradigm shift is starting. –Storage Arrays are understanding flash and flash are becoming flash not disk emulation (slow evolution) –Next steps is to model filesystems, kernels, application to understand flash Failure domains, Performance, Data placement/movement vs RAID Strategic 5+ years –Server model/ Resource model changing Disaggregation of resources brings new challenges Persistent Memory replacing standard memory requires system bring up and fault remediation challenges Object storage evolution and metadata handling – long ways to go 2014 University of Minnesota Storage Workshop

2016 Infrastructure Goals 15 Flash Everywhere: Local, Shared, Cold, Hadoop, Media etc. No FC or Infiniband. ISER/IP, 40Gbps+ethernet Fully REST API driven storage management and automation Hyperscale cloud across all classes of service (QA, Dev, Prod, ec.) Extremely high densities, 140TB+ per flash device, 54PB+ racks Two tiers of flash: High performance, and Archival flash 2014 University of Minnesota Storage Workshop