Content Addressed Storage

Slides:



Advertisements
Similar presentations
Data Storage Solutions Module 1.2. Data Storage Solutions Upon completion of this module, you will be able to: List the common storage media and solutions.
Advertisements

CLEARSPACE Digital Document Archiving system INTRODUCTION Digital Document Archiving is the process of capturing paper documents through scanning and.
Centres of excellence and first choice for healthcare providers Digital Archival Program Neville Pinto Team Leader - Server & Storage Services.
RSS 2000 v3 Product Presentation Live Recording and Streaming.
Management Information Systems, Sixth Edition
A Better Option for IT’s Data Management Challenge By Shaun Smale Solutions Consultant, BridgeHead Software.
MobiShare: Sharing Context-Dependent Data & Services from Mobile Sources Efstratios Valavanis, Christopher Ververidis, Michalis Vazirgianis, George C.
Storage area Network(SANs) Topics of presentation
STORAGE MANAGEMENT Introduction to Information Storage and Management - 1.
E-commerce and Information Technology in Hospitality and Tourism Chapter 3 Connecting to the World Copyright 2004 by Zongqing Zhou, PhD Niagara University.
Internet Resources Discovery (IRD) IBM DB2 Digital Library Thanks to Zvika Michnik and Avital Greenberg.
1 © Copyright 2010 EMC Corporation. All rights reserved. EMC Centera The best archive storage platform with lowest total cost of ownership.
Copyright © 2006 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill Technology Education Copyright © 2006 by The McGraw-Hill Companies,
Storage Networking Technologies and Virtualization Section 2 DAS and Introduction to SCSI1.
Barracuda Networks Confidential1 Barracuda Backup Service Integrated Local & Offsite Data Backup.
© 2009 EMC Corporation. All rights reserved. Content Addressed Storage Module 2.5.
System Design/Implementation and Support for Build 2 PDS Management Council Face-to-Face Mountain View, CA Nov 30 - Dec 1, 2011 Sean Hardman.
CPMT 1449 Computer Networking Technology – Lesson 1
1 © Copyright 2010 EMC Corporation. All rights reserved. EMC Centera Technical Review.
Global Capabilities Archiving – Designing from Top to Bottom Gary Brown Dimension Data.
Data Center Infrastructure
Module – 10 Backup and Archive
AL-MAAREFA COLLEGE FOR SCIENCE AND TECHNOLOGY INFO 232: DATABASE SYSTEMS CHAPTER 1 DATABASE SYSTEMS (Cont’d) Instructor Ms. Arwa Binsaleh.
STEALTH Content Store for SharePoint using Caringo CAStor  Boosting your SharePoint to the MAX! "Optimizing your Business behind the scenes"
MODULE – 8 OBJECT-BASED AND UNIFIED STORAGE
Computers Are Your Future Tenth Edition Chapter 8: Networks: Communicating & Sharing Resources Copyright © 2009 Pearson Education, Inc. Publishing as Prentice.
OASIS ebXML Registry Standard Open Forum 2003 on Metadata Registries 10:30 – 11:15 January 20, 2003 Kathryn Breininger The Boeing Company Chair, OASIS.
Meeting the Data Protection Demands of a 24x7 Economy Steve Morihiro VP, Programs & Technology Quantum Storage Solutions Group
Archiving Where did I put that mail?. Business criticity Importance to manage : –Authenticity –Integrity –Perennity –Compliance High TCO of mail.
Module – 4 Intelligent storage system
Challenges of Digital Media Preservation Karen Cariani, Director Media Library and Archives Dave MacCarn, Chief Technologist.
© 2009 EMC Corporation. All rights reserved. Introduction to Information Storage and Management Module 1.1.
Chapter 9 Section 2 : Storage Networking Technologies and Virtualization.
Course ILT Basics of information technology Unit objectives Define “information technology” (IT), distinguish between hardware and software, and identify.
© 2009 EMC Corporation. All rights reserved. EMC Proven Professional The #1 Certification Program in the information storage and management industry Content.
Electronic Records Management: A Checklist for Success Jesse Wilkins April 15, 2009.
Tracy Bierman August 17, 2011 A Proposal to Archive Shuttle Records in the Cloud.
Production Data Grids SRB - iRODS Storage Resource Broker Reagan W. Moore
© 2006 EMC Corporation. All rights reserved. Content Addressed Storage (CAS) Module 3.5.
MULTIMEDIA DATABASES -Define data -Define databases.
ISDMR :BEIT VIII:CHAP1 MADHU N 1 Section Objective Upon completion of this section, you will be able to: Describe the challenges in information storage.
Copyright © 2015 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill Education.
Chapter 2 part 2. Computer Processing Speeds Milliseconds - thousands of a second Microseconds - millionths of a second Nanoseconds - billionths of a.
Hosted by The Pros & Cons of Content Addressed Storage Arun Taneja Founder & Consulting Analyst.
+ CS 325: CS Hardware and Software Organization and Architecture Memory Organization.
© 2007 Cisco Systems, Inc. All rights reserved.Cisco Public 1 Version 4.0 Living in a Network Centric World Network Fundamentals – Chapter 1.
CLASS Information Management Presented at NOAATECH Conference 2006 Presented by Pat Schafer (CLASS-WV Development Lead)
1 NETE4631 Working with Cloud-based Storage Lecture Notes #11.
© 2009 EMC Corporation. All rights reserved. EMC Proven Professional The #1 Certification Program in the information storage and management industry Content.
I.R.I.S. © 2006, All rights reserved 1 GENERALI Belgium, a global Documentum Content Management Solution since 2004.
© 2007 Cisco Systems, Inc. All rights reserved.Cisco Public 1 Living in a Network Centric World Network Fundamentals – Chapter 1.
OAIS Rathachai Chawuthai Information Management CSIM / AIT Issued document 1.0.
11 CLUSTERING AND AVAILABILITY Chapter 11. Chapter 11: CLUSTERING AND AVAILABILITY2 OVERVIEW  Describe the clustering capabilities of Microsoft Windows.
Digital Preservation across the technologies, strategies, open standards & interoperability aspects including the legal issues Pratik Shrivastava Scientist.
Internet Protocol Storage Area Networks (IP SAN)
1 Electronic Records Management and Preservation Denis Plude June 26, 2006.
Medical Imaging Lection 3. Basic Questions Imaging in Medical Sciences Transmission Imaging PACS and DICOM.
OASIS ebXML Registry Standard Open Forum 2003 on Metadata Registries 10:30 – 11:15 January 20, 2003 Kathryn Breininger The Boeing Company Chair, OASIS.
AFS/OSD Project R.Belloni, L.Giammarino, A.Maslennikov, G.Palumbo, H.Reuter, R.Toebbicke.
The overview How the open market works. Players and Bodies  The main players are –The component supplier  Document  Binary –The authorized supplier.
Rights Management for Shared Collections Storage Resource Broker Reagan W. Moore
© 2007 EMC Corporation. All rights reserved. Internet Protocol Storage Area Networks (IP SAN) Module 3.4.
Version 4.0 Living in a Network Centric World Network Fundamentals – Chapter 1.
KEEPS – a system for UELMA preservation and security
KEEPS – a system for UELMA preservation and security
MODULE – 8 OBJECT-BASED AND UNIFIED STORAGE
An Introduction to Computer Networking
Searchable. Secure. Simple.
Title Month Year Chris Patel EMC Centera Strategic Alliance Manager
IBM Tivoli Storage Manager
Presentation transcript:

Content Addressed Storage Copyright © 2009 EMC Corporation. Do not Copy - All Rights Reserved. Content Addressed Storage Chapter 9 ISMDR:BEIT:VIII:chap5.5:CAS: Madhu N PIIT

Copyright © 2009 EMC Corporation. Do not Copy - All Rights Reserved. Chapter Objective Upon completion of this chapter, you will be able to: Describe CAS, fixed content and archives, traditional storage solutions for archive Describe the features and benefits of a CAS based storage strategy List the physical and logical elements of CAS Describe the storage and retrieval process for CAS data objects Describe the best suited operational environments for CAS solutions ISMDR:BEIT:VIII:chap5.5:CAS: Madhu N PIIT

Copyright © 2009 EMC Corporation. Do not Copy - All Rights Reserved. Lesson: CAS Overview Upon completion of this lesson, you be able to: Define fixed content Describe traditional archival solutions and its shortcoming Define Content Addressed Storage (CAS) List benefits of CAS ISMDR:BEIT:VIII:chap5.5:CAS: Madhu N PIIT

What are Fixed Content and Archives Copyright © 2009 EMC Corporation. Do not Copy - All Rights Reserved. What are Fixed Content and Archives Generate New Revenues Improve Service Levels Leverage Historical Value Digital Assets Retained For Active Reference And Value Electronic Documents Contracts, claims, etc. E-mail and attachments Financial spread sheets CAD/CAM designs Presentations Digital Records Documents Checks, securities trades Historical preservation Photographs Personal / professional Surveys Seismic, astronomic, geographic Rich Media Medical X-rays, MRIs, CTI Video News / media, movies Security surveillance Audio Voicemail Radio ISMDR:BEIT:VIII:chap5.5:CAS: Madhu N PIIT

Challenges of Storing Fixed Content Copyright © 2009 EMC Corporation. Do not Copy - All Rights Reserved. Challenges of Storing Fixed Content Fixed content is growing at more than 90% annually Significant amount of newly created information falls into this category New regulations require retention and data protection Often, long-term preservation is required (years-decades) Simultaneous multi-user online access is preferable to offline storage Need faster access to fixed content Need for location independent data, enabling technology refresh and migration Traditional storage methods are inadequate ISMDR:BEIT:VIII:chap5.5:CAS: Madhu N PIIT

Traditional storage solutions for Archive Copyright © 2009 EMC Corporation. Do not Copy - All Rights Reserved. Traditional storage solutions for Archive Three categories of archival solution are: Online, nearline, and offline based on the means of access Traditional archival solution were offline Traditional archival process used optical disks and tapes as media for archival An archive is often stored on a Write Once Read Many (WORM) device, such as a CD-ROM ISMDR:BEIT:VIII:chap5.5:CAS: Madhu N PIIT

Shortcomings of Traditional Archiving Solutions Copyright © 2009 EMC Corporation. Do not Copy - All Rights Reserved. Shortcomings of Traditional Archiving Solutions Tape is slow, and standards are always changing Optical is expensive, and requires vast amounts of media Recovering files from tape and optical is often time consuming Data on tape and optical is subject to media degradation Both solution require sophisticated media management CAS has emerged as an alternative to traditional archiving solutions ISMDR:BEIT:VIII:chap5.5:CAS: Madhu N PIIT

What is Content Addressed Storage (CAS) Copyright © 2009 EMC Corporation. Do not Copy - All Rights Reserved. What is Content Addressed Storage (CAS) Object-oriented, location-independent approach to data storage Repository for the “Objects” Access mechanism to interface with repository Globally unique identifiers provide access to objects ISMDR:BEIT:VIII:chap5.5:CAS: Madhu N PIIT

Copyright © 2009 EMC Corporation. Do not Copy - All Rights Reserved. Benefits of CAS Content authenticity Content integrity Location independence Single-instance storage (SiS) Retention enforcement Record-level protection and disposition Technology independence Fast record retrieval ISMDR:BEIT:VIII:chap5.5:CAS: Madhu N PIIT

Copyright © 2009 EMC Corporation. Do not Copy - All Rights Reserved. Benefits of CAS Content authenticity Content integrity Location independence Single-instance storage (SiS) Retention enforcement Record-level protection and disposition Technology independence Fast record retrieval ISMDR:BEIT:VIII:chap5.5:CAS: Madhu N PIIT

Copyright © 2009 EMC Corporation. Do not Copy - All Rights Reserved. Lesson Summary Key points covered in this lesson: CAS Definition Challenges of Storing Fixed Content Shortcomings of Traditional Archiving Solutions Benefits of CAS ISMDR:BEIT:VIII:chap5.5:CAS: Madhu N PIIT

Lesson: CAS Architecture Copyright © 2009 EMC Corporation. Do not Copy - All Rights Reserved. Lesson: CAS Architecture Upon completion of this lesson, you will be able to: Describe CAS architecture Describe Physical and logical elements of CAS Describe data storage and retrieval process in CAS environment CAS examples ISMDR:BEIT:VIII:chap5.5:CAS: Madhu N PIIT

Physical Elements of CAS Copyright © 2009 EMC Corporation. Do not Copy - All Rights Reserved. Physical Elements of CAS Storage devices (CAS Based) Storage node Access node Servers (to which storage devices get connected) Client Storage Nodes Private LAN Access Nodes IP API CAS System ISMDR:BEIT:VIII:chap5.5:CAS: Madhu N PIIT Server

Copyright © 2009 EMC Corporation. Do not Copy - All Rights Reserved. CAS Terminology API Application Programming Interface (API) A set of function calls that enables communication between applications or between an application and an operating system Binary Large Object (BLOB) The Distinct Bit Sequence (DBS) of user data represents the actual content of a file and is independent of the filename and physical location ISMDR:BEIT:VIII:chap5.5:CAS: Madhu N PIIT

CAS Terminology (Cont.) Copyright © 2009 EMC Corporation. Do not Copy - All Rights Reserved. CAS Terminology (Cont.) C-Clip A package containing the user's data and associated metadata C-Clip ID (C-Clip handle or C-Clip reference) is the CA that the system returns to the client application Content Address (CA) An identifier that uniquely addresses the content of a file and not its location. Unlike location-based addresses, content addresses are inherently stable and, once calculated, they never change and always refer to the same content C-Clip Descriptor File (CDF) The additional XML file that the system creates when making a C-Clip. This file includes the content addresses for all referenced BLOBs and associated metadata ISMDR:BEIT:VIII:chap5.5:CAS: Madhu N PIIT

How CAS Stores a Data Object Copyright © 2009 EMC Corporation. Do not Copy - All Rights Reserved. How CAS Stores a Data Object Client presents data to API to be archived Unique Content Address is calculated Object is sent to Centera via Centera API over IP CAS System Application Server API Client C-Clip (Object) CDF ISMDR:BEIT:VIII:chap5.5:CAS: Madhu N PIIT

How CAS Stores a Data Object Copyright © 2009 EMC Corporation. Do not Copy - All Rights Reserved. How CAS Stores a Data Object Client presents data to API to be archived Unique Content Address is calculated Object is sent to Centera via Centera API over IP CAS System Application Server API Object Client Acknowledgement returned to application Clip ID is retained and stored for future use Centera validates the Content Address and stores the object ISMDR:BEIT:VIII:chap5.5:CAS: Madhu N PIIT

How CAS Retrieves a Data Object Copyright © 2009 EMC Corporation. Do not Copy - All Rights Reserved. How CAS Retrieves a Data Object CAS authenticates the request and delivers the object 4 Object is needed by an application 1 CAS System Application Server API Client Retrieval request is sent to the CAS via CAS API over IP 3 Application finds Content Address of object to be retrieved 2 C-Clip ID ISMDR:BEIT:VIII:chap5.5:CAS: Madhu N PIIT

Copyright © 2009 EMC Corporation. Do not Copy - All Rights Reserved. CAS Features Features available with most CAS systems are: Integrity checking Data protection Local replication Remote replication Load balancing Scalability Self-diagnosis and repair Report generation and event notification Fault tolerance Audit trails ISMDR:BEIT:VIII:chap5.5:CAS: Madhu N PIIT

Example 1: CAS Healthcare Solution Copyright © 2009 EMC Corporation. Do not Copy - All Rights Reserved. Example 1: CAS Healthcare Solution Hospital Application Server API Stored locally for Data Stored Short-Term Use on CAS Patient Studies (60 Days) CAS System Each X-ray image ranges from about 15MB to over 1GB Patient record is stored online for a period of 60-90 days Beyond 90 days patient records are archived ISMDR:BEIT:VIII:chap5.5:CAS: Madhu N PIIT

Example 2: CAS Financial Solution Copyright © 2009 EMC Corporation. Do not Copy - All Rights Reserved. Example 2: CAS Financial Solution Bank Application Server API CAS System Check image size is about 25KB Check imaging service provider may process 50–90 million check images per month Checks are stored online for a period of 60 days Beyond 60 days data is archived ISMDR:BEIT:VIII:chap5.5:CAS: Madhu N PIIT

Copyright © 2009 EMC Corporation. Do not Copy - All Rights Reserved. Lesson Summary Key points covered in this lesson: CAS architecture Physical and logical elements of CAS CAS storage and retrieval process CAS solution examples ISMDR:BEIT:VIII:chap5.5:CAS: Madhu N PIIT

Copyright © 2009 EMC Corporation. Do not Copy - All Rights Reserved. Chapter Summary Key points covered in this chapter: Benefits of CAS based storage strategy Overview of physical and logical elements of CAS Storing and retrieving data from CAS CAS application examples ISMDR:BEIT:VIII:chap5.5:CAS: Madhu N PIIT

Concept in Practice – EMC Centera Copyright © 2009 EMC Corporation. Do not Copy - All Rights Reserved. Concept in Practice – EMC Centera Centera Architecture Based on RAIN (Redundant Array of Independent Node) Access Node Storage Node Content Mirrored Content To Server Storage Nodes 1 2 3 4 5 6 4 Ethernet LAN Switch 3 Access/Storage Nodes 6 Private 1 LAN 5 2 Ethernet Switch ISMDR:BEIT:VIII:chap5.5:CAS: Madhu N PIIT Power Rails