The Content Intelligence Company

Slides:



Advertisements
Similar presentations
T HE V ALUE OF E NTERPRISE S EARCH Robert Gill & Pieter-Jan De Boeck.
Advertisements

School Systems Learn the Value of Document Management to Better Serve Students, Parents and Staff and Presented By:
Presentation by Priyanka Sawarkar
© 2007 IBM Corporation Enterprise Content Management Integrating Content, Process, and Connectivity for Competitive Advantage Malcolm Holden October 2007.
2  Industry trends and challenges  Windows Server 2012: Modern workstyle, enabled  Access from virtually anywhere, any device  Full Windows experience.
© 2007 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice HP TRIM HP Information Management.
© Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice. trans for ma tion : a.
Misys Treasury & Capital Markets
1 Exchange Management : Archiving & Storage Do you really need 3rd party archiving with Exchange 2010? James Bushell Symantec Enterprise.
SAP BI ConnectorDuet Enterprise for Microsoft SharePoint and SAP SAP NetWeaver Gateway productivity accelerator for Microsoft Synch enterprise data.
11© 2011 Hitachi Data Systems. All rights reserved. HITACHI DATA DISCOVERY FOR MICROSOFT® SHAREPOINT ® SOLUTION SCALING YOUR SHAREPOINT ENVIRONMENT PRESENTER.
© 2008 Kroll Ontrack Inc.| Ontrack PowerControls 5.1 The ultimate “power tool” for SharePoint administrators.
On Privacy-aware Information Lifecycle Management (ILM) in Enterprises: Setting the Context Marco Casassa Mont Hewlett-Packard.
Agenda Symantec Enterprise Vault 1 Today’s Management Challenges 1 Why Management? 2 The Solution: Symantec Enterprise Vault 3 Benefits & Closing.
Libraries and Institutional Content Management Systems
Mel Pless, Sr. Director, Solutions Consulting Guidance Software, Inc. Let’s Get Right To The Endpoint Leveraging Endpoint Data to Expose,
1 1© 2011 Hitachi Data Systems. All rights reserved. FILE ARCHIVING SOLUTION WITH ARKIVIO® AUTOSTOR® PRESENTER NAME DATE FILE ARCHIVING SOLUTION WITH ARKIVIO®
Why Information Governance….instead of Records & Information Management? Angela Fares, RHIA, CRM, CISA, CGEIT, CRISC, CISM or
By Helen Streck President/CEO Kaizen InfoSource LLC Litigation Readiness: Information Manager’s Role.
© 2008 IBM Corporation ® 1 ECM Product Vision & Strategy Ken Bisconti Vice President, ECM Products and Strategy IBM Software Group February 2009.
STEALTH Content Store for SharePoint using Caringo CAStor  Boosting your SharePoint to the MAX! "Optimizing your Business behind the scenes"
Document Solutions Document Solutions William Zastrow President, CEO FileMark Corporation September 24, 2008 Document Solutions Document Solutions .
Digital Imaging Services Digital Imaging Services – We take information from any format (i.e. Paper, Microfilm, Microfiche, Digital, etc.) and move it.
Archiving s. How to Manage Auto-Archive in Outlook Your Microsoft Outlook mailbox grows as you create and receive items. To manage the space.
March 2014 Basic Content Management Tuffolo Group Perspective TUFFOLO.
Presented to AIIM William Penn Chapter Meeting 5/13/08.
Archiving Best Practices with Symantec Enterprise Vault 8.0 Lou Zeidman Regional Sales Manager, Information Risk Management.
Business Productivity Infrastructure Optimization Campaign 1 Agenda: BPIO Partner Sales Readiness Workshop Day 3: Topic: Enterprise Content management.
Microsoft.com/publicsector Records Management Microsoft Records Management for Government Agencies.
ControlPoint The Eleventh Hour Presentation and ControlPoint Demonstration Abdullah Noman October, 2015.
OpenText EIM for SAP In a Nutshell. OpenText ©2013 All Rights Reserved. 2 An integrated portfolio designed for SAP best-run businesses harnessing market.
Archiving & Enterprise Content Management from Infocrew Solutions Pvt.Ltd.
Integrating Alfresco with Salesforce. Agenda About Technology Services Group Why a Salesforce / Alfresco Integration Use Cases / Examples Technical Architecture.
Copyright © 2013 Avaali. All Rights Reserved. 1 SAP OpenText ECM Solutions: Travel Receipts Management.
5 REASONS TO CLEAN UP THE DIGITAL LANDFILL Presentation to Boston ARMA September 12, 2011 Brent G. Stanley.
Copyright © 2013 Avaali. All Rights Reserved. 1 SAP OpenText ECM Solutions: Vendor Invoice Management (VIM)
Barracuda Networks. Safe Public Cloud Transitions Why Barracuda? The Challenge When organizations move workloads to the public cloud, data protection.
Gabor Fari April 26, 2007.
Alfresco – Protecting Trafigura’s Corporate Assets Beverley Verster, Head of IT Back Office & Corporate Systems.
A/P Processes and Governance: The Building Blocks for Compliance and Productivity Warren D’Avirro – Sales Director David Hutt – Solution Engineer.
10/16/2017 7:22 AM © 2014 Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION.
Advanced Endpoint Security Data Connectors-Charlotte January 2016
Portal Content Management (PCM) & Portal Site Management (PSM)
Robert Lejnert HPE Information Management & Governance, CEE.
Technology Market Trends Understanding ECM
Digital Transformation
The effort-saving, cost-cutting, low-overhead, cloud capture platform.
Data Minimization Framework
92% of the world’s data was created in the past 2 years
Taming the Wild Unstructured Data: The Shared Drive Jungle
Brandon Botes #SPSJHB Records Management – Friend or Foe ???
Brandon Botes #SPSDBN Records Management – Friend or Foe ???
Proactive Information Management and eDiscovery
Reducing Cost and Risk During an Investigation
eDiscovery & Information Governance Think Tank
Searchable. Secure. Simple.
Analysing and Classifying Data at Rest
Governance, Risk Management & Compliance (GRC) Market Share, Segmentation, Report 2024
Varonis Overview.
Big Red Cloud Offers a Simple Online Accounts Solution for Business Owners and Bookkeepers Hosted on the Powerful Microsoft Azure Platform MICROSOFT AZURE.
Druva inSync: A 360° Endpoint and Cloud App Data Protection and Information Management Solution Powered by Azure for the Modern Mobile Workforce MICROSOFT.
Searchable. Secure. Simple.
Agolo Summarization Platform Integrates with Microsoft OneDrive to Relate Enterprise Cloud Documents with Real-Time News Summaries OFFICE 365 APP BUILDER.
Managing Content: You Need To Think About More Than Office 365
FileFacets Information Governance Solution Performs High-Quality Automated Enterprise Content Management Migration, Built on Azure MICROSOFT AZURE APP.
Compliant Information Management and the eDiscovery Challenge
Case Study One organization’s journey to full in-place records compliance using SharePoint
Brandon Botes #SPSJHB Records Management – Friend or Foe ???
AI-Powered Information Governance
Microsoft Data Insights Summit
Presentation transcript:

The Content Intelligence Company Eric Rossborough Bytes, Basics and Beyond March 2017

About us Haystac The Content Intelligence Company Privately held & self-funded Launched in 2014 Headquarters in Newton, Massachusetts ~ 20 employees Working with Engineering and Operations, we identified and classified a large set of scanned documents to address regulatory compliance requirements around key topics. We developed a County-wide solution to classify and extract data points for large volume of scanned images as well as electronic stored information (including emails). The Content Intelligence Company

Situation analysis – Some goofy math For a sense of scale - Some goofy math (just for fun): In shared files today at a large US Bank: Estimates in PB = 10,000 TB = 10,000,000 GB = 150,000,000,000 pages of “Dark Data” 1 box of documents = 0.833 ft high 150,000,000,000 pages = 75,000,000 boxes Bank Building in Atlanta = 310 ft = 372 boxes 10PB = 75,000,000 boxes = 201,612 bank towers = 31 miles Distance from surface of earth to stratosphere = 30 miles The Content Intelligence Company

Why Content Analytics Reduce information security risk Reduce potential hack “strike zone” PII, PCI, HCI, etc. Confidential or restricted content Lower storage management costs Reliably identify Relevant vs Redundant, Obsolete, and Trivial (ROT) content Improve accuracy and speed of content searches Consistently apply best practices for Information Governance. Minimize end-user impact on content indexing Eliminates ROT data Accelerate document- based business processes Commercial/retail loan origination Forensic accounting Dynamically classify content according to business value and events Mergers and acquisitions Litigations and e-discovery Audits Report on content for advanced analytics The Content Intelligence Company

Cross-Industry Use Cases Storage management and legacy information cleanup IT cost reduction Information governance Corporate and regulatory compliance Information Security Sensitive PII/PCI/PHI content identification and remediation Retention/disposition content Data Monetization Data Migration Litigation and E-Discovery acceleration Process improvement initiatives Mergers and acquisitions Document analytics The Content Intelligence Company

Cross-Industry Use Cases - Examples Large US Bank – Cost reduction and Information Security 25 PB of content in file shares - $100 M/year expenditure and growing 6 versions of File Net, SharePoint – expensive to maintain, poor user value Large stream of digitized paper coming from business (retail banking in particular) Large Electrical Utility – Info security and governance 6 PB of content in OpenText Content Server + x PB in FileShares Under corporate mandate to universally develop and apply retention and disposition policies Integrated Oil and Gas – Acquisition (Data Migration) Mandate to migrate from ECM (OpenText, SharePoint) and file Shares to Corporate ECM (Documentum) Large Canadian Bank – Migration and Governance Over 3,000 applications running on Notes Corporate mandate to migrate content to Corporate ECM (Documentum) Over 5 PB of contents derived from acquisition Unknown value and risk of content Large volume of PST files The Content Intelligence Company

Our discussion today – Large US Bank Historically, long term storage of XXXX’s information assets (data and e-documents) has supported an environment where structured and unstructured information is over-retained, and disposed of infrequently and inconsistently. User-created records can be stored anywhere Little or no retention or Lifecycle Governance (value vs risk) Lack of search findability Not always secure – can contain PHI, PII orother sensitive information Increased cost for e-discovery, storage, and backups Increased RISK ! The Content Intelligence Company

Current Unstructured Content Environment Problem Statement Using Indāgō Content Analytics - Crawled and Indexed all NAS Drives – both personal and shared drives Presented our findings from a high level review of the primary NAS storage environments: Surfaced current storage size of ~1.9 PB and corresponding managed storage costs of ~$18 MM / yr. Initial estimates have surfaced that operationalizing the disposal of unnecessary data could reduce storage expenses by ~$10MM in year one (with organic growth / ROT reduction assumptions). In an effort to validate the size of the opportunity, there is a need to interrogate storage environments and quantify business benefits associated with disposing of “ROT” data (Redundant, Obsolete, Transient) as defined by corporate policies. Current Unstructured Content Environment The Content Intelligence Company

High Level Findings Environment ROT Summary – 6/08/2016 The Content Intelligence Company

Business Case Unstructured Cleansing * Financial Impact 1) Prohibited File Types 7% Review File List Haystac to ID Data Mitigate 2) Non-Accountable Data 8% Abandoned Home Shares Orphaned Home Shares N/A Data in Common Shares 3) Aging Data 25% Home Share Data 2+ Years (85 TB) Records past retention Common Share or Shared Drives 4) Duplicate Data 10% Home Shares Shared Drives Across Enterprise Financial Impact Using the same data that was originally presented: $10M/y x 50% = $ 5M/y Using 40% New Data Growth Rate (NDGR) $25 M saved over 5 years * Very Conservative percentages with Content Analytics The Content Intelligence Company

What is Haystac Indāgō Comprehensive and scalable Content Analytics Machine learning and Visual Content Intelligence Searches, crawls, profiles and clusters unstructured data repositories File-shares, Email, Google Drive, Enterprise Content Management (ECM), SharePoint, Office365, etc. Identifies ROT and Sensitive data Automatically profiles and clusters relevant data Manages content and metadata in-place within ECM Connectors to FileNet, Documentum, OpenText ContentServer, SharePoint, Google Drive, etc. 600+ files types, including scanned and pdf documents Applies dynamically known or derived classification model Applies visual classification to scanned and pdf content Applies retention policies to content Automatically extracts data points from content Auto-indexes electronic documents Targeted OCR (visual anchor) for scanned and pdf documents The Content Intelligence Company

What is Move to Manage – Process © 2012 Capgemini – All rights reserved What is Move to Manage – Process Identifies ROT, Non-records, Dups and Near-Dups to reduce volume of content to be moved Tags sensitive content 1 File Share File Share File Share File Share FileNet ROT Non-records Dups SharePoint 2 Records Leverage existing protocols, connectors and accelerators File Share File Share File Share File Share The Content Intelligence Company © 2012 Capgemini – All rights reserved

What is Manage in Place – Process © 2012 Capgemini – All rights reserved What is Manage in Place – Process Crawl based inventory of content and meta-data Google Drive Aodocs Likely daily syndication Published reports of meta-data updates Haystac Indago integrates with key ECM systems and classifies content, providing decision support Disposition or management of content will happen at system of record System of Management responsible for CRUD action (Create, Replace, Update, Delete) The Content Intelligence Company © 2012 Capgemini – All rights reserved

Unleash the Power of Content Understand, Classify, Act Director of Sales: Eric Rossborough – erossborough@haystac.com The Content Intelligence Company