Data Administration SIG

Slides:



Advertisements
Similar presentations
Nokia Technology Institute Natural Partner for Innovation.
Advertisements

‘Cash is King’ Mohamed Samee Khan Channel Head - SAP MENA Channel Head - SAP MENA.
Big Data Management and Analytics Introduction Spring 2015 Dr. Latifur Khan 1.
Big Data and Predictive Analytics in Health Care Presented by: Mehadi Sayed President and CEO, Clinisys EMR Inc.
Query, Analysis and Reporting Tools Brian BALSER Lamia BENKIRANE Jeralyn PASINABO Dave WILSON MBA 664 April, the 13 th, 2009.
Copyright © 2008 SAS Institute Inc. All rights reserved. SAS and all other SAS Institute Inc. product or service names are registered trademarks or trademarks.
Meeting of the Management of Statistical Information System (MSIS 2014) Innovation – Open Data Initiative of Government of India – Fostering Innovations,
Big Data A big step towards innovation, competition and productivity.
© 2012 TeraMedica, Inc. Big Data: Challenges and Opportunities for Healthcare Joe Paxton Healthcare and Life Sciences Sales Leader.
Basic Concepts in Big Data
Real world cloud computing challenges Giedrius Markevičius Territory account manager.
© 2011 IBM Corporation Smarter Software for a Smarter Planet The Capabilities of IBM Software Borislav Borissov SWG Manager, IBM.
August 27, 2008 Platform Market, Business & Strategy.
Big Data. What is Big Data? Big Data Analytics: 11 Case Histories and Success Stories
Business Productivity Infrastructure Optimization The Business Productivity Infrastructure Optimization Campaign For Microsoft Office 2007 Module 25 –
Datawarehouse A sneak preview. 2 Data Warehouse Approach An old idea with a new interest: Cheap Computing Power Special Purpose Hardware New Data Structures.
Big Data Analytics Large-Scale Data Management Big Data Analytics Data Science and Analytics How to manage very large amounts of data and extract value.
1.less than 3 million. 2.less than 10 million. 3.over 23 million. 4.over 100 million. 5.Not sure In the U.S., the number of managers that rely on Information.
© 2007 IBM Corporation IBM Information Management Accelerate information on demand with dynamic warehousing April 2007.
Freedom to think: The Science of Data Dr Quentin Williams.
Big Data: Electronic Gold And why Oreus should invest in Big Data Thomas Snuverink.
What’s the Big Deal about Big Data? Jennifer Lewis Priestley, Ph.D. Professor of Statistics and Data Science.
Axis AI Solves Challenges of Complex Data Extraction and Document Classification through Advanced Natural Language Processing and Machine Learning MICROSOFT.
BUSINESS INTELLIGENCE & ADVANCED ANALYTICS DISCOVER | PLAN | EXECUTE JANUARY 14, 2016.
What is Big Data? Refers to datasets whose size is beyond the ability of typical databases software tools to capture, store, manage and analyze. Big data.
© 2007 IBM Corporation IBM Software Strategy Group IBM Google Announcement on Internet-Scale Computing (“Cloud Computing Model”) Oct 8, 2007 IBM Confidential.
AN OPPORTUNITY TO ENTER INTO A JOINT VENTURE PARTNERSHIP WITH NTIS FOR DATA INNOVATION NTIS Information Session Department of Commerce Auditorium and Webcast.
© 2016 Global Market Insights, Inc. USA. All Rights Reserved Retail Analytics Market to grow at 19% CAGR from 2017 to 2024: Global Market.
The Rise of Data CHAOS Driving Growth through Digital Transformation.
SAP Process Mining by Celonis
INDUSTRY 4.0: FROM THINGS TO OUTCOMES
Data warehousing AND Data mining PRESENTED by N.GANESH (10QF1A0447)
Data Analytics 1 - THE HISTORY AND CONCEPTS OF DATA ANALYTICS
Internal primer Transform Your Products
Introduction BIM Data Mining.
WEBINAR The Top Emerging Technologies To Watch: 2017 To 2021
BIG Data 25 Need-to-Know Facts.
Mike Gualtieri, Principal Analyst
Big-Data Fundamentals
Driving Digital Business with SAP Digital Business Services
1&1 Internet AG: Optimizing Debt Management
Predictions For The Future of Cloud Computing. Cloud Services In India.
CIS 515 STUDY Lessons in Excellence-- cis515study.com.
Information Systems in Organizations 1.1 Introduction to MIS
Information Systems in Organizations 1.1 Introduction to MIS
SAP Business Transformation Study | Healthcare | Providence St
Big Data.
Information Systems in Organizations 1.1 Introduction to MIS
Information Systems in Organizations 1.1 Introduction to MIS
Transforming IT at HM Revenue & Customs
Stop Data Wrangling, Start Transforming Data to Intelligence
High Tech and Industrial Manufacturing
Information Systems in Organizations 1.1 Introduction to MIS
Big Data.
XtremeData on the Microsoft Azure Cloud Platform:
Cloud adoption NECOOST Advisory | June 2017.
Shell – Bringing global business closer together with IBM and SAP
Information Systems and Business Intelligence
Information Systems in Organizations 1.1 Introduction to MIS
Business Intelligence
Big DATA.
Labour Market Information (LMI) What does it tell us?
Information Systems in Organizations 1.1 Introduction to MIS
Information Systems in Organizations 1.1 Introduction to MIS
Analytics, BI & Data Integration
KEY INITIATIVE Financial Data and Analytics
Presentation to Primary Health Alliance 7 June 2019
UNIT 6 RECENT TRENDS.
The Intelligent Enterprise and SAP Business One
Presentation transcript:

Data Administration SIG Mike Fary Enterprise Data Architect University of Chicago November 8, 2012

Big Data Data Management Challenges Abound -or- We thought we had problems before ...

Big Data is not new Late 19th century 1890 census US population reached 63 million How to deal with all that data Machine readable punched cards, invented by Herman Hollerith

“Big data is really about new uses and new insights, not so much the data itself” Rod Smith, IBM technical fellow and VP for Emerging Internet Technologies

OBAMA ADMINISTRATION UNVEILS “BIG DATA” INITIATIVE: ANNOUNCES $200 MILLION IN NEW R&D INVESTMENTS March 29, 2012 Aiming to make the most of the fast-growing volume of digital data, the Obama Administration today announced a “Big Data Research and Development Initiative.” By improving our ability to extract knowledge and insights from large and complex collections of digital data, the initiative promises to help solve some the Nation’s most pressing challenges. To launch the initiative, six Federal departments and agencies today announced more than $200 million in new commitments that, together, promise to greatly improve the tools and techniques needed to access, organize, and glean discoveries from huge volumes of digital data.

To make the most of this opportunity, the White House Office of Science and Technology Policy (OSTP)—in concert with several Federal departments and agencies—created the Big Data Research and Development Initiative to: Advance state-of-the-art core technologies needed to collect, store, preserve, manage, analyze, and share huge quantities of data. Harness these technologies to accelerate the pace of discovery in science and engineering, strengthen our national security, and transform teaching and learning; and Expand the workforce needed to develop and use Big Data technologies.

McKinsey If US healthcare were to use Big Data creatively and effectively to drive efficiency and quality the sector could create more than $300 Billion in value every year.

McKinsey In the developed economies of Europe, government administrators could save more than $149 Billion in operational efficiency improvements alone by using Big Data (not including using Big Data to reduce fraud and errors and boost the collection of tax revenues).

McKinsey Users of services enabled by personal location data could capture $600 Billion in consumer surplus. “Big data: The next frontier for innovation, competition, and productivity” McKinsey Global Institute, May 2011

Big Data WalMart – 1 million customer transactions every hour; into a 2.5 petabyte database Facebook – 40 billion photos Large Synoptic Survey Telescope – due in 2016, 140 terabytes every 5 days

More Big Data Equifax – 800 billion records, 26 petabytes of data 500 million consumers and 81 million businesses worldwide “We know more about you than you would care for us to know” Equifax CIO Dave Webb

With all that data …. How do we manage it ?

Ok, so we’re going to do this … how do we access the data ? what data do we access ? in what format ? is it free ? what do I use to get at it ? if it’s that big, how do I deal with it ?

How do we think about data classification when dealing with Big Data ? public ? confidential ? somewhere in the middle …..

Implications personal privacy legal/liability FERPA HIPAA national security

Data is now available for analysis in raw form Not confined by structured environments like DBMSs Data.gov City of Chicago data portal Chicago restaurant inspection data Others – thousands

Open Government Directive Dec. 8, 2009 required agencies to register at least three new high-value datasets on Data.gov by January 22, 2010 continuing submissions in accordance with Open Government Directive provisions as of 10/9/2012 378,529 data sets

Data.gov How were the datasets in Data.gov selected? data.gov was initially launched in May 2009 with a limited number of Federal datasets and tools. datasets that already enjoy a high degree of consensus around definitions some agencies offered data extraction and mining tools, and widgets

Analytics understanding the problem vs. “Free Wheeling” knowing the data knowing the value of the data

What effect will Big Data have on research at your institution ? pre research analysis analysis of research output post analysis in combination with other research integration with other data – How ?

Effects on Higher Ed University of Kentucky example (SAP HANA) Queries executed 350 times faster Get out of ETL business Reallocating IT staff to assist business users with modeling needs Develop breakthrough business applications and run real-time analytics with SAP HANA. This powerful, in-memory computing platform delivers a next-generation technology stack – all in one package. SAP HANA transforms business by streamlining applications, analytics, planning, predictive analysis, and sentiment analysis on a single platform so business can operate in real-time. SAP and partner solutions powered by SAP HANA have everything you need to build a personalized customer experience and to reach your business goals.

Implications ? Data warehousing vs. analytics ? Exist side by side ? Existing staff skill sets vs. new skills

ECAR Report “Analytics in Higher Education” Jacqueline Bichsel, August 2012  

Who uses the data ? Institutional Research group Executive staff Deans/Faculty What does that say about level of detail quality of the data “freshness” of the data

Questions ?