Magdalena Balazinska C OMPUTER S CIENCE & E NGINEERING -- U OF W ASHINGTON The Hidden Face of Cloud Data.

Slides:



Advertisements
Similar presentations
Virtualization Group FIND Meeting. Does Virtualization aid Security? Depends what you build on top –Were only providing mechanisms, not solutions Better:
Advertisements

The BigFrame Team Duke University, Hong Kong Polytechnic University, and HP Labs.
Windows IT Pro magazine Datacenter solution with lower infrastructure costs and OPEX savings from increased operational efficiencies. Datacenter.
Cloud Computing: Theirs, Mine and Ours Belinda G. Watkins, VP EIS - Network Computing FedEx Services March 11, 2011.
By: Mr Hashem Alaidaros MIS 211 Lecture 4 Title: Data Base Management System.
Big Data Management and Analytics Introduction Spring 2015 Dr. Latifur Khan 1.
7.RP - Analyze proportional relationships and use them to solve real-world and mathematical problems. 1. Compute unit rates associated with ratios of.
UNCLASSIFIED: LA-UR Data Infrastructure for Massive Scientific Visualization and Analysis James Ahrens & Christopher Mitchell Los Alamos National.
Microsoft Provides Complete Information Platform, On Your Terms On Premises & Private CloudPublic Cloud Extend any data Extend anywhere.
NJIT Use Case Model Operation Contracts Prepared By: Sumit Sharma.
Clementine Server Clementine Server A data mining software for business solution.
1 A Survey of Affect Recognition Methods: Audio, Visual, and Spontaneous Expressions Zhihong Zeng, Maja Pantic, Glenn I. Roisman, Thomas S. Huang Reported.
1© Copyright 2015 EMC Corporation. All rights reserved. SDN INTELLIGENT NETWORKING IMPLICATIONS FOR END-TO-END INTERNETWORKING Simone Mangiante Senior.
Resource Management in Data-Intensive Systems Bernie Acs, Magda Balazinska, John Ford, Karthik Kambatla, Alex Labrinidis, Carlos Maltzahn, Rami Melhem,
DASHBOARDS Dashboard provides the managers with exactly the information they need in the correct format at the correct time. BI systems are the foundation.
Introduction to Data Science Kamal Al Nasr, Matthew Hayes and Jean-Claude Pedjeu Computer Science and Mathematical Sciences College of Engineering Tennessee.
System Center: Accelerating Growth in the hybrid Cloud Microsoft Hosting Service Providers Conversation #2 1.
Copyright © 2014 Pearson Education, Inc. 1 It's what you learn after you know it all that counts. John Wooden Key Terms and Review (Chapter 6) Enhancing.
SM STRATA PRESENTATION Tim Garnto - SVP Engineering, edo Interactive Rob Rosen – Big Data Field Lead, Pentaho.
Security and Privacy Services Cloud computing point of view October 2012.
Karolina Muszyńska. Reverse engineering - looking at the solution to figure out how it works Reverse engineering - breaking something down in order to.
COOPERaTE Control and Optimization of Energy Positive Neighbourhoods Energy Management and Sharing The COOPERaTE Experience COOPERaTE ICT Approach Keith.
Dr. Tucker Balch Associate Professor School of Interactive Computing Computational Investing, Part I 221: Intro to Machine Learning Find out how modern.
MIT DB GROUP. People Sam Madden Daniel Abadi (Yale)Daniel Abadi Magdalena Balazinska (U. Wash.)Magdalena Balazinska.
Benchmarking Interactive Social Networking Actions Shahram Ghandeharizadeh Director of Database Lab Computer Science Department University of Southern.
Use Case Model Operation Contracts Chapter 11 Applying UML and Patterns Craig Larman.
1 ICDM 2004 Business Meeting 11/4/2004 Data Mining on ICDM Submission Data Shusaku Tsumoto Ning Zhong and Xindong Wu.
Application Policy on Network Functions (APONF) G. Karagiannis and T.Tsou 1.
Washington State Office of Insurance Commissioner State Insurance Management & Business Application Project Recap November 2007.
Big Data Analytics Large-Scale Data Management Big Data Analytics Data Science and Analytics How to manage very large amounts of data and extract value.
MIS 451 Building Business Intelligence Systems Data Analysis.
8/20/2013NIST Big Data WG / Roadmap Subgroup1 Architecture Storage Architecture Processing Architecture Resource Managers Architecture Infrastructure Architecture.
Yonglei Tao School of Computing & Info Systems GVSU Ch 7 Design Guidelines.
Information Systems in Organizations
SUPPLY CHAIN OF BIG DATA. WHAT IS BIG DATA?  A lot of data  Too much data for traditional methods  The 3Vs  Volume  Velocity  Variety.
What we know or see What’s actually there Wikipedia : In information technology, big data is a collection of data sets so large and complex that it.
Data Resource Management Agenda What types of data are stored by organizations? How are different types of data stored? What are the potential problems.
noun ; Software Defined Enterprise/SDE/ The enterprise who leverages software to flank their traditional business offerings, or to create entirely new.
Information Systems in Organizations Managing the business: decision-making Growing the business: knowledge management, R&D, and social business.
Computer/Human Interaction Fall 2015 Northeastern University1 Name of Interface Tagline if you have one Team member names and schools/years Team member.
Overview + Digital Strategy + Interactive Engineering + Experience Design + Product Incubation + Data Visualization and Discovery + Data Management.
A New BI Paradigm Data Discovery Brad Peterman Enterprise Client Deployments QlikTech, Inc.
Copyright © 2016 Pearson Education, Inc. Modern Database Management 12 th Edition Jeff Hoffer, Ramesh Venkataraman, Heikki Topi CHAPTER 11: BIG DATA AND.
Introduction to OLAP and Data Warehouse Assoc. Professor Bela Stantic September 2014 Database Systems.
Going Hybrid – part 2 Moving to Hybrid Cloud with Windows Azure Virtual Machines & System Center 2012 R2.
BIG DATA. The information and the ability to store, analyze, and predict based on that information that is delivering a competitive advantage.
INTRODUCTION TO GRID & CLOUD COMPUTING U. Jhashuva 1 Asst. Professor Dept. of CSE.
What is the Big Data Challenge? Organizations are seeking solutions that combine the real-time analytics capabilities of SAP HANA and accessibility to.
Abstract MarkLogic Database – Only Enterprise NoSQL DB Aashi Rastogi, Sanket V. Patel Department of Computer Science University of Bridgeport, Bridgeport,
6. (supplemental) User Interface Design. User Interface Design System users often judge a system by its interface rather than its functionality A poorly.
Challenges facing data- enabled interdisciplinary training.
H2020 Big Data Lighthouse Pilot DataBio
Data Platform Modernization
Information Systems in Organizations
Data and Analytics Diagram Template
BIG DATA IN ENGINEERING APPLICATIONS
Microsoft Build /22/ :52 PM © 2016 Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY,
Sas is open (for business)
Data Platform Modernization
Today’s Business Pain Points
Big Data.
Assistant Vice President and Chief Technology Officer
Automation Committee Workshop Presentation 2
Technical Capabilities
Big DATA.
Transformations I can translate an object
Caveonix Solution Diagram Template
DBA Situational Decision Automation Diagram Template
Presentation transcript:

Magdalena Balazinska C OMPUTER S CIENCE & E NGINEERING -- U OF W ASHINGTON The Hidden Face of Cloud Data Management Magdalena Balazinska - University of Washington1

2 Data Management in the Cloud Problem 1: Fast Well defined research problems Pretty well defined research problems Poorly defined research problems Pb 2: Efficient (multitenant) Pb 3: Transformative (for developers and users) Three Types of Challenges

Pb 1: Fast Cloud Data Management Well-known challenges around all V’s – Volume, velocity, variety, etc. But: Need to incentivize and recognize translation of real-world use-cases into research problems – Description of application – Dataset(s) to use for testing solutions – List of current solutions and their performance – It’s too costly when everyone does it over and over again Magdalena Balazinska - University of Washington3

Pb2: Efficient Cloud Data Management Multitenancy problem for Cloud providers: – OLTP: Well defined problems (predict, pack, and migrate) – OLAP: Less well defined problems And what about service developers – Build services on top of existing Cloud resources – These services may have tenants of their own Challenges – Resource management in hierarchy of multitenant systems – Performance, elasticity, costs, SLAs Magdalena Balazinska - University of Washington4

Pb3: Transformative Cloud Data Mgmt New interface to data management – How should Cloud SLAs look like? How to avoid surprises? How to reason about Cloud costs and capabilities? – How to do all types of processing at the same time? Graphs, text, streams, etc. Interactive, batch, visual New levels of sharing across users and tenants – How to manage sharing of data and computation? – Data itself may have a price and/or a license agreement – How to support data discovery? quality assessment? Magdalena Balazinska - University of Washington5