Download presentation
Presentation is loading. Please wait.
Published byLoraine Morgan Modified over 9 years ago
1
The Three E’s of Big Data and What DB People can do About Them UC BERKELEY Michael Franklin – UC Berkeley Beckman Database Get Together October 14, 2013
2
The Big Data Problem - Nutshelled TimeQualityMoney 2 Massive Diverse and Growing Data Massive Diverse and Growing Data Something’s gotta give :
3
The 3 E’s of Big Data:
4
Extreme Elasticity - Machines Option #1 – Build your own Cluster/WSC (US East – Saturday Sept 28 @1:30am) Option #3 – Try your luck on the Spot Market Option #2 – Rent Machines from AWS x Servers needed 46K Servers (2010 estimate)
5
Extreme Elasticity - Algorithms Agarwal et al., BlinkDB: Queries with Bounded Errors and Bounded Response Times on Very Large Data. ACM EuroSys 2013.
6
Extreme Elasticity - People 6 Incentives Fatigue, Fraud, & other Failure Modes Latency & Prediction Work Conditions Interface Answer Quality Task Structuring Task Routing
7
Extreme Elasticity Algorithms Approximate Answers ML Libraries and Ensemble Methods Active Learning Machines Cloud Computing – esp. Spot Instances Multi-tenancy Relaxed (eventual) consistency/ Multi-version methods People Dynamic Task and Microtask Marketplaces Visual analytics Manipulative interfaces and mixed mode operation
8
The Challenge
9
The Good News: We already know how to do this (kinda)! SQLResultMQL Model ✦ End Users tell the system what they want, not how to get it
10
Query Planner / Optimizer Runtime ML Developer API ML Library MQL Parser (Contracts) Release d July 2013 initial release: Spring 2014 MLbase: Progress
11
For More Information amplab.cs.berkeley. edu franklin@berkeley.e du UC BERKELEY
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.