Page 1 © Hortonworks Inc – All Rights Reserved Hortonworks Naser Ali UK Building Energy Management Group Hadoop: A Data platform for businesses.
Page 2 © Hortonworks Inc – All Rights Reserved There’s a change taking place…. allow organizations to shift interactions from… Reactive Post Transaction Proactive Pre Decision …to Real-time PersonalizationFrom static branding…to repair before breakFrom break then fix…to Designer MedicineFrom mass treatment…to Automated Algorithms From Educated Investing …to 1x1 TargetingFrom mass branding A shift in Advertising A shift in Financial Services A shift in Healthcare A shift in Retail A shift in Telco
We estimate that within 3 years 50% of the worlds data will reside on Hadoop….
Analysts suggest data is doubling in size every 2 – 3 years…. Traditional or not? APPLICATIONS DATA SYSTEM REPOSITORIES SOURCES Existing Sources (CRM, ERP, Clickstream, Logs) RDBMSEDWMPP Business Analytics Custom Applications Packaged Applications Source: IDC 2.8 ZB in % from New Data Types 15x Machine Data by ZB by 2020
Hadoop stores and processes the data you typically do not or cannot…. 1: Data structure. 2: Cost profile. OLTP, ERP, CRM Systems Unstructured documents, s Clickstream Server logs Sentiment, Web Data Sensor. Machine Data Geolocation
Hadoop enables scalable compute & storage for all data structures…. ✚ Determine list of questions Design solutions Collect structured data Ask questions from list Detect additional questions Current Reality Apply schema on write Dependent on IT Repeatable Process: SQL Augment w/ Hadoop Apply schema on read Support range of access patterns to data stored in HDFS: polymorphic access HADOOP Iterate over structure Transform and Analyze BatchInteractiveReal-time Right Engine, Right Job In- memory
Hadoop enables scalable compute & storage with a compelling cost profile…. Commodity Compute & Storage MPP SAN Engineered System NAS HADOOP Cloud Storage $0$20,000$40,000$60,000$80,000$180,000 Fully-loaded Cost Per Raw TB of Data (Min–Max Cost) Hadoop Enables Scalable Compute & Storage at a Compelling Cost Structure Storage Costs/Compute Costs from $19/GB to $0.23/GB
The Net Result: A modern data architecture capable of storing, processing, correlating, analysing, matching, aggregating, searching and exposing…. ….all data & insights….
Page 9 © Hortonworks Inc – All Rights Reserved …….when integrated with the right tools capable of delivering the right results
Page 10 © Hortonworks Inc – All Rights Reserved Our company: Formed from Yahoo! In June 2011 with 24 lead Hadoop engineers. Delivered HDP 1.0 in July Now lead Hadoop development globally.
Page 11 © Hortonworks Inc – All Rights Reserved Thank you. Questions