Download presentation
Presentation is loading. Please wait.
Published byBernice Millicent Cox Modified over 9 years ago
1
BIG DATA – WHAT’S THE BIG DEAL The call would start soon, please be on mute. Thanks for your time and patience.
2
WHO AM I Debarchan, from Calcutta with an Indian heart and a global mind. .NET programmer who fell in love with Open Source, specifically Apache Hadoop Author. Community enthusiast. Cricket and music lover. One who gets really scared and bored if tomorrow is exactly like today. Known is a drop, the unknown is an ocean.
3
WHAT IS BIG DATA?
4
How do I optimize my fleet based on weather and traffic patterns? SOCIAL & WEB ANALYTICS LIVE DATA FEEDS ADVANCED ANALYTICS What’s the social sentiment for my brand or products How do I better predict future outcomes? A NEW SET OF QUESTIONS
5
COMMON BIG DATA CUSTOMER SCENARIOS GAIN COMPETITIVE ADVANTAGE BY MOVING FIRST AND FAST IN YOUR INDUSTRY Web app optimization Smart meter monitoring Equipment monitoring Advertising analysis Life sciences research Fraud detection Healthcare outcomes Weather forecasting Natural resource exploration Social network analysis Churn analysis Traffic flow optimization IT infrastructure optimization Legal discovery
6
THE BIG DATA LIFECYCLE
7
MANAGE ANY DATA, ANY SIZE, ANYWHERE 010101010101010101 1010101010101010 01010101010101 101010101010
8
Extremely large volume of unstructured web logs Ad hoc analysis of logs to prototype patterns Hadoop data cluster feeds large 24TB cube Business users analyze cube data E.g. STRUCTURED & UNSTRUCTURED DATA
9
THE BIG DATA LIFECYCLE
10
ENRICH BY CONNECTING TO THE WORLDS DATA Discove r Combine Refine
11
POWER OF COMBINING THE WORLDS DATA Value
12
E.g. VALUE OF EXTERNAL DATA
13
THE BIG DATA LIFECYCLE
14
INSIGHTS ON ANY DATA, ALL USERS, WHEREVER THEY ARE 010101010101010101 1010101010101010 01010101010101 101010101010
15
INSIGHTS FOR ALL USERS THROUGH FAMILIAR TOOLS PB TB GB
16
16 Application written in java for Big Data Processing Uses the “Map-Reduce” Processing Paradigm Characteristics: How is it different from traditional SQL Server? 1. Optimized for distributed storage and computing of data 2. Highly-scalable (scale out model) 3. Commodity HW-based 4. Open Source Very low cost for acquisition and storage Hadoop Data Analytics Dataflow
17
17 Distributed Storage (HDFS) Distributed Processing (Map Reduce)
18
18 Welcome to the Zoo! Need to Know* StreamInsight Good to Know*
19
19 Feed us back Support Team’s blog: http://blogs.msdn.com/b/bigdatasupport/http://blogs.msdn.com/b/bigdatasupport/ Facebook Page: https://www.facebook.com/MicrosoftBigDatahttps://www.facebook.com/MicrosoftBigData Facebook Group: https://www.facebook.com/groups/bigdatalearnings/https://www.facebook.com/groups/bigdatalearnings/ Twitter: @debarchans Read more: http://en.wikipedia.org/wiki/Hadoop http://en.wikipedia.org/wiki/Big_data Next Session: Apache Hadoop – A deep dive
20
© 2012 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are or may be registered trademarks and/or trademarks in the U.S. and/or other countries. The information herein is for informational purposes only and represents the current view of Microsoft Corporation as of the date of this presentation. Because Microsoft must respond to changing market conditions, it should not be interpreted to be a commitment on the part of Microsoft, and Microsoft cannot guarantee the accuracy of any information provided after the date of this presentation. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.