PLATFORM FOR BIG DATA, NOSQL AND RELATIONAL DATA. WHAT MAKES SENSE FOR ME? (+AZURE)

Slides:



Advertisements
Similar presentations
MAKING BIG DATA RELEVANT FOR YOUR BUSINESS! Name Title Microsoft Canada.
Advertisements

There are some differences between Cloud and Dev Storage. A good approach for developers: To test pre-deployment,
DBI403 Few Nodes Many Nodes SalesDB CustomerFederationeration CustomerFederation Federation Members Federation Root Federations.
Windows Azure Storage Name Title Organization.
Big Data Working with Terabytes in SQL Server Andrew Novick
System Center 2012 R2 Overview
Roger Breu SQL Server PDW Solution Sales Microsoft Western Europe Microsoft Solutions for Big Data | Oct 17th 2013 From Numbers.
MICROSOFT BIG DATA. WHAT IS BIG DATA? How do I optimize my fleet based on weather and traffic patterns? SOCIAL & WEB ANALYTICS LIVE DATA FEEDS ADVANCED.
FAST FORWARD WITH MICROSOFT BIG DATA Vinoo Srinivas M Solutions Specialist Windows Azure (Hadoop, HPC, Media)
Getting Your Nerd on with Microsoft’s Cloud data services Scott Klein Technical Evangelist Microsoft Corporation.
© 2009 VMware Inc. All rights reserved Big Data’s Virtualization Journey Andrew Yu Sr. Director, Big Data R&D VMware.
Observation Pattern Theory Hypothesis What will happen? How can we make it happen? Predictive Analytics Prescriptive Analytics What happened? Why.
SQL Server 2014 Enterprise Edition Brad Jarocki Adam Bogobowicz Matt Haynes.
Empowering Collaborative Art with Technology Rami Sayar Technical Evangelist Microsoft Canada Rami Sayar – FITC Toronto 2014.
Devices & Services Full WinRT 11,000 members Windows Phone Runtime 2,800 shared members 600 new members Networking Proximity In-App Purchase.
BIG DATA – WHAT’S THE BIG DEAL The call would start soon, please be on mute. Thanks for your time and patience.
This presentation was scheduled to be delivered by Brian Mitchell, Lead Architect, Microsoft Big Data COE Follow him Contact him.
Windows Azure SQL Database and Storage Name Title Organization.
Database Design Table design Index design Query design Transaction design Capacity Size limits Partitioning (shard) Latency Redundancy Replica overhead.
Using Windows Azure John Donnelly Technical Evangelist Microsoft Technology Centre Thames Valley Park
Introduction To Windows Azure Cloud
Training Workshop Windows Azure Platform. Presentation Outline (hidden slide): Technical Level: 200 Intended Audience: Developers Objectives (what do.
SQL Server 2014: The Data Platform for the Cloud.
MSDN Event. WINDOWS AZURE STORAGE Windows Azure Storage Storage in the Cloud –Scalable, durable, and available –Anywhere at anytime access –Only pay.
Larisa kocsis priya ragupathy
HBase A column-centered database 1. Overview An Apache project Influenced by Google’s BigTable Built on Hadoop ▫A distributed file system ▫Supports Map-Reduce.
Presented by CH.Anusha.  Apache Hadoop framework  HDFS and MapReduce  Hadoop distributed file system  JobTracker and TaskTracker  Apache Hadoop NextGen.
Introduction to Hadoop and HDFS
Windows Azure Conference 2014 Deploy your Java workloads on Windows Azure.
Windows Azure Storage Name Title Microsoft Corporation.
Fares Zekri Account Technology Strategist Microsoft Tunisia ITU Workshop on “Cloud Computing” (Tunis, Tunisia, June 2012) Microsoft Clouds.
Windows Azure Storage Cloud Computing Soup to Nuts Mike Benkovich Microsoft Corporation btlod-72.
Scott Klein Technical Evangelist. Scott Klein.
T.N.C.Venkata Rangan CEO, Vishwak Solutions Your Data on Cloud.
Windows Azure Storage Anton Boyko.NET developer.
Windows Azure. Azure Application platform for the public cloud. Windows Azure is an operating system You can: – build a web application that runs.
Nov 2006 Google released the paper on BigTable.
Microsoft Azure and DataStax: Start Anywhere and Scale to Any Size in the Cloud, On- Premises, or Both with a Leading Distributed Database MICROSOFT AZURE.
IT Pro Day Welcome to TechDays Congress Center Basel.
SSIS – Deep Dive Praveen Srivatsa Director, Asthrasoft Consulting Microsoft Regional Director | MVP.
Scalable data access with Impala Zbigniew Baranowski Maciej Grzybek Daniel Lanza Garcia Kacper Surdy.
Technology Drill Down: Windows Azure Platform Eric Nelson | ISV Application Architect | Microsoft UK |
AZURE DISTRIBUTED DATA Storage, HDInsight Hadoop, Azure Data Lake.
Big Data Analytics with Excel Peter Myers Bitwise Solutions.
October 15-18, 2013 Charlotte, NC Being the DBA of the Future A World of On-Premises and Cloud Dandy Weyn, Snr. Technical Marketing Product Manager Microsoft.
Harnessing Big Data with Hadoop Dipti Sangani; Madhu Reddy DBI210.
Azure Table Storage Cheap, fast and scalable storage Anton Boyko Ukrainian Azure Community Founder Microsoft Azure MVP
Time Series Data Repository #ODSummit - The Generic, Extensible, and Elastic Data Repository in OpenDaylight for Advanced Analytics.
An Introduction To Big Data For The SQL Server DBA.
Windows Azure Custom Software Development Mobile Middleware Windows Azure Storage Dipl.-Ing. Damir Dobric Lead Architect daenet
Microsoft Partner since 2011
Big Data for the SQL Eye Cindy Look, it’s SQL! SELECT score, fun FROM toDo WHERE type = 'they pay me for
Unlock your Big Data with Analytics and BI on Office365 Brian Culver ● SharePoint Fest Seattle● BI102 ● August 18-20, 2015.
Microsoft Ignite /28/2017 6:07 PM
OMOP CDM on Hadoop Reference Architecture
Connected Infrastructure
Data Platform and Analytics Foundational Training
Smart Building Solution
Windows Azure SQL Federation
Chapter 14 Big Data Analytics and NoSQL
Smart Building Solution
Operational & Analytical Database
Microsoft Build /22/ :52 PM © 2016 Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY,
Couchbase Server is a NoSQL Database with a SQL-Based Query Language
Connected Infrastructure
Powering real-time analytics on Xfinity using Kudu
03 | Data Storage Bruno Terkaly | Technical Evangelist
Designed for Big Data Visual Analytics, Zoomdata Allows Business Users to Quickly Connect, Stream, and Visualize Data in the Microsoft Azure Platform MICROSOFT.
Microsoft Virtual Academy
SQL Server 2019 Bringing Apache Spark to SQL Server
Presentation transcript:

PLATFORM FOR BIG DATA, NOSQL AND RELATIONAL DATA. WHAT MAKES SENSE FOR ME? (+AZURE)

WHAT IS BIG DATA?

RoadDesignatorDrivingStatus A1Difficulties

Batch ProcessingInteractive AnalysisStream Processing Query runtimeMinutes to hoursMilliseconds to minutesNever-ending Data volumeTBs to PBsGBs to PBsContinuous stream Programming modelMapReduceQueriesDAG UsersDevelopersAnalysts and developersDevelopers Originating projectGoogle MapReduceGoogle DremelTwitter Storm Open source projectHadoop / SparkDrill / Shark / Impala Hbase Storm / Apache S4 /Kafka

How do I optimize my fleet based on weather and traffic patterns? SOCIAL & WEB ANALYTICS LIVE DATA FEEDS ADVANCED ANALYTICS Whats the social sentiment for my brand or products How do I better predict future outcomes? A NEW SET OF QUESTIONS

COMMON BIG DATA CUSTOMER SCENARIOS GAIN COMPETITIVE ADVANTAGE BY MOVING FIRST AND FAST IN YOUR INDUSTRY Web app optimization Smart meter monitoring Equipment monitoring Advertising analysis Life sciences research Fraud detection Healthcare outcomes Weather forecasting Natural resource exploration Social network analysis Churn analysis Traffic flow optimization IT infrastructure optimization Legal discovery

persistent | distributed In Memory Efficient at Random Reads/Writes Distributed, large scale data store Utilizes Hadoop for persistence Both HBase and Hadoop are distributed

MANAGE ANY DATA, ANY SIZE, ANYWHERE

HADOOP INTEGRATED INTO THE DATA PLATFORM

Distributed Storage (HDFS) Hadoop architecture. Distributed Processing (Map Reduce)

INSIGHTS FOR ALL USERS THROUGH FAMILIAR TOOLS PB TB GB

Orders_federation CREATE FEDERATION fed_name(fed_key_label fed_key_type distribution_type)

Orders_federation Federation Key The key used for data distribution int, bigint, guid, varbinary Atomic Unit Represent a single instance of a federation key. All rows in all federated tables with the same federation key value.

Federated Table Contains only atomic units for members key range Reference Table Non-Federated table

SalesDB Orders_federation Orders_Fed [5000, 10000) ALTER FEDERATION Orders_Fed SPLIT AT (tenant_id=7500) [5000, 7500) & [7500, 10000) Dynamic Partitioning SPLIT members to spread workloads over to more nodes DROP members to shrink back to fewer nodes

SalesDB Orders_federation Orders_Fed [5000, 7500) & [7500, 10000) USE FEDERATION Orders_Fed (tenant_id=7509) Built-in Data-Dependent Routing (DDR) Ensure apps can discover where the data is just-in-time No Shard Map caching Guaranteed member routing

EntityTableAccount contoso Name =… = … Name =… Add= customers Photo ID =… Date =… photos Photo ID =… Date =…

Table Details Insert Update Merge – Partial update Replace – Update entire entity Upsert Delete Query Entity Group Transactions Multiple CUD Operations in a single atomic transaction Create, Query, Delete Tables can have metadata Not an RDBMS! Table Entities

FIRSTLASTBIRTHDATE WadeWegner2/2/1981 NathanTotten3/15/1965 NickHarrisMay 1, 1976 FAV SPORT Canoeing

FIRSTLASTBIRTHDATE WadeWegner2/2/1981 NathanTotten3/15/1965 NickHarrisMay 1, 1976 ?$filter=Last eq Wegner

PARTITIONKEY (CATEGORY) ROWKEY (TITLE) TIMESTAMPMODELYEAR BikesSuper Duper Cycle…2009 Bikes Quick Cycle 200 Deluxe …2007 ………… CanoesWhitewater…2009 CanoesFlatwater…2006 PARTITIONKEY (CATEGORY) ROWKEY (TITLE) TIMESTAMPMODELYEAR Rafts14ft Super Tourer…1999 ………… Skis Fabrikam Back Trackers …2009 ………… TentsSuper Palace…2008 PARTITIONKEY (CATEGORY) ROWKEY (TITLE) TIMESTAMPMODELYEAR BikesSuper Duper Cycle…2009 Bikes Quick Cycle 200 Deluxe …2007 ………… CanoesWhitewater…2009 CanoesFlatwater…2006 Rafts14ft Super Tourer…1999 ………… Skis Fabrikam Back Trackers …2009 ………… TentsSuper Palace…2008

MANAGE ANY DATA, ANY SIZE ANYWHERE SQL Server Database & Parallel Data Warehouse Hadoop on Windows Hadoop on Azure StreamInsight Hadoop Connectors & ETL

Global Physical Infrastructure servers / network / datacenters computestoragenetworking virtual machinesweb sitescloud servicesSQL databasenoSQL databaseblob storageconnectvirtual networktraffic manager Frameworks Services Fabric Infrastructure N Central US, S Central US, N Europe, W Europe, E Asia, SE Asia + 24 Edge CDN Locations Automated Managed Resources Elastic Usage Based