Welcome to SQL Saturday Denmark

Slides:



Advertisements
Similar presentations
Brian Alderman | MCT, CEO / Founder of MicroTechPoint Pete Harris | Microsoft Senior Content Publisher.
Advertisements

Introduction to Big Data and Hadoop Name Title Microsoft Corporation.
Get more control & flexibility of the Windows Azure environment Developers IT Pros Easier migration of existing Windows applications to Windows Azure.
Breaking points of traditional approach What if you could handle big data?
Azure.
Microsoft Machine Learning & Data Science Summit
IT Operations Management
1/27/2018 5:13 AM © Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN.
Data Platform and Analytics Foundational Training
Building ARM IaaS Application Environment
Data Platform and Analytics Foundational Training
Data Platform and Analytics Foundational Training
Data Platform and Analytics Foundational Training
Successfully migrate existing databases to Azure SQL Database
System Center Marketing
5/22/2018 1:39 AM BRK2156 Power BI Report Server: Self-service BI and enterprise reporting on-premises Christopher Finlan Senior Program Manager © Microsoft.
The story of an IoT solution
Creating Enterprise Grade BI Models with Azure Analysis Services
System Center Marketing
Microsoft Machine Learning & Data Science Summit
Microsoft /2/2018 3:42 PM BRK3129 Query Big Data using the Expanded T-SQL footprint with PolyBase in SQL Server 2016 Casey Karst Program Manager.
Use any Amazon S3 application with Azure Blob Storage
Melbourne Azure Meetup
Developing Hybrid Apps on Microsoft Azure Stack
Microsoft Build /22/ :52 PM © 2016 Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY,
7/22/2018 9:21 PM BRK3270 Building a Better Data Solution: Microsoft SQL Server and Azure Data Services Joey D’Antoni Principal Consultant Denny Cherry.
Building Analytics At Scale With USQL and C#
Data Platform and Analytics Foundational Training
Installation and database instance essentials
SQL Server Data Tools for Visual Studio Part I: Core SQL Server Tools
IT Operations Management
Design and Implement Cloud Data Platform Solutions
9/13/2018 © 2014 Microsoft Corporation. All rights reserved. Microsoft, Windows, and other product names are or may be registered trademarks and/or trademarks.
Exceedra + Azure Mark Rendle Principal Software Architect
Azure.
Cloudy with a Chance of Data
HDInsight makes Hadoop Easy
Office Power Hour New developer APIs and features for Apps for Office
9/21/2018 3:41 AM BRK3180 Architect your big data solutions with SQL Data Warehouse & Azure Analysis Services Josh Caplan & Matt Usher Program Managers.
Overview of Azure Data Lake Store
Azure PowerShell Aaron Roney Senior Program Manager Cormac McCarthy
Microsoft Ignite NZ October 2016 SKYCITY, Auckland.
Dive into Predictive Maintenance using Cortana Intelligence Suite
Microsoft Azure P wer Lunch
Server & Tools Business
Microsoft Connect /22/2018 9:50 PM
TechEd /23/ :44 AM © 2013 Microsoft Corporation. All rights reserved. Microsoft, Windows, and other product names are or may be registered.
Microsoft Connect /24/ :05 AM
Power-up NoSQL with Azure Cosmos DB
Microsoft Virtual Academy
Jim Nakashima Program Manager Cloud Tools
TechEd /15/2019 8:08 PM © 2013 Microsoft Corporation. All rights reserved. Microsoft, Windows, and other product names are or may be registered trademarks.
Building and running HPC apps in Windows Azure
Developing for Windows Azure
TechEd /11/ :25 AM © 2013 Microsoft Corporation. All rights reserved. Microsoft, Windows, and other product names are or may be registered.
HDInsight Tools for Visual Studio
Developing Windows Azure Applications with Visual Studio
5/8/2019 3:20 AM bQuery-Tool 3.0 A new and elegant way to create queries and ad-hoc reports on your Baan/Infor ERP LN data. This Baan session is a query.
5/25/2019 © 2014 Microsoft Corporation. All rights reserved. Microsoft, Windows, and other product names are or may be registered trademarks and/or trademarks.
Day 2, Session 2 Connecting System Center to the Public Cloud
Server & Tools Business
Server & Tools Business
Microsoft Virtual Academy
Introduction to Azure Data Lake
Server & Tools Business
Visual Data Flows – Azure Data Factory v2
Visual Data Flows – Azure Data Factory v2
Architecture of modern data warehouse
Presentation transcript:

Welcome to SQL Saturday Denmark Azure Data Lake

Thanks you our PLATINIUM sponsors

Thanks you our GOLD and SILVER sponsors

About me Kenneth M. Nielsen Worked with SQL Server since 1999 Data Solution Architect at Microsoft Kenneth.Nielsen@microsoft.com @doktorkermit Linkedin.com/in/KennethMNielsen www.funkylab.com

Agenda Azure Data Lake Store Azure Data Lake Analytics Azure Data Lake Analytics – Using Visual Studio Azure Data Lake Analytics – Using PowerShell Q & A

Data Lake Store

Azure Data Lake Store A hyper scale repository for big data analytics workloads No limits to SCALE Store ANY DATA in its native format HADOOP FILE SYSTEM (HDFS) for the cloud ENTERPRISE READY access control, encryption at rest Optimized for analytic workload PERFORMANCE Data lake store is your new friend for storing data, actually almost unlimited data, and the price, well it cost next to nothing to store data on Azure Any file-format is supported, data is stored in its native format, meaning that you can store, images, json tables, csv, tcv, blobs etc etc. It is build on HDFS, and here it is HDFS for the cloud.

Azure Data Lake Store Any Data Unstructured Semi-structured Structured

Azure Data Lake Store

Azure Data Lake Store HDFS for the cloud New filesystem build from the ground up, based on HADOOP file system Integrates with HDInsight, Hortonworks and Cloudera Supports Files and Folder objects and operations Support for rename, create and delete files and folders. Files system build from the scratch, based on HADOOP files system. Microsoft Azure Data Lake Store is a Hadoop file system that’s compatible with Hadoop Distributed File System (HDFS) and works with the Hadoop ecosystem. Data Lake Store is integrated with Azure Data Lake Analytics and Azure HDInsight and will be integrated with Microsoft offerings like Revolution-R Enterprise; industry-standard distributions like Hortonworks, Cloudera, and MapR; and individual Hadoop projects like Spark, Storm, Flume, Sqoop, and Kafka.

Azure Data Lake Store Unlimited storage Files sizes can be from Gigabytes to Petabytes No limits to scale Data Lake Store has no fixed limits on account size or file size. While other cloud storage offerings might restrict individual file sizes to a few terabytes, Data Lake Store can store very large files that are hundreds of times larger. At the same time, it provides very low latency read/write access and high throughput for scenarios like high-resolution video, scientific, medical, large backup data, event streams, web logs, and Internet of Things (IoT). Collect and store everything in Data Lake Store without restriction or prior understanding of business requirements.

Azure Data Lake Store Security Integrates with Azure Active Directory Audit logs for all operations* Server side Encryption* ACL on files and folders* Enterprise ready security when in GA Access Control List is only at root level at the moment, meaning that a user is granted access to a root folder, and will have access to everything in that root This will be changed when the service goes into GA.

Data Lake Analytics

Azure Data Lake Analytics A elastic analytics service built on Apache YARN that processes all data, at any size No limits to SCALE Includes U-SQL, a language that unifies the benefits of SQL with the expressive power of C# Optimized to work with ADL STORE FEDERATED QUERY across Azure data sources ENTERPRISE READY Role based access control & Auditing Pay PER JOB & Scale PER JOB

U-SQL A new language for Big Data Familiar syntax to millions of SQL & .NET developers Unifies declarative nature of SQL with the imperative power of C# Unifies structured, semi-structured and unstructured data Distributed query support over all data

Language Overview U-SQL Fundamentals All the familiar SQL clauses SELECT | FROM | WHERE GROUP BY | JOIN | OVER Operate on unstructured and structured data Relational metadata objects .NET integration and extensibility U-SQL expressions are full C# expressions Reuse .NET code in your own assemblies Use C# to define your own: Types | Functions | Joins | Aggregators | I/O (Extractors, Outputters)

U-SQL Capabilities Batch Interactive Streaming Machine Learning AVAILABLE NOW Interactive IN PROGRESS Streaming FUTURE Machine Learning FUTURE

U-SQL Distributed Query Azure Data Lake Store READ WRITE Azure Storage Blobs READ WRITE Azure SQL Database READ WRITE Azure SQL Data Warehouse READ WRITE Azure SQL DB in Azure VM READ WRITE

Read the input, write it directly to output (just a simple copy) Rowset @orders = EXTRACT OrderId int, Customer string, Date DateTime, Amount float FROM "/input/orders.txt" USING Extractors.Tsv(); OUTPUT @orders TO "/output/orders_copy.txt" USING Outputters.Tsv(); Apply Schema on read From a file in a Data Lake Easy delimited text handling Write out

Azure Data Lake Pattern Azure Services Data Engineer Data Science VM ADL Storage Visual Studio ADL ADL Storage ADL Analytics Tweets Upload Dataset Get Data From CSV Azure Storage Where CAQS Files are stored, but would load into ADLS directly if ingesting from scratch Data Scientist Data Analyst AML Experiment Power BI Desktop Azure Services

Execution with Requested Parallelism (reserve enough to do 1 vertex at a time) Requested Parallelism = 4 (reserve enough to do 4 vertices at a time)

AVG Vertex execution time Stage Details 252 Pieces of work AVG Vertex execution time 4.3 Billion rows Data Read & Written

ADLAUs Azure Data Lake Analytics Unit Parallelism N = N ADLAUs 1 ADLAU ~= A VM with 2 cores and 6 GB of memory

Visual Studio Data Lake Analytics

Azure Data Lake – Visual Studio Available project types U-SQL project, where you write your statements U-SQL sample project, really extensive project that you can work with on you own account, will give you a head start to getting up to speed on the topic U-SQL unit testing project,

Azure Data Lake – Visual Studio Fully integrates to Solution Explorer

Azure Data Lake – Visual Studio Monitor and manage jobs Browse and manage storage Browse U-SQL catalog Integrates seamlessly with server explorer

Creating U-SQL

Creating U-SQL IntelliSense Supported

Creating U-SQL Code behind enhance your code

Demonstration: Using Visual Studio

Installing Azure PowerShell SMSG Readiness 9/18/2018 Installing Azure PowerShell PowerShell Gallery Recommended approach PowerShell 5.0 supports PowerShell Gallery Windows 10 ships with PowerShell 5.0 Web Platform Installation (WebPI) © 2015 Microsoft Corporation. All rights reserved. Microsoft, Windows, and other product names are or may be registered trademarks and/or trademarks in the U.S. and/or other countries. The information herein is for informational purposes only and represents the current view of Microsoft Corporation as of the date of this presentation. Because Microsoft must respond to changing market conditions, it should not be interpreted to be a commitment on the part of Microsoft, and Microsoft cannot guarantee the accuracy of any information provided after the date of this presentation. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.

Installing from the PowerShell Gallery Launch Windows PowerShell ISE as Administrator Install-Module AzureRM Install-AzureRM

Finding the ADL cmdlets Option 1 Get-Command -Module AzureRM.DataLakeStore Get-Command -Module AzureRM.DataLakeAnalytics Option 2 Get-Command *DataLake*

Logging in to Azure $subname = “BDHadoopTeamPMTestDemo” Launch Windows PowerShell ISE $subname = “BDHadoopTeamPMTestDemo” Login-AzureRmAccount –SubscriptionName $subname

ADLS: Listing files in a store $adls = “sqlkonferenz” Get-AzureRmDataLakeStoreChildItem -Account $adls -Path /

ADLS: Upload and download $adls = “sqlkonferenz” Import-AzureRmDataLakeStoreItem -Account $adls -Path d:\somefile.txt -Destination /somefile.txt Export-AzureRmDataLakeStoreItem -Account $adls -Path /somefile.txt -Destination d:\somefile_copy.txt

ADLA: List and submit jobs $adla = “sqlkonferenz” Get-AzureRmDataLakeAnalyticsJob -Account $adla Submit-AzureRmDataLakeAnalyticsJob -Account $adla -Script “…” # U-SQL text -Name myjob Submit-AzureRmDataLakeAnalyticsJob -Account $adla -ScriptPath D:\test.script -Name myjob

ADL Store (ADLS) feature set Account Management Create new account List accounts Update account properties Delete account Transferring Data Upload into store from local disk Download from store to local disk Files and Folders List contents of folder Create Move Delete Does file exist Security Get ACLs Update ACLs Get Owner Set Owner File Content Set file content Append file content Get file content Merge files

ADL Analytics (ADLA) feature set Account Management Create new account List accounts Update account properties Delete account Data Sources Add a data source List data sources Update data source Delete data source Compute List jobs Submit job Cancel job Catalog Items List items in U-SQL catalog Update item Catalog Secrets Create catalog secret List catalog secrets Delete catalog secrets

Demonstration: Using ADL PowerShell

Questions

Please review the event and sessions http://speakerscore.com/ZGVX http://speakerscore.com/XYZ 9/18/2018 | Footer Goes Here