Get data insights faster with Data Wrangling

Slides:



Advertisements
Similar presentations
Using Excel, Excel Service and PerformancePoint
Advertisements

Opening Keynote Presentation An Architecture for Intelligent Trading  Alessandro Petroni – Senior Principal Architect, Financial Services, TIBCO Software.
Rodney Holman Mandip Kaur Information Builders  Company Name: Information Builders  CEO and Founder: Gerald D. Cohen  Address: Two Penn Plaza, New.
Almost 4 decades of Advanced Analytics & DM expertise.
PO320: Reporting with the EPM Solution Keshav Puttaswamy Program Manager Lead Project Business Unit Microsoft Corporation.
Training Workshop Windows Azure Platform. Presentation Outline (hidden slide): Technical Level: 200 Intended Audience: Developers & Technical Decision.
1 Business Intelligence in the Information Age © 2006 Acxiom Corporation. All Rights Reserved. Carmen McKenna-McWilliams Marketing Technology Center of.
OBJECT ORIENTED SYSTEM ANALYSIS AND DESIGN. COURSE OUTLINE The world of the Information Systems Analyst Approaches to System Development The Analyst as.
April 10-12, Chicago, IL Driving Smarter Decisions with Microsoft Big Data Tim Mallalieu Group Program Manager, HDInsight.
Managing Knowledge in Business Intelligence Systems Dr. Jan Mrazek.
April 10-12, Chicago, IL Microsoft Data Explorer for Excel Faisal Mohamood, Lead PM, Microsoft.
Intelligent Performance Management Empowering Your Enterprise Duane E. Presti, CEO PARIS Technologies, Inc.
MICROSOFT CODENAME “DATA EXPLORER”. “Data Explorer” is a self-service experience in the cloud and on the desktop for discovering, transforming and publishing.
Microsoft Project Reporting with Reporting Services.
Datazen – an overview Frank Geisler Please Support Our Sponsors SQL Saturday is made possible with the generous support of these sponsors.
Yes, Data Management Can Be Agile! Michele Goetz, Principal Analyst.
Self-Service Data Integration with Power Query Stéphane Fréchette.
Andy Roberts Data Architect
#SQLSAT454 Using Power BI in Enterprise Andrea
(OBIA) Training & Placement Program By Keen IT To request free demo session please mail us at
1 Data Warehouse Assessments What, Why, and How Noah Subrin Technical Lead SRA International April 24, 2010.
MAKING BUSINESS INTELLIGENT Brought to you by your local PASS Community! Self Service ETL with Power Query Welcome.
Data Warehousing The Easy Way with AWS Redshift
Processing Temporal Telemetry Data -aka- Storing BigData in a Small Space =tg= Thomas H. Grohser, SQL Server MVP, Senior Director - Technical Solutions.
Microsoft Power Query 101 Belinda Allen Smith & Allen Consulting, Inc.
Dumps PDF Perform Data Engineering on Microsoft Azure HD Insight dumps.html Complete PDF File Download From.
Analytics Warehouse P.J. Kelly.
Cloud BI with Azure Analysis Services
Data Platform and Analytics Foundational Training
Deliver business insights with Microsoft Dynamics AX and Power BI
Viewing Data-Driven Success Through a Capability Lens
Cloud BI with Azure Analysis Services
QlikView Connector for Informatica Powercenter An Introduction
How to build a successful Data Lake
7/4/2018 © 2014 Microsoft Corporation. All rights reserved. Microsoft, Windows, and other product names are or may be registered trademarks and/or trademarks.
Encryption in SQL Server
SQL Server Integration Services
Welcome! Power BI User Group (PUG)
Maximize the value of your cloud
9/21/2018 3:41 AM BRK3180 Architect your big data solutions with SQL Data Warehouse & Azure Analysis Services Josh Caplan & Matt Usher Program Managers.
Data Science that’s scale
Operationalize your data lake Accelerate business insight
What is a Data Scientist and How Do I Become One?
Introducing the SQL Server 2016 Query Store
Business Intelligence for Project Server/Online
Oscar AP by Massive Analytic: A Precognitive Analytics Platform for Effortless Data-Driven Decisions. Now Available in Azure Marketplace MICROSOFT AZURE.
Sandy Rivas | Program Manager
R'em All. Use R in Power BI to Deal with Data
Introduction to AWS Redshift
Stop Data Wrangling, Start Transforming Data to Intelligence
DeFacto Planning on the Powerful Microsoft Azure Platform Puts the Power of Intelligent and Timely Planning at Any Business Manager’s Fingertips Partner.
Advanced Dashboard Creation Using Microsoft SharePoint Server 2010
Cloud BI with Azure Analysis Services
Azure SQL DWH: Tips and Tricks for developers
Delivering an End-to-End Business Intelligence Solution
SQL Server Performance Tuning Nowadays
Why Innovate with Lagom & SAP?
Azure Data Factory v2: What’s new?
Azure SQL DWH: Tips and Tricks for developers
SQL Database on IoT devices could you? should you? would you?
Technical Resources & Training
Introducing Power BI dataflows
Matthew Roche Senior Program Manager STOCKHOLM
Power BI dataflows Beyond the basics
Data Wrangling as the key to success with Data Lake
Data Wrangling for ETL enthusiasts
Microsoft Azure Data Catalog
SQL Like Languages in Azure IoT
Architecture of modern data warehouse
Creating a Seamless Sales to manufacturing customer Experience
Presentation transcript:

Get data insights faster with Data Wrangling Sergiy Lunyakin

SQLSat Kyiv Team Denis Reznik Eugene Polonichko Oksana Tkach Yevhen Nedashkivskyi Mykola Pobyivovk Denis Reznik Eugene Polonichko Oksana Tkach Oksana Borysenko

Sponsor Sessions Starts at 13:00 Don’t miss them, they might be providing some interesting and valuable information! Congress Hall DevArt Conference Hall Simplement Room AC DB Best Predslava1 Intapp NULL means no session in that room at that time 

Sponsors

Session will begin very soon :) Please complete the evaluation form from your pocket after the session. Your feedback will help us to improve future conferences and speakers will appreciate your feedback! Enjoy the conference!

Center of Excellence – Intelligent Enterprise About me SERGIY LUNYAKIN Big Data Architect Center of Excellence – Intelligent Enterprise MS Data Platform MVP MCSE Data Analytics MCSA Cloud Platform

Agenda What is Data Wrangling Place of Data Wrangling Data Wrangling Drivers ETL or Data Wrangling Trifacta Demo

What is Data Wrangling? Data Wrangling is the process of cleaning, structuring and enriching raw data into a desired output for analysis Data Wrangling Question Analyze Insight Discover Refine Publish Q&A 80 % 20 %

Place of Data Wrangling

Data Wrangling Drivers 81% Shorten time to business insight 76% Increase data-driven decision making 53% Improve reaction time to business conditions 49% Operational efficiency for frontline works 43% Gain a single, complete view of relevant data * According to a TDWI’s Best Practices Report on “Improving Data Preparation for Business Analytics”

ETL or Data Wrangling Traditional (ETL) Data Wrangling Done by IT Done by data analysts, data scientists, power users Enterprise reporting Exploratory projects, Data Discovery, Prototyping Long-term projects Quick wins Data Standards Little documentation and governance Metadata & Governance Detailing ETL Requirements, Precursor to ETL build

Choosing a Data Wrangling Tool Forrester Wave™: Data Preparation Tools, Q1 2017

Situating in Data Lake

Common Data Wrangling Use Cases with Trifacta Self-Service data prep. automation Preparation for IT Operationalization Exploratory Analytics

Integration with Hadoop

Integration in Google Cloud Ecosystem Trifacta Interface & Photon Engine Integrated within Google Cloud Ecosystem Access & publish data from/to Google Cloud Storage & BigQuery Compile recipes to Google Cloud Dataflow for fully-managed auto-scaling execution

Trifacta Architecture on AWS

Trifacta Architecture on Microsoft Azure

Execution engines

Technical Approaches to Anyscale Interactivity

Sampling strategy

Trifacta Products

Demo scenario Product Location Date/Time Price Quantity Input Data – Transactions from sales system, customers, zip codes: Product Location Date/Time Price Quantity Goal of the analysis Combine transaction data from multiple year files Join the data with reference datasets Perform a lookup to fill in missing state values Filter data by date Aggregate prices by product and zip code

Demo

Trifacta benefits Empower the people who know the data best Accelerate time to value Lower business risk with more accurate data Unlock innovation using a wider variety of data

Useful Links Trifacta resources: Product Documentation Product editions spec. Resource library Online training Product on Azure Marketplace Product on AWS Marketplace

Q&A

Sponsors