DataOps DataOps DevOps for Analytics.

Slides:



Advertisements
Similar presentations
Altosoft Copyright ® 2012 altosoft.com8/3/2012 Sandy Follin, Sr. Account Executive Steve Schrader, Sr. Sales Engineer.
Advertisements

SharePoint 2010 Business Intelligence Module 2: Business Intelligence.
& Dev Ops. Sherwin-Williams & DevOps Introduction to Sherwin-Williams.
What Is DevOps? DevOps is "a portmanteau of 'development' and 'operations'" and is "a software development method that stresses communications, collaboration,
Powered by Microsoft Azure, PointMatter Is a Flexible Solution to Move and Share Data between Business Groups and IT MICROSOFT AZURE ISV PROFILE: LOGICMATTER.
Infrastructure as code. “Enable the reconstruction of the business from nothing but a source code repository, an application data backup, and bare metal.
Real World SQL Server Data Tools Benjamin
Steph itsaLocke.com agile BI Opinionated talk Isn’t Serious Agile Iterates Not about post-its Not about jargon Where’s the money? Move.
Steph itsaLocke.com agile BI Opinionated talk Isn’t Serious Agile Iterates Not about post-its Not about jargon Where’s the money? Move.
Andy Roberts Data Architect
Declarative Configuration Management with Azure Automation DSC and ARM Nathan Lasnoski Vice President of blog.concurrency.com Concurrency.
Survive and Thrive in a DevOps World Steven Murawski
Project Workflow.
Agile BI with SQL Server 2012
11/19/2017 9:41 PM © Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN.
DevOps for the IT Pro with Azure and Visual Studio Team Services
DevOps with ASP.NET Core and Entity Framework Core
Analysis Services in times of Continuous Integration
Bringing DevOps to the Database
Bringing DevOps to the Database
Stress Free Deployments with Octopus Deploy
Continuous Delivery- Complete Guide
Agile Analytics: Automated Builds and Deployments
Agile BI with SQL Server 2014 & TFS 2013
Orchestrating Data and Services with Azure Data Factory
Project Workflow.
Agile BI with SQL Server 2012 & TFS 2012
Владимир Гусаров Директор R&D, Dell Visual Studio ALM MVP ALM Ranger
Power BI Integration with Azure Machine Learning
Microsoft Ignite /22/2018 3:27 PM BRK2121
Microsoft Virtual Academy
DevOps for the DBA Grant Fritchey Product Evangelist Redgate Software.
Add intelligence to Dynamics AX with Cortana Intelligence suite
Dimodelo Solutions.
Introduction to Software Engineering
Microsoft Connect /7/ :48 PM
Bringing DevOps to the Database
Business Intelligence for Project Server/Online
Quantifying Quality in DevOps
SQL Server Analysis Services Fundamentals
Winter 2016 (c) Ian Davis.
11/29/2018 2:52 AM MDC317A Continuous Delivery - The Agile End to End Story for Developers & IT Pros! Morgan Webb Technical © 2013.
Introduction to DevOps
Bringing DevOps to the Database
12/26/2018 1:44 AM © Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN.
Azure Data Lake for First Time Swimmers
THR1171 Azure Data Integration: Choosing between SSIS, Azure Data Factory, and Azure Databricks Cathrine Wilhelmsen, | cathrinew.net.
DevOps in Localization Continuous Delivery
Analytics in the Cloud using Microsoft Azure
ETL Automation using Biml
Microsoft Ignite NZ October 2016 SKYCITY, Auckland
HCL’s Viewpoint – DevOps on MS Cloud
Evolution not Revolution
SSIS Data Integration Data Warehouse Acceleration
SSDT, Docker, and (Azure) DevOps
SSIS Data Integration Data Warehouse Acceleration
Azure Data Factory V2 Templates
Bob Duffy 22 years in database sector, 250+ projects
Keeping your SQL Code safe
SSDT, Docker, and (Azure) DevOps
Wimmer Solutions Team Justin Barbara Meg SQL and PowerBI Developer
A DevOps process for deploying R to production
Customer 360.
SSIS Data Integration Data Warehouse Acceleration
Continuous Integration and Delivery (CI/CD) in Azure Data Factory
SSDT, Docker, and (Azure) DevOps
Bringing source control to BI world!
Michael Stephenson Microsoft MVP - Azure
OFFICIAL Data Analytics and AI Centre of Expertise 17th September 2019.
SSDT, Docker, and (Azure) DevOps
Presentation transcript:

DataOps DataOps DevOps for Analytics

Hi, I’m Steph! An MVP & data science consultant.

DevOps Dev Test Ops What is DevOps? DevOps is bridging the gap between different teams, culturally and technologically to reduce the risk and cost of software development in an organisation. Culturally, it means things like ownership of changes, “left-shifting” responsibility for testing, lower bureaucracy Technologically, it means things like unit testing, immutable architecture, continuous integration, continuous deployment, real-time application monitoring and alerting

DBs Test Ops What’s DLM? Database Lifecycle Management is the work Microsoft & Redgate are doing to facilitate unit testing, CI, CD, and pipeline management for databases Is it DevOps? It depends! Yes, people need to do some of this database stuff for applications so DLM needs to be usable for a DevOps shop, but they should also be doing it for data warehouses and other data repositories. It’s vital but given that it’s not just about development it should be considered separately.

BI Test Ops What do people (sort of) use to make decisions? Facts / data / information. These are retrieved from source systems, integrated, transformed, and finally reported on. So many steps involved and so many point of failure. What to do about it? Test, test, test! But how? Unit test frameworks, metadata driven, scripted builds. These enable you to be more RIGHT – incorporate them and do so in an automated fashion and your bringing Ops and Testins closer. This is DataOps

Test Ops Analytics Analytics AKA data science, statistics, data mining, machine learning The problem of time We saw in BI that we need to be RIGHT – but we’re only ever aiming for RIGHT at the point of presentation. Analytics is aiming to be RIGHT over people and time. It’s about making predictions that as accurate today as they are tomorrow. That’s a really hard thing since when you start changing things, you can’t tell what would have happened if you hadn’t, unless you experiment. You need to be able to account for changing behaviour over time, and changes to data capture etc. This means your modelling has to be updateable and refinements easy to deploy, and at the same time remaining robust. How to solve? Coded analysis, regular retraining, and a validation and deployment pipeline are all needed to facilitate on-going analytics. Analytics has to be automated to be able to cope.

Data Test Ops DataOps What do DBs, BI, and Analytics have in common? They’re all about the data DataOps is all about bringing rigour and speed of delivery to how we store, process, use, and present data. It’s a vital movement that will enable BI departments to keep pace with the “modern world” and helps them scale to meet the increasing demand for data. So, yeah, #DataOps is totally a thing!

DataOps is moving data roles and ops closer

Current People Tools Process Quantified value Planning Continuous improvement Process Code review Coping with change Productivity tools Tools Satisfaction / fitness metrics Continuous integration People Teach simplicity Continuous learning Automation Quick to build Coherent communication Face to face / virtual

Ideal People Tools Process Quantified value Planning Continuous improvement Code review Coping with change Continuous learning People Productivity tools Tools Satisfaction / fitness metrics Face to face / virtual Continuous integration Teach simplicity Automation Quick to build Coherent communication

To improve is to change; to be perfect is to change often. Winston Churchill

Current Get & Tidy Transform Viz Model Transform @hadleywickham

Ideal Viz Model Transform Get & Tidy Transform @hadleywickham

Process Value / idea Prototype Dev Test Deliver Prioritise Release Ticket Prioritise Dev Test Release

It’s never about how you start – it’s always about how you finish The Rock

Good reading The Phoenix Project The Art of Agile Agile Data Warehouse Design DataOps.info DataOps Manifesto

Collaboration Azure Visual Studio Online Trello GitHub White board & post-its Slack http://itsalocke.com/index.php/database-bi-related-unit-testing-options/

Data Cost Tooling Learning SSDT Free Medium-High Medium Redgate DLM Anchor Modeling Low High MSBuild Medium-high CosmosDB http://itsalocke.com/index.php/database-bi-related-unit-testing-options/ Github.com/stephlocke/MeDriAnchor http://sqlbits.com/Sessions/Event14/Metadata_Driven_Automation_A_Primer

ETL Cost Tooling Learning BIML Free or High High SQL Free Medium Powershell Azure Functions Low SSIS Free or Medium http://itsalocke.com/index.php/database-bi-related-unit-testing-options/ Github.com/stephlocke/MeDriAnchor http://sqlbits.com/Sessions/Event14/Metadata_Driven_Automation_A_Primer

Reporting Cost Tooling Learning Excel Free/Low Low PowerBI Medium Mediuim SSRS Free-Medium Fairly low R Free High Medium-high Other ? http://itsalocke.com/index.php/database-bi-related-unit-testing-options/ http://sqlbits.com/Sessions/Event14/Delivering_Agile_Analytics_with_Azure_Machine_Learning

Cubes Cost Tooling Learning Tabular Free or Medium Medium Original Free or High Low High http://itsalocke.com/index.php/database-bi-related-unit-testing-options/ Github.com/stephlocke/MeDriAnchor http://sqlbits.com/Sessions/Event14/Metadata_Driven_Automation_A_Primer

Data Science Cost Tooling Learning R Free High Python Docker Free/Medium Medium Microsoft ML Free / High Azure ML Free / Low Low H2O Spark

@stefflocke @lockedata steph@itsalocke.com Follow up! @stefflocke @lockedata steph@itsalocke.com