@Djeepy1 Fasten your seatbelt and listen to the (Data) Steward Jean-Pierre Riehl Florian Eiden
@Djeepy1 Jean-Pierre Riehl Practice Manager Data & BI – AZEO MVP SQL Server President at GUSS Florian Eiden Managing Consultant, Data & Analytics - Cellenza MVP SQL Server Board Member at GUSS Who are we ?
@Djeepy1 GUSS : PASS France chapter Webcasts, Conferences, Afterworks.Pro Next event : SQLSaturday Paris 2014 September 13th Tour Montparnasse, Paris English-speaking track
@Djeepy1 THE CONTEXT
@Djeepy1 Self-Service BI Corporate BI Managed DatawareHouse Company-wide Team BI Shared Models Department-wide Self-Service BI Quick & Easy Personal data Document-centric
@Djeepy1 Empower Users Release Data, Release usages
@Djeepy1 Governance noun. « Leading the conduct of things or persons »
@Djeepy1 THE ISSUES
@Djeepy1 Writer’s block also known as White Worksheet Syndrom Issue #1 ?
@Djeepy1 Too much Data ! « I want the Employee’s List » – Duplicates – Wrong sources – Bad Data – Poor or Bad description – Etc. Issue #2
@Djeepy1 Compliance – Security – Encryption – Anonymization Issue #3
@Djeepy1 How does it scale ?
@Djeepy1 POWER BI The Microsoft Way
@Djeepy1 Features and tools Analyze Visualize Share Question Q&AQ&A Mobility Discover Search, access, and transform public and internal data sources with Power Query Share datasets and workbooks refreshable from on-premises and cloud based data sources, with Power BI Sites Easy data modeling and lightning fast in-memory analytics with Power Pivot Bold new interactive data visualizations with Power View and Power Map Ask questions and get immediate answers with natural language query Mobile access through HTML5 and touch optimized apps Scalable | Manageable | Trusted
@Djeepy1 Power BI - Big Picture Power BI O365 Tenant Power BI Admin Center SQL Data Catalog External Data Q&A CloudOn-Prem Oracle … Excel Power BI Sites Power Query Power Pivot Power View Power Map Cloud Power Query Data Refresh Index Search Data Management Gateway
@Djeepy1 Ideas of costs Q&AQ&A All Inclusive (ie. including Office 2013 ProPlus Licences)
@Djeepy1 Power BI * On-Prem Data Sources Gateways & Data Sources Shared Data Sources (SSRS) Office Data Connection Datasets Queries Shared Datasets (SSRS) Power Pivot for SharePoint Models Power Pivot Data Management Gateway Power Pivot for SharePoint Dashboards Power View Power View (BISM) SSRS over Power Pivot There was some SSBI before Power BI Power BI vs. On-Prem * Many more features to come
@Djeepy1 demo
@Djeepy1 Tools are only a part of the solution Good formula : People + Processes + Tools – « Data governance is between 80 and 95% communication » - Dec 2006 Data Governance ConferenceData Governance Conference We have the tools, let’s talk about the rest… If all you have is hammer…
@Djeepy1 THE DATA STEWARD
@Djeepy1 Wikipedia: Stewardship is an ethic that embodies the responsible planning and management of resources. A Steward? Data Steward of Gondor… Not our idea, see Matthew Roche for complaints
@Djeepy1 IT : Information Technology My pretty typical organization Piercing items Slashing items Bludgeoning items Armors & Shield Business Divisions Functional Units Finance HR Legal
@Djeepy1 Where are stewards needed in the org? Piercing items Slashing items Bludgeoning items Armors & Shield Business Divisions Functional Units Finance HR Legal IT : Information Technology
@Djeepy1 My organization : actual perception Piercing items Slashing items Bludgeoning items Armors & Shield Business Divisions Functional Units Finance HR Legal IT
@Djeepy1 Well, let’s be honest about what it looks like Piercing items Slashing items Bludgeoning items Armors & Shield Business Divisions Functional Units Finance HR Legal IT
@Djeepy1 For maximum results: local initiatives Piercing items Slashing items Bludgeoning items Armors & Shield Business Divisions Functional Units Finance HR Legal IT
@Djeepy1 A warning : difficult places to start Piercing items Slashing items Bludgeoning items Armors & Shield Business Divisions Functional Units Finance HR Legal IT
@Djeepy1 Why : – Specific to your company, to be defined in your master plan How : – “Responsible planning and management of resources” What : – Elect data stewards that will enable, teach, police Let’s get back to our steward Slashing items
@Djeepy1 Skills – Interpersonal skills – Good personal organization – Data-awareness Data lifecycle specific to the company General understanding of BI/data technologies Data merging, cleaning, metadata maintenance – Training in tools used in the company A chosen career path – It’s an actual job, usually part time – But not just an additional task in the schedule! Required skills
@Djeepy1 The Journey of a Data Steward
@Djeepy1 The Journey of a Data Steward Help to find data – Manage the Data Lake – Create Data Sources – Facilitate exploration – Manage metadata
@Djeepy1 The Journey of a Data Steward Manage new data – Find new Data Sources – Find new Datasets Verify new datasets – Check for Accuracy – Check for duplicates – Fix sources and queries Use of Workflows
@Djeepy1 Data Workflows Create Derive Approve Data Hub Models, OData, Reports, DWH, MDM, etc. Publish Sandox Enhance Discovery Data Steward Analyst Developer
@Djeepy1 The Journey of a Data Steward Certify – Ensure Corporate Policies Train & Teach – Help for modeling – Help for analysis
@Djeepy1 demo
@Djeepy1 Information Management Platform IT Developers Data Steward Importance of relations Business Users Tools
@Djeepy1 Information Management Platform Sales IT And reality is more complex Mktg Production
@Djeepy1 DATA(source) LIFECYCLE MANAGEMENT
@Djeepy1 Data Lifecycle Management
@Djeepy1 Data Lifecycle Management
@Djeepy1 Data Lifecycle Management
@Djeepy1 Data Lifecycle Management
@Djeepy1 Data Source Lifecycle Management Manage, enable Teach, assist Analyse, merge Data Steward
@Djeepy1 Data asset administration – Create, Delete – Update, Maintain – Give/revoke access – Refresh schedule – Monitor – … Business metadata understanding Data manipulation Applied to : data artifacts – Data sets Files: CSV, XLSX – Metadata Data Models Documentation – Data sources – Queries – … Data Source Lifecycle Management
@Djeepy1 CONCLUSION
@Djeepy1 Tools are nothing without people and processes Governance is different in every company – Decided and sponsored by the executives, inscribed in a global strategy – Adapted to your organization – The Data Steward as the local implementation of it A matter of governance
@Djeepy1 1.Build an Information Management Platform 2.Identify your processes & Org Chart 3.Write the Data Steward « Job Profile » 4.Identify the right people for the job 5.Leverage Self-Service BI Ask your local experts How to start tomorrow ?
@Djeepy1 A Data Culture – See Satya Nadella April 15th 2015 presentation in SF To at last step up in the knowledge pyramid! – Machine learning \o/ All this for what?
@Djeepy1 Any questions ? Thank You !