Pentaho 7.1.

Slides:



Advertisements
Similar presentations
Thanks to Microsoft Azure’s Scalability, BA Minds Delivers a Cost-Effective CRM Solution to Small and Medium-Sized Enterprises in Latin America MICROSOFT.
Advertisements

System Center 2012 R2 Overview
Securely Synchronize and Share Enterprise Files across Desktops, Web, and Mobile with EasiShare on the Powerful Microsoft Azure Cloud Platform MICROSOFT.
Powered by Microsoft Azure, PointMatter Is a Flexible Solution to Move and Share Data between Business Groups and IT MICROSOFT AZURE ISV PROFILE: LOGICMATTER.
Gaining Unprecedented Visibility into Microsoft Dynamics CRM with Halo’s Pipeline Advisor, Powered by the Microsoft Azure Cloud Platform MICROSOFT AZURE.
Data-Centric Security and User Access Controls for Hadoop on Microsoft Azure MICROSOFT AZURE APP BUILDER PROFILE: BLUETALON BlueTalon provides data-centric.
Please note that the session topic has changed
Self-Service Data Integration with Power Query Stéphane Fréchette.
SQL Server 2016 Integration Services (SSIS)
The Derivitec Risk Portal Provides Powerful, Cost-Effective Risk Management Solutions, Powered by Azure, that Deploy in Minutes MICROSOFT AZURE ISV PROFILE:
Leverage Big Data With Hadoop Analytics Presentation by Ravi Namboori Visit
Data-centric security of Blutalon
READ ME FIRST Use this template to create your Partner datasheet for Azure Stack Foundation. The intent is that this document can be saved to PDF and provided.
BUILD BIG DATA ENTERPRISE SOLUTIONS FASTER ON AZURE HDINSIGHT
AuraPortal Cloud Helps Empower Organizations to Organize and Control Their Business Processes via Applications on the Microsoft Azure Cloud Platform MICROSOFT.
Univa Grid Engine Makes Work Management Automatic and Efficient, Accelerates Deployment of Cloud Services with Power of Microsoft Azure MICROSOFT AZURE.
MICROSOFT AZURE ISV PROFILE: BMC SOFTWARE
Organizations Are Embracing New Opportunities
COMPANY PROFILE: CORENT TECHNOLOGY INC.
Data Platform and Analytics Foundational Training

What is it ? …all via a single, proven Platform-as-a-Service.
ServiceNow Business Offerings
Meemim's Microsoft Azure-Hosted Knowledge Management Platform Simplifies the Sharing of Information with Colleagues, Clients or the Public MICROSOFT AZURE.
DocFusion 365 Intelligent Template Designer and Document Generation Engine on Azure Enables Your Team to Increase Productivity MICROSOFT AZURE APP BUILDER.
Built on Microsoft Azure, 11Ants Retail Analytics Customer Science Solution Delivers Real Growth Opportunities to Retailers with Loyalty Programs MICROSOFT.
Barracuda Networks Creates Next-Generation Security Solutions That Enable Customers to Accelerate Their Adoption of Microsoft Azure MICROSOFT AZURE APP.
Ralleo Enterprise-Grade Solution for Managing Change and Business Transformation Provides Opportunities to Better Analyze Real-Time Data MICROSOFT AZURE.
Vidcoding Introduces Scalable Video and TV Encoding in the Cloud at an Affordable Price by Utilizing the Processing Power of Azure Batch MICROSOFT AZURE.
Docker Birthday #3.
Zhangxi Lin, The Rawls College,
Cherwell Service Management is an IT Service Management Solution that Makes it Easier for Users to Capitalize on Power of Microsoft Azure MICROSOFT AZURE.
Wonderware Online Cost-Effective SaaS Solution Powered by the Microsoft Azure Cloud Platform Delivers Industrial Insights to Users and OEMs MICROSOFT AZURE.
Extensible Platform Microsoft Dynamics 365
Stylelabs Develops the Marketing Content Hub to Offer Enterprises a High-End Marketing Content Management Platform Based on Microsoft Azure MICROSOFT AZURE.
Hosted on Azure, LoginRadius’ Customer Identity
Speaker’s Name, SAP Month 00, 2017
With Help from the Microsoft Azure Cloud,
NGAGE Intelligence Leverages Microsoft Azure Platform to Provide Essential Analytics for Hybrid SharePoint Server/Office 365 Environments MICROSOFT AZURE.
Enterprise security for big data solutions on Azure HDInsight
MyHealthDirect’s Enterprise Scheduling Platform, Based on Microsoft Azure, Improves the Patient Experience and Reduces Patient Readmissions MICROSOFT AZURE.
Operationalize your data lake Accelerate business insight
Take Control of Insurance Product Management: Build, Test, and Launch Any Product Globally 10x Faster, 10x More Cheaply with INSTANDA on Azure Partner.
Replace with Application Image
Welcome! Power BI User Group (PUG)
Oscar AP by Massive Analytic: A Precognitive Analytics Platform for Effortless Data-Driven Decisions. Now Available in Azure Marketplace MICROSOFT AZURE.
Designed for Big Data Visual Analytics, Zoomdata Allows Business Users to Quickly Connect, Stream, and Visualize Data in the Microsoft Azure Platform MICROSOFT.
Yellowfin: An Azure-Compatible Business Intelligence Platform That Connects People with Their Data for Better Decision Making MICROSOFT AZURE APP BUILDER.
Be Better: Achieve Customer Service Excellence and Create a Lean RMA and Returns Process with Renewity RMA and the Power of Microsoft Azure MICROSOFT AZURE.
On-Premises, or Deployed in a Hybrid Environment
Auth0 Is Identity Made Simple for Developers, Built by Developers and Supported by the High Availability and Performance of Microsoft Azure MICROSOFT AZURE.
DeFacto Planning on the Powerful Microsoft Azure Platform Puts the Power of Intelligent and Timely Planning at Any Business Manager’s Fingertips Partner.
Accelerate Your Self-Service Data Analytics
Welcome! Power BI User Group (PUG)
MyCloudIT Enables Partners to Drive Their Cloud Profitability Using CSP-Enabled Desktop Hosting Automation with Microsoft Azure and Office 365 MICROSOFT.
Crypteron is a Developer-Friendly Data Breach Solution that Allows Organizations to Secure Applications on Microsoft Azure in Just Minutes MICROSOFT AZURE.
Dell Data Protection | Rapid Recovery: Simple, Quick, Configurable, and Affordable Cloud-Based Backup, Retention, and Archiving Powered by Microsoft Azure.
Data science and machine learning at scale, powered by Jupyter
One-Stop Shop Manages All Technical Vendor Data and Documentation and is Globally Deployed Using Microsoft Azure to Support Asset Owners/Operators MICROSOFT.
Media365 Portal by Ctrl365 is Powered by Azure and Enables Easy and Seamless Dissemination of Video for Enhanced B2C and B2B Communication MICROSOFT AZURE.
XtremeData on the Microsoft Azure Cloud Platform:
Technical Capabilities
Single Cell’s Progenitor Powered by Microsoft Azure Improves Organisational Efficiency with Strategic Procurement, Contract Management, and Analytics MICROSOFT.
Last.Backend is a Continuous Delivery Platform for Developers and Dev Teams, Allowing Them to Manage and Deploy Applications Easier and Faster MICROSOFT.
School Districts Can Analyze and Report on Data Across Multiple Systems with EdWire, a Powerful Integration Solution that Utilizes Microsoft Azure MICROSOFT.
Remedy Integration Strategy Leverage the power of the industry’s leading service management solution via open APIs February 2018.
Customer 360.
Architecture of modern data warehouse
Dynamics 365 Customer Insights
Presentation transcript:

Pentaho 7.1

Data Discovery / Analysis Current State Today Data Engineering Data Prep Analytics Ingestion Processing Blending Data Delivery Data Discovery / Analysis Analysis & Dashboards Administration Security Lifecycle Management Data Provenance Dynamic Data Pipeline Monitoring Automation

Future Vision: A Single Flow Data Engineering Data Prep Analytics Ingestion Processing Blending Data Delivery Data Discovery / Analysis Analysis & Dashboards Administration Security Lifecycle Management Data Provenance Dynamic Data Pipeline Monitoring Automation Looking to build out a single platform where data engineers, data analysts, business analysts and data scientists can enter anywhere and make data useful for the business

Industry Challenges: An Evolving Big Data Landscape Rapidly evolving Big Data technologies and landscape Disjointed Tools Growth in volumes and varieties of data Pentaho 7.1 Those working in data are working in a turbulent, rapidly changing data world. They are facing the challenges of: Growth of volumes and variety of data Rapidly evolving big data technologies and landscape Disjointed tools 7.1 is built to ease these challenges

Analyze data anywhere in the data pipeline Recap: Pentaho 7.0 Analyze data anywhere in the data pipeline Data Prep Data Engineering Analytics Ingestion Processing Blending Data Delivery Data Discovery / Analysis Analysis & Dashboards Administration Security Lifecycle Management Data Provenance Dynamic Data Pipeline Monitoring Automation Bridging the gap between data preparation and analytics with a visual data experience from anywhere in the data pipeline: Bringing analytics into data prep Share analytics during data prep And governance, security, and big data ecosystem support for a blended data world New Spark capabilities Enhanced metadata Injection Hadoop security Simplified configuration, deployment, and administration

Introducing Pentaho 7.1 Adaptive execution and improved visualizations make users more productive and improves big data job performance across the entire data pipeline. Data Prep Data Engineering Analytics Ingestion Processing Blending Data Delivery Data Discovery / Analysis Analysis & Dashboards Administration Security Lifecycle Management Data Provenance Dynamic Data Pipeline Monitoring Automation Increased productivity and job performance for big data Adaptive execution on any engine, starting with Spark Increased cloud support with HDInsight More enterprise-level security for Hortonworks Kerberos impersonation support Ranger support Improved analytics at every stage of the data pipeline Visual Data Exploration enhancements Support for third party visualizations

CHALLENGE: Productivity and Job Performance for Big Data Jobs

Increased User Productivity and Job Performance for Big Data Jobs Adaptive Execution on any engine, starting with Spark Adaptive Execution for Spark Increased cloud support with HDInsight

Adaptive Execution on Any Engine, Starting with Spark Build Once, Execute on Any Engine Challenge: With rapidly changing big data technology, coding on various engines can be time-consuming or impossible with existing resources Solution: Future-proof data integration and analytics development in a drag-and-drop visual development environment, eliminating the need for specialized coding and API knowledge. Seamlessly switch between execution engines to fit data volume and transformation complexity Challenge: With rapidly changing big data technology, coding on various engines can be time-consuming or impossible Enables anyone, not just a Java developer, to work with processing engines such as Spark Build once, Execute on Any Engine. Can easily switch between execution engines, without rewriting transformation logic, based on data complexity Allows data teams to future-proof data integration and analytics development in a drag and drop visual development environment, eliminates the need for specialized coding and API knowledge and enables seamless switching of execution engines to fit data volume and transformation complexity. While the competition, such as Talend, support multiple execution engines, the number of transformation steps that they support decreases when they transition from their native engine to MapReduce and then further reduces when they transition to Spark. Pentaho on the other hand provides coverage of virtually all available steps across execution engines.

Adaptive Execution for Spark Process Big Data Faster on Spark Without Manual Coding Challenge: Finding the talent and time to work with Spark and newer big data technologies Solution: More easily develop Spark applications in PDI using adaptive execution to ingest, process and blend data from a range of big data sources and scale on Spark clusters Challenge: Finding the talent and time to work with Spark and newer big data technologies Develop Spark applications in PDI’s drag-and-drop visual development environment. Enables more people to be more productive Spark applications. Pentaho is the only vendor to support Spark with all data integration steps in a visual drag-and-drop environment.

Increased Cloud Support with HDInsight Store and Process Big Data in the Cloud with HDInsight Challenge: Big Data storage and processing options for deploying in the cloud, on premise, or hybrid Solution: Customers who already use Microsoft Azure HDInsight will be able to seamlessly use Pentaho's capabilities, allowing more options in cloud, on-premise, or hybrid deployments Challenge: Big Data storage and processing options for deploying in the cloud, on premise, or hybrid More options to store – and more importantly, process – big data in hybrid, on-premises, and public cloud environments. Now, potential customers who already use Microsoft Azure HDInsight will be able to use Pentaho's capabilities with support for Microsoft Azure HDInsight, Azure SQL and Azure SQL Server

CHALLENGE: Enterprise-Level Security for Hadoop Deployments

Additional Enterprise-Level Security for Hortonworks Kerberos Impersonation Support Ranger Support

Increased Kerberos Impersonation Support Increase Security with Hortonworks Deployments Challenge: Authentication security vulnerabilities with Hortonworks deployments Solution: Reduced risk, more secure multi-user Hadoop data integration, better big data governance, and cluster protection from intruders Challenge: Authentication security vulnerabilities with Hortonworks deployments With increased Kerberos impersonation support - reduce risk, provide more secure multi-user Hadoop data integration, better big data governance, and cluster protection from intruders

Manage Role Based Permissions on Hortonworks Ranger Support Manage Role Based Permissions on Hortonworks Challenge: Governance and risk with authorization on Hortonworks deployments. Solution: Enterprise-grade compatibility with Ranger for authorization and role-based access to specific data sets on Hortonworks, ensuring business access rules are enforced across Hadoop data and components Challenge: Security and role based permissions on Hortonworks In 7.1, enterprise-grade compatibility with Ranger for authorization and role based access to specific data sets on Hortonworks. Ensures business access rules are enforced across Hadoop data and components, promoting governance, protecting resources, and reducing risk.

CHALLENGE: Multiple Tools and Siloed Processes in Data Prep

Improved Analytics at Every Stage of the Data Pipeline Visual Data Exploration Enhancements Integration with Third Party Visualizations

Recap: Visual Data Exploration Access visualizations during data prep for inspection or prototyping Challenge: Inability to view visualizations without switching in and out of tools Solution: Visual Data Exploration provides access to analytics during data preparation so users can easily spot check data issues on the spot, without switching in and out of tools or waiting until the very end to find data quality problems In addition, IT and the business can collaborate and iterate faster, shortening the cycle from raw data to meaningful analytics. Challenge: Inability to view visualizations without switching in and out of tools Visual Data Exploration provides access to analytics during data preparation so users can easily spot check data issues on the spot, without switching in and out of tools or waiting until the very end to find data quality problems In addition, IT and the business can collaborate and iterate faster, shortening the cycle from raw data to meaningful analytics.

Visual Data Exploration Enhancements Drill-Down Exploration Can click to drill into various hierarchies Access visualizations during data prep for data inspection or prototyping Visual Data Exploration Enhancement: Ability to further drill-down into visualizations within Visual Data Explorer x Visual Data Exploration Enhancement: Ability to further drill-down into visualizations within Visual Data Explorer

Visual Data Exploration Enhancements New Visualizations: Heat Grid, Geo Map, Sunburst Access visualizations during data prep for data inspection or prototyping Visual Data Exploration Enhancement: New visualizations for expanded prototyping – heat grid, geo map, sunburst New visualizations for expanded prototyping – heat grid, geo map, sunburst Heat Grid: shows 2 dimensions and 2 measures at once. Most useful for relative comparisons at the ‘intersection’ of 2 dimensions. Ex: See sales metrics by each combination of month and region (as shown) Sunburst: useful for showing how a measure is distributed across several categories / attributes. Esp. useful for showing multiple levels in hierarchy at once. Ex: breakdown of sales by state (inner slice), and city (outer slice) Geo map: measures represented by dot size/color. Pan, zoom actions. User can now explicitly define lat and long fields when creating a location attribute for a model in annotate stream, with results showing in DE and Analyzer

Integration with 3rd Party Visualizations More Easily Integrate 3rd Party Visualizations Challenge: Easily integrating visualizations that are not out of the box Solution: Integrate visualizations from 3rd party libraries (D3, FusionCharts, Highcharts, etc) with an easier-to-use and more flexible API and documentation Challenge: Easily integrating visualizations that are not out of the box Integrate visualizations from 3rd party libraries (D3, FusionCharts, Highcharts, etc) with an easier-to-use and more flexible API and documentation. More robust framework for developers to use, including samples and documentation. Easier to integrate new visualizations into Pentaho Better developer APIs Reusability of visualizations in Data Explorer and Pentaho Analyzer Better documentation

Demonstration

Questions?

Thank You