Data Mining with R/ORE Minming Duan. 2 iTech Solution Profile Agenda R/ORE Overview 1 XML output generation using SQL 4 Integration with IBP and BIEE.

Slides:



Advertisements
Similar presentations
Svetlin Nakov Director Training and Consulting Activities National Academy for Software Development (NASD) ASP.NET 3.5 New Features.
Advertisements

© Copyright 2007 Exempler Telecom Test Automation System Exempler - We pride ourselves with providing lightweight robust engineering solutions.
17 Copyright © 2005, Oracle. All rights reserved. Deploying Applications by Using Java Web Start.
Enterprise Java and Data Services Designing for Broadly Available Grid Data Access Services.
A View of the Business with Drillable Graphics Southern Computer Measurement Group May, 2012 Martha Hays.
1 Copyright © 2012 Oracle and/or its affiliates. All rights reserved. Convergence of HPC, Databases, and Analytics Tirthankar Lahiri Senior Director, Oracle.
1 Migrating from Access to SQL Server Simon Kingston, CSU / NPS NRGIS.
OneBridge Mobile Data Suite Product Positioning. Target Plays IT-driven enterprise mobility initiatives Extensive support for integration into existing.
1. Complete and integrated BI and Performance Management offering Complete and integrated BI and Performance Management offering Widespread delivery of.
© 2010 TIBCO Software Inc. All Rights Reserved. Confidential and Proprietary. TIBCO Spotfire Application Data Services TIBCO Spotfire European User Conference.
The Free Software Desktop Project By: Joshua Anglero
ICS 434 Advanced Database Systems
1. 2 Captaris Workflow Microsoft SharePoint User Group 16 May 2006.
Business Intelligence in Microsoft SQL Server 2005 Marin Bezić Microsoft EMEA SQL BI PRODUCT MANAGER
Pentaho Open Source BI Goldwin. Pentaho Overview Pentaho is the commercial open source software for Business Pentaho is the commercial open source software.
IT Analytics for Symantec Endpoint Protection
Jenkins User Conference San Francisco, Sept #jenkinsconf Business Process Model & Notation (BPMN) Workflows in Jenkins Max Spring Cisco
Copyright © 2005, SAS Institute Inc. All rights reserved. Making the Transition from MDDB-based OLAP Applications to a SAS ® 9 OLAP Solution Ivy Parker.
Netscape Application Server Application Server for Business-Critical Applications Presented By : Khalid Ahmed DS Fall 98.
Essbase Reporting Jim Kubik Senior Sales Consultant.
FAST FORWARD WITH MICROSOFT BIG DATA Vinoo Srinivas M Solutions Specialist Windows Azure (Hadoop, HPC, Media)
Advance Analytics Capabilities
Tableau Visual Intelligence Platform
© 2004 Visible Systems Corporation. All rights reserved. 1 (800) 6VISIBLE Holistic View of the Enterprise Business Development Operations.
ManageEngine TM Applications Manager 8 Monitoring Custom Applications.
Unlock Your Data Rich connectivity Robust data integration Enterprise-class manageability Deliver Relevant Information Intuitive design environment.
Interpret Application Specifications
Copyright © 2006 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill Technology Education Copyright © 2006 by The McGraw-Hill Companies,
Business Intelligence components Introduction. Microsoft® SQL Server™ 2005 is a complete business intelligence (BI) platform that provides the features,
How Business Intelligence Software Works and a Brief Overview of Leading Products Jai Windsor MIS 5973 December 8, 2005.
Tableau Visual Intelligence Platform
SQL Server 2008 for Hosting Key Questions to Address How can SQL Server save your costs? How can SQL Server help you increase customer base? How can.
Oracle Advanced Analytics Oracle R Enterprise & Oracle Data Mining
Understanding and Managing WebSphere V5
Enterprise Reporting with Reporting Services SQL Server 2005 Donald Farmer Group Program Manager Microsoft Corporation.
Copyright © 2006, SAS Institute Inc. All rights reserved. Enterprise Guide 4.2 : A Primer SHRUG : Spring 2010 Presented by: Josée Ranger-Lacroix SAS Institute.
Overview of SQL Server Alka Arora.
M icrosoft Data Warehousing - SQL Server State of the Technology Presentation by Sujata Angara Nakul Johri Sang Ho Park.
What is SAP HANA SAP HANA is new Platform Developed By SAP New Product User based license upgrade from ECC 6 TO SAP HANA Possible with 20% Additional Cost.
PO320: Reporting with the EPM Solution Keshav Puttaswamy Program Manager Lead Project Business Unit Microsoft Corporation.
The way Archiving should be!.  Many organisations have either no archiving policy or is severely fragmented.  Archiving is considered as just another.
Introducing Reporting Services for SQL Server 2005.
COLD FUSION Deepak Sethi. What is it…. Cold fusion is a complete web application server mainly used for developing e-business applications. It allows.
PHP With Oracle 11g XE By Shyam Gurram Eastern Illinois University.
Chapter 1 Introduction to SAS ® Enterprise Guide ®
Information Explosion. Reality: New Machine-Generated Data Non-relational and relational data outside of the EDW † Source: Analytics Platforms – Beyond.
Oracle’s Big Plans For Big Data Analysis By Doug Henschen, InformationWeek, Oct 4 th, 2011 Presented by Group 7: Sam Tucker and Ayoung Noh.
Copyright © 2013, SAS Institute Inc. All rights reserved. SAS GLOBAL FORUM: NEW & NOTEWORTHY MATT MALCZEWSKI – COMMUNITIES MANAGER.
What’s New in Windows Forms 2.0 Stephen Turner Software Design Engineer
SQL Server 2008 Analysis Services. END USER TOOLS & PERFORMANCE MANAGEMENT APPS Excel PerformancePoint Server BI PLATFORM SQL Server Reporting Services.
Features Of SQL Server 2000: 1. Internet Integration: SQL Server 2000 works with other products to form a stable and secure data store for internet and.
© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice. PPT Version 3 | Content.
Copyright 2007, Information Builders. Slide 1 iWay Web Services and WebFOCUS Consumption Michael Florkowski Information Builders.
Introduction to SQL Server 2000 Reporting Services Jeff Dumas Technical Specialist Microsoft Corporation
Slide 1 © 2016, Lera Technologies. All Rights Reserved. SAP BO vs SPLUNK vs OBIEE By Lera Technologies.
AZ PASS User Group Azure Data Factory Overview Josh Sivey, Solution Partner October
Slide 1 © 2016, Lera Technologies. All Rights Reserved. Oracle Business Intelligence By Lera Technologies.
Oracle Exalytics Business Intelligence Machine Eshaanan Gounden – Core Technology Team.
1 © 2015 IBM Corporation dashDB Messaging Guide. 2 © 2015 IBM Corporation Positioning statement: IBM dashDB is for the new ‘builders’ – developers, data.
System Center Marketing
Netscape Application Server
Data Platform and Analytics Foundational Training
Introduction to R Programming with AzureML
Welcome! Power BI User Group (PUG)
Analytics for Cloud ERP
Integrated Collections Portal
Business Intelligence for Project Server/Online
Accelerate Your Self-Service Data Analytics
Predictive Models with SQL Server Machine Learning Services
Oracle’s Reporting Strategy
Presentation transcript:

Data Mining with R/ORE Minming Duan

2 iTech Solution Profile Agenda R/ORE Overview 1 XML output generation using SQL 4 Integration with IBP and BIEE R vs. SPSS 6 FAQ Oracle R for Hadoop Connector

Why analysts use R R is a statistics language similar to Base SAS or SPSS statistics. R environment is… – Powerful – Extensible – Graphical – Extensive statistics – OOTB functionality with many ‘knobs’ but smart defaults – Ease of installation and use – Free

Limitations of R R is a client and server bundled together as 1 executable - like Excel – Single user tool – Not multi-threaded – Cannot leverage CPU capacity even on a user's laptop/desktop R requires data it operates on to be first loaded into memory – Loading data may not be a limitation given RAM available on laptops/desktops – R’s call by value semantics means as data flows into functions, for each function invocation, many copies of the data are made – As a result you quickly run into memory limits

Why should you be interested in R? Emerging trends – It’s the next “big thing” in advanced analytics – Colleges and universities use R for statistics classes (replacing more traditional software tools) – Advanced Analytics as a critical differentiator of the DWH technology stack Augment Oracle deployments – Enhance results with powerful graphics – Integrate R results and graphics with BI Publisher documents and OBIEE dashboards A scalable R via Oracle R Enterprise – Leverage Oracle-engineered solutions – A viable alternative to SAS/SPSS

Rexer Analytics Survey 2011

Default R GUI

RStudio – Third Party, Open Source IDE

Oracle R Enterprise Function push-down – data transformation & statistics R workspace console Oracle statistics engine OBIEE, Web Services No changes to the user experience Scale to large data sets Embed in operational systems DevelopmentProductionConsumption

Oracle R Enterprise Transparently leverage Hadoop for High Performance Analytics to Oracle Big Data Appliance (part of Big Data Connectors software suite) Function push-down – data transformation & statistics R workspace console Oracle statistics engine OBIEE, Web Services ©2012 Oracle – All Rights Reserved

Substantial leap forward from incumbent platforms Data volume – using SQL and existing DB functionality Data Heterogeneity – Oracle DB + BDA Breadth of Analytics – Oracle DB + R packages Breadth of User Types – R+SQL+BI report developers, DBAs Enables enterprise-wide consumption of advanced analytics models via integration with Oracle Exalytics Most integrated and complete suite of Enterprise Advanced Analytics software available in the market today Oracle R Enterprise – Key messages

12 iTech Solution Profile Agenda R/ORE Overview 1 4 Integration with IBP and BIEE R vs. SPSS 6 FAQ Oracle R for Hadoop Connector XML output generation using SQL

13 iTech Solution Profile Agenda R/ORE Overview 1 4 Integration with IBP and BIEE R vs. SPSS 6 FAQ Oracle R for Hadoop Connector XML output generation using SQL

14 iTech Solution Profile Agenda R/ORE Overview 1 4 Integration with IBP and BIEE R vs. SPSS 6 FAQ Oracle R for Hadoop Connector XML output generation using SQL

15 iTech Solution Profile Agenda R/ORE Overview 1 4 Integration with IBP and BIEE R vs. SPSS 6 FAQ Oracle R for Hadoop Connector XML output generation using SQL

R vs SPSS-data loading

R vs SPSS-processing

R vs SPSS-modeling

R vs SPSS-results

R Visualization

R Visualization-continue

Frequently Asked Questions(FAQ) What version(s) of R do we support? – R , however versions R >= will likely work What does CRAN stand for? – Comprehensive R Archive Network Is there a workflow GUI for R? – Red-R, see What other GUI front ends are there for R? Are there R interfaces for ROLAP/MOLAP in Oracle? – Not yet Is there an R connector for NoSQL? – Not yet

FAQ-continue Can we use CRAN open source packages in ORE and get the same benefits, e.g., performance, scalability? – There are benefits, but not the same as from the ORE Transparency Layer – Users can leverage data parallelism through embedded R execution What resources are available for learning R / ORE in Oracle? – See retriever.us.oracle.com With ORE, is Oracle ANSI SQL enhanced to understand R? – Using the extensibility framework, SQL table functions exist that can execute R scripts. The SQL syntax itself has not been extended.

FAQ-continue How does ORE help Exalytics? Is there integration between the two? – OBIEE dashboards and BIP documents can execute R scripts to generate data and/or graph to be displayed. – ORE scripts can generate table data for use in an RPD, and hence through Answers Where do you get the RStudio? –

Copyright © 2008, Oracle and/or its affiliates. All rights reserved. 25 Q & A

Thanks!