Copyright © 2014 Oracle and/or its affiliates. All rights reserved. | Analytik Mittels R als übergreifende Plattform Detlef E. Schröder Oracle DWH Community.

Slides:



Advertisements
Similar presentations
Fachbereich Elektrotechnik und Informationstechnik
Advertisements

Database Futures Workshop CERN Michael Dahlinger, GSI
Oracle Data Warehouse Mit Big Data neue Horizonte für das Data Warehouse ermöglichen Alfred Schlaucher, Detlef Schroeder DATA WAREHOUSE.
Dream Report: Advanced Manual Data Entry
Data Mining with R/ORE Minming Duan. 2 iTech Solution Profile Agenda R/ORE Overview 1 XML output generation using SQL 4 Integration with IBP and BIEE.
Lecture plan Information retrieval (from week 11)
Statische Code-Analyse mit FxCop & StyleCop ROLAND GRAF SIMON KRANZER.
Migration in die Cloud – so gelingt ein nachhaltiger Umstieg.
Copyright © 2012, Oracle and/or its affiliates. All rights reserved. 1.
Chapters 14 & 15 Internet Databases. E-Commerce  Bringing new products, services, or ideas to market, supporting and enhancing business operations 
IT Project Management, Third Edition Appendix A1 Appendix A: Guide to Using Microsoft Project 2002.
Essbase Reporting Jim Kubik Senior Sales Consultant.
Oracle Enterprise Manager – Cloud Control 12c Simon Keys, The Small Ronnie Martin Lambert, The Large Ronnie.
WebGIS als Instrument für die WOZ-Aufnahmen? Jochen Häußler, European Media Lab.
Architecting for the Internet of Things
Business Intelligence components Introduction. Microsoft® SQL Server™ 2005 is a complete business intelligence (BI) platform that provides the features,
Oracle Advanced Analytics Oracle R Enterprise & Oracle Data Mining
PI Data Access via OLE DB/SQL
Copyright © 2006, SAS Institute Inc. All rights reserved. Enterprise Guide 4.2 : A Primer SHRUG : Spring 2010 Presented by: Josée Ranger-Lacroix SAS Institute.
A Spotfire Demo Gallery with Data Science Dr. Brand Niemann Director and Senior Data Scientist Semantic Community November 13, 2011 DRAFT 1.
Working With Large Datasets in Corporate Settings Ed Bassin
SharePoint 2010 Business Intelligence Module 10: Reporting Services.
Jason G. Caudill Assistant Professor of Business Administration Carson-Newman College.
MapReduce April 2012 Extract from various presentations: Sudarshan, Chungnam, Teradata Aster, …
Almost 4 decades of Advanced Analytics & DM expertise.
PHP With Oracle 11g XE By Shyam Gurram Eastern Illinois University.
Interactive report designer ADO.Net MS SQL Server OleDB ODBC Oracle SQL MySQL Firebird SQL Postgre SQL XML Business Objects What types of data does Stimulsoft.
Hive Facebook 2009.
Create a Homepage Takeovers Bigsizebanner, Contentad and Skyscraper (optional Background and/or Layer)
An Introduction to HDInsight June 27 th,
Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 1.
Mainframe (Host) - Communications - User Interface - Business Logic - DBMS - Operating System - Storage (DB Files) Terminal (Display/Keyboard) Terminal.
SQL Server 2012 Session: 1 Session: 3 Introduction to SQL Server 2012 Data Management Using Microsoft SQL Server.
Copyright © 2013, SAS Institute Inc. All rights reserved. SAS GLOBAL FORUM: NEW & NOTEWORTHY MATT MALCZEWSKI – COMMUNITIES MANAGER.
SharePoint & BizTalk better together. & BizTalk even better together Update.
Copyright © 2004, SAS Institute Inc. All rights reserved. SAS Stored Processes An analyst’s perspective Sylvain Tremblay SAS Canada 24 February 2006.
Angel Trifonov Yun Lu Ying Wang RICARDO: INTEGRATING R AND HADOOP.
PEPPOL (Pan European Public Procurement Online) by IBM
All about Revolution R Enterprise
© 2013, published by Flat World Knowledge Chapter 10 Understanding Software: A Primer for Managers 10-1.
Progress OpenEdge Analytics360 Predicting The Future Of Your Business Michael Marriage Senior Principal Product Manager
Copyright © 2015, SAS Institute Inc. All rights reserved. THE ELEPHANT IN THE ROOM SAS & HADOOP.
Impala. Impala: Goals General-purpose SQL query engine for Hadoop High performance – C++ implementation – runtime code generation (using LLVM) – direct.
Integrating the Mainframe Liberating Enterprise Data.
23 Copyright © 2009, Oracle. All rights reserved. Oracle Business Intelligence Answers: Advanced Features.
2 Copyright © 2011, Oracle and/or its affiliates. All rights reserved. BI Publisher: Technology and Architecture.
OBIEE Architecture.
Oracle OLAP Option Bud Endress Director of Product Management, OLAP.
Big Data Analytics with Excel Peter Myers Bitwise Solutions.
1 Seattle University Master’s of Science in Business Analytics Key skills, learning outcomes, and a sample of jobs to apply for, or aim to qualify for,
Gut zu wissen... Martin Jäckle Leitender Systemberater Solution House Development Framework.
1. 2 Quick Introduction to Oracle R Enterprise ©2011 Oracle – All Rights Reserved.
Slide 1 © 2016, Lera Technologies. All Rights Reserved. SAP BO vs SPLUNK vs OBIEE By Lera Technologies.
Microsoft Power Query: an Excel Users Dream for Data Extraction and Cleansing Presented by: Belinda Allen Smith & Allen Consulting, Inc.
E Copyright © 2006, Oracle. All rights reserved. Using SQL Developer.
Bundesministerium der Justiz - Berlin, Mai 2009 Projekt des BMJ: „Elektronische Arbeitshilfen und Verkündung“ im Rahmen der Einer-für-Alle-Dienstleistung.
1 Copyright © 2008, Oracle. All rights reserved. Repository Basics.
Microsoft Power Query 101 Belinda Allen Smith & Allen Consulting, Inc.
© 2009 Oracle Corporation – Proprietary and Confidential Agenda Reporting Overview Performance Workspace Dashboards Reports Drill thru Smartview Excel.
Oracle Big Data eSeminar Series
Bridging the Data Science and SQL Divide for Practitioners
Appendix A: Guide to Using Microsoft Project 2002
Introduction to R Programming with AzureML
Data Platform and Analytics Foundational Training
Redesign Firmen-Websites
Dane Stubben QuintilesIMS Database Manager
Web Browser server client 3-Tier Architecture Apache web server PHP
Server & Tools Business
A Configurable FPGA-Based Traffic Generator for High-Performance Tests of Packet Processing Systems The Sixth International Conference on Internet Monitoring.
Appendix A: Guide to Using Microsoft Project 2002
Presentation transcript:

Copyright © 2014 Oracle and/or its affiliates. All rights reserved. | Analytik Mittels R als übergreifende Plattform Detlef E. Schröder Oracle DWH Community STCC DB

Copyright © 2014 Oracle and/or its affiliates. All rights reserved. | Themen  Anforderungen an Datenmanagement  R - Grundsätzliches  Die Datendrehscheibe  Datenbank – Oracle  Hadoop  XML  Office  R- in der Mitte  Fragen und Antworten

Copyright © 2014 Oracle and/or its affiliates. All rights reserved. | Anforderungen an Datenmanagement Eingabe Hadoop – Big Data klassische Dateien XSLX,TXT,LST,Mail Web Inhalte – Referenzdaten Datenbanken Oracle Streams 3 Verarbeitung Bereinigen Transformieren Analysieren Anreichern Mining Ausgabe Grafiken Tabellen Excel / Office SQL Druck

Copyright © 2014 Oracle and/or its affiliates. All rights reserved. | Anforderungen an Datenmanagement Eingabe Hadoop – Big Data klassische Dateien XSLX,TXT,LST,Mail Web Inhalte – Referenzdaten Datenbanken Oracle Streams 4 Verarbeitung Bereinigen Transformieren Analysieren Anreichern Mining Ausgabe Grafiken Tabellen Excel / Office SQL Druck

Copyright © 2014 Oracle and/or its affiliates. All rights reserved. | R - Grundsätzliches 5

Copyright © 2014 Oracle and/or its affiliates. All rights reserved. | R – Überblick R Statistische Programmiersprache users sales histor y retur ns onlin e PCA 5 Factor 3Factor 1Clustering 4 Groups

Copyright © 2014 Oracle and/or its affiliates. All rights reserved. | Warum sollte man sich für R interessieren Gehört zu den neuen aufkommenden Trends – Next “big thing” in Avanced Analytics – Moderne statistische Programmiersprache – Ausbildungsinstitute und Universitäten nutzen R für die Ausbildung und ersetzen damit traditionelle Tools – Advanced Analytics ist zunehmend kritisches Unterscheidungsmerkmal im DWH Technologie Stack R wird durch Oracle R Enterprise skalierbar Kostengünstige Alternative zu SAS Kernfunktionalität

Copyright © 2014 Oracle and/or its affiliates. All rights reserved. | Daten-Visualisierung mit R

Copyright © 2014 Oracle and/or its affiliates. All rights reserved. | Oracle R-Angebote Oracle R Distribution – Free download, pre-installed on Oracle Big Data Appliance, bundled with Oracle Linux – Enterprise support for customers of Oracle R Enterprise, Big Data Appliance, and Oracle Linux – Contribute bug fixes and enhancements to open source R ROracle – Open source Oracle database interface driver for R based on OCI – Maintainer is Oracle – rebuilt from the ground up – Many bug fixes and optimizations Oracle R Enterprise – Transparent access to database-resident data from R – Embedded R script execution through database managed R engines – Statistics engine Oracle R Connector for Hadoop – R interface to Oracle Hadoop Cluster on BDA – Access and manipulate data in HDFS, database, and file system – Write MapReduce functions using R and execute through natural R interface

Copyright © 2014 Oracle and/or its affiliates. All rights reserved. | Die Datendrehscheibe 10

Copyright © 2014 Oracle and/or its affiliates. All rights reserved. | Lesen und schreiben in dei Oracle DB Mögliche Szenarien mit Oracle R-Enterprise R Engine Direkten Zugriff auf alle Tabellen in der Datenbank File System R Engine Auslagern der Analysen in die Datenbank Zurückholen der Ergebnisse Anlegen neuer Objekte in der Datenbank R Engine Parallelisierung durch die Datenbank File System R Engine R-Analysen über SQL-Funktionen (Batch) R Engine Parallelisierung durch die Datenbank File System SQL

Copyright © 2014 Oracle and/or its affiliates. All rights reserved. | ORE bietet eine “in-database execution” – Funktionalität als transparente Schicht an What’s transparent about it? – R Benutzter benötigen nur R Syntax – Benutzer sehen Datenbank-Objeckte als spezielle R Objekte – Unterstützt weden fast alle R-Funktionen des Basis-Pakets – Unterstützt R's Statistik und Graphik-Pakete Funktional vergleichbar mit SAS DATA STEP, läuft allerdings in-Datenbank! Transparency Layer Support

Copyright © 2014 Oracle and/or its affiliates. All rights reserved. | ORE Packages PackageDescription ORETop Level Package for Oracle R Enterprise OREbaseCorresponds to R’s base package OREstatCorresponds to R’s stat package OREgraphicsCorresponds to R’s graphics package OREedaORE’s exploratory data analysis package containing SAS PROC-equivalent functionality ORExmlORE’s package supporting XML translation between R and Oracle Database – internal ORHCOracle R Hadoop Connector

Copyright © 2014 Oracle and/or its affiliates. All rights reserved. | Invoke in-database aggregation function Client R Engine Other R packages Oracle R package R user on desktop Oracle Database User tables Transparency Layer aggdata <- aggregate(ONTIME_S$DEST, by = list(ONTIME_S$DEST), FUN = length) class(aggdata) head(aggdata) Source data is an ore.frame ONTIME_S, which resides in Oracle Database The aggregate() function has been overloaded to accept ORE frames aggregate() transparently switches between code that works with standard R data.frames and ore.frames Returns an ore.frame In-db stats ©2012 Oracle – All Rights Reserved select DEST, count(*) from ONTIME_S group by DEST

Copyright © 2014 Oracle and/or its affiliates. All rights reserved. | Gezieltes Ansteuern einer Verarbeitungsvariante (Beispiel Regressions Modell) mod <- ore.doEval( function(param) { library(ORE) ore.connect(user="RQUSER", password="RQUSER„, sid="ORCL", host=" ",port=1521) ore.sync() ore.attach() mod <- lm(ARRDELAY ~ DISTANCE + DEPDELAY, dat) return (mod) }); mod_local <- ore.pull(mod) class(mod_local) summary(mod_local) Daten bleiben im Memory Der Oracle Datenbank mod <- ore.doEval( function(param) { dat <- ore.pull(ONTIME_S) mod <- lm(ARRDELAY ~ DISTANCE + DEPDELAY, dat) return( mod ) }); mod_local <- ore.pull(mod) class(mod_local) summary(mod_local) Daten im Memory der R-Engine auf dem DB-Server Laufzeit: 3 Sekunden Laufzeit: 110 Sekunden

Copyright © 2014 Oracle and/or its affiliates. All rights reserved. | R on Hadoop 16

Copyright © 2014 Oracle and/or its affiliates. All rights reserved. | Integrierte R Umgebung Oracle R Connector for Hadoop Native R MapReduce Native R HDFS Zugriff Mehr Produktivität ORE Client Host R Engine Hadoop Cluster Software R Engine MapReduce Nodes HDFS Oracle Big Data Appliance Oracle Exadata R Engine ORE ORHC

Copyright © 2014 Oracle and/or its affiliates. All rights reserved. | R on Files TXT – Read.table > meinetabelle <- read.table("/Pfad/zur/Tabelle.txt", header=TRUE) CSV – Read.csv XLS – read.xls und andere ODBC – RODBC für alle Files, für die es ODBC Verbindungen gibt 18

Copyright © 2014 Oracle and/or its affiliates. All rights reserved. | R on http Rcurl – Zeilenweise lesen eines http – Requests – Verarbeiten von <> Tags Standard Informationen auf Wikipedia... 19

Copyright © 2014 Oracle and/or its affiliates. All rights reserved. | R – in der Mitte

Copyright © 2014 Oracle and/or its affiliates. All rights reserved. | SQL – Datenbank als Integration SQL auf R - Funktionen SQL auf Datenbankobjekte SQL auf Hadoop durch die DB (Big Data SQL) 21

Copyright © 2014 Oracle and/or its affiliates. All rights reserved. | XML – als Integration Für Grafiken und Tabellen Als Basis für alles was XML spricht Berichtswesen – Z.B: OBIP – Oracle BI Publisher 22

Copyright © 2014 Oracle and/or its affiliates. All rights reserved. | Anzeige über Business Intelligence Als Funktion oder gespeicherte Ergebnisse Das Kundenranking wird mit in den Berichten angezeigt

Copyright © 2014 Oracle and/or its affiliates. All rights reserved. | Einbindung von R-Grafiken in OBIEE Mit Parametern ©2011 Oracle – All Rights Reserved

Copyright © 2014 Oracle and/or its affiliates. All rights reserved. | Grafiken exportieren JPG, PNG, BMP, TIFF, SVG, EPS,... Direkt in ein Verzeichnis für weitere Nutzung 25

Copyright © 2014 Oracle and/or its affiliates. All rights reserved. |

XLS(X) – In die Officewelt Aus R in Excel Für Tabellen und Berechnungen Öffnet in die Anwenderschicht – write.xls() oder xlsReadWrite als Package 30

Copyright © 2014 Oracle and/or its affiliates. All rights reserved. |