R For The SQL Developer Kevin Feasel Manager, Predictive Analytics

Slides:



Advertisements
Similar presentations
1 Migrating from Access to SQL Server Simon Kingston, CSU / NPS NRGIS.
Advertisements

Data Analytics and Dynamic Languages Lee E. Edlefsen, Ph.D. VP of Engineering 1.
Insert Presentation Title Heremm.dd.yy, City, ST SQL Reporting Services – Building the Report Kevin Ford Services Consaultant Accela, Inc.
Basics of Web Databases With the advent of Web database technology, Web pages are no longer static, but dynamic with connection to a back-end database.
DTS Conversion to SSIS Conversion Best Practices Mike Davis
240-Current Research Easily Extensible Systems, Octave, Input Formats, SOA.
ONENOTE ST CENTURY LEARNING TOOLS Tips and tools for working with OneNote in the Classroom.
NCBI Genome Workbench Chuong Huynh NIH/NLM/NCBI Sao Paulo, Brasil July 15, 2004 Slides from Michael Dicuccio’s Genome Workbench.
Datalayer Notebook Allows Data Scientists to Play with Big Data, Build Innovative Models, and Share Results Easily on Microsoft Azure MICROSOFT AZURE ISV.
DATA MINING Pandas. Python Data Analysis Library A library for data analysis of (mostly) tabular data Gives capabilities similar to Excel and SQL but.
Session ID: Expert 114 BOBJ on HANA Dr. Bjarne Berg.
Boost Developer Productivity with a 360- Degree View of Every Software Change by Using FinditEZ, Certified Microsoft Platform Ready for SQL Azure MICROSOFT.
SQL SERVER 2008 Installation Guide A Step by Step Guide Prepared by Hassan Tariq.
Entity Framework Database Connection with ASP Notes from started/getting-started-with-ef-using-mvc/creating-an-
Blog: R YOU READY FOR.
Dato Confidential 1 Danny Bickson Co-Founder. Dato Confidential 2 Successful apps in 2015 must be intelligent Machine learning key to next-gen apps Recommenders.
EQuIS and Tableau Getting the most out of your tools.
9/24/2017 7:27 AM © Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN.
How to Get Started With Python
DevOps with ASP.NET Core and Entity Framework Core
Stop Doing That! Common T-SQL Anti-Patterns
Python for data analysis Prakhar Amlathe Utah State University
IBM Predictive Analytics Virtual Users’ Group Meeting March 30, 2016
Data Virtualization Demoette… ODBC Clients
Stress Free Deployments with Octopus Deploy
AP CSP: Cleaning Data & Creating Summary Tables
Foundations of Data Science
A session on the adjacent possible
Metis Data Science Meetup:
Working With Azure Batch AI
Stop Doing That! Common T-SQL Anti-Patterns
LOCO Extract – Transform - Load
Data Insights Big Data v3.1 Customer Flyer
Getting Started with R.
Development Changes in Dynamics 365 for Finance and Operations
Logo here Module 3 Microsoft Azure Web App. Logo here Module Overview Introduction to App Service Overview of Web Apps Hosting Web Applications in Azure.
Stop Doing That! Common T-SQL Anti-Patterns
Introduction to R Programming with AzureML
6 Benefits of Using Microsoft Access Database. Microsoft Access is an efficient program that helps companies to carry out complex business processes in.
SQL Server Data Tools for Visual Studio Part I: Core SQL Server Tools
Prepared by Kimberly Sayre and Jinbo Bi
Microsoft Build /22/2018 4:06 AM © 2016 Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY,
Order Management For Shippers.
CTI Contact Center For CustomerSoft ESP
What’s New in Colectica 5.3 Part 1
Network Visualization
Principles of report writing
Dane Stubben QuintilesIMS Database Manager
Teacher Academy Workshops
Making PowerShell Useful
Stop Doing That! Common T-SQL Anti-Patterns
Cloud Foundry on Azure and Azure Stack
Accelerate Your Self-Service Data Analytics
Thank you Sponsors.
Microsoft Virtual Academy
EMSE 6574 – Programming for Analytics: Python 101 – Python Enviornments Joel Klein.
Genome Workbench Chuong Huynh NIH/NLM/NCBI New Delhi, India
Making PowerShell Useful
Chapter 7 –Implementation Issues
Building your First Cube with SSAS
Get More Than CRUD From Your ORM
Predictive Models with SQL Server Machine Learning Services
Topic 11 Lesson 1 - Analyzing Data in Access
Finding and using code online
DEV322 Visual Studio 2005 C# IDE Enhancements
Server & Tools Business
REST Easy - Instant APIs for Your Database
Windows Forms in Visual Studio 2005: An in-depth look at key features
SSDT, Docker, and (Azure) DevOps
Open data in teaching and education
Presentation transcript:

R For The SQL Developer Kevin Feasel Manager, Predictive Analytics ChannelAdvisor

Who Am I? What Am I Doing Here? Curated SQL https://curatedsql.com Tribal SQL http://tribalsql.com @feaselkl

What Is R? R is a language focused around performing statistical analysis, predictive modeling, and data cleansing. R is an off-shoot of the S language and is built on top of C.

What Is R? There are two major branches of R of interest to us: base R and Microsoft R. "Base" R is managed by the R Consortium and is entirely open-source. Microsoft takes base R and adds additional libraries and support.

Why Use R? R provides several advantages as a data analysis Domain Specific Language (DSL): R has a large number of built-in functions for performing operations, including aggregates, statistical analysis, and graphing and plotting. The R ecosystem is vast. With CRAN (CPAN or NuGet for R), thousands of open-source packages are available to help you solve common data cleansing, data analysis, and plotting problems. R language constructs make set-based analysis and operation easy, improving performance and reducing the transition period for SQL Server developers. R helps you go well beyond simple Excel analysis and pivot  tables

Notebooks R integrates well with the idea of notebooks. Notebooks are a way of mixing Markdown-enabled text and language snippets to make your thoughts clear to others. You can create and share notebooks, allowing others easily to test your process and follow along. Notebooks are also an excellent teaching mechanism. Today's talk will look at Jupyter Notebooks. Jupyter (which name derives from a combination of the languages Julia, Python, and R) is a great framework because it has support for dozens of languages. Microsoft uses Jupyter Notebooks for its Azure Machine Learning product.

Motivation My goals in this talk: Introduce you to the R ecosystem, including programs, libraries, and places to learn more. Introduce you to the R language and show how to connect to SQL Server, as well as a few things you can do with R. Introduce you to notebooks and show how they can serve as pedagogical or scientific purposes. Get you thinking about ways you could use R in your environment today. Note that R is not the only data analysis language you could learn. Julia and Python are also great languages, and there are very good closed- source, commercial tools like SAS

Motivation Call logging plot:

Motivation CPU usage plot:

Motivation Columnstore index updates in SQL Server 2016:

Introducing R Installing R and Tools Learning the Basics Connecting to SQL Server Getting a Taste of R

Getting the Right Version of R There are two core versions of R: open-source base R and Microsoft R (nee Revolution R). Selected features: Version Parallelism Data Size Deployment Base R Parallel library Memory Shiny Microsoft R Open MKL w/o ScaleR Microsoft R Client ScaleR, 2 threads Memory; can connect to R Server Microsoft R Server Full parallelism Memory or disk DeplyR/Shiny

Choosing Your IDE There is one big IDE available: RStudio. RStudio is a standalone installation and provides a nice development interface for R. Microsoft has also made available R Tools for Visual Studio (RTVS), a Visual Studio plug-in. It offers some interesting features like making SQL Server R Services integration easier, and it integrates with other Visual Studio projects.

Jupyter We will also install Jupyer Notebooks and use it during this talk. Installing Jupyter takes a few steps, but the links for this talk include a step-by-step walkthrough. The easiest way to install Jupyter is to use Anaconda, a data science suite for Python. Jupyter also comes with Visual Studio 2017 if you install the Data Science tools.

Introducing R Installing R and Tools Learning the Basics Connecting to SQL Server Getting a Taste of R

Learning About Notebooks Instead of spending a lot of time talking theory, let's investigate R using notebooks. Notebooks allow us to combine code and explanatory text (using Markdown to help with formatting). The most important thing about notebooks is that they are repeatable, meaning that I should be able to hand you a notebook and have you run it all the way through, getting the same results I do. Notebooks help scientists defend their hypotheses and allow others to replicate their experiments.

Demo Time

Introducing R Installing R and Tools Learning the Basics Connecting to SQL Server Getting a Taste of R

Connecting to SQL Server Connecting to a SQL Server database (or any other relational database) is easy with R. The first step is to install the RODBC pacakage to give your R code ODBC support. From there, you can connect to a system data source that you've defined in your ODBC Data Sources. You could also pass in a connection string if you don't want to set up a DSN.

Demo Time

Introducing R Installing R and Tools Learning the Basics Connecting to SQL Server Getting a Taste of R

Getting a Taste of R No single talk will expose the full gamut of what you can do with R, but this next section will try to hit a few of the highlights. If this feels a bit overwhelming, don't fret: you can grab the notebook and try it out yourself. This notebook will cover the analysis of restaurant data for Wake County, North Carolina over a multi-year period.

Demo Time

Wrapping Up R is a powerful language for performing analysis. We've seen just a few of the many valuable uses of R. To learn more, go here: https://CSmore.info/on/r And for help, contact me: feasel@catallaxyservices.com | @feaselkl