Course Lab Introduction to IBM Watson Analytics

Slides:



Advertisements
Similar presentations
PROJECT RISK MANAGEMENT
Advertisements

RGS-IBG Online CPD course in GIS Analysing Data in ArcGIS Session 6.
Predictive Data Modeling A CASE STUDY FOR DATA MODELING.
Introduction to Data Mining Data mining is a rapidly growing field of business analytics focused on better understanding of characteristics and.
Microsoft Enterprise Consortium Data Mining Concepts Introduction to Directed Data Mining: Decision Trees Prepared by David Douglas, University of ArkansasHosted.
Gavin Russell-Rockliff BI Technical Specialist Microsoft BIN305.
Beyond Opportunity; Enterprise Miner Ronalda Koster, Data Analyst.
Introduction to Directed Data Mining: Decision Trees
CHAPTER 11 Managerial Support Systems. CHAPTER OUTLINE  Managers and Decision Making  Business Intelligence Systems  Data Visualization Technologies.
NSW Curriculum and Learning Innovation Centre Tinker with Tinker Plots Elaine Watkins, Senior Curriculum Officer, Numeracy.
Copyright R. Weber Machine Learning, Data Mining ISYS370 Dr. R. Weber.
3 Objects (Views Synonyms Sequences) 4 PL/SQL blocks 5 Procedures Triggers 6 Enhanced SQL programming 7 SQL &.NET applications 8 OEM DB structure 9 DB.
by B. Zadrozny and C. Elkan
DTIC Discovery Tools 28 March 2012 Moderator: Kapin L. Ferguson.
INTRODUCTION TO DATA MINING MIS2502 Data Analytics.
Zhangxi Lin ISQS Texas Tech University Note: Most slides are from Decision Tree Modeling by SAS Lecture Notes 5 Auxiliary Uses of Trees.
Analytics for Smart Decisions The Wonderful World of Big Data, Business Analytics and Business Intelligence - Demystified.
Chapter 11 Business Intelligence Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall 11-1.
Decision making Under Risk & Uncertainty. PAWAN MADUSHANKA MADUSHAN WIJEMANNA.
Chapter 5: Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization DECISION SUPPORT SYSTEMS AND BUSINESS.
What is Data Mining? process of finding correlations or patterns among dozens of fields in large relational databases process of finding correlations or.
CISB113 Fundamentals of Information Systems Data Management.
Data Mining BY JEMINI ISLAM. Data Mining Outline: What is data mining? Why use data mining? How does data mining work The process of data mining Tools.
BY SANDY. WHAT IS DATAMINING TYPES OF DATAMINING TOOLS OVERVIEW OF TIBCO TIBCO SPOTFIRE MINER DATA ANALYSIS EXPLORE DATA MANIPULATE DATA CHART VIEW.
Dr. Chen, Data Mining  A/W & Dr. Chen, Data Mining Chapter 4 An Excel-based Data Mining Tool (iData Analyzer) Jason C. H. Chen, Ph.D. Professor of MIS.
Today’s Goals Answer questions about homework and lecture 2 Understand what a query is Understand how to create simple queries using Microsoft Access 2007.
MIS2502: Data Analytics Advanced Analytics - Introduction.
® IBM Software Group © 2009 IBM Corporation Essentials of Modeling with the IBM Rational Software Architect, V7.5 Module 15: Traceability and Static Analysis.
Using Blackboard as a Tool to Teach Online Technology Skills in College Classrooms Dr. Victoria Haddad Adjunct Professor, College of Technology Wilmington.
© 2015 Ex Libris | Confidential & Proprietary Yoel Kortick | Senior Librarian Primo Analytics.
1 Determining How Costs Behave. 2 Knowing how costs vary by identifying the drivers of costs and by distinguishing fixed from variable costs are frequently.
Oracle Advanced Analytics
Once Upon a Time: The Story of a Successful BI Implementation
Significance of Findings and Discussion
Sourcing Event Tool Kit Matrix Pricing & Tiered Pricing User Guide
SNS COLLEGE OF TECHNOLOGY
Customer Support Strategic Pillars
MIS2502: Data Analytics Advanced Analytics - Introduction
Dealing with data qualitative data The main report
Chapter 11: Project Risk Management
Course Lab Introduction to IBM Watson Explorer
Course Lab Introduction to IBM Watson Explorer
Fundamentals of Information Systems
E-Commerce Theories & Practices
Designing a Housing Microfinance Loan Product
Learning Objectives: Understand the interdependence of DM and KM
IBM WATSON ANALYTICS Training
Applications of Data Mining in Software Engineering
NBA Draft Prediction BIT 5534 May 2nd 2018
MIS5101: Data Analytics Advanced Analytics - Introduction
Tools of Software Development
Regional Architecture Development for Intelligent Transportation
Regional Architecture Development for Intelligent Transportation
Course Lab Introduction to IBM Watson Analytics
Critical Path Method Farrokh Alemi, Ph.D.
Course Lab Introduction to IBM Watson Analytics
Spreadsheets, Modelling & Databases
Reporting Site Manager User Guide February 2019.
JMP 11 added new features and improvements to CCB and MSA.
MIS2502: Data Analytics Introduction to Advanced Analytics
Multiple Regression – Split Sample Validation
Shortlisting Applications
TECHNOLOGY ASSESSMENT
Shortlisting Applications
Big DATA.
MIS2502: Data Analytics Introduction to Advanced Analytics and R
Yining ZHAO Computer Network Information Center,
Chapter 9 Excel Extension: Now You Try!
Information system analysis and design
Presentation transcript:

Course Lab Introduction to IBM Watson Analytics University of Rome «La Sapienza» Course of Business Intelligence - 2018 Course Lab Introduction to IBM Watson Analytics

Ing. Ivonne E. Vereau Tolino Lab Speakers Ing. Vittorio Carullo Software Architect IBM Watson Squad Senior Member v.carullo@it.ibm.com Ing. Ivonne E. Vereau Tolino Software Engineer IBM Software Services Specialist ivonne_vereau@it.ibm.com

Target & Scope Familiarize with a «real» software used in large enterprises Accomplish small but significant use cases in BI arena Understand the impact of tools over team productivity Introduce advanced topics like the use of “non structured” information Lab sessions will be held on Thursday, starting from September 27, 2018 , 4 - 6 pm

Labs Schedule Presentation of the tool and its basic features Lab 1: Introduction to Watson Analytics Understanding and use of the tool features or conducting BI use cases Lab 2: Working with Data Lab 3: Analyze and Discover Lab 4: Predict and take decisions Lab 5: Report and visualization Use of the tool for Advanced Analytics Lab 6: Working with Social Media Lab 7: Introduction to Content Analytics Lab 8: Putting all together Today’s topic is highlighted!

Reference Materials Explore IBM Watson Analytics https://public.dhe.ibm.com/software/data/sw- library/analytics/watsonanalyticsgallery/ A gallery of use cases with related datasets and supporting data visualizations IBM Knowledge Center https://www.ibm.com/support/knowledgecenter/en/SS4QC9/com.ibm.sol utions.wa.doc/welcome.html A technical reference for product features

Today’s Contents: Analyze and Discover (Part 2) Predictive analysis Decision rules and Decision tree

1. Predictive Analysis

Identify key drivers When you select a Starting Point or submit a question within the Discovery set, Watson analytics will build a linear regression model to quantify the impact that each field and each potential combination of fields, has on our dependent variable Dependent variable = target Strength essentially captures how well each driver can explain the variance in our target  Predictive power! Drivers = Factors impacting the target

The “spiral” visualization After creating the Discovery, you will be able to navigate the data from different perspectives using a visualization. To best identify the predictive aspects of the data, it is recommended to use the Spiral Visualization The Spiral Visualization opens displaying a target in the center of the spiral, with icons surrounding it.

The “spiral” visualization (cont.) You can identify the factors impacting the key performance indicator, by hovering over the surrounding icons in the spiral. In some cases, there is a combination of factors having impact, or there might be only a single factor. Combination factors and single factors are represented by different icons. Factors closes to the center of spiral have the highest predictive strength than a single factor.

The “spiral” visualization (cont.)

Target to analyze You can change target When changing target, also Drivers change.

Analysis of factors Now, that you know which factors are predictors of the target, you can continue your analysis of the factors using the table.

Predictive analysis: Hands on Upload spreadsheet into Watson Analytics «WA_Fn UseC_ Marketing Campaign Eff UseC_ FastF.csv» Let’s look at the data asset and understand the meaning of columns

Predictive analysis: Hands on Pose a natural question «What drives SalesInThousands?» Choose Spiral visualization Within Discovery Set, identify Drivers Change the target from «SalesInThousands» to «MarketSize» Identify new Drivers

2. Decision rules and Decision tree

How to view decision rules and the decision tree in a predictive model? To analyse the predictive factors of a specific target choose a Data Asset. Based on the dataset, Watson Analytics generates a predictive model that includes: Associated Decision Rules Decision Tree visualizations

Decision rules Decision rules are a set of statistically generated profiles, with each profile showing you a group of factors that are used that to classify records.

Decision rules (cont.) This classification helps identify which combination of factors are probable to result in a specific outcome for the target field

Decision rules (cont.) The Decision rules tab also provides an overall predictive strength for all the records in the data set.

Decision rules (cont.) In this case, the first profile predicts an outcome with the highest average case call duration. This outcome is predicted to occur when Agent Training Level is No Training and Case Type is Request and when Case Area is Hardware.

Decision rules (cont.) You can also identify what combination of factors will predict an outcome with the lowest average case call duration. This can be done by changing the sort on the predicted value column, from the current descending, to ascending.

Decision tree The decision tree shows you patterns of characteristics that lead to a certain outcome, which you can think of as profiles.

Decision tree (cont.) Reading from left to right, each branch in the tree is a unique pattern that leads to the likelihood of an outcome occurring in the past. Each level of the tree (from left to right) has a higher predictive strength than the subsequent levels and branches.

Decision tree (cont.) You can examine a pattern by collapsing nodes to their lowest leaf level, and then expanding the nodes to analyze predicted outcomes at each level.

Decision tree (cont.) The tree also includes a color gradient at each level to emphasize the target values. As shown by the gradient legend, a darker color indicates higher average values, while a lighter color indicates lower values.

Decision tree (cont.) At the end of the branch, you can identify that the lowest level in this branch is Agent Training Level, and the training level with the highest average case call duration is No training.

Decision tree (cont.) As you look back through the branch, you can identify that the levels and values showing the highest average call duration, are the same factors, profile, and values you previously identified on the Decision rules tab.

Predictive analysis It is up to you as to which view you use to interpret decision rules. Typically, most users prefer the natural language view of the Decision rules tab. And what about you, what do you prefer?

Predictive Analysis: Hands on Upload “Service Agent Performance dataset” to Watson Analytics Let’s look at the data asset and understand the meaning of columns

Predictive Analysis: Hands on Within the Decision rules tab, analyse the target “Case Call Duration”, check the profiles and tell what is the factor that drives the greatest duration call. Within the Decision tree, using the color gradient, identify which case type has the highest average case call duration. Change target to analyse, for example: Service Satisfaction and check what it changes.