It’s All About Me From Big Data Models to Personalized Experience

Slides:



Advertisements
Similar presentations
Decision Making: An Introduction 1. 2 Decision Making Decision Making is a process of choosing among two or more alternative courses of action for the.
Advertisements

 Firewalls and Application Level Gateways (ALGs)  Usually configured to protect from at least two types of attack ▪ Control sites which local users.
Introduction To System Analysis and Design
CASE Tools CIS 376 Bruce R. Maxim UM-Dearborn. Prerequisites to Software Tool Use Collection of useful tools that help in every step of building a product.
Oakkar Fall The Need for Decision Engine Automate business processes Implement complex business decision logic Separation of rules and process Business.
UNIT-V The MVC architecture and Struts Framework.
Walter Hop Web-shop Order Prediction Using Machine Learning Master’s Thesis Computational Economics.
Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence From Data Mining To Knowledge.
Data Mining Chun-Hung Chou
Katanosh Morovat.   This concept is a formal approach for identifying the rules that encapsulate the structure, constraint, and control of the operation.
Cmpe 589 Spring Software Quality Metrics Product  product attributes –Size, complexity, design features, performance, quality level Process  Used.
1 Validation & Verification Chapter VALIDATION & VERIFICATION Very Difficult Very Important Conceptually distinct, but performed simultaneously.
Chapter 6: Foundations of Business Intelligence - Databases and Information Management Dr. Andrew P. Ciganek, Ph.D.
Lecture 9: Knowledge Discovery Systems Md. Mahbubul Alam, PhD Associate Professor Dept. of AEIS Sher-e-Bangla Agricultural University.
1 A Static Analysis Approach for Automatically Generating Test Cases for Web Applications Presented by: Beverly Leung Fahim Rahman.
Copyright © 2012, SAS Institute Inc. All rights reserved. ANALYTICS IN BIG DATA ERA ANALYTICS TECHNOLOGY AND ARCHITECTURE TO MANAGE VELOCITY AND VARIETY,
Advanced Database Course (ESED5204) Eng. Hanan Alyazji University of Palestine Software Engineering Department.
Objectives: Terminology Components The Design Cycle Resources: DHS Slides – Chapter 1 Glossary Java Applet URL:.../publications/courses/ece_8443/lectures/current/lecture_02.ppt.../publications/courses/ece_8443/lectures/current/lecture_02.ppt.
Finite State Machines (FSM) OR Finite State Automation (FSA) - are models of the behaviors of a system or a complex object, with a limited number of defined.
Data Mining and Decision Support
Chong Ho Yu.  Data mining (DM) is a cluster of techniques, including decision trees, artificial neural networks, and clustering, which has been employed.
Machine Learning in CSC 196K
Csci 418/618 Simulation Models Dr. Ken Nygard, IACC 262B
SUPERVISED AND UNSUPERVISED LEARNING Presentation by Ege Saygıner CENG 784.
AZURE MACHINE LEARNING Bringing New Value To Old Data SQL Saturday #
Data Resource Management – MGMT An overview of where we are right now SQL Developer OLAP CUBE 1 Sales Cube Data Warehouse Denormalized Historical.
Business Intelligence and Decision Support Systems (9 th Ed., Prentice Hall) Chapter 6: Artificial Neural Networks for Data Mining.
Introduction to Machine Learning, its potential usage in network area,
Prepared by John Swearingen
CIS 375 Bruce R. Maxim UM-Dearborn
Bhakthi Liyanage SQL Saturday Atlanta 15 July 2017
Machine Learning with Spark MLlib
Software Testing.
SNS COLLEGE OF TECHNOLOGY
Definition CASE tools are software systems that are intended to provide automated support for routine activities in the software process such as editing.
Chapter 11: Artificial Intelligence
Introduction to Quantitative Analysis
Chapter 11: Artificial Intelligence
Introduction Characteristics Advantages Limitations
Siemens Enables Digitalization: Data Analytics & Artificial Intelligence Dr. Mike Roshchin, CT RDA BAM.
School of Computer Science & Engineering
Microsoft SharePoint Server 2016
Supervised Time Series Pattern Discovery through Local Importance
Reading: Pedro Domingos: A Few Useful Things to Know about Machine Learning source: /cacm12.pdf reading.
DEFECT PREDICTION : USING MACHINE LEARNING
GoF Design Patterns (Ch. 26). GoF Design Patterns Adapter Factory Singleton Strategy Composite Façade Observer (Publish-Subscribe)
OpenWorld 2018 How to Create Chatbots with OMCe
By Thomas Hartmann, Assad Moawad, Francois Fouquet, Yves Le Traon
AI in Cyber-security: Examples of Algorithms & Techniques
Object-Oriented Analysis
Intro to Machine Learning
Roberto Battiti, Mauro Brunato
Introduction Artificial Intelligent.
Objective of This Course
Verification and Validation Unit Testing
Machine Learning Telepathy for Shift Right Approach
Lecture 1: Multi-tier Architecture Overview
Overview of Machine Learning
iSRD Spam Review Detection with Imbalanced Data Distributions
An Introduction to Software Architecture
GoF Design Patterns (Ch. 26)
MACHINE LEARNING TECHNIQUES IN IMAGE PROCESSING
MACHINE LEARNING TECHNIQUES IN IMAGE PROCESSING
Intro to Machine Learning
Machine Learning for Space Systems: Are We Ready?
Rohan Yadav and Charles Yuan (rohany) (chenhuiy)
Dr. Arslan Ornek MATHEMATICAL MODELS
An Introduction to Data Science using Python
AI Builder for Power Platform
Getting Started with Microsoft Azure Machine Learning
Presentation transcript:

It’s All About Me From Big Data Models to Personalized Experience Yao Morin, Ph.D.

Go from this…

… to this …

Roots as a Desktop App (and old) 30 Million users filed their taxes with TurboTax 5 Million used desktop 25 Million used online TurboTax is 25 years old Roots as a Desktop App (and old)

SERVICES

Business Logic and TurboTax Hard-coded business logic Fixed UI flow Domain knowledge embedded

Experience A Experience B We know what you PREFER

We serve up what’s RELEVANT to you

We know when you need HELP

How can we tailor the experience just for YOU?

Marriage between Data Science and Dynamic and Responsive Frontend

What is Data Science? It is multidisciplinary study and incorporates various techniques and theories from many fields, such as statistics, mathematics, artificial intelligence, data engineering, etc. Answers questions based on data instead of assumptions extract meaning from data and explain phenomenon uncover patterns from data and develop predictive models

From business problems to models E2E goals definition Model KPI, Input/ Output definition Model creation and offline evaluation Online model coding & validation Integration/ Experience QA Online evaluation Result analysis Training/ test set preprocessing Algorithm & method selection Model training/ parameters selection KPI measurement/ accuracy assessment

Data model building cycle Training/ test set preprocessing KPI measurement/ accuracy assessment Algorithm & method selection Model training/ parameters selection

Identify data Features - what information do you have From data inventory and/or domain experts Examples: Demographic, behavioral or geographic data, etc. Labels ( for supervised learning ): what you want to predict What kind of products to recommend Whether a customer buys a product How a customer reacts to an experience

Pre-processing data “Encoding” categorical data ZIP code, feelings, occupations dummy coding, bucketing, and others Imputations – “filling in” missing data ML estimations, stochastic regression, multiple imputation Other cleaning

Learning the relationship between features and labels through data Model training Learning the relationship between features and labels through data

Not this kind of relationship

Labels = f(Features) But this kind of relationship Regressors Classifiers, etc.

Model evaluation Evaluate model performance against model-specific performance metrics with hold-out data and iterate on Model type Hyperparameters Features …

Example: Training a model User data Training Set Preprocessing Model Training (Random Forest) Separate into training and validation sets Model Metric Labels Validation Set Preprocessing Model Validation ( FP/FN)

Advantages of data models To have dynamic personalized experience, we need to decide what to show out of a large variety of possible experiences, in an algorithmic way. Data models solve this: Connect user data to user preferences Machine learning is automated and handles the complexity

Limitations of data models Uncertainties May not be suitable when applications require 100% accurate May need to build in safeguards for applications that require high accuracy Vulnerable to inaccurate, missing or insufficient data

Traditional process flow User Requests Logic Pages Send information about the user Dispatcher If… else… logic blocks Static flow Static pages Hide/show DOM elements

Dynamic process flow User Requests Model Service Platform Player Send information about the user Hosts models Processes user requests based on user data received Consume received decision and generate final user experience

Design With Data Science Mindset Not Static Configurable Scalability Maintainability Data science and static do not mix Do not hardcode paths/pages Data science works well with configurable components Use templates Experiences should support large amounts of variability Use templates (again!) A refresh of design should not break underlying logic Build experiences with separation of logic and design

How do we apply Data Science to TurboTax UI?

Dynamic Views { type: template } Truly Dynamic UI Traditional Dynamic UI Dynamic Data Dynamic Data + + { type: template } Dynamic Semantic Templates Static Templates = = Dynamic Site Dynamic Site

Dynamic Flow Statically Defined Routes/States Dynamic Finite State Machine Relationships between pages are pre-determined Entry points into the app are pre-determined All flow and variation in the application is hard coded Relationships among data are pre-determined Entry points are determined dynamically Flow though the application is completely data driven

FUEGO Data science model enabled Semantically defined dynamic experiences Dynamic application flow Device agnostic representation of the UI Device specific applications to render the UI