Presentation is loading. Please wait.

Presentation is loading. Please wait.

© 2002 page 1 Data Mining Tools For ZLE Copying and Use Restrictions: Material under this presentation is the Intellectual Property of HP Corporation and.

Similar presentations


Presentation on theme: "© 2002 page 1 Data Mining Tools For ZLE Copying and Use Restrictions: Material under this presentation is the Intellectual Property of HP Corporation and."— Presentation transcript:

1 © 2002 page 1 Data Mining Tools For ZLE Copying and Use Restrictions: Material under this presentation is the Intellectual Property of HP Corporation and Genus Software. Any use of the this material, in part or whole, except in context of Genus Data Mining Integrator and Data Mart Builder, without written permission from HP and Genus is prohibited.

2 © 2002 page 2 agenda data mining in ZLE solutions ZLE data mining toolkit toolkit demonstration agenda

3 © 2002 page 3 title text Meta Group process of identifying and/or extracting previously unknown, non-trivial, unanticipated, important information from large sets of data Gartner Group process of discovering meaningful new correlations, patterns and trends by sifting through large amounts of data stored in repositories, using pattern recognition technologies, statistical and mathematical techniques

4 © 2002 page 4 title text role –determine most effective responses to business events ZLE facilitates mining by providing –a rich, integrated, current data source –an integrated operational environment into which models can be deployed data mining helps to realize the full business value of a ZLE system

5 © 2002 page 5 derive attributes identify and define business opportunity create case set deploy model profile data transform data assess performance train models typically about 75% of process ZLE data mining process understand the opportunity –identify and define business opportunity prepare data –profile and understand data –derive attributes –transform data –create case set build models –train models –assess model performance use models –deploy model –monitor model performance monitor model performance

6 © 2002 page 6 agenda data mining in ZLE solutions ZLE data mining toolkit toolkit demonstration agenda

7 © 2002 page 7 the ZLE data mining toolkit goal: –provide tools that facilitate ZLE data mining –reduce process cycle times dramatically three tools being developed by Genus Software: –data preparation –data transfer –model deployment partners: Genus, MicroStrategy, SAS product names: –Genus Mining Integrator for NonStop SQL (all three tools) –Genus Mart Builder for NonStop SQL (first two tools only)

8 © 2002 page 8 part of Genus toolkit ZLE data mining analytical cycle Data Store (NonStop SQL) Data Preparation (profiling/transforming data) Model Deployment (written to DB tables) Data Transfer (fast parallel streams) Mining Mart (Tru64/Windows) Scoring Engine Rules Engine Agg. Engine Interaction Manager Real-Time Scoring (using the Recommender) part of ZDK 3 Modeling (SAS Enterprise Miner) available from SAS

9 © 2002 page 9 agenda data mining in ZLE solutions ZLE data mining toolkit toolkit demonstration agenda

10 © 2002 page 10 toolkit demonstration credit card fraud detection example opportunity: use ZLE data store data to predict, in real-time, which credit card purchases are likely to be fraudulent use tools to: –build a case set table with one row describing each purchase –transfer table to SAS server for modeling –deploy predictive model to ZLE data store –execute model in real-time to make fraud predictions steps described, including many tool screen shots

11 © 2002 page 11 based on the MicroStrategy (MSI) Business Intelligence toolset, leverages GUI, logical data model support, SQL generation, etc. uses NonStop SQL/MX DBMS, leverages sampling, TRANSPOSE, statistical functions, … custom tool developed by Genus using MSI SDK for NonStop SQL operations and functionality not supported by MSI tools toolkit data preparation solution

12 © 2002 page 12 two main ZLE data preparation tasks 1.profile tables –column names and types –partitioning information, attributes, key structure, … –column values 2.transform source tables –derive new attributes –aggregate to appropriate level –clean data –pivot –combine to form case set

13 © 2002 page 13 the MicroStrategy desktop

14 © 2002 page 14 MSI profile report: fraud vs. billing state

15 © 2002 page 15 NonStop SQL/MX sampling source table sampling –insert into CustSamp select * from Cust sample random 1 percent clusters of 10 blocks union select * from Cust where CardNo in (select CardNo from FrdFlg) enables interactive and exploratory data prep cleanly integrated into SQL performed efficiently in DP2 easily accessible through Genus tool

16 © 2002 page 16 creating a materialized sample table using the Genus Data Mart Builder

17 © 2002 page 17 identifying source and sample method

18 © 2002 page 18 specifying materialized sample table

19 © 2002 page 19 transforming source data Billions of Purchase s Millions of Accounts Purchase StoreAccount Purchase History Item Summary Fraud Aggregate and Pivot

20 © 2002 page 20 result: a case set for modeling Hundreds of Attributes One Row Per Purchase Mix of Fraud and No-Fraud Purchases

21 © 2002 page 21 MSI Datamart report summarizing items

22 © 2002 page 22 data transfer tool Data StoreMining Mart NonStop SQL/MX ASCII filesSAS data set data transfer tool task: transfer case set from data store to mining mart coordinator –design HTML HTTP JDBC Web browser client Web server Web App. receive SAS import transferreceive SAS import transferreceive SAS import transfer receive SAS import transfer

23 © 2002 page 23 data transfer specification screen

24 © 2002 page 24 transfer monitoring

25 © 2002 page 25 modeling in SAS enterprise miner

26 © 2002 page 26 body copy model export score converter node generates Java model code reporter node exports code and HTML report to project directory

27 © 2002 page 27 NonStop SQL/MX Data Store SAS Open Metadata server File/SAS server SAS Enterpris e Miner Mining Mart model deployment tool task –copy model information to a ZLE Data Store Model export/registration –design HTML HTTP JDBC access Web browser client File/registry access Web Server Web App.

28 © 2002 page 28 starting the model deployment tool

29 © 2002 page 29 connecting to a Data Store

30 © 2002 page 30 a list of models in the Data Store

31 © 2002 page 31 viewing a deployed model

32 © 2002 page 32 selecting a SAS report directory

33 © 2002 page 33 viewing available reports

34 © 2002 page 34 viewing an Enterprise Miner report

35 © 2002 page 35 deploying a model

36 © 2002 page 36 deployment confirmation

37 © 2002 page 37 real-time scoring using the Recommender Scoring Engine Aggregation Engine Rules Engine Model Aggregates Model Scores Deployed Models Business Rules Aggregate Definitions Offers / Advice Customer Data Interaction Manager

38 © 2002 page 38 how to get the data mining tools Product Names –Genus Mining Integrator for NonStop SQL (Data Preparation, Data Transfer, and Model Deployment tools) –Genus Mart Builder for NonStop SQL (first two tools only) Can be ordered through HP, support provided by Genus Availability: calendar Q4 2002 For more information, contact –kevin.christian@hp.com (Product Manager)kevin.christian@hp.com –susan.mcconkey@hp.com (Program Manager)susan.mcconkey@hp.com –angad@genussoft.com (Development)angad@genussoft.com


Download ppt "© 2002 page 1 Data Mining Tools For ZLE Copying and Use Restrictions: Material under this presentation is the Intellectual Property of HP Corporation and."

Similar presentations


Ads by Google