Megaputer Intelligence 2000. 3. 27 인공지능연구실 석사 2 학기 최윤정

Slides:



Advertisements
Similar presentations
DIGIDOC A web based tool to Manage Documents. System Overview DigiDoc is a web-based customizable, integrated solution for Business Process Management.
Advertisements

How Come It Takes Me So Long to Get Answers to Simple Questions About My Business? Technologies for Business Intelligence Introduction to Microsoft Access.
Chapter 1 Business Driven Technology
PolyAnalyst Data and Text Mining tool Your Knowledge Partner TM www
By: Mr Hashem Alaidaros MIS 211 Lecture 4 Title: Data Base Management System.
Introduction to Data Mining with XLMiner
Mining data with PolyAnalyst © 1999 Megaputer intelligence, Inc. learn to profit from data.
Classifier Decision Tree A decision tree classifies data by predicting the label for each record. The first element of the tree is the root node, representing.
Week 9 Data Mining System (Knowledge Data Discovery)
Clementine Server Clementine Server A data mining software for business solution.
Data Mining By Archana Ketkar.
Chapter 4: Database Management. Databases Before the Use of Computers Data kept in books, ledgers, card files, folders, and file cabinets Long response.
CS2032 DATA WAREHOUSING AND DATA MINING
Building Knowledge-Driven DSS and Mining Data
Knowledge Discovery Centre: CityU-SAS Partnership 1 Speakers: Prof Y V Hui, CityU Dr H P Lo, CityU Dr Sammy Yuen, CityU Dr K W Cheng, SAS Institute Mr.
Data Mining – Intro.
DASHBOARDS Dashboard provides the managers with exactly the information they need in the correct format at the correct time. BI systems are the foundation.
Data Mining: A Closer Look
Chapter 5 Data mining : A Closer Look.
Lecture-8/ T. Nouf Almujally
Working With Databases. Questions to Answer about a Database System What functions the marketing database is expected to perform? What is the initial.
Slide 1 of 9 Presenting 24x7 Scheduler The art of computer automation Press PageDown key or click to advance.
Computer Software.
Application of SAS®! Enterprise Miner™ in Credit Risk Analytics
Data Mining Techniques
Overview DM for Business Intelligence.
Data Mining. 2 Models Created by Data Mining Linear Equations Rules Clusters Graphs Tree Structures Recurrent Patterns.
Enabling Organization-Decision Making
1.Knowledge management 2.Online analytical processing 3. 4.Supply chain management 5.Data mining Which of the following is not a major application.
Ihr Logo Data Explorer - A data profiling tool. Your Logo Agenda  Introduction  Existing System  Limitations of Existing System  Proposed Solution.
ROOT: A Data Mining Tool from CERN Arun Tripathi and Ravi Kumar 2008 CAS Ratemaking Seminar on Ratemaking 17 March 2008 Cambridge, Massachusetts.
3 Objects (Views Synonyms Sequences) 4 PL/SQL blocks 5 Procedures Triggers 6 Enhanced SQL programming 7 SQL &.NET applications 8 OEM DB structure 9 DB.
PO320: Reporting with the EPM Solution Keshav Puttaswamy Program Manager Lead Project Business Unit Microsoft Corporation.
11 C H A P T E R Artificial Intelligence and Expert Systems.
© 2010 Pearson Addison-Wesley. All rights reserved. Addison Wesley is an imprint of Designing the User Interface: Strategies for Effective Human-Computer.
Copyright © 2006, The McGraw-Hill Companies, Inc. All rights reserved. Decision Support Systems Chapter 10.
Introduction to machine learning and data mining 1 iCSC2014, Juan López González, University of Oviedo Introduction to machine learning Juan López González.
Lecturer: Gareth Jones. How does a relational database organise data? What are the principles of a database management system? What are the principal.
Some working definitions…. ‘Data Mining’ and ‘Knowledge Discovery in Databases’ (KDD) are used interchangeably Data mining = –the discovery of interesting,
Managing Knowledge in Business Intelligence Systems Dr. Jan Mrazek.
11 3 / 12 CHAPTER Databases MIS105 Lec15 Irfan Ahmed Ilyas.
Database Design and Management CPTG /23/2015Chapter 12 of 38 Functions of a Database Store data Store data School: student records, class schedules,
Ihr Logo Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization Turban, Aronson, and Liang.
5 - 1 Copyright © 2006, The McGraw-Hill Companies, Inc. All rights reserved.
1 (21) EZinfo Introduction. 2 (21) EZinfo  A Software that makes data analysis easy  Reveals patterns, trends, groups, outliers and complex relationships.
The CRISP Data Mining Process. August 28, 2004Data Mining2 The Data Mining Process Business understanding Data evaluation Data preparation Modeling Evaluation.
1 Topics about Data Warehouses What is a data warehouse? How does a data warehouse differ from a transaction processing database? What are the characteristics.
1 Improving quality of graduate students by data mining Asst. Prof. Kitsana Waiyamai, Ph.D. Dept. of Computer Engineering Faculty of Engineering, Kasetsart.
6.1 © 2010 by Prentice Hall 6 Chapter Foundations of Business Intelligence: Databases and Information Management.
1 Categories of data Operational and very short-term decision making data Current, short-term decision making, related to financial transactions, detailed.
10-1 Identify the changes taking place in the form and use of decision support in business Identify the role and reporting alternatives of management information.
Chapter 5: Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization DECISION SUPPORT SYSTEMS AND BUSINESS.
Data Mining BY JEMINI ISLAM. Data Mining Outline: What is data mining? Why use data mining? How does data mining work The process of data mining Tools.
Chapter 4 Decision Support System & Artificial Intelligence.
BOĞAZİÇİ UNIVERSITY DEPARTMENT OF MANAGEMENT INFORMATION SYSTEMS MATLAB AS A DATA MINING ENVIRONMENT.
DATA RESOURCE MANAGEMENT
Data Mining and Decision Support
CS507 Information Systems. Lesson # 11 Online Analytical Processing.
CISC 849 : Applications in Fintech Namami Shukla Dept of Computer & Information Sciences University of Delaware iCARE : A Framework for Big Data Based.
Business Intelligence Overview. What is Business Intelligence? Business Intelligence is the processes, technologies, and tools that help us change data.
Managing Data Resources File Organization and databases for business information systems.
Systems Analysis and Design in a Changing World, Fifth Edition
Data Mining Functionalities
Decision Support Systems
Data Warehouse.
PolyAnalyst Data and Text Mining tool
Dr. Morgan C. Wang Department of Statistics
Supporting End-User Access
CHAPTER 7: Information Visualization
Improving Decision Making and Managing Knowledge
Presentation transcript:

Megaputer Intelligence 인공지능연구실 석사 2 학기 최윤정

Outline  Overview Overview  Technology Technology  PolyAnalyst solution overview PolyAnalyst solution overview  Customer cases Customer cases  Future developments Future developments

Megaputers… Overview 1989 년 모스크바 주립대학 AI 연구소  Knowledge Discovery  Semantic 정보검색 및 분석에 기반을 둠 년 Polyanalyst1.0 개발 2000

Technology O Subject-Oriented analytical systems O Statistical packages O Neural Networks O Evolutionary Programming O Memory Based Reasoning(MBR) O Decision Tress O Genetic Algorithms

데이타마이닝과 지식탐사를 위한 툴과 semantic Text 분석, information retrieval 을 위한 툴 제공 O PolyAnalyst 4.0 O PolyAnalyst COM O TextAnalyst 1.5 O TextAnalyst Com O MegaSearch tm Product

PolyAnalyst overview

Features in more detail  multi-strategy data mining suite O utilizing the latest achievements in knowledge discovery O with a broad selection of exploration engines O powerful data manipulation and visualization tools O Modeling O Predicting O Clustering O Classifying O Explaining

PolyAnalyst workplace Multiple machine learning algorithms can be accessed through pull-down and pop-up menus, or control buttons The project data, charts, discovered rules, and system reports are represented by icons held in separate containers

Learning algorithms Find Dependencies PolyNet Predictor Cluster Find Laws Classify Discriminate Linear Regression Identifies a set of the most influential predictors and determines outliers Predicts values of the target variable - a hybrid of GMDH and Neural Net algorithms Separates groups of similar records and finds the best clustering variables Finds an explicit model for the relation predicting the target variable Assigns cases to two different classes by utilizing Fuzzy Logic Determines what characteristics of a specified data set distinguish it from the rest of the data Stepwise linear regression - correctly treats categorical and yes/no variables New algorithm: robustly classifies records into multiple categories Memory Based Reasoning PolyAnalyst COM

Find Dependencies Outliers Most influential variables determined Predicted target value for a cell All considered variables

Cluster Variables providing the best clustering Individual clusters Cluster sequential number Number of points in a cluster

PolyNet Predictor R^2 = 0.93 Linear Regression R^2 = 0.86 Similar to all other PolyAnalyst algorithms the best PN model is found as an optimal solution in terms of The following graphs display the accuracy of PN and LR models developed to predict relative performance of computers from different manufacturers: Predicted vs. Actual target variable

Classify Mass mailing Targeted mailing PolyAnalyst Lift chart illustrates an increase in the response to a campaign based on the discovered model - instead of random mailing % of maximal possible response Mass mailing Targeted mailing Profit ($) PolyAnalyst Gain chart helps optimize the profit obtained in a direct marketing campaign

Linear Regression Yes/no variable taken into account correctly Partial contributions of individual terms in the linear regression formula

Discriminate algorithm  Determines what features of a selected data set distinguish it from the rest of the data  Requires no preset target variable  Can be powered by Find Laws PolyNet Predictor Linear Regression Memory-Based Reasoning  Performs classification to multiple categories  Is based on identifying similar cases in the previous history  Implemented only in PolyAnalyst COM (available in the end of March 1999)

Data Access  PolyAnalyst works with ODBC-compliant databases: Oracle, DB2, Informix, Sybase, MS SQL Server, etc.  A customized version works with IBM Visual Warehouse Solution and Oracle Express  Data and exploration results can be exchanged with MS Excel  CSV or DBF format files  New data can be added to the project when necessary

Visualization Data can be displayed in various visual formats:  Histograms  Line and point plots with zoom and drill-through capabilities  Colored charts for three dimensions  Interactive rule-graphs with sliders help visualizing and manipulating multi-variable relations  Frequencies charts provide for a quick and thorough visualization of the distribution of categorical, integer, or yes/no variables  Lift and Gain charts are very useful in marketing applications

Histograms and Frequencies Histogram displays distribution of numerical variables Frequencies chart displays distribution of categorical and yes/no variables

2D charts and Rule-graphs Sliders help visualize effects of other variables in more than two-dimensional models The Find Laws model (red line) for a product market share dependence on the price predicts a dramatic change in the formula when the product goes on promotion

PolyAnalyst platforms  Standalone system: PolyAnalyst Power - Windows 95/98/NT PolyAnalyst Pro - Windows NT PolyAnalyst Lite - Windows 95/98/NT PolyAnalyst IBM OS/2  Client/Server system: PolyAnalyst Knowledge Server - Windows NT or OS/2 Client - Windows NT, 95, 98, or OS/2

Sample customer cases

PolyAnalyst supports medical projects at 3M Timothy Nagle Consulting Scientist 3M Corporation St. Paul, MN, USA “ Analytical engines do an excellent job of finding relations amongst many fields without overfitting. I found the user interface both intuitive and easy to use. Megaputer support is outstanding. The inevitable problems one expects with a complex system are dealt with immediately. ”

PolyAnalyst helps improving flight control system at Boeing James Farkas Senior Navigation Engineer The Boeing Company Kent, WA, USA “ PolyAnalyst provides quick and easy access for inexperienced users to powerful modeling tools. The user interface is intuitive and new users come up to speed very quickly. Interfaces to spreadsheet tools provide flexibility needed to work solutions as a team. ”

PolyAnalyst facilitates marketing research at Indiana University Raymond Burke E.W. Kelley Professor of BA Kelley Business School Indiana University Bloomington, IN, USA “ PolyAnalyst provides a unique and powerful set of tools for data mining applications, including promotion response analysis, customer segmentation and profiling, and cross-selling analysis. Unlike neural network programs, PolyAnalyst displays a symbolic representation of the relationship between the independent and dependent variables - a critical advantage for business applications. ”

PolyAnalyst helps medical research at the University of Wisconsin-Madison Prof. Roger L. Brown Director of RDSU University of Wisconsin Madison, WI, USA “ PolyAnalyst suite enabled our researchers to search their data for rules and structure while providing a symbolic knowledge of the structure, the detail they needed. The software has provided very interesting results for one of our projects, which had been presented at a major cardiology meeting. ”

PolyAnalyst enjoys international success “ PolyAnalyst scores extremely well by providing a complete environment in which almost any research worker could data mine his or her own data. It is a very useful product, potentially with a wide user base, and it appears to me to be unique. ” “ PolyAnalyst proves capable of providing models for building reliable trading strategies even for a difficult to predict FOREX market. PolyAnalyst is a leader in reliability, accuracy, and diversity of automatically built models. ” Alexander Fomenko Director Analytical Dept Killiney Investments Europe Rep. Moscow, Russia David McIlroy Analytical Department Master Foods Olen, Belgium

Product Price $$ Custom-build own PolyAnalyst system!

Product Price $$(continue) Custom-build own PolyAnalyst system! - COM 모듈은 어플리케이션을 작성하는데 적당 - 각각의 필요한 알고리즘에 해당하는 Tool Kit 을 구입할 수 있음

Future developments  New machine learning algorithms: Memory Based Reasoning Weighted variable Clustering and Classification  PolyAnalyst COM built on the basis of Component Object Model - an integrated kit for simple development of decision support applications utilizing advanced PolyAnalyst algorithms (see PCAI Magazine, March 99, p )  Enhanced graphics (Snake and Boxplot charts) and data import and manipulation

PolyAnalyst evaluation