10/20021 Enterprise and Business Intelligence Systems (e.bis.business.utah.edu) Research Lab, UA -> UU Director Olivia R. Liu Sheng, Ph.D. Emma Eccles.

Slides:



Advertisements
Similar presentations
Web Mining.
Advertisements

Supporting End-User Access
1.Data categorization 2.Information 3.Knowledge 4.Wisdom 5.Social understanding Which of the following requires a firm to expend resources to organize.
Data Mining Glen Shih CS157B Section 1 Dr. Sin-Min Lee April 4, 2006.
Data Mining Sangeeta Devadiga CS 157B, Spring 2007.
Building an Intelligent Web: Theory and Practice Pawan Lingras Saint Mary’s University Rajendra Akerkar American University of Armenia and SIBER, India.
SESSION 10 MANAGING KNOWLEDGE FOR THE DIGITAL FIRM.
1 ACCTG 6910 Building Enterprise & Business Intelligence Systems (e.bis) Introduction to Data Mining Olivia R. Liu Sheng, Ph.D. Emma Eccles Jones Presidential.
© Prentice Hall1 DATA MINING TECHNIQUES Introductory and Advanced Topics Eamonn Keogh (some slides adapted from) Margaret Dunham Dr. M.H.Dunham, Data Mining,
LinkSelector: A Web Mining Approach to Hyperlink Selection for Web Portals Xiao Fang University of Arizona 10/18/2002.
1 ACCTG 6910 Building Enterprise & Business Intelligence Systems (e.bis) Association Rule Mining Olivia R. Liu Sheng, Ph.D. Emma Eccles Jones Presidential.
Data Mining By Archana Ketkar.
Database – Part 2 Dr. V.T. Raja Oregon State University.
Introduction to SAP R/3.
Data Mining – Intro.
1 Data and Knowledge Management. 2 Data Management: A Critical Success Factor The difficulties and the process Data sources and collection Data quality.
CS157A Spring 05 Data Mining Professor Sin-Min Lee.
Advanced Database Applications Database Indexing and Data Mining CS591-G1 -- Fall 2001 George Kollios Boston University.
Knowledge Portals and Knowledge Management Tools
Data Mining.
Data Mining: Concepts & Techniques. Motivation: Necessity is the Mother of Invention Data explosion problem –Automated data collection tools and mature.
OLAM and Data Mining: Concepts and Techniques. Introduction Data explosion problem: –Automated data collection tools and mature database technology lead.
Data Mining : Introduction Chapter 1. 2 Index 1. What is Data Mining? 2. Data Mining Functionalities 1. Characterization and Discrimination 2. MIning.
FALL 2012 DSCI5240 Graduate Presentation By Xxxxxxx.
Module 3: Business Information Systems Chapter 11: Knowledge Management.
Business Intelligence, Data Mining and Data Analytics/Predictive Analytics By: Asela Thomason IS 495 Summer 2015.
Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence From Data Mining To Knowledge.
Data Mining Chun-Hung Chou
Spatial Statistics and Spatial Knowledge Discovery First law of geography [Tobler]: Everything is related to everything, but nearby things are more related.
Chapter 1 Introduction to Data Mining
Introduction to Data Mining Group Members: Karim C. El-Khazen Pascal Suria Lin Gui Philsou Lee Xiaoting Niu.
Lecture 9: Knowledge Discovery Systems Md. Mahbubul Alam, PhD Associate Professor Dept. of AEIS Sher-e-Bangla Agricultural University.
1 1 Slide Introduction to Data Mining and Business Intelligence.
Knowledge Discovery and Data Mining Evgueni Smirnov.
Data Mining Chapter 1 Introduction -- Basic Data Mining Tasks -- Related Concepts -- Data Mining Techniques.
DATA MINING 1. 2 Data Mining Extracting or “mining” knowledge from large amounts of data Data mining is the process of autonomously retrieving useful.
Marketing Research Marketing Information Systems.
Data Warehousing Data Mining Privacy. Reading Bhavani Thuraisingham, Murat Kantarcioglu, and Srinivasan Iyer Extended RBAC-design and implementation.
Data Mining By Dave Maung.
Principles of Data Mining. Introduction: Topics 1. Introduction to Data Mining 2. Nature of Data Sets 3. Types of Structure Models and Patterns 4. Data.
Data Mining – Intro. Course Overview Spatial Databases Temporal and Spatio-Temporal Databases Multimedia Databases Data Mining.
1 Improving quality of graduate students by data mining Asst. Prof. Kitsana Waiyamai, Ph.D. Dept. of Computer Engineering Faculty of Engineering, Kasetsart.
CS157B Fall 04 Introduction to Data Mining Chapter 22.3 Professor Lee Yu, Jianji (Joseph)
Advanced Database Course (ESED5204) Eng. Hanan Alyazji University of Palestine Software Engineering Department.
Chapter 5: Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization DECISION SUPPORT SYSTEMS AND BUSINESS.
Data Mining BY JEMINI ISLAM. Data Mining Outline: What is data mining? Why use data mining? How does data mining work The process of data mining Tools.
Data Warehousing Lecture-30 What can Data Mining do? Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for Agro-Informatics Research.
Chapter 1 An Introduction to Information Systems
Search Engine using Web Mining COMS E Web Enhanced Information Mgmt Prof. Gail Kaiser Presented By: Rupal Shah (UNI: rrs2146)
Foundations of Information Systems in Business. System ® System  A system is an interrelated set of business procedures used within one business unit.
MIS2502: Data Analytics Advanced Analytics - Introduction.
Academic Year 2014 Spring Academic Year 2014 Spring.
Foundations of Information Systems in Business
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 28 Data Mining Concepts.
Introduction.  Instructor: Cengiz Örencik   Course materials:  myweb.sabanciuniv.edu/cengizo/courses.
Chapter 3 Building Business Intelligence Chapter 3 DATABASES AND DATA WAREHOUSES Building Business Intelligence 6/22/2016 1Management Information Systems.
Data Mining – Introduction (contd…) Compiled By: Umair Yaqub Lecturer Govt. Murray College Sialkot.
CS570: Data Mining Spring 2010, TT 1 – 2:15pm Li Xiong.
1 2. Knowledge Management. 2  Structuring of knowledge enables effective and efficient problem solving dynamic learning strategic planning decision making.
Data Mining - Introduction Compiled By: Umair Yaqub Lecturer Govt. Murray College Sialkot.
Data Mining.
Data Mining – Intro.
MIS2502: Data Analytics Advanced Analytics - Introduction
DATA MINING © Prentice Hall.
Sangeeta Devadiga CS 157B, Spring 2007
MANAGING KNOWLEDGE FOR THE DIGITAL FIRM
Data Warehousing and Data Mining
I don’t need a title slide for a lecture
Supporting End-User Access
Data Warehousing Data Mining Privacy
Presentation transcript:

10/20021 Enterprise and Business Intelligence Systems (e.bis.business.utah.edu) Research Lab, UA -> UU Director Olivia R. Liu Sheng, Ph.D. Emma Eccles Jones Presidential Chair of Business School of Accounting and Information Systems David Eccles School of Business University of Utah ,

10/20022 e.bis Research Focus Enterprise Systems E-procurement technology Web content caching and storage mgmt Enterprise application integration Process modeling and re-use System security and risk management Portal design and management Business Intelligence Systems Decision support systems Data/web mining Knowledge management Knowledge refreshing Personalization

10/20023 e.bis Research Output Models Methods Technology Analyses Fueled by Applications!

10/20024 Faculty Olivia R. Liu Sheng, Ph.D.UU Paul Hu, Ph.D.UU Ph.D. students and Post Docs Xiao Fang, 5 th -yr Ph.D. studentUA Lin Lin, 3 rd -yr Ph.D. studentUA Wei Gao, 3 rd -yr Ph.D. studentUA Hua Su, post-docUA Xiaoyun Sun, 1 st -yr Ph.D. studentUA Zhongmin Ma, 1 st -yr Ph.D. studentUU 6 to 10 Master and UG students per yr International and industrial collaborators

Web Mining for Knowledge Management

10/20026 The automated process of discovering relationships and patterns in data Related terms: knowledge discovery in database (KDD), machine learning A step in the knowledge discovery process consisting of particular algorithms (methods) that under some acceptable objective, produces a particular enumeration of patterns (models) over the data. An iterative process within which progress is defined by “discovery”, through either automatic or manual methods The application of statistical and artificial intelligence techniques (algorithms) for discovering patterns and regularities in large volumes of data. What is Data Mining?

10/20027 Why Data Mining Data Visualization Needs Going beyond business charts (e.g., pie, line, bar charts) Maps, trees, 2-D, and 3-D Type of knowledge (more abstract) and the level of sophistication in required computation, e.g., Which buyers are likely to be late on future payments? Which sellers are likely to be late on future deliveries? If a seller increases product-in-week by x units, how much % of sales increase can be expected. Which buyers are similar in their buying powers and product and contract preferences? Frequency in discovering and applying the knowledge is met with bottlenecks in human processing Decision support for buyers, sellers and market hosts at each transaction decision point

10/20028 Taxonomies of Data Mining By Tasks By Data

10/20029 Data Mining Tasks Time-series Analysis Analyzing large set of time-series data to find certain regularities and interesting characteristics. Association/Sequential Patterns The discovery of co-occurrence correlations among a set of items. Clustering Identifying clusters embedded in the data, where a cluster is a collection of data objects that are “similar” to one another. Classification Analyzing a set of training data and constructing a model for each class based on the features in the data. Class Description Providing a concise and succinct summarization of a collection of data.

10/ Market Basket (Association Rule) Analysis market basket A market basket is a collection of items purchased by a customer in an individual customer transaction, which is a well-defined business activity Ex: a customer’s visit a grocery store an online purchase from a virtual store such as ‘Amazon.com’

10/ Market Basket (Association Rule) Analysis Market basket analysis Market basket analysis is a common analysis run against a transaction database to find sets of items, or itemsets, that appear together in many transactions. Each pattern extracted through the analysis consists of an itemset and the number of transactions that contain it.Applications: improve the placement of items in a store the layout of mail-order catalog pages the layout of Web pages others?

10/ Clustering Clustering Clustering distributes data into several groups so that similar objects fall into the same group. For example, we can cluster customers based on their purchase behavior. Applications: customer, web content, document and gene segmentation

10/ Classification Example: Classification classifies data into pre-defined outcome classes

10/ Classification Car Type in {sports} High Low Age <25 High Applications: customer profiling, shopping prediction Diagnostic decision support

10/ By Data Structured alphanumeric data Buyer, supplier, product, order, bank acct Image data Satellite, patient, document, handwriting, facial, etc. Spatial data Map, traffic, geological, CAD, graphics, etc.

10/ By Data, Cont’d Temporal data Time series, population, stock, inventory, sales, etc. Spatial and temporal data – trajectory Text – documents, web pages, etc. Video/audio – surveillance video, voice, music, etc.

10/ Web (Data) Mining Web data – generated or used by the Web Web content - static or dynamic Web structure – hyperlinks Web usage – web access log

10/ Why is Web Mining Important? Rich data gathering and access medium A variety of important applications Information retrieval Ecommerce – CRM, SCM, etc. Knowledge management Interesting challenges Scalability – global, multi-lingual, growth Agility of knowledge

10/ What is “knowledge”? Relationships and patterns in data Organized, analyzed and understandable Truths, beliefs, perspectives, concepts, procedures, judgments, expectations, methodologies, heuristics, restrictions, know-how Applicable to problem solving and decision making DBs, documents, policies and procedures as well as the un-captured, tacit expertise and experience Actionable, at the right place and right time!!!

10/ What is Knowledge Management? Views: Process (KM activities) Goal (Operational efficiency and innovations) Methodology (formalization, control and technology) Delphi Group: “Leveraging collective wisdom to increase responsiveness and innovation.”

10/ What is a KM program? Processes Organizational structure and policies Management theories and methodologies Information assurance Technologies and resources Implementation, training and change management Measurement, maintenance and evolution A multi-disciplinary effort!!! Managerial and cultural Technological and engineering esources, support and technology for –Creation, acquisition, organization, storage, retrieval, visualization and sharing of knowledge

10/ KM Process Identify Collect Organize Represent Store Locate Retrieve Extract Discover Visualize Interpret Share Transfer Adapt Apply Monitor Evaluate Create

10/ Data Mining & KM Data mining  discover knowledge Data mining  support management of KM infrastructure (Personalized) content management Security management Workflow management Scalable performance

10/ Web Mining & KM Web mining  discover knowledge Web mining  support management of web KM portal R&D Intranet Consulting B2B, B2C, e-government, e-financing, e-risk management

Web Mining & Knowledge Refreshing

10/ Data Step 1: Selection Step 2: Cleaning & Preprocessing Step 3: Transformation Step 4: Data Mining Step 5: Interpretation & Evaluation Target Data Preprocessed Data Transformed Data Patterns Discovered Knowledge The KDD Process

10/ Data Step 1: Selection Step 2: Cleaning & Preprocessing Step 3: Transformation Step 4: Data Mining Step 5: Interpretation & Evaluation Target Data Preprocessed Data Transformed Data Patterns Discovered Knowledge Types of Domain Knowledge DBA Knowledge Domain Expert Knowledge Data Mining Expert Knowledge

10/ Fundamental Problems The size of the database is significantly large The number of rules resulting from mining activity is also large The knowledge derived from a database reflects only the current state of the database 

10/ Issues in the KDD Process Data Step 2: Cleaning & Preprocessing Step 3: Transformation Step 4: Data Mining Step 5: Interpretation & Evaluation Target Data Preprocessed Data Transformed Data Patterns Scalability Discovered Knowledge Agility

10/ Knowledge Refreshing The process to efficiently update discovered knowledge as data and domain knowledge change. Goals – Up-to-date knowledge (Agility) – Knowledge Re-use (Scalability)

10/ Data Target Data Preprocessed Data Transformed Data Patterns Discovered Knowledge Type of Changes DBA Knowledge Domain Expert Knowledge Data Mining Expert Knowledge NEW

10/ Knowledge Refreshing Needs assessment Monitoring vs. analytic approaches Monitoring/estimate changes in knowledge to determine if and when to re-mine Incremental data mining (learning) How to leverage knowledge previously discovered from data mining to improve computational efficiency and quality of knowledge