Application of Apriori Algorithm to Derive Association Rules Over Finance Data Set Presented By Kallepalli Vijay Instructor: Dr. Ruppa Thulasiram.

Slides:



Advertisements
Similar presentations
Web Mining.
Advertisements

An Introduction to Data Mining
Unit 7: Store and Retrieve it Database Management Systems (DBMS)
Overview of Data Mining & The Knowledge Discovery Process Bamshad Mobasher DePaul University Bamshad Mobasher DePaul University.
Data Mining Techniques Cluster Analysis Induction Neural Networks OLAP Data Visualization.
Database – Part 3 Dr. V.T. Raja Oregon State University External References/Sources: Data Warehousing – Mr. Sakthi Angappamudali.
DATA MINING CS157A Swathi Rangan. A Brief History of Data Mining The term “Data Mining” was only introduced in the 1990s. Data Mining roots are traced.
1 ACCTG 6910 Building Enterprise & Business Intelligence Systems (e.bis) Introduction to Data Mining Olivia R. Liu Sheng, Ph.D. Emma Eccles Jones Presidential.
Week 9 Data Mining System (Knowledge Data Discovery)
LinkSelector: A Web Mining Approach to Hyperlink Selection for Web Portals Xiao Fang University of Arizona 10/18/2002.
Advanced Topics COMP163: Database Management Systems University of the Pacific December 9, 2008.
Data Mining Knowledge Discovery in Databases Data 31.
Database – Part 2b Dr. V.T. Raja Oregon State University External References/Sources: Data Warehousing – Sakthi Angappamudali at Standard Insurance; BI.
Artificial intelligence 4 Expert systems 4 Neural nets 4 Data base mining.
Data Mining By Archana Ketkar.
Data Mining Adrian Tuhtan CS157A Section1.
Database – Part 2 Dr. V.T. Raja Oregon State University.
Pattern Recognition Lecture 20: Data Mining 3 Dr. Richard Spillman Pacific Lutheran University.
TURKISH STATISTICAL INSTITUTE INFORMATION TECHNOLOGIES DEPARTMENT (Muscat, Oman) DATA MINING.
CSCI 347 / CS 4206: Data Mining Module 01: Introduction Topic 03: Stages in Data Mining.
Data Mining Dr. Chang Liu. What is Data Mining Data mining has been known by many different terms Data mining has been known by many different terms Knowledge.
Repository Method to suit different investment strategies Alma Lilia Garcia & Edward Tsang.
Intelligent Systems Lecture 23 Introduction to Intelligent Data Analysis (IDA). Example of system for Data Analyzing based on neural networks.
Page 1 WEB MINING by NINI P SURESH PROJECT CO-ORDINATOR Kavitha Murugeshan.
Ch5 Mining Frequent Patterns, Associations, and Correlations
Topics Covered: Data Processing Data Processing Information Examples of data and information Examples of data and information Difference between data.
Chapter 1 Introduction to Data Mining
INTRODUCTION TO DATA MINING MIS2502 Data Analytics.
Automatically Extracting Data Records from Web Pages Presenter: Dheerendranath Mundluru
Data Mining Chapter 1 Introduction -- Basic Data Mining Tasks -- Related Concepts -- Data Mining Techniques.
WebMining Web Mining By- Pawan Singh Piyush Arora Pooja Mansharamani Pramod Singh Praveen Kumar 1.
Lecturer: Gareth Jones. How does a relational database organise data? What are the principles of a database management system? What are the principal.
Unit 2: The Stock Market Lesson 7: Economic Indicators Day 2.
Data Mining: Software Helping Business Run
Copyright © 2004 Pearson Education, Inc.. Chapter 27 Data Mining Concepts.
Expert Systems with Applications 34 (2008) 459–468 Multi-level fuzzy mining with multiple minimum supports Yeong-Chyi Lee, Tzung-Pei Hong, Tien-Chin Wang.
6.1 © 2010 by Prentice Hall 6 Chapter Foundations of Business Intelligence: Databases and Information Management.
3-1 Data Mining Kelby Lee. 3-2 Overview ¨ Transaction Database ¨ What is Data Mining ¨ Data Mining Primitives ¨ Data Mining Objectives ¨ Predictive Modeling.
Pattern Discovery of Fuzzy Time Series for Financial Prediction -IEEE Transaction of Knowledge and Data Engineering Presented by Hong Yancheng For COMP630P,
Chapter 5: Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization DECISION SUPPORT SYSTEMS AND BUSINESS.
CISB113 Fundamentals of Information Systems Data Management.
1 Introduction to Data Mining C hapter 1. 2 Chapter 1 Outline Chapter 1 Outline – Background –Information is Power –Knowledge is Power –Data Mining.
Introduction to Data Mining by Yen-Hsien Lee Department of Information Management College of Management National Sun Yat-Sen University March 4, 2003.
Mining Frequent Patterns. What Is Frequent Pattern Analysis? Frequent pattern: a pattern (a set of items, subsequences, substructures, etc.) that occurs.
Extracting value from grey literature Processes and technologies for aggregating and analysing the hidden Big Data treasure of the organisations.
Academic Year 2014 Spring Academic Year 2014 Spring.
Data Mining Copyright KEYSOFT Solutions.
Waqas Haider Bangyal. 2 Source Materials “ Data Mining: Concepts and Techniques” by Jiawei Han & Micheline Kamber, Second Edition, Morgan Kaufmann, 2006.
Miloš Kotlar 2012/115 Single Layer Perceptron Linear Classifier.
Fundamentals of Information Systems, Sixth Edition Chapter 1 Part A An Introduction to Information Systems in Organizations.
13.4 Information and Data. Characteristics and Classifications of Information There are many ways in which information can be classified, this can be.
Smart Web Search Agents Data Search Engines >> Information Search Agents - Traditional searching on the Web is done using one of the following three: -
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 28 Data Mining Concepts.
Introduction.  Instructor: Cengiz Örencik   Course materials:  myweb.sabanciuniv.edu/cengizo/courses.
Data Mining – Introduction (contd…) Compiled By: Umair Yaqub Lecturer Govt. Murray College Sialkot.
CS570: Data Mining Spring 2010, TT 1 – 2:15pm Li Xiong.
Data Mining - Introduction Compiled By: Umair Yaqub Lecturer Govt. Murray College Sialkot.
Survey on Different Data Mining Techniques for E- Crimes
Data Mining Functionalities
Data Mining.
Introduction BIM Data Mining.
Mining Dependent Patterns
By Arijit Chatterjee Dr
Applying Deep Neural Network to Enhance EMPI Searching
DATA MINING © Prentice Hall.
Data Mining Jim King.
MIS 451 Building Business Intelligence Systems
I don’t need a title slide for a lecture
Association Rule Mining
CSE591: Data Mining by H. Liu
Presentation transcript:

Application of Apriori Algorithm to Derive Association Rules Over Finance Data Set Presented By Kallepalli Vijay Instructor: Dr. Ruppa Thulasiram

Agenda Introduction Motivation Background Solution Results Conclusion and Future work

Introduction Stock exchanges maintain a log of events – rise in stock price, option value, number of stocks sold Investors predict the market trends based on available log information Stock market is highly unpredictable The values in the log change very drastically with time

Introduction (Con…) If investors are given adequate information – regarding stock market trends Investors can invest money accordingly – maximum profits.

Motivation Data in the log is very huge Data contains hidden details Manually identifying the hidden details – cumbersome process Apply Apriori to retrieve hidden details Prior deriving large item data has to be classified

Background Data mining: A process of extracting unknown patterns, facts and relations from large database Data mining means knowledge discovery from large databases Association rules in data mining involves in detecting which items tend to occur together in transactions

Background (Con…) Association rules in data mining was first proposed by Agrawal, Imielinski and Swami in 1993 Ex: Customer who purchase one item are likely to purchase another item. Consider A transaction is a set of items – T = {i 1, i 2,……i t } – T  I, where I is the set of all possible items {i 1, i 2,……i n } – P  Q, Where P  I, Q  I, and P  Q 

Solution Finance data is quantitative Classify data into regular intervals Map the classified data to an index value Index values range from 0 through 143 – 0 through 35 represent stocks opening value – 36 through 71 represent stocks day high value – 72 through 107 represent stocks day low value – 108 through 143 represent stocks closing value

Solution (Con…) Finance data of the Apple Computers Inc is read from a text file Eg1: Stock opening value ranging between 10.0 and 10.5 is mapped to an index value 0 Eg2: Stock high value ranging between 10.0 and 10.5 is mapped to an index value 36 Apply Apriori algorithm on the mapped indices to derive association rules

Solution (Con…)

Results

Results (Con…)

Conclusions & Future work Manually identifying hidden details is a tedious process Classified the collected data into regular intervals Applied apriori algorithm to derive large item sets Derived large item sets and projected to the user in user readable form

Conclusions & Future work (Con…) Classification of data plays important role Correctness of association rules depends on the classification of the data Selecting the length of the interval for classification is difficult Fuzzy logic can applied on the data for classification

Thanks!