CS 548 FALL 2016 Association Rules Showcase by Deepan Sanghavi, Dhaval Dholakia, Peter Wang & Karan Somaiah Napanda Showcasing work by Gang Li ,Rob Law.

Slides:



Advertisements
Similar presentations
Data Mining Techniques Association Rule
Advertisements

Probabilistic Skyline Operator over Sliding Windows Wenjie Zhang University of New South Wales & NICTA, Australia Joint work: Xuemin Lin, Ying Zhang, Wei.
Association Rules Spring Data Mining: What is it?  Two definitions:  The first one, classic and well-known, says that data mining is the nontrivial.
LOGO Association Rule Lecturer: Dr. Bo Yuan
CS548 Spring 2015 Association Rule Mining Showcase Showcasing work by Ting, Pan, and Chou on "Finding Ideal Menu Items Assortments: An Empirical Application.
Sampling Large Databases for Association Rules ( Toivenon’s Approach, 1996) Farzaneh Mirzazadeh Fall 2007.
Efficient Mining of Both Positive and Negative Association Rules Xindong Wu (*), Chengqi Zhang (+), and Shichao Zhang (+) (*) University of Vermont, USA.
Relational Data Mining in Finance Haonan Zhang CFWin /04/2003.
Reduced Support Vector Machine
Data Mining By Archana Ketkar.
Fast Algorithms for Association Rule Mining
Data Mining: A Closer Look Chapter Data Mining Strategies 2.
Data Mining BS/MS Project Clustering for Market Segmentation Presentation by Mike Calder.
Mining Association Rules between Sets of Items in Large Databases presented by Zhuang Wang.
Basic Data Mining Techniques
ICMLC2007, Aug. 19~22, 2007, Hong Kong 1 Incremental Maintenance of Ontology- Exploiting Association Rules Ming-Cheng Tseng 1, Wen-Yang Lin 2 and Rong.
VLDB 2012 Mining Frequent Itemsets over Uncertain Databases Yongxin Tong 1, Lei Chen 1, Yurong Cheng 2, Philip S. Yu 3 1 The Hong Kong University of Science.
Association Rules. CS583, Bing Liu, UIC 2 Association rule mining Proposed by Agrawal et al in Initially used for Market Basket Analysis to find.
Association Rule By Kenneth Leung. Data Mining The process of extracting valid, previously unknown, comprehensible, and actionable information from large.
Treatment Learning: Implementation and Application Ying Hu Electrical & Computer Engineering University of British Columbia.
60-520:Seminar Presented By Robyat Hossain University of Windsor February 10, 2006 Market Basket Analysis: An Approach to Association Rule Mining in Multiple.
Health and wellness tourism: trends and strategies Paolo Grigolli.
EXAM REVIEW MIS2502 Data Analytics. Exam What Tool to Use? Evaluating Decision Trees Association Rules Clustering.
CSE4334/5334 DATA MINING CSE4334/5334 Data Mining, Fall 2014 Department of Computer Science and Engineering, University of Texas at Arlington Chengkai.
Association Rules presented by Zbigniew W. Ras *,#) *) University of North Carolina – Charlotte #) ICS, Polish Academy of Sciences.
Discriminative Frequent Pattern Analysis for Effective Classification By Hong Cheng, Xifeng Yan, Jiawei Han, Chih- Wei Hsu Presented by Mary Biddle.
Association Rules Carissa Wang February 23, 2010.
Chap 6: Association Rules. Rule Rules!  Motivation ~ recent progress in data mining + warehousing have made it possible to collect HUGE amount of data.
Data Mining Association Rules Mining Frequent Itemset Mining Support and Confidence Apriori Approach.
Subgraph Search Over Uncertain Graphs Erşan Demircioğlu.
Differential Analysis on Deep Web Data Sources Tantan Liu, Fan Wang, Jiedan Zhu, Gagan Agrawal December.
Mining Association Rules in Large Database This work is created by Dr. Anamika Bhargava, Ms. Pooja Kaul, Ms. Priti Bali and Ms. Rajnipriya Dhawan and licensed.
Chapter 1 marketing is all around us Section 1.1
Data Mining Functionalities
Data Mining.
By Arijit Chatterjee Dr
Data Based Decision Making
An Empirical Examination of Transaction- and Firm-Level Influences on the Vertical Boundaries of the Firm Leiblein, Michael.
2.01-Understand the impact of the hospitality and tourism destinations on an economy. Ms. Osteen.
Data Mining Jim King.
A Research Oriented Study Report By :- Akash Saxena
Association Rules Repoussis Panagiotis.
Byung Joon Park, Sung Hee Kim
Association Rules.
2.01-Understand the impact of the hospitality and tourism destinations on an economy. Ms. Osteen.
Association Rules Zbigniew W. Ras*,#) presented by
I. Association Market Basket Analysis.
Market Basket Analysis and Association Rules
Market Basket Many-to-many relationship between different objects
Association Rule Mining
Data Mining II: Association Rule mining & Classification
Sangeeta Devadiga CS 157B, Spring 2007
Mining Frequent Itemsets over Uncertain Databases
CS 548 Sequence Mining Showcase By Bian Du, Wa Gao, and Cam Jones
Farzaneh Mirzazadeh Fall 2007
Analysis of Customer Behavior and Service Modeling
Using Data Mining To Improve Company Strategies
Approximate Frequency Counts over Data Streams
Frequent patterns and Association Rules
(Understanding customer behavior Using data mining skills)
جستجو در وب عميق ارائه‌دهنده: حسين شريفي‌پناه
China Outbound Tourism Market China Outbound Tourism Market.
Market Basket Analysis and Association Rules
I. Association Market Basket Analysis.
Administrative Data and their Use in Economic Statistics
Lecture 11 (Market Basket Analysis)
Presented by : SaiVenkatanikhil Nimmagadda
Conducting Marketing Research
15-826: Multimedia Databases and Data Mining
Survey on Coverage Problems in Wireless Sensor Networks - 2
Presentation transcript:

CS 548 FALL 2016 Association Rules Showcase by Deepan Sanghavi, Dhaval Dholakia, Peter Wang & Karan Somaiah Napanda Showcasing work by Gang Li ,Rob Law ,Jia Rong & Huy Quan Vu on “Incorporating Both Positive and Negative Association Rules into the Analysis of Outbound Tourism in Hong Kong ”, Journal of Travel & Tourism Marketing, 27:812–828, 2010

References Rob Law Catherine Cheung Ada Lo, (2004),"The relevance of profiling travel activities for improving destination marketing strategies", International Journal of Contemporary Hospitality Management, Vol. 16 Iss 6 pp. 355 – 362. Ceglar, A., & Roddick, J. F. (2006). Association mining, ACM Computing Survey, 38(2), article no. 5. GÜL GÖKAY EMEL , ÇAĞATAN TAŞKIN & ÖMER AKAT (2007) Profiling a Domestic Tourism Market by Means of Association Rule Mining, Anatolia, 18:2, 334-342. King, B., & Tang, C. H. (2009). China’s outbound tourism during the 1980s—A socio-political perspective. Anatolia: An International Journal of Tourism and Hospitality Research, 20(1), 18–32. Min, H., Min, H., & Eman, A. (2002). A data mining approach to developing the profiles of hotel customers. International Journal of Contemporary Hospitality Management, 14(6), 274–285. Savasere, A., Omiecinski, E., & Navathe, S. (1998). Mining for strong negative associations in a large database of customer transactions. In Proceedings of the Fourteenth International Conference on Data Engineering (pp. 494–502). Washington, DC: IEEE Computer Society. Wu, X. D., Zhang, C., & Zhang, S. (2004). Efficient mining of both positive and negative association rules. ACM Transactions on Information Systems, 22(3), 381–405. Song, H., Romilly, P., & Liu, X. (2000). An empirical study of outbound tourism demand in the U.K. Applied Economics, 32, 611–624

Motivation Increased spending power for the people. Travel and tourism sector is heavily influenced by personal and national income Travel is primarily viewed as a luxury product. Analysis of traveler’s behavior is important.

Association Rules Finds correlation among items to form rules Rules: “A ⇒ B” Data Sources: purchase repositories or transaction databases Every rule has a support and confidence MBA Earlier Study: Hotel guest and their behavior Travel package customer likely to buy

Disadvantages of Association Rules & Improvement Sometimes do not cover all promising situations Negative associations : A ⇒ ¬B, ¬A ⇒ B or ¬A ⇒ ¬B Determine which travel packages are of no interest to certain groups of people.

Data collection 3 largest tourist surveys (2007, 2008, 2009) 2007 & 2008: Training 2009: Testing Random digit dialing method Generate a sample list of residential tel (>=16 years old) Attributes: Education Level Age Groups House Size Monthly Income Travel Experience

Problem Definition & Algorithm

Problem Definition Negative associations - conflicts between different products Associations Challenges: Combined infrequent item sets are not considered Does not discover negative relationships Computational cost and produce results which are not targeted for application Mining Targeted Positive and Negative Rules Positive item sets- support(item) > threshold Method: Promising item sets identification Rules extraction

Identifying Promising Item-Sets Infrequent item sets are considered only if they are part of target In combination, each item should be frequent Process: Frequent 1-item-sets F(1) = {A(1), A(2)…..} If [sup(A(K)(i) ∪ Tj) ≥ δs] then [A(K)(i) ∪ Tj ] OR [A(K)(i) ∪ ¬ Tj ] Leverage Threshold Different combination of frequent item sets F(k) is considered with Tj and steps 2 and 4 are repeated Leverage Used for pruning the sets PS(X⇒Y)=leverage(X⇒Y)=supp(X⇒Y)−supp(X)supp(Y)=P(X∧Y)−P(X)P(Y) Measures difference of X and Y appearing together in the data set and what would be expected if X and Y where statistically dependent

Extracting Promising Rules Previous process results Promising positive item set F(1), F(2)... Promising negative frequent item set l(1), l(2)… Usually, confidence value is used to measure the strength of rules CPIR (Conditional Probability Increment Ratio) Value is between -1 and 1 Close to 0 signifies item is independent of target Positive higher value signifies positive dependency Negative higher value signifies negative dependency

Example: Frequent item sets w.r.t. target: δs =0.2, δi =0.05, and δc =0.55 Frequent item sets w.r.t. target: F(1)= {A1 M, A1 X, B1 M, B2 M, C1 M ……} Using leverage and support, 2 promising 1-item sets are generated: F(1) = {A1 X, A2 X, B1 X, B2 X ..}, l1 = {D2 ¬X, A1 ¬M, A2 ¬M ..} Based on the above F(1) sets, we generate initial 2-item sets: F(2)= {A1B1X, A1B2X, A1D1 X, A2B1X…} Rules: D1 ⇒ X, with CPIR value of 0.563 (Rule 1) D2 ⇒ ¬X, with CPIR value of 0.583(Rule 4) Interpreting the Rule: D1-High income D2-Low income X-Will have a outbound travel High income travelers have a higher likelihood of outbound travelers (Rule 1 and 4)

Results & Findings

Results & Findings

Targeted Rules on the overall dataset

Targeted Rules for Travelers to Mainland China

Discussion & Implications

Discussion and Implication Major difference Allow negative items to be included in the rules Negative rules Extend traditional association rules Considering infrequent items Provide more number of rules Loose instead of strict output Comparison with decision trees