Core Methods in Educational Data Mining HUDK4050 Fall 2014.

Slides:



Advertisements
Similar presentations
Frequent Itemset Mining Methods. The Apriori algorithm Finding frequent itemsets using candidate generation Seminal algorithm proposed by R. Agrawal and.
Advertisements

Relationship Mining Association Rule Mining Week 5 Video 3.
Mining Biological Data
Core Methods in Educational Data Mining HUDK4050 Fall 2014.
LOGO Association Rule Lecturer: Dr. Bo Yuan
IT 433 Data Warehousing and Data Mining Association Rules Assist.Prof.Songül Albayrak Yıldız Technical University Computer Engineering Department
10 -1 Lecture 10 Association Rules Mining Topics –Basics –Mining Frequent Patterns –Mining Frequent Sequential Patterns –Applications.
ICDM'06 Panel 1 Apriori Algorithm Rakesh Agrawal Ramakrishnan Srikant (description by C. Faloutsos)
Core Methods in Educational Data Mining HUDK4050 Fall 2014.
Advanced Methods and Analysis for the Learning and Social Sciences PSY505 Spring term, 2012 March 16, 2012.
Data Mining Techniques Cluster Analysis Induction Neural Networks OLAP Data Visualization.
Rakesh Agrawal Ramakrishnan Srikant
CS 590M Fall 2001: Security Issues in Data Mining Lecture 5: Association Rules, Sequential Associations.
DATA MINING -ASSOCIATION RULES-
Summarization of Frequent Pattern Mining. What is FPM? Why being frequent is so important? Application of FPM Decision make/Business Software Debugging.
Association Rule Mining (Some material adapted from: Mining Sequential Patterns by Karuna Pande Joshi)‏
2/8/00CSE 711 data mining: Apriori Algorithm by S. Cha 1 CSE 711 Seminar on Data Mining: Apriori Algorithm By Sung-Hyuk Cha.
© Vipin Kumar CSci 8980 Fall CSci 8980: Data Mining (Fall 2002) Vipin Kumar Army High Performance Computing Research Center Department of Computer.
Educational Data Mining and DataShop John Stamper Carnegie Mellon University 1 9/12/2012 PSLC Corporate Partner Meeting 2012.
Core Methods in Educational Data Mining HUDK4050 Fall 2014.
『 Data Mining 』 By Jung, hae-sun. 1.Introduction 2.Definition 3.Data Mining Applications 4.Data Mining Tasks 5. Overview of the System 6. Data Mining.
Apriori algorithm Seminar of Popular Algorithms in Data Mining and Machine Learning, TKK Presentation Lauri Lahti.
Core Methods in Educational Data Mining HUDK4050 Fall 2014.
Core Methods in Educational Data Mining HUDK4050 Fall 2014.
1 Apriori Algorithm Review for Finals. SE 157B, Spring Semester 2007 Professor Lee By Gaurang Negandhi.
Core Methods in Educational Data Mining HUDK4050 Fall 2014.
Course on Data Mining: Seminar Meetings Page 1/17 Course on Data Mining ( ): Seminar Meetings Ass. Rules EpisodesEpisodes Text Mining
Core Methods in Educational Data Mining HUDK4050 Fall 2014.
Special Topics in Educational Data Mining HUDK5199 Spring 2013 March 25, 2012.
Core Methods in Educational Data Mining HUDK4050 Fall 2014.
Outline Knowledge discovery in databases. Data warehousing. Data mining. Different types of data mining. The Apriori algorithm for generating association.
Fast Algorithms for Mining Association Rules Rakesh Agrawal and Ramakrishnan Srikant VLDB '94 presented by kurt partridge cse 590db oct 4, 1999.
Association Rule Mining
Advanced Methods and Analysis for the Learning and Social Sciences PSY505 Spring term, 2012 March 16, 2012.
Core Methods in Educational Data Mining HUDK4050 Fall 2014.
Core Methods in Educational Data Mining HUDK4050 Fall 2014.
『 Personalization of Supermarket Product Recommendations 』 김용수.
Special Topics in Educational Data Mining HUDK5199 Spring, 2013 April 17, 2012.
Core Methods in Educational Data Mining HUDK4050 Fall 2015.
Introduction to Data Mining by Yen-Hsien Lee Department of Information Management College of Management National Sun Yat-Sen University March 4, 2003.
Core Methods in Educational Data Mining HUDK4050 Fall 2015.
Core Methods in Educational Data Mining HUDK4050 Fall 2015.
Core Methods in Educational Data Mining HUDK4050 Fall 2015.
Core Methods in Educational Data Mining HUDK4050 Fall 2015.
Core Methods in Educational Data Mining HUDK4050 Fall 2015.
Core Methods in Educational Data Mining HUDK4050 Fall 2015.
Special Topics in Educational Data Mining HUDK5199 Spring, 2013 April 24, 2012.
Core Methods in Educational Data Mining HUDK4050 Fall 2015.
Core Methods in Educational Data Mining HUDK4050 Fall 2014.
Core Methods in Educational Data Mining HUDK4050 Fall 2014.
Core Methods in Educational Data Mining HUDK4050 Fall 2015.
Core Methods in Educational Data Mining HUDK4050 Fall 2014.
By Shivaraman Janakiraman, Magesh Khanna Vadivelu.
Core Methods in Educational Data Mining HUDK4050 Fall 2014.
Special Topics in Educational Data Mining HUDK5199 Spring term, 2013 April 15, 2013.
Core Methods in Educational Data Mining
Core Methods in Educational Data Mining
Core Methods in Educational Data Mining
Core Methods in Educational Data Mining
Core Methods in Educational Data Mining
Core Methods in Educational Data Mining
Core Methods in Educational Data Mining
Core Methods in Educational Data Mining
Core Methods in Educational Data Mining
Core Methods in Educational Data Mining
Mining Path Traversal Patterns with User Interaction for Query Recommendation 龚赛赛
Core Methods in Educational Data Mining
Core Methods in Educational Data Mining
Core Methods in Educational Data Mining
Core Methods in Educational Data Mining
Presentation transcript:

Core Methods in Educational Data Mining HUDK4050 Fall 2014

Factor Analysis.vs. Clustering What’s the difference?

Factor Analysis: Any Questions?

What… Are the general advantages of structure discovery algorithms (clustering, factor analysis) Compared to supervised/prediction modeling methods?

What… Are the general advantages of structure discovery algorithms (clustering, factor analysis) Compared to supervised/prediction modeling methods? What are the disadvantages?

Association Rule Mining

Today’s Class The Land of Inconsistent Terminology

Association Rule Mining Try to automatically find simple if-then rules within the data set Another method that can be applied when you don’t know what structure there is in your data Unlike clustering, association rules are often obviously actionable

Association Rule Metrics Support Confidence What do they mean? Why are they useful?

Association Rule Metrics Interestingness What are some interestingness metrics? Why are they needed?

Why is interestingness needed? Possible to generate large numbers of trivial associations – Students who took a course took its prerequisites (Vialardi et al., 2009) – Students who do poorly on the exams fail the course (El-Halees, 2009)

Association Rule Metrics What do Merceron & Yacef argue?

Association Rule Metrics What do Luna-Bazaldua and colleagues argue?

Association Rule Metrics What are the merits and drawbacks to each of these approaches?

Arbitrary Cut-offs The association rule mining community differs from most other methodological communities by treatingn cut-offs for support and confidence as arbitrary Researchers typically adjust them to find a desirable number of rules to investigate, ordering from best-to- worst… Rather than arbitrarily saying that all rules over a certain cut-off are “good” What are the strengths and weaknesses of this approach?

Any questions on apriori algorithm?

Let’s do an example Volunteer please?

Someone pick Support Confidence

Generate Frequent Itemset ABCFABDGABEFBEGH BDIJBCDJDEFJABCD DEGJDEGJABCEABCF BCDJBCDEDEFKDEGH

Was the choice of support level appropriate? ABCFABDGABEFBEGH BDIJBCDJDEFJABCD DEGJDEGJABCEABCF BCDJBCDEDEFKDEGH

Re-try with lower support ABCFABDGABEFBEGH BDIJBCDJDEFJABCD DEGJDEGJABCEABCF BCDJBCDEDEFKDEGH

Generate Rules From Frequent Itemset ABCFABDGABEFBEGH BDIJBCDJDEFJABCD DEGJDEGJABCEABCF BCDJBCDEDEFKDEGH

Questions? Comments?

Rules in Education What might be some reasonable applications for Association Rule Mining in education?

Assignment B8 Sequential Pattern Mining Due Monday

Next Class Monday, December 8: Sequential Pattern Mining Readings Baker, R.S. (2014) Big Data and Education. Ch. 5, V4. Srikant, R., Agrawal, R. (1996) Mining Sequential Patterns: Generalizations and Performance Improvements. Research Report: IBM Research Division. San Jose, CA: IBM. [pdf][pdf] Perera, D., Kay, J., Koprinska, I., Yacef, K., Zaiane, O. (2009) Clustering and Sequential Pattern Mining of Online Collaborative Learning Data. IEEE Transactions on Knowledge and Data Engineering, 21, [pdf][pdf] Shanabrook, D.H., Cooper, D.G., Woolf, B.P., Arroyo, I. (2010)Identifying High-Level Student Behavior Using Sequence- based Motif Discovery. Proceedings of the 3rd International Conference on Educational Data Mining,

The End