Summary „Data mining” Vietnam national university in Hanoi, College of technology, Feb.2006
Main topics: Definition, principles and functionalities of data mining systems Data mining role in KDD processes Data preprocessing and data cleaning methods Association rules Classification methods Clustering methods
Data preprocessing and data cleaning Discretization methods Data reduction methods Missing values Outlier elimination
Association rules Definition, possible applications Apriori search for frequent patterns and association rules Modifications of apriori algorithms: hash tree, Apriori-Tid, Apriori-Hybrid FP-tree method
Classification methods Instance-based classification techniques Bayesian classifiers Decision tree methods Decision rules methods Classifier evaluation techniques
Clustering methods K-means and K-medoids algorithms Hierarchical clustering Density clustering