Hanan Ayad Supervisor Prof. Mohamed Kamel

Slides:



Advertisements
Similar presentations
Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
Advertisements

PARTITIONAL CLUSTERING
Query Specific Fusion for Image Retrieval
Selecting Preservation Strategies for Web Archives Stephan Strodl, Andreas Rauber Department of Software.
Information Retrieval in Practice
IR & Metadata. Metadata Didn’t we already talk about this? We discussed what metadata is and its types –Data about data –Descriptive metadata is external.
April 22, Text Mining: Finding Nuggets in Mountains of Textual Data Jochen Doerre, Peter Gerstl, Roland Seiffert IBM Germany, August 1999 Presenter:
© University of Minnesota Data Mining for the Discovery of Ocean Climate Indices 1 CSci 8980: Data Mining (Fall 2002) Vipin Kumar Army High Performance.
A Similarity Evaluation Technique for Data Mining with Ensemble of Classifiers Seppo Puuronen, Vagan Terziyan International Workshop on Similarity Search.
Presented by Zeehasham Rasheed
Multimedia Data Mining Arvind Balasubramanian Multimedia Lab (ECSS 4.416) The University of Texas at Dallas.
Multimedia Data Mining Arvind Balasubramanian Multimedia Lab The University of Texas at Dallas.
Overview of Search Engines
Ensemble Clustering.
Graph-based consensus clustering for class discovery from gene expression data Zhiwen Yum, Hau-San Wong and Hongqiang Wang Bioinformatics, 2007.
Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization.
CHAMELEON : A Hierarchical Clustering Algorithm Using Dynamic Modeling
Methods in Medical Image Analysis Statistics of Pattern Recognition: Classification and Clustering Some content provided by Milos Hauskrecht, University.
Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence From Data Mining To Knowledge.
CS598CXZ Course Summary ChengXiang Zhai Department of Computer Science University of Illinois, Urbana-Champaign.
Learning Object Metadata Mining Masoud Makrehchi Supervisor: Prof. Mohamed Kamel.
Glasgow 02/02/04 NN k networks for content-based image retrieval Daniel Heesch.
Special topics on text mining [ Part I: text classification ] Hugo Jair Escalante, Aurelio Lopez, Manuel Montes and Luis Villaseñor.
A compression-boosting transform for 2D data Qiaofeng Yang Stefano Lonardi University of California, Riverside.
Web Mining: Phrase-based Document Indexing and Document Clustering Khaled Hammouda, Ph.D. Candidate Mohamed Kamel, Supervisor, PI PAMI Research Group University.
Text Mining: Fast Phrase-based Text Indexing and Matching Khaled Hammouda, Ph.D. Student PAMI Research Group University of Waterloo Waterloo, Ontario,
Character Identification in Feature-Length Films Using Global Face-Name Matching IEEE TRANSACTIONS ON MULTIMEDIA, VOL. 11, NO. 7, NOVEMBER 2009 Yi-Fan.
Cooperative Classifiers Rozita Dara Supervisor: Prof. Kamel Pattern Analysis and Machine Intelligence Lab University of Waterloo.
March 31, 1998NSF IDM 98, Group F1 Group F Multi-modal Issues, Systems and Applications.
Data Mining: Knowledge Discovery in Databases Peter van der Putten ALP Group, LIACS Pre-University College LAPP-Top Computer Science February 2005.
1Ellen L. Walker Category Recognition Associating information extracted from images with categories (classes) of objects Requires prior knowledge about.
Data and Applications Security Developments and Directions Dr. Bhavani Thuraisingham The University of Texas at Dallas Lecture #15 Secure Multimedia Data.
Finding document topics for improving topic segmentation Source: ACL2007 Authors: Olivier Ferret (18 route du Panorama, BP6) Reporter:Yong-Xiang Chen.
Data Mining and Decision Support
Discriminative Training and Machine Learning Approaches Machine Learning Lab, Dept. of CSIE, NCKU Chih-Pin Liao.
CLUSTERING HIGH-DIMENSIONAL DATA Elsayed Hemayed Data Mining Course.
Enhanced hypertext categorization using hyperlinks Soumen Chakrabarti (IBM Almaden) Byron Dom (IBM Almaden) Piotr Indyk (Stanford)
Selected Semantic Web UMBC CoBrA – Context Broker Architecture  Using OWL to define ontologies for context modeling and reasoning  Taking.
Feature Generation and Selection in SRL Alexandrin Popescul & Lyle H. Ungar Presented By Stef Schoenmackers.
IR Theory: Web Information Retrieval. Web IRFusion IR Search Engine 2.
1 Dongheng Sun 04/26/2011 Learning with Matrix Factorizations By Nathan Srebro.
Information Retrieval in Practice
Combining Bagging and Random Subspaces to Create Better Ensembles
Information Storage and Retrieval Fall Lecture 1: Introduction and History.
What Is Cluster Analysis?
Gedas Adomavicius Jesse Bockstedt
Introduction Multimedia initial focus
Rule Induction for Classification Using
Fast Kernel-Density-Based Classification and Clustering Using P-Trees
Data and Applications Security Developments and Directions
Federated & Meta Search
Associative Query Answering via Query Feature Similarity
A Consensus-Based Clustering Method
Special Topics in Data Mining Applications Focus on: Text Mining
William Norris Professor and Head, Department of Computer Science
RankClus: Integrating Clustering with Ranking for Heterogeneous Information Network Analysis Yizhou Sun, Jiawei Han, Peixiang Zhao, Zhijun Yin, Hong Cheng,
ISI Web of Knowledge Early updates
Image Segmentation Techniques
Gerd Kortemeyer, William F. Punch
William Norris Professor and Head, Department of Computer Science
Data Warehousing and Data Mining
Information Organization: Clustering
Towards a Personal Briefing Assistant
Jiawei Han Department of Computer Science
traveling salesman problem
Clustering Wei Wang.
Topic 5: Cluster Analysis
Topological Signatures For Fast Mobility Analysis
CSE572: Data Mining by H. Liu
CSE591: Data Mining by H. Liu
Presentation transcript:

Hanan Ayad Supervisor Prof. Mohamed Kamel Developing Methods for Combining Multiple Clustering of Patterns Towards the discovery of natural clusters Hanan Ayad Supervisor Prof. Mohamed Kamel

Outline Motivations Research Summary 2003 Publications Application to Learning Objects Diverse Sources of Information Multiple Clustering in LO Process Overview

Motivations Multiple clustering solutions Enhance Quality Different clustering methods Selection of learning parameters (e.g. NNets) Random starts, random ordering of patterns Enhance Quality Compensatory effects in clustering methods Repeated fine decompositions Distributed clustering Feature-Distributed Multiple partial views Random subspaces Alternative feature reductions Data-Distributed Overlapping subsets of patterns

Research Summary Measure co-associations between patterns based on their co-clustering - voting Development of combination rules based on shared co-associations – Shared nearest neighbors (binary votes, weighted votes, sum rule, product rule, rank-based rule) Determine strength-of-association Accumulate local neighborhood densities of patterns Patterns weights are inversely proportional to their local neighbourhood densities (number and weights of relationships)

Research Summary, Cont’d Pruning of associations for efficiency and assessment of behaviour. Effect on mutuality of relationships Improving quality using subsets of patterns relations Study of convergence and stability Induce a graph, representing the patterns weighted relationships, and the patterns own weights. Weighted Shared nearest neighbors Graph (WSnnG) The graph is partitioned resulting in an integrated clustering of the objects Use of the graph partitioning package METIS. Minimize edge-cut subject to weights of vertices being equally distributed among the clusters

2003 Publications H. Ayad and M. Kamel. "Finding Natural Clusters Using Multi-Clusterer Combiner Based on Shared Nearest Neighbors“. Multiple Classifier Systems: Fourth International Workshop, MCS 2003, Guildford, Surrey, United Kingdom, June 11-13. Proceedings. H. Ayad, and M. Kamel. Refined Shared Nearest Neighbors Graph for Combining Multiple Data Clusterings", The 5th International Symposium on Intelligent Data Analysis IDA 2003. Berlin, Germany. Proceedings. LNCS. Springer. August, 2003 H. Ayad, and M. Kamel. Development of New Methods for Combining Cluster Ensembles. On going Journal Paper.

Application to Learning Objects Diverse Sources of Information Meta Data Standardized Indexing, content structure and organization Intelligent Content Mining Natural Language Understanding Image Analysis and Understanding Automatic Speech Recognition Statistical Learning Re-Use/Learning Scenarios Dynamic assembly, object are grouped and regrouped with other objects. Are any “re-usable”, self-contained pieces of educational material, including text, audio clips, images, videos, animations, which can be dynamically assembled and used in technology supported learning. Are organized in Repositories. A learning object repository organizes educational content using a standardized indexing system. Items contained within a repository can include text, audio clips, images, videos, animations, and more complex combinations of digital elements. The Learning Object Metadata standards focus on the minimal set of attributes needed to allow these Learning Objects to be managed, located, and evaluated. The standards will accommodate the ability for locally extending the basic fields and entity types. Intelligent Content Mining generate new Meta Data for the Learning Objects. It is possible for any given Learning Object to have more than one set of Learning Object Metadata Different re-use or learning scenarios are rich and diversified sources of information, can be aggregated and used for search and categorization

Application to Learning Objects Multiple Clustering in LO Clusters of Learning Objects Multiple distributed taxonomies Info. Sources: different sets of meta data, different re-use/assembling scenarios Dynamic environment, clusters based on partial views Combining of multiple clustering Mining complex web of relationships integrating multiple objects clustering Discovery of combined multi-view clusters.

Application to Learning Objects Combining Multiple Objects Clustering Process Overview Meta Data 1 Re-use Scenarios s Re-use Scenarios 1 Meta Data m . Clustering 1 . Combining Multiple Objects Clustering Integrated Learning Objects Clusters Learning Objects Clustering r